A New Era of NovelAI Begins Now

NovelAI
23 May 202305:21

TLDRThe transcript introduces a new era for NovelAI with the addition of painting to image generation and the unveiling of Cleo, an in-house developed AI model. Cleo, trained on 1.5 trillion tokens, boasts a Lambada score of 73% and an 8192 token context, making her the most advanced model of her size. Although experimental, Cleo is a testament to NovelAI's capability to train large language models. Initially available to Opus subscribers, Cleo will be accessible to all users in two weeks. The team thanks the community for their patience and teases more exciting developments to come.

Takeaways

  • 🎨 New feature: Painting to image generation is being introduced, allowing users to modify and color images.
  • 🐶 A cute dog is used as a visual distraction during the announcement.
  • 📅 Upcoming release: The official release of the painting feature is scheduled for Thursday, two days from the announcement.
  • 🚀 Module updates: Sigurd and Andrew are receiving new V2 modules, which are complete replacements for the original versions.
  • 🍳 Team effort: The team has been working hard, with various metaphorical 'delicacies' representing their work.
  • 🌟 Introduction of Cleo: Cleo is a custom-made model developed in-house, trained from scratch with a custom tokenizer and dataset.
  • 📚 Extensive training: Cleo was trained on 1.5 trillion tokens, providing a strong general knowledge base.
  • 🏆 Performance: Cleo achieved a Lambada score of 73%, surpassing other models of similar size.
  • 🔍 Contextual understanding: Cleo features an impressive 8192 token context, enhancing its ability to understand and generate text.
  • 🔢 Parameter size: Despite its capabilities, Cleo is compact with only 3 billion parameters.
  • 🧑‍🔬 Proof of concept: Cleo serves as a proof of concept for the team's ability to train large language models.
  • 🔧 Testing phase: Cleo is still experimental and will be initially available to Opus subscribers for testing.
  • 📈 Future plans: The team is already training larger models and has more exciting developments planned for the year.

Q & A

  • What is the main announcement regarding image generation?

    -The main announcement is that painting will be integrated into image generation, which is accessible via the image to image interface.

  • What are the new modules being introduced for Sigurd and Andrew?

    -Sigurd and Andrew are getting brand new modules of the V2 variety, which are complete replacements to the original modules.

  • What is significant about the model Cleo?

    -Cleo is the first custom made model created entirely in-house, trained from scratch with a custom tokenizer, a custom 6 terabyte pre-trained data set, custom fine tune, and a custom pre-trained model. It is designed to excel in storytelling.

  • How many tokens of data has Cleo been trained on?

    -Cleo has been trained on 1.5 trillion tokens of data.

  • What is Cleo's Lambada score and how does it compare to other models?

    -Cleo's Lambada score is 73 percent, which is better than any other similarly sized model.

  • What is the token context length of Cleo?

    -Cleo features an 8192 token context length.

  • How many parameters does Cleo have?

    -Cleo has 3 billion parameters.

  • What is the current status of Cleo in terms of availability?

    -Cleo is still somewhat experimental and is being tested by Opus subscribers. Other users should expect to get access to Cleo in two weeks.

  • What does the team have planned for the future?

    -The team has begun training much larger models and has more exciting developments planned for the year.

  • How does the team feel about the patience and support of their audience?

    -The team is grateful for the patience and support of their audience and is committed to keeping them engaged with new developments.

  • What is the significance of the painting feature being introduced?

    -The painting feature allows users to add a creative touch to their images by coloring and replacing elements within the generated image.

  • What does the team mean by 'cooking real hard' in the context of their work?

    -The phrase 'cooking real hard' is a metaphor for the team's intensive efforts and hard work in developing and refining their AI models.

Outlines

00:00

🎨 Introducing Painting to Image Gen and New AI Modules

The video begins with a casual greeting and a playful introduction to the integration of painting into the image generation process. It's highlighted that this is a departure from text generation, but the focus quickly shifts to exciting advancements. The presenter teases an upcoming feature, painting, which allows for interactive image manipulation. They also announce the release of new V2 modules for Sigurd and Andrew, emphasizing that these are not just simple updates but significant upgrades. The presenter then introduces Cleo, a custom-made AI model developed in-house, trained from the ground up with a custom tokenizer, a 6-terabyte pre-trained dataset, and a custom fine-tune process. Cleo stands out for her extensive training on 1.5 trillion tokens, resulting in superior general knowledge and performance, as evidenced by her Lambada score of 73 percent. During fine-tuning, Cleo even achieved a score of 74. As the first model with an 8192 token context and a compact 3B parameter size, Cleo represents a proof of concept, showcasing the team's capability to train large language models. Although still experimental, Cleo is made available to Opus subscribers for testing, with a wider release planned for two weeks later.

05:00

🚀 Upcoming Developments and Subscriber Previews

The script concludes with a teaser of even more innovative features to come in the future, generating excitement among the audience. It's mentioned that Opus subscribers will have the privilege of early access to these new developments. The video ends on a high note with an energetic piece of background music, reinforcing the dynamic and forward-thinking spirit of the team.

Mindmap

Keywords

💡NovelAI

NovelAI refers to an artificial intelligence system designed to generate novel-like content, including text and images. In the context of the video, NovelAI is entering a new era with significant advancements in its capabilities, which is the central theme of the announcement.

💡Image Generation

Image Generation is the process of creating images from data using AI algorithms. In the video, it is mentioned that NovelAI is expanding its capabilities to include painting to image generation, indicating a new feature that allows users to transform text descriptions into visual images.

💡Text Generation

Text Generation is the process of creating written content using AI. It is a core feature of NovelAI, and the video discusses exciting updates related to text generation, such as new modules and improvements to existing ones, which are crucial for enhancing the AI's storytelling abilities.

💡Modules

In the context of AI, modules refer to specific components or functionalities that can be added or upgraded to improve the system's performance. The video mentions new V2 modules, which are advanced versions of the original modules, suggesting that NovelAI is receiving significant upgrades.

💡Tokenizer

A Tokenizer is a component in natural language processing that breaks down text into tokens, which are the basic units of meaning. The video highlights that Cleo, a new AI model, has a custom tokenizer, which is important for understanding and generating language more effectively.

💡Pre-trained Data Set

A Pre-trained Data Set is a collection of data that an AI model is initially trained on to learn patterns and generate responses. The video mentions a custom 6-terabyte pre-trained data set for Cleo, emphasizing the extensiveness of the training material that contributes to the model's capabilities.

💡Fine Tune

Fine Tuning is the process of further training an AI model on a specific task after it has been pre-trained on a general task. The video indicates that Cleo has undergone custom fine tuning, which helps the model to perform better in targeted applications.

💡Parameter Count

The Parameter Count refers to the number of variables that an AI model has, which can influence its complexity and performance. Cleo is described as having 3 billion parameters, which is significant for its size and indicates a balance between complexity and efficiency.

💡Lambada Score

The Lambada Score is a metric used to measure the language understanding capabilities of an AI model. The video states that Cleo achieved a Lambada score of 73 percent, which is higher than other models of similar size, showcasing its advanced language comprehension.

💡Token Context

Token Context refers to the number of tokens an AI model can consider when generating text. Cleo is said to have an 8192 token context, which is an impressively high number that allows the model to generate more coherent and contextually aware responses.

💡Proof of Concept

A Proof of Concept is a demonstration that a particular idea or technology can be successfully implemented. Cleo is presented as a proof of concept for NovelAI, showing that the company can develop and train large, high-performing AI models.

💡Opus Subscribers

Opus Subscribers likely refers to a group of users who have a subscription to a premium service or early access to features. The video mentions that these subscribers will be the first to experiment with Cleo, suggesting a tiered access model for NovelAI's users.

Highlights

Introduction of painting to image generation, a new feature not related to text generation.

Image to image interface allows for color adjustments and text replacements.

Announcement of the official release of painting feature in two days, on Thursday.

Sigurd and Andrew terpy are receiving brand new V2 modules.

Cleo, the first custom made model created in-house, is introduced.

Cleo has been trained from scratch with a custom tokenizer and 6 terabyte pre-trained dataset.

Cleo is trained on 1.5 trillion tokens, offering better general knowledge.

Cleo achieves a Lambada score of 73 percent, surpassing similarly sized models.

During fine tuning, Cleo reached a Lambardo score of 74.

Cleo features an 8192 token context and is packaged in a 3 billion parameter model.

Cleo is a proof of concept model, signifying the capability to train large language models.

Training process for Cleo has been finalized, addressing data set issues and smoothing out the process.

Larger models are already in training following the success with Cleo.

Opus subscribers will have first access to Cleo while final adjustments are made.

General availability of Cleo for all users is expected in two weeks.

The team expresses gratitude for patience and support, with more exciting developments planned for the year.

Opus subscribers will be the first to experiment with Cleo.