NEW Stable diffusion 2.1 RELEASED!

Sebastian Kamph
7 Dec 202210:15

TLDRStable Fusion 2.1 has been released, addressing issues from the previous 2.0 version. The update includes a new prompting style, more diverse training data, and less restrictive content filters. It promises improved image quality for architecture, interiors, and landscapes, as well as better rendering of people and various art styles. Users are encouraged to try out the new version and share their experiences.

Takeaways

  • 🚀 Stable Fusion 2.1 has been released as an improvement over the poorly received 2.0 version.
  • 💡 The 2.0 version's models functioned differently, leading to many users getting unsatisfactory results.
  • 🌟 Negative prompts were found to be a workaround for some users to achieve better results with 2.0.
  • 🎨 Stable Fusion 2.1 promises to bring back the familiar prompting style and restore many prompts.
  • 📈 The new version includes more data, more training, and less restrictive filtering of the dataset.
  • 🖼️ There was a focus on improving the diversity and range of the data set, particularly in architecture, interior design, wildlife, and landscape scenes.
  • 👤 Version 2.0 had issues with generating images of people, which 2.1 aims to address.
  • 🏙️ The architecture and scenery rendering capabilities have been enhanced in Stable Fusion 2.1.
  • 📞 The developers have listened to user feedback and adjusted the filters to be less aggressive while still removing adult content.
  • 🖌️ The release includes better anatomy and hands, and a wider range of art styles compared to the previous version.
  • 🌐 Stable Fusion 2.1 is an open-source release available on Hugging Face for users to download and experiment with.

Q & A

  • What is the main issue with the stable Fusion 2.0 release?

    -The main issue with stable Fusion 2.0 was that it produced poor-quality images, particularly of people, due to the restrictive filtering of the data set which reduced the number of people images available for training.

  • How did users adapt to the problems with stable Fusion 2.0?

    -Users adapted by learning new ways to prompt the model effectively, including the use of negative prompts to guide the image generation process.

  • What improvements have been made in stable Fusion 2.1?

    -In stable Fusion 2.1, improvements include support for a new prompting style, bringing back many prompts, more data and training, less restrictive filtering of the data set, and better handling of architecture, interior design, wildlife, and landscape scenes.

  • What was the impact of the data set filter on stable Fusion 2.0?

    -The data set filter in stable Fusion 2.0 dramatically reduced the number of images of people, leading to difficulties in generating high-quality images of individuals.

  • How does stable Fusion 2.1 address the issue of generating images of people?

    -Stable Fusion 2.1 has been improved to better handle the generation of images of people, with a focus on improved anatomy and hands, and a less aggressive filter to allow for a more diverse range of people images.

  • What are some of the new features introduced in stable Fusion 2.1?

    -New features in stable Fusion 2.1 include the ability to render non-standard resolutions, support for extreme aspect ratios, and the capability to produce images in a wider range of art styles.

  • How does stable Fusion 2.1 handle negative prompts?

    -Stable Fusion 2.1 still utilizes negative prompts, but the implementation varies depending on the tool used. For example, Dream Studio uses a vertical bar, while Automatic 11 11 uses a special box and Invoke uses brackets.

  • What is the role of negative prompts in the stable Fusion model?

    -Negative prompts are used to guide the model away from generating certain undesirable features, such as poorly drawn faces or hands, thus improving the overall quality of the generated images.

  • Where can one find the stable Fusion 2.1 model and related resources?

    -The stable Fusion 2.1 model and related resources can be found on Hugging Face, an open-source platform where users can access the weights and checkpoint files.

  • What is the significance of the aspect ratio changes in stable Fusion 2.1?

    -The aspect ratio changes in stable Fusion 2.1 allow for the generation of wider images and more vertical images, providing better results and more creative flexibility for users.

  • How does the stable Fusion 2.1 model handle adult content?

    -The stable Fusion 2.1 model continues to filter out adult content but does so in a less aggressive manner to reduce false positives and allow for a more diverse range of images.

Outlines

00:00

🚀 Introduction to Stable Fusion 2.1 and Its Improvements

This paragraph introduces the release of Stable Fusion 2.1, reflecting on the previous version's shortcomings and highlighting the improvements made in the new version. It mentions that the 2.1 version has been developed in response to the issues faced by users with version 2.0, particularly the poor results generated by the model. The summary emphasizes the new features of Stable Fusion 2.1, such as support for a new prompting style, the return of various prompts, and a more diverse and less restrictively filtered data set. It also notes the improvements in image quality for architecture, interior design, wildlife, and landscape scenes, as well as the model's enhanced ability to generate images of people.

05:01

🌟 Enhanced Features and User Feedback in Stable Fusion 2.1

The second paragraph delves into the specific features of Stable Fusion 2.1, including its ability to render high-quality architectural concepts, natural scenery, and images of people and pop culture. It discusses the adjustments made to the filters to reduce false positives and the model's fine-tuning to achieve a balance between rendering detailed environments and accurate human figures. The summary also covers the model's capability to handle non-standard resolutions, which allows for the creation of stunning vistas and widescreen images. Additionally, it touches on the various tools and methods available for using negative prompts to refine the output of the model.

10:03

📢 Conclusion and Invitation to Try Stable Fusion 2.1

In the final paragraph, the speaker concludes the discussion on Stable Fusion 2.1 by inviting the audience to try out the new version and share their experiences. It acknowledges that many users may still be using earlier versions like 1.4 or 1.5, but encourages them to explore the advancements made in 2.1. The speaker expresses hope that the improvements and new features will entice users to make the transition and offers a platform for users to provide feedback and share their thoughts on the updated model.

Mindmap

Keywords

💡stablefusion version 2.1

Refers to the updated version of the AI model 'Stable Fusion' which is a tool for generating images based on textual prompts. The 2.1 version is an improvement over the previous 2.0 version, addressing issues that users faced in the initial release. In the context of the video, this new version is expected to provide better image quality and a more diverse range of outputs, including improved handling of architectural and natural scenery, as well as people and pop culture images.

💡Fiasco

In the context of the video, 'fiasco' is used to describe the problematic release of Stable Fusion 2.0, which did not meet user expectations and resulted in poor image generation outcomes. The term implies a significant failure or disaster, emphasizing the extent of the issues faced by users during the 2.0 version's launch.

💡Negative prompts

Negative prompts are a technique used in AI image generation where specific undesirable features or elements are listed to guide the AI model away from producing those outcomes. In the context of the video, it is mentioned that users had to learn new ways to use negative prompts to achieve better results with Stable Fusion 2.0, and that the new version 2.1 still supports this prompting style.

💡Data set

The data set refers to the collection of data used to train the AI model. In the case of Stable Fusion, the data set consists of images that the model learns from to generate new images based on textual prompts. The video discusses how the data set was filtered and how the new version 2.1 has a more diverse and wide-ranging data set, which is expected to improve the quality and variety of the generated images.

💡Architecture

In the context of the video, 'architecture' refers to the category of images related to buildings and structures. The speaker mentions that Stable Fusion 2.1 has made improvements in generating images of architecture, indicating that the model is now better at rendering architectural concepts and scenes, which was one of the areas that users had issues with in the previous version.

💡Anatomy

Anatomy, in the context of the video, pertains to the accurate representation of the human body's structure in the images generated by the AI model. The speaker notes that Stable Fusion 2.1 has improved anatomy rendering, particularly in the depiction of hands, which was a problem area in the previous version of the model.

💡Art styles

Art styles refer to the various visual aesthetics and techniques used in creating images. The video discusses how Stable Fusion 2.1 is much better at handling a range of incredible art styles, indicating an enhancement in the model's versatility and ability to generate images that match different artistic preferences.

💡Non-standard resolution

Non-standard resolution refers to image dimensions that do not conform to typical standards or formats. In the context of the video, it is mentioned that the AI model now has the capability to render images with non-standard resolutions, allowing for the creation of wide vistas and epic widescreen images, which was not as easily achievable with previous versions.

💡Open source

Open source indicates that the software's source code is made publicly available, allowing users to access, use, modify, and redistribute the software freely. In the context of the video, it is mentioned that Stable Fusion is an open-source release, meaning that the model and its components are available on platforms like Hugging Face for users to download and utilize.

💡Dream Studio

Dream Studio is mentioned as a platform or tool associated with Stable Fusion, which users can utilize to interact with the AI model. It is implied that Dream Studio provides a user interface for generating images using the Stable Fusion model and may have specific features or prompts that are unique to it.

💡yaml

YAML, which stands for 'YAML Ain't Markup Language', is a human-readable data serialization format often used for configuration files and data exchange between languages with different data structures. In the context of the video, it is mentioned as a file type that users of the automatic 1111 tool may need to use with Stable Fusion 2.1.

Highlights

Stable Fusion 2.1 has been released as an improvement over the 2.0 version, addressing previous issues.

The 2.0 release was considered a fiasco due to the significant changes in the model's functionality and user dissatisfaction with the results.

The new 2.1 version promises better performance and user experience, with faster releases as part of their commitment.

Users adapted to the 2.0 model by learning new ways to prompt, which led to better results over time.

Stable Fusion 2.1 supports a new prompting style and brings back many prompts that were previously effective.

The update includes more data, more training, and less restrictive filtering of the data set, addressing user concerns.

The previous version had issues with generating images of people, which the new version aims to improve.

The model now offers a wider range of aspect ratios for images, allowing for more diverse outputs.

The 2.1 version is expected to perform better in rendering architecture, landscapes, and scenery.

The model has been fine-tuned to capture the best of both worlds, improving upon the previous version's limitations.

The release includes improved anatomy and hands, as well as a better range of art styles.

Non-standard resolutions are supported, enabling the creation of images with extreme aspect ratios and widescreen imaging.

The filters have been adjusted based on user feedback, aiming to strike a balance between removing adult content and allowing diverse images.

The new version delivers better results for images of people and pop culture, a significant improvement over the 2.0 version.

Stable Fusion 2.1 is available on Hugging Face for those interested in exploring the open-source release.

Users can expect an improved experience with various tools like Dream Studio, automatic 11 11, and invoke, each with their ways of handling negative prompts.

The transcript encourages users to try out Stable Fusion 2.1 and share their experiences, highlighting the ongoing development and community engagement.