Stable Diffusion 3 is HERE! MASSIVE Improvements, Turbo, 3D, Can Stability AI Survive?
TLDRStability AI has announced the release of Stable Diffusion 3 and Stable Diffusion 3 Turbo on their developer platform API, in partnership with Fireworks AI. Despite recent challenges, including CEO departure and restructuring, the company has made significant strides with their new model, which is claimed to be on par with or surpass state-of-the-art text-image generation systems. The model is currently available through the API, with plans to release model weights for self-hosting to Stability AI members soon. A new membership model has been introduced, offering various tiers with commercial use permissions and faster GPU response times. The pricing for using the API is detailed, with costs ranging from 4 cents for Turbo images to 25 cents for upscaling to 4K. The community's response to the membership model and the impact on model fine-tuning and modifications on platforms like Hugging Face remain to be seen.
Takeaways
- 🚀 **Stable Diffusion 3 Launch**: Stability AI has released Stable Diffusion 3 and its Turbo version on their developer platform API.
- 🔄 **CEO Departure**: The company's CEO, Imod, has left to work on a crypto project, which has been one of the recent challenges for Stability AI.
- 💸 **Financial Struggles**: Stability AI has faced financial difficulties, including issues with paying their GPU bills to Amazon and corporate restructuring.
- 🤝 **Partnership with Fireworks AI**: Stability AI has partnered with Fireworks AI for API orchestration, aiming to deliver an enterprise-grade solution with high service availability.
- 📈 **Performance Claims**: Stable Diffusion 3 is claimed to be equal to or better than state-of-the-art text-image generation systems like Dolly 3 and mid-Journey V6.
- 💡 **New Architecture**: The new multimodal diffusion Transformer architecture uses separate sets of weights for image and language, enhancing text understanding and spelling capabilities.
- 📊 **Pricing and Access**: A Stability AI membership is required to access the model weights, which could be a new revenue stream for the company.
- 🌐 **API Availability**: While the model is only available via API currently, an advanced open release is in the works.
- 📉 **Pricing Concerns**: The cost of using Stable Diffusion 3 is noted to be roughly ten times that of SDXL when used through the same API, raising questions about computational intensity and availability of GPUs.
- 🔗 **Community Reaction**: There is curiosity about how the community will respond to the new licensing model and the requirement of a membership for model access.
- ⏱️ **Timeline for Model Weights**: Stability AI plans to make the model weights available for self-hosting to members in the near future, although the exact timeline is unclear.
Q & A
What has been the recent situation with Stability AI?
-Stability AI has faced challenges in the past few months, including the departure of their CEO, corporate restructuring, and issues with paying their GPU bills to Amazon and Corweave.
What is the significance of the announcement of Stable Diffusion 3 and Stable Diffusion 3 Turbo?
-The announcement signifies that despite recent struggles, Stability AI has made significant progress with the release of these new models on their developer platform API, which is a positive step for the company.
How does Stability AI plan to deliver the models through their API?
-Stability AI has partnered with Fireworks AI, which is described as the fastest and most reliable API platform in the market, to deliver the Stable Diffusion 3 and Stable Diffusion 3 Turbo models.
What is the requirement for accessing the model weights of Stable Diffusion 3?
-To access the model weights, a Stability AI membership is required. This is a new approach that may be an attempt to generate additional revenue.
What are the capabilities of Stable Diffusion 3 as mentioned in the script?
-Stable Diffusion 3 is capable of creating massive scenes with text, and the scenes are more cohesive than ever. It also has a new multimodal diffusion Transformer architecture that improves text understanding and spelling capabilities.
What is the current availability of Stable Diffusion 3?
-As of the time of the script, Stable Diffusion 3 is only available through the API. Stability AI is working on an advanced release for its open release.
What is the pricing structure for using Stable Diffusion 3?
-The pricing for using Stable Diffusion 3 is based on a credit system. It costs roughly 7 cents per image generated with the standard model and around 4 cents per image with the Turbo model. Other features like upscaling to 4K, in-painting, and video generation have different pricing.
What is Stability AI Membership?
-Stability AI Membership is a new product offering that allows users to access various models hosted online, including image, video, language, and 3D models. It offers different tiers with varying levels of access and commercial usage rights.
How does Stability AI's partnership with Fireworks AI benefit the API service?
-The partnership with Fireworks AI allows Stability AI to deliver an enterprise-grade API solution with 99.9% service availability, which improves the reliability and robustness of their service.
What are the potential implications of the new licensing model for the community?
-The new licensing model may affect how people fine-tune and post modifications of the models, as well as the quantizations on platforms like Hugging Face. It might also indicate a shift in Stability AI's relationship with Hugging Face, given Amazon's involvement with both companies.
What is the community's reaction to the membership model for accessing Stable Diffusion 3?
-The community's reaction is not detailed in the script, but it is suggested that there may be mixed feelings about the membership requirement and the potential costs associated with accessing the model weights.
Outlines
🚀 Stability AI's New Release and Corporate Restructuring
Stability AI, a key player in the open-source generative AI space, has recently faced challenges including the departure of their CEO, corporate restructuring, and issues with unpaid bills. Despite these hurdles, they've announced the release of Stable Diffusion 3 and Stable Diffusion 3 Turbo on their developer platform API, in partnership with Fireworks AI. This move aims to improve API performance and reliability. The announcement also hints at a potential new revenue stream through a required Stability AI membership for model weights, which could help address financial difficulties. The models demonstrated impressive capabilities in creating detailed and cohesive scenes from text. However, the release lacks some initially promised features, and the pricing model is significantly different, raising questions about computational intensity and availability of GPU resources.
💳 Introducing Stability AI Membership and Pricing Structure
Stability AI has introduced a new membership model, likened to Adobe's Creative Cloud, offering access to various models including image, video, language, and 3D models hosted online. The membership tiers provide different levels of access, with professional membership allowing commercial use. There's a notable absence of a current API endpoint for 3D models, and the definition of 'core models' is limited to those released up until the announcement date. The 'Stable Image Core' is the API used to access Stable Diffusion 3, with a pricing structure that offers different costs for various tasks such as image generation, outpainting, inpainting, upscaling, and video generation. The efficiency and cost of Stable Diffusion 3 are highlighted, with implications for the community's approach to fine-tuning and modifications of the models. The potential impact of the new licensing model on the open-source community and the company's relationship with platforms like Hugging Face is also discussed.
Mindmap
Keywords
💡Stability AI
💡Stable Diffusion 3
💡Stable Diffusion 3 Turbo
💡Fireworks AI
💡API
💡Model Weights
💡Multimodal Diffusion Transformer
💡Stability AI Membership
💡Commercial Access
💡Enterprise Grade API Solution
💡Regulation
Highlights
Stability AI has released Stable Diffusion 3 and Stable Diffusion 3 Turbo on their developer platform API.
The company has partnered with Fireworks AI for API orchestration and delivery.
Stable Diffusion 3 is expected to make model weights available for self-hosting with a Stability AI membership soon.
Stable Diffusion 3 demonstrates the ability to create massive, cohesive scenes from text.
The release includes a new multimodal diffusion Transformer architecture, improving text understanding and spelling capabilities.
Stable Diffusion 3 is claimed to be equal to or outperform state-of-the-art text-image generation systems.
The model is currently only available via API, with an advanced open release in the works.
Stability AI membership will be required to access the raw model weights.
The pricing for using Stable Diffusion 3 is significantly lower than its predecessor, SDXL.
Stable Diffusion 3 Turbo offers a slightly cheaper alternative for certain tasks like inpainting and outpainting.
Upscaling to 4K with Stable Diffusion 3 Turbo costs around 25 cents per image.
The new licensing model may affect how the community fine-tunes and modifies the models.
Stability AI's membership model is likened to Adobe's Creative Cloud, offering access to various models hosted online.
Commercial use of the models is restricted to professional and enterprise membership tiers.
The release comes amidst corporate restructuring and financial challenges for Stability AI.
The efficiency and cost of Stable Diffusion 3 raise questions about GPU availability and deployment strategies.
Stability AI's partnership with Fireworks AI aims to deliver an enterprise-grade API solution with high service availability.
The community's reaction to the membership model and the future of Stability AI remain to be seen.