BEST AI Art Generator? Dall E 2 vs Midjourney vs Stable Diffusion

Wade McMaster - Creator Impact
22 Dec 202207:04

TLDRThe video script offers a comparative analysis of three AI art platforms: Dolly 2, Mid-Journey, and Stable Diffusion. It evaluates their performance based on various prompts, highlighting the distinct styles each platform produces. Dolly 2 is noted for its photorealistic images, Mid-Journey for its artistic and striking visuals, and Stable Diffusion for its photorealistic yet slightly less impressive outputs. The ease of use and features of each platform are also discussed, with Dolly 2 having a user-friendly interface and Mid-Journey offering a more complex experience via Discord. The video invites viewers to share their preferences and experiences with these platforms.

Takeaways

  • 🎨 Three main AI art platforms discussed: Dali 2, Mid-Journey, and Stable Diffusion.
  • 🖼️ Dali 2 tends to create more photorealistic images.
  • 🌟 Mid-Journey produces more artistic and striking visuals.
  • 📸 Stable Diffusion provides decent photorealistic images but may not be as strong as the other two.
  • 💡 Straight-up prompts were used for comparison, but skill is required for better results.
  • 👁️ Dali 2's interface is user-friendly with features like in-painting and out-painting.
  • 🎭 Mid-Journey is more complex to use and operates through Discord.
  • 🆓 Stable Diffusion can be obtained for free but has a steeper setup process.
  • 📱 Dali 2, Mid-Journey, and Stable Diffusion each have their strengths and cater to different artistic preferences.
  • 🎨 The choice of platform depends on the desired style: photorealistic, artistic, or a mix of both.
  • 🗣️ Viewers are encouraged to share their preferences and experiences with the platforms.

Q & A

  • What are the three AI art platforms mentioned in the transcript?

    -The three AI art platforms mentioned are Dolly 2, Mid Journey, and Stable Diffusion.

  • How does the speaker describe the image of a beautiful woman with blue eyes created by Dali 2?

    -The speaker describes the image as almost photorealistic, with the only issue being that the teeth look a little bit funny.

  • What is the speaker's opinion on the oil painting of a Shaolin monk created by Mid Journey?

    -The speaker finds the Mid Journey image to be really stunning and appreciates its artistic and sharp quality.

  • Which platform does the speaker prefer for the outdoor sunny scene?

    -The speaker prefers Mid Journey for the outdoor sunny scene, as it created an artistic masterpiece with a painting style.

  • What is the speaker's overall assessment of the cyborg with glowing eyes created by the three platforms?

    -The speaker prefers Mid Journey's version of the cyborg as it offers a crazy and impressive video game style, while acknowledging that Dali 2's was simple and Stable Diffusion's was decent but not as glowing as expected.

  • How does the interface of Dali 2 compare to the other platforms mentioned?

    -The interface of Dali 2 is described as nicer, easier to use, and featuring great additional options like in painting and out painting.

  • What is the speaker's verdict on the 3D render of a turtle created by Mid Journey?

    -The speaker considers Mid Journey's 3D render of the turtle to be on another level compared to Dali 2's more basic and plain 3D render.

  • Which platform does the speaker find to be the most complex to set up, and is there a free version available?

    -Stable Diffusion is found to be the most complex to set up, but there is a free version available, although it might be more complex to use without an online interface.

  • What are the differences in the styles of images produced by the three platforms according to the speaker?

    -Dali 2 tends to produce the most photorealistic images, Mid Journey creates more artistic and well-composed images, and Stable Diffusion is better at creating photorealistic images but is considered second to both in artistic composition.

  • What is the speaker's final verdict on which platform is their favorite?

    -The speaker does not explicitly state a favorite platform, but they express a preference for the artistic style and quality of Mid Journey's outputs.

  • How can users engage with the speaker's content and share their own experiences with the platforms?

    -Users can engage by leaving comments below the video to share their thoughts, preferences, and experiences with the platforms discussed.

Outlines

00:00

🎨 Comparison of AI Art Platforms: Dolly 2, Mid-Journey, and Stable Diffusion

This paragraph compares the outputs of three main AI art platforms: Dolly 2, Mid-Journey, and Stable Diffusion. The comparison is based on the use of straightforward prompts to evaluate the quality and style of the results produced by each platform. Dolly 2 is noted for its photorealistic images, Mid-Journey for its stunning and artistic outputs, and Stable Diffusion for its standard oil painting-like results. The paragraph emphasizes that while there is skill involved in achieving desired results, the platforms offer varying styles and levels of realism straight out of the box.

05:01

🖼️ Artistic Interpretations: Styles and Preferences in AI Art

The second paragraph delves into the artistic interpretations of the AI platforms when given the same prompts. It discusses the strengths and weaknesses of each platform in terms of photorealism, artistic composition, and style. Dolly 2 is praised for its photorealistic capabilities, Mid-Journey for its artistic and visually striking images, and Stable Diffusion for its photorealistic attempts, though ranked second to Dolly 2 in this aspect. The paragraph also touches on the user interface and accessibility of the platforms, with Dolly 2 having a user-friendly interface and Mid-Journey offering high-quality imagery despite its complexity. Stable Diffusion is noted as the most complex to set up but available for free, prompting viewers to consider their preferences when choosing a platform.

Mindmap

Keywords

💡AI art platforms

AI art platforms refer to digital services or applications that utilize artificial intelligence algorithms to generate or enhance artwork. In the context of the video, these platforms are Dolly 2, Mid-Journey, and Stable Diffusion, which are used to create various types of images based on user prompts. They are central to the video's theme as they are the primary tools being compared for their effectiveness and style in producing art.

💡Photorealistic

Photorealistic refers to the quality of an image or artwork that closely resembles a photograph in terms of detail and realism. In the video, this term is used to describe the level of detail and lifelike appearance of the images generated by the AI art platforms, with Dolly 2 being noted for creating the most photorealistic images among the three platforms.

💡Artistic style

Artistic style encompasses the unique and creative visual language used by an artist or a platform to express ideas or aesthetics in a work of art. In the video, the artistic style is a significant factor in evaluating the output of the AI art platforms, with Mid-Journey being favored for its artistic and well-composed images that have a cooler style and appeal to the viewer's taste.

💡3D render

3D render refers to the process of generating a two-dimensional image or animation from a three-dimensional model. In the context of the video, it highlights the ability of the AI platforms to create images that have a 3D appearance. The comparison of the 3D render of a turtle among the platforms showcases their capability to produce depth and dimension in their outputs.

💡Ink sketch

An ink sketch is a drawing created using ink, characterized by bold lines and shading techniques. In the video, it is one of the styles that the AI platforms attempt, with the goal of achieving a rough yet visually appealing representation, such as the dragon sketch created by the platforms.

💡User interface

User interface refers to the point of interaction between a user and a computer or machine, including the design and usability of the controls and presentation. In the video, Dali 2 is noted for having a more user-friendly interface with features like in painting and out painting, which enhances the user experience and ease of creating AI art.

💡Discord

Discord is a communication platform designed for communities, including text, voice, and video chat. In the context of the video, Discord is mentioned as the medium through which Mid-Journey operates, indicating a more complex user experience compared to other platforms.

💡Free access

Free access implies that a service or product is available without any cost to the user. In the video, Stable Diffusion is mentioned as being available for free, which can be a significant factor for users considering the affordability and accessibility of the AI art platform.

💡Image composition

Image composition refers to the arrangement of visual elements within an artwork to create a harmonious and engaging final piece. In the context of the video, it is an essential aspect of evaluating the AI-generated images, with Mid-Journey being praised for its well-composed and artistically impressive scenes.

💡Photograph

A photograph is an image created by capturing light on a light-sensitive surface, often used to represent reality accurately. In the video, the term is used to describe the type of output the AI platforms aim to generate, with Dali 2 being recognized for its ability to create images that closely resemble actual photographs.

💡Facial elements

Facial elements refer to the individual features that make up a person's face, such as the eyes, nose, mouth, and overall structure. In the context of the video, the accuracy and detail of facial elements are crucial in evaluating the quality of the AI-generated images, particularly when comparing the platforms' ability to recreate a realistic human likeness.

Highlights

Three main AI art platforms are discussed: Dolly 2, Mid-Journey, and Stable Diffusion.

Dolly 2 creates almost photorealistic images with some minor imperfections.

Mid-Journey produces stunning images, though not as photorealistic as Dolly 2.

Stable Diffusion provides decent results but is considered the weakest among the three platforms.

Vision prep is noted as the best-looking image from the three platforms.

Dolly 2's image of a Shaolin monk resembles a standard oil painting.

Mid-Journey's Shaolin monk image is described as sharp, fantastic, and exciting.

Stable Diffusion's Shaolin monk image maintains a traditional oil painting look.

Dolly 2's outdoor scene image is very photo-like but not the most appealing.

Mid-Journey's outdoor scene is considered an artistic masterpiece with a painting style.

Stable Diffusion's outdoor scene is more photo-like, falling between Mid-Journey and Dolly 2.

Dolly 2's image of a cyborg with glowing eyes is simple and basic.

Mid-Journey's cyborg image is described as crazy and impressive with a video game style.

Stable Diffusion's cyborg image is good but the eyes aren't as glowing.

Dolly 2's image of a busy city street combines a painted and photographic look.

Mid-Journey's city street image is striking and artistic with vibrant colors.

Stable Diffusion's city street image is more photo-like but lacks the artistic flair of Mid-Journey.