DALL·E 3 vs MIDJOURNEY : quelle est la meilleure IA ? (comparatif complet)

Ludo Salenne
27 Oct 202341:29

TLDRThe video script presents a comprehensive comparison between Dali 3 and Midjourney, two AI image generation platforms. The comparison is based on various criteria such as understanding of requests, creativity, realism, and the ability to incorporate text into images. The test involves creating visuals based on prompts, such as a red car in the rain, a hip-hop dancer, a manga character, and a paradisiacal landscape. The results show that Dali 3 excels in creativity and illustration, particularly when used with Chat GPT, while Midjourney stands out for its ultra-realistic images. The video also highlights the importance of using English prompts for optimal results with these AI platforms.

Takeaways

  • 🔍 The video script is a comparative analysis of Dali 3 and Midjourney AI image generation capabilities.
  • 🌐 Dali 3 can be used on Chat GPT with a paid subscription to GPT4, while Midjourney requires access through Discord.
  • 🎨 Both AIs were tested on various criteria including understanding of requests, creativity, realism, and ability to include text in images.
  • 🏙️ In terms of understanding basic requests, both AIs performed well, but Dali 3 on Chat GPT slightly outperformed Midjourney.
  • 💃 For more detailed and creative requests, Dali 3 excelled in generating diverse and imaginative visuals, especially for logo creation.
  • 🖼️ When it came to generating ultra-realistic images, Midjourney delivered more lifelike and detailed visuals, particularly for a prompt involving a child running on the beach with a dog.
  • 🚫 Dali 3 could not generate images of public figures playing sports on Chat GPT due to usage restrictions.
  • 📸 Midjourney was able to create a visual of Cristiano Ronaldo playing basketball, although with some imperfections in the prompt.
  • 🌟 For integrating text into images, Dali 3 performed better, especially when using Chat GPT, providing clear and readable text within the visuals.
  • 📝 The importance of using English in prompts was highlighted, as both AIs translated requests into English before generating images.
  • 🔑 The final verdict from the comparison is that Dali 3 is superior for creativity and illustration, while Midjourney excels in ultra-realistic image generation.

Q & A

  • What is the main focus of the video script?

    -The main focus of the video script is to compare the performance of two AI image generation tools, Dali 3 and Midjourney, based on various criteria such as understanding of requests, creativity, realism, and the ability to include text in images.

  • How can Dali 3 be accessed according to the script?

    -Dali 3 can be accessed either through Chat GPT if you have a paid subscription to GPT-4, or through Bing, which is free and allows immediate image generation.

  • What is the significance of the language used in the prompts for the AI tools?

    -The language used in the prompts is significant because it affects the quality of the visuals generated by the AI. The script emphasizes the importance of using French for the test, but also mentions that using English might yield better results as the AI tools might translate and interpret the requests more accurately.

  • What are the different criteria tested in the video to determine the better AI tool?

    -The criteria tested in the video include understanding of the request, creativity level, realism level, and the ability to include text within an image.

  • What is the conclusion regarding the comparison between Dali 3 and Midjourney?

    -The conclusion is that Dali 3 seems to perform better in terms of creativity and illustration, while Midjourney excels in producing ultra-realistic visuals. The choice between using Dali 3 on Chat GPT or Bing depends on the subscription availability and the desired quality of outputs, with Chat GPT providing better quality visuals in the tested scenarios.

  • How does the script address the issue of generating images with public figures?

    -The script mentions that Dali 3 on Chat GPT cannot generate images of public figures due to usage conditions, while Midjourney can, although it may not always accurately represent the figure.

  • What is the importance of the prompt optimization in getting better results from AI tools?

    -Prompt optimization is crucial as it directly affects how the AI interprets and executes the user's request. Using precise and correctly translated language in the prompts can significantly improve the quality and relevance of the generated images.

  • What is the role of the user's proficiency in using AI tools in the effectiveness of the results?

    -The user's proficiency in formulating prompts and understanding the capabilities of the AI tools plays a significant role in the effectiveness of the results. The script suggests that for an average user, Dali 3 might be more accessible and yield better results than Midjourney.

  • How does the script suggest improving the user experience with AI tools?

    -The script suggests that understanding how AI tools interpret and process requests can help users improve their prompts. It also implies that choosing the right tool for the task (Dali 3 for creativity and Midjourney for realism) and the right platform (Chat GPT over Bing for Dali 3) can enhance the user experience.

  • What are the limitations encountered when using Dali 3 on Bing?

    -The limitations encountered when using Dali 3 on Bing include lower quality visuals and less detailed outputs compared to using Dali 3 on Chat GPT. The script suggests that while Bing is free, it may not provide the best results for image generation tasks.

  • What is the final verdict on which AI tool to use for different purposes according to the script?

    -The script concludes that for creative and illustrative tasks, Dali 3 is the better choice, while for ultra-realistic visuals, Midjourney is superior. For Dali 3, using it through Chat GPT is recommended over Bing for better quality outputs.

Outlines

00:00

🤖 Introduction to Dali 3 vs Midjourney AI Test

The video begins by introducing a test comparing Dali 3 and Midjourney, two AI platforms known for their image generation capabilities. The test aims to analyze various criteria such as performance, creativity, realism, and the ability to include text in images. The user outlines a plan to use simple prompts that are not optimized to reflect the expectations and needs of an average user. The importance of using the AI in French is highlighted, emphasizing the goal to cater to all users who will watch the comparison.

05:02

🌧️ Testing AI's Understanding of Basic Requests

The first test involves assessing the AI's comprehension of basic requests by asking them to create an image of a red car in a rainy cityscape. The results from Midjourney and Dali 3 are compared, with observations on the level of detail, the depiction of rain, and the urban environment. The user notes a slight advantage for Dali 3 on Chat GPT in terms of better translating the abstract concept of rain, but acknowledges that both AIs understood the request well.

10:03

💃 Assessing Creativity with Dancer and Crowd

The second test focuses on creativity by prompting the AIs to generate an image of a hip-hop dancer with a white cap, dancing among an impressed crowd. The user provides detailed feedback on the accuracy of the dancer's depiction, the color of the cap, and the expressions of the crowd. Dali 3 on Chat GPT is noted to have a better understanding of the 'impressed' aspect of the crowd, while Bing's Dali 3 offers a more creative take on the prompt, including a blurring effect to convey speed.

15:03

🌃 Evaluating Futuristic Manga Character

The third test asks the AIs to create a manga character in a futuristic Tokyo setting. The user critiques the results based on the depiction of the streets of Tokyo and the futuristic lighting. While Midjourney's response is deemed satisfactory, Dali 3 on Chat GPT is praised for its atmospheric portrayal of the futuristic concept, including attempts to illustrate futuristic elements like bubbles on the ground.

20:05

🌅 Comparing Creativity in Paradise Landscapes

The creativity of the AIs is further tested by asking them to create a painting of a paradisiacal landscape with an exceptional sunset. Midjourney's response is appreciated for its creative elements such as chairs and the inclusion of a sunset, indicating a paradisiacal setting. Dali 3, however, is noted for its textured painting aspect and diverse landscapes, although the user expresses some dissatisfaction with the representation of the sunset's exceptional quality.

25:07

👧 Realistic Image Creation: Girl and Dog on the Beach

The test shifts to creating ultra-realistic images with a prompt for a little girl running on the beach with her dog. Midjourney delivers highly realistic and detailed images that closely match the user's request. Dali 3 on Chat GPT also provides a variety of images with different angles and styles, but the user feels that the ultra-realistic aspect is better captured by Midjourney, making it the preferred choice for this specific task.

30:07

🖼️ Logo Design for Sneaker and Streetwear Store

The user challenges the AIs to design a logo for a sneaker and streetwear store. Midjourney's logos are creative and visually appealing but lack text integration. Dali 3 on Chat GPT, however, not only creates visually striking logos but also successfully incorporates readable text, making it the clear winner in this test. Bing's Dali 3, while providing a relevant logo, does not meet the same level of quality as Chat GPT's output.

35:09

🐻 Integrating Text into Images

The video explores the ability of the AIs to integrate text into images, starting with a request for an image of a bear holding a paper with a subscription message. Midjourney struggles with the request, misunderstanding the concept of 'paper'. Dali 3 on Chat GPT, in contrast, successfully incorporates text into the image, although it is not perfect. Bing's Dali 3 also manages to include text, but in English rather than French, highlighting the importance of language in AI interpretation.

40:09

🏀 Creating a Recognizable Public Figure

The final test asks the AIs to create a recognizable image of Cristiano Ronaldo playing basketball. Midjourney produces an image that, while not perfect, captures the essence of Ronaldo and his activity. However, Dali 3 on Chat GPT is unable to fulfill the request due to restrictions on creating images of public figures. This limitation is not present in Midjourney, which successfully creates the requested image.

📊 Conclusion and Recommendations

The user concludes that Dali 3 is superior for creativity and illustration, while Midjourney excels in ultra-realistic image generation. For Dali 3, the user recommends using it on Chat GPT rather than Bing due to better image quality and more accurate interpretations of prompts. The video ends with the user encouraging viewers to share their experiences and thoughts on the comparison, and promises more content on AI in future videos.

Mindmap

Keywords

💡Artificial Intelligence

Artificial Intelligence (AI) refers to the simulation of human intelligence in machines that are programmed to think and learn like humans. In the context of the video, AI is used to create images and visual content, showcasing its capabilities in understanding user requests and generating creative outputs.

💡Image Generation

Image Generation is the process by which computers or AI systems create new images from scratch based on given inputs or prompts. In the video, image generation is the core activity, where AI systems are tested for their ability to generate various types of images, from realistic scenes to creative illustrations.

💡Creativity

Creativity refers to the ability to produce original and imaginative ideas, often leading to the creation of new and unique content. In the video, creativity is a critical criterion for assessing the AI systems, as it is important to see how well they can generate novel and inventive images that go beyond straightforward interpretations of the prompts.

💡Realism

Realism in art and image generation refers to the accurate and true-to-life representation of subjects. In the context of the video, realism is a key aspect being evaluated, as the AI systems are tasked with generating images that closely resemble real-world scenes or photographs, such as an ultra-realistic photo of a child on the beach.

💡User Experience

User Experience (UX) refers to the overall experience a user has while interacting with a system or product, including how easy it is to use, the effectiveness of the interface, and the satisfaction derived from the interaction. In the video, UX is discussed in terms of the ease of using AI systems and the quality of the results they produce.

💡Prompt Optimization

Prompt optimization involves crafting prompts or instructions for AI systems in a way that maximizes the quality and relevance of the output. It requires understanding how the AI interprets and processes the prompts to guide it effectively. In the video, prompt optimization is crucial for achieving the desired results from the AI image generation systems.

💡Dali 3

Dali 3 is an AI system mentioned in the video that specializes in generating images based on user inputs. It is noted for its creative output and ability to produce a variety of visual content. The system is accessed through platforms like Chat GPT and Bing, and its performance is compared with another AI system, Midjourney.

💡Midjourney

Midjourney is an AI system discussed in the video that is capable of generating images. It is noted for its ability to produce highly realistic images, such as an ultra-realistic photo of a child on the beach. The system requires optimization of prompts to achieve the best results and is compared with Dali 3 for various image generation tasks.

💡Chat GPT

Chat GPT is a platform mentioned in the video that provides access to the Dali 3 AI system for image generation. It serves as an interface where users can input prompts and receive AI-generated images or text-based responses. The video compares the output quality and user experience of Dali 3 on Chat GPT with that of Bing.

💡Bing

Bing is a web search engine owned by Microsoft, which is mentioned in the video as another platform where users can access the Dali 3 AI system for image generation. The performance and output quality of Dali 3 on Bing are compared to those on Chat GPT throughout the video.

Highlights

Comparison between Dali 3 and Midjourney AI on various criteria.

Testing AI's understanding of basic and detailed requests.

Assessing creativity levels in generating images.

Examining the realism of generated images.

Capability to include text within images.

Performance of Dali 3 and Midjourney in creating a realistic car image.

Comparing the depiction of a hip-hop dancer in a crowd.

Evaluation of manga character creation.

Creating a paradisiacal landscape with a sunset.

Integrating text into images for logos and public figures.

Dali 3's inability to create public figures on Chat GPT.

Comparing the quality of ultra-realistic images of a little girl and a dog on the beach.

Creating a logo for a sneakers and streetwear store.

Testing the integration of text in images for a subscription call to action.

Assessing the ability to create personality visuals for public figures.

Recommendation to use Dali 3 for creativity and Midjourney for ultra-realism.

Advice on using Dali 3 on Chat GPT for better quality images.

Importance of using English in prompts for optimal AI performance.