BETTER than PROMPTS - The Future of AI Composition

Olivio Sarikas
3 May 202309:22

TLDRIn this video, the presenter, known as the king of AI, demonstrates advanced techniques for creating detailed and controlled AI-generated compositions. The focus is on creating a compelling image of a female hero with angel wings against a burning city backdrop. The process involves separating the subject from the background, manipulating light and color through curves adjustment, and integrating additional elements like wings and a cityscape. The presenter also shares tips on refining the image with Affinity Photo and further enhancing it using various settings in the AI tool, such as Euler, denoise strength, and CFG scale. The video concludes with a demonstration of another method combining a portrait with an abstract background, showcasing the versatility and creativity possible with AI composition tools.

Takeaways

  • 🎨 **Image Composition Control**: The speaker emphasizes the need for more control over image composition elements, suggesting that traditional text prompts may not suffice for detailed creative work.
  • 🖼️ **Describing with Images**: To better describe an image, the process involves using another image, highlighting the power of visual references in the creative process.
  • 🚫 **Layer Separation**: The script demonstrates how to separate the foreground from the background in an image, a crucial step for detailed editing and composition.
  • 🖌️ **Selection and Masking**: The use of selection tools and brushes is fundamental for refining the image and isolating specific elements, such as the woman and the rooftop.
  • 🧚‍♀️ **Adding Elements**: The process of adding angel wings to a character is shown, illustrating how to integrate new visual elements into an existing composition.
  • 🌆 **Background Manipulation**: The speaker discusses how to incorporate a burning city into the background, showing the importance of context in creating a compelling scene.
  • 🔆 **Adjusting Brightness and Contrast**: The use of curves adjustment to modify the brightness and contrast of elements, such as the wings, is explained to achieve the desired visual effect.
  • 📐 **Resolution and Export**: The importance of setting the correct resolution and exporting the final composition is highlighted, with specific settings provided for optimal results.
  • 📝 **Prompt Refinement**: The script details how to refine prompts for AI to generate more accurate and detailed images, such as specifying a 'female hero with blonde hair'.
  • 🔍 **Upscaling and Denoising**: Techniques for upscaling images and reducing noise are discussed, including the use of specific software features and settings.
  • 🧩 **Combining Images**: The method of combining a portrait with an abstract background is demonstrated, showcasing how to blend different images to create a unique composition.

Q & A

  • What is the main theme of the composition the speaker wants to create?

    -The speaker wants to create a composition featuring a female hero with angel wings standing on the top of a roof, with a burning city in the background.

  • How does the speaker suggest separating the foreground from the background in an image?

    -The speaker suggests using the selection brush in add mode to select the woman and the rooftop, then refining the selection and applying it to separate the foreground from the background.

  • What is the process for adding angel wings to the female hero in the composition?

    -The process involves opening an image with wings, copying the desired wing, pasting it into the main image, positioning it on the hero's back, duplicating the layer, flipping it horizontally, and adjusting the brightness and contrast of the wings using the Curves adjustment.

  • How does the speaker describe the method of adding a burning city to the background?

    -The speaker describes the method as placing the burning city image in the composition, resizing it as needed, and positioning it behind the model and wings to create a specific scene with a nice contrast.

  • What are the settings the speaker uses for exporting the composition?

    -The settings used for exporting include reducing the resolution on the long side to about 1600, using Euler with 25 steps, a restore phase of 512 by 768, a CFG scale of 7, a denoise strength of 0.4, and a batch size of eight.

  • How does the speaker propose to enhance the hero's face in the composition?

    -The speaker proposes to send the image to inpaint to paint out the face of the character and then use a new prompt to generate a beautiful face, using a mask and denoising with a batch size of 8.

  • What is the method for combining a portrait with an abstract background?

    -The method involves opening the portrait in Affinity Photo, placing the abstract background, and using the selection brush to remove the original background of the portrait. Then, the abstract background is re-enabled and positioned to complement the portrait.

  • How does the speaker use text to image for creating a composition?

    -The speaker uses text to image by loading the portrait with the abstract background into Automatic 1111, using a prompt that describes the desired composition, and applying various control nets and settings to achieve the final result.

  • What is the sampling method used by the speaker for creating the composition?

    -The sampling method used is DPM++ 2sa Keras with sampling steps set to 25.

  • How does the speaker adjust the brightness and contrast of the wings in the composition?

    -The speaker adjusts the brightness and contrast by selecting both wings, grouping them, and then using the Curves adjustment to push the upper right part (bright areas) up and pull the lower part (dark areas) down to create an S curve.

  • What is the speaker's approach to fixing imperfections in the composition?

    -The speaker's approach to fixing imperfections involves using the inpaint tool to address areas like the eyes and then upscaling the image for a refined final result.

  • Why did the speaker delete the part of the video where a bear holding a red balloon and a girl sitting in his lap were featured?

    -The speaker deleted that part of the video by accident and offers to recreate it during a live stream if the viewers are interested.

Outlines

00:00

🎨 'Creative Image Composition with AI'

The first paragraph introduces a method for creating compelling compositions using AI, with a focus on having more control over elements such as composition, colors, and details. The speaker, referring to themselves as the 'king of AI,' guides the audience through a step-by-step process of creating an image of a female hero with angel wings against a burning city backdrop. The process involves separating the foreground from the background, adding angel wings to the character, adjusting the brightness and contrast, and placing a burning city in the background. The paragraph concludes with exporting the image and adjusting settings for resolution and composition details.

05:03

🖼️ 'Image Upscaling and Text-to-Image Techniques'

The second paragraph demonstrates advanced techniques for image manipulation and text-to-image conversion using AI. It begins with a discussion on combining a portrait with an abstract background in Affinity Photo, removing the original background, and integrating a new one. The process includes refining selections, masking, and exporting the image with specific dimensions. The paragraph then shifts to text-to-image conversion, showcasing how to input prompts and use control networks for depth and resolution, resulting in a composition of a beautiful African queen with a colorful abstract background. The speaker also mentions an accidental deletion of a part of the video and offers to recreate it in a live stream, ending with a call to action for likes and future engagement.

Mindmap

Keywords

💡AI Composition

AI Composition refers to the use of artificial intelligence to create or compose artwork, music, or other creative content. In the context of the video, it is about using AI to generate images and compositions that are more controlled and detailed than what can be achieved with simple prompts. The video demonstrates how AI can be used to create a complex scene with a female hero, angel wings, and a burning city in the background.

💡Rasterize

Rasterize is a term used in graphic design and image editing, which involves converting vector graphics into a raster or bitmap image. In the video, the speaker uses the 'Rasterize' function to turn the selected layer into a bitmap so that it can be manipulated with a selection brush, which is a crucial step in separating the woman and the rooftop from the background.

💡Selection Brush

A Selection Brush is a tool in photo editing software that allows users to 'paint' a selection around an object or area in an image. The video demonstrates the use of the Selection Brush in 'add mode' to select the woman and the rooftop, which is an essential technique for isolating elements within a composition for further editing.

💡Refine

Refine is a feature in image editing software that improves the accuracy of a selection. After using the Selection Brush, the speaker clicks on 'Refine' to fine-tune the selection, ensuring that the woman and the rooftop are cleanly separated from the rest of the image, which is important for a professional-looking composition.

💡Wings

In the context of the video, 'wings' refer to the angel wings that the female hero is to be depicted with. The wings are added to the composition to enhance the theme of the image and to give the character a majestic and heroic appearance. The speaker selects and adjusts the wings' brightness and contrast to integrate them seamlessly into the scene.

💡Burning City

The 'burning city' is a dramatic element in the background of the composition that adds to the apocalyptic theme of the image. It serves as a contrasting backdrop to the hero figure, highlighting her as a beacon of hope amidst chaos. The video shows how to incorporate this element into the composition by placing it behind the hero and the wings.

💡CFG Scale

CFG Scale refers to the 'Control Flow Guidance' scale, a parameter used in AI image generation that affects the level of detail and coherence of the generated image. In the video, the speaker mentions using a CFG scale of 7 for the settings, which likely contributes to the high quality and detail of the final composition.

💡Denoising

Denoising is a process in image and audio processing that removes unwanted noise or graininess from a file. In the context of the video, the speaker uses a denoising strength of 0.4 to clean up the AI-generated images and make them appear more polished and professional.

💡Batch Size

Batch Size in the context of AI image generation refers to the number of images processed at one time. The video mentions a batch size of 8, which means that the AI system generates eight images simultaneously, allowing for a more efficient workflow.

💡Image to Image

Image to Image is a process in AI where an existing image is used as a starting point to generate a new image with certain modifications or enhancements. The video demonstrates this by using an initial composition and then refining it through upscaling and additional AI processing to create a high-resolution final image.

💡Control Net

Control Net is a feature in some AI image generation systems that allows for more precise control over the output by using additional input images or parameters. The video shows how to use Control Net with depth layers and open pose to guide the AI in generating a composition with a specific style and compositional elements.

Highlights

The speaker introduces a new approach to AI composition that offers more control over elements, composition, and colors than traditional prompts.

Describing an image with another image is suggested as the best way to achieve detailed control in AI composition.

The process of separating the foreground from the background using selection and rasterization techniques is demonstrated.

Adding angel wings to a female hero figure is shown, with adjustments made for lighting and contrast.

The integration of a burning city background into the composition to create a dramatic scene is explained.

Exporting the composition with specific resolution and prompt parameters for further refinement is discussed.

The use of Euler, steps, restore phase, CFG scale, denoise strength, and batch size in settings for image generation is highlighted.

A method for painting out and improving the face of a character in the composition using a new prompt is shown.

Upscaling the image with tile resampling and control net techniques for higher quality results is demonstrated.

Combining a portrait with an abstract background to create a unique composition is featured.

Techniques for removing and replacing backgrounds, as well as adding design elements like halos, are presented.

Exporting the composition at a reduced size and using text-to-image methods for further creative input is explained.

A trick for loading the composition into Automatic 1111 for additional creative control is shared.

Using control nets for depth and open pose to refine the composition is demonstrated.

Inpainting and upscaling the final image to achieve the stunning result is the final step in the process.

An accidental deletion of a part of the video is mentioned, offering a live stream as an alternative for the missed content.

The video concludes with a call to action for viewers to like the video and a promise to see them soon in future content.