A Legit Sora Competitor is Here + Testing AI Actor Emotions

Curious Refuge
4 May 202419:48

TLDRThis week's AI film news introduces a new Sora competitor, Vu, which generates high-quality 16-second 1080P videos from text prompts. Adobe's Video Giga Gan tool is highlighted for its ability to upscale low-resolution footage to realistic high-resolution. Synthesia's expressive AI avatars are showcased, which respond emotionally to text inputs. The news also covers Nvidia's large-scale NeRF scanning technology, which could revolutionize virtual production in Hollywood. Additionally, Adobe and UCSD's research on prompt-based image editing, Render's AI image editing tools, and China's 3D model editing using prompts are discussed. The video concludes with an AI trailer competition announcement, upcoming AI film events, and a look at the latest AI-assisted films.

Takeaways

  • 🚀 A new Sora competitor called Vu has emerged, offering video generation with prompts and producing high-quality 1080P videos up to 16 seconds long.
  • 🎥 Adobe's Video Giga GAN tool can upscale low-resolution footage to high resolution while maintaining realistic facial textures and details.
  • 🤖 Synthesia's Expressive AI avatars can now respond emotionally to text prompts, changing their facial structure based on the words spoken.
  • 📈 Synthesia recently raised $90 million, indicating significant investment in the development of AI avatars.
  • 🧍‍♂️ A video of LinkedIn's creator, Reid Hoffman, cloned using AI, showcased a conversation with his AI self, demonstrating future societal AI adoption.
  • 🌐 Nvidia's research team has developed the ability to perform large-scale NeRF scans, allowing areas up to 25 square kilometers to be scanned and rendered in real-time.
  • 🎨 Adobe and the University of California San Diego have partnered to create prompt-based image editing, enabling contextual changes to images through text descriptions.
  • 👕 Render's online application allows users to change clothing in images using AI prompts, with results that are often seamless and consistent with the scene.
  • 🦕 A team from China has developed a method to edit 3D models using prompts, including generating models from conversation and making edits through painting.
  • 📽 An AI film news trailer competition was launched in partnership with Submachine, with the chance to win an Apple Vision Pro.
  • 🎬 The film 'To Dear Me', which integrated live-action footage with AI techniques, won the Beijing Film Festival 2024, showcasing the potential of AI in filmmaking.

Q & A

  • What is the name of the new Sora competitor mentioned in the video?

    -The new Sora competitor mentioned in the video is called Vu.

  • How does the tool Video Giga Gan from Adobe enhance video footage?

    -Video Giga Gan from Adobe allows for upscaling video footage to a higher resolution in a very realistic way, improving facial textures, details in hair, and avoiding the plastic or overly smooth skin appearance common in AI-upscaled footage.

  • What is the capability of the expressive AI avatars developed by Synthesia?

    -Synthesia's expressive AI avatars can emotionally respond and change their facial structure based on the text or words that are input into the system.

  • What is the significance of the large scale NeRF scans developed by Nvidia?

    -The large scale NeRF scans developed by Nvidia enable the scanning of areas up to 25 square kilometers in size and render them in real time on a computer. This technology could be used for virtual production sets in Hollywood, video games, and other applications.

  • How does the AI image editing tool from Render work?

    -The AI image editing tool from Render allows users to upload an image and make specific changes to it using AI prompts. For example, users can change clothing or other elements within an image by describing what they want to change.

  • What is the process like when using Sora to create a film, as described by the creators of the Airhead AI video?

    -The process of using Sora is described as similar to a slot machine where you input a prompt and hope for the desired outcome. There is an extensive post-processing stage, which includes color correction, asset comping, and rotoscoping. Additionally, there is a high render to use ratio, with 300 shots generated for every shot that makes it into the final film.

  • What is the name of the film that won the Beijing Film Festival 2024 and how was it created?

    -The film that won the Beijing Film Festival 2024 is called 'Dear Me'. It was created using AI assistance, integrating live-action footage with style transfers and other AI concepts to produce a stylized film.

  • What is the name of the AI film that explores the concept of a cat being a gladiator?

    -The AI film that explores the concept of a cat being a gladiator is called 'Cator'.

  • How does the AI tool from China allow users to edit 3D models?

    -The AI tool from China allows users to generate a 3D model through conversation by typing in a prompt. It also enables users to edit the 3D model by simply painting and indicating the desired changes, such as opening a dinosaur's mouth or transforming a banana into a whale.

  • What is the name of the online application that provides advanced AI control features through a clean online platform?

    -The online application that provides advanced AI control features through a clean online platform is called Render.

  • What is the name of the AI video tool that allows for the creation of multi-shot compilations?

    -The AI video tool that allows for the creation of multi-shot compilations is called Hyper.

  • What is the process of creating AI video generations with Vu like?

    -Creating AI video generations with Vu involves typing in a prompt to generate videos up to 16 seconds long in 1080P resolution. The tool is capable of creating dynamic AI movement and cinematic scenes, although it is noted that the quality is not as high as Sora.

Outlines

00:00

🎬 AI Film News: New Tools and Developments

The video script introduces various AI advancements in the film industry. It discusses a new Sora competitor, AI actors that can change emotions based on dialogue, and Adobe's Video Giga Gan tool for upscaling footage. The script also mentions Synthesia's expressive AI avatars, which respond emotionally to text inputs. The tool's capabilities are demonstrated through examples, including changing the background of a video. The segment also explores AI avatars by Reed Hoffman and large-scale NeRF scans by Nvidia, which could revolutionize virtual production sets and video games.

05:04

🖼️ AI Image and 3D Model Editing Innovations

This paragraph delves into AI-driven image editing, with Adobe and the University of California San Diego showcasing prompt-based image editing. The technology allows for contextual changes to images through descriptive prompts. Render's online application is highlighted for its advanced AI control features, enabling users to change clothing and other elements in images with ease. Additionally, a team from China presents a method for editing 3D models using prompts, offering new ways to generate and modify 3D models interactively.

10:05

📹 Emerging AI Video Tools and Industry Interviews

The script introduces Vu, a new AI video tool that rivals Sora, capable of generating videos from text prompts. Vu's video quality and cinematic potential are discussed, with examples provided. An interview with the creators of an AI video demo for Sora is mentioned, detailing the process and challenges of using Sora, including the need for extensive post-processing and the high render-to-use ratio. The paragraph also invites participation in an AI trailer competition and highlights Hyper's new gallery showcasing AI projects, emphasizing Hyper's capabilities as an AI video generator.

15:08

🌟 AI Filmmaking Courses and Community Engagement

The final paragraph promotes AI advertising and filmmaking courses, encouraging networking with artists from top studios and companies. It invites viewers to join the AI community and attend global meetups hosted by the Curious Refuge team. Upcoming events, including the Cannes Film Festival and AI on the Lot, are mentioned. The script also discusses a conversation with Nicholas Newbert, an AI art director, and his experiences with AI in the film industry. Finally, the video highlights several AI-assisted films, such as 'To Dear Me,' 'Son of Life,' 'Cator,' and a scene from AEL Art, emphasizing the growing impact of AI in filmmaking.

Mindmap

Keywords

💡Sora

Sora is an AI video generation tool that allows users to create videos from textual prompts. It is mentioned as a benchmark for other AI video tools, indicating its significance in the field. In the script, a new competitor to Sora called 'Vu' is introduced, suggesting the evolving nature of AI video technology.

💡AI Actor Emotions

AI Actor Emotions refer to the ability of AI avatars to express and respond to emotions based on the text input. The script discusses advancements where AI avatars can change their facial expressions and tone to match the sentiment of the words they are programmed to speak, showcasing the growing sophistication of AI in mimicking human emotions.

💡Video Giga Gan

Video Giga Gan is a tool developed by Adobe's research team that enhances the resolution of video footage. It is highlighted for its ability to upscale low-resolution videos to high resolution without the typical artifacts associated with AI upscaling, such as plastic-like textures. The script provides examples of its effectiveness, emphasizing its potential to streamline the video upscaling process.

💡Synthesia

Synthesia is a company that created expressive AI avatars capable of emotionally responding to text inputs. The script describes how these avatars can change their facial structure to reflect emotions, which is a significant step towards more realistic and engaging AI interactions in video content creation.

💡Nerf Scans

Nerf Scans are a technology developed by Nvidia that enable large-scale 3D scanning of real-world environments. The script explains that these scans can cover areas up to 25 square kilometers and be rendered in real time, which has profound implications for creating realistic virtual environments for film, video games, and other applications.

💡Prompt Based Image Editing

Prompt Based Image Editing is a technique where AI algorithms alter images based on textual prompts provided by the user. The script mentions a collaboration between Adobe and the University of California San Diego to demonstrate this technology, where users can request specific changes to images, such as moving objects or changing scenes, through natural language descriptions.

💡Render

Render is an online application that offers advanced AI image editing features. The script provides an example of changing clothing in a photograph to a red jacket using Render's AI prompting, illustrating how AI can seamlessly integrate new elements into existing images.

💡3D Model Editing

3D Model Editing with prompts is a method where AI technology allows users to generate and modify 3D models through conversational prompts. The script describes a Chinese team's innovation where users can create or edit 3D models by painting or describing desired changes, such as opening a dinosaur's mouth or transforming a banana into a whale.

💡Vu

Vu is an AI video tool that competes with Sora, enabling users to generate short videos from textual prompts. The script compares Vu's output to Sora's, noting that while Sora may have superior quality, Vu is a noteworthy contender in the AI video generation space.

💡AI Film Making

AI Film Making is the process of creating films or film trailers using AI technologies. The script discusses an AI trailer competition and the use of AI in creating various films, such as 'Dear Me' and 'Son of Life,' highlighting the growing role of AI in the film industry.

💡AI Art Directors

AI Art Directors are professionals who utilize AI in their creative process to design and produce visual content. The script mentions Nicholas Newbert, an AI art director who worked with Jared Leto, indicating the intersection of AI technology and traditional artistic roles.

Highlights

A new Sora competitor, Vu, has emerged in the market, offering AI-generated videos with high emotional intelligence.

Adobe's Video Giga Gan tool can upscale low-resolution footage to high resolution with impressive facial and hair texture details.

Synthesia's Expressive AI avatars can change their facial structure and emotional responses based on the text input.

Nvidia's research team has developed the capability to perform large-scale NeRF scans, allowing real-time rendering of areas up to 25 square kilometers.

Adobe and the University of California San Diego have collaborated to showcase prompt-based image editing, enabling context-aware image manipulation.

Render's online application allows users to change clothing in images using AI, offering a clean and user-friendly platform for advanced AI control features.

A team from China has developed a method to edit 3D models using prompts, allowing for simple modifications like opening a dinosaur's mouth or transforming objects.

Vu, the AI video tool, can generate videos up to 16 seconds long in 1080P, offering a competitive alternative to Sora with dynamic AI movement.

Shy Kids, creators of the Airhead AI video, describe using Sora as a slot machine-like process with extensive post-processing and a 300:1 render to use ratio.

An AI trailer competition has been launched in partnership with Submachine, offering the chance to win an Apple Vision Pro.

Hyper has released a new gallery showcasing impressive AI projects, with the ability for users to submit their own films.

Enrollment is open for an AI advertising and filmmaking course covering the latest AI trends and techniques.

Curious Refuge is hosting meetups around the world, including an upcoming event at the Cannes Film Festival.

Chat GBT now has the ability to store information from prompts as memory, assisting in creating future prompts without repetition.

The film 'To Dear Me' won the Beijing Film Festival 2024, showcasing the integration of live-action footage with AI concepts.

The film 'Son of Life' demonstrates consistency in shots with a black and white color grade and film grains for a stylized look.

The concept film 'Cator' humorously portrays a cat as a gladiator with convincing dialogue and high-quality visuals.

AEL Art's scene 'The Dinner' cleverly uses AI and 3D tools to create dynamic camera movements in a five-character scene.