Advanced Settings Tutorial - Kits AI

Kits AI
29 Feb 202405:46

TLDRThis tutorial video from Kits AI focuses on advanced settings for converting voices with their AI technology. The video guides users through the process of optimizing audio conversion by adjusting settings such as removing instrumentals, reducing reverb and delay, and eliminating backing vocals. It also introduces the pitch shift feature to match the audio to the selected AI model. The importance of conversion strength and volume blend is emphasized, as they directly affect the conversion's outcome. The video provides examples of how different settings impact the final audio, suggesting starting with medium conversion strength and adjusting as needed. Pre-processing effects like cut noise and smooth volume are discussed, as well as post-processing effects like compression, chorus, reverb, and delay. The presenter demonstrates how to apply these settings using the M strain Rock model and advises saving preferred settings as a preset for future use. The tutorial concludes with a comparison of the original and AI-converted audio, showcasing the improved presence and smoother audio quality achieved with the AI conversion.

Takeaways

  • 🎙️ **Audio Conversion with Kits AI**: The video provides a tutorial on advanced settings for converting voices using Kits AI for better AI voice conversions.
  • 🔍 **Advanced Settings Access**: After selecting an AI model and inputting audio, it's recommended to access the advanced settings for fine-tuning the conversion process.
  • 🎶 **Remove Instrumentals**: A feature to separate vocals from instrumentals in a full song, useful for audio with vocals, melodies, bass, drums, etc.
  • 🔊 **Audio Cleanup**: Buttons are available to reduce reverb and delay, and to remove backing vocals or ad libs for a cleaner vocal track.
  • 🎛️ **Pitch Shifting**: The pitch shift tool adjusts the audio to match the pitch level required by the selected AI model, but note that it also changes the key of the audio.
  • 🔧 **Conversion Strength**: This setting determines how much the AI voice's version of a sound is used in the conversion process, with higher settings potentially increasing mispronunciation.
  • 📈 **Volume Blending**: A lower model volume maintains original audio levels, while a higher model volume smooths out the audio for a more polished result.
  • 👻 **Dynamics Preservation**: Use a lower volume blend to preserve the dynamics of clean, dynamic recordings or for instances where the converted audio needs to fit well in a mix.
  • 🛠️ **Pre-processing Effects**: Subtle changes like cut noise and smooth volume can help clean up input audio before conversion, useful for recordings with varied volume or pitch issues.
  • ⚙️ **Post-processing Effects**: The compressor can be used for more consistent volumes and presence, while creative effects like chorus, reverb, and delay offer more flexibility for different uses.
  • 💾 **Save Presets**: Once you find settings you like, you can save them as a preset for future use, ensuring consistency across different audio conversions.
  • 📚 **In-Depth Practice**: The video concludes with a practical example, demonstrating the application of the discussed settings for a specific audio conversion.

Q & A

  • What is the main purpose of the 'remove instrumentals' feature in Kits AI?

    -The 'remove instrumentals' feature is used to separate vocals from the instrumentals in a full song, which includes vocals, melodies, bass, drums, etc.

  • How can the 'Reverb and Delay' button help in audio conversion?

    -The 'Reverb and Delay' button helps to clean up audio by reducing reverb and delay effects that are common in vocals and songs.

  • What is the function of the 'remove backing vocals' feature?

    -The 'remove backing vocals' feature assists in eliminating ad libs in hip-hop songs or backup singers from the audio.

  • How does the pitch shift tool work in Kits AI?

    -The pitch shift tool adjusts the pitch of the audio to match the range of the selected AI model. It can lower or raise the pitch, but it also changes the key of the audio.

  • What is the role of 'conversion strength' in the audio conversion process?

    -Conversion strength determines how much the input audio is altered to resemble the AI voice. Higher settings increase the character of the AI voice but may also lead to mispronunciation of words.

  • Why would someone choose a high model volume?

    -A high model volume is chosen to make the conversion smoother and more polished, which is useful for recordings with varied audio levels or less than ideal conditions.

  • What is the significance of 'volume blend' in the conversion process?

    -Volume blend helps to maintain the original audio levels or to smooth out the audio. It's important for preserving the dynamics of a clean recording or ensuring the converted audio fits well in a mix.

  • How can the 'cut noise' pre-processing effect be beneficial?

    -The 'cut noise' effect helps to mask static background noise and can be used to reduce rumble or harshness in the high end of the recording.

  • What is the purpose of the 'smooth volume' pre-processing effect?

    -The 'smooth volume' effect is used to even out recordings with varied volume levels and can assist with pitch correction if the audio is not perfectly in tune.

  • Why might someone choose to use only the compressor for post-processing effects?

    -Using only the compressor is preferred for those who want to maintain maximum flexibility with their converted audio, especially if they plan to use their own chorus, reverb, and delay plugins in a DAW (Digital Audio Workstation).

  • How can saving a preset in Kits AI benefit a user?

    -Saving a preset allows a user to quickly apply their preferred settings for future audio conversions, saving time and ensuring consistency across different projects.

Outlines

00:00

🎙️ Advanced Voice Conversion Settings with Kits AI

This paragraph introduces the video's focus on advanced settings for converting voices using Kits AI. It explains the process of audio conversion, emphasizing the importance of the advanced settings to refine the conversion for optimal results. Key features discussed include removing instrumentals, handling reverb and delay, and removing backing vocals. Additionally, the paragraph touches on pitch shifting to match the AI model's range and the significance of conversion strength and volume blend settings in achieving a high-quality conversion.

05:01

🔊 Volume Blend and Conversion Strength for Audio Dynamics

The second paragraph delves into the nuances of volume blend and conversion strength, two critical settings for voice conversion. It describes how a higher model volume can smooth out audio, making it more polished, while a lower volume blend preserves the original audio dynamics. The paragraph also discusses pre- and post-processing effects, such as cut noise, smooth volume, and the use of a compressor for post-conversion audio. It concludes with a practical application of these settings on a sample audio clip, demonstrating the conversion process and the impact of the chosen settings on the final output.

Mindmap

Keywords

💡Advanced Settings

Advanced Settings refer to the optional configurations that allow users to fine-tune the performance of a software application. In the context of the video, these settings are crucial for optimizing the conversion of audio to AI voices, enabling users to achieve better results tailored to their specific needs.

💡Remove Instrumentals

Remove Instrumentals is a feature that allows users to separate vocals from the instrumental parts of a song. This is particularly useful when converting full songs with multiple audio layers. In the video, it is mentioned as one of the first adjustments to consider when refining the audio conversion process.

💡Reverb and Delay

Reverb and Delay are audio effects that simulate the persistence of sound in a particular space and the time it takes for a repeated sound to diminish, respectively. The video discusses the removal of these effects to clean up the audio, which can be common issues in vocal tracks that may interfere with the clarity of AI voice conversion.

💡Pit Shift

Pit Shift is a tool used to adjust the pitch of an audio signal. In the video, it is mentioned as a helpful tool when the audio's pitch is not within the optimal range for the selected AI model. Adjusting the pit shift can correct the pitch but also changes the key of the audio, which is an important consideration during conversion.

💡Conversion Strength

Conversion Strength is a setting that determines the degree to which the original audio is altered to match the AI voice. It is a critical parameter as it affects how much the AI's version of the sound is used in the conversion process. The video illustrates the impact of varying this setting on the pronunciation and character of the AI voice.

💡Volume Blend

Volume Blend is a setting that controls the balance between the original audio and the AI-generated voice. It is important for achieving a polished sound or preserving the dynamics of the original recording. The video provides examples of how different volume blend settings can affect the final output.

💡Pre-processing Effects

Pre-processing Effects are audio treatments applied to the input audio before conversion. These effects, such as cut noise and smooth volume, are used to clean up and prepare the audio for a better conversion process. The video explains how these subtle changes can enhance the quality of the AI voice conversion.

💡Post-processing Effects

Post-processing Effects are applied to the audio after the conversion process. These include the compressor, chorus, reverb, and delay, which can be used to enhance the audio's presence and creative qualities. The video emphasizes the importance of understanding the desired outcome for the audio before applying these effects.

💡Dynamics

Dynamics in audio refer to the variation in volume levels throughout a recording. Preserving the dynamics is important for maintaining the natural flow and expressiveness of the original audio. The video discusses the use of volume blend to either maintain or smooth out the dynamics in the AI conversion.

💡Presets

Presets are pre-defined settings that users can save and reuse for future projects. In the context of the video, once a user finds a set of advanced settings that works well for their specific needs, they can save these as a preset for convenience and consistency in future audio conversions.

💡AI Model

An AI Model, in the context of the video, refers to a specific instance or configuration of the AI voice conversion software. Different models may be better suited for different types of audio or vocal styles. The video demonstrates how to select and adjust settings for the best match with the chosen AI model.

Highlights

An instructional video on advanced settings for converting voices with Kits AI is presented.

The video aims to improve understanding of the program for better AI voice conversions.

When converting audio, it's recommended to use the 'remove instrumentals' feature for full songs.

The 'Reverb and Delay' button helps clean up vocals and songs with reverberation.

The 'remove backing vocals' feature assists in eliminating ad libs or backup singers.

The 'pitch shift' tool is useful for audio out of the model's range, adjusting pitch without altering the key.

Conversion strength determines how much the AI voice is used in the conversion process.

High conversion strength can exaggerate certain sounds but may mispronounce words.

Medium conversion strength is suggested as a starting point for most audio.

Volume blend adjusts the AI model's volume in relation to the original audio levels.

High model volume is ideal for smoothing out audio with varied levels or less than ideal conditions.

Low volume blend preserves the dynamics of clean, dynamic recordings.

Pre-processing effects include 'cut noise' for background noise and 'smooth volume' for uneven volume levels.

Post-processing effects like 'compressor' can improve audio presence and volume consistency.

Creative post-processing options like 'chorus', 'reverb', and 'delay' can be used for specific audio needs.

For audio to be used in a DAW, it's recommended to use fewer effects for more flexibility.

The video demonstrates the conversion process using the 'M strange Rock' model on a clean studio recording.

Settings can be saved as presets for future use.

The final conversion is compared to the original audio, showing the impact of AI conversion.

The tutorial concludes with a comprehensive understanding of Kits AI's advanced settings for optimal AI voice conversions.