How to Create Voiceover Using Google Cloud Text to Speech

LearnWoo
28 May 202203:27

TLDRIn this informative video, Nian Roby from LearnVue demonstrates two methods for creating voiceovers using Google Cloud's Text-to-Speech technology. The first method involves setting up a Google Cloud account, enabling the Cloud Text-to-Speech API, and using an API key with the Wavenet extension for Chrome to convert text into speech. The second method is simpler, utilizing an audio capture extension to record speech generated directly from the Google Cloud website with customizable language, voice, speed, and pitch. Both methods are straightforward and cater to users looking to add professional voiceovers to their projects without hassle.

Takeaways

  • 📈 **Create a Google Cloud account**: Sign up at the provided link, fill in your details, and accept terms of service.
  • 💳 **Verification and Payment Info**: Provide your phone number for verification and credit card info for the final step, but no charges will be made after the complimentary $300 credit.
  • 🔍 **Enable Text-to-Speech API**: Search for 'Text-to-Speech' in the Google Cloud console and enable the API.
  • 🚫 **Free Version Limit**: The free version allows processing up to one million characters for WaveNet voices.
  • 💡 **Create API Key**: Generate an API key from the 'Credentials' section and restrict it for security purposes.
  • 🔗 **Install WaveNet Extension**: Add the WaveNet extension to your browser from the web store and paste your API key.
  • 📑 **Text Character Count**: Use the character counter (link provided) to ensure you don't exceed the free character limit.
  • 🔁 **Download Audio**: Use the WaveNet extension to download your text as an MP3 file.
  • 🎤 **Second Method Overview**: An alternative method involves using an 'Audio Capture' extension to record the speech from Google Cloud's Text-to-Speech interface.
  • 🔊 **Record and Save Audio**: With the 'Audio Capture' extension, record the speech, and then save the audio file.
  • 🌐 **No Sign-In Required**: For method two, you can access and use Google Cloud's Text-to-Speech without signing in.
  • ⚙️ **Customize Voice Settings**: Choose your language, voice, and adjust speed and pitch to your preference before generating the voiceover.

Q & A

  • What is the main topic of the video?

    -The video is about how to create text to speech voiceover using Google Cloud.

  • How many voices and languages are available in Google Cloud's text to speech tool?

    -Google Cloud's text to speech tool offers over 200 voices in more than 40 languages.

  • What is the name of the person presenting the video?

    -The presenter's name is Nian Roby from LearnVue.

  • What is the first step to create a text to speech voiceover using Google Cloud?

    -The first step is to create a Google Cloud account by following the link provided in the description.

  • What is the complimentary credit amount that Google Cloud offers for new users?

    -Google Cloud offers a complimentary credit of $300 for new users.

  • How much does it cost to process more than one million characters with WaveNet invoices?

    -It costs $16 to process more than one million characters with WaveNet invoices.

  • What is the purpose of creating credentials in the Google Cloud Text to Speech API?

    -Creating credentials is necessary to authenticate your applications and secure your API key, which is like a password that should not be shared.

  • How can you restrict the use of your API key?

    -You can restrict the use of your API key by specifying the applications or websites that are allowed to use it.

  • What is the purpose of installing the Wavenet extension in the first method?

    -The Wavenet extension is used to facilitate the text to speech process directly from the browser, allowing users to convert selected text into speech.

  • What is the second method described in the video for creating a voiceover?

    -The second method involves using an audio capture extension to record speech generated by Google Cloud's text to speech feature.

  • How can you customize the voice, speed, and pitch of the voiceover?

    -You can customize the voice, speed, and pitch by selecting the desired options in the Google Cloud Text to Speech interface before generating the voiceover.

  • What should you do after recording the speech using the audio capture extension?

    -After recording the speech, you should go back to your audio extension, finalize the recording, and save the audio file.

Outlines

00:00

🎓 Introduction to Google Cloud Text-to-Speech

In this video, Nian Roby from LearnVue introduces viewers to two methods for creating text-to-speech voiceovers using Google Cloud. The first method requires setting up a Google Cloud account, which involves visiting a provided link, filling in personal details, accepting terms of service, and providing a phone number for verification. After signing up, viewers are guided to search for and enable the Cloud Text-to-Speech API, which allows processing up to one million characters for free. Roby also explains the importance of creating and restricting an API key for security reasons. The second method is simpler and involves installing an audio capture extension and using a Google Cloud feature to input text, select language and voice, and record the speech. Both methods are demonstrated with the help of a browser extension and Google Cloud's text-to-speech functionality.

Mindmap

Keywords

💡Voiceover

Voiceover refers to a production technique where a voice—that is not part of the main action of a movie, television show, or other visual medium—is used to narrate or provide additional information. In the context of the video, voiceover is created using Google Cloud's Text to Speech service, allowing users to generate spoken words from text.

💡Google Cloud

Google Cloud is a suite of cloud computing services offered by Google, which includes storage, computing power, and various software tools. In the video, Google Cloud provides the platform for the Text to Speech API, which is central to creating the voiceover.

💡Text to Speech API

The Text to Speech API is a service that converts written text into audible speech. It is a crucial component in the video's demonstration, as it enables users to select voices and languages to synthesize speech from text.

💡Wavenet

Wavenet is a deep learning-based text-to-speech technology developed by Google. It is mentioned in the video as a tool that can be used to create voiceovers with high-quality, natural-sounding speech.

💡API Key

An API key is a unique identifier used in the context of software interactions to grant an application access to a particular service. In the video, the presenter instructs viewers to create and restrict an API key for the Google Cloud Text to Speech service to ensure secure access.

💡Complimentary Credit

Complimentary credit refers to a sum of money given for free by a service provider, which can be used to pay for certain services. Google Cloud offers a complimentary credit of $300 to new users, as mentioned in the video, allowing them to explore the platform's capabilities without immediate cost.

💡Character Counter

A character counter is a tool used to count the number of characters in a given text. In the video, it is suggested to use a character counter to ensure that the text to be converted into speech does not exceed the free usage limit of the Text to Speech API.

💡Audio Capture Extension

An audio capture extension is a browser plugin that allows users to record audio directly from their browser. In the video, the presenter demonstrates how to use an audio capture extension to record the voiceover generated by the Text to Speech API.

💡Language Selection

Language selection is the process of choosing the language in which the text will be converted to speech. The video emphasizes the ability to select from over 200 voices in 40 plus languages, showcasing the versatility of the Google Cloud Text to Speech service.

💡Voice Selection

Voice selection involves choosing the specific voice that will be used to narrate the text. The video highlights the variety of voices available, allowing users to select the one that best fits their needs.

💡Speed and Pitch

Speed and pitch refer to the rate at which the speech is delivered and the frequency of the voice's tone, respectively. The video demonstrates how users can adjust the speed and pitch of the synthesized voice to their preference using the Text to Speech service.

Highlights

Two methods are presented for creating text-to-speech voiceover using Google Cloud.

Over 200 voices and 40+ languages are available in Google Cloud's Text-to-Speech tool.

To use the first method, a Google Cloud account is required, which can be set up via a provided link.

The account setup includes country, organization details, and acceptance of terms of service.

Verification and credit card information are needed for the final step of account creation.

Google Cloud offers a complimentary credit of $300 for new users.

The free version of the Text-to-Speech API allows processing up to one million characters for WaveNet voices.

For more than one million characters, a fee of $16 applies.

The Cloud Text-to-Speech API can be enabled through the Google Cloud homepage's search bar.

API credentials need to be created and restricted to prevent unauthorized use.

The WaveNet extension for Chrome is used in the first method, requiring the API key to be pasted into it.

Text can be copied into a character counter and processed through the WaveNet extension to create voiceover.

The second method involves using an Audio Capture extension.

No sign-in is required for the second method on the Google Cloud website.

Text-to-Speech can be tested directly on the Google Cloud website with options to choose language, voice, and adjust speed and pitch.

Recording of the speech is done through the Audio Capture extension after clicking 'Speak'.

The recorded audio can be saved after the process is complete.

The video provides step-by-step guidance on both methods.

The presenter, Nian Roby from LearnVue, encourages viewers to subscribe for more content.

Questions or doubts about the methods can be addressed by leaving a comment.