What’s new with generative AI at Google Cloud

Google Cloud Tech
19 Dec 202350:57

TLDRJune Yang, Vice President of Cloud AI at Google Cloud, introduces the transformative impact of generative AI and foundation models, emphasizing their role in accelerating AI adoption for enterprises. Google Cloud's Vertex AI platform, integrated with Duet AI, offers a comprehensive suite of tools for building AI applications, supporting various data types and use cases. The event highlights include new versions of Google's foundation models, industry-specific models, and the introduction of partner models in Vertex AI Model Garden. Google Cloud's commitment to enterprise readiness is underscored by investments in data privacy, security, responsible AI, and partner collaboration.

Takeaways

  • 🚀 Google Cloud is leading the shift towards generative AI with foundation models, enabling enterprises to leverage AI without extensive ML expertise or training data.
  • 🌟 June Yang emphasized the importance of generative AI in accelerating the adoption of AI within enterprises and its role in democratizing AI for a broader range of users, including developers and business users.
  • 📈 Google Cloud's customer base for generative AI has grown by 15x since April, highlighting the significant demand and adoption of these technologies.
  • 🛠️ Vertex AI is a comprehensive AI platform that helps enterprises build AI-powered applications and services, and is the foundation for many of Google's generative AI offerings.
  • 🌐 Google Cloud offers a wide variety of models through Vertex AI Model Garden, including its own models, partner models, and open-source models, providing flexibility for customers.
  • 💡 New versions of Google's foundation models were announced, including increased input token length for PaLM 2 and expanded language support, enhancing the capabilities of these models.
  • 🎨 Upgrades to the text-to-image model, Imagen, were introduced, featuring improved image quality and a new style tuning feature for brand-aligned image generation.
  • 🔐 Google Cloud is committed to enterprise readiness, with a focus on security, privacy, compliance, and control, ensuring that generative AI can be adopted with confidence.
  • 🤖 Vertex AI Search and Conversation is now generally available, combining Google's expertise in search and conversation to help enterprises build Q&A and chatbot experiences.
  • 🌟 Partnerships with leading companies like GitLab and the World Bank were highlighted, showcasing the real-world applications and benefits of Google Cloud's generative AI technologies.

Q & A

  • What is the main focus of June Yang's presentation at Google Cloud Next 2023?

    -June Yang's main focus is on the advancements in generative AI at Google Cloud, particularly emphasizing the capabilities of foundation models and how they can be leveraged by enterprises without ML expertise.

  • What is the significance of foundation models in AI?

    -Foundation models represent a major shift in AI, as they can be used across a variety of use cases without the need for extensive training data, thus accelerating the adoption of AI in enterprises and democratizing AI beyond data scientists.

  • How does Google Cloud ensure the security and privacy of generative AI?

    -Google Cloud incorporates security, privacy, compliance, and control requirements across the entire stack, ensuring that enterprises can adopt generative AI with confidence in the protection of their data and IP.

  • What are some of the new features announced for Google's foundation models?

    -New features include increased input token length for PaLM 2, support for 38 languages, upgrades to Imagen for improved image generation and editing, and the introduction of style tuning for image models.

  • How is Google Cloud's Vertex AI platform designed to help enterprises build AI-powered applications and services?

    -Vertex AI provides a comprehensive platform that combines foundation model capabilities with search and conversation tools, allowing developers to easily build chatbots, voice bots, knowledge search, and more, all built on a world-class infrastructure.

  • What is Duet AI, and how does it integrate with Google Cloud products?

    -Duet AI is a solution built on Vertex AI that integrates generative AI capabilities into GCP products like BigQuery and Workspace, enabling enterprises to leverage these capabilities directly within their existing tools and workflows.

  • How does GitLab utilize Google's foundation models to enhance their platform?

    -GitLab uses Google's foundation models, particularly the Codey model family, to deliver AI-powered features such as code suggestions and improvements across the software development lifecycle, aiming to make users 10x more efficient.

  • What are some industry-specific foundation models mentioned in the presentation?

    -Industry-specific foundation models mentioned include Sec-PaLM for cybersecurity and Med-PaLM for the medical domain, which are designed to meet the unique needs of these sectors.

  • How does Google Cloud ensure responsible AI usage?

    -Google Cloud ensures responsible AI usage through a set of tools and practices that include safety filters, model evaluation, bias evaluation, content moderation, and recitation checks, all aimed at reducing the risk of harmful outcomes.

  • What is the role of partnerships in Google Cloud's AI strategy?

    -Partnerships play a crucial role in Google Cloud's AI strategy by collaborating with foundation model creators, open source providers, tool providers, SaaS providers, and consulting firms to offer a wide range of models and support enterprises in adopting AI.

Outlines

00:00

🌟 Introduction to Google Cloud Next 2023 and Generative AI

The video script begins with June Yang, Vice President of Cloud AI at Google Cloud, welcoming the audience to Google Cloud Next 2023. She expresses her excitement about being back in person and introduces the topic of generative AI at Google Cloud. June Yang discusses the major shift in AI with the rise of foundation models, which allows for various use cases without ML expertise and minimal training data. The goal of Google's generative AI is to bring enterprise-ready solutions to help organizations innovate faster. She also mentions Google's comprehensive technology stack, from AI infrastructure to tools for building custom models and applications. June Yang emphasizes the integration of security, privacy, compliance, and control across the stack. She invites attendees to learn more about Duet AI and highlights the significant demand from customers for Google Cloud's generative AI capabilities.

05:01

🚀 Announcements of New Foundation Model Versions

In this paragraph, June Yang announces new versions of Google's foundation model families, starting with the text and chat model PaLM 2. She addresses the increased demand for longer input token lengths and introduces the new version with a fourfold increase, allowing for processing of longer documents. June also announces the general availability of 38 languages and private preview for 100 more languages. Upgrades to the text-to-image model, Imagen, are discussed, including improved image quality and the introduction of style tuning. Additionally, the first Cloud provider to enable digital watermarking for generated images is announced, powered by Google DeepMind SynthID. Codey, the text-to-code model, has also been upgraded with a 25% quality improvement. The paragraph concludes with the announcement of the preview of streaming API for code generation and chat, improving the developer experience.

10:03

🤖 GitLab's Collaboration with Google Cloud

David DeSanto, Chief Product Officer from GitLab, joins June Yang to discuss GitLab's collaboration with Google Cloud. GitLab aims to revolutionize how development, security, and operations teams work together to build secure software. They provide an enterprise DevSecOps platform that is now enhanced with AI for greater efficiency. David addresses concerns about the misuse of software and intellectual property, outlining GitLab's vision for AI use, which includes applying AI across the software development lifecycle, leading with a privacy-first approach, transparency, and providing the best AI experience for specific use cases. GitLab's partnership with Google Cloud is highlighted, emphasizing shared commitments to enterprise readiness and privacy. The integration of PaLM 2 foundation models, including the Codey model family, is discussed, along with the introduction of GitLab Duo, a suite of AI-powered features for the software development lifecycle.

15:04

🛠️ Model Augmentation Tools and Tuning Methods

June Yang continues the discussion by introducing model augmentation tools designed to enhance the capabilities of foundation models for enterprise use. Grounding, extension, and tuning are highlighted as key tools. Grounding allows for model outputs based on user-specified source material, ensuring factual and verifiable responses. Extension enables integration with real-time data and actions via APIs, while tuning allows customization of foundation models with enterprise data for more accurate results. June Yang provides examples of each tool and announces the preview of new grounding capabilities in Vertex AI. The paragraph also covers various tuning methods, including prompt design, adapter tuning, reinforcement learning with human feedback, and full fine-tuning. The importance of model evaluation tools and frameworks is emphasized, as is the introduction of Colab Enterprise on Vertex AI, offering a common notebook interface and AI-powered code completion.

20:05

🎨 Exploring Foundation Models with Gen AI Studio

Nikita Namjoshi takes the stage to demonstrate the use of Vertex AI and Gen AI Studio to explore and customize foundation models across different data modalities. The demonstration includes using a large language model to extract structured data from text, showcasing the ability to process documents in various languages. Codey, the text-to-code model, is used to generate Dockerfiles and Python code for machine learning models. The demonstration concludes with the use of Imagen to generate images based on text prompts in different styles, highlighting the versatility of foundation models in various data types.

25:08

🔍 Vertex AI Search and Conversation Capabilities

June Yang discusses the introduction of Vertex AI Search and Conversation, formerly known as Gen App Builder, which combines Google's experience in search and conversation with foundation models. The use cases for generative AI in search and chat applications are explored, such as Q&A with internal knowledge bases and chatbots for employees and customers. The orchestration layer's role in understanding user intent and providing answers from various sources is explained. Raman Pugalumperumal from the World Bank shares their experience as an early adopter of Vertex AI Search, discussing the organization's goals and the importance of knowledge management. He highlights the benefits of generative AI-powered search in content analysis, synthesis, and generation, and the potential to elevate the value of development investments through efficient use of the World Bank's extensive data.

30:08

💬 Building Generative AI Experiences with Vertex AI

Kalyan Pamarthy demonstrates how developers can build and configure a Search and Conversation app on Vertex AI. He walks through the process of creating a search engine, selecting a data source, and indexing data. The demo showcases the capabilities of the search engine in providing relevant results, summarized answers, and citations for verification. The conversational search experience is also demonstrated, highlighting the ability to maintain context and provide follow-up answers. Kalyan emphasizes the customization options and API availability for the system, allowing for integration into various enterprise solutions.

35:11

🛡️ Ensuring Enterprise Readiness with Responsible AI

In the final paragraph, June Yang addresses the importance of enterprise readiness in the context of generative AI, focusing on data governance, privacy, security, compliance, reliability, sustainability, safety, and responsibility. She outlines Google Cloud's approach to ensuring data privacy, with a strong emphasis on the customer's ownership of their data. June Yang also discusses the security and compliance measures in place for Vertex AI, including VPC security control, customer-managed encryption keys, data residency, and access transparency. The commitment to responsible AI is highlighted through a suite of tools for safety, bias evaluation, content moderation, and more. June Yang concludes by emphasizing the ongoing journey with generative AI and the exciting potential it holds for the future.

Mindmap

Keywords

💡Generative AI

Generative AI refers to artificial intelligence systems capable of creating new content, such as text, images, or audio. In the context of the video, it highlights Google Cloud's advancements in this area, emphasizing the ability to leverage AI for various use cases without extensive ML expertise or training data.

💡Foundation Models

Foundation models are large-scale machine learning models trained on vast datasets that can perform multiple tasks with minimal additional training. They are central to Google Cloud's strategy of making AI accessible and transformative for enterprises by providing a powerful and flexible platform for various applications.

💡Vertex AI

Vertex AI is Google Cloud's comprehensive AI platform designed to help enterprises build AI-powered applications and services. It serves as the foundation for other Google Cloud AI services and integrates various generative AI capabilities, offering a range of tools for developers, data scientists, and ML engineers.

💡Duet AI

Duet AI is a product within Google Cloud that integrates generative AI capabilities into GCP products like BigQuery and Workspace. It represents the company's efforts to bring the power of enterprise-ready generative AI to a wider range of applications, enhancing innovation and efficiency.

💡AI-Powered Workflows

AI-powered workflows refer to the integration of artificial intelligence into various processes to automate tasks, improve efficiency, and provide intelligent insights. In the video, Google Cloud's partnership with GitLab exemplifies this concept by using AI to enhance software development processes, making them more secure and efficient.

💡Data Privacy

Data privacy concerns the protection of personal and sensitive information from unauthorized access and use. In the context of the video, Google Cloud emphasizes its commitment to data privacy by ensuring that customer data is not used to train their models and that robust security protocols are in place to protect customer data and intellectual property.

💡Enterprise Readiness

Enterprise readiness refers to the ability of a product or service to meet the needs of large organizations in terms of scalability, security, compliance, and overall reliability. In the video, Google Cloud outlines its efforts to ensure that their AI offerings are enterprise-ready by providing robust security controls, data governance, and responsible AI practices.

💡Model Augmentation Tools

Model augmentation tools are techniques or mechanisms used to enhance the capabilities of AI models, often by incorporating additional data or fine-tuning the models for specific tasks. In the video, Google Cloud introduces a suite of model augmentation tools designed to make foundation models more useful and accurate for enterprise applications.

💡Colab Enterprise

Colab Enterprise is a notebook service developed by Google that brings collaborative features to data scientists, allowing them to work on projects together and share their work easily. It is built on Google Cloud and offers a seamless integration with other Google Cloud services, providing a zero-configuration experience and AI-powered code completion capabilities.

💡Digital Watermarking

Digital watermarking is a technique used to embed information into digital media, such as images, in a way that is difficult to remove or alter without detection. In the context of the video, Google Cloud introduces digital watermarking for AI-generated images, powered by Google DeepMind SynthID, to ensure the integrity and traceability of generated content.

💡Model Garden

Model Garden is a feature of Google Cloud that offers a variety of curated machine learning models for use by customers. It includes models from Google, partners, and open-source projects, providing flexibility and choice for customers to select the best model for their specific use cases.

Highlights

June Yang, Vice President of Cloud AI, and Industry Solutions at Google Cloud, discusses the major shift in AI with the rise of foundation models.

Generative AI at Google Cloud aims to bring enterprise-ready AI capabilities to help organizations innovate faster.

Google Cloud's foundation models can be leveraged without ML expertise and with little to no training data, democratizing AI for a broader range of users.

Google is the only company offering a full technology stack from high-performance AI infrastructure to foundation models and custom model building tools.

Duet AI and Vertex AI are integral to Google Cloud's strategy, integrating generative AI capabilities into GCP products and providing a comprehensive AI platform for enterprises.

Since April, the number of customers engaging with Google Cloud on generative AI has grown by 15x, showing significant demand from customers for these services.

Google Cloud's foundation models have been updated, with PaLM 2 now supporting 38 languages and an increased input token length of up to 32,000 tokens.

Imagen, Google's text-to-image model, has been upgraded with improved image quality and a new style tuning feature for brand-aligned image generation.

Google Cloud is the first cloud provider to enable digital watermarking for generated images, using Google DeepMind's SynthID technology.

Codey, the text-to-code model, has been upgraded with a 25% improvement in code quality and the introduction of a streaming API for code generation and chat.

Specialized foundation models like Med-PaLM for the medical domain and Sec-PaLM for cybersecurity are being made available to a wider range of customers based on feedback and demand.

Google Cloud's Model Garden now includes Meta's Llama 2 and Code Llama, offering the widest variety of models compared to other hyperscale providers.

Google Cloud is announcing the addition of Anthropic's Claude 2 to Model Garden, further expanding the range of available models.

GitLab is partnering with Google Cloud to leverage foundation models like Codey and PaLM for AI-powered software development experiences.

GitLab Duo is introduced as a suite of AI-powered features to enhance the software development lifecycle, making developers 10x more efficient.

Google Cloud is investing in model augmentation tools like grounding, extension, and tuning to further enhance the capabilities of foundation models for enterprise use.

Vertex AI Search and Conversation, formerly known as Gen App Builder, is now generally available, combining Google's search and conversation experience with foundation models.

The World Bank is using Vertex AI Search to reimagine enterprise search, leveraging AI-generated summaries and multi-turn search functionality to increase the value of their extensive data.

Google Cloud is focusing on enterprise readiness for generative AI, emphasizing data privacy, security, compliance, reliability, sustainability, safety, and responsibility.

Google Cloud is working with partners across the entire AI stack, including foundation model creators, tool providers, SaaS providers, and consulting firms to support enterprise adoption of generative AI.