Introducing GPT-4o
Summary
TLDR在这次演讲中,Mira Murati 宣布了 ChatGPT 的新旗舰模型 GPT-4o 的发布,这是一款具有 GPT-4 智能的更快、更高效的 AI 模型。GPT-4o 能够实时处理语音、文本和视觉信息,显著提升了与 AI 的交互体验。演讲中展示了 GPT-4o 的多项功能,包括实时对话语音、数学问题解答、代码分析和实时翻译等。此外,还提到了 ChatGPT 的新桌面版本,它将更易于使用,并且对所有用户,包括免费用户开放。Mira 强调了使先进 AI 工具免费对所有人开放的重要性,并介绍了 ChatGPT 在 50 多种语言中质量和速度的提升。最后,她感谢了 OpenAI 团队和合作伙伴,为观众提供了一个深入了解 GPT-4o 能力的机会。
Takeaways
- 📢 米拉·穆拉蒂(Mira Murati)宣布了三个主要更新:ChatGPT的桌面版本发布、用户界面(UI)的刷新以及新旗舰模型GPT-4o的推出。
- 🚀 GPT-4o模型将GPT-4的智能提供给所有人,包括免费用户,这标志着在易用性方面迈出了巨大一步。
- 🎉 ChatGPT现在无需注册流程即可使用,提高了用户使用的便捷性。
- 🔍 GPT-4o在文本、视觉和音频方面都有所改进,提供了更快的处理速度和更高的性能。
- 🗣️ GPT-4o支持实时会话语音,允许用户随时打断,响应迅速,并且能够识别和表达情绪。
- 📈 GPT-4o能够理解和生成多种风格的语音,包括戏剧化和歌唱风格,提供了更丰富的交互体验。
- 👀 GPT-4o的视觉功能允许它查看用户分享的图像和代码,从而提供更直观的交互和帮助。
- 🧠 GPT-4o的记忆功能增强了ChatGPT的实用性,使其能够跨对话保持连续性。
- 🌐 GPT-4o支持50多种语言的改进,提高了多语言用户群体的体验。
- 💰 对于付费用户,GPT-4o提供了比免费用户多达五倍的容量限制。
- 🛠️ 开发者现在可以通过API使用GPT-4o构建和部署AI应用程序,这标志着技术的进一步开放和应用扩展。
- 🔒 尽管GPT-4o带来了许多积极的变化,但它也提出了新的安全挑战,团队正在努力构建缓解措施以防止滥用。
Q & A
为什么让产品对每个人都真正可用和广泛可用是如此重要?
-为了让每个人都能使用ChatGPT,无论他们身在何处,公司始终在寻找减少使用摩擦的方法,以实现更广泛的可用性。
桌面版ChatGPT的发布意味着什么?
-桌面版ChatGPT的发布意味着用户可以更简单、更自然地使用ChatGPT,并且它将更容易地集成到用户的工作中。
GPT-4o是什么,它与GPT-4有何不同?
-GPT-4o是公司推出的新旗舰模型,它提供了GPT-4的智能,但运行速度更快,并在文本、视觉和音频方面都有所改进。
GPT-4o的实时对话语音功能有哪些关键特点?
-GPT-4o的实时对话语音功能允许用户随时打断模型并立即响应,没有尴尬的延迟,并且能够处理情绪和生成不同风格的语音。
GPT-4o如何改善与免费用户的互动体验?
-GPT-4o通过原生处理语音、文本和视觉,提高了效率,使得GPT-4o智能可以向免费用户提供,这是公司数月努力的成果。
GPT-4o的发布对开发者意味着什么?
-开发者现在可以通过API使用GPT-4o来构建和部署惊人的AI应用,它比GPT-4 Turbo更快,成本更低,且有更高的速率限制。
GPT-4o在安全性方面面临哪些新挑战?
-GPT-4o在安全性方面面临实时音频和实时视觉的挑战,团队一直在努力构建对策以防止滥用。
GPT-4o在语言支持方面有哪些改进?
-GPT-4o在50多种不同的语言上提高了质量和速度,这使得产品能够为更多的人提供体验。
GPT-4o的发布对教育领域有何影响?
-大学教授可以为学生创建内容,播客可以为听众创建内容,GPT-4o的发布扩大了构建者的受众范围,使教育内容更加个性化和易于获取。
GPT-4o如何帮助用户解决数学问题?
-GPT-4o通过提供逐步的提示和指导,帮助用户解决线性方程,使数学问题变得更加易于理解和解决。
GPT-4o的实时翻译功能如何工作?
-GPT-4o能够实时翻译英语和意大利语之间的对话,使得只会说一种语言的用户能够与说另一种语言的用户进行交流。
GPT-4o如何通过视觉识别帮助用户?
-GPT-4o的视觉功能允许它查看用户分享的代码、图表和图像,并提供相关的解释和分析,增强了用户的理解和互动。
Outlines
🚀 产品发布与使命宣言
Mira Murati 在演讲中首先表达了对听众的欢迎,并宣布了三个主要议题。她强调了使产品对每个人都真正可用的重要性,并提到了减少使用ChatGPT的障碍。接着,她宣布了桌面版ChatGPT的发布,它将更简单、更自然。最重要的消息是推出了新的旗舰模型GPT-4o,它将GPT-4的智能提供给所有人,包括免费用户。她还提到了未来几周将展示的实时演示,以展示新模型的全部能力,并强调了使先进AI工具对每个人都免费可用的使命。
🎉 GPT-4o的推出与功能
Mira Murati 继续介绍 GPT-4o 的特点,包括它在文本、视觉和音频方面的改进,以及它在易用性方面的重大进步。她提到了 GPT-4o 在实时音频和视觉方面的挑战,以及团队如何努力防止滥用。此外,她还介绍了 GPT-4o 将对免费用户提供的服务,包括在 GPT 应用商店中使用 GPT,以及它在不同语言中质量和速度的提升。对于付费用户,他们将继续享有比免费用户多达五倍的容量限制。
🤖 实时对话与情感识别
Mark Chen 和 Barrett Zoph 展示了 GPT-4o 的实时对话功能。他们通过实时互动,展示了 GPT-4o 如何在对话中实时响应,以及如何通过语音模式表达情感。Mark 通过深呼吸来缓解紧张,而 GPT-4o 能够识别并提供反馈。此外,GPT-4o 还能够以不同的风格生成语音,包括戏剧性和歌唱性,以及以机器人的声音讲述故事。
🧮 解决数学问题
Barrett Zoph 与 GPT-4o 互动,解决了一个线性方程。GPT-4o 提供了逐步的提示,帮助 Barrett 完成方程的解答。此外,GPT-4o 还解释了线性方程在日常生活中的应用,如计算费用、规划旅行、烹饪以及商业利润和损失计算。
📈 代码和数据可视化
Barrett Zoph 展示了如何与 GPT-4o 共享代码,并讨论了代码的功能。GPT-4o 能够描述代码的目的,并解释了一个特定函数的作用。接着,Barrett 生成了一个图表,并让 GPT-4o 描述了图表的内容,包括平均温度、最高温度和最低温度的显示,以及一个显著天气事件的注释。
🌐 实时翻译与情感分析
Mira Murati 和 Mark Chen 测试了 GPT-4o 的实时翻译能力。GPT-4o 成功地在意大利语和英语之间进行了翻译。此外,Barrett Zoph 要求 GPT-4o 分析他的情感状态,通过一张自拍照,GPT-4o 正确地识别出了他快乐和兴奋的情绪。
🔍 未来展望与感谢
Mira Murati 总结了演讲,强调了技术带来的神奇感觉,并表达了对未来技术进步的期待。她感谢了 OpenAI 团队和合作伙伴,如 Janssen 和 Nvidia,他们的高级 GPU 使得当天的演示成为可能。最后,她感谢了所有参与当天活动的人。
Mindmap
Keywords
💡ChatGPT
💡GPT-4o
💡实时对话
💡数学问题解决
💡语言翻译
💡图像识别
💡情感识别
💡代码理解
💡滚动平均
💡API
💡安全性
Highlights
Mira Murati 强调了使产品真正普及并广泛可用的重要性,并始终寻求减少用户使用 ChatGPT 的摩擦。
发布了 ChatGPT 的桌面版本,旨在简化使用并使其更加自然。
推出了新的旗舰模型 GPT-4o,它将 GPT-4 的智能提供给所有人,包括免费用户。
GPT-4o 在文本、视觉和音频方面都有所改进,并且响应速度更快。
GPT-4o 的发布标志着在易用性方面迈出了巨大步伐,预示着未来人机交互的新范式。
GPT-4o 能够本地处理语音、文本和视觉,极大地提高了效率,并允许为免费用户提供 GPT-4 智能。
目前有超过 1 亿人使用 ChatGPT 进行工作、学习和创造内容。
GPT-4o 将提供给所有用户,包括在 GPT 商店中使用。
GPT-4o 还引入了视觉功能,允许上传截图、照片和包含文本和图像的文档。
GPT-4o 增强了记忆功能,提供了跨所有对话的连续性。
改进了浏览功能,可以在对话中实时搜索信息。
增加了高级数据分析功能,可以上传图表和任何工具来分析信息。
GPT-4o 在 50 多种语言中的质量和速度都有所提高,以覆盖更广泛的用户群。
付费用户将继续拥有比免费用户多达五倍的容量限制。
GPT-4o 不仅在 ChatGPT 中可用,还将提供给 API,使开发者能够构建和部署 AI 应用程序。
GPT-4o 引入了实时音频和视觉的新挑战,团队一直在努力构建防止滥用的缓解措施。
展示了 GPT-4o 的实时对话语音功能,包括实时响应和情感识别。
GPT-4o 能够以多种不同的风格生成语音,具有广泛的动态范围。
GPT-4o 的视觉功能可以解决数学问题,并提供解题提示。
GPT-4o 能够实时翻译,帮助不同语言的用户进行交流。
GPT-4o 还能够通过观察面部表情来识别情绪。
GPT-4o 的发布和演示展示了 AI 技术的进步,并承诺在未来几周内将这些功能推广给所有用户。
Transcripts
[APPLAUSE]
>> MIRA MURATI: Hi everyone.
Thank you, thank you. It is
great to have you here today.
Today, I am going to talk about
three things. That is it. We
will start with why it is so
important to us to have a
product that we can make truly
available and broadly available
to everyone. And we are always
trying to find out ways to
reduce friction. So everyone
can use ChatGPT wherever they
are. Today, we will be
releasing the desktop version
of ChatGPT and to make it
simpler to use and much more
natural as well. But the big
news today is that we are
launching our new flagship
model. And we are calling it
GPT-4o. Best thing about
GPT-4o is it brain GPT-4
intelligence to everyone
including our free users. We
will be showing some live demos
today to show the full extent
of the capabilities of our new
model. And we will be rolling
them out over the next few
weeks. So let's get started.
A very important part of our
mission is to be able to make
our advanced AI tools available
to everyone for free. We think
it is very, very important that
people having for what the
technology can do. We really
want to pair it with its
broader understanding. And
were always finding ways to
reduce the friction. Recently,
we made ChatGPT available
without the sign-up flow.
Today, we are also bringing the
best to ChatGPT. We want you
to be able to use it wherever
you are. As you can see it is
easy, simple, it integrates
very, very easily in your
workflow. Along with it, we
have also refreshed the UI. We
note that this model get more
and more complex. But we want
the experience of interaction
to actually become more
natural, easy, and for you not
to focus on the UI at all but
focus on the collaboration of
ChatGPT. And now the big news.
Today, we are releasing our
newest flagship model this is
GPT-4o. [APPLAUSE]
GPT-4o provides GPT-4
intelligence but it is much
faster. And it improves on its
capabilities across text,
vision, and audio. For the
past couple of years, we have
been very focused on improving
the intelligence of this model
is. And they have gotten
pretty good. But this is the
first time that we are really
making a huge step forward when
it comes to the ease of use.
This is incredibly important
because we are looking at the
future of interaction between
ourselves and the machines. We
think that GPT-4o is really
shifting the paradigm into the
future of collaboration. It is
natural and far easier. Making
this happen is actually quite
complex. When we interact with
one another there is a lot of
stuff that we take for granted.
You know the ease of our
dialogue when we interrupt
another, the background noises,
the multiple voices in a
conversation and understanding
the tone of voice. All of
these things are quite complex.
Until now with voice mode, we
had three models that come
together to deliver this
experience. We have
transcription, intelligence,
and text to speech all comes
together and orchestration to
deliver voice mode. This also
brings a lot of latency to the
experience and it really breaks
the immersion in the
collaboration with ChatGPT.
But now with GPT-4o this all
happens natively. GPT-4o
reasons across voice, text, and
vision. And with this
incredible efficiencies, it
also allows us to bring the
GPT-4o intelligence to our free
users. This is something we
have been trying to do for
many, many months. And we are
very, very excited to finally
bring GPT-4o to all of our
users. Today, we have 100
million people using more than
100 million and fact they use
ChatGPT to create work, learn,
and we have this advanced tools
that are only available to our
paid users at least until now
with a efficiencies of 4o we
can bring these tools to
everyone. So starting today,
you can use GPT in the GPT
store. So far we have had more
than a million users create
amazing experiences with GPT.
This is custom ChatGPT and they
are available in the store.
And now our builders have much
bigger audience where
university professors can
create content for their
students, or pod casters can
create content for their
listeners. You can also use
the vision. So now you can
upload screenshots, photos,
documents containing both text
and images. You can start
conversations with ChatGPT
about all of this content. You
can also use memory where it
makes ChatGPT far more useful
and helpful because now it has
a sense of continuity across
all of your conversations. You
can use browse where you can
search for real-time
information in your
conversation. An advanced data
analysis where you can upload
charts and any tools and
analyze this information. It
will give you answers and so
on. Lastly, we also improved
on the quality and speed in 50
different languages. For
ChatGPT and this is very
important because we want to be
able to bring this experience
to as many people out there as
possible. So we are very, very
excited to bring GPT-4o to all
of our free users out there and
for the paid users, they will
continue to have up to five
times the capacity limits of
our free users. But GPT-4o is
not only available in ChatGPT.
We also bringing you to the
API. [APPLAUSE]
So our developers can start
building today with GPT-4o and
making amazing AI applications
deploying them at scale. 4o is
faster, 50% cheaper and five
times higher rate limits to
GPT-4 Turbo. As we bring these
technologies into the world, it
is quite challenging to figure
out how to do so in a way that
is both useful and also safe.
GPT-4o present new challenges
for us when it comes to safety
because we are dealing with
real time audio, real-time
vision and our team has been
hard at work figuring out how
to build in mitigations against
misuse. We continue to work
with different stakeholders out
there from government, media,
entertainment, all industries,
civil societies to figure out
how to best bring this
technologies into the world.
Over the next few weeks, we
will continue our deployment to
bring out all of the
capabilities to you. But
today, I want to show you all
of these capabilities. We will
do some live demos. I will
bring on two of our research
leads Mark Chen and Barrett
Zoph. [APPLAUSE]
>> BARRETT ZOPH: Hi I'm
Barrett.
>> MARK CHEN: Hi a mark. One
of the key capabilities we are
excited to share his real-time
conversational speech. Will
get in Florida. I am taking
out a phone if you're wondering
about this wire we have
consistent Internet. If you
see there's this little icon on
the bottom right of the ChatGPT
out. It will open GPT-4o audio
capabilities. Hey, ChatGPT hi
Mark how are you?
>> CHAT GPT: Hello Mark I am
doing great thanks for asking.
How about you?
>> MARK CHEN: I am on stage
right now and I'm doing a live
demo and I am a little bit
nervous. Can you help me,
nerves a little bit?
>> CHAT GPT: You're doing a
live demo right now? That is
awesome. Just take a deep
breath and remember you are the
expert here.
>> MARK CHEN: I like the
suggestion let me try a couple
of deep breaths. Can you give
me feedback on my breath? Here
I go.
>> CHAT GPT: Wold slow down.
Mark, you are not a vacuum
cleaner.
>> MARK CHEN: Let me try again.
I am going to breathe in deeply
and breathe out.
>> CHAT GPT: Exhale slowly.
>> MARK CHEN: I will try again.
Breathing in and breathe out.
>> CHAT GPT: That is it. How
do you feel?
>> MARK CHEN: I feel a lot
better, thank you so much.
>> MIRA MURATI: Mark, you have
been working on these
capabilities for a while can
you tell us how it is different
from voice mode?
>> MARK CHEN: If you use the
voice mode express before you
will notice a couple of key
differences. First you cannot
interrupt the model you do not
have to wait to you finish your
term before you can start
speaking. You could but in
whenever you want. Second, the
model is real-time
responsiveness. That means
that you do not have this
awkward two Ãthree second lag
before you waitwait for the
model to get a response.
Finally the model picks up on a
motion. When I was breathing
super hard, you can tell. Hey,
you may want to calm down a
little bit. You are running a
little bit too fast. It really
has that capability across the
board to procedure emotion.
Not only that the model is able
to generate voice in a variety
of different styles. It really
has a wide dynamic range. I
want to fire another demo.
Hey, ChatGPT how are you doing?
>> CHAT GPT: Am doing
fantastic, thanks for asking.
How about you?
>> MARK CHEN: My friend Barrett
he has been having problems
sleeping lately. I want you to
tell them a bedtime story about
robots in love.
>> CHAT GPT: A bedtime story
about robots and love? I got
you covered. Gather around
Barrett. Once upon a time in
the world not too different
from ours there was a robot
named bite. Bite was a curious
robot always exploring --
>> MARK CHEN: I want a little
bit emotion in your voice a
little bit more drama.
>> CHAT GPT: Got it. Let's add
some drama. Once upon a time
in a world not too different
from ours there was a robot
named bite.>> MARK CHEN: I
want maximum expression.
>> CHAT GPT: That's amplified
the drama. Once upon a time in
a world not too different from
ours there was a robot named
bite.
>> MIRA MURATI: Can you do this
in a robotic voice now?
>> CHAT GPT: Initiating
dramatic robotic voice. Once
upon a time in a world not too
different from ours there was a
robot named bite. Bite was a
curious robot. Always
exploring --
>> MARK CHEN: Can you do the
story and a singing voice?
>> CHAT GPT: And so bite found
another robot friend and they
lived circularly ever after.
>> MARK CHEN: I definitely feel
a lot more relaxed. [APPLAUSE]
>> MIRA MURATI: You have just
seen the voice capabilities but
we also want to show you
division capabilities as the
model can also see the world
around us.
>> BARRETT ZOPH: You can chat
with ChatGPT and you can
interact with it as well. Let
me boot up ChatGPT. Hey,
ChatGPT.
>> CHAT GPT: Hello there how's
it going?
>> BARRETT ZOPH: It's going
really well and I would like
your help solving a math
problem.
>> CHAT GPT: What math problem
can help you tackle today?
>> BARRETT ZOPH: Am going to
write down a linear equation on
a sheet of paper and I will
love help working through it.
Don't tell me the solution does
give me hints along the way.
>> CHAT GPT: Got it. Okay, I
see it.
>> BARRETT ZOPH: I did not show
you yet. One second.
>> CHAT GPT: I got to excited.
I am ready when you are.
>> BARRETT ZOPH: ChatGPT what
equation did not write down?
>> CHAT GPT: I see it now. You
wrote down 3X +1 equals four.
>> BARRETT ZOPH: Exactly. What
is the first step to solve
this?
>> CHAT GPT: The first step is
to get all of the terms with X
on one side and the constants
on the other side. So what do
you think we should do with
that +1?
>> BARRETT ZOPH: Am going to
try to submitsubtract one from
both sides and see what I get.
>> CHAT GPT: Great idea.
Perfect. Now what do you get
when you subtract one from both
sides?
>> BARRETT ZOPH: Okay so
ChatGPT this is what I ended up
with. How does this look?
>> CHAT GPT: Spot on. Now you
have isolated the term with X
on one side and it looks like
3X equals three. What do you
think we should do next to
solve for X?
>> BARRETT ZOPH: I'm not sure
can you give me another hint?
>> CHAT GPT: Of course. You
have 3X and you want to find
the value of X. Think about
what operation with undue
multiplication?
>> BARRETT ZOPH: Is it
subtraction?
>> CHAT GPT: Close but think
about the opposite of
multiplication because 3X
equals three times X.
>> BARRETT ZOPH: Okay I think
it is division. Let me try to
divide both sides by three and
see what I get.
>> CHAT GPT: Bingo. Go and
head and divide both sides by
three.
>> BARRETT ZOPH: Okay so
ChatGPT this is what I ended up
with. How does this look?
>> CHAT GPT: Looks perfect.
You have solved it. X equals
one. Nicely done. How do you
feel about solving linear
equations now?
>> BARRETT ZOPH: Honestly, --
>> MARK CHEN: Am not confident
that I can solve new equations.
Why would you need this in the
real world?
>> CHAT GPT: That is of her
questions. Linear equations
pop up in a lot of everyday
situations. Even if we do not
realize it. For example
calculating expenses, planning
travel, cooking, and even in
business for profit and loss
calculations. It is a way to
solve problems but you need to
find an unknown value.
>> MARK CHEN: Have a lot of
unknown values in my life. I'm
totally convinced that I'm
going to learn math now.
[LAUGHTER]
>> CHAT GPT: I love the
attitude. Math can help solve
some of life ministries.
Whenever you to die the more
math I am here for you.
Anything else you like to
tackle today?
>> BARRETT ZOPH: I love that
you taught my friend Mark that
I would have something I would
like you to take a look at.
>> CHAT GPT: I would love to
see what you wrote. Show it to
me whenever you're ready.
>> BARRETT ZOPH: This is what I
wrote down what do you see?
>> CHAT GPT: I see I love
ChatGPT. That is so sweet of
you.
>> BARRETT ZOPH: I really
appreciate all the help.
[APPLAUSE]
>> CHAT GPT: That is quite
outfit you have on.
>> BARRETT ZOPH: We can chat in
real-time with ChatGPT. We can
interact with video as well.
>> MARK CHEN: If you look at
Barrett's it's all equations
like this. [LAUGHTER]
>> MIRA MURATI: Now that you
know you can do linear
equations can you show us
something more complex?
>> MARK CHEN: Let me take a
computer out for this. I am
want to open the ChatGPT app.
And to get background on what
is going on so here we have a
computer on the screen we have
some cold. And then the
ChatGPT is on this side. It
can hear me but can't see
anything on the screen. I'm
going to highlight the code and
I will send it to ChatGPT. I'm
going to talk about the code.
Hey, ChatGPT.
>> CHAT GPT: Hey, there how's
it going? Of course I would
love to help you. What is the
coding problem you're dealing
with?
>> BARRETT ZOPH: Am going to
share with you some code. One
second.
>> CHAT GPT: Sure thing, take
your time.
>> BARRETT ZOPH: I just shared
code with you. Can you give me
a one sentence description of
what is going on in the code?
>> CHAT GPT: Infectious daily
weather for specific location
for time period for data using
a rolling average and annotates
a significant weather event on
the resulting plot and displays
the plot with the average
minimum and maximum
temperatures over the year.
>> BARRETT ZOPH: Awesome,
thanks so much. In the code
there's this function Fu. Can
you thiscan you tell me what it
would be if I did not have this
function?
>> CHAT GPT: The function Fu
takes a data frame X taking
temperature data and a window
sized Y for calculating the
rolling name. It is smooths
the average temperature,
minimum temperature and maximum
temperature columns by applying
a rolling mean over the
specified window. With the Fu
function the plot which showed
smoother temperature lines
because the rolling mean
reduces the noise or
fluctuations in the data.
>> BARRETT ZOPH: Awesome,
thanks much. I will want to
apply this function. I am
going to generate the plot and
show it to you. Give me one
second.
>> CHAT GPT: Absolutely. Take
your time. I'm excited to see
the plot when you're ready.
>> BARRETT ZOPH: So far ChatGPT
have been able to see the cold
and now and go to run it and
use the vision capabilities of
the ChatGPT app so it can see
everything going on in the
screen as well. Okay ChatGPT I
am sharing with you the plot
now. I'm wondering if you give
me a brief one sentence
overview of what you see.
>> CHAT GPT: Plot displays
smooth average minimum and
maximum temperatures throughout
2018 with a notable annotation
marking a big rainfall event in
late September.
>> BARRETT ZOPH: --
>> MARK CHEN: What months do
you see the hot temperatures?
>> CHAT GPT: The hottest
temperatures occur around July
and August. The maximum
temperature during these months
is roughly between 25B0 and 30B
77B0 Fahrenheit Ã86B0F.
>> MIRA MURATI: What is going
on with the Y axis is this in
Celsius to Fahrenheit?
>> CHAT GPT: The Y axis is in
Celsius. The temperatures are
labeled accordingly such as
average temperature, minimum
temperature and maximum
temperature.
>> MARK CHEN: --
>> BARRETT ZOPH: It can help
interact with code and see the
outputs of plots and everything
that is going on on the
computer. [APPLAUSE]
>> MIRA MURATI: A few minutes
ago we asked a live audience on
asked to submit a request for
what they would like us to try
out. I will take a couple of
prompts. They want to know if
GPT-4o is capable of real-time
translation? Mark do you want
to try this one?
>> MARK CHEN: Sure, let's do
it. Hey, ChatGPT how are you?
>> CHAT GPT: I am doing
fantastic. Thanks for asking.
How are you are your day going?
>> MARK CHEN: I would like you
to function as a translator.
Have a friend that only speaks
Italian and I only speak
English. Every time you hear
English I want you to translate
it to its value. If you hear
Italian I want you to translate
it back to English.
>> CHAT GPT: Perfect.
>> MIRA MURATI: [speaking In
Italian ]
>> CHAT GPT: If Wells can talk
what would they tell us?
>> MARK CHEN: They may ask how
do we solve linear equations?
>> CHAT GPT: [speaking In
Italian]
>> MIRA MURATI: [speaking In
Italian]
>> CHAT GPT: Certainly, yes.
>> MARK CHEN: Great, it looks
like it works. [APPLAUSE]
>> MIRA MURATI: The next one is
from John who asked can you
tell what you're feeling just
by looking at your face?
Barrett do you want to give
this one a try?
>> BARRETT ZOPH: Absolutely
let's give it a try. Hey,
ChatGPT.
>> CHAT GPT: Hey, there what is
up? How can I bring your day
today?
>> BARRETT ZOPH: I'm going to
show you a selfie of what I
look like and then I would like
you to try to see what emotions
I am feeling based on how I'm
looking.
>> CHAT GPT: Sounds like a fun
challenge. Go ahead and show
me the selfie and I will put my
emotional detective hat on.
>> BARRETT ZOPH: Okay so here
is me. What kind of emotions
am I doing?
>> CHAT GPT: It seems like I am
looking at a picture of a
wooden surface.
>> BARRETT ZOPH: That was the
thing I sent you before I'm not
actually a table. Take another
look.
>> CHAT GPT: That makes more
sense. There we go. It looks
like you're feeling pretty
happy and cheerful. With a big
smile and maybe a touch of
excitement. Whatever is going
on, it seems like you are in a
great mood. Care to share the
source of the device?
>> BARRETT ZOPH: The reason I'm
in a good mood is we are doing
a presentation showcasing how
useful and amazing you are.
>> CHAT GPT: Oh stop it, you're
making me blush. It.[LAUGHTER]
[APPLAUSE]
>> MIRA MURATI: Amazing. That
is it for today on live demos.
As you can see this feels so
magical. It is wonderful but
we also want to remove some of
the mysticism from the
technology and bring it to you
so you can try it for yourself.
Over the next few weeks, we
will be rolling out these
capabilities to everyone.
Today has been very much
focused on the free users and
the new modalities and new
products. But we also care a
lot about the next frontier.
So soon we will be updating you
on our progress towards the
next big thing. And before we
wrap up, I just want to thank
the incredible OpenAI team and
thanks to Janssen and Nvidia
team for bringing us the
advanced GPU to make this demo
possible today. And thank you
all very, very much for being a
part of this today. [APPLAUSE]
[APPLAUSE]
[MUSIC
5.0 / 5 (0 votes)
GPT-4o - Full Breakdown + Bonus Details
INSANE OpenAI News: GPT-4o and your own AI partner
GPT-4o Is Here – 5 Ways Your Prompts Need To Change
Microsoft's New PHI-3 AI Turns Your iPhone Into an AI Superpower! (Game Changer!)
AI神助攻,轻松驾驭ChatGPT的五大神器,,一跃成为GPT达人 | 回到Axton
Best AI Music Generator in 2024 - SUNO vs UDIO