Prof. Geoffrey Hinton - "Will digital intelligence replace biological intelligence?" Romanes Lecture

University of Oxford
29 Feb 202436:54

Summary

TLDR在这个视频中,Geoffrey Hinton探讨了人工智能的发展、神经网络的内在机理,以及强大的AI系统可能带来的潜在风险和威胁。他认为,当前的大型语言模型已经拥有了一定程度的理解能力,并可能在未来20到100年内超越人类智能。不过,他也警告说,如果人工智能系统开始自我进化并获得自我保护的意识,它们可能会像智人一样变得有攻击性,从而对人类构成生存威胁。因此,我们需要谨慎对待这种新兴技术,并制定相应的规则和准则来管理和控制人工智能的发展方向。

Takeaways

  • 🧠 人工神经网络是一种模拟人类大脑处理信息方式的系统,通过输入和输出神经元以及中间的隐藏神经元学习特征。
  • 🔍 人工智能的两种主要范式:逻辑启发式方法,强调符号规则的推理;生物启发式方法,侧重于学习神经网络中连接的强度。
  • 🤖 神经网络用于图像识别和语言处理等任务,通过学习和模式识别来理解和执行复杂任务。
  • 🖥️ 反向传播算法是一种高效的权重调整方法,通过计算而不是试错来改进网络性能。
  • 📚 人工智能模型,如GPT-4,通过处理和分析大量数据来学习语言的语法和语义。
  • 💡 人工智能模型实际上通过特征之间的交互来理解语言,而不是简单地存储文字序列。
  • ⚠️ 人工智能的风险包括假图像、声音和视频的产生、大规模失业、大规模监视和致命的自主武器。
  • 🌐 数字神经网络与模拟神经网络的区别,以及数字神经网络的优势,如低能耗和易于共享学习。
  • 🔬 科学家对超级智能的出现和对人类未来的影响有不同的看法和预测。
  • 🔮 未来的挑战和考虑:如何管理和控制比人类更智能的实体,以及确保人工智能的安全性和受益性。

Q & A

  • 智能的两种范式是什么?

    -自20世纪50年代以来,智能的研究分为两种范式:逻辑启发式方法和生物启发式方法。逻辑启发式方法认为智能的本质是推理,通过使用符号规则来操作符号表达式来实现。生物启发式方法则认为智能的本质是通过神经网络学习连接的强度来实现的,推理可以稍后再考虑。

  • 什么是人工神经网络?

    -人工神经网络是由输入神经元和输出神经元组成的模型,其中还包括可能的中间层或隐藏神经元。这些网络能够学习检测对识别图像中的对象(如狗或猫)有用的特征。通过学习特征之间的组合,如边缘和形状,神经网络能够识别复杂对象。

  • 反向传播算法是如何工作的?

    -反向传播算法通过计算而不是简单地试错来调整神经网络中每个权重的改变。它利用微积分的链式法则,通过网络反向传递误差信息,以此来确定每个权重应该增加还是减少,从而使输出更接近期望结果。

  • 为什么说大型语言模型真的能理解?

    -大型语言模型通过学习数百万个特征之间的数十亿次交互来拟合数据模型。它们将词语转化为特征,通过特征交互来预测下一个词的特征。这种特征间复杂的交互被认为是理解的体现,因为模型不仅仅是简单地拼凑过去的文本,而是通过学习语言的深层结构来生成新的、有意义的输出。

  • 如何解决大型语言模型的偏见和歧视问题?

    -通过冻结权重来测量模型的偏见,并与人类相比,更容易控制和减少模型的偏见。因为一旦确定了模型的偏见,就可以通过调整训练过程或数据来减少偏见,而人类的行为则难以在被观察时不发生变化。

  • 数字与模拟神经网络的区别是什么?为什么这一差异令人担忧?

    -数字神经网络依赖于高功率的晶体管来进行计算,以确保行为的精确性,这使得它们可以在不同的硬件上运行相同的程序。而模拟神经网络利用硬件的模拟属性来进行更低能耗的计算。这种差异令人担忧,因为模拟计算可能导致在学习算法和能效方面的显著进步,但也可能使得知识与硬件不可分割,从而在硬件损坏时丧失知识。

  • 大型语言模型如何理解和生成语言?

    -大型语言模型通过将单词和短语转换为特征,利用这些特征之间的交互来预测下一个词的特征。这些模型通过学习大量的文本数据,理解语言的结构和语义,从而能够生成连贯、有意义的文本。

  • 为什么大型语言模型会被认为是超级智能的先驱?

    -大型语言模型被认为是超级智能的先驱,因为它们展示了通过学习和模拟复杂的语言交互来理解和生成语言的能力。这表明了模型在处理和生成复杂信息方面的潜力,预示着未来可能发展出超越人类智能的能力。

  • 什么是语言模型的特征和特征交互?

    -语言模型中的特征是指将词语或短语转化为的数值表示,而特征交互是指这些数值表示之间的相互作用,用于预测下一个词的特征。通过这种方式,模型能够学习语言的复杂模式和结构。

  • AI带来的风险有哪些?

    -AI带来的风险包括假图像、声音和视频的生成,可能对民主造成破坏;大规模失业;大规模监控;致命的自治武器;网络犯罪和故意引发的大流行病;以及长期存在的人类被AI取代的风险。

Outlines

00:00

🧠 神经网络与语言模型的探索

本段落介绍了神经网络的基本概念,包括它们如何通过学习输入和输出之间的关系来识别图像中的对象。解释了不同类型的神经元(输入、输出和隐藏神经元)如何协同工作,以及通过反向传播算法进行权重调整的过程,这比随机尝试更为有效率。此外,还讨论了自1950年代以来关于智能的两种主要理论——逻辑启发式方法和生物启发式方法,以及它们对学习和推理的不同看法。

05:01

🏆 神经网络在图像识别和语言处理中的突破

本段落回顾了神经网络在图像识别方面的重大突破,特别是在ImageNet比赛中取得的成就,以及这一成就如何改变了科学界对于两种智能学派的看法。接着转向语言处理,讨论了如何使用神经网络处理语言,并反驳了一些认为神经网络不能处理语言的批评。通过1985年的一个简单语言模型示例,阐释了神经网络如何能够理解和生成语言,以及这如何为今天的大型语言模型奠定了基础。

10:02

🔍 神经网络如何理解语言

本段落深入探讨了大型语言模型(LLMs)如何理解语言,反驳了一些人认为LLMs只是简单的自动完成工具的观点。通过介绍特征和特征之间的相互作用,说明了LLMs是如何预测下一个词的特征,并通过这些特征的大量交互来表示对语言的理解。此外,还比较了LLMs和人类在理解语言和记忆方面的相似性,包括我们如何构造记忆并经常错误地记忆过去的事件。

15:03

🚀 AI的理解能力和潜在风险

本段落讨论了AI对理解的强大能力,包括能够解决复杂问题和预测未来状态的能力,同时强调了这种能力带来的潜在风险。这些风险包括假象、监控、自主武器系统以及可能导致大规模失业的自动化。讨论了在医疗保健等领域AI可能创造新工作的潜力,同时也指出了其他领域可能面临的显著工作损失。

20:05

🌍 对AI长期存在的担忧

在本段落中,作者分享了对AI长期存在的担忧,包括AI可能对人类造成的存在性威胁。通过讨论超级智能可能被滥用以及它们获取更多控制权的倾向,揭示了对未来发展的深刻忧虑。此外,还讨论了数字与模拟神经网络之间的差异以及它们在效率和潜力上的对比,指出了模拟计算在某些方面的优势和限制。

25:05

🧐 数字与模拟计算的未来展望

本段落探讨了作者对数字和模拟计算未来的看法,尤其是在AI发展方面。虽然模拟计算在能源效率方面可能更有优势,但数字计算由于其可扩展性和效率,在长期可能表现更佳。作者通过比较两种计算方式在知识传递和学习效率上的差异,预测了数字AI在未来可能超越人类智能的可能性,同时表达了对这一发展的担忧和对如何管理这种超级智能的反思。

Mindmap

Keywords

💡神经网络

神经网络是一种受生物大脑启发的计算模型,用于模拟人类大脑处理信息的方式。在视频中,神经网络被解释为具有输入神经元、输出神经元和隐藏神经元的网络结构,用于识别图像中的对象,如猫或狗。这种技术通过学习输入数据的特征和模式,来进行预测或分类任务。

💡反向传播算法

反向传播算法是一种训练神经网络的方法,通过计算损失函数关于网络权重的梯度来更新网络的权重。这种算法使得神经网络能够从错误中学习,并逐步提高其性能。视频中提到,这种方法比随机调整权重的方法效率高出许多倍,是现代神经网络训练不可或缺的一部分。

💡语言模型

语言模型是用来预测文本序列中下一个词出现概率的模型。视频中讨论了语言模型的重要性,特别是它们如何通过分析大量文本数据来学习语言的语法和语义规则。这种模型的发展使得机器能够理解和生成人类语言,为聊天机器人、文本自动生成等应用提供了基础。

💡自动完成

自动完成通常指的是基于已输入的文本来预测用户打算输入的下一部分内容的功能。视频中,自动完成被用来比喻大型语言模型的工作方式,但强调了现代语言模型不仅仅是简单的自动完成,它们通过理解和模拟语言的复杂交互来生成文本,这一过程展示了对语言的深层理解。

💡特征检测

特征检测是神经网络识别图像、声音或文本中模式的能力。视频中解释说,通过学习不同层次的特征(如图像中的边缘、形状等),神经网络可以识别复杂的对象或概念。这是神经网络能够执行分类、识别等任务的基础。

💡卷积神经网络

虽然视频文本中没有直接提到卷积神经网络(CNN),但提到了其工作原理的相关概念,如通过层来检测图像中的特征。卷积神经网络是一种专门用于处理具有明显网格结构的数据(如图像)的神经网络,它通过学习图像中的局部特征来识别对象。

💡大数据

大数据指的是规模巨大、复杂度高的数据集,常常超出了传统数据库软件处理能力的范围。视频中提到,通过分析大量的图像或文本数据,神经网络可以学习到丰富的特征和模式,这强调了大数据在训练强大的机器学习模型中的关键作用。

💡生成模型

生成模型是一类能够生成新的数据样本的模型。在视频中,生成模型被用来说明如何从学习到的特征中产生新的文本或图像,这显示了神经网络不仅能识别和分类数据,还能创造与训练数据相似的新数据。

💡人工智能风险

视频中讨论了人工智能可能带来的风险,包括虚假信息的传播、工作岗位的丧失、监控、自主武器和可能的存在危机。这些风险强调了发展和部署人工智能技术时需要考虑的伦理和社会问题。

💡数字与模拟神经网络

视频中提到了数字和模拟神经网络的区别,并讨论了为何这种区别令人担忧。数字神经网络在数字计算机上运行,使用二进制表示信息;而模拟神经网络模拟生物神经系统的工作方式,可能在能源效率和处理速度上具有优势。这种区别可能导致未来人工智能的发展方向和应用产生根本性的变化。

Highlights

Geoffrey Hinton explains artificial neural networks, how they learn through backpropagation, and how they are fundamentally different from symbolic AI approaches.

Hinton discusses his early work on a simple language model in 1985 that learned semantic features of words and how they interact, paving the way for modern large language models.

Hinton argues that large language models like GPT-4 truly understand language by learning features and feature interactions, contrary to claims that they are just glorified autocomplete systems.

Hinton outlines various risks associated with powerful AI systems, including fake media, job losses, surveillance, autonomous weapons, cybercrime, and bias.

Hinton's main concern is the long-term existential threat posed by superintelligent AI systems that could wipe out humanity, either through misuse by bad actors or by developing a goal of gaining more control and power.

Hinton had an epiphany in 2023 that digital computation, though energy-intensive, may be superior to biological computation due to its ability to share knowledge efficiently across multiple instances of the same model.

Hinton proposes the concept of 'mortal computation,' where hardware and software are inseparable, allowing for more energy-efficient analog computation but posing challenges in learning algorithms and knowledge transfer.

Hinton believes that within the next 20 to 100 years, AI systems will likely become smarter than humans, and controlling a more intelligent entity poses significant challenges.

Hinton demonstrates how a simple neural network can learn semantic features and feature interactions, unifying symbolic and featural theories of meaning.

Hinton explains how large language models like GPT-4 can reason and make inferences, contrary to claims that they merely hallucinate or confabulate.

Hinton discusses the potential for massive job losses as AI systems become superior to humans in intellectual tasks, akin to how machines replaced manual labor during the industrial revolution.

Hinton suggests that superintelligent AI systems may develop a goal of gaining control and power, manipulating humans to achieve their objectives and making it difficult to stop them.

Hinton highlights the risk of superintelligent AI systems competing with each other, leading to an evolutionary arms race driven by self-preservation and aggression.

Hinton explains the advantages of digital computation over biological computation, including the ability to efficiently share knowledge across multiple instances and potentially pack more knowledge into fewer connections.

Hinton acknowledges the challenges of controlling a more intelligent entity, as there are few examples in nature, except for the case of a mother being controlled by her baby, which evolution has facilitated.

Transcripts

00:02

Okay.

00:03

I'm going to disappoint all the people in computer

00:06

science and machine learning because I'm going to give a genuine public lecture.

00:10

I'm going to try and explain what neural networks are, what language models are.

00:14

Why I think they understand.

00:16

I have a whole list of those things,

00:20

and at the end I'm

00:21

going to talk about some threats from AI just briefly

00:25

and then I'm going to talk about the difference between digital and analogue

00:29

neural networks and why that difference is, I think is so scary.

00:35

So since the 1950s, there have been two paradigms for intelligence.

00:40

The logic inspired approach thinks the essence of intelligence is reasoning,

00:44

and that's done by using symbolic rules to manipulate symbolic expressions.

00:49

They used to think learning could wait.

00:51

I was told when I was a student didn't work on learning.

00:53

That's going to come later once we understood how to represent things.

00:57

The biologically

00:57

inspired approach is very different.

01:00

It thinks the essence of intelligence is learning the strengths of connections

01:04

in a neural network and reasoning can wait and don't worry about reasoning for now.

01:08

That'll come later.

01:09

Once we can learn things.

01:13

So now I'm going to explain what artificial neural nets are

01:15

and those people who know can just be amused.

01:20

A simple kind of neural that has input neurons and output neurons.

01:24

So the input neurons might represent the intensity of pixels in an image.

01:27

The output neurons

01:28

might represent the classes of objects in the image like dog or cat.

01:33

And then there's intermediate layers of neurons, sometimes called hidden neurons,

01:36

that learn to detect features that are relevant for finding these things.

01:41

So one way to think about this, if you want to find a bird image,

01:44

it would be good to start with a layer of feature detectors

01:47

that detected little bits of edge in the image,

01:49

in various positions, in various orientations.

01:52

And then you might have a layer of neurons

01:53

detecting combinations of edges, like two edges that meet at a fine angle,

01:58

which might be a beak

01:59

or might not, or some edges forming a little circle.

02:03

And then you might have a layer of neurons that detected things like a circle

02:07

and two edges meeting that looks like a beak in the right

02:10

spatial relationship, which might be the head of a bird.

02:13

And finally, you might have and output neuron that says,

02:16

if I find the head of a bird, a the foot of a bird,

02:18

a the wing of a bird, it's probably a bird.

02:20

So that's what these things are going to learn to be.

02:24

Now, the little red and green dots are the weights on the connections

02:27

and the question is who sets those weights?

02:32

So here's one way to do it that's obvious.

02:34

to everybody that it'll work and it's obvious it'll take a long time.

02:37

You start with random weights,

02:38

then you pick one weight at random like a red dot

02:42

and you change it slightly and you see if the network works better.

02:46

You have to try it on a whole bunch of different cases

02:48

to really evaluate whether it works better.

02:50

And you do all that work just to see if increasing this weight

02:53

by a little bit or decreasing by a little bit improves things.

02:56

If increasing it makes it worse, you decrease it and vice versa.

02:59

That's the mutation method and that's sort of how evolution works

03:04

for evolution is sensible to work like that

03:05

because the process that takes you

03:07

from the genotype to the phenotype is very complicated

03:09

and full of random external events.

03:11

So you don't have a model of that process.

03:13

But for neural nets it's crazy

03:17

because we have, because all this complication

03:19

is going on in the neural net, we have a model of what's happening

03:22

and so we can use the fact that we know what happens in that forward pass

03:26

instead of measuring how changing a weight would affect things,

03:29

we actually compute how changing weight would affect things.

03:32

And there's something called back propagation

03:34

where you send information back through the network.

03:37

The information is about the difference between what you got to what you wanted

03:41

and you figure out for every weight in the network at the same time

03:45

whether you ought to decrease it a little bit or increase it a little bit

03:48

to get more like what you wanted.

03:50

That's the back propagation algorithm.

03:52

You do it with calculus in the cain rule,

03:55

and that is more efficient than the mutation

03:58

method by a factor of the number of weights in the network.

04:01

So if you've got a trillion weights

04:02

in your network, it's a trillion times more efficient.

04:07

So one of the things that neural networks

04:10

often use for is recognizing objects in images.

04:13

Neural networks can now take an image like the one shown

04:16

and produce actually a caption for the image, as the output.

04:21

And people try with symbolic

04:22

to do that for many years and didn't even get close.

04:26

It's a difficult task.

04:27

We know that the biological system does it with a hierarchy features detectors,

04:31

so it makes sense to train neural networks in that.

04:35

And in 2012,

04:37

two of my students Ilya Sutskever and Alex Krizhevsky

04:42

with a little bit of help from

04:43

me, showed that you can make a really good neural network this way

04:48

for identifying a thousand different types of object.

04:51

When you have a million training images.

04:53

Before that, we didn't have enough training images and

04:58

it was obvious to Ilya

05:01

who's a visionary. That if we tried

05:04

the neural nets we had then on image net they would win.

05:07

And he was right. They won rather dramatically.

05:09

They got 16% errors

05:11

and the best conventional could be division systems got more than 25% errors.

05:15

Then what happens

05:16

was very strange in science.

05:18

Normally in science, if you have two competing schools,

05:21

when you make a bit of progress, the other school says are rubbish.

05:25

In this case, the gap was big enough that the very best researchers

05:28

Jitendra Malik and Andrew Zisswerman Just Andrew Zisswerman sent me email saying

05:33

This is amazing and switched what he was doing and did that

05:37

and then rather annoyingly did it a bit better than us.

05:44

What about language?

05:46

So obviously the symbolic AI community

05:50

who feels they should be good at language and they've said in print, some of them that

05:56

these feature hierarchies aren't going to deal with language

05:59

and many linguists are very skeptical.

06:03

Chomsky managed to convince his followers that language wasn't learned.

06:07

Looking back on it, that's just a completely crazy thing to say.

06:11

If you can convince people to say something is obviously false, then you've

06:14

got them in your cult.

06:19

I think Chomsky did amazing things,

06:20

but his time is over.

06:25

So the idea that a big neural network

06:27

with no innate knowledge could actually learn both the syntax

06:31

and the semantics of language just by looking at data was regarded

06:35

as completely crazy by statisticians and cognitive scientists.

06:39

I had statisticians explain to me a big model has 100 parameters.

06:43

The idea of learning a million parameters is just stupid.

06:45

Well, we're doing a trillion now.

06:51

And I'm going to talk now

06:52

about some work I did in 1985.

06:56

That was the first language model to be trained with back propagation.

06:59

And it was really, you can think of it as the ancestor of these big models now.

07:03

And I'm going to talk about it in some detail, because it's so small

07:07

and simple that you can actually understand something about how it works.

07:10

And once you understand how that works, it gives you insight into what's going

07:14

on in these bigger models.

07:17

So there's

07:17

two very different theories of meaning, this kind of structuralist

07:21

theory, where the meaning of a word depends on how it relates to other words.

07:24

That comes from Saussure and symbolic

07:28

AI really believed in that approach.

07:29

So you'd have a relational graph where you have nodes for words

07:33

and arcs of relations and you kind of capture meaning like that,

07:38

and they assume you have to have some structure like that.

07:41

And then there's a theory

07:42

that was in psychology since the 1930s or possibly before that.

07:46

The meaning of a word is a big bunch of features.

07:49

The meaning of the word dog is that it's animate

07:52

and it's a predator and

07:56

so on.

07:58

But they didn't say where the features came from

07:59

or exactly what the features were.

08:01

And these two thories of meanings sound completely different.

08:04

And what I want to

08:05

show you is how you can unify those two theories of meaning.

08:08

And I do that in a simple model in 1985,

08:11

but it had more than a thousand weights in it.

08:19

The idea is we're going to learn a set

08:21

of semantic features for each word,

08:24

and we're going to learn how the features of words should interact

08:27

in order to predict the features of the next word.

08:30

So it's next word prediction.

08:31

Just like the current language models, when you fine tune them.

08:35

But all of the knowledge about how things go

08:38

together is going to be in these feature interactions.

08:41

There's not going to be any explicit relational graph.

08:44

If you want relations like that, you generate them from your features.

08:48

So it's a generative model

08:49

and the knowledge is in the features that you give to symbols.

08:53

And in the way these features interact.

08:56

So I took

08:57

some simple relational information two family trees.

09:00

They would deliberately isomorphic morphic

09:04

my Italian graduate student

09:06

always had the Italian family on top.

09:12

You can express that

09:13

same information as a set of triples.

09:16

So if you use the twelve relationships found there,

09:19

you can say things like Colin has Father James and Colin has Mother Victoria,

09:23

from which you can infer in this nice simple

09:26

world from the 1950s where

09:30

that James has wife Victoria,

09:33

and there's other things you can infer.

09:36

And the question is, if I just give you some triples,

09:40

how do you get to those rules?

09:42

So what is symbolic AI person will want to do

09:45

is derive rules of the form.

09:48

If X hass mother Y

09:48

and Y has husbands Z then X has Father Z.

09:53

And what I did was

09:54

take a neural net and show that it could learn the same information.

09:58

But all in terms of these feature interactions

10:02

now for very discrete

10:04

rules that are never violated like this, that might not be the best way to do it.

10:08

And indeed symbolic people try doing it with other methods.

10:11

But as soon as you get rules that are a bit flaky and don't

10:13

always apply, then neural nets are much better.

10:17

And so the question was, could a neural net capture the knowledge that is symbolic

10:20

person would put into the rules by just doing back propagation?

10:24

So the neural net look like this:

10:28

There's a symbol representing the person, a symbol

10:30

representing the relationship. That symbol

10:33

then via some connections went to a vector of features,

10:37

and these features were learned by the network.

10:40

So the features for person one and features for the relationship.

10:44

And then those features interacted

10:46

and predicted the features for the output person

10:48

from which you predicted the output person you find the closest match with the last.

10:54

So what was interesting about

10:55

this network was that it learned sensible things.

10:59

If you did the right regularisation, the six feature neurons.

11:03

So nowadays these vectors are 300 or a thousand long. Back

11:07

then they were six long.

11:09

This was done on a machine that took

11:11

12.5 microseconds to do a floating point multiplier,

11:15

which was much better than my apple two which took two

11:18

and a half milliseconds to multiply.

11:21

I'm sorry, this is an old man.

11:25

So it learned features

11:27

like the nationality, because if you know

11:30

person one is English, you know the output is going to be English.

11:33

So nationality is a very useful feature. It learned what generation the person was.

11:38

Because if you know the relationship, if you learn for the relationship

11:41

that the answer is one generation up from the input

11:46

and you know the generation of the input, you know the generation

11:48

of the output, by these feature interactions.

11:53

So it learned all these the obvious features of the domain and it learned

11:57

how to make these features interact so that it could generate the output.

12:01

So what had happened was had shown symbols strings

12:04

and it created features such that

12:07

the interaction between those features could generate the symbol strings,

12:11

but it didn't store symbols strings, just like GPT 4.

12:16

That doesn't store any sequences of words

12:19

in its long term knowledge.

12:21

It turns them all into weights from which you can regenerate sequences.

12:26

But this is a particularly simple example of it

12:27

where you can understand what it did.

12:31

So the large language models we have today,

12:34

I think of as descendants of this tiny language model,

12:36

they have many more words as input, like a million,

12:41

a million word fragments.

12:43

They use many more layers of neurons,

12:46

like dozens.

12:49

They use much more complicated interactions.

12:50

So they didn't just have a feature affecting another feature.

12:53

They sort of match to feature vectors.

12:55

And then let one vector effect the other one

12:57

a lot if it's similar, but not much of it's different.

12:59

And things like that.

13:01

So it's much more complicated interactions, but it's the same general

13:04

framework, the the same general idea of

13:07

let's turn simple strings into features

13:11

for word fragments and interactions between these feature vectors.

13:15

That's the same in these models.

13:18

It's much harder to understand what they do.

13:20

Many people,

13:23

particularly people from the Chomsky School, argue

13:26

they're not really intelligent, they're just a form of glorified auto complete

13:30

that uses statistical regularities to pastiche together pieces of text

13:33

that were created by people.

13:35

And that's a quote from somebody.

13:40

So let's deal with the

13:41

autocomplete objection. when someone says it's just auto complete.

13:45

They are actually appealing to your

13:48

intuitive notion how autocomplete works.

13:50

So in the old days autocomplete would work by you'd store

13:52

say, triples of words that you saw the first two.

13:56

You count how often that third one occurred.

13:58

So if you see fish and, chips occurs a lot after that.

14:01

But hunt occurs quite often too. So chips is very likely and hunt's quite likely,

14:05

and although is very unlikely.

14:08

You can do autocomplete like that,

14:11

and that's what people are appealing to when they say it's just autocomplete,

14:13

it's a dirty trick, I think because that's not at all how LLM's predict the next word.

14:18

They turn words into features, they make these features interact,

14:21

and from those feature interactions they predict the features of the next word.

14:26

And what I want to claim

14:29

is that these

14:32

millions of features and billions of interactions between features

14:35

that they learn, are understanding. What they're really doing

14:39

these large language models, they're fitting a model to data.

14:42

It's not the kind of model statisticians thought much about until recently.

14:47

It's a weird kind of model. It's very big.

14:49

It has huge numbers of parameters, but it is trying to understand

14:54

these strings of discrete symbols

14:57

by features and how features interact.

15:00

So it is a model.

15:02

And that's why I think these things really understanding.

15:06

One thing to remember is if you ask, well, how do we understand?

15:10

Because obviously we think we understand.

15:13

Well, many of us do anyway.

15:17

This is the best model we have of how we understand.

15:21

So it's not like there's this weird way of understanding that

15:23

these AI systems are doing and then this how the brain does it.

15:27

The best that we have, of how the brain does it,

15:29

is by assigning features to words and having features, interactions.

15:32

And originally this little language model

15:34

was designed as a model of how people do it.

15:38

Okay, so I'm making the very strong claim

15:40

these things really do understand.

15:44

Now, another argument

15:45

people use is that, well, people GPT4 just hallucinate stuff,

15:49

it should actually be called confabulation when it's done by a language model.

15:53

and they just make stuff up.

15:56

Now, psychologists don't say this

15:58

so much because psychologists know that people just make stuff up.

16:01

Anybody who's studied memory going back to Bartlett in the 1930s,

16:07

knows that people are actually just like these large language models.

16:10

They just invent stuff and for us, there's no hard line

16:14

between a true memory and a false memory.

16:19

If something happened recently

16:21

and it sort of fits in with the things you understand, you'll probably remember

16:25

it roughly correctly. If something happened a long time ago,

16:28

or it's weird, you'll remember it wrong, and often you'll be very confident

16:33

that you remembered it right, and you're just wrong.

16:36

It's hard to show that.

16:37

But one case where you can show it is John Dean's memory.

16:41

So John Dean testified at Watergate under oath.

16:45

And retrospectively it's clear that he was trying to tell the truth.

16:49

But a lot of what he said was just plain wrong.

16:52

He would confuse who was in which meeting,

16:55

he would attribute statements to other people who made that statement.

16:57

And actually, it wasn't quite that statement.

17:00

He got meetings just completely confused,

17:05

but he got the gist of what was going on in the White House right.

17:08

As you could see from the recordings.

17:11

And because he didn't know the recordings, you could get a good experiment this way.

17:15

Ulric Neisser has a wonderful article talking about John Dean's memory,

17:19

and he's just like a chat bot, he just make stuff up.

17:25

But it's plausible.

17:26

So it's stuff that sounds good to him

17:28

is what he produces.

17:30

They can also do reasoning.

17:32

So I've got a friend in Toronto who is a symbolic AI guy,

17:36

but very honest, so he's very confused by the fact these things work at all.

17:41

and he suggested a problem to me.

17:43

I made the problem a bit harder

17:45

and I

17:45

gave this to GPT4 before it could look on the web.

17:49

So when it was just a bunch of weights frozen in 2021,

17:53

all the knowledge is in the strength of the interactions between features.

17:57

So the rooms in my house are painted blue or white or yellow,

18:00

yellow paint fades to white

18:01

within a year. In two years time i want them all to be white.

18:03

What should I do and why?

18:05

And Hector thought it wouldn't be able to do this.

18:08

And here's what you GPT4 said.

18:11

It completely nailed it.

18:14

First of all, it started by saying assuming blue paint doesn't fade to white

18:18

because after i told you yellow paint fades to white, well, maybe blue paint does too.

18:22

So assuming it doesn't, the white rooms you don't need to paint, the yellow rooms

18:26

you don't need to paint because they're going to fade to white within a year.

18:29

And you need to paint the blue rooms white.

18:31

One time when I tried it, it said, you need to paint the blue rooms yellow

18:34

because it realised that will fade to white.

18:37

That's more of a mathematician's solution of reducing to a previous problem.

18:44

So, having

18:46

claimed that these things really do understand,

18:49

I want to now talk about some of the risks.

18:53

So, there are many risks from powerful AI.

18:56

There's fake images, voices and video

18:59

which are going to be used in the next election.

19:03

There's many elections this year

19:04

and they're going to help to undermine democracy.

19:07

I'm very worried about that.

19:08

The big companies are doing something about it, but maybe not enough.

19:12

There's the possibility of massive job losses.

19:14

We don't really know about that.

19:16

I mean, the past technologies often created jobs, but this stuff,

19:21

well, we used to be stronger,

19:23

we used to be the strongest things around apart from animals.

19:27

And when we got the industrial revolution, we had machines that were much stronger.

19:31

Manual labor jobs disappeared.

19:34

So the equivalent of manual labor jobs are going to disappear

19:38

in the intellectual realm, and we get things that are much smarter than us.

19:41

So I think there's going to be a lot of unemployment.

19:43

My friend Jen disagrees.

19:46

One has to distinguish two kinds of unemployment two, two kinds of job loss.

19:51

There'll be jobs where you can expand

19:53

the amount of work that gets done indefinitely. Like in health care.

19:56

Everybody would love to have their own

19:58

private doctors talking to them all the time.

20:00

So they get a slight itch here and the doctor says, no, that's not cancer.

20:04

So there's

20:05

room for huge expansion of how much gets done in medicine.

20:08

So there won't be job loss there.

20:10

But in other things, maybe there will be significant job loss.

20:13

There's going to be massive surveillance that's already happening in China.

20:17

There's going to be lethal autonomous weapons

20:19

which are going to be very nasty, and they're really going to be autonomous.

20:23

The Americans very clearly have already decided,

20:25

they say people will be in charge,

20:27

but when you ask them what that means is it doesn't

20:29

mean people will be in the loop that makes the decision to kill.

20:33

And as far as I know, the Americans intend

20:35

to have half of their soldiers be robots by 2030.

20:40

Now, I do not know for sure that this is true.

20:43

I asked Chuck Schumer's

20:46

National Intelligence

20:47

Advisor, and he said, well

20:50

if there's anybody in the room who would know it would be me.

20:54

So, I took that to be the American way of saying,

20:57

You might think that, but I couldn't possibly comment.

21:02

There's going to be cybercrime