Trending Songs Recently Updated Songs Popular Music Genres Add Songs

Explore

Display Bilingual:

Off Tiếng Việt 日本語 Español Português 한국어 Français 中文

This is something of a nice change. I've 00:00

given a lot of scientific talks and no 00:02

one claps and cheers when I come on. Not 00:04

normally even when I come on. 00:06

It's really exciting. It's really 00:13

wonderful to be here. I guess I should 00:15

start off assuming that not everyone in 00:17

this cavernous hall knows who I am. Who 00:19

am I? I'm I'm someone who has done some 00:22

work in AI for science who really 00:25

believes that we can use the AI systems, 00:28

these technologies, these ideas to 00:31

change the world in a very specific way 00:34

to make science go faster to enable new 00:37

discoveries. I think it's really really 00:39

wonderful. We have the opportunity to 00:42

take these tools, these ideas 00:44

and aim them toward the question of how 00:47

can we build the right AI systems so 00:49

that sick people can become healthy and 00:52

go home from the hospital. And it's been 00:54

kind of a a really wonderful and winding 00:57

journey for me to end up here. I was 01:00

originally trained as a physicist. I 01:02

thought I was going to be a laws of the 01:04

universe physicist. If I was very very 01:06

lucky, I could do something that would 01:08

end up one sentence in a textbook. 01:10

And I did physics and I went to actually 01:13

do a PhD in physics. And then kind of 01:16

what I was working on didn't really grab 01:19

me. I just it didn't feel like what I 01:22

wanted to do. So I dropped out. I didn't 01:24

start a startup. That would have been 01:26

very on point for this event, but I uh 01:28

dropped out and I ended up working at a 01:31

company that was doing computational 01:33

biology. How do we get computers to say 01:35

something smart about biology? And I 01:38

loved it. I loved it not just because it 01:40

was fun, but it was something that would 01:43

let me do what I thought I was good at. 01:44

Write code, manipulate equations, think 01:47

hard thoughts about the nature of the 01:50

world and use it toward this very 01:51

applied purpose that at the end we want 01:54

to ena we want to make medicines or we 01:57

want to enable others to make medicines. 01:59

Then I really kind of became a biologist 02:01

and a machine learner. Actually a 02:04

machine learner because I left that job 02:06

and I went back to grad school in 02:07

biohysics and chemistry and uh I no 02:09

longer had access to this incredible 02:13

computer hardware that I had when I was 02:14

working at my previous job and in fact 02:17

they had custom asics for simulating how 02:19

proteins this part of your body that 02:22

I'll talk about move. And since I didn't 02:23

have that anymore but I still wanted to 02:25

work on the same problems. Well, I 02:28

didn't want to just do the same thing 02:29

with less compute. And so I started to 02:30

learn and I was getting very interested 02:33

in statistics, in machine learning. We 02:35

didn't call it AI back then. In fact, we 02:38

didn't even call it machine learning. 02:40

That was a bit disreputable. I said, I'm 02:42

working in statistical physics. But you 02:43

know, how are we going to develop 02:46

algorithms? How are we going to learn 02:48

from data and do that instead of very 02:50

large compute? And I guess it turns out 02:52

in terms of AI in addition to very large 02:53

compute to answer new problems. And 02:56

after this I joined uh Google DeepMind 02:59

and really joining a company that wanted 03:03

to say how are we going to take these 03:06

powerful technologies and all kind of 03:09

these ideas and we they were becoming 03:11

very very readily apparent how powerful 03:13

these technologies were with 03:15

applications 03:17

uh to especially games but also to 03:18

things like data centers and others. How 03:22

are we going to take these technologies 03:23

and use them to advance science and 03:25

really push forward scientific frontier? 03:27

And how can we do this in an industrial 03:30

setting with an incredibly fast pace 03:32

working with some really smart people 03:35

working with great computer resources 03:36

and with all that you darn well better 03:38

make some progress and it's been really 03:40

really fun and the fact that I'm on this 03:42

stage indicates that we made some 03:45

progress and I think it really the 03:46

guiding principle for me has that when 03:49

we do this work that ultimately we are 03:52

building tools that will enable 03:56

scientists to make discoveries. 03:58

And what I think is really heartening 04:00

about the work we've done and the part 04:02

that really I think still just resonates 04:04

with me at my core is there about I 04:07

think 35,000 citations of Alphafold. But 04:10

within that is there are tens of 04:13

thousands of examples of people using 04:16

our tools to do science that I couldn't 04:18

do on my own but are using it to make 04:21

discoveries. be it vaccines, be it drug 04:24

development, be it how the body works. 04:27

And I think that's really really 04:30

exciting. And the part I want to talk to 04:31

you about today and the story I want to 04:33

tell you is a bit about the problem, a 04:35

bit about how we did it. And I think 04:38

especially the role of research and 04:40

machine learning research and the fact 04:43

that it isn't just off-the-shelf machine 04:44

learning and then I want to tell you a 04:46

little bit about what happens when you 04:48

make something great and how people use 04:50

it and what it does for the world. So, 04:52

I'll start with the world's shortest 04:54

biology lesson. The cell is complex. 04:56

Um, for people who have only studied 05:00

biology in high school or in college, 05:04

you might have this idea that the cell 05:06

is a couple parts that have labels 05:08

attached to them. And it's kind of 05:10

simple, but really it looks much more 05:12

like what you see on the screen. It's 05:14

dense. It's complex. Uh, in terms of 05:16

crowding, it's like the swimming pool on 05:19

the 4th of July and it's in full of 05:20

enormous complexity. Humans have about 05:24

20,000 different types of proteins. 05:27

Those are some of the blobs you see on 05:30

the screen. They come together to do 05:31

practically every function in your cell. 05:33

You can see that uh kind of green tail 05:36

is the psyllium of uh an ecoli. That's 05:38

how it moves around. And you can see in 05:42

fact how it moves around. And you can 05:44

see that thing that looks like it turns 05:46

and in fact it turns and drives this 05:48

motor. All of this is made of proteins. 05:50

When people say that DNA is the 05:52

instruction manual for life, well, this 05:55

is what it's telling you how to do. It's 05:57

telling you how to build these tiny 05:59

machines. And biology has evolved an 06:01

incredible mechanism to build the 06:04

machines it needs, literal nano 06:07

machines, and build them out of atoms. 06:09

And so your DNA gives you instructions 06:11

that say build a protein. Now you might 06:13

say your DNA is a line and so are 06:16

proteins in a certain sense. It's 06:18

instructions on how to attach one bead 06:20

after another where each bead is a 06:22

specific kind of molecular arrangement 06:24

of atoms. And you should wonder if I my 06:26

DNA is aligned and I am very much not 06:30

one-dimensional, 06:32

what happens in between? And the answer 06:34

is after you make this protein and 06:35

assemble it one piece at a time, it will 06:38

fold up spontaneously 06:41

into a shape like you've opened your 06:43

IKEA bookshelf and instead of having to 06:46

do the hard work, it simply builds 06:48

itself and you get this quite complex 06:50

structure. You can see quite typical 06:52

protein, a kynise for those of you who 06:55

are biologists in the audience over 06:56

there. And you can see this very complex 06:58

arrangement of atoms and that 07:00

arrangement is functional and and the 07:02

majority not everyone of the proteins uh 07:06

in your body undergo this transformation 07:08

and that is what functions and that is 07:11

incredibly small. 07:13

So light itself is a few hundred 07:15

nanometers in size and that's a few 07:19

nanometers in size. So it's smaller than 07:22

you can see in a microscope. And for a 07:24

long time scientists have wanted to 07:26

understand this structure because they 07:28

use it to predict how changes in that 07:31

protein might affect disease. How does 07:34

that work? How does biology work? Often 07:37

if you make a drug it is to interrupt 07:39

the function of a certain protein like 07:40

this one. 07:42

Now scientists have through an 07:44

incredible amount of cleverness figured 07:47

out the structure of lots of proteins 07:49

and it remains to this day exceptionally 07:51

difficult. Right? You shouldn't imagine 07:54

this as I want to determine the 07:56

structure of a protein. So I shall open 07:59

the lab protocol for protein structure 08:01

determination. I shall follow the steps. 08:03

It consists of cleverness of ideas of 08:06

finding many ways. In this case, I'm 08:10

describing one type of protein structure 08:12

prediction in or protein structure, 08:14

sorry, determination, experimental 08:16

measurement, where you convince that big 08:17

ugly molecule I just showed you to form 08:19

a regular crystal kind of like table 08:22

salt. No one has an easy recipe for 08:24

this. So, they try many things. They 08:26

have ideas and it's exceptionally 08:28

difficult and filled with failure like 08:32

many things in science. 08:34

And you're really looking at 08:36

kind of one way to get an idea of how 08:40

difficult this is. Just one kind of 08:42

ordinary paper that we were using. I 08:43

flipped to the back and it said, you 08:45

know, in their protocol, after more than 08:48

a year, crystals began to form. Right? 08:49

So, not only did they do all these hard 08:52

experiments, but they had to wait about 08:54

a year to find out if it worked. And 08:56

probably that year wasn't spent waiting. 08:58

It was trying a thousand other things 08:59

that didn't work as well. 09:01

Once you do that, you can take this to a 09:03

uh synretron, a modest thing. You can 09:06

see the cars rigging the outside of this 09:09

instrument so that you can shine 09:11

incredibly bright X-rays on it and get 09:13

what is called a defraction pattern and 09:15

you can solve that and you can deposit 09:18

it in what's called the PDB or the 09:20

protein datab bank. And one of the 09:22

things that enabled the work we did is 09:24

that scientists 50 years ago had the 09:27

foresight to say these are important, 09:29

these are hard. We should collect them 09:32

all in one place. So there's a data set 09:35

that represents ex essentially all the 09:37

academic output of protein structures in 09:40

the community and available to everyone. 09:43

So our work was on very public data. 09:46

About 200,000 protein structures are 09:48

known. They pretty regularly increase at 09:51

about 12,000 a year. 09:53

But this is much much smaller than the 09:57

need. 09:59

Getting the kind of input information, 10:01

the DNA that tells you about a protein 10:03

is much much much much easier. So 10:06

billions of protein sequences are being 10:09

discovered. About 3,000 times faster are 10:12

we learning about protein sequence than 10:14

protein structure. 10:16

Okay, that's all scientific content, but 10:18

I should talk to you about the little 10:21

thing we did which has this kind of 10:24

schematic diagram. 10:26

We wanted to build an AI system. In 10:28

fact, we didn't even care if it was an 10:31

AI system. That's one of the nice things 10:32

about uh working in AI for science is 10:35

you don't care how you solve it. If it 10:37

ended up being a computer program, if it 10:39

ended up being anything else, we want to 10:41

find some way to get from the left where 10:43

each of those letters represents a 10:46

specific building block of the protein 10:47

considered an order. We want to put 10:49

something in the middle in the alpha 10:51

fold and we want to end up with 10:53

something on the right. And you'll see 10:55

uh two structures there if you look 10:57

closely where the blue is our prediction 10:59

and the green is the experimental 11:02

structure that took someone a year or 11:04

two of effort. If you want to put an 11:06

economic value on it on the order of 11:07

$100,000 11:10

and you can see we were able to do this 11:13

and I want to tell you how 11:16

and there were really three components 11:19

to doing this or to do any machine 11:21

learning problem and you can say you 11:23

have data and you have compute and you 11:25

have research 11:27

and I feel like we tell too many stories 11:29

about the first two and not enough about 11:32

the third. In data, we had 200,000 11:34

protein structures. Everyone has the 11:37

same data. 11:40

In terms of compute, this isn't LLM 11:41

scale. It's the final model itself was 11:44

128 TPU v3 cores, roughly equivalent to 11:48

a GPU per core for two weeks. This is 11:52

again within the scope of say academic 11:55

resources but it's worth saying really 11:58

most of your compute when you think 12:01

about how much compute you need don't 12:03

get distracted by the number for the 12:04

final model the real cost of compute is 12:06

the cost of ideas that didn't work all 12:08

the things you had to do to get there 12:12

and then finally research and I would 12:14

say this is all but about two people 12:15

that worked on this it's a small group 12:19

of people that end up doing this So 12:21

really when you look at these machine 12:24

learning breakthroughs they're probably 12:26

fewer people than you imagine and really 12:28

this is where our work was 12:31

differentiated. We came up with a new 12:33

set of ideas on how do we bring machine 12:35

learning to this problem and I can say 12:39

earlier systems largely based on 12:41

convolutional neural networks did okay. 12:44

They certainly made progress. If you 12:46

replace that with a transformer you're 12:48

honestly about the same. If you take the 12:50

ideas of a transformer and much 12:52

experimentation and many more ideas, 12:54

then that's when you start to get real 12:57

change. And in almost all the AI systems 12:59

you can see today, a tremendous amount 13:03

of research and ideas and what I would 13:05

call midscale ideas are involved. It 13:07

isn't just about the headlines where 13:10

people will say transformers, 13:12

you know, scaling, test time inference. 13:15

These are all important but they're one 13:18

of many ingredients in a really powerful 13:20

system and in fact we can measure how 13:22

much our research was worth. So someone 13:26

Alphafold 2 is the system that is quite 13:29

famous the one that uh was quite a large 13:31

improvement. Alpha fold one was the best 13:33

in the world but someone did uh the 13:35

Alcesi lab did a very uh careful 13:37

experiment where they took Alphold 2 the 13:40

architecture and they trained it on 1% 13:43

of the available data and they could 13:46

show that alpha fold 2 trained on 1% of 13:48

the data was as accurate or more 13:51

accurate as alphafold one which was the 13:54

state-of-the-art system previously. So 13:56

there's a very clean thing that says 13:58

that the third uh the third of these 14:00

ingredients research was worth a 14:03

hundfold of the first of these 14:06

ingredients data. And I think this is 14:08

generally really really important that 14:10

one of the big as you're all thinking as 14:13

you're all in startups or thinking about 14:16

startups think about the amount to which 14:18

ideas research discoveries amplify data 14:21

amplify compute they work together with 14:26

it we wouldn't want to use less data 14:28

than we have we wouldn't want to use 14:30

less compute than we have available but 14:31

ideas are a core component when you're 14:35

doing machine learning research and they 14:37

really helped to transform the world. 14:39

>> YC's Next Batch is now taking 14:41

applications. Got a startup in you? 14:44

Apply at y combinator.com/apply. 14:46

It's never too early. And filling out 14:49

the app will level up your idea. Okay, 14:51

back to the video. We can even go back 14:54

and we can do ablations and we can say 14:56

what parts matter. And don't focus too 14:58

much on the details. We pulled this from 15:00

our paper. You can see here this is the 15:01

difference compared to the baseline. And 15:04

you take either of those and you can see 15:06

that each of the ideas that you might 15:08

remove from our final system kind of 15:10

discreet identifiable ideas some of 15:12

which were incredibly popular research 15:15

areas within the field like this work 15:18

came out and a part of it was 15:20

equivariant and people said equivariance 15:22

that is the answer alphafold is an 15:25

equivariant system and it's great we 15:27

must do more research on equivarians to 15:29

get even more great systems well I was 15:31

very confused by this because the sixth 15:34

uh row there no IPA invariant point 15:37

attention that removes all the 15:40

equavariance in alpha fold and it hurts 15:42

a bit but only a bit. Alpha fold itself 15:45

on this GDT scale that you can see on 15:48

the left graph. Alphafold 2 was about 30 15:51

GDT better than alphafold one and 15:54

equivariance explains two or three of 15:57

this. It isn't about one idea. It's 15:59

about many midscale ideas that add up to 16:02

a transformative system. And it's very 16:05

very important when you're building 16:07

these systems to think about what we 16:08

would call in this context biological 16:11

relevance. We would have ideas that were 16:13

better. We kind of got our system 16:15

grinding 1% at a time. But what really 16:18

mattered was when we crossed the 16:21

accuracy that it mattered to an 16:23

experimental biologist who didn't care 16:25

about machine learning. And you have to 16:27

get there through a lot of work and a 16:29

lot of effort. And when you do, it is 16:31

incredibly transformative. And we can 16:33

measure against uh this axis where the 16:36

dark blue axis the other systems 16:38

available at the time. And this was 16:40

assessed. Protein structure prediction 16:42

is in some ways far ahead of uh LLMs or 16:45

the general machine learning space and 16:49

having blind assessment. Since 1994, 16:50

every two years, everyone interested in 16:53

predicting the structure of proteins 16:55

gets together and predicts the structure 16:57

of a hundred proteins whose answer isn't 16:58

known to anyone except the research 17:00

group that just solved it, right? 17:02

Unpublished. And so, you really do know 17:04

what works. And we had about a third of 17:06

the error of any other group on this 17:08

assessment. But it matters because once 17:10

you are working on problems in which you 17:13

don't know the answer, you get to really 17:15

measure how good things are. And you can 17:16

really find that a lot of systems don't 17:19

live up to what people believe over the 17:21

course of their research. And because 17:24

even if you have a benchmark, we all 17:26

overfit to our ideas to the benchmark, 17:28

right? Unless you have held out. And in 17:31

fact, the problems you have in the real 17:33

world are almost always harder than the 17:36

problems you train on, right? Because 17:38

you have to learn from much data and you 17:40

apply it to very important singular 17:41

problems. So it is very very important 17:44

that you measure well both as you're 17:46

developing and when people are trying to 17:48

decide whether they should use your 17:50

system. External benchmarks are 17:52

absolutely critical to figuring out what 17:54

works and that's what really helps drive 17:57

the world forward. So just some 18:00

wonderful examples of this is typical 18:02

performance for us. These are blind 18:04

predictions. You can see they're pretty 18:05

darn good. also important we made it 18:07

available and we thought it was and we 18:10

did a lot of assessment but we decided 18:12

that it was very important to make it 18:13

available in two ways. One is that we 18:15

open source the code and we actually 18:17

open sourced the code about a week 18:18

before we released a database of 18:19

predictions starting originally at 18:22

300,000 predictions and later going to 18:24

200 million essentially every protein um 18:26

from an organism whose genome has been 18:29

sequenced. And this made an enormous 18:31

difference. And one of the most 18:34

interesting kind of sociological things 18:35

is this huge difference between when we 18:36

released a piece of code that 18:39

specialists could use and we got some 18:40

information and then when we made it 18:43

available to the world in this database 18:44

form. It was really interesting kind of 18:48

you know you release something and every 18:51

day you check Twitter to find out or 18:52

check X to find out what's going on. And 18:54

what we would really see is even after 18:58

that CASP assessment, I would say that 19:01

the structure predictors were convinced 19:03

this obviously was this enormous advance 19:05

solved the problem. But general 19:08

biologists, the people we wanted to use, 19:10

the people who didn't care about 19:12

structure prediction, they cared about 19:13

proteins to do their experiments, they 19:14

weren't as sure. They said, "Well, maybe 19:16

CASP was easy. I don't know." And then 19:18

this database came out and people got 19:21

curious and they clicked in and the 19:23

amount to which the proof was social was 19:26

extraordinary that people would look and 19:28

say how did deep mind get access to my 19:31

unpublished structure. you know, this 19:34

moment at which they really believed it 19:36

that everyone had a a protein either had 19:38

a protein that they hadn't solved or had 19:41

a friend who had a protein that was 19:43

unpublished and they could compare and 19:45

that's what really made the difference. 19:47

And having this database, this 19:49

accessibility, this ease led everyone to 19:50

try it and figure out how it worked. 19:53

Word of mouth is really how this trust 19:56

is built. And you can kind of see some 19:59

of these testimonials, right? I wrestled 20:00

for three to four months trying to do 20:03

this uh scientific task. You know, this 20:06

morning I got an alpha fold prediction 20:09

and now it's much better. I want my time 20:11

back, right? You know, you really 20:14

appreciate alphafold when you run it on 20:17

a protein that for a year refused to get 20:19

expressed and purified. Meaning they for 20:22

a year they couldn't even get the 20:24

material to start experiments. These are 20:25

really important. When you build the 20:27

right tool, when you solve the right 20:29

problem, it matters and it changes the 20:30

lives of people who are doing things not 20:34

that you would do but building on top of 20:37

your work. And I think it's just 20:39

extraordinary to see these and the 20:41

number of people I talked to. The time 20:43

that I really knew this tool mattered. 20:45

In fact, there was a special issue of 20:47

science on the nuclear pore complex a 20:49

few months after the tool came out. And 20:51

the special issue was all about this 20:54

particular very large kind of several 20:56

hundred protein system. And three out of 20:59

the four uh papers in science about this 21:02

made extensive use of alpha fold. I 21:05

think I counted over a hundred mentions 21:07

of the word alphafold in science and we 21:08

had nothing to do with it. We didn't 21:11

know it was happening. We weren't 21:12

collaborating. It was just people doing 21:14

new science on top of the tools we had 21:16

built and that is the greatest feeling 21:18

in the world. And in fact, users do the 21:19

darnest things. They will use tools in 21:22

ways you didn't know were possible. The 21:25

tweet on the left from Yoshaka Morowaki 21:28

came out two days after our code was 21:31

available. We had predicted the 21:33

structure of individual proteins, but we 21:35

consider we were working on building a 21:37

system that would predict how proteins 21:39

came together. But uh this researcher 21:40

said, "Well, I have alphapold. Why don't 21:43

I just put two proteins together and 21:45

I'll put something in between?" You 21:47

could think of this as prompt 21:49

engineering but for proteins. And 21:50

suddenly they find out this is the best 21:52

protein interaction prediction in the 21:54

world, right? That when you train on 21:56

these a really really powerful system, 21:58

it will have additional in some sense 22:00

emergent skills as long as they're 22:03

aligned. People started to find all 22:05

sorts of problems that Alphafold would 22:07

work on that we hadn't anticipated. It 22:11

was so interesting to see the field of 22:13

science in real time reacting to the 22:16

existence of these tools, finding their 22:19

limitations, finding their possibilities 22:20

and this continues and people do all 22:24

sorts of exciting work be it in protein 22:26

design be it in others on top of either 22:28

the ideas and often the systems we have 22:31

built. One application that really uh I 22:34

thought was really important is that 22:39

people have started to learn how to use 22:41

it to engineer big proteins or to use it 22:43

in part of and I want to tell this story 22:46

for two reasons. One is I think it's a 22:48

really cool application but the second 22:50

is how it really changes the work of 22:52

science and often people will say 22:54

science is all about experiments and 22:57

validation. So it's great that you have 22:59

all these alpha fold predictions. Now 23:01

all we have to do is solve all the 23:03

proteins the classic way so that we can 23:05

tell whether your predictions are right 23:08

or wrong. And they're right about one 23:10

thing. Science is about experiments. 23:13

Science is about doing these 23:15

experiments. 23:17

But they're wrong about another thing. 23:19

Um science is about making hypotheses 23:21

and testing them not about the structure 23:24

of a particular protein. In this case, 23:27

the question was they took this protein 23:29

on the left called the contractile 23:32

inject injection system, but that's a 23:34

mouthful. They like to call it the 23:36

molecular syringe. And what it does is 23:37

it attaches to a cell and injects a 23:40

protein into it. And the scientists at 23:43

the Jang Lab at uh MIT were saying, 23:45

well, can we use this protein 23:49

to do targeted drug delivery? Can we use 23:53

it to get gene editors like cast 9 into 23:55

the cell? They tried over a hundred 23:58

methods to figure out how to take this 24:01

protein, which they didn't have a 24:03

structure of. This is just kind of a 24:04

rendition after the fact, and say, how 24:05

can we change what it recognizes? I 24:08

think it's originally involved in plant 24:10

defense or something like that, and they 24:12

didn't know how to do it. And they ran 24:14

an alpha fold prediction. You can see 24:15

the one on the left. I wouldn't even say 24:16

it's a great alpha fold prediction, but 24:18

almost immediately they looked at that 24:20

and said, "Wait a minute. those legs at 24:21

the bottom are how it must recognize and 24:23

attach to cells. Why don't we just 24:26

replace those with a designed protein? 24:28

And so almost immediately as soon as 24:31

they got the alpha fold prediction, they 24:32

re-engineered to add this design protein 24:34

that you see in red uh to target a new 24:36

type of cell. And they take this system 24:40

and then they show in fact that they can 24:45

choose cells within a mouse and they can 24:47

inject proteins in this case fluorescent 24:50

proteins. So there you'll see the color 24:52

and they can target the cells they want 24:54

within a mouse brain. And so they are 24:56

using this to develop a new type of 24:58

system 25:00

of targeted drug discovery. And we see 25:02

many more examples. We see some in which 25:05

scientists are using this tool to try 25:07

thousands and thousands of interactions 25:10

to figure out which ones are likely to 25:11

be the case. In fact, discovered a new 25:14

component of how eggs and sperm come 25:16

together in fertilization. Many many of 25:18

these discoveries that are built on top 25:21

of this. And I like to think that our 25:23

work made the whole field of what's 25:26

called structural biology, biology that 25:29

deals with structures, you know, five or 25:31

10% faster. But the amount to which that 25:33

matters for the world is enormous and we 25:37

will have more of these discoveries. And 25:39

I think ultimately structure prediction 25:43

and larger AI for science should be 25:45

thought of as an incredible capability 25:47

to be an amplifier for the work of 25:49

experimentalists that we start from 25:51

these scattered observations, these 25:53

natural data. This is our equivalent of 25:55

all the words on the internet. And then 25:58

we train a general model that 26:00

understands the rules underneath it and 26:02

can fill in the rest of the picture. And 26:04

I think that we will continue to see 26:06

this pattern and it will get more 26:08

general that we will find the right 26:10

foundational data sources in order to do 26:11

this. And I think the other thing that 26:15

has really been a property is that you 26:17

start where you have data but then you 26:20

find what problems it can be applied to. 26:22

And so we find enormous advance, 26:25

enormous capability to understand 26:28

interactions in the cell or others that 26:30

are downstream of extracting the 26:33

scientific content of these predictions 26:35

and then the rules they use can be 26:39

adapted to new purposes. And I think 26:41

this is really where we see the 26:42

foundational model aspect of alpha fold 26:45

or other narrow systems. And in fact, I 26:47

think we will start to see this on more 26:50

general systems, be them LLMs or others, 26:51

that we will find more and more 26:54

scientific knowledge within them and 26:55

we'll use them for important important 26:58

purposes. And I think this is really 27:00

where this is going. And I think the 27:03

most exciting question in AI for science 27:04

is how general will it be. Will we find 27:08

a couple of narrow places where we have 27:10

transformative impact or will we have 27:12

very very broad systems? And I expect it 27:15

will ultimately be the latter as we 27:17

figure it out. Thank you. 27:19

– English Lyrics

📲 "" is trending – don’t miss the chance to learn it in the app!

By

Viewed

36,958

Language

English

Learn this song

Lyrics & Translation

[English]

This is something of a nice change. I've

given a lot of scientific talks and no

one claps and cheers when I come on. Not

normally even when I come on.

It's really exciting. It's really

wonderful to be here. I guess I should

start off assuming that not everyone in

this cavernous hall knows who I am. Who

am I? I'm I'm someone who has done some

work in AI for science who really

believes that we can use the AI systems,

these technologies, these ideas to

change the world in a very specific way

to make science go faster to enable new

discoveries. I think it's really really

wonderful. We have the opportunity to

take these tools, these ideas

and aim them toward the question of how

can we build the right AI systems so

that sick people can become healthy and

go home from the hospital. And it's been

kind of a a really wonderful and winding

journey for me to end up here. I was

originally trained as a physicist. I

thought I was going to be a laws of the

universe physicist. If I was very very

lucky, I could do something that would

end up one sentence in a textbook.

And I did physics and I went to actually

do a PhD in physics. And then kind of

what I was working on didn't really grab

me. I just it didn't feel like what I

wanted to do. So I dropped out. I didn't

start a startup. That would have been

very on point for this event, but I uh

dropped out and I ended up working at a

company that was doing computational

biology. How do we get computers to say

something smart about biology? And I

loved it. I loved it not just because it

was fun, but it was something that would

let me do what I thought I was good at.

Write code, manipulate equations, think

hard thoughts about the nature of the

world and use it toward this very

applied purpose that at the end we want

to ena we want to make medicines or we

want to enable others to make medicines.

Then I really kind of became a biologist

and a machine learner. Actually a

machine learner because I left that job

and I went back to grad school in

biohysics and chemistry and uh I no

longer had access to this incredible

computer hardware that I had when I was

working at my previous job and in fact

they had custom asics for simulating how

proteins this part of your body that

I'll talk about move. And since I didn't

have that anymore but I still wanted to

work on the same problems. Well, I

didn't want to just do the same thing

with less compute. And so I started to

learn and I was getting very interested

in statistics, in machine learning. We

didn't call it AI back then. In fact, we

didn't even call it machine learning.

That was a bit disreputable. I said, I'm

working in statistical physics. But you

know, how are we going to develop

algorithms? How are we going to learn

from data and do that instead of very

large compute? And I guess it turns out

in terms of AI in addition to very large

compute to answer new problems. And

after this I joined uh Google DeepMind

and really joining a company that wanted

to say how are we going to take these

powerful technologies and all kind of

these ideas and we they were becoming

very very readily apparent how powerful

these technologies were with

applications

uh to especially games but also to

things like data centers and others. How

are we going to take these technologies

and use them to advance science and

really push forward scientific frontier?

And how can we do this in an industrial

setting with an incredibly fast pace

working with some really smart people

working with great computer resources

and with all that you darn well better

make some progress and it's been really

really fun and the fact that I'm on this

stage indicates that we made some

progress and I think it really the

guiding principle for me has that when

we do this work that ultimately we are

building tools that will enable

scientists to make discoveries.

And what I think is really heartening

about the work we've done and the part

that really I think still just resonates

with me at my core is there about I

think 35,000 citations of Alphafold. But

within that is there are tens of

thousands of examples of people using

our tools to do science that I couldn't

do on my own but are using it to make

discoveries. be it vaccines, be it drug

development, be it how the body works.

And I think that's really really

exciting. And the part I want to talk to

you about today and the story I want to

tell you is a bit about the problem, a

bit about how we did it. And I think

especially the role of research and

machine learning research and the fact

that it isn't just off-the-shelf machine

learning and then I want to tell you a

little bit about what happens when you

make something great and how people use

it and what it does for the world. So,

I'll start with the world's shortest

biology lesson. The cell is complex.

Um, for people who have only studied

biology in high school or in college,

you might have this idea that the cell

is a couple parts that have labels

attached to them. And it's kind of

simple, but really it looks much more

like what you see on the screen. It's

dense. It's complex. Uh, in terms of

crowding, it's like the swimming pool on

the 4th of July and it's in full of

enormous complexity. Humans have about

20,000 different types of proteins.

Those are some of the blobs you see on

the screen. They come together to do

practically every function in your cell.

You can see that uh kind of green tail

is the psyllium of uh an ecoli. That's

how it moves around. And you can see in

fact how it moves around. And you can

see that thing that looks like it turns

and in fact it turns and drives this

motor. All of this is made of proteins.

When people say that DNA is the

instruction manual for life, well, this

is what it's telling you how to do. It's

telling you how to build these tiny

machines. And biology has evolved an

incredible mechanism to build the

machines it needs, literal nano

machines, and build them out of atoms.

And so your DNA gives you instructions

that say build a protein. Now you might

say your DNA is a line and so are

proteins in a certain sense. It's

instructions on how to attach one bead

after another where each bead is a

specific kind of molecular arrangement

of atoms. And you should wonder if I my

DNA is aligned and I am very much not

one-dimensional,

what happens in between? And the answer

is after you make this protein and

assemble it one piece at a time, it will

fold up spontaneously

into a shape like you've opened your

IKEA bookshelf and instead of having to

do the hard work, it simply builds

itself and you get this quite complex

structure. You can see quite typical

protein, a kynise for those of you who

are biologists in the audience over

there. And you can see this very complex

arrangement of atoms and that

arrangement is functional and and the

majority not everyone of the proteins uh

in your body undergo this transformation

and that is what functions and that is

incredibly small.

So light itself is a few hundred

nanometers in size and that's a few

nanometers in size. So it's smaller than

you can see in a microscope. And for a

long time scientists have wanted to

understand this structure because they

use it to predict how changes in that

protein might affect disease. How does

that work? How does biology work? Often

if you make a drug it is to interrupt

the function of a certain protein like

this one.

Now scientists have through an

incredible amount of cleverness figured

out the structure of lots of proteins

and it remains to this day exceptionally

difficult. Right? You shouldn't imagine

this as I want to determine the

structure of a protein. So I shall open

the lab protocol for protein structure

determination. I shall follow the steps.

It consists of cleverness of ideas of

finding many ways. In this case, I'm

describing one type of protein structure

prediction in or protein structure,

sorry, determination, experimental

measurement, where you convince that big

ugly molecule I just showed you to form

a regular crystal kind of like table

salt. No one has an easy recipe for

this. So, they try many things. They

have ideas and it's exceptionally

difficult and filled with failure like

many things in science.

And you're really looking at

kind of one way to get an idea of how

difficult this is. Just one kind of

ordinary paper that we were using. I

flipped to the back and it said, you

know, in their protocol, after more than

a year, crystals began to form. Right?

So, not only did they do all these hard

experiments, but they had to wait about

a year to find out if it worked. And

probably that year wasn't spent waiting.

It was trying a thousand other things

that didn't work as well.

Once you do that, you can take this to a

uh synretron, a modest thing. You can

see the cars rigging the outside of this

instrument so that you can shine

incredibly bright X-rays on it and get

what is called a defraction pattern and

you can solve that and you can deposit

it in what's called the PDB or the

protein datab bank. And one of the

things that enabled the work we did is

that scientists 50 years ago had the

foresight to say these are important,

these are hard. We should collect them

all in one place. So there's a data set

that represents ex essentially all the

academic output of protein structures in

the community and available to everyone.

So our work was on very public data.

About 200,000 protein structures are

known. They pretty regularly increase at

about 12,000 a year.

But this is much much smaller than the

need.

Getting the kind of input information,

the DNA that tells you about a protein

is much much much much easier. So

billions of protein sequences are being

discovered. About 3,000 times faster are

we learning about protein sequence than

protein structure.

Okay, that's all scientific content, but

I should talk to you about the little

thing we did which has this kind of

schematic diagram.

We wanted to build an AI system. In

fact, we didn't even care if it was an

AI system. That's one of the nice things

about uh working in AI for science is

you don't care how you solve it. If it

ended up being a computer program, if it

ended up being anything else, we want to

find some way to get from the left where

each of those letters represents a

specific building block of the protein

considered an order. We want to put

something in the middle in the alpha

fold and we want to end up with

something on the right. And you'll see

uh two structures there if you look

closely where the blue is our prediction

and the green is the experimental

structure that took someone a year or

two of effort. If you want to put an

economic value on it on the order of

$100,000

and you can see we were able to do this

and I want to tell you how

and there were really three components

to doing this or to do any machine

learning problem and you can say you

have data and you have compute and you

have research

and I feel like we tell too many stories

about the first two and not enough about

the third. In data, we had 200,000

protein structures. Everyone has the

same data.

In terms of compute, this isn't LLM

scale. It's the final model itself was

128 TPU v3 cores, roughly equivalent to

a GPU per core for two weeks. This is

again within the scope of say academic

resources but it's worth saying really

most of your compute when you think

about how much compute you need don't

get distracted by the number for the

final model the real cost of compute is

the cost of ideas that didn't work all

the things you had to do to get there

and then finally research and I would

say this is all but about two people

that worked on this it's a small group

of people that end up doing this So

really when you look at these machine

learning breakthroughs they're probably

fewer people than you imagine and really

this is where our work was

differentiated. We came up with a new

set of ideas on how do we bring machine

learning to this problem and I can say

earlier systems largely based on

convolutional neural networks did okay.

They certainly made progress. If you

replace that with a transformer you're

honestly about the same. If you take the

ideas of a transformer and much

experimentation and many more ideas,

then that's when you start to get real

change. And in almost all the AI systems

you can see today, a tremendous amount

of research and ideas and what I would

call midscale ideas are involved. It

isn't just about the headlines where

people will say transformers,

you know, scaling, test time inference.

These are all important but they're one

of many ingredients in a really powerful

system and in fact we can measure how

much our research was worth. So someone

Alphafold 2 is the system that is quite

famous the one that uh was quite a large

improvement. Alpha fold one was the best

in the world but someone did uh the

Alcesi lab did a very uh careful

experiment where they took Alphold 2 the

architecture and they trained it on 1%

of the available data and they could

show that alpha fold 2 trained on 1% of

the data was as accurate or more

accurate as alphafold one which was the

state-of-the-art system previously. So

there's a very clean thing that says

that the third uh the third of these

ingredients research was worth a

hundfold of the first of these

ingredients data. And I think this is

generally really really important that

one of the big as you're all thinking as

you're all in startups or thinking about

startups think about the amount to which

ideas research discoveries amplify data

amplify compute they work together with

it we wouldn't want to use less data

than we have we wouldn't want to use

less compute than we have available but

ideas are a core component when you're

doing machine learning research and they

really helped to transform the world.

>> YC's Next Batch is now taking

applications. Got a startup in you?

Apply at y combinator.com/apply.

It's never too early. And filling out

the app will level up your idea. Okay,

back to the video. We can even go back

and we can do ablations and we can say

what parts matter. And don't focus too

much on the details. We pulled this from

our paper. You can see here this is the

difference compared to the baseline. And

you take either of those and you can see

that each of the ideas that you might

remove from our final system kind of

discreet identifiable ideas some of

which were incredibly popular research

areas within the field like this work

came out and a part of it was

equivariant and people said equivariance

that is the answer alphafold is an

equivariant system and it's great we

must do more research on equivarians to

get even more great systems well I was

very confused by this because the sixth

uh row there no IPA invariant point

attention that removes all the

equavariance in alpha fold and it hurts

a bit but only a bit. Alpha fold itself

on this GDT scale that you can see on

the left graph. Alphafold 2 was about 30

GDT better than alphafold one and

equivariance explains two or three of

this. It isn't about one idea. It's

about many midscale ideas that add up to

a transformative system. And it's very

very important when you're building

these systems to think about what we

would call in this context biological

relevance. We would have ideas that were

better. We kind of got our system

grinding 1% at a time. But what really

mattered was when we crossed the

accuracy that it mattered to an

experimental biologist who didn't care

about machine learning. And you have to

get there through a lot of work and a

lot of effort. And when you do, it is

incredibly transformative. And we can

measure against uh this axis where the

dark blue axis the other systems

available at the time. And this was

assessed. Protein structure prediction

is in some ways far ahead of uh LLMs or

the general machine learning space and

having blind assessment. Since 1994,

every two years, everyone interested in

predicting the structure of proteins

gets together and predicts the structure

of a hundred proteins whose answer isn't

known to anyone except the research

group that just solved it, right?

Unpublished. And so, you really do know

what works. And we had about a third of

the error of any other group on this

assessment. But it matters because once

you are working on problems in which you

don't know the answer, you get to really

measure how good things are. And you can

really find that a lot of systems don't

live up to what people believe over the

course of their research. And because

even if you have a benchmark, we all

overfit to our ideas to the benchmark,

right? Unless you have held out. And in

fact, the problems you have in the real

world are almost always harder than the

problems you train on, right? Because

you have to learn from much data and you

apply it to very important singular

problems. So it is very very important

that you measure well both as you're

developing and when people are trying to

decide whether they should use your

system. External benchmarks are

absolutely critical to figuring out what

works and that's what really helps drive

the world forward. So just some

wonderful examples of this is typical

performance for us. These are blind

predictions. You can see they're pretty

darn good. also important we made it

available and we thought it was and we

did a lot of assessment but we decided

that it was very important to make it

available in two ways. One is that we

open source the code and we actually

open sourced the code about a week

before we released a database of

predictions starting originally at

300,000 predictions and later going to

200 million essentially every protein um

from an organism whose genome has been

sequenced. And this made an enormous

difference. And one of the most

interesting kind of sociological things

is this huge difference between when we

released a piece of code that

specialists could use and we got some

information and then when we made it

available to the world in this database

form. It was really interesting kind of

you know you release something and every

day you check Twitter to find out or

check X to find out what's going on. And

what we would really see is even after

that CASP assessment, I would say that

the structure predictors were convinced

this obviously was this enormous advance

solved the problem. But general

biologists, the people we wanted to use,

the people who didn't care about

structure prediction, they cared about

proteins to do their experiments, they

weren't as sure. They said, "Well, maybe

CASP was easy. I don't know." And then

this database came out and people got

curious and they clicked in and the

amount to which the proof was social was

extraordinary that people would look and

say how did deep mind get access to my

unpublished structure. you know, this

moment at which they really believed it

that everyone had a a protein either had

a protein that they hadn't solved or had

a friend who had a protein that was

unpublished and they could compare and

that's what really made the difference.

And having this database, this

accessibility, this ease led everyone to

try it and figure out how it worked.

Word of mouth is really how this trust

is built. And you can kind of see some

of these testimonials, right? I wrestled

for three to four months trying to do

this uh scientific task. You know, this

morning I got an alpha fold prediction

and now it's much better. I want my time

back, right? You know, you really

appreciate alphafold when you run it on

a protein that for a year refused to get

expressed and purified. Meaning they for

a year they couldn't even get the

material to start experiments. These are

really important. When you build the

right tool, when you solve the right

problem, it matters and it changes the

lives of people who are doing things not

that you would do but building on top of

your work. And I think it's just

extraordinary to see these and the

number of people I talked to. The time

that I really knew this tool mattered.

In fact, there was a special issue of

science on the nuclear pore complex a

few months after the tool came out. And

the special issue was all about this

particular very large kind of several

hundred protein system. And three out of

the four uh papers in science about this

made extensive use of alpha fold. I

think I counted over a hundred mentions

of the word alphafold in science and we

had nothing to do with it. We didn't

know it was happening. We weren't

collaborating. It was just people doing

new science on top of the tools we had

built and that is the greatest feeling

in the world. And in fact, users do the

darnest things. They will use tools in

ways you didn't know were possible. The

tweet on the left from Yoshaka Morowaki

came out two days after our code was

available. We had predicted the

structure of individual proteins, but we

consider we were working on building a

system that would predict how proteins

came together. But uh this researcher

said, "Well, I have alphapold. Why don't

I just put two proteins together and

I'll put something in between?" You

could think of this as prompt

engineering but for proteins. And

suddenly they find out this is the best

protein interaction prediction in the

world, right? That when you train on

these a really really powerful system,

it will have additional in some sense

emergent skills as long as they're

aligned. People started to find all

sorts of problems that Alphafold would

work on that we hadn't anticipated. It

was so interesting to see the field of

science in real time reacting to the

existence of these tools, finding their

limitations, finding their possibilities

and this continues and people do all

sorts of exciting work be it in protein

design be it in others on top of either

the ideas and often the systems we have

built. One application that really uh I

thought was really important is that

people have started to learn how to use

it to engineer big proteins or to use it

in part of and I want to tell this story

for two reasons. One is I think it's a

really cool application but the second

is how it really changes the work of

science and often people will say

science is all about experiments and

validation. So it's great that you have

all these alpha fold predictions. Now

all we have to do is solve all the

proteins the classic way so that we can

tell whether your predictions are right

or wrong. And they're right about one

thing. Science is about experiments.

Science is about doing these

experiments.

But they're wrong about another thing.

Um science is about making hypotheses

and testing them not about the structure

of a particular protein. In this case,

the question was they took this protein

on the left called the contractile

inject injection system, but that's a

mouthful. They like to call it the

molecular syringe. And what it does is

it attaches to a cell and injects a

protein into it. And the scientists at

the Jang Lab at uh MIT were saying,

well, can we use this protein

to do targeted drug delivery? Can we use

it to get gene editors like cast 9 into

the cell? They tried over a hundred

methods to figure out how to take this

protein, which they didn't have a

structure of. This is just kind of a

rendition after the fact, and say, how

can we change what it recognizes? I

think it's originally involved in plant

defense or something like that, and they

didn't know how to do it. And they ran

an alpha fold prediction. You can see

the one on the left. I wouldn't even say

it's a great alpha fold prediction, but

almost immediately they looked at that

and said, "Wait a minute. those legs at

the bottom are how it must recognize and

attach to cells. Why don't we just

replace those with a designed protein?

And so almost immediately as soon as

they got the alpha fold prediction, they

re-engineered to add this design protein

that you see in red uh to target a new

type of cell. And they take this system

and then they show in fact that they can

choose cells within a mouse and they can

inject proteins in this case fluorescent

proteins. So there you'll see the color

and they can target the cells they want

within a mouse brain. And so they are

using this to develop a new type of

system

of targeted drug discovery. And we see

many more examples. We see some in which

scientists are using this tool to try

thousands and thousands of interactions

to figure out which ones are likely to

be the case. In fact, discovered a new

component of how eggs and sperm come

together in fertilization. Many many of

these discoveries that are built on top

of this. And I like to think that our

work made the whole field of what's

called structural biology, biology that

deals with structures, you know, five or

10% faster. But the amount to which that

matters for the world is enormous and we

will have more of these discoveries. And

I think ultimately structure prediction

and larger AI for science should be

thought of as an incredible capability

to be an amplifier for the work of

experimentalists that we start from

these scattered observations, these

natural data. This is our equivalent of

all the words on the internet. And then

we train a general model that

understands the rules underneath it and

can fill in the rest of the picture. And

I think that we will continue to see

this pattern and it will get more

general that we will find the right

foundational data sources in order to do

this. And I think the other thing that

has really been a property is that you

start where you have data but then you

find what problems it can be applied to.

And so we find enormous advance,

enormous capability to understand

interactions in the cell or others that

are downstream of extracting the

scientific content of these predictions

and then the rules they use can be

adapted to new purposes. And I think

this is really where we see the

foundational model aspect of alpha fold

or other narrow systems. And in fact, I

think we will start to see this on more

general systems, be them LLMs or others,

that we will find more and more

scientific knowledge within them and

we'll use them for important important

purposes. And I think this is really

where this is going. And I think the

most exciting question in AI for science

is how general will it be. Will we find

a couple of narrow places where we have

transformative impact or will we have

very very broad systems? And I expect it

will ultimately be the latter as we

figure it out. Thank you.

Key Vocabulary

Start Practicing

Vocabulary

Meanings

protein

/ˈprōtēn/

B2

noun
- a large molecule composed of amino acids, essential for living organisms

AI

/ˌeɪˈaɪ/

B2

noun
- artificial intelligence; the capability of a machine to imitate intelligent human behavior

algorithm

/ˈalɡəˌriT͡Həm/

C1

noun
- a step‑by‑step procedure for calculations, data processing, and automated reasoning

data

/ˈdātə/

B1

noun
- facts or information, especially numbers, collected to be examined and used for analysis

model

/ˈmäd(ə)l/

B2

noun
- a simplified representation of a system or phenomenon used to explain or predict its behavior

verb
- to create a representation or simulation of something

prediction

/prɪˈdikSH(ə)n/

B2

noun
- a statement about what will happen in the future based on data or analysis

structure

/ˈstrʌk(t)CHər/

B2

noun
- the arrangement of parts that form a whole; the organization of a system

verb
- to arrange or organize something in a particular way

machine

/məˈSH(ə)n/

B1

noun
- a device that uses energy to perform a task

learning

/ˈLɜrniNG/

B2

noun
- the process of acquiring knowledge or skills through study, experience, or teaching

verb
- to acquire knowledge or skill through study or experience (used in the continuous form 'learning')

research

/riːˈsɜːrtʃ/

B2

noun
- systematic investigation to establish facts or principles

verb
- to investigate systematically

technology

/tekˈnɒlədʒi/

B2

noun
- the application of scientific knowledge for practical purposes

tool

/tuːl/

B1

noun
- an instrument used to carry out a particular function

verb
- to equip with a tool or set of tools

discovery

/dɪˈskʌv(ə)ri/

B2

noun
- the act of finding something new, especially in science

cell

/sel/

B1

noun
- the basic structural and functional unit of living organisms

genome

/ˈjēnəʊm/

C1

noun
- the complete set of genetic material in an organism

sequence

/ˈsēkw(ə)ns/

B2

noun
- an ordered list of elements, such as nucleotides or amino acids

verb
- to arrange in a particular order

AlphaFold

/ˈælfə ˈfoʊld/

B2

noun
- an AI system developed by DeepMind that predicts protein structures from their amino‑acid sequences

transform

/trænsˈfôrm/

B2

verb
- to change in form, appearance, or character

accurate

/ˈakyərit/

B2

adjective
- correct in all details; free from error

complex

/ˈkämplĕks/

B2

adjective
- consisting of many interrelated parts; not simple

Are there any new words in “” you don’t know yet?

💡 Hint: protein, AI… Jump into the app and start learning now!

Key Grammar Structures

Coming Soon!

We're updating this section. Stay tuned!

Related Songs