Display Bilingual:

This is something of a nice change. I've 00:00
given a lot of scientific talks and no 00:02
one claps and cheers when I come on. Not 00:04
normally even when I come on. 00:06
It's really exciting. It's really 00:13
wonderful to be here. I guess I should 00:15
start off assuming that not everyone in 00:17
this cavernous hall knows who I am. Who 00:19
am I? I'm I'm someone who has done some 00:22
work in AI for science who really 00:25
believes that we can use the AI systems, 00:28
these technologies, these ideas to 00:31
change the world in a very specific way 00:34
to make science go faster to enable new 00:37
discoveries. I think it's really really 00:39
wonderful. We have the opportunity to 00:42
take these tools, these ideas 00:44
and aim them toward the question of how 00:47
can we build the right AI systems so 00:49
that sick people can become healthy and 00:52
go home from the hospital. And it's been 00:54
kind of a a really wonderful and winding 00:57
journey for me to end up here. I was 01:00
originally trained as a physicist. I 01:02
thought I was going to be a laws of the 01:04
universe physicist. If I was very very 01:06
lucky, I could do something that would 01:08
end up one sentence in a textbook. 01:10
And I did physics and I went to actually 01:13
do a PhD in physics. And then kind of 01:16
what I was working on didn't really grab 01:19
me. I just it didn't feel like what I 01:22
wanted to do. So I dropped out. I didn't 01:24
start a startup. That would have been 01:26
very on point for this event, but I uh 01:28
dropped out and I ended up working at a 01:31
company that was doing computational 01:33
biology. How do we get computers to say 01:35
something smart about biology? And I 01:38
loved it. I loved it not just because it 01:40
was fun, but it was something that would 01:43
let me do what I thought I was good at. 01:44
Write code, manipulate equations, think 01:47
hard thoughts about the nature of the 01:50
world and use it toward this very 01:51
applied purpose that at the end we want 01:54
to ena we want to make medicines or we 01:57
want to enable others to make medicines. 01:59
Then I really kind of became a biologist 02:01
and a machine learner. Actually a 02:04
machine learner because I left that job 02:06
and I went back to grad school in 02:07
biohysics and chemistry and uh I no 02:09
longer had access to this incredible 02:13
computer hardware that I had when I was 02:14
working at my previous job and in fact 02:17
they had custom asics for simulating how 02:19
proteins this part of your body that 02:22
I'll talk about move. And since I didn't 02:23
have that anymore but I still wanted to 02:25
work on the same problems. Well, I 02:28
didn't want to just do the same thing 02:29
with less compute. And so I started to 02:30
learn and I was getting very interested 02:33
in statistics, in machine learning. We 02:35
didn't call it AI back then. In fact, we 02:38
didn't even call it machine learning. 02:40
That was a bit disreputable. I said, I'm 02:42
working in statistical physics. But you 02:43
know, how are we going to develop 02:46
algorithms? How are we going to learn 02:48
from data and do that instead of very 02:50
large compute? And I guess it turns out 02:52
in terms of AI in addition to very large 02:53
compute to answer new problems. And 02:56
after this I joined uh Google DeepMind 02:59
and really joining a company that wanted 03:03
to say how are we going to take these 03:06
powerful technologies and all kind of 03:09
these ideas and we they were becoming 03:11
very very readily apparent how powerful 03:13
these technologies were with 03:15
applications 03:17
uh to especially games but also to 03:18
things like data centers and others. How 03:22
are we going to take these technologies 03:23
and use them to advance science and 03:25
really push forward scientific frontier? 03:27
And how can we do this in an industrial 03:30
setting with an incredibly fast pace 03:32
working with some really smart people 03:35
working with great computer resources 03:36
and with all that you darn well better 03:38
make some progress and it's been really 03:40
really fun and the fact that I'm on this 03:42
stage indicates that we made some 03:45
progress and I think it really the 03:46
guiding principle for me has that when 03:49
we do this work that ultimately we are 03:52
building tools that will enable 03:56
scientists to make discoveries. 03:58
And what I think is really heartening 04:00
about the work we've done and the part 04:02
that really I think still just resonates 04:04
with me at my core is there about I 04:07
think 35,000 citations of Alphafold. But 04:10
within that is there are tens of 04:13
thousands of examples of people using 04:16
our tools to do science that I couldn't 04:18
do on my own but are using it to make 04:21
discoveries. be it vaccines, be it drug 04:24
development, be it how the body works. 04:27
And I think that's really really 04:30
exciting. And the part I want to talk to 04:31
you about today and the story I want to 04:33
tell you is a bit about the problem, a 04:35
bit about how we did it. And I think 04:38
especially the role of research and 04:40
machine learning research and the fact 04:43
that it isn't just off-the-shelf machine 04:44
learning and then I want to tell you a 04:46
little bit about what happens when you 04:48
make something great and how people use 04:50
it and what it does for the world. So, 04:52
I'll start with the world's shortest 04:54
biology lesson. The cell is complex. 04:56
Um, for people who have only studied 05:00
biology in high school or in college, 05:04
you might have this idea that the cell 05:06
is a couple parts that have labels 05:08
attached to them. And it's kind of 05:10
simple, but really it looks much more 05:12
like what you see on the screen. It's 05:14
dense. It's complex. Uh, in terms of 05:16
crowding, it's like the swimming pool on 05:19
the 4th of July and it's in full of 05:20
enormous complexity. Humans have about 05:24
20,000 different types of proteins. 05:27
Those are some of the blobs you see on 05:30
the screen. They come together to do 05:31
practically every function in your cell. 05:33
You can see that uh kind of green tail 05:36
is the psyllium of uh an ecoli. That's 05:38
how it moves around. And you can see in 05:42
fact how it moves around. And you can 05:44
see that thing that looks like it turns 05:46
and in fact it turns and drives this 05:48
motor. All of this is made of proteins. 05:50
When people say that DNA is the 05:52
instruction manual for life, well, this 05:55
is what it's telling you how to do. It's 05:57
telling you how to build these tiny 05:59
machines. And biology has evolved an 06:01
incredible mechanism to build the 06:04
machines it needs, literal nano 06:07
machines, and build them out of atoms. 06:09
And so your DNA gives you instructions 06:11
that say build a protein. Now you might 06:13
say your DNA is a line and so are 06:16
proteins in a certain sense. It's 06:18
instructions on how to attach one bead 06:20
after another where each bead is a 06:22
specific kind of molecular arrangement 06:24
of atoms. And you should wonder if I my 06:26
DNA is aligned and I am very much not 06:30
one-dimensional, 06:32
what happens in between? And the answer 06:34
is after you make this protein and 06:35
assemble it one piece at a time, it will 06:38
fold up spontaneously 06:41
into a shape like you've opened your 06:43
IKEA bookshelf and instead of having to 06:46
do the hard work, it simply builds 06:48
itself and you get this quite complex 06:50
structure. You can see quite typical 06:52
protein, a kynise for those of you who 06:55
are biologists in the audience over 06:56
there. And you can see this very complex 06:58
arrangement of atoms and that 07:00
arrangement is functional and and the 07:02
majority not everyone of the proteins uh 07:06
in your body undergo this transformation 07:08
and that is what functions and that is 07:11
incredibly small. 07:13
So light itself is a few hundred 07:15
nanometers in size and that's a few 07:19
nanometers in size. So it's smaller than 07:22
you can see in a microscope. And for a 07:24
long time scientists have wanted to 07:26
understand this structure because they 07:28
use it to predict how changes in that 07:31
protein might affect disease. How does 07:34
that work? How does biology work? Often 07:37
if you make a drug it is to interrupt 07:39
the function of a certain protein like 07:40
this one. 07:42
Now scientists have through an 07:44
incredible amount of cleverness figured 07:47
out the structure of lots of proteins 07:49
and it remains to this day exceptionally 07:51
difficult. Right? You shouldn't imagine 07:54
this as I want to determine the 07:56
structure of a protein. So I shall open 07:59
the lab protocol for protein structure 08:01
determination. I shall follow the steps. 08:03
It consists of cleverness of ideas of 08:06
finding many ways. In this case, I'm 08:10
describing one type of protein structure 08:12
prediction in or protein structure, 08:14
sorry, determination, experimental 08:16
measurement, where you convince that big 08:17
ugly molecule I just showed you to form 08:19
a regular crystal kind of like table 08:22
salt. No one has an easy recipe for 08:24
this. So, they try many things. They 08:26
have ideas and it's exceptionally 08:28
difficult and filled with failure like 08:32
many things in science. 08:34
And you're really looking at 08:36
kind of one way to get an idea of how 08:40
difficult this is. Just one kind of 08:42
ordinary paper that we were using. I 08:43
flipped to the back and it said, you 08:45
know, in their protocol, after more than 08:48
a year, crystals began to form. Right? 08:49
So, not only did they do all these hard 08:52
experiments, but they had to wait about 08:54
a year to find out if it worked. And 08:56
probably that year wasn't spent waiting. 08:58
It was trying a thousand other things 08:59
that didn't work as well. 09:01
Once you do that, you can take this to a 09:03
uh synretron, a modest thing. You can 09:06
see the cars rigging the outside of this 09:09
instrument so that you can shine 09:11
incredibly bright X-rays on it and get 09:13
what is called a defraction pattern and 09:15
you can solve that and you can deposit 09:18
it in what's called the PDB or the 09:20
protein datab bank. And one of the 09:22
things that enabled the work we did is 09:24
that scientists 50 years ago had the 09:27
foresight to say these are important, 09:29
these are hard. We should collect them 09:32
all in one place. So there's a data set 09:35
that represents ex essentially all the 09:37
academic output of protein structures in 09:40
the community and available to everyone. 09:43
So our work was on very public data. 09:46
About 200,000 protein structures are 09:48
known. They pretty regularly increase at 09:51
about 12,000 a year. 09:53
But this is much much smaller than the 09:57
need. 09:59
Getting the kind of input information, 10:01
the DNA that tells you about a protein 10:03
is much much much much easier. So 10:06
billions of protein sequences are being 10:09
discovered. About 3,000 times faster are 10:12
we learning about protein sequence than 10:14
protein structure. 10:16
Okay, that's all scientific content, but 10:18
I should talk to you about the little 10:21
thing we did which has this kind of 10:24
schematic diagram. 10:26
We wanted to build an AI system. In 10:28
fact, we didn't even care if it was an 10:31
AI system. That's one of the nice things 10:32
about uh working in AI for science is 10:35
you don't care how you solve it. If it 10:37
ended up being a computer program, if it 10:39
ended up being anything else, we want to 10:41
find some way to get from the left where 10:43
each of those letters represents a 10:46
specific building block of the protein 10:47
considered an order. We want to put 10:49
something in the middle in the alpha 10:51
fold and we want to end up with 10:53
something on the right. And you'll see 10:55
uh two structures there if you look 10:57
closely where the blue is our prediction 10:59
and the green is the experimental 11:02
structure that took someone a year or 11:04
two of effort. If you want to put an 11:06
economic value on it on the order of 11:07
$100,000 11:10
and you can see we were able to do this 11:13
and I want to tell you how 11:16
and there were really three components 11:19
to doing this or to do any machine 11:21
learning problem and you can say you 11:23
have data and you have compute and you 11:25
have research 11:27
and I feel like we tell too many stories 11:29
about the first two and not enough about 11:32
the third. In data, we had 200,000 11:34
protein structures. Everyone has the 11:37
same data. 11:40
In terms of compute, this isn't LLM 11:41
scale. It's the final model itself was 11:44
128 TPU v3 cores, roughly equivalent to 11:48
a GPU per core for two weeks. This is 11:52
again within the scope of say academic 11:55
resources but it's worth saying really 11:58
most of your compute when you think 12:01
about how much compute you need don't 12:03
get distracted by the number for the 12:04
final model the real cost of compute is 12:06
the cost of ideas that didn't work all 12:08
the things you had to do to get there 12:12
and then finally research and I would 12:14
say this is all but about two people 12:15
that worked on this it's a small group 12:19
of people that end up doing this So 12:21
really when you look at these machine 12:24
learning breakthroughs they're probably 12:26
fewer people than you imagine and really 12:28
this is where our work was 12:31
differentiated. We came up with a new 12:33
set of ideas on how do we bring machine 12:35
learning to this problem and I can say 12:39
earlier systems largely based on 12:41
convolutional neural networks did okay. 12:44
They certainly made progress. If you 12:46
replace that with a transformer you're 12:48
honestly about the same. If you take the 12:50
ideas of a transformer and much 12:52
experimentation and many more ideas, 12:54
then that's when you start to get real 12:57
change. And in almost all the AI systems 12:59
you can see today, a tremendous amount 13:03
of research and ideas and what I would 13:05
call midscale ideas are involved. It 13:07
isn't just about the headlines where 13:10
people will say transformers, 13:12
you know, scaling, test time inference. 13:15
These are all important but they're one 13:18
of many ingredients in a really powerful 13:20
system and in fact we can measure how 13:22
much our research was worth. So someone 13:26
Alphafold 2 is the system that is quite 13:29
famous the one that uh was quite a large 13:31
improvement. Alpha fold one was the best 13:33
in the world but someone did uh the 13:35
Alcesi lab did a very uh careful 13:37
experiment where they took Alphold 2 the 13:40
architecture and they trained it on 1% 13:43
of the available data and they could 13:46
show that alpha fold 2 trained on 1% of 13:48
the data was as accurate or more 13:51
accurate as alphafold one which was the 13:54
state-of-the-art system previously. So 13:56
there's a very clean thing that says 13:58
that the third uh the third of these 14:00
ingredients research was worth a 14:03
hundfold of the first of these 14:06
ingredients data. And I think this is 14:08
generally really really important that 14:10
one of the big as you're all thinking as 14:13
you're all in startups or thinking about 14:16
startups think about the amount to which 14:18
ideas research discoveries amplify data 14:21
amplify compute they work together with 14:26
it we wouldn't want to use less data 14:28
than we have we wouldn't want to use 14:30
less compute than we have available but 14:31
ideas are a core component when you're 14:35
doing machine learning research and they 14:37
really helped to transform the world. 14:39
>> YC's Next Batch is now taking 14:41
applications. Got a startup in you? 14:44
Apply at y combinator.com/apply. 14:46
It's never too early. And filling out 14:49
the app will level up your idea. Okay, 14:51
back to the video. We can even go back 14:54
and we can do ablations and we can say 14:56
what parts matter. And don't focus too 14:58
much on the details. We pulled this from 15:00
our paper. You can see here this is the 15:01
difference compared to the baseline. And 15:04
you take either of those and you can see 15:06
that each of the ideas that you might 15:08
remove from our final system kind of 15:10
discreet identifiable ideas some of 15:12
which were incredibly popular research 15:15
areas within the field like this work 15:18
came out and a part of it was 15:20
equivariant and people said equivariance 15:22
that is the answer alphafold is an 15:25
equivariant system and it's great we 15:27
must do more research on equivarians to 15:29
get even more great systems well I was 15:31
very confused by this because the sixth 15:34
uh row there no IPA invariant point 15:37
attention that removes all the 15:40
equavariance in alpha fold and it hurts 15:42
a bit but only a bit. Alpha fold itself 15:45
on this GDT scale that you can see on 15:48
the left graph. Alphafold 2 was about 30 15:51
GDT better than alphafold one and 15:54
equivariance explains two or three of 15:57
this. It isn't about one idea. It's 15:59
about many midscale ideas that add up to 16:02
a transformative system. And it's very 16:05
very important when you're building 16:07
these systems to think about what we 16:08
would call in this context biological 16:11
relevance. We would have ideas that were 16:13
better. We kind of got our system 16:15
grinding 1% at a time. But what really 16:18
mattered was when we crossed the 16:21
accuracy that it mattered to an 16:23
experimental biologist who didn't care 16:25
about machine learning. And you have to 16:27
get there through a lot of work and a 16:29
lot of effort. And when you do, it is 16:31
incredibly transformative. And we can 16:33
measure against uh this axis where the 16:36
dark blue axis the other systems 16:38
available at the time. And this was 16:40
assessed. Protein structure prediction 16:42
is in some ways far ahead of uh LLMs or 16:45
the general machine learning space and 16:49
having blind assessment. Since 1994, 16:50
every two years, everyone interested in 16:53
predicting the structure of proteins 16:55
gets together and predicts the structure 16:57
of a hundred proteins whose answer isn't 16:58
known to anyone except the research 17:00
group that just solved it, right? 17:02
Unpublished. And so, you really do know 17:04
what works. And we had about a third of 17:06
the error of any other group on this 17:08
assessment. But it matters because once 17:10
you are working on problems in which you 17:13
don't know the answer, you get to really 17:15
measure how good things are. And you can 17:16
really find that a lot of systems don't 17:19
live up to what people believe over the 17:21
course of their research. And because 17:24
even if you have a benchmark, we all 17:26
overfit to our ideas to the benchmark, 17:28
right? Unless you have held out. And in 17:31
fact, the problems you have in the real 17:33
world are almost always harder than the 17:36
problems you train on, right? Because 17:38
you have to learn from much data and you 17:40
apply it to very important singular 17:41
problems. So it is very very important 17:44
that you measure well both as you're 17:46
developing and when people are trying to 17:48
decide whether they should use your 17:50
system. External benchmarks are 17:52
absolutely critical to figuring out what 17:54
works and that's what really helps drive 17:57
the world forward. So just some 18:00
wonderful examples of this is typical 18:02
performance for us. These are blind 18:04
predictions. You can see they're pretty 18:05
darn good. also important we made it 18:07
available and we thought it was and we 18:10
did a lot of assessment but we decided 18:12
that it was very important to make it 18:13
available in two ways. One is that we 18:15
open source the code and we actually 18:17
open sourced the code about a week 18:18
before we released a database of 18:19
predictions starting originally at 18:22
300,000 predictions and later going to 18:24
200 million essentially every protein um 18:26
from an organism whose genome has been 18:29
sequenced. And this made an enormous 18:31
difference. And one of the most 18:34
interesting kind of sociological things 18:35
is this huge difference between when we 18:36
released a piece of code that 18:39
specialists could use and we got some 18:40
information and then when we made it 18:43
available to the world in this database 18:44
form. It was really interesting kind of 18:48
you know you release something and every 18:51
day you check Twitter to find out or 18:52
check X to find out what's going on. And 18:54
what we would really see is even after 18:58
that CASP assessment, I would say that 19:01
the structure predictors were convinced 19:03
this obviously was this enormous advance 19:05
solved the problem. But general 19:08
biologists, the people we wanted to use, 19:10
the people who didn't care about 19:12
structure prediction, they cared about 19:13
proteins to do their experiments, they 19:14
weren't as sure. They said, "Well, maybe 19:16
CASP was easy. I don't know." And then 19:18
this database came out and people got 19:21
curious and they clicked in and the 19:23
amount to which the proof was social was 19:26
extraordinary that people would look and 19:28
say how did deep mind get access to my 19:31
unpublished structure. you know, this 19:34
moment at which they really believed it 19:36
that everyone had a a protein either had 19:38
a protein that they hadn't solved or had 19:41
a friend who had a protein that was 19:43
unpublished and they could compare and 19:45
that's what really made the difference. 19:47
And having this database, this 19:49
accessibility, this ease led everyone to 19:50
try it and figure out how it worked. 19:53
Word of mouth is really how this trust 19:56
is built. And you can kind of see some 19:59
of these testimonials, right? I wrestled 20:00
for three to four months trying to do 20:03
this uh scientific task. You know, this 20:06
morning I got an alpha fold prediction 20:09
and now it's much better. I want my time 20:11
back, right? You know, you really 20:14
appreciate alphafold when you run it on 20:17
a protein that for a year refused to get 20:19
expressed and purified. Meaning they for 20:22
a year they couldn't even get the 20:24
material to start experiments. These are 20:25
really important. When you build the 20:27
right tool, when you solve the right 20:29
problem, it matters and it changes the 20:30
lives of people who are doing things not 20:34
that you would do but building on top of 20:37
your work. And I think it's just 20:39
extraordinary to see these and the 20:41
number of people I talked to. The time 20:43
that I really knew this tool mattered. 20:45
In fact, there was a special issue of 20:47
science on the nuclear pore complex a 20:49
few months after the tool came out. And 20:51
the special issue was all about this 20:54
particular very large kind of several 20:56
hundred protein system. And three out of 20:59
the four uh papers in science about this 21:02
made extensive use of alpha fold. I 21:05
think I counted over a hundred mentions 21:07
of the word alphafold in science and we 21:08
had nothing to do with it. We didn't 21:11
know it was happening. We weren't 21:12
collaborating. It was just people doing 21:14
new science on top of the tools we had 21:16
built and that is the greatest feeling 21:18
in the world. And in fact, users do the 21:19
darnest things. They will use tools in 21:22
ways you didn't know were possible. The 21:25
tweet on the left from Yoshaka Morowaki 21:28
came out two days after our code was 21:31
available. We had predicted the 21:33
structure of individual proteins, but we 21:35
consider we were working on building a 21:37
system that would predict how proteins 21:39
came together. But uh this researcher 21:40
said, "Well, I have alphapold. Why don't 21:43
I just put two proteins together and 21:45
I'll put something in between?" You 21:47
could think of this as prompt 21:49
engineering but for proteins. And 21:50
suddenly they find out this is the best 21:52
protein interaction prediction in the 21:54
world, right? That when you train on 21:56
these a really really powerful system, 21:58
it will have additional in some sense 22:00
emergent skills as long as they're 22:03
aligned. People started to find all 22:05
sorts of problems that Alphafold would 22:07
work on that we hadn't anticipated. It 22:11
was so interesting to see the field of 22:13
science in real time reacting to the 22:16
existence of these tools, finding their 22:19
limitations, finding their possibilities 22:20
and this continues and people do all 22:24
sorts of exciting work be it in protein 22:26
design be it in others on top of either 22:28
the ideas and often the systems we have 22:31
built. One application that really uh I 22:34
thought was really important is that 22:39
people have started to learn how to use 22:41
it to engineer big proteins or to use it 22:43
in part of and I want to tell this story 22:46
for two reasons. One is I think it's a 22:48
really cool application but the second 22:50
is how it really changes the work of 22:52
science and often people will say 22:54
science is all about experiments and 22:57
validation. So it's great that you have 22:59
all these alpha fold predictions. Now 23:01
all we have to do is solve all the 23:03
proteins the classic way so that we can 23:05
tell whether your predictions are right 23:08
or wrong. And they're right about one 23:10
thing. Science is about experiments. 23:13
Science is about doing these 23:15
experiments. 23:17
But they're wrong about another thing. 23:19
Um science is about making hypotheses 23:21
and testing them not about the structure 23:24
of a particular protein. In this case, 23:27
the question was they took this protein 23:29
on the left called the contractile 23:32
inject injection system, but that's a 23:34
mouthful. They like to call it the 23:36
molecular syringe. And what it does is 23:37
it attaches to a cell and injects a 23:40
protein into it. And the scientists at 23:43
the Jang Lab at uh MIT were saying, 23:45
well, can we use this protein 23:49
to do targeted drug delivery? Can we use 23:53
it to get gene editors like cast 9 into 23:55
the cell? They tried over a hundred 23:58
methods to figure out how to take this 24:01
protein, which they didn't have a 24:03
structure of. This is just kind of a 24:04
rendition after the fact, and say, how 24:05
can we change what it recognizes? I 24:08
think it's originally involved in plant 24:10
defense or something like that, and they 24:12
didn't know how to do it. And they ran 24:14
an alpha fold prediction. You can see 24:15
the one on the left. I wouldn't even say 24:16
it's a great alpha fold prediction, but 24:18
almost immediately they looked at that 24:20
and said, "Wait a minute. those legs at 24:21
the bottom are how it must recognize and 24:23
attach to cells. Why don't we just 24:26
replace those with a designed protein? 24:28
And so almost immediately as soon as 24:31
they got the alpha fold prediction, they 24:32
re-engineered to add this design protein 24:34
that you see in red uh to target a new 24:36
type of cell. And they take this system 24:40
and then they show in fact that they can 24:45
choose cells within a mouse and they can 24:47
inject proteins in this case fluorescent 24:50
proteins. So there you'll see the color 24:52
and they can target the cells they want 24:54
within a mouse brain. And so they are 24:56
using this to develop a new type of 24:58
system 25:00
of targeted drug discovery. And we see 25:02
many more examples. We see some in which 25:05
scientists are using this tool to try 25:07
thousands and thousands of interactions 25:10
to figure out which ones are likely to 25:11
be the case. In fact, discovered a new 25:14
component of how eggs and sperm come 25:16
together in fertilization. Many many of 25:18
these discoveries that are built on top 25:21
of this. And I like to think that our 25:23
work made the whole field of what's 25:26
called structural biology, biology that 25:29
deals with structures, you know, five or 25:31
10% faster. But the amount to which that 25:33
matters for the world is enormous and we 25:37
will have more of these discoveries. And 25:39
I think ultimately structure prediction 25:43
and larger AI for science should be 25:45
thought of as an incredible capability 25:47
to be an amplifier for the work of 25:49
experimentalists that we start from 25:51
these scattered observations, these 25:53
natural data. This is our equivalent of 25:55
all the words on the internet. And then 25:58
we train a general model that 26:00
understands the rules underneath it and 26:02
can fill in the rest of the picture. And 26:04
I think that we will continue to see 26:06
this pattern and it will get more 26:08
general that we will find the right 26:10
foundational data sources in order to do 26:11
this. And I think the other thing that 26:15
has really been a property is that you 26:17
start where you have data but then you 26:20
find what problems it can be applied to. 26:22
And so we find enormous advance, 26:25
enormous capability to understand 26:28
interactions in the cell or others that 26:30
are downstream of extracting the 26:33
scientific content of these predictions 26:35
and then the rules they use can be 26:39
adapted to new purposes. And I think 26:41
this is really where we see the 26:42
foundational model aspect of alpha fold 26:45
or other narrow systems. And in fact, I 26:47
think we will start to see this on more 26:50
general systems, be them LLMs or others, 26:51
that we will find more and more 26:54
scientific knowledge within them and 26:55
we'll use them for important important 26:58
purposes. And I think this is really 27:00
where this is going. And I think the 27:03
most exciting question in AI for science 27:04
is how general will it be. Will we find 27:08
a couple of narrow places where we have 27:10
transformative impact or will we have 27:12
very very broad systems? And I expect it 27:15
will ultimately be the latter as we 27:17
figure it out. Thank you. 27:19

– English Lyrics

📲 "" is trending – don’t miss the chance to learn it in the app!
By
Viewed
36,958
Language
Learn this song

Lyrics & Translation

[English]
This is something of a nice change. I've
given a lot of scientific talks and no
one claps and cheers when I come on. Not
normally even when I come on.
It's really exciting. It's really
wonderful to be here. I guess I should
start off assuming that not everyone in
this cavernous hall knows who I am. Who
am I? I'm I'm someone who has done some
work in AI for science who really
believes that we can use the AI systems,
these technologies, these ideas to
change the world in a very specific way
to make science go faster to enable new
discoveries. I think it's really really
wonderful. We have the opportunity to
take these tools, these ideas
and aim them toward the question of how
can we build the right AI systems so
that sick people can become healthy and
go home from the hospital. And it's been
kind of a a really wonderful and winding
journey for me to end up here. I was
originally trained as a physicist. I
thought I was going to be a laws of the
universe physicist. If I was very very
lucky, I could do something that would
end up one sentence in a textbook.
And I did physics and I went to actually
do a PhD in physics. And then kind of
what I was working on didn't really grab
me. I just it didn't feel like what I
wanted to do. So I dropped out. I didn't
start a startup. That would have been
very on point for this event, but I uh
dropped out and I ended up working at a
company that was doing computational
biology. How do we get computers to say
something smart about biology? And I
loved it. I loved it not just because it
was fun, but it was something that would
let me do what I thought I was good at.
Write code, manipulate equations, think
hard thoughts about the nature of the
world and use it toward this very
applied purpose that at the end we want
to ena we want to make medicines or we
want to enable others to make medicines.
Then I really kind of became a biologist
and a machine learner. Actually a
machine learner because I left that job
and I went back to grad school in
biohysics and chemistry and uh I no
longer had access to this incredible
computer hardware that I had when I was
working at my previous job and in fact
they had custom asics for simulating how
proteins this part of your body that
I'll talk about move. And since I didn't
have that anymore but I still wanted to
work on the same problems. Well, I
didn't want to just do the same thing
with less compute. And so I started to
learn and I was getting very interested
in statistics, in machine learning. We
didn't call it AI back then. In fact, we
didn't even call it machine learning.
That was a bit disreputable. I said, I'm
working in statistical physics. But you
know, how are we going to develop
algorithms? How are we going to learn
from data and do that instead of very
large compute? And I guess it turns out
in terms of AI in addition to very large
compute to answer new problems. And
after this I joined uh Google DeepMind
and really joining a company that wanted
to say how are we going to take these
powerful technologies and all kind of
these ideas and we they were becoming
very very readily apparent how powerful
these technologies were with
applications
uh to especially games but also to
things like data centers and others. How
are we going to take these technologies
and use them to advance science and
really push forward scientific frontier?
And how can we do this in an industrial
setting with an incredibly fast pace
working with some really smart people
working with great computer resources
and with all that you darn well better
make some progress and it's been really
really fun and the fact that I'm on this
stage indicates that we made some
progress and I think it really the
guiding principle for me has that when
we do this work that ultimately we are
building tools that will enable
scientists to make discoveries.
And what I think is really heartening
about the work we've done and the part
that really I think still just resonates
with me at my core is there about I
think 35,000 citations of Alphafold. But
within that is there are tens of
thousands of examples of people using
our tools to do science that I couldn't
do on my own but are using it to make
discoveries. be it vaccines, be it drug
development, be it how the body works.
And I think that's really really
exciting. And the part I want to talk to
you about today and the story I want to
tell you is a bit about the problem, a
bit about how we did it. And I think
especially the role of research and
machine learning research and the fact
that it isn't just off-the-shelf machine
learning and then I want to tell you a
little bit about what happens when you
make something great and how people use
it and what it does for the world. So,
I'll start with the world's shortest
biology lesson. The cell is complex.
Um, for people who have only studied
biology in high school or in college,
you might have this idea that the cell
is a couple parts that have labels
attached to them. And it's kind of
simple, but really it looks much more
like what you see on the screen. It's
dense. It's complex. Uh, in terms of
crowding, it's like the swimming pool on
the 4th of July and it's in full of
enormous complexity. Humans have about
20,000 different types of proteins.
Those are some of the blobs you see on
the screen. They come together to do
practically every function in your cell.
You can see that uh kind of green tail
is the psyllium of uh an ecoli. That's
how it moves around. And you can see in
fact how it moves around. And you can
see that thing that looks like it turns
and in fact it turns and drives this
motor. All of this is made of proteins.
When people say that DNA is the
instruction manual for life, well, this
is what it's telling you how to do. It's
telling you how to build these tiny
machines. And biology has evolved an
incredible mechanism to build the
machines it needs, literal nano
machines, and build them out of atoms.
And so your DNA gives you instructions
that say build a protein. Now you might
say your DNA is a line and so are
proteins in a certain sense. It's
instructions on how to attach one bead
after another where each bead is a
specific kind of molecular arrangement
of atoms. And you should wonder if I my
DNA is aligned and I am very much not
one-dimensional,
what happens in between? And the answer
is after you make this protein and
assemble it one piece at a time, it will
fold up spontaneously
into a shape like you've opened your
IKEA bookshelf and instead of having to
do the hard work, it simply builds
itself and you get this quite complex
structure. You can see quite typical
protein, a kynise for those of you who
are biologists in the audience over
there. And you can see this very complex
arrangement of atoms and that
arrangement is functional and and the
majority not everyone of the proteins uh
in your body undergo this transformation
and that is what functions and that is
incredibly small.
So light itself is a few hundred
nanometers in size and that's a few
nanometers in size. So it's smaller than
you can see in a microscope. And for a
long time scientists have wanted to
understand this structure because they
use it to predict how changes in that
protein might affect disease. How does
that work? How does biology work? Often
if you make a drug it is to interrupt
the function of a certain protein like
this one.
Now scientists have through an
incredible amount of cleverness figured
out the structure of lots of proteins
and it remains to this day exceptionally
difficult. Right? You shouldn't imagine
this as I want to determine the
structure of a protein. So I shall open
the lab protocol for protein structure
determination. I shall follow the steps.
It consists of cleverness of ideas of
finding many ways. In this case, I'm
describing one type of protein structure
prediction in or protein structure,
sorry, determination, experimental
measurement, where you convince that big
ugly molecule I just showed you to form
a regular crystal kind of like table
salt. No one has an easy recipe for
this. So, they try many things. They
have ideas and it's exceptionally
difficult and filled with failure like
many things in science.
And you're really looking at
kind of one way to get an idea of how
difficult this is. Just one kind of
ordinary paper that we were using. I
flipped to the back and it said, you
know, in their protocol, after more than
a year, crystals began to form. Right?
So, not only did they do all these hard
experiments, but they had to wait about
a year to find out if it worked. And
probably that year wasn't spent waiting.
It was trying a thousand other things
that didn't work as well.
Once you do that, you can take this to a
uh synretron, a modest thing. You can
see the cars rigging the outside of this
instrument so that you can shine
incredibly bright X-rays on it and get
what is called a defraction pattern and
you can solve that and you can deposit
it in what's called the PDB or the
protein datab bank. And one of the
things that enabled the work we did is
that scientists 50 years ago had the
foresight to say these are important,
these are hard. We should collect them
all in one place. So there's a data set
that represents ex essentially all the
academic output of protein structures in
the community and available to everyone.
So our work was on very public data.
About 200,000 protein structures are
known. They pretty regularly increase at
about 12,000 a year.
But this is much much smaller than the
need.
Getting the kind of input information,
the DNA that tells you about a protein
is much much much much easier. So
billions of protein sequences are being
discovered. About 3,000 times faster are
we learning about protein sequence than
protein structure.
Okay, that's all scientific content, but
I should talk to you about the little
thing we did which has this kind of
schematic diagram.
We wanted to build an AI system. In
fact, we didn't even care if it was an
AI system. That's one of the nice things
about uh working in AI for science is
you don't care how you solve it. If it
ended up being a computer program, if it
ended up being anything else, we want to
find some way to get from the left where
each of those letters represents a
specific building block of the protein
considered an order. We want to put
something in the middle in the alpha
fold and we want to end up with
something on the right. And you'll see
uh two structures there if you look
closely where the blue is our prediction
and the green is the experimental
structure that took someone a year or
two of effort. If you want to put an
economic value on it on the order of
$100,000
and you can see we were able to do this
and I want to tell you how
and there were really three components
to doing this or to do any machine
learning problem and you can say you
have data and you have compute and you
have research
and I feel like we tell too many stories
about the first two and not enough about
the third. In data, we had 200,000
protein structures. Everyone has the
same data.
In terms of compute, this isn't LLM
scale. It's the final model itself was
128 TPU v3 cores, roughly equivalent to
a GPU per core for two weeks. This is
again within the scope of say academic
resources but it's worth saying really
most of your compute when you think
about how much compute you need don't
get distracted by the number for the
final model the real cost of compute is
the cost of ideas that didn't work all
the things you had to do to get there
and then finally research and I would
say this is all but about two people
that worked on this it's a small group
of people that end up doing this So
really when you look at these machine
learning breakthroughs they're probably
fewer people than you imagine and really
this is where our work was
differentiated. We came up with a new
set of ideas on how do we bring machine
learning to this problem and I can say
earlier systems largely based on
convolutional neural networks did okay.
They certainly made progress. If you
replace that with a transformer you're
honestly about the same. If you take the
ideas of a transformer and much
experimentation and many more ideas,
then that's when you start to get real
change. And in almost all the AI systems
you can see today, a tremendous amount
of research and ideas and what I would
call midscale ideas are involved. It
isn't just about the headlines where
people will say transformers,
you know, scaling, test time inference.
These are all important but they're one
of many ingredients in a really powerful
system and in fact we can measure how
much our research was worth. So someone
Alphafold 2 is the system that is quite
famous the one that uh was quite a large
improvement. Alpha fold one was the best
in the world but someone did uh the
Alcesi lab did a very uh careful
experiment where they took Alphold 2 the
architecture and they trained it on 1%
of the available data and they could
show that alpha fold 2 trained on 1% of
the data was as accurate or more
accurate as alphafold one which was the
state-of-the-art system previously. So
there's a very clean thing that says
that the third uh the third of these
ingredients research was worth a
hundfold of the first of these
ingredients data. And I think this is
generally really really important that
one of the big as you're all thinking as
you're all in startups or thinking about
startups think about the amount to which
ideas research discoveries amplify data
amplify compute they work together with
it we wouldn't want to use less data
than we have we wouldn't want to use
less compute than we have available but
ideas are a core component when you're
doing machine learning research and they
really helped to transform the world.
>> YC's Next Batch is now taking
applications. Got a startup in you?
Apply at y combinator.com/apply.
It's never too early. And filling out
the app will level up your idea. Okay,
back to the video. We can even go back
and we can do ablations and we can say
what parts matter. And don't focus too
much on the details. We pulled this from
our paper. You can see here this is the
difference compared to the baseline. And
you take either of those and you can see
that each of the ideas that you might
remove from our final system kind of
discreet identifiable ideas some of
which were incredibly popular research
areas within the field like this work
came out and a part of it was
equivariant and people said equivariance
that is the answer alphafold is an
equivariant system and it's great we
must do more research on equivarians to
get even more great systems well I was
very confused by this because the sixth
uh row there no IPA invariant point
attention that removes all the
equavariance in alpha fold and it hurts
a bit but only a bit. Alpha fold itself
on this GDT scale that you can see on
the left graph. Alphafold 2 was about 30
GDT better than alphafold one and
equivariance explains two or three of
this. It isn't about one idea. It's
about many midscale ideas that add up to
a transformative system. And it's very
very important when you're building
these systems to think about what we
would call in this context biological
relevance. We would have ideas that were
better. We kind of got our system
grinding 1% at a time. But what really
mattered was when we crossed the
accuracy that it mattered to an
experimental biologist who didn't care
about machine learning. And you have to
get there through a lot of work and a
lot of effort. And when you do, it is
incredibly transformative. And we can
measure against uh this axis where the
dark blue axis the other systems
available at the time. And this was
assessed. Protein structure prediction
is in some ways far ahead of uh LLMs or
the general machine learning space and
having blind assessment. Since 1994,
every two years, everyone interested in
predicting the structure of proteins
gets together and predicts the structure
of a hundred proteins whose answer isn't
known to anyone except the research
group that just solved it, right?
Unpublished. And so, you really do know
what works. And we had about a third of
the error of any other group on this
assessment. But it matters because once
you are working on problems in which you
don't know the answer, you get to really
measure how good things are. And you can
really find that a lot of systems don't
live up to what people believe over the
course of their research. And because
even if you have a benchmark, we all
overfit to our ideas to the benchmark,
right? Unless you have held out. And in
fact, the problems you have in the real
world are almost always harder than the
problems you train on, right? Because
you have to learn from much data and you
apply it to very important singular
problems. So it is very very important
that you measure well both as you're
developing and when people are trying to
decide whether they should use your
system. External benchmarks are
absolutely critical to figuring out what
works and that's what really helps drive
the world forward. So just some
wonderful examples of this is typical
performance for us. These are blind
predictions. You can see they're pretty
darn good. also important we made it
available and we thought it was and we
did a lot of assessment but we decided
that it was very important to make it
available in two ways. One is that we
open source the code and we actually
open sourced the code about a week
before we released a database of
predictions starting originally at
300,000 predictions and later going to
200 million essentially every protein um
from an organism whose genome has been
sequenced. And this made an enormous
difference. And one of the most
interesting kind of sociological things
is this huge difference between when we
released a piece of code that
specialists could use and we got some
information and then when we made it
available to the world in this database
form. It was really interesting kind of
you know you release something and every
day you check Twitter to find out or
check X to find out what's going on. And
what we would really see is even after
that CASP assessment, I would say that
the structure predictors were convinced
this obviously was this enormous advance
solved the problem. But general
biologists, the people we wanted to use,
the people who didn't care about
structure prediction, they cared about
proteins to do their experiments, they
weren't as sure. They said, "Well, maybe
CASP was easy. I don't know." And then
this database came out and people got
curious and they clicked in and the
amount to which the proof was social was
extraordinary that people would look and
say how did deep mind get access to my
unpublished structure. you know, this
moment at which they really believed it
that everyone had a a protein either had
a protein that they hadn't solved or had
a friend who had a protein that was
unpublished and they could compare and
that's what really made the difference.
And having this database, this
accessibility, this ease led everyone to
try it and figure out how it worked.
Word of mouth is really how this trust
is built. And you can kind of see some
of these testimonials, right? I wrestled
for three to four months trying to do
this uh scientific task. You know, this
morning I got an alpha fold prediction
and now it's much better. I want my time
back, right? You know, you really
appreciate alphafold when you run it on
a protein that for a year refused to get
expressed and purified. Meaning they for
a year they couldn't even get the
material to start experiments. These are
really important. When you build the
right tool, when you solve the right
problem, it matters and it changes the
lives of people who are doing things not
that you would do but building on top of
your work. And I think it's just
extraordinary to see these and the
number of people I talked to. The time
that I really knew this tool mattered.
In fact, there was a special issue of
science on the nuclear pore complex a
few months after the tool came out. And
the special issue was all about this
particular very large kind of several
hundred protein system. And three out of
the four uh papers in science about this
made extensive use of alpha fold. I
think I counted over a hundred mentions
of the word alphafold in science and we
had nothing to do with it. We didn't
know it was happening. We weren't
collaborating. It was just people doing
new science on top of the tools we had
built and that is the greatest feeling
in the world. And in fact, users do the
darnest things. They will use tools in
ways you didn't know were possible. The
tweet on the left from Yoshaka Morowaki
came out two days after our code was
available. We had predicted the
structure of individual proteins, but we
consider we were working on building a
system that would predict how proteins
came together. But uh this researcher
said, "Well, I have alphapold. Why don't
I just put two proteins together and
I'll put something in between?" You
could think of this as prompt
engineering but for proteins. And
suddenly they find out this is the best
protein interaction prediction in the
world, right? That when you train on
these a really really powerful system,
it will have additional in some sense
emergent skills as long as they're
aligned. People started to find all
sorts of problems that Alphafold would
work on that we hadn't anticipated. It
was so interesting to see the field of
science in real time reacting to the
existence of these tools, finding their
limitations, finding their possibilities
and this continues and people do all
sorts of exciting work be it in protein
design be it in others on top of either
the ideas and often the systems we have
built. One application that really uh I
thought was really important is that
people have started to learn how to use
it to engineer big proteins or to use it
in part of and I want to tell this story
for two reasons. One is I think it's a
really cool application but the second
is how it really changes the work of
science and often people will say
science is all about experiments and
validation. So it's great that you have
all these alpha fold predictions. Now
all we have to do is solve all the
proteins the classic way so that we can
tell whether your predictions are right
or wrong. And they're right about one
thing. Science is about experiments.
Science is about doing these
experiments.
But they're wrong about another thing.
Um science is about making hypotheses
and testing them not about the structure
of a particular protein. In this case,
the question was they took this protein
on the left called the contractile
inject injection system, but that's a
mouthful. They like to call it the
molecular syringe. And what it does is
it attaches to a cell and injects a
protein into it. And the scientists at
the Jang Lab at uh MIT were saying,
well, can we use this protein
to do targeted drug delivery? Can we use
it to get gene editors like cast 9 into
the cell? They tried over a hundred
methods to figure out how to take this
protein, which they didn't have a
structure of. This is just kind of a
rendition after the fact, and say, how
can we change what it recognizes? I
think it's originally involved in plant
defense or something like that, and they
didn't know how to do it. And they ran
an alpha fold prediction. You can see
the one on the left. I wouldn't even say
it's a great alpha fold prediction, but
almost immediately they looked at that
and said, "Wait a minute. those legs at
the bottom are how it must recognize and
attach to cells. Why don't we just
replace those with a designed protein?
And so almost immediately as soon as
they got the alpha fold prediction, they
re-engineered to add this design protein
that you see in red uh to target a new
type of cell. And they take this system
and then they show in fact that they can
choose cells within a mouse and they can
inject proteins in this case fluorescent
proteins. So there you'll see the color
and they can target the cells they want
within a mouse brain. And so they are
using this to develop a new type of
system
of targeted drug discovery. And we see
many more examples. We see some in which
scientists are using this tool to try
thousands and thousands of interactions
to figure out which ones are likely to
be the case. In fact, discovered a new
component of how eggs and sperm come
together in fertilization. Many many of
these discoveries that are built on top
of this. And I like to think that our
work made the whole field of what's
called structural biology, biology that
deals with structures, you know, five or
10% faster. But the amount to which that
matters for the world is enormous and we
will have more of these discoveries. And
I think ultimately structure prediction
and larger AI for science should be
thought of as an incredible capability
to be an amplifier for the work of
experimentalists that we start from
these scattered observations, these
natural data. This is our equivalent of
all the words on the internet. And then
we train a general model that
understands the rules underneath it and
can fill in the rest of the picture. And
I think that we will continue to see
this pattern and it will get more
general that we will find the right
foundational data sources in order to do
this. And I think the other thing that
has really been a property is that you
start where you have data but then you
find what problems it can be applied to.
And so we find enormous advance,
enormous capability to understand
interactions in the cell or others that
are downstream of extracting the
scientific content of these predictions
and then the rules they use can be
adapted to new purposes. And I think
this is really where we see the
foundational model aspect of alpha fold
or other narrow systems. And in fact, I
think we will start to see this on more
general systems, be them LLMs or others,
that we will find more and more
scientific knowledge within them and
we'll use them for important important
purposes. And I think this is really
where this is going. And I think the
most exciting question in AI for science
is how general will it be. Will we find
a couple of narrow places where we have
transformative impact or will we have
very very broad systems? And I expect it
will ultimately be the latter as we
figure it out. Thank you.

Key Vocabulary

Start Practicing
Vocabulary Meanings

protein

/ˈprōtēn/

B2
  • noun
  • - a large molecule composed of amino acids, essential for living organisms

AI

/ˌeɪˈaɪ/

B2
  • noun
  • - artificial intelligence; the capability of a machine to imitate intelligent human behavior

algorithm

/ˈalɡəˌriT͡Həm/

C1
  • noun
  • - a step‑by‑step procedure for calculations, data processing, and automated reasoning

data

/ˈdātə/

B1
  • noun
  • - facts or information, especially numbers, collected to be examined and used for analysis

model

/ˈmäd(ə)l/

B2
  • noun
  • - a simplified representation of a system or phenomenon used to explain or predict its behavior
  • verb
  • - to create a representation or simulation of something

prediction

/prɪˈdikSH(ə)n/

B2
  • noun
  • - a statement about what will happen in the future based on data or analysis

structure

/ˈstrʌk(t)CHər/

B2
  • noun
  • - the arrangement of parts that form a whole; the organization of a system
  • verb
  • - to arrange or organize something in a particular way

machine

/məˈSH(ə)n/

B1
  • noun
  • - a device that uses energy to perform a task

learning

/ˈLɜrniNG/

B2
  • noun
  • - the process of acquiring knowledge or skills through study, experience, or teaching
  • verb
  • - to acquire knowledge or skill through study or experience (used in the continuous form 'learning')

research

/riːˈsɜːrtʃ/

B2
  • noun
  • - systematic investigation to establish facts or principles
  • verb
  • - to investigate systematically

technology

/tekˈnɒlədʒi/

B2
  • noun
  • - the application of scientific knowledge for practical purposes

tool

/tuːl/

B1
  • noun
  • - an instrument used to carry out a particular function
  • verb
  • - to equip with a tool or set of tools

discovery

/dɪˈskʌv(ə)ri/

B2
  • noun
  • - the act of finding something new, especially in science

cell

/sel/

B1
  • noun
  • - the basic structural and functional unit of living organisms

genome

/ˈjēnəʊm/

C1
  • noun
  • - the complete set of genetic material in an organism

sequence

/ˈsēkw(ə)ns/

B2
  • noun
  • - an ordered list of elements, such as nucleotides or amino acids
  • verb
  • - to arrange in a particular order

AlphaFold

/ˈælfə ˈfoʊld/

B2
  • noun
  • - an AI system developed by DeepMind that predicts protein structures from their amino‑acid sequences

transform

/trænsˈfôrm/

B2
  • verb
  • - to change in form, appearance, or character

accurate

/ˈakyərit/

B2
  • adjective
  • - correct in all details; free from error

complex

/ˈkämplĕks/

B2
  • adjective
  • - consisting of many interrelated parts; not simple

Are there any new words in “” you don’t know yet?

💡 Hint: protein, AI… Jump into the app and start learning now!

Key Grammar Structures

Coming Soon!

We're updating this section. Stay tuned!

Related Songs