Display Bilingual:

I think the like dystopian view of AI or 00:00
AI as like taking all our jobs and all 00:03
of that is like not correct. And I think 00:06
the future of work is more human, is 00:08
more interactive, is more multimodal. 00:11
Once the making of things gets easier, 00:13
the bottleneck goes back to to to like 00:16
how many ideas you can have. I wouldn't 00:18
put learning to code as the top thing. I 00:21
would put learn to make things. Learn to 00:23
make things with code. Learn to make 00:25
things with video. learn to make like 00:27
anything with AI. 00:29
[Music] 00:32
>> Welcome back to the breakdown. I'm Dave. 00:37
This is Tom. And today we're joined by 00:39
Amjad Msad. Amjad is founder and CEO of 00:40
Replet, which was a YC 2018 company. Uh 00:43
welcome to the show. 00:46
>> Thank you. 00:47
>> So today, uh we want to talk about kind 00:48
of your beginnings in helping people 00:50
learn to code. what you're doing now in 00:52
replacing a lot of that work and then 00:55
where we see the future going in terms 00:57
of how people make things. So, Replet 00:58
started um I guess in 2016, you were in 01:01
YC in 2018. You started out as a tool to 01:03
make it really easy for people learning 01:07
to program to set up development 01:09
environments purely on the web. Um but 01:11
now you're really taking off in AI 01:14
assisted coding. Tell us tell us the 01:16
latest. Yeah. So, um the mission has 01:18
always been to make programming more 01:20
accessible. Uh then we sort of uh you 01:22
know after YC we sort of updated it to 01:25
be a little more ambitious. We started 01:27
talking about a billion software 01:29
developers and it just sounded absurd at 01:31
the time and 01:33
>> this is all prel right. 01:34
>> Yeah. Prelm. There was a moment of time 01:35
I don't know if you remember that but 01:37
like 20 in the like 2015 AI hype. There 01:39
was like a bit of a NLP natural language 01:42
hype as well. There's a bunch of AI 01:45
companies that ended up being just like 01:47
humans behind the scene. They all went 01:48
broke, but um there was like a a glimpse 01:50
of potentially doing NLP on code. So, we 01:53
had an idea that, you know, it's it's 01:57
probably coming. I I even had it in my 01:58
in my seed deck that like at some point 02:00
like we'll collect enough data to like 02:03
train models. It wasn't until like GPG2 02:04
in 2020 that I felt like it was was 02:08
going to be possible. So we you know we 02:11
had we had built all these um primitives 02:13
like the development environment the 02:16
sort of hosting environment all the 02:18
other stuff around it. Uh and I felt 02:19
like if if we just like added um like AI 02:22
agents like it will be able to 02:25
orchestrate all the stuff and it'll be 02:27
it'll be really great. agents were like 02:28
every time we tried them. Uh I think the 02:31
first time we tried it like 2021 didn't 02:33
work, 22 it didn't work and then 2024 02:35
like early early in the in the year we 02:39
just felt like it's getting close. 02:42
>> Even GPT40 could like be coherent for 02:44
like um 2 minutes somehow just ended up 02:47
doing a big bat. The company was not 02:51
doing super well. 02:54
>> Yeah. was I was about to ask was there 02:55
like a bet the company moment where it's 02:56
like we were teaching people to code now 02:57
we're no longer doing that now we're 02:59
going to enable everyone to build apps 03:00
>> yeah yeah yeah we had grown the company 03:02
quite a bit um and we were burning way 03:03
too much money uh and we decided to do a 03:07
layoff 03:11
>> and so we we um cut like perhaps 50 03:11
people and then like another like 15 20 03:14
people left and so we were like less 03:17
than half of the company and so I just 03:18
put everything on on replet agents I 03:21
just felt like this is the thing it has 03:22
to work. It felt like it was close. 03:25
Yeah. I don't know. I was just like burn 03:27
the boat boats kind of moments. It's 03:29
just like we got to do this. I mean, 03:30
it's I felt like this was the this was 03:32
the the thing that would make the 03:35
company work. And honestly, like if claw 03:37
3.5 hadn't come out mid US building 03:40
agent, we would have probably failed. 03:43
>> Yeah. 03:45
>> Because um like I said, like GPT40 would 03:45
like stay coherent for 2 3 minutes. Quad 03:48
3.5 was the first one that like could 03:50
work for like you know five to 10 03:52
minutes and actually actually work and 03:54
the code generations 03:56
>> and that's sort of the approach that I 03:57
think is most successful for startups 03:58
kind of building on the very edge of 04:00
what is possible you know you start out 04:01
on a mission it's the techn is not quite 04:02
there yet you start building and you 04:04
sort of you you skate where the puck is 04:06
going you know it sort of catches up 04:07
with you 04:09
>> yeah and you you want to keep keep doing 04:09
that 04:11
>> totally 04:11
>> yeah I remember you came to speak at YC 04:12
like 6 months ago and right after you 04:14
had launched this and at the time you 04:16
said we're very far from fully automated 04:18
software development. 04:20
>> Do you still feel that way? 04:22
>> No, not at all. I mean, I I feel like 04:23
I've, you know, every time I make a sort 04:25
of some kind of prediction, it feels 04:27
like bold and like, you know, but I've 04:29
been I've been consistently wrong and 04:31
like uh the way things are moving uh is 04:33
a lot faster. Just the level of 04:36
autonomy. Uh like I said, like maybe 3.5 04:37
was like 5 to 10 minutes. Like 3.7 04:40
perhaps can get up to like four five 04:43
minutes to an hour. The uh 4.0 No, in 04:45
the in the system card and opus, they 04:48
said they made it work for seven hours. 04:51
>> Wow. 04:53
>> 7 hours. 04:54
>> That's incredible. 04:55
>> That's always been the limiting factor 04:56
for making agents is like, can they stay 04:57
coherent? Can is the context length 04:59
actually useful to reason over. If an 05:01
LLM can work for for seven hours, that's 05:04
basically like a human worker. 05:06
>> Yeah. And presumably they're working at 05:09
much faster speed than a human would do. 05:10
So they're kind of completing perhaps a 05:12
week's work in those seven hours. The 05:13
only thing that I think is is missing 05:15
and a big limiting factor for actually 05:17
automating a lot of work is computer 05:19
use. 05:21
>> Computer use kind of sucks. 05:22
>> You know, I'm sure you've you've tried 05:24
it. And this is the difference between 05:25
Replet agent really uh giving it one 05:27
prompt going all all the way in an app 05:30
versus having to kind of babysit it a 05:32
little bit and like test with it. 05:35
>> And we had a company in the last batch 05:36
called browser use that's um that's 05:37
working browser automation. 05:39
>> Browser Use is great. And another one 05:40
called pig which is doing the same for 05:41
kind of windows desktops. 05:42
>> And uh I think advice I would give 05:44
founders today is um taking either 05:45
browser use or windows automation with 05:48
pig and trying to apply that into 05:50
enterprise into vertical 05:52
>> into a vertical industry. 05:54
>> The moment this technology worked those 05:56
two companies are just going to 05:57
>> totally and I think I think we're like 05:59
weeks or perhaps single digit months 06:00
away from it working really really well 06:02
and so now is the time to get started 06:03
using those technologies. 06:05
>> Absolutely 100%. And this is this is 06:06
what we're focused on. So you know 06:08
replet agent v1 to v2 was a huge jump in 06:09
autonomy. Uh v3 is the most autonomous 06:12
thing. So we're already working on it 06:15
and um the interesting thing for us is 06:17
the underlying technology and how it 06:21
would enable autonomy. There are like a 06:23
few important things. One is uh uh you 06:25
know being transactional. The ability to 06:29
roll back is is very important. So you 06:31
you would want you'd want it to be safe 06:34
for agents in the same way that gits 06:37
made it safe for human programmers to 06:38
kind of experiment and create branches 06:41
whatever you want the same thing for for 06:43
agents like if an agent kind of messes 06:45
up a DB migration it should be able to 06:47
roll back and also it should be able to 06:49
like sample 06:51
>> uh across different different paths. Uh 06:53
I think this is very very important for 06:55
autonomy. If you look at uh like for 06:57
example when when uh Anthropic publishes 06:59
their Swedbench score they publish one 07:02
with that sampling and one with sampling 07:04
and it goes from 70% to 80%. 07:07
>> And so this is the idea that you sort of 07:09
you spawn multiple agents each one will 07:11
take a a shot at it and then you figure 07:13
out which one works and and choose that 07:15
that branch effectively. We built a 07:16
infrastructure that is um uh fully 07:19
transactional and moves in lock step 07:22
meaning the file system is a snap 07:25
snapshot fail based file system the DB 07:27
is a snapshot based uh u DB and so a as 07:29
you're we're making commits uh to the 07:34
entire system including the virtual 07:36
machine as you're going uh and so what 07:38
we can do is also we can we can fork it 07:40
and branch it out. So if if we had 07:42
computer use that's working well you can 07:44
you can just like right now what they do 07:46
with the sampling is they do they have 07:49
some kind of judge that does ranking 07:50
>> which you know it's not a true verifier 07:52
a true verifier is a is a is a test 07:55
right and a computer use test so you you 07:57
can sample out and really pick the best 07:59
branch that's actually working 08:01
>> that's incredible 08:02
>> and and repeat that on and on and 08:03
reliability gets really really high 08:05
>> so fast forward six or 12 months you're 08:08
not spawning one agent you're spawning 08:10
is Is it five or 10 or a million? Like 08:12
how 08:14
>> I think this is where it's going to get 08:15
interesting, which is you'd want to give 08:17
the user the ability to set compute 08:20
budgets. I I think we're starting to see 08:22
that with um with the O style models of 08:23
like here's here's how much budget. But 08:26
but look, I mean, if if you give us 08:28
$1,000, we'll spend them if you know. 08:29
So, uh 08:32
>> that's really cool. I've not thought of 08:33
that before. Um but the idea of like 08:34
spawning multiple branches then just 08:36
picking the best one and being able to 08:37
do that in parallel. I just the human 08:38
brain doesn't naturally work like that. 08:40
We think sequentially. We can't. 08:42
>> It reminds me of this legend. I don't 08:44
know if this is actually true, but at 08:45
Apple when Steve was running it, he 08:46
would intentionally have teams doing 08:48
basically the same things and then see 08:49
which one did the best job. It's like 08:51
>> I've heard OpenAI does that now. 08:53
Interesting. Uh I I've heard that like 08:54
the Codeex project like had multiple 08:56
different different teams. So in the 08:58
literature, you'll see that uh small 09:00
small models sampled uh will beat larger 09:03
models. So Sonnet sampled is probably 09:07
better than than Opus companies. uh 09:09
haven't tried that where instead of 09:11
hiring one senior engineer you hire like 09:13
10 junior engineers and like give them 09:16
the same task and then 09:18
>> it's just very expensive with humans but 09:20
with LM it's relatively cheap you can 09:21
just do that do a hundred of them and 09:23
just pick the best one every time 09:25
>> so how are people using replet agent 09:26
like and and who are the people using it 09:28
>> you know it goes back to sort of our 09:30
early vision that if you make 09:32
programming easy uh then uh then more 09:33
and more people would want to would want 09:36
to do it actually this is one of the 09:38
first thing uh that that PG PG and I 09:39
sort of connected on um before we got 09:41
into YC actually PG um found us on 09:44
HackerNews and so we started this email 09:46
relationship um and so he he told me 09:48
there's like a super linear relationship 09:52
with how easy programming is versus how 09:54
many people would want to do it. The 09:56
optimizing function of rapid has always 09:58
been like just keep lowering the the 10:00
barrier to entry and and that's how you 10:02
grow users and you grow uh customers. 10:04
right now. Um, basically people from 10:07
kind of every walk of life, we've seen 10:11
users, product managers tend to be tend 10:13
to be a very a great um use case and 10:15
users for us and we've had customers uh 10:19
product managers uh who uh are able to 10:21
make significant impact on the business 10:25
without talking to engineers at all like 10:28
you know running AB tests or 10:30
optimizations or or things like that. I 10:31
mean it's empowering. I mean it gets us 10:33
to think about the really the seams 10:35
between these roles like you know 10:39
product manager, designer, engineer. We 10:40
actually just created a new new product 10:43
group and and and typically you know 10:45
product uh you know head of product is 10:47
have has a bunch of like product 10:49
managers reporting to them but we're 10:51
actually having uh this this group have 10:52
has designers, engineers and and PMs and 10:55
the the idea is they're all using AI all 10:57
the time to to prototype in some cases 10:59
go all the way to production. there 11:02
isn't this sort of waterfall model where 11:04
you know there's a lot of inefficiency 11:07
you know like communication problems 11:09
between these different teams they can 11:10
move incredibly fast and I think that 11:12
that starts to change how tech companies 11:13
work my experience of this was when 11:15
whenever I was working in a startup the 11:17
list of ideas and the backlog would 11:20
always would be infinitely long 11:21
basically and the bottleneck was always 11:23
engineering time and now I'm doing my 11:24
own personal projects I write my to-do 11:26
list or my idea list and then I get to 11:28
work and my the ideas just get done and 11:30
suddenly the bottleneck is like my 11:32
ability to have ideas 11:33
>> and it's just such a weird experience 11:35
looking at a to-do list. It's just empty 11:37
being like what do I do next? I've heard 11:38
from a team that has like a really large 11:40
uh replet deployment in their company 11:42
that their founders using Replet and 11:44
that's stressing the engineers out 11:47
because oh I did this in a weekend like 11:49
can't you do it in a 11:50
>> what do you guys show for yourself and 11:52
so are these previously technical people 11:53
or non techchnical people they're 11:55
building the first version are they 11:57
deploying that production or are they 11:59
giving to engineers and say hey build 12:00
this like what what do you see typically 12:02
>> we advise to like work with with 12:04
engineering but that doesn't always 12:06
happen I think It's um understandable 12:07
that a lot of PMs and designers want to 12:10
go straight to users. What we're seeing 12:12
is they go to like beta users and test 12:14
users and I think this works really well 12:16
but in some cases they just like put it 12:18
in production. Uh and so right now we're 12:20
having discussions with all these 12:22
companies especially the engineering 12:24
leaders are unhappy about this like 12:25
who's on call for these services you 12:28
>> who's responsible for the bug 12:30
>> who's responsible for it. There's a lot 12:32
of questions that are coming up. The 12:34
obvious answer to all these questions is 12:36
like agents are responsible. 12:37
>> So what's the limiting factor today? I 12:39
buy code a thing and I'm like yoloing it 12:41
into production. Like what are what do 12:43
the engineers typically object like? 12:45
What what's going wrong? 12:47
>> Security is is is a big one. Like um 12:48
LLMs are fallible like like humans are. 12:51
They tend to write some uh there's some 12:54
components that they do terribly at like 12:57
for example off 13:00
>> uh like they all kind of suck at it. all 13:01
use like you know old methods of of 13:04
assaulting and hashing and so that's 13:06
been that's been a big sticking point 13:09
and we've seen a lot of examples out 13:11
there right now of some catastrophes 13:14
luckily it hasn't been like there hasn't 13:16
been a major catastrophe I think it's 13:19
coming but uh there's been uh like solo 13:21
founders who would leak um you know API 13:24
keys or would make it really easy to get 13:26
around the login security protections 13:28
and there are a lot of tools out there 13:31
that are like really not trying to take 13:32
responsibility for that and saying, "Oh, 13:34
it's it's the user's problem, the user's 13:36
fault." For us, we think that as a 13:38
platform that is marketing uh for the 13:41
non-developer, you actually have a 13:44
responsibility. We're trying to take 13:46
away 13:48
uh some of the things that we think LMS 13:50
should not do today. Uh so O for 13:53
example, we have a built-in O. So if you 13:55
go to wrap and just say add o, it will 13:58
pull in an o component that we built 14:00
from scratch. It has capture on it. It 14:02
has all the security you know bells and 14:05
whistles. So you don't have to worry 14:07
about all of that. 14:09
>> We integrated with your database. You 14:10
have a user management portal on on your 14:12
site as much as possible. Building in 14:14
these components that are tricky. I 14:18
think payments is the other one. I don't 14:20
think you'd want the LLMs to kind of do 14:22
do the payments. 14:24
>> And they're not that different, right? 14:24
like the sort of checkout. You know, you 14:25
might have a one time checkout, you 14:27
might have subscription, pay as you go, 14:28
but there are not unlimited versions of 14:29
payments. There's like two or three or 14:31
four. 14:33
>> And this is the analogy is today humans 14:33
don't write their own version of 14:36
payments or their own version of O. They 14:37
use providers that have built these 14:39
components already. So, seems like we're 14:41
seeing the same thing play out and 14:42
that's the best way to do it. 14:43
>> Yeah, 100%. And then the other thing we 14:44
added uh we partnered with is a a great 14:46
security company called Samrip. And so 14:48
right now uh when you go to deploy a 14:50
replet app, we run a security scan. We 14:53
run a code security scan and we give you 14:56
like a report of warnings and errors and 14:58
things like that and the agents can try 15:01
to fix them for you. 15:02
>> Okay. And so security is one of the big 15:04
kind of bottlenecks to fully deploying 15:06
to production. I can imagine there must 15:08
be other things that coming down the 15:10
line like sort of scalability, you know, 15:11
looking for like N plus1 database 15:13
queries, uh performance bottlenecks. 15:14
What do you have in your mind like a 15:17
clear set of of blockers that need to be 15:18
overcome before this is truly like one 15:20
click to deploy? 15:22
>> Yeah, I I mean the big the biggest one 15:23
is just going to be humans like social 15:25
like you know there's just going to be 15:28
mistrust and I think it's just going to 15:30
have has has to play out. So you know 15:32
leaving that aside like enterprises has 15:34
to adapt to all of that stuff having 15:36
some way to also scan for scalability 15:38
like figuring out you know uh like doing 15:43
fuzzing or whatever it is or having some 15:45
kind of adversarial agent that's you 15:48
know trying to to to break your app I 15:50
think um you know that's one big thing I 15:53
think integrating with a company's 15:55
ecosystem so one thing we're adding is 15:57
the ability to bring in your design 15:58
system 16:00
>> that's cool 16:01
>> so in addition to kind of us providing 16:01
these components if you go to any 16:03
company they have a lot of these 16:05
components built out as raplet is 16:06
getting deployed into these large 16:08
companies how can we hook into their 16:10
their internal systems 16:12
>> this makes me think about the kind of 16:13
spectrum of these different coding 16:15
tools. Um, on one end of the spectrum, 16:17
you have kind of what I would call the 16:19
the power tools, the cursors, the wind 16:20
surf that let developers use this as a 16:22
way to amplify their efforts. And on the 16:25
full other end of the spectrum, you have 16:27
more of the like consumerf facing, hey, 16:29
if you want to make an app, like you can 16:31
now make an app. And it sounds like you 16:32
guys are kind of in the middle. Um, 16:34
you're helping companies get stuff done, 16:36
but you're doing it for people who don't 16:39
look like the tra traditional developer. 16:41
How do you see this world playing out? 16:43
Are there going to be like n different 16:44
tools or will we converge to one place 16:46
on that spectrum? 16:49
>> AGI is convergence obviously but but 16:50
leaving that aside, it's really hard to 16:52
plan for that world. I think that the 16:53
battle for how to incrementally make 16:56
product uh engineers more productive is 17:00
just it's an obvious one. The market 17:03
there is obvious. There's a lot of 17:05
companies going after it. Both the 17:06
application companies like cursor, the 17:09
underlying model companies are trying to 17:11
go there. I mean um you know claude code 17:12
is competing with cursor. Cursor uses a 17:15
cloud. I think it's a bit of a blood 17:17
bath there but it's the market is 17:19
obvious and the market is really large. 17:21
I would guess that there's going to be 17:22
more of a consolidation there. 17:25
>> Maybe it's not you know one but it's 17:27
probably two or three uh at best. I 17:29
think the sort of the the market we're 17:31
in is a lot larger. Uh so the 17:34
addressable humans is a lot more. You 17:37
know, we're talking about a billion, but 17:40
it could be more. Like really any 17:41
knowledge worker should be able to like 17:42
solve problems with software. My 17:45
conception of replet right now is the 17:46
what we want it to be is a universal 17:49
problem solver. It'll solve problems in 17:51
your personal lives or you know solve 17:53
problems in uh in your work and and all 17:54
of that. And so I think that that market 17:57
is uh will probably be like a little 17:59
more diverse and I think companies will 18:02
kind of figure out where um where they 18:03
slot in trying to solve uh autonomous 18:06
programming with the focus on the 18:09
non-engineer. 18:12
>> Uh we want you to not worry about 18:13
security, not worry about systems, not 18:15
worry about any of that. We want you to 18:18
really come in with your ideas to to 18:20
Replet and be uh an agents manager. And 18:22
we're trying to do it in a way that is 18:26
like as human as as possible and kind of 18:28
fits into into into the workflow. One 18:31
big difference between engineers and 18:33
sort of non-engineers, be it executives 18:36
or product managers, you're not on your 18:37
desk like 8 hours a day. Mobile is a big 18:40
part of that. We have like a really 18:43
great mobile app. And we're thinking 18:44
about this way of like ambient building. 18:46
Maybe you start an app on your uh in 18:48
your desktop. You go away with your 18:50
phone. You're in boring meeting. You get 18:51
a notification from the agent saying you 18:53
I'm done with this. Do you want 18:55
something else? You got texted. And so 18:56
we're trying to kind of build that 18:58
thing. 19:00
>> Yeah. A question I had um related to 19:01
that point is is about the user 19:04
interface. So for something like cursor 19:06
or wind surf, it's pretty obvious. The 19:08
primary UI element is like the code. You 19:10
you see code, you have a little chat 19:13
window, but primarily it's about diffs. 19:14
It's about kind of changes to code. And 19:16
for a tool like Replet, the primary 19:19
interface is like the graphical user 19:21
interface. You know, it's the it's the 19:23
buttons and the the wizzywig like you 19:25
see what you're building. 19:28
>> Exactly. And that's great for building 19:29
user interfaces, but when you're trying 19:30
to build more complicated sort of 19:32
logical flows, I found it a little bit 19:34
difficult cuz I couldn't there's no way 19:36
I couldn't see the code. I can't 19:37
visualize what's going on behind the 19:38
scenes. And it's sort of it's almost a 19:40
black box. Fast forwarding here, if 19:41
you're trying to build more complex 19:43
internal workflows, how do they how does 19:44
a product manager or an operations 19:46
manager at a big company visualize that 19:48
workflow and the kind of logical 19:50
branching that happens? 19:51
>> Yeah. So, if if you look back in the 19:52
history of computing, there was always 19:55
this vision of visual programming, it 19:57
never worked uh very well because 20:00
ultimately it's about Turing 20:03
completeness like these systems are not 20:04
universal computing devices. And now we 20:07
go to codegen. Obviously codegen is is 20:10
turing complete. Um but you're 20:12
interfacing with it primarily via 20:15
natural language. Natural language is 20:16
fuzzy. It's really hard to know whether 20:18
it's doing the right thing. I think the 20:20
synthesis of these two things is 20:22
probably coming where you are 20:24
interfacing with natural language but 20:27
you can instead of like just staring at 20:29
code uh there's maybe an interface or 20:31
like a different view on top of code. 20:34
You can imagine being able to do you 20:37
know small talk. 20:40
>> Yeah. So small talk is this uh 20:42
>> it's the first object-oriented um uh 20:44
programming um system and you know Alan 20:46
K would say it's like it is actually OP 20:50
where everything comes after it is not 20:53
um but the interesting thing about it in 20:55
small talk you can actually the way you 20:57
interact with code is not via files but 21:00
via objects via like logical objects and 21:01
so there's some kind of prior arc there 21:05
and I think the world we're headed in 21:07
where there's some kind of abstraction 21:08
over code that allows people to like 21:10
understand it. 21:12
>> Yeah, I think that's really interesting. 21:13
I there's some there's like an open 21:14
space there whether it's like pseudo 21:16
code is like looks like English but it's 21:17
a little bit more structured or it's a 21:19
visual drag and drop. I don't know. It's 21:21
>> Yeah, I think back to building products 21:22
with a team of engineers, designers, and 21:24
other folks. The interactions that I 21:27
would have as the like product lead with 21:29
those teams was verbal. It would be 21:31
written like abstracted ideas. We would 21:33
draw stuff on the whiteboard together. 21:36
We'd make system diagrams. We'd look at 21:37
the results of the app and we would test 21:40
it and point out like oh this thing is 21:42
broken that's too slow and it feels to 21:43
me like that sort of interface which is 21:45
very multimodal and very flexible is 21:48
probably the best end result like I 21:50
think we will get to something like that 21:52
where the author of products will be 21:54
doing that but the teams they're talking 21:56
to are not other humans they're agents 21:58
doing these things. Has has there been 22:00
an attempt in in sort of PM land to have 22:01
a little more formalism around 22:04
communication or 22:06
>> Yes. And I would not say it's been good. 22:08
The main thing is just like the PRD, 22:10
right? The product spec, right? And it's 22:12
just become, in my opinion, uh 22:14
oftentimes a performative 22:16
>> work artifact that you create just so 22:18
that you can have a thing to get your 22:21
promotion, right? If you're at a big 22:22
company. Uh but they're not actually 22:23
that useful. To me, the most useful 22:25
interactions are just these like 22:27
whiteboard conversations, right? What do 22:28
we wanted to do here? Oh, we got to 22:30
think about that. Oh, we didn't consider 22:31
this. Okay, let's re rethink this whole 22:33
thing. Like those sorts of 22:35
conversations. 22:36
>> AI can play a role in that as well. So, 22:36
I um you know this startup granola uh 22:38
that allows you to kind of record 22:42
meetings and they released this like 22:43
team version that like all the meetings 22:45
uh gets transcribed and and go there and 22:48
they have a mobile app right now where 22:50
like you can put on the table and like 22:51
it's trying. So I I was thinking maybe 22:53
we should go gr granola maximalism where 22:55
look you shouldn't fight the trend in 22:59
which companies are becoming 23:03
increasingly oral as opposed to like 23:04
written um and because like you know 23:06
people are talking on Slack people are 23:10
in meetings people like are 23:12
communicating via prompts with with 23:14
agents uh but you you would want a set 23:15
of AI tools that is actually like 23:18
creating that that that record and in 23:21
background that's like searchable and 23:24
organizable and and all of that. 23:26
>> Yeah. I wonder when we'll have our first 23:28
AI in like oral meetings. You know, you 23:29
you're jamming with your designer and 23:32
you the AI like chips in and says, 23:34
"Well, how about this idea?" 23:36
>> Yeah. And and this is where I think the 23:37
like dystopian view of AI or AI as like 23:40
taking all our jobs and all that is like 23:43
not correct. And I think the future of 23:46
work is more human, is more interactive, 23:48
is more multimodal, is more fun in my 23:51
opinion. 23:53
>> YC's next batch is now taking 23:54
applications. Got a startup in you? 23:56
Apply at y combinator.com/apply. 23:58
It's never too early and filling out the 24:01
app will level up your idea. Okay, back 24:04
to the video. All right, so so last time 24:07
we chatted, uh, you had launched Replet 24:08
agent and things were growing really 24:10
crazy. I imagine that has continued. 24:12
Any, anything you can share there? Maybe 24:14
soon we'll we'll share some numbers but 24:16
uh since replet agent launch we're 24:18
growing 45% 24:20
compound monthly uh average. 24:22
>> These are like metrics that we tell YC 24:24
companies during the batch when they 24:27
have a base of like no users to try to 24:28
achieve and you're doing this at larger 24:30
scale. 24:32
>> Yes. But you know it put a lot of strain 24:32
strain on the company and our systems 24:34
were still relatively relatively small. 24:36
I I feel like it can get to to your head 24:38
and you can start optimizing for for the 24:40
wrong thing. It's very easy in AI to 24:43
increase AR while users are not happy 24:47
>> because they're spending a lot more and 24:50
like not getting the results and in some 24:52
cases maybe shouldn't grow that that 24:55
fast because like you'd want users to 24:57
get a better experience for less less 24:59
money and so it's one thing that we try 25:01
not to obsess we actually don't have AR 25:03
goals at at Replet we have like more 25:05
product goals retention goals just like 25:07
other methods 25:09
>> yeah because the the sort of bad pattern 25:10
with some AI companies you grow topline 25:11
revenue very very quickly the churn is 25:13
like approaching 100% and eventually 25:15
that just catches up 25:17
>> and the gross margins are horrible too 25:18
and so it's like 25:19
>> you know the more growth you have the 25:21
worse the company's doing financially 25:23
and so 25:24
>> so how do investors see this space you 25:25
you must have talked with a bunch can 25:27
they tell the difference 25:29
>> it's all kind of a blur for them because 25:30
investors when they I mean I'm just 25:32
going to generalize here but when they 25:35
start looking at the space they'll use 25:36
everything for three minutes and 25:38
everything for three minutes looks the 25:40
same so I I think I think it will it 25:41
will start to clarify and these products 25:44
will start to uh I suppose to converge 25:47
diverge more with the different focuses 25:50
in the areas we're talking about. I 25:52
think over the next year it'll be like a 25:54
little more clearer but I think a lot of 25:57
them are are just like you know super 25:59
when we talk to them they're super 26:01
confused. They don't understand these 26:02
systems. They don't understand where 26:04
they're going. 26:05
>> I think that's a great segue into um the 26:06
the sort of technology underlying some 26:08
of these tools. I'd love to dig in a 26:10
little bit deeper. We've we've seen some 26:11
announcements from cursor and wind surf 26:13
about how they're kind of layering 26:15
whether it's um Claude or Gemini or um 26:16
the OpenAI models with then their own 26:20
layers on top the fast apply APIs stuff 26:23
like that fast apply models I guess can 26:26
you give us an overview of how it works 26:28
at Replet? 26:29
>> Yeah, a lot of it is you're patching 26:30
problems with the underlying frontier 26:32
models. Yeah. 26:34
>> So the reason fast supply was important 26:35
because none of the models uh were doing 26:37
very well at diffs. Can you explain what 26:40
what supply is? 26:42
>> When you're trying to like edit a file 26:44
as an LLM, the best thing to do is to 26:46
like uh create a create a diff. Uh but 26:48
these models are actually not very good 26:51
at creating diffs. They're actually not 26:53
very good at countering the lines in the 26:54
source code. So they get confused about 26:56
a lot of these things. So for a while, a 26:58
lot of companies were just like 27:01
rewriting the entire entire file. 27:02
>> And so diff for for nontechnical users 27:04
is sort of like remove these three lines 27:06
and inject these other three lines 27:08
instead. like find and replace almost. 27:10
>> That's right. That's right. And so you 27:12
needed some way to um make the model 27:14
generate some something of a diff, 27:19
>> but it's actually not good enough to 27:21
merge it. And so you need another model 27:23
to actually do the merge. So 27:25
>> instead of rewriting like it's a 27:27
thousand line file, just like output the 27:29
entire thousand lines, 27:30
>> it's very slow. It's very expensive. So 27:31
you prompt the model to be as lazy as 27:34
possible. 27:36
>> But in that case, it's really hard to 27:37
apply. Okay, so you need another model 27:38
that's like doing the application. Okay, 27:40
>> you can train a model or you can use 27:42
Gemini Flash or some of these smaller 27:43
smaller models. And so in some cases, 27:46
you need to to train a model or 27:48
fine-tune a model to do a better job at 27:50
it. In other cases, you just also patch 27:52
a bunch of other models to to do it. 27:54
It's engineering. It's engineering as 27:56
opposed to like research, I would say. 27:58
>> And I noticed um you guys don't expose 28:01
the underlying model to the user. With 28:04
other tools like cursor and wind surf, 28:06
there's a drop down. It's like I want to 28:07
you know let's see what Gemini thinks of 28:09
this problem. You don't do that. Why is 28:11
that? 28:13
>> A a big part of our research efforts is 28:13
in eval 28:16
>> and I think this is like an underrated 28:18
part of like you know co AI coding. So 28:20
we spend a ton of time evaling new 28:23
models writing evals generating evals 28:26
just data crunching trying to figure out 28:29
what the users are are getting or are 28:31
feeling. We built a lot of systems try 28:33
to understand um how these systems are 28:35
are performing the moment a new frontier 28:38
model uh lands we are evaluating almost 28:41
immediately. Jim and I for example like 28:44
uh you know couple months ago were 28:47
making the rounds really great model 28:48
really awesome at like oneshotting in 28:51
some cases better than claude agentic 28:52
work wasn't at like a tool calling at 28:55
you know some of the things like that 28:58
but users just see the hype and they're 29:00
like oh like yeah give me Gemini that's 29:03
>> how much notice do you have do they like 29:06
drop it on you and you like scrambling 29:07
or do you have like days or weeks before 29:09
that to to try to you know test it out 29:10
>> well we we have good partnerships with 29:12
these companies we We have a great 29:14
partnership with Google. We have a great 29:15
partnership with uh with Anthropic, even 29:17
with OpenAI. We have a close 29:19
relationship. So, a lot of them give us 29:20
give us a heads up, give us some early 29:22
checkpoints and we we try we we kind of 29:24
play with all of them. And in the case 29:27
of anthropic, we're always like the 29:28
first day kind of launching because we 29:30
end up building. A lot of times we're 29:32
sort of anticipating where they're 29:34
going. 29:35
>> Yeah. 29:35
>> Because you kind of like you can tell 29:36
like 3.5 3.7 there there's some 29:37
direction you can tell like where 4.0 is 29:41
going to land. So you start architecting 29:43
the systems but you know ultimately a 29:45
lot of our engineering efforts still 29:47
infrastructure like the the distributed 29:48
network file system that snapshot based 29:51
network file system like it took us 2 29:54
years to build like there's nothing off 29:56
the shelf to do that a lot of the 29:58
security stuff is really really 30:00
difficult like replet is one of the few 30:01
places in the world where you can get 30:04
like a virtual machine in the cloud um 30:05
you know by just like creating an 30:08
account and that's like a it's really 30:09
hard to kind of run this and protect 30:12
against this. You have crypto miners, 30:14
you have all sorts of stuff like that. 30:16
And Replet also uses um uh Nyx OS under 30:18
the hood. Nixos is a like fully 30:22
declarative also transactional uh 30:24
operating system generator as it were. 30:27
Uh we have like this multi- terabyte uh 30:29
hard drive in all the different regions 30:32
that we have compute. um that that that 30:34
cached all the packages in the in the 30:37
world and that gets attached to every 30:39
container and and again all of that I 30:40
mean I keep coming back to this idea of 30:43
transactionality like you would want the 30:45
system to be fully functional. You would 30:47
want it to be um you would want it to be 30:49
safe in order to like experiment with 30:52
something go back do the sampling with 30:54
with agents. So really a lot of the a 30:56
lot of the work is just like the 30:59
engineering of this infrastructure and 31:01
it's like not as a parent not as sexy as 31:03
sort of like we're trained this model 31:05
here's the um but but I think this is 31:06
where you can build like a uh a bit of a 31:09
lead. 31:13
>> Yeah. I when VCs talk about moes uh the 31:13
thing that I translate it to in my head 31:16
is what is the compounding advantage 31:18
that you might be able to achieve. 31:20
Right. It's not really a defensive mode. 31:21
It's more just I'm ahead and by being 31:23
ahead in some vector it allows me to 31:25
continue to move faster. That's right. 31:27
>> And it sounds like this is a great 31:29
example of that. 31:30
>> True modes are often not obvious until 31:30
like decades perhaps many decades into 31:32
into the company. Like you wouldn't know 31:34
what like Netflix's mode. Uh but 31:36
obviously they have one like Disney 31:39
tried to like compete with them and 31:40
everyone was like you know down on on on 31:42
Netflix but turned out they have a mode 31:44
that is this uh content production 31:45
system that they that they built. 31:48
>> So Amad you you started with the mission 31:49
of like making it easier for people to 31:51
learn to code. You've accomplished a lot 31:53
of that. Now you're pushing the envelope 31:55
of what it even means to code. I've got 31:57
young kids. I want them to be productive 32:00
creators in the world. What should I 32:02
tell them to do? Should they learn to 32:04
code? What does that even mean? 32:06
>> Look, I think if you want to go the like 32:08
professional software developer route, I 32:11
think getting a computer science degree 32:13
and like learning fundamentals makes 32:15
sense. But if you want to be a a 32:17
creator, if you want to be a generalist 32:20
in this world, I don't think it's 32:22
necessary anymore to learn to code in 32:23
the more like traditional ways of 32:25
learning to code. I think you pick it up 32:27
by osmosis almost like like go go go to 32:29
replet and you know and start using it 32:32
and at some point you're going going to 32:34
run into some issue where you're going 32:36
to have to look at code or you're going 32:37
to have to look at logs just by being 32:39
resourceful having to Google around and 32:41
all that. You'll start picking it up. 32:42
And by the way, this is how our 32:44
generation kind of learned how to code, 32:45
right? when you were talking I'm like 32:46
that's what I did. 32:48
>> Yeah. And somehow uh it just became very 32:49
industrial and very formal over the 32:52
years like the way we made web apps is 32:55
like you just like start a notepad with 32:57
a you know HTML file and now you have to 32:59
like learn like webpack or whatever you 33:01
know it's like so um I think the future 33:03
of work is not really clear what is you 33:06
you know we can sort of like have some 33:09
like you know idea of like where where 33:11
the world is headed and so with with my 33:14
children I want them to have like as 33:15
broad uh based knowledge as possible. Uh 33:18
I want them to be as generalist as 33:21
possible. I want them to be as 33:23
generative as possible like being able 33:25
to like create a lot of ideas because 33:27
once the making of things gets easier. 33:29
The bottleneck goes back to to to like 33:32
how many ideas you can have. 33:35
>> Um and so I I I wouldn't put learning to 33:37
code as the top thing. I would put learn 33:40
to make things. Learn to make things 33:43
with code. Learn to make things with 33:45
video. learn to make like anything with 33:47
AI. 33:49
>> And so what do you think happens with 33:49
SAS generally if we're very soon able to 33:52
say create me a version of Google 33:54
Calendar or clone Docyign? What happens 33:57
to SAS? Do you think 33:59
>> we have stories today of um a lot of 34:00
people replacing hundreds of thousands 34:04
of dollars worth of SAS um with Replet? 34:07
The other day I heard a story from 34:10
someone who um you know they got quoted 34:12
$150,000 for a piece of software. He 34:15
went and made it in replet sold it to 34:18
his employer for $32,000 34:20
cost him $400. Companies that have a 34:23
platform developer community around it 34:28
and and plug-in ecosystem and things 34:30
like that. I think those are safe. 34:32
>> You're not going to be able to vibe code 34:34
Salesforce. I think the the vertical SAS 34:35
is is in trouble and I think it's my 34:39
guess it's already probably showing in 34:42
some of the metrics. 34:44
>> Okay, last question. What advice would 34:45
you give to um founders starting out 34:47
right now? 34:50
>> You know the best advice is what we what 34:51
you what you actually pointed out 34:53
earlier is work on the edge of what's 34:55
possible. 34:57
uh because uh one evolution of of AI or 34:59
models will make your business valuable 35:03
and suddenly you're first on market. I 35:05
find it rare to see founders that are 35:07
actually like sitting down trying to 35:10
actually predict the future and and and 35:11
maybe that's something that was you know 35:14
ill advised in the past but I think 35:17
right now trying to actually figure out 35:19
where things are headed is very very 35:21
important uh skill to have. So like make 35:23
some prediction uh figure out like how 35:26
to create like a crappy product that 35:28
would get better immediately as you 35:32
switch switch the model. I mean computer 35:34
use is a is is a great example. 35:36
>> Oh it's been an absolute pleasure. Thank 35:38
you so much for coming on. 35:39
>> Pleasure. Thank you. 35:40
>> See you next time. Thanks. 35:41
[Music] 35:45

– English Lyrics

📚 Don’t just sing along to "" – train your ears, learn vocab, and become a language pro in the app!
By
Viewed
34,984
Language
Learn this song

Lyrics & Translation

[English]
I think the like dystopian view of AI or
AI as like taking all our jobs and all
of that is like not correct. And I think
the future of work is more human, is
more interactive, is more multimodal.
Once the making of things gets easier,
the bottleneck goes back to to to like
how many ideas you can have. I wouldn't
put learning to code as the top thing. I
would put learn to make things. Learn to
make things with code. Learn to make
things with video. learn to make like
anything with AI.
[Music]
>> Welcome back to the breakdown. I'm Dave.
This is Tom. And today we're joined by
Amjad Msad. Amjad is founder and CEO of
Replet, which was a YC 2018 company. Uh
welcome to the show.
>> Thank you.
>> So today, uh we want to talk about kind
of your beginnings in helping people
learn to code. what you're doing now in
replacing a lot of that work and then
where we see the future going in terms
of how people make things. So, Replet
started um I guess in 2016, you were in
YC in 2018. You started out as a tool to
make it really easy for people learning
to program to set up development
environments purely on the web. Um but
now you're really taking off in AI
assisted coding. Tell us tell us the
latest. Yeah. So, um the mission has
always been to make programming more
accessible. Uh then we sort of uh you
know after YC we sort of updated it to
be a little more ambitious. We started
talking about a billion software
developers and it just sounded absurd at
the time and
>> this is all prel right.
>> Yeah. Prelm. There was a moment of time
I don't know if you remember that but
like 20 in the like 2015 AI hype. There
was like a bit of a NLP natural language
hype as well. There's a bunch of AI
companies that ended up being just like
humans behind the scene. They all went
broke, but um there was like a a glimpse
of potentially doing NLP on code. So, we
had an idea that, you know, it's it's
probably coming. I I even had it in my
in my seed deck that like at some point
like we'll collect enough data to like
train models. It wasn't until like GPG2
in 2020 that I felt like it was was
going to be possible. So we you know we
had we had built all these um primitives
like the development environment the
sort of hosting environment all the
other stuff around it. Uh and I felt
like if if we just like added um like AI
agents like it will be able to
orchestrate all the stuff and it'll be
it'll be really great. agents were like
every time we tried them. Uh I think the
first time we tried it like 2021 didn't
work, 22 it didn't work and then 2024
like early early in the in the year we
just felt like it's getting close.
>> Even GPT40 could like be coherent for
like um 2 minutes somehow just ended up
doing a big bat. The company was not
doing super well.
>> Yeah. was I was about to ask was there
like a bet the company moment where it's
like we were teaching people to code now
we're no longer doing that now we're
going to enable everyone to build apps
>> yeah yeah yeah we had grown the company
quite a bit um and we were burning way
too much money uh and we decided to do a
layoff
>> and so we we um cut like perhaps 50
people and then like another like 15 20
people left and so we were like less
than half of the company and so I just
put everything on on replet agents I
just felt like this is the thing it has
to work. It felt like it was close.
Yeah. I don't know. I was just like burn
the boat boats kind of moments. It's
just like we got to do this. I mean,
it's I felt like this was the this was
the the thing that would make the
company work. And honestly, like if claw
3.5 hadn't come out mid US building
agent, we would have probably failed.
>> Yeah.
>> Because um like I said, like GPT40 would
like stay coherent for 2 3 minutes. Quad
3.5 was the first one that like could
work for like you know five to 10
minutes and actually actually work and
the code generations
>> and that's sort of the approach that I
think is most successful for startups
kind of building on the very edge of
what is possible you know you start out
on a mission it's the techn is not quite
there yet you start building and you
sort of you you skate where the puck is
going you know it sort of catches up
with you
>> yeah and you you want to keep keep doing
that
>> totally
>> yeah I remember you came to speak at YC
like 6 months ago and right after you
had launched this and at the time you
said we're very far from fully automated
software development.
>> Do you still feel that way?
>> No, not at all. I mean, I I feel like
I've, you know, every time I make a sort
of some kind of prediction, it feels
like bold and like, you know, but I've
been I've been consistently wrong and
like uh the way things are moving uh is
a lot faster. Just the level of
autonomy. Uh like I said, like maybe 3.5
was like 5 to 10 minutes. Like 3.7
perhaps can get up to like four five
minutes to an hour. The uh 4.0 No, in
the in the system card and opus, they
said they made it work for seven hours.
>> Wow.
>> 7 hours.
>> That's incredible.
>> That's always been the limiting factor
for making agents is like, can they stay
coherent? Can is the context length
actually useful to reason over. If an
LLM can work for for seven hours, that's
basically like a human worker.
>> Yeah. And presumably they're working at
much faster speed than a human would do.
So they're kind of completing perhaps a
week's work in those seven hours. The
only thing that I think is is missing
and a big limiting factor for actually
automating a lot of work is computer
use.
>> Computer use kind of sucks.
>> You know, I'm sure you've you've tried
it. And this is the difference between
Replet agent really uh giving it one
prompt going all all the way in an app
versus having to kind of babysit it a
little bit and like test with it.
>> And we had a company in the last batch
called browser use that's um that's
working browser automation.
>> Browser Use is great. And another one
called pig which is doing the same for
kind of windows desktops.
>> And uh I think advice I would give
founders today is um taking either
browser use or windows automation with
pig and trying to apply that into
enterprise into vertical
>> into a vertical industry.
>> The moment this technology worked those
two companies are just going to
>> totally and I think I think we're like
weeks or perhaps single digit months
away from it working really really well
and so now is the time to get started
using those technologies.
>> Absolutely 100%. And this is this is
what we're focused on. So you know
replet agent v1 to v2 was a huge jump in
autonomy. Uh v3 is the most autonomous
thing. So we're already working on it
and um the interesting thing for us is
the underlying technology and how it
would enable autonomy. There are like a
few important things. One is uh uh you
know being transactional. The ability to
roll back is is very important. So you
you would want you'd want it to be safe
for agents in the same way that gits
made it safe for human programmers to
kind of experiment and create branches
whatever you want the same thing for for
agents like if an agent kind of messes
up a DB migration it should be able to
roll back and also it should be able to
like sample
>> uh across different different paths. Uh
I think this is very very important for
autonomy. If you look at uh like for
example when when uh Anthropic publishes
their Swedbench score they publish one
with that sampling and one with sampling
and it goes from 70% to 80%.
>> And so this is the idea that you sort of
you spawn multiple agents each one will
take a a shot at it and then you figure
out which one works and and choose that
that branch effectively. We built a
infrastructure that is um uh fully
transactional and moves in lock step
meaning the file system is a snap
snapshot fail based file system the DB
is a snapshot based uh u DB and so a as
you're we're making commits uh to the
entire system including the virtual
machine as you're going uh and so what
we can do is also we can we can fork it
and branch it out. So if if we had
computer use that's working well you can
you can just like right now what they do
with the sampling is they do they have
some kind of judge that does ranking
>> which you know it's not a true verifier
a true verifier is a is a is a test
right and a computer use test so you you
can sample out and really pick the best
branch that's actually working
>> that's incredible
>> and and repeat that on and on and
reliability gets really really high
>> so fast forward six or 12 months you're
not spawning one agent you're spawning
is Is it five or 10 or a million? Like
how
>> I think this is where it's going to get
interesting, which is you'd want to give
the user the ability to set compute
budgets. I I think we're starting to see
that with um with the O style models of
like here's here's how much budget. But
but look, I mean, if if you give us
$1,000, we'll spend them if you know.
So, uh
>> that's really cool. I've not thought of
that before. Um but the idea of like
spawning multiple branches then just
picking the best one and being able to
do that in parallel. I just the human
brain doesn't naturally work like that.
We think sequentially. We can't.
>> It reminds me of this legend. I don't
know if this is actually true, but at
Apple when Steve was running it, he
would intentionally have teams doing
basically the same things and then see
which one did the best job. It's like
>> I've heard OpenAI does that now.
Interesting. Uh I I've heard that like
the Codeex project like had multiple
different different teams. So in the
literature, you'll see that uh small
small models sampled uh will beat larger
models. So Sonnet sampled is probably
better than than Opus companies. uh
haven't tried that where instead of
hiring one senior engineer you hire like
10 junior engineers and like give them
the same task and then
>> it's just very expensive with humans but
with LM it's relatively cheap you can
just do that do a hundred of them and
just pick the best one every time
>> so how are people using replet agent
like and and who are the people using it
>> you know it goes back to sort of our
early vision that if you make
programming easy uh then uh then more
and more people would want to would want
to do it actually this is one of the
first thing uh that that PG PG and I
sort of connected on um before we got
into YC actually PG um found us on
HackerNews and so we started this email
relationship um and so he he told me
there's like a super linear relationship
with how easy programming is versus how
many people would want to do it. The
optimizing function of rapid has always
been like just keep lowering the the
barrier to entry and and that's how you
grow users and you grow uh customers.
right now. Um, basically people from
kind of every walk of life, we've seen
users, product managers tend to be tend
to be a very a great um use case and
users for us and we've had customers uh
product managers uh who uh are able to
make significant impact on the business
without talking to engineers at all like
you know running AB tests or
optimizations or or things like that. I
mean it's empowering. I mean it gets us
to think about the really the seams
between these roles like you know
product manager, designer, engineer. We
actually just created a new new product
group and and and typically you know
product uh you know head of product is
have has a bunch of like product
managers reporting to them but we're
actually having uh this this group have
has designers, engineers and and PMs and
the the idea is they're all using AI all
the time to to prototype in some cases
go all the way to production. there
isn't this sort of waterfall model where
you know there's a lot of inefficiency
you know like communication problems
between these different teams they can
move incredibly fast and I think that
that starts to change how tech companies
work my experience of this was when
whenever I was working in a startup the
list of ideas and the backlog would
always would be infinitely long
basically and the bottleneck was always
engineering time and now I'm doing my
own personal projects I write my to-do
list or my idea list and then I get to
work and my the ideas just get done and
suddenly the bottleneck is like my
ability to have ideas
>> and it's just such a weird experience
looking at a to-do list. It's just empty
being like what do I do next? I've heard
from a team that has like a really large
uh replet deployment in their company
that their founders using Replet and
that's stressing the engineers out
because oh I did this in a weekend like
can't you do it in a
>> what do you guys show for yourself and
so are these previously technical people
or non techchnical people they're
building the first version are they
deploying that production or are they
giving to engineers and say hey build
this like what what do you see typically
>> we advise to like work with with
engineering but that doesn't always
happen I think It's um understandable
that a lot of PMs and designers want to
go straight to users. What we're seeing
is they go to like beta users and test
users and I think this works really well
but in some cases they just like put it
in production. Uh and so right now we're
having discussions with all these
companies especially the engineering
leaders are unhappy about this like
who's on call for these services you
>> who's responsible for the bug
>> who's responsible for it. There's a lot
of questions that are coming up. The
obvious answer to all these questions is
like agents are responsible.
>> So what's the limiting factor today? I
buy code a thing and I'm like yoloing it
into production. Like what are what do
the engineers typically object like?
What what's going wrong?
>> Security is is is a big one. Like um
LLMs are fallible like like humans are.
They tend to write some uh there's some
components that they do terribly at like
for example off
>> uh like they all kind of suck at it. all
use like you know old methods of of
assaulting and hashing and so that's
been that's been a big sticking point
and we've seen a lot of examples out
there right now of some catastrophes
luckily it hasn't been like there hasn't
been a major catastrophe I think it's
coming but uh there's been uh like solo
founders who would leak um you know API
keys or would make it really easy to get
around the login security protections
and there are a lot of tools out there
that are like really not trying to take
responsibility for that and saying, "Oh,
it's it's the user's problem, the user's
fault." For us, we think that as a
platform that is marketing uh for the
non-developer, you actually have a
responsibility. We're trying to take
away
uh some of the things that we think LMS
should not do today. Uh so O for
example, we have a built-in O. So if you
go to wrap and just say add o, it will
pull in an o component that we built
from scratch. It has capture on it. It
has all the security you know bells and
whistles. So you don't have to worry
about all of that.
>> We integrated with your database. You
have a user management portal on on your
site as much as possible. Building in
these components that are tricky. I
think payments is the other one. I don't
think you'd want the LLMs to kind of do
do the payments.
>> And they're not that different, right?
like the sort of checkout. You know, you
might have a one time checkout, you
might have subscription, pay as you go,
but there are not unlimited versions of
payments. There's like two or three or
four.
>> And this is the analogy is today humans
don't write their own version of
payments or their own version of O. They
use providers that have built these
components already. So, seems like we're
seeing the same thing play out and
that's the best way to do it.
>> Yeah, 100%. And then the other thing we
added uh we partnered with is a a great
security company called Samrip. And so
right now uh when you go to deploy a
replet app, we run a security scan. We
run a code security scan and we give you
like a report of warnings and errors and
things like that and the agents can try
to fix them for you.
>> Okay. And so security is one of the big
kind of bottlenecks to fully deploying
to production. I can imagine there must
be other things that coming down the
line like sort of scalability, you know,
looking for like N plus1 database
queries, uh performance bottlenecks.
What do you have in your mind like a
clear set of of blockers that need to be
overcome before this is truly like one
click to deploy?
>> Yeah, I I mean the big the biggest one
is just going to be humans like social
like you know there's just going to be
mistrust and I think it's just going to
have has has to play out. So you know
leaving that aside like enterprises has
to adapt to all of that stuff having
some way to also scan for scalability
like figuring out you know uh like doing
fuzzing or whatever it is or having some
kind of adversarial agent that's you
know trying to to to break your app I
think um you know that's one big thing I
think integrating with a company's
ecosystem so one thing we're adding is
the ability to bring in your design
system
>> that's cool
>> so in addition to kind of us providing
these components if you go to any
company they have a lot of these
components built out as raplet is
getting deployed into these large
companies how can we hook into their
their internal systems
>> this makes me think about the kind of
spectrum of these different coding
tools. Um, on one end of the spectrum,
you have kind of what I would call the
the power tools, the cursors, the wind
surf that let developers use this as a
way to amplify their efforts. And on the
full other end of the spectrum, you have
more of the like consumerf facing, hey,
if you want to make an app, like you can
now make an app. And it sounds like you
guys are kind of in the middle. Um,
you're helping companies get stuff done,
but you're doing it for people who don't
look like the tra traditional developer.
How do you see this world playing out?
Are there going to be like n different
tools or will we converge to one place
on that spectrum?
>> AGI is convergence obviously but but
leaving that aside, it's really hard to
plan for that world. I think that the
battle for how to incrementally make
product uh engineers more productive is
just it's an obvious one. The market
there is obvious. There's a lot of
companies going after it. Both the
application companies like cursor, the
underlying model companies are trying to
go there. I mean um you know claude code
is competing with cursor. Cursor uses a
cloud. I think it's a bit of a blood
bath there but it's the market is
obvious and the market is really large.
I would guess that there's going to be
more of a consolidation there.
>> Maybe it's not you know one but it's
probably two or three uh at best. I
think the sort of the the market we're
in is a lot larger. Uh so the
addressable humans is a lot more. You
know, we're talking about a billion, but
it could be more. Like really any
knowledge worker should be able to like
solve problems with software. My
conception of replet right now is the
what we want it to be is a universal
problem solver. It'll solve problems in
your personal lives or you know solve
problems in uh in your work and and all
of that. And so I think that that market
is uh will probably be like a little
more diverse and I think companies will
kind of figure out where um where they
slot in trying to solve uh autonomous
programming with the focus on the
non-engineer.
>> Uh we want you to not worry about
security, not worry about systems, not
worry about any of that. We want you to
really come in with your ideas to to
Replet and be uh an agents manager. And
we're trying to do it in a way that is
like as human as as possible and kind of
fits into into into the workflow. One
big difference between engineers and
sort of non-engineers, be it executives
or product managers, you're not on your
desk like 8 hours a day. Mobile is a big
part of that. We have like a really
great mobile app. And we're thinking
about this way of like ambient building.
Maybe you start an app on your uh in
your desktop. You go away with your
phone. You're in boring meeting. You get
a notification from the agent saying you
I'm done with this. Do you want
something else? You got texted. And so
we're trying to kind of build that
thing.
>> Yeah. A question I had um related to
that point is is about the user
interface. So for something like cursor
or wind surf, it's pretty obvious. The
primary UI element is like the code. You
you see code, you have a little chat
window, but primarily it's about diffs.
It's about kind of changes to code. And
for a tool like Replet, the primary
interface is like the graphical user
interface. You know, it's the it's the
buttons and the the wizzywig like you
see what you're building.
>> Exactly. And that's great for building
user interfaces, but when you're trying
to build more complicated sort of
logical flows, I found it a little bit
difficult cuz I couldn't there's no way
I couldn't see the code. I can't
visualize what's going on behind the
scenes. And it's sort of it's almost a
black box. Fast forwarding here, if
you're trying to build more complex
internal workflows, how do they how does
a product manager or an operations
manager at a big company visualize that
workflow and the kind of logical
branching that happens?
>> Yeah. So, if if you look back in the
history of computing, there was always
this vision of visual programming, it
never worked uh very well because
ultimately it's about Turing
completeness like these systems are not
universal computing devices. And now we
go to codegen. Obviously codegen is is
turing complete. Um but you're
interfacing with it primarily via
natural language. Natural language is
fuzzy. It's really hard to know whether
it's doing the right thing. I think the
synthesis of these two things is
probably coming where you are
interfacing with natural language but
you can instead of like just staring at
code uh there's maybe an interface or
like a different view on top of code.
You can imagine being able to do you
know small talk.
>> Yeah. So small talk is this uh
>> it's the first object-oriented um uh
programming um system and you know Alan
K would say it's like it is actually OP
where everything comes after it is not
um but the interesting thing about it in
small talk you can actually the way you
interact with code is not via files but
via objects via like logical objects and
so there's some kind of prior arc there
and I think the world we're headed in
where there's some kind of abstraction
over code that allows people to like
understand it.
>> Yeah, I think that's really interesting.
I there's some there's like an open
space there whether it's like pseudo
code is like looks like English but it's
a little bit more structured or it's a
visual drag and drop. I don't know. It's
>> Yeah, I think back to building products
with a team of engineers, designers, and
other folks. The interactions that I
would have as the like product lead with
those teams was verbal. It would be
written like abstracted ideas. We would
draw stuff on the whiteboard together.
We'd make system diagrams. We'd look at
the results of the app and we would test
it and point out like oh this thing is
broken that's too slow and it feels to
me like that sort of interface which is
very multimodal and very flexible is
probably the best end result like I
think we will get to something like that
where the author of products will be
doing that but the teams they're talking
to are not other humans they're agents
doing these things. Has has there been
an attempt in in sort of PM land to have
a little more formalism around
communication or
>> Yes. And I would not say it's been good.
The main thing is just like the PRD,
right? The product spec, right? And it's
just become, in my opinion, uh
oftentimes a performative
>> work artifact that you create just so
that you can have a thing to get your
promotion, right? If you're at a big
company. Uh but they're not actually
that useful. To me, the most useful
interactions are just these like
whiteboard conversations, right? What do
we wanted to do here? Oh, we got to
think about that. Oh, we didn't consider
this. Okay, let's re rethink this whole
thing. Like those sorts of
conversations.
>> AI can play a role in that as well. So,
I um you know this startup granola uh
that allows you to kind of record
meetings and they released this like
team version that like all the meetings
uh gets transcribed and and go there and
they have a mobile app right now where
like you can put on the table and like
it's trying. So I I was thinking maybe
we should go gr granola maximalism where
look you shouldn't fight the trend in
which companies are becoming
increasingly oral as opposed to like
written um and because like you know
people are talking on Slack people are
in meetings people like are
communicating via prompts with with
agents uh but you you would want a set
of AI tools that is actually like
creating that that that record and in
background that's like searchable and
organizable and and all of that.
>> Yeah. I wonder when we'll have our first
AI in like oral meetings. You know, you
you're jamming with your designer and
you the AI like chips in and says,
"Well, how about this idea?"
>> Yeah. And and this is where I think the
like dystopian view of AI or AI as like
taking all our jobs and all that is like
not correct. And I think the future of
work is more human, is more interactive,
is more multimodal, is more fun in my
opinion.
>> YC's next batch is now taking
applications. Got a startup in you?
Apply at y combinator.com/apply.
It's never too early and filling out the
app will level up your idea. Okay, back
to the video. All right, so so last time
we chatted, uh, you had launched Replet
agent and things were growing really
crazy. I imagine that has continued.
Any, anything you can share there? Maybe
soon we'll we'll share some numbers but
uh since replet agent launch we're
growing 45%
compound monthly uh average.
>> These are like metrics that we tell YC
companies during the batch when they
have a base of like no users to try to
achieve and you're doing this at larger
scale.
>> Yes. But you know it put a lot of strain
strain on the company and our systems
were still relatively relatively small.
I I feel like it can get to to your head
and you can start optimizing for for the
wrong thing. It's very easy in AI to
increase AR while users are not happy
>> because they're spending a lot more and
like not getting the results and in some
cases maybe shouldn't grow that that
fast because like you'd want users to
get a better experience for less less
money and so it's one thing that we try
not to obsess we actually don't have AR
goals at at Replet we have like more
product goals retention goals just like
other methods
>> yeah because the the sort of bad pattern
with some AI companies you grow topline
revenue very very quickly the churn is
like approaching 100% and eventually
that just catches up
>> and the gross margins are horrible too
and so it's like
>> you know the more growth you have the
worse the company's doing financially
and so
>> so how do investors see this space you
you must have talked with a bunch can
they tell the difference
>> it's all kind of a blur for them because
investors when they I mean I'm just
going to generalize here but when they
start looking at the space they'll use
everything for three minutes and
everything for three minutes looks the
same so I I think I think it will it
will start to clarify and these products
will start to uh I suppose to converge
diverge more with the different focuses
in the areas we're talking about. I
think over the next year it'll be like a
little more clearer but I think a lot of
them are are just like you know super
when we talk to them they're super
confused. They don't understand these
systems. They don't understand where
they're going.
>> I think that's a great segue into um the
the sort of technology underlying some
of these tools. I'd love to dig in a
little bit deeper. We've we've seen some
announcements from cursor and wind surf
about how they're kind of layering
whether it's um Claude or Gemini or um
the OpenAI models with then their own
layers on top the fast apply APIs stuff
like that fast apply models I guess can
you give us an overview of how it works
at Replet?
>> Yeah, a lot of it is you're patching
problems with the underlying frontier
models. Yeah.
>> So the reason fast supply was important
because none of the models uh were doing
very well at diffs. Can you explain what
what supply is?
>> When you're trying to like edit a file
as an LLM, the best thing to do is to
like uh create a create a diff. Uh but
these models are actually not very good
at creating diffs. They're actually not
very good at countering the lines in the
source code. So they get confused about
a lot of these things. So for a while, a
lot of companies were just like
rewriting the entire entire file.
>> And so diff for for nontechnical users
is sort of like remove these three lines
and inject these other three lines
instead. like find and replace almost.
>> That's right. That's right. And so you
needed some way to um make the model
generate some something of a diff,
>> but it's actually not good enough to
merge it. And so you need another model
to actually do the merge. So
>> instead of rewriting like it's a
thousand line file, just like output the
entire thousand lines,
>> it's very slow. It's very expensive. So
you prompt the model to be as lazy as
possible.
>> But in that case, it's really hard to
apply. Okay, so you need another model
that's like doing the application. Okay,
>> you can train a model or you can use
Gemini Flash or some of these smaller
smaller models. And so in some cases,
you need to to train a model or
fine-tune a model to do a better job at
it. In other cases, you just also patch
a bunch of other models to to do it.
It's engineering. It's engineering as
opposed to like research, I would say.
>> And I noticed um you guys don't expose
the underlying model to the user. With
other tools like cursor and wind surf,
there's a drop down. It's like I want to
you know let's see what Gemini thinks of
this problem. You don't do that. Why is
that?
>> A a big part of our research efforts is
in eval
>> and I think this is like an underrated
part of like you know co AI coding. So
we spend a ton of time evaling new
models writing evals generating evals
just data crunching trying to figure out
what the users are are getting or are
feeling. We built a lot of systems try
to understand um how these systems are
are performing the moment a new frontier
model uh lands we are evaluating almost
immediately. Jim and I for example like
uh you know couple months ago were
making the rounds really great model
really awesome at like oneshotting in
some cases better than claude agentic
work wasn't at like a tool calling at
you know some of the things like that
but users just see the hype and they're
like oh like yeah give me Gemini that's
>> how much notice do you have do they like
drop it on you and you like scrambling
or do you have like days or weeks before
that to to try to you know test it out
>> well we we have good partnerships with
these companies we We have a great
partnership with Google. We have a great
partnership with uh with Anthropic, even
with OpenAI. We have a close
relationship. So, a lot of them give us
give us a heads up, give us some early
checkpoints and we we try we we kind of
play with all of them. And in the case
of anthropic, we're always like the
first day kind of launching because we
end up building. A lot of times we're
sort of anticipating where they're
going.
>> Yeah.
>> Because you kind of like you can tell
like 3.5 3.7 there there's some
direction you can tell like where 4.0 is
going to land. So you start architecting
the systems but you know ultimately a
lot of our engineering efforts still
infrastructure like the the distributed
network file system that snapshot based
network file system like it took us 2
years to build like there's nothing off
the shelf to do that a lot of the
security stuff is really really
difficult like replet is one of the few
places in the world where you can get
like a virtual machine in the cloud um
you know by just like creating an
account and that's like a it's really
hard to kind of run this and protect
against this. You have crypto miners,
you have all sorts of stuff like that.
And Replet also uses um uh Nyx OS under
the hood. Nixos is a like fully
declarative also transactional uh
operating system generator as it were.
Uh we have like this multi- terabyte uh
hard drive in all the different regions
that we have compute. um that that that
cached all the packages in the in the
world and that gets attached to every
container and and again all of that I
mean I keep coming back to this idea of
transactionality like you would want the
system to be fully functional. You would
want it to be um you would want it to be
safe in order to like experiment with
something go back do the sampling with
with agents. So really a lot of the a
lot of the work is just like the
engineering of this infrastructure and
it's like not as a parent not as sexy as
sort of like we're trained this model
here's the um but but I think this is
where you can build like a uh a bit of a
lead.
>> Yeah. I when VCs talk about moes uh the
thing that I translate it to in my head
is what is the compounding advantage
that you might be able to achieve.
Right. It's not really a defensive mode.
It's more just I'm ahead and by being
ahead in some vector it allows me to
continue to move faster. That's right.
>> And it sounds like this is a great
example of that.
>> True modes are often not obvious until
like decades perhaps many decades into
into the company. Like you wouldn't know
what like Netflix's mode. Uh but
obviously they have one like Disney
tried to like compete with them and
everyone was like you know down on on on
Netflix but turned out they have a mode
that is this uh content production
system that they that they built.
>> So Amad you you started with the mission
of like making it easier for people to
learn to code. You've accomplished a lot
of that. Now you're pushing the envelope
of what it even means to code. I've got
young kids. I want them to be productive
creators in the world. What should I
tell them to do? Should they learn to
code? What does that even mean?
>> Look, I think if you want to go the like
professional software developer route, I
think getting a computer science degree
and like learning fundamentals makes
sense. But if you want to be a a
creator, if you want to be a generalist
in this world, I don't think it's
necessary anymore to learn to code in
the more like traditional ways of
learning to code. I think you pick it up
by osmosis almost like like go go go to
replet and you know and start using it
and at some point you're going going to
run into some issue where you're going
to have to look at code or you're going
to have to look at logs just by being
resourceful having to Google around and
all that. You'll start picking it up.
And by the way, this is how our
generation kind of learned how to code,
right? when you were talking I'm like
that's what I did.
>> Yeah. And somehow uh it just became very
industrial and very formal over the
years like the way we made web apps is
like you just like start a notepad with
a you know HTML file and now you have to
like learn like webpack or whatever you
know it's like so um I think the future
of work is not really clear what is you
you know we can sort of like have some
like you know idea of like where where
the world is headed and so with with my
children I want them to have like as
broad uh based knowledge as possible. Uh
I want them to be as generalist as
possible. I want them to be as
generative as possible like being able
to like create a lot of ideas because
once the making of things gets easier.
The bottleneck goes back to to to like
how many ideas you can have.
>> Um and so I I I wouldn't put learning to
code as the top thing. I would put learn
to make things. Learn to make things
with code. Learn to make things with
video. learn to make like anything with
AI.
>> And so what do you think happens with
SAS generally if we're very soon able to
say create me a version of Google
Calendar or clone Docyign? What happens
to SAS? Do you think
>> we have stories today of um a lot of
people replacing hundreds of thousands
of dollars worth of SAS um with Replet?
The other day I heard a story from
someone who um you know they got quoted
$150,000 for a piece of software. He
went and made it in replet sold it to
his employer for $32,000
cost him $400. Companies that have a
platform developer community around it
and and plug-in ecosystem and things
like that. I think those are safe.
>> You're not going to be able to vibe code
Salesforce. I think the the vertical SAS
is is in trouble and I think it's my
guess it's already probably showing in
some of the metrics.
>> Okay, last question. What advice would
you give to um founders starting out
right now?
>> You know the best advice is what we what
you what you actually pointed out
earlier is work on the edge of what's
possible.
uh because uh one evolution of of AI or
models will make your business valuable
and suddenly you're first on market. I
find it rare to see founders that are
actually like sitting down trying to
actually predict the future and and and
maybe that's something that was you know
ill advised in the past but I think
right now trying to actually figure out
where things are headed is very very
important uh skill to have. So like make
some prediction uh figure out like how
to create like a crappy product that
would get better immediately as you
switch switch the model. I mean computer
use is a is is a great example.
>> Oh it's been an absolute pleasure. Thank
you so much for coming on.
>> Pleasure. Thank you.
>> See you next time. Thanks.
[Music]

Key Vocabulary

Start Practicing
Vocabulary Meanings

code

/koʊd/

B1
  • noun
  • - source code or computer program instructions
  • verb
  • - to write computer code

build

/bɪld/

A2
  • verb
  • - to construct or create something

learn

/lɜrn/

A1
  • verb
  • - to gain knowledge or skill

make

/meɪk/

A1
  • verb
  • - to create or produce

idea

/aɪˈdiə/

A1
  • noun
  • - a thought or concept

agent

/ˈeɪdʒənt/

B2
  • noun
  • - an AI program or software agent

model

/ˈmɑdl/

B2
  • noun
  • - an AI or machine learning model

program

/ˈproʊgræm/

B1
  • noun
  • - a computer program
  • verb
  • - to write a program

work

/wɜrk/

A1
  • noun
  • - employment or task
  • verb
  • - to function or operate

human

/ˈhjuːmən/

A1
  • adjective
  • - relating to people
  • noun
  • - a person

interactive

/ˌɪntərˈæktɪv/

B1
  • adjective
  • - allowing interaction

future

/ˈfjutʃər/

A2
  • noun
  • - time yet to come

easy

/ˈizi/

A1
  • adjective
  • - not difficult

access

/ˈækses/

A2
  • noun
  • - permission to use
  • verb
  • - to reach or use

possible

/ˈpɑsəbl/

A2
  • adjective
  • - able to happen

grow

/groʊ/

A2
  • verb
  • - to develop or increase

need

/nid/

A1
  • verb
  • - to require

change

/tʃeɪndʒ/

A2
  • noun
  • - alteration
  • verb
  • - to make different

problem

/ˈprɑbləm/

A2
  • noun
  • - a difficulty

way

/weɪ/

A1
  • noun
  • - a method or manner

Are there any new words in “” you don’t know yet?

💡 Hint: code, build… Jump into the app and start learning now!

Key Grammar Structures

Coming Soon!

We're updating this section. Stay tuned!

Related Songs