Trending Songs Recently Updated Songs Popular Music Genres Add Songs

Explore

Display Bilingual:

Off 한국어 Español Tiếng Việt Português Français 中文日本語

I think the like dystopian view of AI or 00:00

AI as like taking all our jobs and all 00:03

of that is like not correct. And I think 00:06

the future of work is more human, is 00:08

more interactive, is more multimodal. 00:11

Once the making of things gets easier, 00:13

the bottleneck goes back to to to like 00:16

how many ideas you can have. I wouldn't 00:18

put learning to code as the top thing. I 00:21

would put learn to make things. Learn to 00:23

make things with code. Learn to make 00:25

things with video. learn to make like 00:27

anything with AI. 00:29

[Music] 00:32

>> Welcome back to the breakdown. I'm Dave. 00:37

This is Tom. And today we're joined by 00:39

Amjad Msad. Amjad is founder and CEO of 00:40

Replet, which was a YC 2018 company. Uh 00:43

welcome to the show. 00:46

>> Thank you. 00:47

>> So today, uh we want to talk about kind 00:48

of your beginnings in helping people 00:50

learn to code. what you're doing now in 00:52

replacing a lot of that work and then 00:55

where we see the future going in terms 00:57

of how people make things. So, Replet 00:58

started um I guess in 2016, you were in 01:01

YC in 2018. You started out as a tool to 01:03

make it really easy for people learning 01:07

to program to set up development 01:09

environments purely on the web. Um but 01:11

now you're really taking off in AI 01:14

assisted coding. Tell us tell us the 01:16

latest. Yeah. So, um the mission has 01:18

always been to make programming more 01:20

accessible. Uh then we sort of uh you 01:22

know after YC we sort of updated it to 01:25

be a little more ambitious. We started 01:27

talking about a billion software 01:29

developers and it just sounded absurd at 01:31

the time and 01:33

>> this is all prel right. 01:34

>> Yeah. Prelm. There was a moment of time 01:35

I don't know if you remember that but 01:37

like 20 in the like 2015 AI hype. There 01:39

was like a bit of a NLP natural language 01:42

hype as well. There's a bunch of AI 01:45

companies that ended up being just like 01:47

humans behind the scene. They all went 01:48

broke, but um there was like a a glimpse 01:50

of potentially doing NLP on code. So, we 01:53

had an idea that, you know, it's it's 01:57

probably coming. I I even had it in my 01:58

in my seed deck that like at some point 02:00

like we'll collect enough data to like 02:03

train models. It wasn't until like GPG2 02:04

in 2020 that I felt like it was was 02:08

going to be possible. So we you know we 02:11

had we had built all these um primitives 02:13

like the development environment the 02:16

sort of hosting environment all the 02:18

other stuff around it. Uh and I felt 02:19

like if if we just like added um like AI 02:22

agents like it will be able to 02:25

orchestrate all the stuff and it'll be 02:27

it'll be really great. agents were like 02:28

every time we tried them. Uh I think the 02:31

first time we tried it like 2021 didn't 02:33

work, 22 it didn't work and then 2024 02:35

like early early in the in the year we 02:39

just felt like it's getting close. 02:42

>> Even GPT40 could like be coherent for 02:44

like um 2 minutes somehow just ended up 02:47

doing a big bat. The company was not 02:51

doing super well. 02:54

>> Yeah. was I was about to ask was there 02:55

like a bet the company moment where it's 02:56

like we were teaching people to code now 02:57

we're no longer doing that now we're 02:59

going to enable everyone to build apps 03:00

>> yeah yeah yeah we had grown the company 03:02

quite a bit um and we were burning way 03:03

too much money uh and we decided to do a 03:07

layoff 03:11

>> and so we we um cut like perhaps 50 03:11

people and then like another like 15 20 03:14

people left and so we were like less 03:17

than half of the company and so I just 03:18

put everything on on replet agents I 03:21

just felt like this is the thing it has 03:22

to work. It felt like it was close. 03:25

Yeah. I don't know. I was just like burn 03:27

the boat boats kind of moments. It's 03:29

just like we got to do this. I mean, 03:30

it's I felt like this was the this was 03:32

the the thing that would make the 03:35

company work. And honestly, like if claw 03:37

3.5 hadn't come out mid US building 03:40

agent, we would have probably failed. 03:43

>> Yeah. 03:45

>> Because um like I said, like GPT40 would 03:45

like stay coherent for 2 3 minutes. Quad 03:48

3.5 was the first one that like could 03:50

work for like you know five to 10 03:52

minutes and actually actually work and 03:54

the code generations 03:56

>> and that's sort of the approach that I 03:57

think is most successful for startups 03:58

kind of building on the very edge of 04:00

what is possible you know you start out 04:01

on a mission it's the techn is not quite 04:02

there yet you start building and you 04:04

sort of you you skate where the puck is 04:06

going you know it sort of catches up 04:07

with you 04:09

>> yeah and you you want to keep keep doing 04:09

that 04:11

>> totally 04:11

>> yeah I remember you came to speak at YC 04:12

like 6 months ago and right after you 04:14

had launched this and at the time you 04:16

said we're very far from fully automated 04:18

software development. 04:20

>> Do you still feel that way? 04:22

>> No, not at all. I mean, I I feel like 04:23

I've, you know, every time I make a sort 04:25

of some kind of prediction, it feels 04:27

like bold and like, you know, but I've 04:29

been I've been consistently wrong and 04:31

like uh the way things are moving uh is 04:33

a lot faster. Just the level of 04:36

autonomy. Uh like I said, like maybe 3.5 04:37

was like 5 to 10 minutes. Like 3.7 04:40

perhaps can get up to like four five 04:43

minutes to an hour. The uh 4.0 No, in 04:45

the in the system card and opus, they 04:48

said they made it work for seven hours. 04:51

>> Wow. 04:53

>> 7 hours. 04:54

>> That's incredible. 04:55

>> That's always been the limiting factor 04:56

for making agents is like, can they stay 04:57

coherent? Can is the context length 04:59

actually useful to reason over. If an 05:01

LLM can work for for seven hours, that's 05:04

basically like a human worker. 05:06

>> Yeah. And presumably they're working at 05:09

much faster speed than a human would do. 05:10

So they're kind of completing perhaps a 05:12

week's work in those seven hours. The 05:13

only thing that I think is is missing 05:15

and a big limiting factor for actually 05:17

automating a lot of work is computer 05:19

use. 05:21

>> Computer use kind of sucks. 05:22

>> You know, I'm sure you've you've tried 05:24

it. And this is the difference between 05:25

Replet agent really uh giving it one 05:27

prompt going all all the way in an app 05:30

versus having to kind of babysit it a 05:32

little bit and like test with it. 05:35

>> And we had a company in the last batch 05:36

called browser use that's um that's 05:37

working browser automation. 05:39

>> Browser Use is great. And another one 05:40

called pig which is doing the same for 05:41

kind of windows desktops. 05:42

>> And uh I think advice I would give 05:44

founders today is um taking either 05:45

browser use or windows automation with 05:48

pig and trying to apply that into 05:50

enterprise into vertical 05:52

>> into a vertical industry. 05:54

>> The moment this technology worked those 05:56

two companies are just going to 05:57

>> totally and I think I think we're like 05:59

weeks or perhaps single digit months 06:00

away from it working really really well 06:02

and so now is the time to get started 06:03

using those technologies. 06:05

>> Absolutely 100%. And this is this is 06:06

what we're focused on. So you know 06:08

replet agent v1 to v2 was a huge jump in 06:09

autonomy. Uh v3 is the most autonomous 06:12

thing. So we're already working on it 06:15

and um the interesting thing for us is 06:17

the underlying technology and how it 06:21

would enable autonomy. There are like a 06:23

few important things. One is uh uh you 06:25

know being transactional. The ability to 06:29

roll back is is very important. So you 06:31

you would want you'd want it to be safe 06:34

for agents in the same way that gits 06:37

made it safe for human programmers to 06:38

kind of experiment and create branches 06:41

whatever you want the same thing for for 06:43

agents like if an agent kind of messes 06:45

up a DB migration it should be able to 06:47

roll back and also it should be able to 06:49

like sample 06:51

>> uh across different different paths. Uh 06:53

I think this is very very important for 06:55

autonomy. If you look at uh like for 06:57

example when when uh Anthropic publishes 06:59

their Swedbench score they publish one 07:02

with that sampling and one with sampling 07:04

and it goes from 70% to 80%. 07:07

>> And so this is the idea that you sort of 07:09

you spawn multiple agents each one will 07:11

take a a shot at it and then you figure 07:13

out which one works and and choose that 07:15

that branch effectively. We built a 07:16

infrastructure that is um uh fully 07:19

transactional and moves in lock step 07:22

meaning the file system is a snap 07:25

snapshot fail based file system the DB 07:27

is a snapshot based uh u DB and so a as 07:29

you're we're making commits uh to the 07:34

entire system including the virtual 07:36

machine as you're going uh and so what 07:38

we can do is also we can we can fork it 07:40

and branch it out. So if if we had 07:42

computer use that's working well you can 07:44

you can just like right now what they do 07:46

with the sampling is they do they have 07:49

some kind of judge that does ranking 07:50

>> which you know it's not a true verifier 07:52

a true verifier is a is a is a test 07:55

right and a computer use test so you you 07:57

can sample out and really pick the best 07:59

branch that's actually working 08:01

>> that's incredible 08:02

>> and and repeat that on and on and 08:03

reliability gets really really high 08:05

>> so fast forward six or 12 months you're 08:08

not spawning one agent you're spawning 08:10

is Is it five or 10 or a million? Like 08:12

how 08:14

>> I think this is where it's going to get 08:15

interesting, which is you'd want to give 08:17

the user the ability to set compute 08:20

budgets. I I think we're starting to see 08:22

that with um with the O style models of 08:23

like here's here's how much budget. But 08:26

but look, I mean, if if you give us 08:28

$1,000, we'll spend them if you know. 08:29

So, uh 08:32

>> that's really cool. I've not thought of 08:33

that before. Um but the idea of like 08:34

spawning multiple branches then just 08:36

picking the best one and being able to 08:37

do that in parallel. I just the human 08:38

brain doesn't naturally work like that. 08:40

We think sequentially. We can't. 08:42

>> It reminds me of this legend. I don't 08:44

know if this is actually true, but at 08:45

Apple when Steve was running it, he 08:46

would intentionally have teams doing 08:48

basically the same things and then see 08:49

which one did the best job. It's like 08:51

>> I've heard OpenAI does that now. 08:53

Interesting. Uh I I've heard that like 08:54

the Codeex project like had multiple 08:56

different different teams. So in the 08:58

literature, you'll see that uh small 09:00

small models sampled uh will beat larger 09:03

models. So Sonnet sampled is probably 09:07

better than than Opus companies. uh 09:09

haven't tried that where instead of 09:11

hiring one senior engineer you hire like 09:13

10 junior engineers and like give them 09:16

the same task and then 09:18

>> it's just very expensive with humans but 09:20

with LM it's relatively cheap you can 09:21

just do that do a hundred of them and 09:23

just pick the best one every time 09:25

>> so how are people using replet agent 09:26

like and and who are the people using it 09:28

>> you know it goes back to sort of our 09:30

early vision that if you make 09:32

programming easy uh then uh then more 09:33

and more people would want to would want 09:36

to do it actually this is one of the 09:38

first thing uh that that PG PG and I 09:39

sort of connected on um before we got 09:41

into YC actually PG um found us on 09:44

HackerNews and so we started this email 09:46

relationship um and so he he told me 09:48

there's like a super linear relationship 09:52

with how easy programming is versus how 09:54

many people would want to do it. The 09:56

optimizing function of rapid has always 09:58

been like just keep lowering the the 10:00

barrier to entry and and that's how you 10:02

grow users and you grow uh customers. 10:04

right now. Um, basically people from 10:07

kind of every walk of life, we've seen 10:11

users, product managers tend to be tend 10:13

to be a very a great um use case and 10:15

users for us and we've had customers uh 10:19

product managers uh who uh are able to 10:21

make significant impact on the business 10:25

without talking to engineers at all like 10:28

you know running AB tests or 10:30

optimizations or or things like that. I 10:31

mean it's empowering. I mean it gets us 10:33

to think about the really the seams 10:35

between these roles like you know 10:39

product manager, designer, engineer. We 10:40

actually just created a new new product 10:43

group and and and typically you know 10:45

product uh you know head of product is 10:47

have has a bunch of like product 10:49

managers reporting to them but we're 10:51

actually having uh this this group have 10:52

has designers, engineers and and PMs and 10:55

the the idea is they're all using AI all 10:57

the time to to prototype in some cases 10:59

go all the way to production. there 11:02

isn't this sort of waterfall model where 11:04

you know there's a lot of inefficiency 11:07

you know like communication problems 11:09

between these different teams they can 11:10

move incredibly fast and I think that 11:12

that starts to change how tech companies 11:13

work my experience of this was when 11:15

whenever I was working in a startup the 11:17

list of ideas and the backlog would 11:20

always would be infinitely long 11:21

basically and the bottleneck was always 11:23

engineering time and now I'm doing my 11:24

own personal projects I write my to-do 11:26

list or my idea list and then I get to 11:28

work and my the ideas just get done and 11:30

suddenly the bottleneck is like my 11:32

ability to have ideas 11:33

>> and it's just such a weird experience 11:35

looking at a to-do list. It's just empty 11:37

being like what do I do next? I've heard 11:38

from a team that has like a really large 11:40

uh replet deployment in their company 11:42

that their founders using Replet and 11:44

that's stressing the engineers out 11:47

because oh I did this in a weekend like 11:49

can't you do it in a 11:50

>> what do you guys show for yourself and 11:52

so are these previously technical people 11:53

or non techchnical people they're 11:55

building the first version are they 11:57

deploying that production or are they 11:59

giving to engineers and say hey build 12:00

this like what what do you see typically 12:02

>> we advise to like work with with 12:04

engineering but that doesn't always 12:06

happen I think It's um understandable 12:07

that a lot of PMs and designers want to 12:10

go straight to users. What we're seeing 12:12

is they go to like beta users and test 12:14

users and I think this works really well 12:16

but in some cases they just like put it 12:18

in production. Uh and so right now we're 12:20

having discussions with all these 12:22

companies especially the engineering 12:24

leaders are unhappy about this like 12:25

who's on call for these services you 12:28

>> who's responsible for the bug 12:30

>> who's responsible for it. There's a lot 12:32

of questions that are coming up. The 12:34

obvious answer to all these questions is 12:36

like agents are responsible. 12:37

>> So what's the limiting factor today? I 12:39

buy code a thing and I'm like yoloing it 12:41

into production. Like what are what do 12:43

the engineers typically object like? 12:45

What what's going wrong? 12:47

>> Security is is is a big one. Like um 12:48

LLMs are fallible like like humans are. 12:51

They tend to write some uh there's some 12:54

components that they do terribly at like 12:57

for example off 13:00

>> uh like they all kind of suck at it. all 13:01

use like you know old methods of of 13:04

assaulting and hashing and so that's 13:06

been that's been a big sticking point 13:09

and we've seen a lot of examples out 13:11

there right now of some catastrophes 13:14

luckily it hasn't been like there hasn't 13:16

been a major catastrophe I think it's 13:19

coming but uh there's been uh like solo 13:21

founders who would leak um you know API 13:24

keys or would make it really easy to get 13:26

around the login security protections 13:28

and there are a lot of tools out there 13:31

that are like really not trying to take 13:32

responsibility for that and saying, "Oh, 13:34

it's it's the user's problem, the user's 13:36

fault." For us, we think that as a 13:38

platform that is marketing uh for the 13:41

non-developer, you actually have a 13:44

responsibility. We're trying to take 13:46

away 13:48

uh some of the things that we think LMS 13:50

should not do today. Uh so O for 13:53

example, we have a built-in O. So if you 13:55

go to wrap and just say add o, it will 13:58

pull in an o component that we built 14:00

from scratch. It has capture on it. It 14:02

has all the security you know bells and 14:05

whistles. So you don't have to worry 14:07

about all of that. 14:09

>> We integrated with your database. You 14:10

have a user management portal on on your 14:12

site as much as possible. Building in 14:14

these components that are tricky. I 14:18

think payments is the other one. I don't 14:20

think you'd want the LLMs to kind of do 14:22

do the payments. 14:24

>> And they're not that different, right? 14:24

like the sort of checkout. You know, you 14:25

might have a one time checkout, you 14:27

might have subscription, pay as you go, 14:28

but there are not unlimited versions of 14:29

payments. There's like two or three or 14:31

four. 14:33

>> And this is the analogy is today humans 14:33

don't write their own version of 14:36

payments or their own version of O. They 14:37

use providers that have built these 14:39

components already. So, seems like we're 14:41

seeing the same thing play out and 14:42

that's the best way to do it. 14:43

>> Yeah, 100%. And then the other thing we 14:44

added uh we partnered with is a a great 14:46

security company called Samrip. And so 14:48

right now uh when you go to deploy a 14:50

replet app, we run a security scan. We 14:53

run a code security scan and we give you 14:56

like a report of warnings and errors and 14:58

things like that and the agents can try 15:01

to fix them for you. 15:02

>> Okay. And so security is one of the big 15:04

kind of bottlenecks to fully deploying 15:06

to production. I can imagine there must 15:08

be other things that coming down the 15:10

line like sort of scalability, you know, 15:11

looking for like N plus1 database 15:13

queries, uh performance bottlenecks. 15:14

What do you have in your mind like a 15:17

clear set of of blockers that need to be 15:18

overcome before this is truly like one 15:20

click to deploy? 15:22

>> Yeah, I I mean the big the biggest one 15:23

is just going to be humans like social 15:25

like you know there's just going to be 15:28

mistrust and I think it's just going to 15:30

have has has to play out. So you know 15:32

leaving that aside like enterprises has 15:34

to adapt to all of that stuff having 15:36

some way to also scan for scalability 15:38

like figuring out you know uh like doing 15:43

fuzzing or whatever it is or having some 15:45

kind of adversarial agent that's you 15:48

know trying to to to break your app I 15:50

think um you know that's one big thing I 15:53

think integrating with a company's 15:55

ecosystem so one thing we're adding is 15:57

the ability to bring in your design 15:58

system 16:00

>> that's cool 16:01

>> so in addition to kind of us providing 16:01

these components if you go to any 16:03

company they have a lot of these 16:05

components built out as raplet is 16:06

getting deployed into these large 16:08

companies how can we hook into their 16:10

their internal systems 16:12

>> this makes me think about the kind of 16:13

spectrum of these different coding 16:15

tools. Um, on one end of the spectrum, 16:17

you have kind of what I would call the 16:19

the power tools, the cursors, the wind 16:20

surf that let developers use this as a 16:22

way to amplify their efforts. And on the 16:25

full other end of the spectrum, you have 16:27

more of the like consumerf facing, hey, 16:29

if you want to make an app, like you can 16:31

now make an app. And it sounds like you 16:32

guys are kind of in the middle. Um, 16:34

you're helping companies get stuff done, 16:36

but you're doing it for people who don't 16:39

look like the tra traditional developer. 16:41

How do you see this world playing out? 16:43

Are there going to be like n different 16:44

tools or will we converge to one place 16:46

on that spectrum? 16:49

>> AGI is convergence obviously but but 16:50

leaving that aside, it's really hard to 16:52

plan for that world. I think that the 16:53

battle for how to incrementally make 16:56

product uh engineers more productive is 17:00

just it's an obvious one. The market 17:03

there is obvious. There's a lot of 17:05

companies going after it. Both the 17:06

application companies like cursor, the 17:09

underlying model companies are trying to 17:11

go there. I mean um you know claude code 17:12

is competing with cursor. Cursor uses a 17:15

cloud. I think it's a bit of a blood 17:17

bath there but it's the market is 17:19

obvious and the market is really large. 17:21

I would guess that there's going to be 17:22

more of a consolidation there. 17:25

>> Maybe it's not you know one but it's 17:27

probably two or three uh at best. I 17:29

think the sort of the the market we're 17:31

in is a lot larger. Uh so the 17:34

addressable humans is a lot more. You 17:37

know, we're talking about a billion, but 17:40

it could be more. Like really any 17:41

knowledge worker should be able to like 17:42

solve problems with software. My 17:45

conception of replet right now is the 17:46

what we want it to be is a universal 17:49

problem solver. It'll solve problems in 17:51

your personal lives or you know solve 17:53

problems in uh in your work and and all 17:54

of that. And so I think that that market 17:57

is uh will probably be like a little 17:59

more diverse and I think companies will 18:02

kind of figure out where um where they 18:03

slot in trying to solve uh autonomous 18:06

programming with the focus on the 18:09

non-engineer. 18:12

>> Uh we want you to not worry about 18:13

security, not worry about systems, not 18:15

worry about any of that. We want you to 18:18

really come in with your ideas to to 18:20

Replet and be uh an agents manager. And 18:22

we're trying to do it in a way that is 18:26

like as human as as possible and kind of 18:28

fits into into into the workflow. One 18:31

big difference between engineers and 18:33

sort of non-engineers, be it executives 18:36

or product managers, you're not on your 18:37

desk like 8 hours a day. Mobile is a big 18:40

part of that. We have like a really 18:43

great mobile app. And we're thinking 18:44

about this way of like ambient building. 18:46

Maybe you start an app on your uh in 18:48

your desktop. You go away with your 18:50

phone. You're in boring meeting. You get 18:51

a notification from the agent saying you 18:53

I'm done with this. Do you want 18:55

something else? You got texted. And so 18:56

we're trying to kind of build that 18:58

thing. 19:00

>> Yeah. A question I had um related to 19:01

that point is is about the user 19:04

interface. So for something like cursor 19:06

or wind surf, it's pretty obvious. The 19:08

primary UI element is like the code. You 19:10

you see code, you have a little chat 19:13

window, but primarily it's about diffs. 19:14

It's about kind of changes to code. And 19:16

for a tool like Replet, the primary 19:19

interface is like the graphical user 19:21

interface. You know, it's the it's the 19:23

buttons and the the wizzywig like you 19:25

see what you're building. 19:28

>> Exactly. And that's great for building 19:29

user interfaces, but when you're trying 19:30

to build more complicated sort of 19:32

logical flows, I found it a little bit 19:34

difficult cuz I couldn't there's no way 19:36

I couldn't see the code. I can't 19:37

visualize what's going on behind the 19:38

scenes. And it's sort of it's almost a 19:40

black box. Fast forwarding here, if 19:41

you're trying to build more complex 19:43

internal workflows, how do they how does 19:44

a product manager or an operations 19:46

manager at a big company visualize that 19:48

workflow and the kind of logical 19:50

branching that happens? 19:51

>> Yeah. So, if if you look back in the 19:52

history of computing, there was always 19:55

this vision of visual programming, it 19:57

never worked uh very well because 20:00

ultimately it's about Turing 20:03

completeness like these systems are not 20:04

universal computing devices. And now we 20:07

go to codegen. Obviously codegen is is 20:10

turing complete. Um but you're 20:12

interfacing with it primarily via 20:15

natural language. Natural language is 20:16

fuzzy. It's really hard to know whether 20:18

it's doing the right thing. I think the 20:20

synthesis of these two things is 20:22

probably coming where you are 20:24

interfacing with natural language but 20:27

you can instead of like just staring at 20:29

code uh there's maybe an interface or 20:31

like a different view on top of code. 20:34

You can imagine being able to do you 20:37

know small talk. 20:40

>> Yeah. So small talk is this uh 20:42

>> it's the first object-oriented um uh 20:44

programming um system and you know Alan 20:46

K would say it's like it is actually OP 20:50

where everything comes after it is not 20:53

um but the interesting thing about it in 20:55

small talk you can actually the way you 20:57

interact with code is not via files but 21:00

via objects via like logical objects and 21:01

so there's some kind of prior arc there 21:05

and I think the world we're headed in 21:07

where there's some kind of abstraction 21:08

over code that allows people to like 21:10

understand it. 21:12

>> Yeah, I think that's really interesting. 21:13

I there's some there's like an open 21:14

space there whether it's like pseudo 21:16

code is like looks like English but it's 21:17

a little bit more structured or it's a 21:19

visual drag and drop. I don't know. It's 21:21

>> Yeah, I think back to building products 21:22

with a team of engineers, designers, and 21:24

other folks. The interactions that I 21:27

would have as the like product lead with 21:29

those teams was verbal. It would be 21:31

written like abstracted ideas. We would 21:33

draw stuff on the whiteboard together. 21:36

We'd make system diagrams. We'd look at 21:37

the results of the app and we would test 21:40

it and point out like oh this thing is 21:42

broken that's too slow and it feels to 21:43

me like that sort of interface which is 21:45

very multimodal and very flexible is 21:48

probably the best end result like I 21:50

think we will get to something like that 21:52

where the author of products will be 21:54

doing that but the teams they're talking 21:56

to are not other humans they're agents 21:58

doing these things. Has has there been 22:00

an attempt in in sort of PM land to have 22:01

a little more formalism around 22:04

communication or 22:06

>> Yes. And I would not say it's been good. 22:08

The main thing is just like the PRD, 22:10

right? The product spec, right? And it's 22:12

just become, in my opinion, uh 22:14

oftentimes a performative 22:16

>> work artifact that you create just so 22:18

that you can have a thing to get your 22:21

promotion, right? If you're at a big 22:22

company. Uh but they're not actually 22:23

that useful. To me, the most useful 22:25

interactions are just these like 22:27

whiteboard conversations, right? What do 22:28

we wanted to do here? Oh, we got to 22:30

think about that. Oh, we didn't consider 22:31

this. Okay, let's re rethink this whole 22:33

thing. Like those sorts of 22:35

conversations. 22:36

>> AI can play a role in that as well. So, 22:36

I um you know this startup granola uh 22:38

that allows you to kind of record 22:42

meetings and they released this like 22:43

team version that like all the meetings 22:45

uh gets transcribed and and go there and 22:48

they have a mobile app right now where 22:50

like you can put on the table and like 22:51

it's trying. So I I was thinking maybe 22:53

we should go gr granola maximalism where 22:55

look you shouldn't fight the trend in 22:59

which companies are becoming 23:03

increasingly oral as opposed to like 23:04

written um and because like you know 23:06

people are talking on Slack people are 23:10

in meetings people like are 23:12

communicating via prompts with with 23:14

agents uh but you you would want a set 23:15

of AI tools that is actually like 23:18

creating that that that record and in 23:21

background that's like searchable and 23:24

organizable and and all of that. 23:26

>> Yeah. I wonder when we'll have our first 23:28

AI in like oral meetings. You know, you 23:29

you're jamming with your designer and 23:32

you the AI like chips in and says, 23:34

"Well, how about this idea?" 23:36

>> Yeah. And and this is where I think the 23:37

like dystopian view of AI or AI as like 23:40

taking all our jobs and all that is like 23:43

not correct. And I think the future of 23:46

work is more human, is more interactive, 23:48

is more multimodal, is more fun in my 23:51

opinion. 23:53

>> YC's next batch is now taking 23:54

applications. Got a startup in you? 23:56

Apply at y combinator.com/apply. 23:58

It's never too early and filling out the 24:01

app will level up your idea. Okay, back 24:04

to the video. All right, so so last time 24:07

we chatted, uh, you had launched Replet 24:08

agent and things were growing really 24:10

crazy. I imagine that has continued. 24:12

Any, anything you can share there? Maybe 24:14

soon we'll we'll share some numbers but 24:16

uh since replet agent launch we're 24:18

growing 45% 24:20

compound monthly uh average. 24:22

>> These are like metrics that we tell YC 24:24

companies during the batch when they 24:27

have a base of like no users to try to 24:28

achieve and you're doing this at larger 24:30

scale. 24:32

>> Yes. But you know it put a lot of strain 24:32

strain on the company and our systems 24:34

were still relatively relatively small. 24:36

I I feel like it can get to to your head 24:38

and you can start optimizing for for the 24:40

wrong thing. It's very easy in AI to 24:43

increase AR while users are not happy 24:47

>> because they're spending a lot more and 24:50

like not getting the results and in some 24:52

cases maybe shouldn't grow that that 24:55

fast because like you'd want users to 24:57

get a better experience for less less 24:59

money and so it's one thing that we try 25:01

not to obsess we actually don't have AR 25:03

goals at at Replet we have like more 25:05

product goals retention goals just like 25:07

other methods 25:09

>> yeah because the the sort of bad pattern 25:10

with some AI companies you grow topline 25:11

revenue very very quickly the churn is 25:13

like approaching 100% and eventually 25:15

that just catches up 25:17

>> and the gross margins are horrible too 25:18

and so it's like 25:19

>> you know the more growth you have the 25:21

worse the company's doing financially 25:23

and so 25:24

>> so how do investors see this space you 25:25

you must have talked with a bunch can 25:27

they tell the difference 25:29

>> it's all kind of a blur for them because 25:30

investors when they I mean I'm just 25:32

going to generalize here but when they 25:35

start looking at the space they'll use 25:36

everything for three minutes and 25:38

everything for three minutes looks the 25:40

same so I I think I think it will it 25:41

will start to clarify and these products 25:44

will start to uh I suppose to converge 25:47

diverge more with the different focuses 25:50

in the areas we're talking about. I 25:52

think over the next year it'll be like a 25:54

little more clearer but I think a lot of 25:57

them are are just like you know super 25:59

when we talk to them they're super 26:01

confused. They don't understand these 26:02

systems. They don't understand where 26:04

they're going. 26:05

>> I think that's a great segue into um the 26:06

the sort of technology underlying some 26:08

of these tools. I'd love to dig in a 26:10

little bit deeper. We've we've seen some 26:11

announcements from cursor and wind surf 26:13

about how they're kind of layering 26:15

whether it's um Claude or Gemini or um 26:16

the OpenAI models with then their own 26:20

layers on top the fast apply APIs stuff 26:23

like that fast apply models I guess can 26:26

you give us an overview of how it works 26:28

at Replet? 26:29

>> Yeah, a lot of it is you're patching 26:30

problems with the underlying frontier 26:32

models. Yeah. 26:34

>> So the reason fast supply was important 26:35

because none of the models uh were doing 26:37

very well at diffs. Can you explain what 26:40

what supply is? 26:42

>> When you're trying to like edit a file 26:44

as an LLM, the best thing to do is to 26:46

like uh create a create a diff. Uh but 26:48

these models are actually not very good 26:51

at creating diffs. They're actually not 26:53

very good at countering the lines in the 26:54

source code. So they get confused about 26:56

a lot of these things. So for a while, a 26:58

lot of companies were just like 27:01

rewriting the entire entire file. 27:02

>> And so diff for for nontechnical users 27:04

is sort of like remove these three lines 27:06

and inject these other three lines 27:08

instead. like find and replace almost. 27:10

>> That's right. That's right. And so you 27:12

needed some way to um make the model 27:14

generate some something of a diff, 27:19

>> but it's actually not good enough to 27:21

merge it. And so you need another model 27:23

to actually do the merge. So 27:25

>> instead of rewriting like it's a 27:27

thousand line file, just like output the 27:29

entire thousand lines, 27:30

>> it's very slow. It's very expensive. So 27:31

you prompt the model to be as lazy as 27:34

possible. 27:36

>> But in that case, it's really hard to 27:37

apply. Okay, so you need another model 27:38

that's like doing the application. Okay, 27:40

>> you can train a model or you can use 27:42

Gemini Flash or some of these smaller 27:43

smaller models. And so in some cases, 27:46

you need to to train a model or 27:48

fine-tune a model to do a better job at 27:50

it. In other cases, you just also patch 27:52

a bunch of other models to to do it. 27:54

It's engineering. It's engineering as 27:56

opposed to like research, I would say. 27:58

>> And I noticed um you guys don't expose 28:01

the underlying model to the user. With 28:04

other tools like cursor and wind surf, 28:06

there's a drop down. It's like I want to 28:07

you know let's see what Gemini thinks of 28:09

this problem. You don't do that. Why is 28:11

that? 28:13

>> A a big part of our research efforts is 28:13

in eval 28:16

>> and I think this is like an underrated 28:18

part of like you know co AI coding. So 28:20

we spend a ton of time evaling new 28:23

models writing evals generating evals 28:26

just data crunching trying to figure out 28:29

what the users are are getting or are 28:31

feeling. We built a lot of systems try 28:33

to understand um how these systems are 28:35

are performing the moment a new frontier 28:38

model uh lands we are evaluating almost 28:41

immediately. Jim and I for example like 28:44

uh you know couple months ago were 28:47

making the rounds really great model 28:48

really awesome at like oneshotting in 28:51

some cases better than claude agentic 28:52

work wasn't at like a tool calling at 28:55

you know some of the things like that 28:58

but users just see the hype and they're 29:00

like oh like yeah give me Gemini that's 29:03

>> how much notice do you have do they like 29:06

drop it on you and you like scrambling 29:07

or do you have like days or weeks before 29:09

that to to try to you know test it out 29:10

>> well we we have good partnerships with 29:12

these companies we We have a great 29:14

partnership with Google. We have a great 29:15

partnership with uh with Anthropic, even 29:17

with OpenAI. We have a close 29:19

relationship. So, a lot of them give us 29:20

give us a heads up, give us some early 29:22

checkpoints and we we try we we kind of 29:24

play with all of them. And in the case 29:27

of anthropic, we're always like the 29:28

first day kind of launching because we 29:30

end up building. A lot of times we're 29:32

sort of anticipating where they're 29:34

going. 29:35

>> Yeah. 29:35

>> Because you kind of like you can tell 29:36

like 3.5 3.7 there there's some 29:37

direction you can tell like where 4.0 is 29:41

going to land. So you start architecting 29:43

the systems but you know ultimately a 29:45

lot of our engineering efforts still 29:47

infrastructure like the the distributed 29:48

network file system that snapshot based 29:51

network file system like it took us 2 29:54

years to build like there's nothing off 29:56

the shelf to do that a lot of the 29:58

security stuff is really really 30:00

difficult like replet is one of the few 30:01

places in the world where you can get 30:04

like a virtual machine in the cloud um 30:05

you know by just like creating an 30:08

account and that's like a it's really 30:09

hard to kind of run this and protect 30:12

against this. You have crypto miners, 30:14

you have all sorts of stuff like that. 30:16

And Replet also uses um uh Nyx OS under 30:18

the hood. Nixos is a like fully 30:22

declarative also transactional uh 30:24

operating system generator as it were. 30:27

Uh we have like this multi- terabyte uh 30:29

hard drive in all the different regions 30:32

that we have compute. um that that that 30:34

cached all the packages in the in the 30:37

world and that gets attached to every 30:39

container and and again all of that I 30:40

mean I keep coming back to this idea of 30:43

transactionality like you would want the 30:45

system to be fully functional. You would 30:47

want it to be um you would want it to be 30:49

safe in order to like experiment with 30:52

something go back do the sampling with 30:54

with agents. So really a lot of the a 30:56

lot of the work is just like the 30:59

engineering of this infrastructure and 31:01

it's like not as a parent not as sexy as 31:03

sort of like we're trained this model 31:05

here's the um but but I think this is 31:06

where you can build like a uh a bit of a 31:09

lead. 31:13

>> Yeah. I when VCs talk about moes uh the 31:13

thing that I translate it to in my head 31:16

is what is the compounding advantage 31:18

that you might be able to achieve. 31:20

Right. It's not really a defensive mode. 31:21

It's more just I'm ahead and by being 31:23

ahead in some vector it allows me to 31:25

continue to move faster. That's right. 31:27

>> And it sounds like this is a great 31:29

example of that. 31:30

>> True modes are often not obvious until 31:30

like decades perhaps many decades into 31:32

into the company. Like you wouldn't know 31:34

what like Netflix's mode. Uh but 31:36

obviously they have one like Disney 31:39

tried to like compete with them and 31:40

everyone was like you know down on on on 31:42

Netflix but turned out they have a mode 31:44

that is this uh content production 31:45

system that they that they built. 31:48

>> So Amad you you started with the mission 31:49

of like making it easier for people to 31:51

learn to code. You've accomplished a lot 31:53

of that. Now you're pushing the envelope 31:55

of what it even means to code. I've got 31:57

young kids. I want them to be productive 32:00

creators in the world. What should I 32:02

tell them to do? Should they learn to 32:04

code? What does that even mean? 32:06

>> Look, I think if you want to go the like 32:08

professional software developer route, I 32:11

think getting a computer science degree 32:13

and like learning fundamentals makes 32:15

sense. But if you want to be a a 32:17

creator, if you want to be a generalist 32:20

in this world, I don't think it's 32:22

necessary anymore to learn to code in 32:23

the more like traditional ways of 32:25

learning to code. I think you pick it up 32:27

by osmosis almost like like go go go to 32:29

replet and you know and start using it 32:32

and at some point you're going going to 32:34

run into some issue where you're going 32:36

to have to look at code or you're going 32:37

to have to look at logs just by being 32:39

resourceful having to Google around and 32:41

all that. You'll start picking it up. 32:42

And by the way, this is how our 32:44

generation kind of learned how to code, 32:45

right? when you were talking I'm like 32:46

that's what I did. 32:48

>> Yeah. And somehow uh it just became very 32:49

industrial and very formal over the 32:52

years like the way we made web apps is 32:55

like you just like start a notepad with 32:57

a you know HTML file and now you have to 32:59

like learn like webpack or whatever you 33:01

know it's like so um I think the future 33:03

of work is not really clear what is you 33:06

you know we can sort of like have some 33:09

like you know idea of like where where 33:11

the world is headed and so with with my 33:14

children I want them to have like as 33:15

broad uh based knowledge as possible. Uh 33:18

I want them to be as generalist as 33:21

possible. I want them to be as 33:23

generative as possible like being able 33:25

to like create a lot of ideas because 33:27

once the making of things gets easier. 33:29

The bottleneck goes back to to to like 33:32

how many ideas you can have. 33:35

>> Um and so I I I wouldn't put learning to 33:37

code as the top thing. I would put learn 33:40

to make things. Learn to make things 33:43

with code. Learn to make things with 33:45

video. learn to make like anything with 33:47

AI. 33:49

>> And so what do you think happens with 33:49

SAS generally if we're very soon able to 33:52

say create me a version of Google 33:54

Calendar or clone Docyign? What happens 33:57

to SAS? Do you think 33:59

>> we have stories today of um a lot of 34:00

people replacing hundreds of thousands 34:04

of dollars worth of SAS um with Replet? 34:07

The other day I heard a story from 34:10

someone who um you know they got quoted 34:12

$150,000 for a piece of software. He 34:15

went and made it in replet sold it to 34:18

his employer for $32,000 34:20

cost him $400. Companies that have a 34:23

platform developer community around it 34:28

and and plug-in ecosystem and things 34:30

like that. I think those are safe. 34:32

>> You're not going to be able to vibe code 34:34

Salesforce. I think the the vertical SAS 34:35

is is in trouble and I think it's my 34:39

guess it's already probably showing in 34:42

some of the metrics. 34:44

>> Okay, last question. What advice would 34:45

you give to um founders starting out 34:47

right now? 34:50

>> You know the best advice is what we what 34:51

you what you actually pointed out 34:53

earlier is work on the edge of what's 34:55

possible. 34:57

uh because uh one evolution of of AI or 34:59

models will make your business valuable 35:03

and suddenly you're first on market. I 35:05

find it rare to see founders that are 35:07

actually like sitting down trying to 35:10

actually predict the future and and and 35:11

maybe that's something that was you know 35:14

ill advised in the past but I think 35:17

right now trying to actually figure out 35:19

where things are headed is very very 35:21

important uh skill to have. So like make 35:23

some prediction uh figure out like how 35:26

to create like a crappy product that 35:28

would get better immediately as you 35:32

switch switch the model. I mean computer 35:34

use is a is is a great example. 35:36

>> Oh it's been an absolute pleasure. Thank 35:38

you so much for coming on. 35:39

>> Pleasure. Thank you. 35:40

>> See you next time. Thanks. 35:41

[Music] 35:45

– English Lyrics

📚 Don’t just sing along to "" – train your ears, learn vocab, and become a language pro in the app!

By

Viewed

34,984

Language

English

Learn this song

Lyrics & Translation

[English]

I think the like dystopian view of AI or

AI as like taking all our jobs and all

of that is like not correct. And I think

the future of work is more human, is

more interactive, is more multimodal.

Once the making of things gets easier,

the bottleneck goes back to to to like

how many ideas you can have. I wouldn't

put learning to code as the top thing. I

would put learn to make things. Learn to

make things with code. Learn to make

things with video. learn to make like

anything with AI.

[Music]

>> Welcome back to the breakdown. I'm Dave.

This is Tom. And today we're joined by

Amjad Msad. Amjad is founder and CEO of

Replet, which was a YC 2018 company. Uh

welcome to the show.

>> Thank you.

>> So today, uh we want to talk about kind

of your beginnings in helping people

learn to code. what you're doing now in

replacing a lot of that work and then

where we see the future going in terms

of how people make things. So, Replet

started um I guess in 2016, you were in

YC in 2018. You started out as a tool to

make it really easy for people learning

to program to set up development

environments purely on the web. Um but

now you're really taking off in AI

assisted coding. Tell us tell us the

latest. Yeah. So, um the mission has

always been to make programming more

accessible. Uh then we sort of uh you

know after YC we sort of updated it to

be a little more ambitious. We started

talking about a billion software

developers and it just sounded absurd at

the time and

>> this is all prel right.

>> Yeah. Prelm. There was a moment of time

I don't know if you remember that but

like 20 in the like 2015 AI hype. There

was like a bit of a NLP natural language

hype as well. There's a bunch of AI

companies that ended up being just like

humans behind the scene. They all went

broke, but um there was like a a glimpse

of potentially doing NLP on code. So, we

had an idea that, you know, it's it's

probably coming. I I even had it in my

in my seed deck that like at some point

like we'll collect enough data to like

train models. It wasn't until like GPG2

in 2020 that I felt like it was was

going to be possible. So we you know we

had we had built all these um primitives

like the development environment the

sort of hosting environment all the

other stuff around it. Uh and I felt

like if if we just like added um like AI

agents like it will be able to

orchestrate all the stuff and it'll be

it'll be really great. agents were like

every time we tried them. Uh I think the

first time we tried it like 2021 didn't

work, 22 it didn't work and then 2024

like early early in the in the year we

just felt like it's getting close.

>> Even GPT40 could like be coherent for

like um 2 minutes somehow just ended up

doing a big bat. The company was not

doing super well.

>> Yeah. was I was about to ask was there

like a bet the company moment where it's

like we were teaching people to code now

we're no longer doing that now we're

going to enable everyone to build apps

>> yeah yeah yeah we had grown the company

quite a bit um and we were burning way

too much money uh and we decided to do a

layoff

>> and so we we um cut like perhaps 50

people and then like another like 15 20

people left and so we were like less

than half of the company and so I just

put everything on on replet agents I

just felt like this is the thing it has

to work. It felt like it was close.

Yeah. I don't know. I was just like burn

the boat boats kind of moments. It's

just like we got to do this. I mean,

it's I felt like this was the this was

the the thing that would make the

company work. And honestly, like if claw

3.5 hadn't come out mid US building

agent, we would have probably failed.

>> Yeah.

>> Because um like I said, like GPT40 would

like stay coherent for 2 3 minutes. Quad

3.5 was the first one that like could

work for like you know five to 10

minutes and actually actually work and

the code generations

>> and that's sort of the approach that I

think is most successful for startups

kind of building on the very edge of

what is possible you know you start out

on a mission it's the techn is not quite

there yet you start building and you

sort of you you skate where the puck is

going you know it sort of catches up

with you

>> yeah and you you want to keep keep doing

that

>> totally

>> yeah I remember you came to speak at YC

like 6 months ago and right after you

had launched this and at the time you

said we're very far from fully automated

software development.

>> Do you still feel that way?

>> No, not at all. I mean, I I feel like

I've, you know, every time I make a sort

of some kind of prediction, it feels

like bold and like, you know, but I've

been I've been consistently wrong and

like uh the way things are moving uh is

a lot faster. Just the level of

autonomy. Uh like I said, like maybe 3.5

was like 5 to 10 minutes. Like 3.7

perhaps can get up to like four five

minutes to an hour. The uh 4.0 No, in

the in the system card and opus, they

said they made it work for seven hours.

>> Wow.

>> 7 hours.

>> That's incredible.

>> That's always been the limiting factor

for making agents is like, can they stay

coherent? Can is the context length

actually useful to reason over. If an

LLM can work for for seven hours, that's

basically like a human worker.

>> Yeah. And presumably they're working at

much faster speed than a human would do.

So they're kind of completing perhaps a

week's work in those seven hours. The

only thing that I think is is missing

and a big limiting factor for actually

automating a lot of work is computer

use.

>> Computer use kind of sucks.

>> You know, I'm sure you've you've tried

it. And this is the difference between

Replet agent really uh giving it one

prompt going all all the way in an app

versus having to kind of babysit it a

little bit and like test with it.

>> And we had a company in the last batch

called browser use that's um that's

working browser automation.

>> Browser Use is great. And another one

called pig which is doing the same for

kind of windows desktops.

>> And uh I think advice I would give

founders today is um taking either

browser use or windows automation with

pig and trying to apply that into

enterprise into vertical

>> into a vertical industry.

>> The moment this technology worked those

two companies are just going to

>> totally and I think I think we're like

weeks or perhaps single digit months

away from it working really really well

and so now is the time to get started

using those technologies.

>> Absolutely 100%. And this is this is

what we're focused on. So you know

replet agent v1 to v2 was a huge jump in

autonomy. Uh v3 is the most autonomous

thing. So we're already working on it

and um the interesting thing for us is

the underlying technology and how it

would enable autonomy. There are like a

few important things. One is uh uh you

know being transactional. The ability to

roll back is is very important. So you

you would want you'd want it to be safe

for agents in the same way that gits

made it safe for human programmers to

kind of experiment and create branches

whatever you want the same thing for for

agents like if an agent kind of messes

up a DB migration it should be able to

roll back and also it should be able to

like sample

>> uh across different different paths. Uh

I think this is very very important for

autonomy. If you look at uh like for

example when when uh Anthropic publishes

their Swedbench score they publish one

with that sampling and one with sampling

and it goes from 70% to 80%.

>> And so this is the idea that you sort of

you spawn multiple agents each one will

take a a shot at it and then you figure

out which one works and and choose that

that branch effectively. We built a

infrastructure that is um uh fully

transactional and moves in lock step

meaning the file system is a snap

snapshot fail based file system the DB

is a snapshot based uh u DB and so a as

you're we're making commits uh to the

entire system including the virtual

machine as you're going uh and so what

we can do is also we can we can fork it

and branch it out. So if if we had

computer use that's working well you can

you can just like right now what they do

with the sampling is they do they have

some kind of judge that does ranking

>> which you know it's not a true verifier

a true verifier is a is a is a test

right and a computer use test so you you

can sample out and really pick the best

branch that's actually working

>> that's incredible

>> and and repeat that on and on and

reliability gets really really high

>> so fast forward six or 12 months you're

not spawning one agent you're spawning

is Is it five or 10 or a million? Like

how

>> I think this is where it's going to get

interesting, which is you'd want to give

the user the ability to set compute

budgets. I I think we're starting to see

that with um with the O style models of

like here's here's how much budget. But

but look, I mean, if if you give us

$1,000, we'll spend them if you know.

So, uh

>> that's really cool. I've not thought of

that before. Um but the idea of like

spawning multiple branches then just

picking the best one and being able to

do that in parallel. I just the human

brain doesn't naturally work like that.

We think sequentially. We can't.

>> It reminds me of this legend. I don't

know if this is actually true, but at

Apple when Steve was running it, he

would intentionally have teams doing

basically the same things and then see

which one did the best job. It's like

>> I've heard OpenAI does that now.

Interesting. Uh I I've heard that like

the Codeex project like had multiple

different different teams. So in the

literature, you'll see that uh small

small models sampled uh will beat larger

models. So Sonnet sampled is probably

better than than Opus companies. uh

haven't tried that where instead of

hiring one senior engineer you hire like

10 junior engineers and like give them

the same task and then

>> it's just very expensive with humans but

with LM it's relatively cheap you can

just do that do a hundred of them and

just pick the best one every time

>> so how are people using replet agent

like and and who are the people using it

>> you know it goes back to sort of our

early vision that if you make

programming easy uh then uh then more

and more people would want to would want

to do it actually this is one of the

first thing uh that that PG PG and I

sort of connected on um before we got

into YC actually PG um found us on

HackerNews and so we started this email

relationship um and so he he told me

there's like a super linear relationship

with how easy programming is versus how

many people would want to do it. The

optimizing function of rapid has always

been like just keep lowering the the

barrier to entry and and that's how you

grow users and you grow uh customers.

right now. Um, basically people from

kind of every walk of life, we've seen

users, product managers tend to be tend

to be a very a great um use case and

users for us and we've had customers uh

product managers uh who uh are able to

make significant impact on the business

without talking to engineers at all like

you know running AB tests or

optimizations or or things like that. I

mean it's empowering. I mean it gets us

to think about the really the seams

between these roles like you know

product manager, designer, engineer. We

actually just created a new new product

group and and and typically you know

product uh you know head of product is

have has a bunch of like product

managers reporting to them but we're

actually having uh this this group have

has designers, engineers and and PMs and

the the idea is they're all using AI all

the time to to prototype in some cases

go all the way to production. there

isn't this sort of waterfall model where

you know there's a lot of inefficiency

you know like communication problems

between these different teams they can

move incredibly fast and I think that

that starts to change how tech companies

work my experience of this was when

whenever I was working in a startup the

list of ideas and the backlog would

always would be infinitely long

basically and the bottleneck was always

engineering time and now I'm doing my

own personal projects I write my to-do

list or my idea list and then I get to

work and my the ideas just get done and

suddenly the bottleneck is like my

ability to have ideas

>> and it's just such a weird experience

looking at a to-do list. It's just empty

being like what do I do next? I've heard

from a team that has like a really large

uh replet deployment in their company

that their founders using Replet and

that's stressing the engineers out

because oh I did this in a weekend like

can't you do it in a

>> what do you guys show for yourself and

so are these previously technical people

or non techchnical people they're

building the first version are they

deploying that production or are they

giving to engineers and say hey build

this like what what do you see typically

>> we advise to like work with with

engineering but that doesn't always

happen I think It's um understandable

that a lot of PMs and designers want to

go straight to users. What we're seeing

is they go to like beta users and test

users and I think this works really well

but in some cases they just like put it

in production. Uh and so right now we're

having discussions with all these

companies especially the engineering

leaders are unhappy about this like

who's on call for these services you

>> who's responsible for the bug

>> who's responsible for it. There's a lot

of questions that are coming up. The

obvious answer to all these questions is

like agents are responsible.

>> So what's the limiting factor today? I

buy code a thing and I'm like yoloing it

into production. Like what are what do

the engineers typically object like?

What what's going wrong?

>> Security is is is a big one. Like um

LLMs are fallible like like humans are.

They tend to write some uh there's some

components that they do terribly at like

for example off

>> uh like they all kind of suck at it. all

use like you know old methods of of

assaulting and hashing and so that's

been that's been a big sticking point

and we've seen a lot of examples out

there right now of some catastrophes

luckily it hasn't been like there hasn't

been a major catastrophe I think it's

coming but uh there's been uh like solo

founders who would leak um you know API

keys or would make it really easy to get

around the login security protections

and there are a lot of tools out there

that are like really not trying to take

responsibility for that and saying, "Oh,

it's it's the user's problem, the user's

fault." For us, we think that as a

platform that is marketing uh for the

non-developer, you actually have a

responsibility. We're trying to take

away

uh some of the things that we think LMS

should not do today. Uh so O for

example, we have a built-in O. So if you

go to wrap and just say add o, it will

pull in an o component that we built

from scratch. It has capture on it. It

has all the security you know bells and

whistles. So you don't have to worry

about all of that.

>> We integrated with your database. You

have a user management portal on on your

site as much as possible. Building in

these components that are tricky. I

think payments is the other one. I don't

think you'd want the LLMs to kind of do

do the payments.

>> And they're not that different, right?

like the sort of checkout. You know, you

might have a one time checkout, you

might have subscription, pay as you go,

but there are not unlimited versions of

payments. There's like two or three or

four.

>> And this is the analogy is today humans

don't write their own version of

payments or their own version of O. They

use providers that have built these

components already. So, seems like we're

seeing the same thing play out and

that's the best way to do it.

>> Yeah, 100%. And then the other thing we

added uh we partnered with is a a great

security company called Samrip. And so

right now uh when you go to deploy a

replet app, we run a security scan. We

run a code security scan and we give you

like a report of warnings and errors and

things like that and the agents can try

to fix them for you.

>> Okay. And so security is one of the big

kind of bottlenecks to fully deploying

to production. I can imagine there must

be other things that coming down the

line like sort of scalability, you know,

looking for like N plus1 database

queries, uh performance bottlenecks.

What do you have in your mind like a

clear set of of blockers that need to be

overcome before this is truly like one

click to deploy?

>> Yeah, I I mean the big the biggest one

is just going to be humans like social

like you know there's just going to be

mistrust and I think it's just going to

have has has to play out. So you know

leaving that aside like enterprises has

to adapt to all of that stuff having

some way to also scan for scalability

like figuring out you know uh like doing

fuzzing or whatever it is or having some

kind of adversarial agent that's you

know trying to to to break your app I

think um you know that's one big thing I

think integrating with a company's

ecosystem so one thing we're adding is

the ability to bring in your design

system

>> that's cool

>> so in addition to kind of us providing

these components if you go to any

company they have a lot of these

components built out as raplet is

getting deployed into these large

companies how can we hook into their

their internal systems

>> this makes me think about the kind of

spectrum of these different coding

tools. Um, on one end of the spectrum,

you have kind of what I would call the

the power tools, the cursors, the wind

surf that let developers use this as a

way to amplify their efforts. And on the

full other end of the spectrum, you have

more of the like consumerf facing, hey,

if you want to make an app, like you can

now make an app. And it sounds like you

guys are kind of in the middle. Um,

you're helping companies get stuff done,

but you're doing it for people who don't

look like the tra traditional developer.

How do you see this world playing out?

Are there going to be like n different

tools or will we converge to one place

on that spectrum?

>> AGI is convergence obviously but but

leaving that aside, it's really hard to

plan for that world. I think that the

battle for how to incrementally make

product uh engineers more productive is

just it's an obvious one. The market

there is obvious. There's a lot of

companies going after it. Both the

application companies like cursor, the

underlying model companies are trying to

go there. I mean um you know claude code

is competing with cursor. Cursor uses a

cloud. I think it's a bit of a blood

bath there but it's the market is

obvious and the market is really large.

I would guess that there's going to be

more of a consolidation there.

>> Maybe it's not you know one but it's

probably two or three uh at best. I

think the sort of the the market we're

in is a lot larger. Uh so the

addressable humans is a lot more. You

know, we're talking about a billion, but

it could be more. Like really any

knowledge worker should be able to like

solve problems with software. My

conception of replet right now is the

what we want it to be is a universal

problem solver. It'll solve problems in

your personal lives or you know solve

problems in uh in your work and and all

of that. And so I think that that market

is uh will probably be like a little

more diverse and I think companies will

kind of figure out where um where they

slot in trying to solve uh autonomous

programming with the focus on the

non-engineer.

>> Uh we want you to not worry about

security, not worry about systems, not

worry about any of that. We want you to

really come in with your ideas to to

Replet and be uh an agents manager. And

we're trying to do it in a way that is

like as human as as possible and kind of

fits into into into the workflow. One

big difference between engineers and

sort of non-engineers, be it executives

or product managers, you're not on your

desk like 8 hours a day. Mobile is a big

part of that. We have like a really

great mobile app. And we're thinking

about this way of like ambient building.

Maybe you start an app on your uh in

your desktop. You go away with your

phone. You're in boring meeting. You get

a notification from the agent saying you

I'm done with this. Do you want

something else? You got texted. And so

we're trying to kind of build that

thing.

>> Yeah. A question I had um related to

that point is is about the user

interface. So for something like cursor

or wind surf, it's pretty obvious. The

primary UI element is like the code. You

you see code, you have a little chat

window, but primarily it's about diffs.

It's about kind of changes to code. And

for a tool like Replet, the primary

interface is like the graphical user

interface. You know, it's the it's the

buttons and the the wizzywig like you

see what you're building.

>> Exactly. And that's great for building

user interfaces, but when you're trying

to build more complicated sort of

logical flows, I found it a little bit

difficult cuz I couldn't there's no way

I couldn't see the code. I can't

visualize what's going on behind the

scenes. And it's sort of it's almost a

black box. Fast forwarding here, if

you're trying to build more complex

internal workflows, how do they how does

a product manager or an operations

manager at a big company visualize that

workflow and the kind of logical

branching that happens?

>> Yeah. So, if if you look back in the

history of computing, there was always

this vision of visual programming, it

never worked uh very well because

ultimately it's about Turing

completeness like these systems are not

universal computing devices. And now we

go to codegen. Obviously codegen is is

turing complete. Um but you're

interfacing with it primarily via

natural language. Natural language is

fuzzy. It's really hard to know whether

it's doing the right thing. I think the

synthesis of these two things is

probably coming where you are

interfacing with natural language but

you can instead of like just staring at

code uh there's maybe an interface or

like a different view on top of code.

You can imagine being able to do you

know small talk.

>> Yeah. So small talk is this uh

>> it's the first object-oriented um uh

programming um system and you know Alan

K would say it's like it is actually OP

where everything comes after it is not

um but the interesting thing about it in

small talk you can actually the way you

interact with code is not via files but

via objects via like logical objects and

so there's some kind of prior arc there

and I think the world we're headed in

where there's some kind of abstraction

over code that allows people to like

understand it.

>> Yeah, I think that's really interesting.

I there's some there's like an open

space there whether it's like pseudo

code is like looks like English but it's

a little bit more structured or it's a

visual drag and drop. I don't know. It's

>> Yeah, I think back to building products

with a team of engineers, designers, and

other folks. The interactions that I

would have as the like product lead with

those teams was verbal. It would be

written like abstracted ideas. We would

draw stuff on the whiteboard together.

We'd make system diagrams. We'd look at

the results of the app and we would test

it and point out like oh this thing is

broken that's too slow and it feels to

me like that sort of interface which is

very multimodal and very flexible is

probably the best end result like I

think we will get to something like that

where the author of products will be

doing that but the teams they're talking

to are not other humans they're agents

doing these things. Has has there been

an attempt in in sort of PM land to have

a little more formalism around

communication or

>> Yes. And I would not say it's been good.

The main thing is just like the PRD,

right? The product spec, right? And it's

just become, in my opinion, uh

oftentimes a performative

>> work artifact that you create just so

that you can have a thing to get your

promotion, right? If you're at a big

company. Uh but they're not actually

that useful. To me, the most useful

interactions are just these like

whiteboard conversations, right? What do

we wanted to do here? Oh, we got to

think about that. Oh, we didn't consider

this. Okay, let's re rethink this whole

thing. Like those sorts of

conversations.

>> AI can play a role in that as well. So,

I um you know this startup granola uh

that allows you to kind of record

meetings and they released this like

team version that like all the meetings

uh gets transcribed and and go there and

they have a mobile app right now where

like you can put on the table and like

it's trying. So I I was thinking maybe

we should go gr granola maximalism where

look you shouldn't fight the trend in

which companies are becoming

increasingly oral as opposed to like

written um and because like you know

people are talking on Slack people are

in meetings people like are

communicating via prompts with with

agents uh but you you would want a set

of AI tools that is actually like

creating that that that record and in

background that's like searchable and

organizable and and all of that.

>> Yeah. I wonder when we'll have our first

AI in like oral meetings. You know, you

you're jamming with your designer and

you the AI like chips in and says,

"Well, how about this idea?"

>> Yeah. And and this is where I think the

like dystopian view of AI or AI as like

taking all our jobs and all that is like

not correct. And I think the future of

work is more human, is more interactive,

is more multimodal, is more fun in my

opinion.

>> YC's next batch is now taking

applications. Got a startup in you?

Apply at y combinator.com/apply.

It's never too early and filling out the

app will level up your idea. Okay, back

to the video. All right, so so last time

we chatted, uh, you had launched Replet

agent and things were growing really

crazy. I imagine that has continued.

Any, anything you can share there? Maybe

soon we'll we'll share some numbers but

uh since replet agent launch we're

growing 45%

compound monthly uh average.

>> These are like metrics that we tell YC

companies during the batch when they

have a base of like no users to try to

achieve and you're doing this at larger

scale.

>> Yes. But you know it put a lot of strain

strain on the company and our systems

were still relatively relatively small.

I I feel like it can get to to your head

and you can start optimizing for for the

wrong thing. It's very easy in AI to

increase AR while users are not happy

>> because they're spending a lot more and

like not getting the results and in some

cases maybe shouldn't grow that that

fast because like you'd want users to

get a better experience for less less

money and so it's one thing that we try

not to obsess we actually don't have AR

goals at at Replet we have like more

product goals retention goals just like

other methods

>> yeah because the the sort of bad pattern

with some AI companies you grow topline

revenue very very quickly the churn is

like approaching 100% and eventually

that just catches up

>> and the gross margins are horrible too

and so it's like

>> you know the more growth you have the

worse the company's doing financially

and so

>> so how do investors see this space you

you must have talked with a bunch can

they tell the difference

>> it's all kind of a blur for them because

investors when they I mean I'm just

going to generalize here but when they

start looking at the space they'll use

everything for three minutes and

everything for three minutes looks the

same so I I think I think it will it

will start to clarify and these products

will start to uh I suppose to converge

diverge more with the different focuses

in the areas we're talking about. I

think over the next year it'll be like a

little more clearer but I think a lot of

them are are just like you know super

when we talk to them they're super

confused. They don't understand these

systems. They don't understand where

they're going.

>> I think that's a great segue into um the

the sort of technology underlying some

of these tools. I'd love to dig in a

little bit deeper. We've we've seen some

announcements from cursor and wind surf

about how they're kind of layering

whether it's um Claude or Gemini or um

the OpenAI models with then their own

layers on top the fast apply APIs stuff

like that fast apply models I guess can

you give us an overview of how it works

at Replet?

>> Yeah, a lot of it is you're patching

problems with the underlying frontier

models. Yeah.

>> So the reason fast supply was important

because none of the models uh were doing

very well at diffs. Can you explain what

what supply is?

>> When you're trying to like edit a file

as an LLM, the best thing to do is to

like uh create a create a diff. Uh but

these models are actually not very good

at creating diffs. They're actually not

very good at countering the lines in the

source code. So they get confused about

a lot of these things. So for a while, a

lot of companies were just like

rewriting the entire entire file.

>> And so diff for for nontechnical users

is sort of like remove these three lines

and inject these other three lines

instead. like find and replace almost.

>> That's right. That's right. And so you

needed some way to um make the model

generate some something of a diff,

>> but it's actually not good enough to

merge it. And so you need another model

to actually do the merge. So

>> instead of rewriting like it's a

thousand line file, just like output the

entire thousand lines,

>> it's very slow. It's very expensive. So

you prompt the model to be as lazy as

possible.

>> But in that case, it's really hard to

apply. Okay, so you need another model

that's like doing the application. Okay,

>> you can train a model or you can use

Gemini Flash or some of these smaller

smaller models. And so in some cases,

you need to to train a model or

fine-tune a model to do a better job at

it. In other cases, you just also patch

a bunch of other models to to do it.

It's engineering. It's engineering as

opposed to like research, I would say.

>> And I noticed um you guys don't expose

the underlying model to the user. With

other tools like cursor and wind surf,

there's a drop down. It's like I want to

you know let's see what Gemini thinks of

this problem. You don't do that. Why is

that?

>> A a big part of our research efforts is

in eval

>> and I think this is like an underrated

part of like you know co AI coding. So

we spend a ton of time evaling new

models writing evals generating evals

just data crunching trying to figure out

what the users are are getting or are

feeling. We built a lot of systems try

to understand um how these systems are

are performing the moment a new frontier

model uh lands we are evaluating almost

immediately. Jim and I for example like

uh you know couple months ago were

making the rounds really great model

really awesome at like oneshotting in

some cases better than claude agentic

work wasn't at like a tool calling at

you know some of the things like that

but users just see the hype and they're

like oh like yeah give me Gemini that's

>> how much notice do you have do they like

drop it on you and you like scrambling

or do you have like days or weeks before

that to to try to you know test it out

>> well we we have good partnerships with

these companies we We have a great

partnership with Google. We have a great

partnership with uh with Anthropic, even

with OpenAI. We have a close

relationship. So, a lot of them give us

give us a heads up, give us some early

checkpoints and we we try we we kind of

play with all of them. And in the case

of anthropic, we're always like the

first day kind of launching because we

end up building. A lot of times we're

sort of anticipating where they're

going.

>> Yeah.

>> Because you kind of like you can tell

like 3.5 3.7 there there's some

direction you can tell like where 4.0 is

going to land. So you start architecting

the systems but you know ultimately a

lot of our engineering efforts still

infrastructure like the the distributed

network file system that snapshot based

network file system like it took us 2

years to build like there's nothing off

the shelf to do that a lot of the

security stuff is really really

difficult like replet is one of the few

places in the world where you can get

like a virtual machine in the cloud um

you know by just like creating an

account and that's like a it's really

hard to kind of run this and protect

against this. You have crypto miners,

you have all sorts of stuff like that.

And Replet also uses um uh Nyx OS under

the hood. Nixos is a like fully

declarative also transactional uh

operating system generator as it were.

Uh we have like this multi- terabyte uh

hard drive in all the different regions

that we have compute. um that that that

cached all the packages in the in the

world and that gets attached to every

container and and again all of that I

mean I keep coming back to this idea of

transactionality like you would want the

system to be fully functional. You would

want it to be um you would want it to be

safe in order to like experiment with

something go back do the sampling with

with agents. So really a lot of the a

lot of the work is just like the

engineering of this infrastructure and

it's like not as a parent not as sexy as

sort of like we're trained this model

here's the um but but I think this is

where you can build like a uh a bit of a

lead.

>> Yeah. I when VCs talk about moes uh the

thing that I translate it to in my head

is what is the compounding advantage

that you might be able to achieve.

Right. It's not really a defensive mode.

It's more just I'm ahead and by being

ahead in some vector it allows me to

continue to move faster. That's right.

>> And it sounds like this is a great

example of that.

>> True modes are often not obvious until

like decades perhaps many decades into

into the company. Like you wouldn't know

what like Netflix's mode. Uh but

obviously they have one like Disney

tried to like compete with them and

everyone was like you know down on on on

Netflix but turned out they have a mode

that is this uh content production

system that they that they built.

>> So Amad you you started with the mission

of like making it easier for people to

learn to code. You've accomplished a lot

of that. Now you're pushing the envelope

of what it even means to code. I've got

young kids. I want them to be productive

creators in the world. What should I

tell them to do? Should they learn to

code? What does that even mean?

>> Look, I think if you want to go the like

professional software developer route, I

think getting a computer science degree

and like learning fundamentals makes

sense. But if you want to be a a

creator, if you want to be a generalist

in this world, I don't think it's

necessary anymore to learn to code in

the more like traditional ways of

learning to code. I think you pick it up

by osmosis almost like like go go go to

replet and you know and start using it

and at some point you're going going to

run into some issue where you're going

to have to look at code or you're going

to have to look at logs just by being

resourceful having to Google around and

all that. You'll start picking it up.

And by the way, this is how our

generation kind of learned how to code,

right? when you were talking I'm like

that's what I did.

>> Yeah. And somehow uh it just became very

industrial and very formal over the

years like the way we made web apps is

like you just like start a notepad with

a you know HTML file and now you have to

like learn like webpack or whatever you

know it's like so um I think the future

of work is not really clear what is you

you know we can sort of like have some

like you know idea of like where where

the world is headed and so with with my

children I want them to have like as

broad uh based knowledge as possible. Uh

I want them to be as generalist as

possible. I want them to be as

generative as possible like being able

to like create a lot of ideas because

once the making of things gets easier.

The bottleneck goes back to to to like

how many ideas you can have.

>> Um and so I I I wouldn't put learning to

code as the top thing. I would put learn

to make things. Learn to make things

with code. Learn to make things with

video. learn to make like anything with

AI.

>> And so what do you think happens with

SAS generally if we're very soon able to

say create me a version of Google

Calendar or clone Docyign? What happens

to SAS? Do you think

>> we have stories today of um a lot of

people replacing hundreds of thousands

of dollars worth of SAS um with Replet?

The other day I heard a story from

someone who um you know they got quoted

$150,000 for a piece of software. He

went and made it in replet sold it to

his employer for $32,000

cost him $400. Companies that have a

platform developer community around it

and and plug-in ecosystem and things

like that. I think those are safe.

>> You're not going to be able to vibe code

Salesforce. I think the the vertical SAS

is is in trouble and I think it's my

guess it's already probably showing in

some of the metrics.

>> Okay, last question. What advice would

you give to um founders starting out

right now?

>> You know the best advice is what we what

you what you actually pointed out

earlier is work on the edge of what's

possible.

uh because uh one evolution of of AI or

models will make your business valuable

and suddenly you're first on market. I

find it rare to see founders that are

actually like sitting down trying to

actually predict the future and and and

maybe that's something that was you know

ill advised in the past but I think

right now trying to actually figure out

where things are headed is very very

important uh skill to have. So like make

some prediction uh figure out like how

to create like a crappy product that

would get better immediately as you

switch switch the model. I mean computer

use is a is is a great example.

>> Oh it's been an absolute pleasure. Thank

you so much for coming on.

>> Pleasure. Thank you.

>> See you next time. Thanks.

[Music]

Key Vocabulary

Start Practicing

Vocabulary

Meanings

code

/koʊd/

B1

noun
- source code or computer program instructions

verb
- to write computer code

build

/bɪld/

A2

verb
- to construct or create something

learn

/lɜrn/

A1

verb
- to gain knowledge or skill

make

/meɪk/

A1

verb
- to create or produce

idea

/aɪˈdiə/

A1

noun
- a thought or concept

agent

/ˈeɪdʒənt/

B2

noun
- an AI program or software agent

model

/ˈmɑdl/

B2

noun
- an AI or machine learning model

program

/ˈproʊgræm/

B1

noun
- a computer program

verb
- to write a program

work

/wɜrk/

A1

noun
- employment or task

verb
- to function or operate

human

/ˈhjuːmən/

A1

adjective
- relating to people

noun
- a person

interactive

/ˌɪntərˈæktɪv/

B1

adjective
- allowing interaction

future

/ˈfjutʃər/

A2

noun
- time yet to come

easy

/ˈizi/

A1

adjective
- not difficult

access

/ˈækses/

A2

noun
- permission to use

verb
- to reach or use

possible

/ˈpɑsəbl/

A2

adjective
- able to happen

grow

/groʊ/

A2

verb
- to develop or increase

need

/nid/

A1

verb
- to require

change

/tʃeɪndʒ/

A2

noun
- alteration

verb
- to make different

problem

/ˈprɑbləm/

A2

noun
- a difficulty

way

/weɪ/

A1

noun
- a method or manner

Are there any new words in “” you don’t know yet?

💡 Hint: code, build… Jump into the app and start learning now!

Key Grammar Structures

Coming Soon!

We're updating this section. Stay tuned!

Related Songs