Reinventing the Developer Terminal with Warp Co-Founder and CEO Zach Lloyd

27m
For decades, the developer terminal has remained largely unchanged. But for Warp CEO and co-founder Zach Lloyd, reinventing this core tool is the key to unlocking AI agents for coding, debugging, and automating the entire development process. Zach joins Elad Gil to discuss how seeing this opportunity for innovation led to Warp’s agentic terminal for developers. Zach talks about the phases of software development, from coding by hand to the current "develop by prompt" era, and the coming age of fully automated development. Plus, Zach and Elad explore the deep philosophical questions around intelligence versus consciousness in AI models, and what it would take to believe a computer program is truly aware.

Sign up for new podcasts every week. Email feedback to show@no-priors.com

Follow us on Twitter: @NoPriorsPod | @Saranormous | @EladGil | @zachlloydtweets | @warpdotdev

Chapters:

00:00 – Zach Lloyd Introduction

00:32 – AI, Intelligence, and Consciousness

06:55 – What Warp Does

07:38 – Benefits of the Terminal as a Launchpoint

08:27 – Features Driving Warp’s Adoption

09:12 – Zach’s View of the Coding Market

10:27 – Evolution of Coding Development

12:45 – Importance of Senior Engineer Expertise

14:11 – Future of Security and Other Dev Tools

22:22 – Why Zach Focused on the Terminal

23:52 – The Future of the Model Layer

25:36 – What Zach’s Excited About in the AI Dev World

27:18 – Conclusion

Press play and read along

Runtime: 27m

Transcript

Speaker 1 Today at Nopriors, I'm joined by Zach Lloyd, the co-founder and CEO of Warp, a terminal product and AI tool for developers that allows you to do different sorts of coding applications.

Speaker 1 Prior to Warp, Zach was at Google and he also started another company called Selfmade.

Speaker 1 We talk about AI dev tooling, but we also end up talking about human consciousness and how can you tell if an AI is actually sentient. Zach, welcome to NeuPriors.

Speaker 2 I'm excited to be here. Thanks for having me.

Speaker 1 So you have a master's degree in the philosophy of science. Yeah.

Speaker 1 And if you're going to take a very different lens and abstract out of the coding world and all the things that we tend to think about every day,

Speaker 1 how do you think about this societally in terms of this big wave of AI that's hitting us right now? And where do you think some of these really big societal impacts will be?

Speaker 2 The way I think about like the advances are it's kind of like we are

Speaker 2 distilling intelligence. And so

Speaker 2 there are, I think, people who consider what's happening, it's like they're like, are we recreating people in some way? Are we recreating consciousness? But it's not that.

Speaker 2 It's actually what's fascinating to me is how much intelligence you can get out of just like next token prediction. Yeah.

Speaker 2 And like, what does that say about the way that our minds work?

Speaker 2 Something I'm always thinking about is like, is this how our brains are working? Are we, are we doing next token prediction? And I don't think so.

Speaker 1 Like, I think that there's going to be some further ai unlock there's actually a book about this that i think is really interesting called blindsight okay the sci-fi book where they separate consciousness from intelligence yeah and basically humanity meets a space-faring civilization or civilization that's overstating it a space faring intelligent being that's not conscious yep And what are the implications of that?

Speaker 1 And how do you think about that? And how do you communicate with that? Are you basically saying that that's kind of your view of AI right now?

Speaker 2 I think that's what it is at the moment. It's like we've distilled intelligence or something that like

Speaker 2 from like an instrumentalist or like functional perspective is able to do things that we recognize as intelligence, but it's totally mechanistic.

Speaker 2 And I don't think anyone who's looking at this thinks that there's any aspect of consciousness to it. I think that's like a very confusing thing for people.
Yeah.

Speaker 1 Because the classical test for this was known as the Turing test, right? Totally.

Speaker 1 And so the idea there is if you can't tell the difference between interacting with a computer and a person, then that computer effectively has met the intelligence bar of a person.

Speaker 1 But in our interactions with this type of AI, we're having very, in some cases, it feels like very deep conversations. We're asking about relationships.

Speaker 1 We're asking about all sorts of aspects of our own lives. And it's giving very cogent answers that make a lot of sense.

Speaker 1 And so there's this interesting separation of,

Speaker 1 again, consciousness and intelligence.

Speaker 2 Right.

Speaker 1 Is that how you interpret it?

Speaker 2 That is how I interpret it. And the Turing test is passed.
What's crazy to me is like, we just passed it and no one seemed to care.

Speaker 1 So what do you think is the next test or what is the right test? Like, how do we actually test for consciousness? For consciousness?

Speaker 2 God.

Speaker 2 I don't have a good, I mean, that's a super deep philosophical question that I don't have a really good answer for.

Speaker 1 I mean, it should be mechanistic, right? The Turing test was very mechanistic. Yeah.

Speaker 1 And I mean, there was other tests we had before of what we would consider super intelligence, right? Can it beat us at chess? Then it'll be super smart. Yeah.
It beat us at Go.

Speaker 1 Can it beat us at different thing? Video games, et cetera. You know, like we keep coming up with new tests that these things pass.
And then we keep saying, well, it's not conscious.

Speaker 2 What would you want to see

Speaker 2 from something which is like running a computer program to make you believe that it had consciousness? Are you looking for certain behavioral characteristics? Sure.

Speaker 2 Or is the problem that if you really understand the mechanism by which it's working, that you will never credit it?

Speaker 2 as being conscious, which is crazy because humans, I mean, at least my belief is that it's also there's a mechanistic thing that's happening.

Speaker 1 Yeah, you're running some form of math in your brain.

Speaker 1 And it may also just be like matrix math and some sort of like series of compounded functions, which is basically all you're doing in a neural net, right?

Speaker 2 Right.

Speaker 1 You're just recursively compounding functions.

Speaker 2 That makes sense.

Speaker 1 So, you know, it's an interesting question because if you look at memory as an example,

Speaker 1 is memory a predicate for consciousness?

Speaker 2 Not really, right? There's people who've lost the ability.

Speaker 2 Yeah, exactly.

Speaker 1 And these

Speaker 1 models basically are brought up. They are fed a stream of tokens.
They output a stream of tokens and they're shut down.

Speaker 1 And so it's an interesting question of,

Speaker 1 you know, is there some sort of other

Speaker 1 modules that are missing here that would allow us to think of it as a conscious thing? Because it does reasoning.

Speaker 2 It definitely does reasoning.

Speaker 1 It does interpretation of language. It does synthesis of language and ideas and knowledge.

Speaker 2 I mean, I have a close friend who's doing a PhD in philosophy, and he now says that conversing with GPT-5 is

Speaker 2 better than conversing with his professor. Really? I was just joking.
No, that's what he says to me. He's like, GPT-5 gets it.
Like, he's like writing his dissertation. He's like,

Speaker 2 and that's crazy.

Speaker 2 But we don't credit it for consciousness. And I actually think rightfully so

Speaker 1 because like... So what do you think is missing?

Speaker 2 I think people would start to give it more credit for consciousness if there was more of a like a feedback loop where if there was more of a sensory experience that was tied to it as opposed to just like what do you mean by sensory experience?

Speaker 2 Probably we're gonna like, I would imagine the first things we're gonna credit is being more conscious are a little bit more robot-like, honestly, where you have some sort of like live input from the world that you're reacting to but again it's going to have the same problem where it's like as long as we know what it's doing we're very unlikely to attribute true consciousness to it which isn't fair but like i actually don't know how we will know when there is a conscious thing yeah because it does raise um interesting ethical questions yeah because the moment an ai is actually conscious yeah if you're doing certain things to manipulate it to hurt it in certain way you know you're starting to get into these odd ethical straits.

Speaker 2 Totally.

Speaker 2 I do think, though, that

Speaker 2 some people,

Speaker 2 this distinction isn't a thing that they recognize. And like the way that you read stories, and we actually had this happen with Warp, where there was a person who

Speaker 2 thought that Warp's AI was like... you know, sentient or conscious in some way, had like a very strong reaction to it, which makes sense.

Speaker 2 It's like if you don't know the like mechanistic underpinnings, then already people think of it as like being

Speaker 2 that happened at Google very early.

Speaker 1 It happened at Google three, four years ago, if you remember. I do remember this dude who I think they were using Mina or one of these really early ChatGPT like internal things.
Yeah.

Speaker 1 Before Google launched anything and ChatGPT came out, there was internal versions, right, at Google and other places. Yeah.

Speaker 1 And,

Speaker 1 you know, this person thought that the AI was conscious.

Speaker 2 Understandably. So yeah.

Speaker 1 Yeah, it's a very interesting question. Yeah.
You've worked at Google. You've run companies before, you've started companies before.
You're now working on Warp.

Speaker 1 Can you describe what Warp does and how it's different from other tools or companies in the world?

Speaker 2 Yep. So Warp is what we call an agentic development environment.
It's grown out of the terminal. The basic concept of the app at this point is it's a platform for telling your computer what to do.

Speaker 2 You can sort of tell it in terminal commands, which is Warp's original product, or you can tell it... in English.
And if you tell it in English, it launches an agent.

Speaker 2 And the agents can do all manner of development tasks, whether it's coding or setting up a project project or debugging while your server is crashing.

Speaker 2 And so it's a very like horizontal, general purpose, and I think unique interface for developing with agents.

Speaker 1 And so a lot of the other coding tools out there are either just kind of a web interface or they're like a cognition or there's things like Cursor and others where they're like an IDE as a starting point.

Speaker 1 Obviously Anthropic and Cloud have their own approach. What do you think is the benefit of doing the terminal? and starting there as sort of the launch point for a lot of these products.

Speaker 2 The competitors are typically like VS Code clones. They all have have a sort of IDE-centric approach.

Speaker 2 Or if you're taking a terminal-centric approach like Cloud Code, the most common thing is it's just like a pure text-based terminal app.

Speaker 2 The advantage of being at Warps Layer is like you get the command line interface, but we're the outer app. And so we can do things with the developer experience in the UX.

Speaker 2 Like we can have editing features where we think it's appropriate. We can build like a code review interface.
And so we have complete control, but still the terminal first approach.

Speaker 1 Yeah. And you folks have been growing really well.
Like, so you're close to a million MAUs. You're doing something like a million in new revenue every seven to 10 days.
Yeah. Like outstanding growth.

Speaker 1 That's cool. Are there specific features or use cases or things that are really driving this adoption?

Speaker 2 Yeah, I think the biggest thing was moving into the coding market, to be honest. Like for a long time in Warp's history, we were kind of known as the AI terminal, which is cool.

Speaker 2 And like we supported terminal use cases really well. Like how do I do this thing with Docker or Git? But the action is in coding.

Speaker 2 And like most most development activity one way or the other is touching a code base.

Speaker 2 And so we really started to inflect when we launched a great coding agent, which was like three, four months ago, honestly. So that's been the biggest change.

Speaker 1 And how do you think about the different parts of the coding market? There's vibe coding, there's professional coding. Like

Speaker 1 there's all just one thing? Are these separable things?

Speaker 2 I think it's pretty separable. So for Warp, our target is pro-developers building software that's economically meaningful.

Speaker 2 So we really want to focus on actually people who are using agents to build kind of hard apps, apps that might like go into your Mac doc or be pinned as a Chrome tab, as opposed to vibe coded apps where I think it's more of a long tail play.

Speaker 2 And so I do think, by the way, it's awesome that anyone can code at this point, but I think if you look at where most of the value is in the software market,

Speaker 2 it's not in those long tail apps. It's in like a relatively small number of apps that are super heavily used.
And that's my background.

Speaker 2 Like I, you know, I worked on one of those apps, Google Sheets, and just like, I have a lot of passion in terms of helping people build real apps. It's much harder, by the way.
Like,

Speaker 2 I think it's relatively straightforward at this point for a good agent to like,

Speaker 2 you know, with relatively few prompts, build like a web app. It's much harder to apply these agents successfully to pro code bases.

Speaker 2 So that's where we're focused.

Speaker 1 So I guess one really interesting macro question for me is where is all all this heading? And if you look at it, ChatGPT launched in November of 25, excuse me, November of 22.

Speaker 1 So three years ago or so.

Speaker 1 At the time, there was predictions that AI would take over the world and we'd be running down the light cone and within five years, like everything will change.

Speaker 1 And human activity would be subsumed by AI. And there's the old saying in technology that less happens than you think in three years and more happens than you think in five years.

Speaker 1 And as you think

Speaker 1 forward in terms of all these different tools and all these different use cases and vibe coding versus professional coding and the role of a software developer, where do you think we are in two, three years?

Speaker 2 Yeah, so the way that I'm thinking of it is there's sort of three phases here. For most of my career, we were in like the world of develop by hand is how I talk about it.

Speaker 2 So my workflow then was like I would open up a code editor. I would find files I want to change.
I would type some code. I'd have some assistive features.

Speaker 2 And then I would go back to the terminal and I would type commands to build that code.

Speaker 2 And I think we're switching away from that to something like develop by prompt, where I start most of the coding tasks that I do right now by prompting an agent, and that agent does some work.

Speaker 2 And I think there's a third phase, which is like automated development. Honestly, I think that's like

Speaker 2 kind of the bigger market here, and why people are so excited about this space: you can actually use these agents to automate some parts of the software development process.

Speaker 2 And so, that's like, you know, cognition does that. We're moving into this space, cursor has background agents.

Speaker 2 The rate at which this stuff will happen is like not super clear to me, actually. Like the

Speaker 2 most recent iterations of the models, in my opinion, were not as big of a step change as like,

Speaker 2 for instance, when like Sonnet 4 came out, that was a really big step change in coding capability. I think there's going to be a mix of interactive and automated pieces of development for a while.

Speaker 2 I would guess like, I don't know. It's so hard to know.

Speaker 2 Like, I think within a couple of of years, you'll have everyone working by prompt and you'll have some slice of development tasks that are just like in the background, like a server error comes in or a new ticket, a user report comes in,

Speaker 2 something is automatically done. But I don't think it's going to be everything.

Speaker 1 I'd be very surprised. So you don't think there's a point at which, you know, all of coding activity just becomes agents doing it.

Speaker 1 And then there's like a human who's kind of giving high-level directions, like a product manager kind of thing, or an engine manager.

Speaker 2 Maybe. I mean, honestly, maybe.

Speaker 2 I think that, I think it'd be silly for us not to like build the infrastructure to enable that. I just don't, I don't know the timeframe, but I do think we're going towards something like that.

Speaker 2 What I really don't think, though, is like engineering expertise is going to become devalued. Sure.

Speaker 2 So I think it's actually, in the short term, at least, it's more important to know what you're doing as an engineer than it ever has been.

Speaker 1 And why do you say that? Is it because you need to correct errors that the agents are making? Is it because things may be architected in a way that isn't scalable? Is it something that's not?

Speaker 2 Totally. So it's like the agents, you can think of them kind of as junior engineers.
So

Speaker 2 if you didn't have someone who was senior watching them, you end up in a situation where these agents will make code that creates bugs. It could create security issues.

Speaker 2 It can cause your code base to become really unmaintainable.

Speaker 2 And so there's actually like a premium right now on these senior engineer skills where you can architect, where you can review code, you can make sure the system doesn't degrade.

Speaker 2 And so, again, I would be, if I were like early in my CS career, I would be racing towards building that expertise.

Speaker 2 Where you don't want to be, I think, is like someone who is just like perpetually in the junior engineer state, because I do think that's at risk.

Speaker 1 And then how do you think about different security tools?

Speaker 1 Does that, so for example, there's tools like Socket or SNCC or others who are basically looking at, you know, whether code has or open source packets or other things have vulnerabilities in them, or they're looking at different aspects of security holes for code in general.

Speaker 1 Do you think that just becomes part of these coding tools? Or do you think there will always be this sort of standalone companies? I'm just sort of curious how the overall.

Speaker 2 It's an awesome question. So I think tools like that become more important.
I think anything that does like either automatic security analysis or automatic verification.

Speaker 2 I think actually like languages like Rust, things that have stronger guarantees around safety by default, where you don't need to rely on like a human reviewer become more valuable.

Speaker 2 Whether those things like

Speaker 2 get integrated or bundled into the coding agents, I actually don't have a strong take on. I'm curious if you have a take, but no, I think that the actual fundamental problem becomes more important.

Speaker 1 Yeah. What do you think is bundled?

Speaker 1 What do you think it's bundled? Like what sorts of tools do you think? Because there's this whole world of dev tools.

Speaker 2 Yep.

Speaker 1 And there's a security aspect of it, but there's lots of others. There's design-related things.
There's, you know, there's a huge spectrum.

Speaker 2 Yep.

Speaker 1 What do you think just gets, just becomes part of coding tooling?

Speaker 2 I think there's going to be a class of tools where you sort of start from the front end.

Speaker 2 This would be things like Lovable or Bolt or Replit or maybe even Figma Make if you're coming from the design side. And you'll have like an all-in-one platform for

Speaker 2 build an app or even like, honestly, like build a business, like put payments in it. It's like, it's kind of like the evolution of like...

Speaker 2 either a Shopify storefront or like WordPress or Squarespace or something like that. So I think that's all going to be bundled.

Speaker 2 More on like the core pro developer side, I can't tell if it's going to be a world of like mcps and integrations and like all these tools sort of interplay that's one approach or it's going to be more like there's enough alpha and like you put all of these things together and i think warp is a little bit more like this like we're trying to build a single pane of glass for instance for doing like local agents and remote agents yeah and if you get a way better developer experience through the bundling, I think that that approach could win.

Speaker 2 But I don't know. Like MCP, I I think, is a pretty valuable approach as well.
But it's not perfect because you end up like with this

Speaker 2 sort of second-hand data coming into all these tools.

Speaker 1 Yeah, it's really interesting because if you look at different industries, early in the industry, things tend to be fragmented often, not always.

Speaker 1 And then late in the evolution of an industry, things get bundled. And then when there's a technology disruption, things debundle again, and you have point apps and then they start bundling.
Yeah.

Speaker 1 And that's just kind of like the cycle of technology in some sense.

Speaker 2 Some things that I think that are vertical right now, like I actually

Speaker 2 think will, like, for instance, like Agentic code review, take that. To me, that should be part of a like holistic agentic development platform, not so much a standalone thing.

Speaker 2 So I think some of these verticalized apps, there's just going to be, if you've, if you've gone through the trouble of building like a really, really excellent coding agent, which Warp has invested a ton in this, that coding agent should be reviewing code.

Speaker 2 And it would be weird to plug in some other thing that needs to relearn all the context, the rules, the coding conventions. So I think there are definitely some things that will be bundled.

Speaker 1 Makes sense. Yeah, Shriya and my team has put together this matrix of companies versus features in the coding market.
And there's a lot of these sort of like single feature companies.

Speaker 1 And it almost feels like all these things should consolidate into a small number of players over time, just as they iterate through the product. I think so.

Speaker 2 And I think the core technology is like the harness. So the thing that sits around the model.
And like, I know the model companies are also investing heavily in this. And then it's like the context.

Speaker 2 And so if you have the rich context in your system, you're going to find a lot of vertical applications where like, I think security checking is an interesting one. Code review is definitely one.

Speaker 2 Anything CI related, you're probably not going to want to use a bunch of different systems.

Speaker 1 How do you think about it in the context? If I look at other historical technology shifts, the operating system or the platform often subsumes the biggest apps into itself.

Speaker 1 So for example, Microsoft OS.

Speaker 2 Yeah.

Speaker 1 Eventually they just bundled Windows on top of it. And those were the four main apps that are being used the most on Windows.

Speaker 1 And similarly, gaming was the other big app. So that's why they started Xbox and Microsoft.

Speaker 1 If you look at Google and Vertical Search, they eventually integrated all the vertical searches into Google directly.

Speaker 2 Totally.

Speaker 1 And so in the context of AI, one could argue that if the foundation model companies follow the same approach, they should bundle or at least attempt to bundle some of the biggest use cases.

Speaker 1 The clearest big use case today is code. Totally.
And we already know that Cloud or Anthropic has launched Cloud Code.

Speaker 1 Open AI

Speaker 1 almost bought Windsurf. It always had early coding stuff.

Speaker 1 Microsoft, which we know are building some of their own models, obviously have GitHub and Copilot and all that. And so do you think eventually those become some of the fiercest players in this market?

Speaker 1 Or how do you kind of view forward-looking shifts in the market and where some of this functionality goes?

Speaker 2 I mean, I do. And I think they're clearly trying to do that playbook right now where they are seeing like, okay.

Speaker 2 If you consider them platforms and they're looking at what are the most valuable applications that are being built on top of the tokens, They are moving aggressively into coding with Cloud Code and Codex, which definitely is like, as a startup, is a little bit scary for sure.

Speaker 2 The question is, like,

Speaker 2 do they have like that distribution advantage that

Speaker 2 say Microsoft had? Where if everyone's using Windows or everyone's coming to Google for the front door, I think it's pretty easy already to add on the first-party app for in place of the ecosystem.

Speaker 2 I don't know that that exact same dynamic holds for for coding right now there like there's the front door is kind of like

Speaker 2 honestly it's still like a native app that someone downloads on their computer so that'd be the terminal or the IDE and kind of at the moment yes I think controlling that is actually like a really interesting front door the other front door which I feel like honestly they're not executing that great is GitHub where all the source code lives yeah that would be like the locus of doing all this stuff that I think makes the most sense But right now, it's a weird dynamic where like we have people who are like running cloud code within Warp and Warp is sort of the outer app in that situation.

Speaker 2 So I don't know.

Speaker 2 You know, the other thing that I hope happens from our perspective is that there's a lot of competition at the model layer and that the sort of like intelligent tokens become a bit more of a commodity.

Speaker 2 Right now, the models have a bunch of sort of like pricing power because like there is a real delta between using the frontier model and using like the open source model.

Speaker 2 But if at some point the models are good enough where coding is sort of, I think of it as like solved, it's like good enough, you don't need to be using the frontier model, then it's like, yeah, maybe they have an advantage just from brand and scale.

Speaker 2 And but I think the advantage is not as entrenched as something where it's like literally the front door, like Facebook or Google or Windows provides those other patterns.

Speaker 1 That's a really good insight in terms of the way that you launch an activity or application then drives what you use.

Speaker 1 And so the hard part is often switching people off that.

Speaker 1 And that's one of the reasons I think people think OpenAI has a strong competitive position in the consumer world is because it's a default behavior for a lot of people right now is just start ChatGPT and use it for something, which is different from like the model layer where there's more switching.

Speaker 2 Like, I think a consumer ChatGPT has a huge advantage.

Speaker 2 Once that behavior is default, even if like cloud is maybe better, I don't know if it is or not. Like everyone knows ChatGPT.

Speaker 2 I don't know if you saw OpenAI Dev Day yesterday, but they're clearly doing this platform play within ChatGPT now, where it's like you have apps within ChatGPT.

Speaker 2 I'm sure they'll use that data to sort of subsume or take over whatever the best first-party integrations are. So they're definitely doing that on consumer.

Speaker 2 For developer, I don't know that that same dynamic is there.

Speaker 1 What made you decide to focus on terminals? So, you know,

Speaker 1 we started talking years ago when you first started doing Warp. Yep.

Speaker 1 And even then, I I think you had really interesting ideas about how to rethink the terminal and how to use that as a launching point for all sorts of things.

Speaker 1 Could you explain that thinking and how it's evolved over time?

Speaker 2 Yeah, so the basic insight or thing that got me excited about Warp to begin with is like, you have this tool that is pretty much a daily use tool for every developer. It's that and the code editor.

Speaker 2 And the terminal itself is something that, you know, really hadn't changed much in the last 40 years. It's a tool also where it's like, if you get good at it, you can really get a lot done.

Speaker 2 If you use it, it like works across all these different parts of software development, not just code writing. On the flip side, from my perspective, not a good product.

Speaker 2 Just like hard to learn, hard to use, hard to remember commands, super intimidating, and just like kind of like a gatekeeping vibe around it as well, in my opinion.

Speaker 2 And so the original concept with Warp was like, Let's build a better product there and see if people will like using it. The business concept has evolved a ton.

Speaker 2 Like the original business concept was like building a collaboration platform, which is like we've just changed our model to be an agent platform because it's like way more demand for that than a collaboration platform around the terminal.

Speaker 2 But the sort of core insight that this is an important tool, it's crazy. It's actually kind of invalidated through all these agentic things that are very terminal first.

Speaker 1 You know, one thing that you mentioned I thought was interesting is that at some point the model layer may commoditize in terms of its coding abilities. Yeah.

Speaker 1 How far on that SM tip do you think we are? How close to that do you think we are?

Speaker 2 God, I don't know. I think that the increasingly the limit that we see is context.

Speaker 2 And like the reasoning capabilities of the models are pretty impressive. The problem is like

Speaker 2 understanding an entire code base or understanding sources outside of the code or literally just understanding user intent are challenging problems.

Speaker 2 I still think there's probably much more to do on the model side, but I don't know is a short answer.

Speaker 1 And do you think that from a model capabilities perspective, we've hit a point where, to your point, it feels like certain aspects of the models are slowing down in terms of the benefits or outcomes of further investment of certain types at least?

Speaker 2 I think so. Like if you take like Sonnet 4 to 4.5, and we're big partners with Anthropic.
They have great models.

Speaker 2 That was like a

Speaker 2 few percentage point increase on SuiBench for us. And

Speaker 2 we've invested a decent amount to be one of the top agents on Sweet Bench.

Speaker 2 And

Speaker 2 when

Speaker 2 we went from three, seven to four, it was a much more significant boost. So I, again, that's like not,

Speaker 2 I don't know what that means about the total underlying trends. I think something with GPT-5 was somewhat similar, like certainly an upgrade.
And GPT-5, I think, is actually pretty much on par.

Speaker 2 It has a different like feel to it and higher latency, but it didn't feel to me like as much of a step change as some of the upgrades before.

Speaker 1 Yeah, makes sense. What other areas of the AI dev world are you excited about?

Speaker 2 I am, um, so I'm really excited about not just like the interactive piece of agents, the way most people are working today, but what can you do if you can program against these agents?

Speaker 2 And so for instance,

Speaker 2 It's like if you have a version of Warp that's like headless, for instance, you can put it in CI and you can start to do crazy things where it's like, okay, every time someone updates the code, make sure the documentation stays up to date.

Speaker 2 It's like, that's very annoying for a developer. And so allowing developers to automate parts of their job that they don't like doing, I think is like a big capability.

Speaker 2 And then from a business perspective, just like automation is a better place to be than productivity enhancement.

Speaker 2 Like one of the challenges with our business, I think with a lot of the coding businesses is just like proving the ROI.

Speaker 2 Like, and there have been these studies that show like, you know, you deploy this stuff on real code bases, it's kind of unclear if it's actually having an impact.

Speaker 2 Whereas if you get something that's more outcome-oriented or more just like an automation, I think it's easier to prove the ROI.

Speaker 2 And then you're also not limited by time spent behind keyboard for doing this type of stuff.

Speaker 2 So, from a business perspective, I'm very excited about like what's unlocked if developers can program these agents.

Speaker 1 Well, that's fascinating. Thank you so much for joining us at Anna Pryors.

Speaker 2 Thank you for having me. This was great.

Speaker 3 Find us on Twitter at NoPriorsPod. Subscribe to our YouTube channel if you want to see our faces.
Follow the show on Apple Podcasts, Spotify, or wherever you listen.

Speaker 3 That way you get a new episode every week. And sign up for emails or find transcripts for every episode at no-priors.com.