DeepSeek DeepDive + Hands-On With Operator + Hot Mess Express!

1h 5m
“DeepSeek is a really odd duck.”

Press play and read along

Runtime: 1h 5m

Transcript

Speaker 1 In business, they say you can have better, cheaper, or faster, but you only get to pick two.

Speaker 4 What if you could have all three? You can with Oracle Cloud Infrastructure.

Speaker 7 OCI is the blazing fast hyperscaler for your infrastructure, database, application development, and AI needs where you can run any workload for less.

Speaker 10 Compared with other clouds, OCI costs up to 50% less for compute, 70% less for storage, and 80% less for networking.

Speaker 13 Try OCI for free at oracle.com/slash NYT.

Speaker 5 Oracle.com/slash NYT.

Speaker 15 I just got my weekly. You know, I set up ChatGPT to email me a weekly affirmation before we start taping because you can do that now with the tasks feature.

Speaker 16 Yeah, people say this is the most expensive way to email yourself a reminder. So what sort of affirmation did we get?

Speaker 15 Today it said, you are an incredible podcast host, sharp, engaging, and completely in command of the mic. Your taping today is going to be phenomenal and you're going to absolutely kill it.

Speaker 16 Wow. And that's why it's so important that ChatGPT can't actually listen to podcasts because I don't think it would say that if it had actually ever heard us.

Speaker 15 It would say, just get this over with.

Speaker 15 Get on with it.

Speaker 15 I'm Kevin Roos, a tech columnist at the New York Times. I'm Casey Newton from Platformer.
And this is Hard Fork.

Speaker 16 This week, we go deeper on DeepSeek. Chinatops Jordan Schneider joins us to break down the race to build powerful AI.
Then, hello, operator. Kevin and I put OpenAI's new agent software to the test.

Speaker 16 And finally, the train is coming back to the station for a round of Hot Mass Express.

Speaker 15 Well, Casey, it is rare that we spend two consecutive episodes of this show talking about the same company, but I think it is fair to say that what is happening with DeepSeek has only gotten more interesting and more confusing.

Speaker 16 Yeah, that's right. It's hard to remember a story in recent months, Kevin, that has generated quite as much interest as what is going on with DeepSeek.

Speaker 16 Now, DeepSeek, for anyone catching up, is this relatively new Chinese AI startup that released some very impressive and cheap AI models this month that lots of Americans have started downloading and using.

Speaker 15 Yeah, so some people are calling this a Sputnik moment for the AI industry when kind of every nation perks up and starts

Speaker 15 paying attention at the same time to the AI arms race. Some people are saying this is the biggest thing to happen in AI since the release of ChatGPT.

Speaker 15 But Casey, why don't you just catch us up on what has been happening since we recorded our emergency podcast episode just two days ago?

Speaker 16 Well, I would say that there have probably been three stories, Kevin, that I would share to give you a quick flavor of what's been going on.

Speaker 16 One, a market research firm says DeepSeek was downloaded 1.9 million times on iOS in recent days and about 1.2 million times on the Google Play Store.

Speaker 16 The second thing I would point out is that DeepSeek has been banned by the US Navy over security concerns, which I think is unfortunate because what is a submarine doing if not deep seeking?

Speaker 16 It was also banned in Italy, by the way, after the data protection regulator made an inquiry. And finally, Kevin, OpenAI says that there is evidence that DeepSeek distilled its models.

Speaker 16 Distillation is kind of the AI lingo or euphemism for they used our API to try to unravel everything we were doing and use our data in ways that we don't approve of.

Speaker 16 And now Microsoft and OpenAI are now jointly investigating whether DeepSeek abused their API.

Speaker 16 And of course, we can only imagine how OpenAI is feeling about the fact that their data might have been used without payment or consent.

Speaker 15 Must be really hard to think that someone might be out there trading AI models on your data without permission and i want to acknowledge that literally every single user of blue sky already made this joke but they were all funny and i'm so happy to repeat it here on hard fork this week now kevin as always when we talk about ai we have certain disclosures to make the new york times company is currently suing open ai and microsoft over copyright violations uh alleged related to the use of their copyrighted data to train AI models.

Speaker 15 I think that was good.

Speaker 16 It was very good. And I'm in love with a man who works at Anthropic.

Speaker 16 Now, with that said, Kevin, we have even further we want to go into the DeepSeek story, and we want to do it with the help of Jordan Schneider.

Speaker 15 Yes, we are bringing in the big guns today because we wanted to have a more focused discussion about DeepSeek that is not about, you know, the stock market or how the American AI companies are reacting to this, but is about one of the biggest sets of questions that all of this raises, which is what is China up to with DeepSeek and AI more broadly?

Speaker 15 Like,

Speaker 15 what are the geopolitical implications of the fact that Americans are now obsessing over this Chinese-made AI app? What does it mean for DeepSeek's prospects in America?

Speaker 15 What does it mean for their prospects in China? And how does all this fit together from the Chinese perspective? So Jordan Schneider is our guest today.

Speaker 15 He's the founder and editor-in-chief of China Talk, which is a very good newsletter and podcast about U.S.-China tech policy. He's been following the Chinese AI ecosystem for years.

Speaker 15 And unlike a lot of American commentators commentators and analysts who were sort of surprised by DeepSeek and what they managed to pull off over the last couple of weeks, I'll say it.

Speaker 15 I was surprised. Yeah, me too.
But Jordan has been following this company for a long time.

Speaker 15 And a big focus of China Talk, his newsletter and podcast, has been translating literally what is going on in China into English, making sense of it for a Western audience and keeping tabs on all the developments there.

Speaker 15 So perfect guest for this week's episode, and I'm very excited for this conversation.

Speaker 16 Yes, I have learned a lot from China Talk in recent days as I've been boning up on DeepSeek. So we're excited to have Jordan here and let's bring him in.

Speaker 15 Join us tonight or welcome to Hard Fork. Oh my God, such a huge fan.
This is such an honor.

Speaker 16 We're so excited to have you. I have learned truly so much from you this week.

Speaker 16 And so when we were talking about what to do this week, we just looked at each other and said, we have got to see if Jordan can come on this podcast. Yeah.

Speaker 15 So this has been a big week for Chinese tech policy. Maybe the biggest week for Chinese tech policy, at least that I can remember.

Speaker 15 I realized that something important was happening last weekend when I started getting texts from like all of my non-tech friends being like, What is going on with DeepSeek?

Speaker 15 And I imagine you had a similar reaction because you are a person who does constantly pay attention to Chinese tech policy.

Speaker 15 So, I've been running China Talk for eight years, and I can get my family members to maybe read like one or two editions a year.

Speaker 15 And the same exact thing happened with me, Kevin, where all of a sudden I got, oh my God, DeepSeek, like it's on the cover of the New York Post, Jordan, you're so clairvoyant.

Speaker 15 Like maybe I should read you more. I'm like, okay, thanks, mom.
Appreciate that.

Speaker 15 Yeah, so I want to talk about DeepSeek and what they have actually done here, but I'm hoping first that you can kind of give us the basic lay of the land of the sort of Chinese AI ecosystem, because that's not an area where Casey or I have spent a lot of time looking.

Speaker 15 But tell us about DeepSeek and sort of where it sits in the overall Chinese industry. So DeepSeek is a really odd duck.

Speaker 15 It was born out of this very successful quant hedge fund, the CEO of which basically after ChatGPT was released was like, okay, this is really cool.

Speaker 15 I want to spend some money and some time and some compute and hire some fresh young graduates graduates to see if we can give it a shot to make our own language models.

Speaker 16 And so, a lot of companies are out there building their own large language models. What was the first thing that happened that made you think, oh,

Speaker 16 this company is actually making some interesting ones?

Speaker 15 Sure. So, there are lots and lots of very moneyed Chinese companies that have been trying to follow a similar path after ChatGPT.

Speaker 15 You know, we have giant players like Alibaba, Tencent, ByteDance, Huawei, even trying to create their own open AI, basically.

Speaker 15 And what is remarkable is the big organizations can't quite get their head around creating the right organizational institutional structure to incentivize this type of collaboration and research that leads to real breakthroughs.

Speaker 15 So Chinese firms have been releasing models for years now, but DeepSeek, because of the way that it structured itself and the freedom they had, not necessarily being under a direct profit motive, they were able to put out some really remarkable innovations that caught the world's attention, you know, starting maybe late December, and then, you know, really blew everyone's mind with the release of the R1 chatbot.

Speaker 15 Yeah. So let's talk about R1 in just a second, but one more question for you, Jordan, about DeepSeek.

Speaker 15 What do we know about their motivation here?

Speaker 15 Because so much of what has been puzzling American tech industry watchers over the last week is that this is not a company that has sort of an obvious business model connected to its AI research, right?

Speaker 15 We know why Google is developing AI because it thinks it's going to make the company Google much more profitable. We know why OpenAI is developing advanced AI models.

Speaker 15 It does not seem obvious to me, and I have not read anything from people involved in DeepSeek about why they are actually doing this and what their ultimate goal is.

Speaker 15 So can you help us understand that?

Speaker 15 So, um, we don't have a lot of data, um, but my base case, which is based on uh, two extended interviews that um the deep seek CEO released, which we've translated on Chinatop, as well as just like what uh deep seek employees have been tweeting about in the west and then um domestically, is that they're dreamers.

Speaker 15 I think the right mental model is open AI, you know, 2017 to 2022. Like, I'm sure you could ask the same thing, like, what the hell are they doing?

Speaker 15 I mean, Sam Altman literally said, I have no idea how we're ever going to make money. Right.
And here we are in this grand new paradigm.

Speaker 15 So I really think that they do have this like vision of AGI and like, look, we'll build it and we'll make it cheaper for everyone. You know, we'll figure it out later.

Speaker 15 And like, they have enough trading strategies that they can fund it.

Speaker 15 And now that they've really blown people's minds, we might be sort of turning into a new period in DeepSeek's history, kind of like what happened with OpenAI, right?

Speaker 15 Where they're going to have to shack up with a hyperscaler, be it, you know, not Microsoft in this case, but ByteDance or Ali or Tencent or Huawei.

Speaker 15 And the government's going to start to pay attention in a way, which it really hasn't over the past few years. Right.
And

Speaker 15 I want to drill down a little bit there because I think one thing that most listeners in the West do know about Chinese tech companies is that many of them are sort of inextricably linked to the Chinese government, that the Chinese government has access to user data under Chinese law, that these companies have to follow the Chinese censorship guidelines.

Speaker 15 And so as soon as DeepSeek started to really pop in America over the last week, people started typing in

Speaker 15 things to DeepSeek's model, like, tell me about what happened at Tiananmen Square, or tell me about Xi Jinping, or tell me about the Great Leap Forward. And it just sort of wouldn't do it at all.

Speaker 15 And so people, I think, saw that and said, oh,

Speaker 15 this is like every other Chinese company that has this sort of in glove relationship with the Chinese ruling party.

Speaker 15 But it sounds from what you're saying, like DeepSeek has a little bit more complicated a relationship to the Chinese government than maybe some other better known Chinese tech companies.

Speaker 15 So explain that. Yeah, I mean, I think

Speaker 15 it's important. Like the mental model you should have for these CEOs are not like people who are dreaming to spread Xi Jinping thought.

Speaker 15 Like what they want to do is compete with Mark Zuckerberg and Sam Altman and show that they're like really awesome and great technologists. But the tragedy is, let's take ByteDance, for example.

Speaker 15 You can look at Zhang Yiming, their CEO's Weibua posts from 2012, 2013, 2014, which are super liberal in a Chinese context, saying like, you know, we should have freedom of expression, like we should be able to do whatever we want.

Speaker 15 In the early years of ByteDance,

Speaker 15 there was a lot of relatively more subversive content on the platform where you sort of saw like real poverty in China, you saw off-color jokes, and then all of a sudden in 2018, he posts a letter saying, I am really sorry, like I need to be part of this sort of like Chinese national project and like better adhere to,

Speaker 15 you know, modern Chinese socialist values. And I'm really sorry and it won't ever happen again.
You know, the same thing happened with Didi, right?

Speaker 15 Like they don't really want to have to do anything with politics. And then they get on someone's side and all of a sudden they get zapped.

Speaker 16 Didi is, of course, the big Chinese rideshare company.

Speaker 15 Correct. Yeah.

Speaker 16 What did Didi do do?

Speaker 15 So they listed on the Western Stock Exchange after the Chinese government told them not to, and then they got taken off app stores and it was a whole giant nightmare.

Speaker 15 Like they had to sort of go through their rectification process. So point being with DeepSeek, right, is like now they are, whether they like it or not, going to be held up as a national champion.

Speaker 15 And that comes with a lot of headaches and responsibilities from, you know, potentially giving the Chinese government more access,

Speaker 15 having to fulfill government contracts, which honestly are probably really annoying for them

Speaker 15 to do and sort of distracting from the broader mission they have of developing and deploying this technology in the widest range possible.

Speaker 15 But DeepSeek thus far has flown under the radar, but that is no longer the case and things are about to change for them. Right.
And I think that was one of the surprising things about DeepSeek

Speaker 15 for the people I know, including you who follow Chinese tech policies,

Speaker 15 you know, I think people were surprised by the sophistication of their models. And we talked about that on the emergency pod that we did earlier this week and how cheaply they were trained.

Speaker 15 But I think the other surprise is that they were released as open source software because

Speaker 15 one thing that you can do with open source software is download it, host it in another country, remove some of the guardrails and the censorship filters that might have been part of the original model.

Speaker 16 By the way, it turned out there weren't even really guardrails on the on the V3 model, right? It had not been trained to avoid questions about Tiananmen Square or anything.

Speaker 16 So that was another really unusual thing about this.

Speaker 15 Right. And one thing that we know about Chinese technology products is that they don't tend to be released that way.

Speaker 15 They tend to be hosted in China and overseen by Chinese teams who can make sure that they're not out there talking about Tiananmen Square. So

Speaker 15 is the open source nature of what DeepSeek has done here part of the reason that you think there might be conflict looming between them and the Chinese government?

Speaker 15 You know, honestly, I think this whole ask it about Tiananmen stuff is a bit of a red herring on a few dimensions.

Speaker 15 So first, one of these like arguments that is a little sort of confusing to me is like folks used to say, oh, like the Chinese models are going to be lobotomized and like they will never be as smart as the Western ones because like they have to be politically correct.

Speaker 15 I mean, look, if you ask Claude to say racist things, it won't. And Claude's still pretty smart.

Speaker 15 Like this is sort of a solved problem and a bit of a, in a bit of a red herring when talking about sort of long-term competitiveness of Chinese and Western models.

Speaker 15 Now, you asked me like, oh, so they released this model globally and it's open source.

Speaker 15 Maybe someone in the Chinese government would be uncomfortable with the fact that people can get a Chinese model to say things that would get you thrown in jail if you posted them online in China.

Speaker 15 It's going to be a really interesting calculus for the Chinese government to make because on the one hand, this is the most positive shine that Chinese AI has got globally in the history of Chinese AI.

Speaker 15 So they're going to have to navigate this and it might prompt some uncomfortable conversations and bring regulators to a place they wouldn't have otherwise landed. Yeah.

Speaker 15 Now, Jordan, I want to ask you about something that people have been talking about and speculating about in relationship to the DeepSeek news for the last week or so, which is about chip controls.

Speaker 15 So we've talked a little bit on the show earlier this week about how DeepSeek managed to put together these models

Speaker 15 using some of these kind of second-rate chips from NVIDIA that are allowed to be exported to China.

Speaker 15 We've also talked about the fact that you cannot get the most powerful chips legally if you are a Chinese tech company.

Speaker 15 So there have been some people, including Elon Musk and other American tech luminaries, who have said, oh, well, DeepSeek has this sort of secret stash of these banned chips that they have smuggled into the country and that actually they are not making do with kind kind of the Kirkland signature chips that they say they are.

Speaker 15 What do we know about how true that is?

Speaker 15 So did DeepSeek have banned chips? It's kind of impossible to know. This is a question more for the U.S.
intelligence community than like Jordan Schneider on Twitter.

Speaker 15 But I do think that it is important to understand that the delta between what you can get in the West and what you can get in China is actually not that big.

Speaker 15 And, you know, we're talking about training a lot, but also on the inference side, China can still buy this H20 chip from NVIDIA, which is basically world class at like deploying the AI and letting everyone use it.

Speaker 15 So does this mean that we should just give up? I don't think so. Compute is going to be a core input regardless of how much model distillation you're going to have in the future.

Speaker 15 There have been a lot of quotes even from the DeepSeek founder basically saying like the one thing that's holding us back are these export controls. Right.

Speaker 15 Okay.

Speaker 16 I want to ask a big picture question. Sure.

Speaker 16 I think that a reason that people have been so fascinated by this deep seek story is that at least for some folks, it seems to change our understanding of where China is in relation to the United States when it comes to developing very powerful AI.

Speaker 16 Jordan, what is your assessment of what the V3 and R1 models mean? And to what extent do you think the game has actually changed here?

Speaker 15 I'm not really sure the game has changed so much. Like Chinese engineers are really good.
I think it is a reasonable base case that...

Speaker 15 Chinese firms will be able to develop comparable or fast follow on the model side.

Speaker 15 But the real sort of long-term competition is not just going to be on developing the models, but deploying them and deploying them at scale. And that's really where compute comes in.

Speaker 15 And that's why expert controls are going to continue to be a really important piece of America's strategic arsenal when it comes to making sure that the 21st century is defined by the U.S.

Speaker 15 and our friends as opposed to China and theirs.

Speaker 16 Right. So it's one thing to have a model that is about as capable as the models that we have here in the United States.

Speaker 16 It's another thing to have the energy to actually let everyone use them as much as they want to use them.

Speaker 16 But what you're saying is no matter what DeepSeek may have invented here, that fundamental dynamic has not changed. China simply does not have nearly the amount of compute that the United States has.

Speaker 15 As long as we don't screw up export controls. So I think the sort of base case for me is that if the U.S.

Speaker 15 stays serious about holding the line on semiconductor manufacturing equipment and export of AI chips, then it will be incredibly difficult for the Chinese sort of broader semiconductor and AI ecosystem to leap ahead, much less kind of like fast follow beyond being able to develop comparable models.

Speaker 15 I'm feeling good as long as Trump doesn't make some crazy trade for soybeans in exchange for ASML EUV machines. That would really break my heart rate.

Speaker 15 I want to inject kind of a note of skepticism here because I buy everything that you're saying about how DeepSeek's progress has been sort of bottlenecked by the fact that it can't get these very powerful American AI chips from companies like NVIDIA.

Speaker 15 But I also am hearing people who I trust say things that make me think that actually the bottleneck may not be the availability of chips, that maybe with some of these algorithmic efficiency breakthroughs that DeepSeek and others have been making, it might be possible to run a very, very powerful AI model on a conventional piece of hardware, on a MacBook even.

Speaker 15 And I wonder about how much of this is just like AI companies in the West trying to cope, trying to make themselves feel better, trying to reassure the market that they are still going to make money by investing billions and billions of dollars into building powerful AI systems.

Speaker 15 If these models do just become sort of lightweight commodities that you can run on a much less powerful cluster of computers or maybe on one computer, doesn't that just mean we can't control the proliferation of them at all?

Speaker 15 Yeah, I mean, I think this is like, this this is one potential future.

Speaker 15 And maybe that potential future like went up 10 percentage points of likelihood of like you being able to fit the biggest, baddest, smartest, most fast, efficient AI model on something that you, that can sit in your home.

Speaker 15 But I think there are lots of other futures in which sort of the world doesn't necessarily play out that way. And look, NVIDIA went down 15%.

Speaker 15 It didn't go down 95%.

Speaker 15 Like, I think if we're really in that world where chips don't matter because everything can be shrunk down to kind of consumer-grade hardware, then the sort of reaction that I think you would have seen in the stock market would have been even more dramatic than the kind of freak out we saw over this week.

Speaker 15 So we'll see.

Speaker 15 I mean, it would be a really remarkable kind of democratizing thing if that was the future we ended up living in, but it still seems pretty unlikely to my, you know, like history major brain here.

Speaker 16 I would also just point out, Kevin, that when you look at what DeepSeek has done, they have created a really efficient version of a model that American companies themselves had trained like nine to 12 months ago, right?

Speaker 16 So they sort of caught up very quickly. And there are fascinating technological innovations in what they did.
But in my mind, these are still primarily optimizations.

Speaker 16 Like for me, what would tip me over into like, oh my gosh, America is losing this race is China is the first one out of the gate with a virtual coworker, right?

Speaker 16 Or like a truly phenomenal agent, some sort of leap forward in the technology as opposed to we've caught up really quickly and we've figured out something more efficiently.

Speaker 15 Are you seeing it differently than that?

Speaker 15 I mean, I guess I just don't know what like a six-month lag would buy us if it does take, you know, six months for the Chinese AI companies like DeepSeek to sort of catch up to the state of the art.

Speaker 15 You know, I was struck by Adario Amadei, who's the CEO of Anthropic,

Speaker 15 wrote an essay just today about DeepSeek and export controls.

Speaker 15 And in it, he makes this point about the sort of difference between living in what he called a unipolar world where one country or one block of countries has access to something like an AGI or an ASI

Speaker 15 and the rest of the world doesn't versus the situation where China gets there roughly around the same time that we do. And so we have this bipolar world where two blocks of countries,

Speaker 15 the East and the West, basically have access to this equivalent technology. And so.

Speaker 16 And of course, in a bipolar world, sometimes we're very happy and sometimes we're very sad.

Speaker 15 Exactly. So I just think like whether we get there six months ahead of them or not, I just feel like there isn't that much of a material difference.
But Jordan, maybe I'm wrong.

Speaker 15 Can you make the other side of that, that it really does matter?

Speaker 15 I'm kind of there.

Speaker 15 I'll take a little bit of issue with what Dario says.

Speaker 15 And I think, you know, what one of the lessons that Deep Sea shows is we should expect a base case of Chinese model makers being able to fast follow the innovations, which, by the way, Casey, actually do take those giant data centers to run all the experiments in order to find out, you know, what is the sort of future direction you want to take your model.

Speaker 15 And what's what sort of AI is going to to come down to is not just creating the model, not just sort of like Dario envisioning the future, and then all of a sudden, like things happen.

Speaker 15 Like there's going to be a lot of messiness in the implementation, and there are going to be sort of like teachers' unions who are upset that AI comes in the classroom.

Speaker 15 And there are going to be like all these regulatory pushbacks and a lot of societal reorganization, which is going to need to happen, just like it did during the Industrial Revolution.

Speaker 15 So, look, model making is a frontier of competition.

Speaker 15 Compute access is a frontier of competition, but there's also this broader, like, how will a society kind of adopt and cope with all of this new future that's going to be thrown in our faces over the coming years?

Speaker 15 And I really think it's that just as much as the model development and the compute, which is going to determine which countries are going to gain the most from what AI is going to offer us.

Speaker 15 Yeah. Well, Jordan, thank you so much for joining and explaining all of this to us.
I feel more enlightened. Me too.
Oh, my pleasure. My chain of thought has just gotten a lot longer.

Speaker 15 That's an AI joke.

Speaker 16 Let me cut back. Kevin, there's an agent at our door.
Is it Jerry Maguire? No, it's an AI one.

Speaker 15 Oh, okay.

Speaker 15 Jerry McGuire? I don't know.

Speaker 15 Jerry McGuire!

Speaker 1 In business, they say you can have better, cheaper, or faster, but you only get to pick two.

Speaker 4 What if you could have all three? You can with Oracle Cloud Infrastructure.

Speaker 7 OCI is the blazing fast hyperscaler for your infrastructure, database, application development, and AI needs where you can run any workload for less.

Speaker 10 Compared with other clouds, OCI costs up to 50% less for compute, 70% less for storage, and 80% less for networking.

Speaker 13 Try OCI for free at oracle.com slash nyt.

Speaker 5 Oracle.com slash nyt.

Speaker 17 Over the last two decades, the world has witnessed incredible progress.

Speaker 17 From dial-up modems to 5G connectivity, from massive PC towers to AI-enabled microchips, innovators are rethinking possibilities every day.

Speaker 17 Through it all, Invesco QQQ ETF has provided investors access to the world of innovation with a single investment. Invesco QQQ, let's rethink possibility.

Speaker 17 There are are risks when investing in ETFs, including possible loss of money. ETFs' risks are similar to those of stocks.

Speaker 17 Investments in the tech sector are subject to greater risk and more volatility than more diversified investments.

Speaker 17 Before investing, carefully read and consider front investment objectives, risks, charges, expenses, and more in perspectives at Invesco.com. In Vesco Distributors Inc.

Speaker 14 Can your software engineering team plan, track, and ship faster?

Speaker 1 Monday Dev says yes.

Speaker 14 Custom workflows, AI-power context, and IDE-friendly integrations.

Speaker 3 No admin bottlenecks, no BS.

Speaker 14 Try it free at monday.com/slash dev.

Speaker 15 Operator,

Speaker 15 information, give me Jesus on the line. Do you know that one? No.

Speaker 16 Do you know Operator by Jim Crochy?

Speaker 15 No.

Speaker 16 Operator, won't you help me place this call?

Speaker 15 Well, Casey, call your agent because today we're talking about AI agents.

Speaker 16 Why do I need to call my agent?

Speaker 15 I don't know. It just sounded good.

Speaker 16 Okay.

Speaker 15 Well, I appreciate the effort.

Speaker 16 But yes, Kevin, because for months now, the big AI labs have been telling us that they are going to release agents this year.

Speaker 16 Agents, of course, being software that can essentially use your computer on your behalf or use a computer on your behalf. And

Speaker 16 the dream is that you have sort of a perfect virtual assistant or coworker. You name it.
If they are somebody who might work with you at your job, the AI labs are saying we are building that for you.

Speaker 15 Yeah. So last year, toward the end of the year, we started to see kind of these demos, these previews that companies like Anthropic and Google were working on.

Speaker 15 Anthropic released something called Computer Use, which was an AI agent, a sort of very early preview of that.

Speaker 15 And then Google had something called Project Mariner that I got a demo of, I believe in December, that was basically the same thing, but their version of it.

Speaker 15 And then just last week, OpenAI announced that it was launching Operator, which is its first version of an AI agent.

Speaker 15 And unlike Anthropic and Google's, which, you know, you you either had to be a developer or part of some early testing program to access, you and I could try it for ourselves by just upgrading to the $200 a month pro subscription of ChatGPT.

Speaker 16 Yeah. And I will say that as somebody who's willing to spend, you know, money on software all the time, I thought, am I really about to spend $200 to do this?

Speaker 16 But, you know, in the name of science, Kevin, I had to.

Speaker 15 At this point, I am spending more on AI subscription products than on my mortgage. I'm pretty sure that's correct.

Speaker 15 But,

Speaker 15 you know, it's worth it. We do it for journalism.

Speaker 16 We do. So we both spent a couple of days putting Operator through its paces.
And today we want to talk a little bit about what we found. Yeah.

Speaker 15 So would you just explain like what Operator is and how it works? Yeah, sure.

Speaker 16 So Operator is a separate subdomain of ChatGPT. You know, sometimes the ChatGPT will just let you pick a new model from a drop-down menu, but for operator, you got to go to a dedicated site.

Speaker 16 Once you do, you'll see a very familiar chatbot interface, but you'll see different kinds of suggestions that reflect some of the partnerships that OpenAI has struck up.

Speaker 16 So, for example, they have partnerships with Open Table and StubHub and all recipes. And these are meant to give you an idea of what Operator can do.

Speaker 16 And frankly, Kevin, Not a lot of this sounds that interesting, right?

Speaker 16 Like the suggestions are on the order of suggest a 30-minute meal with chicken or reserve a table for eight or find the most affordable passes to the Miami Grand Pri. Again, so far, kind of so boring.

Speaker 16 What is different about Operator, though, is that when you say, okay, find the most affordable passes to the Miami Grand Pri, when you hit the enter button, it is going to open up its own web browser and it's going to use this new model that they've developed to try to actually go and get those passes for you.

Speaker 15 Yeah, so this is an important thing because I think, you know, when people first heard about this, they thought, okay, this is an AI that kind of takes over your computer, takes over your web browser.

Speaker 15 That is not what Operator does. Instead, it opens a new browser inside your browser.
And that browser is hosted on OpenAI servers.

Speaker 15 It doesn't have your bookmarks and stuff like that saved, but you can take it over from the autonomous AI agent if you need to click around or do something on it. But it basically exists.

Speaker 15 It's a browser within a browser.

Speaker 16 Yeah. So one of the ideas in Operator is that you should be able to leave it unsupervised and just kind of go do your work while it works.

Speaker 16 But of course, it is very fun initially at least to watch the computer try to use itself.

Speaker 16 And so I sat there in front of this browser within a browser and I watched this computer move a mouse around, type the URL, navigate to a website, and in the example I just gave, actually search for passes to the Miami Grand Prix.

Speaker 15 Yeah. And it's interesting on a slightly more technical level because until now, if an AI system like a Chat GPT wanted to interact with some other website, it had to do so through an API, right?

Speaker 15 APIs, application programming interfaces are sort of the way that computers talk to each other.

Speaker 15 But what operator does is essentially eliminate the need for APIs because it can just click around on a normal website that is designed for humans and behave like a human.

Speaker 15 And you don't need a special interface to do that.

Speaker 16 Yeah, and now some people might hear that, Kevin, and start screaming because what they will say is APIs are so much more efficient than what Operator is doing here. APIs are very structured.

Speaker 16 They're very fast. They let computers talk to each other without having to, for example, open up a browser.
And as long as there's an API for something, you can typically get it done pretty quickly.

Speaker 16 The thing is, though, APIs have to be built. There is a finite number of them.

Speaker 16 The reason that OpenAI is going through this exercise is because they want a true general purpose agent that can do anything for you. you whether there is an API for it or not.

Speaker 16 And maybe we should just pause for a minute there and zoom out a little bit to say why are they building this like what is the long-term vision here sure so the the vision is to create virtual coworkers kevin this is the north star for the big ai labs right now uh many of them have said that they are trying to create some kind of digital entity that you can just hire as a coworker the first ones they'll probably be engineers because these systems are already so good at writing code, but eventually they want to create virtual consultants, virtual lawyers, virtual doctors, you name it.

Speaker 16 Virtual podcast hosts.

Speaker 15 Let's hope they don't go that far. But everything else is on the table.

Speaker 16 And if they can get there, you know, presumably that there are going to be huge profits in it for them. They're going to potentially be huge productivity gains for companies.

Speaker 16 And then there's, of course, the question of, well, what does this mean for human beings? And I think that's somewhat murkier.

Speaker 15 Right. And I think there's also, it also helps to justify the cost of running these things because $200 a month is a lot to pay for a version of ChatGPT, but it's not a lot to pay for a remote worker.

Speaker 15 And if you could say, use the next version of operator or maybe two or three versions from now to say replace a customer service agent or someone in your billing department, that actually starts to look like a very good deal.

Speaker 15 Absolutely.

Speaker 16 Or even if I could bring it into the realm of journalism, Kevin, if I had a virtual research assistant and I said, hey, I'm going to write about this today, go pull all of the most relevant information about this from the past couple of years and maybe organize it in such a way that I might, you know, write a column based off of it.

Speaker 16 Like, yeah, that's absolutely worth $200 a month to me.

Speaker 15 Okay. So, Casey, walk me through something that you actually asked Operator to do for you and what it did autonomously on its own.

Speaker 16 Sure. I'll maybe give like two examples, like a pretty good one and maybe a not so good one.
Pretty good one was, and this was actually suggested by Operator.

Speaker 16 I used TripAdvisor to look up walking tours in London that I might want to do the next time I'm in London. When I did when are you going to London? I'm not actually going to London.

Speaker 15 Oh, so you lied to the AI?

Speaker 16 Not for the first time. But here's what I'll say.
If anybody wants to break heaven and I to London, get in touch. We love the city.

Speaker 16 So I said, okay, operator, sure, let's do it. Let's find me some walking tours.
I clicked that. It opened a browser.
It went to TripAdvisor. It searched for London walking tours.

Speaker 16 It read the information on the website and then it presented it to me. Did that within a couple of minutes.
Now, on one hand, could I have done that that just as easily by Google?

Speaker 16 Could I probably have done it even faster if I'd done it myself? Sure.

Speaker 16 But if you're just sort of interested in the technical feat that is getting one of these models to open a browser, navigate to a website, read it and share information, I did think it was pretty cool.

Speaker 15 Yes. It's very trippy to see a computer using itself and, you know, going around like typing things and selecting things from dropdown menus.

Speaker 16 Yeah, it's sort of like, you know, if you think it is cool to be in a self-driving car, like this is that, but for your website. A self-driving browser.
It is a self-driving browser.

Speaker 16 So that was the good example.

Speaker 15 Yes. What was another example?

Speaker 16 So another example, and this was something else that OpenAI suggested that we try, was to try to use Operator to buy groceries. And they have a partnership with Instacart.

Speaker 16 The CEO of Instacart, Fiji Seema, was on the OpenAI board. And so I thought, okay, they're going to have like sort of dialed this in so that there's a pretty good experience.

Speaker 16 And so I said, okay, let's go ahead and buy groceries. And I went into Operator and I said something like, hey, can you help me buy groceries on Instacart? And it said, sure.

Speaker 16 And here's what it did. It opened up Instacart in a browser.
So far, so good. And then it started searching for milk in stores located in Des Moines, Iowa.

Speaker 15 Now, you do not live in Des Moines, Iowa. So why did it think that you did?

Speaker 16 As best as I can tell, the reason it did this is that Instacart defaults to searching for grocery stores in the local area. And the server that this instance of operator was running on was in Iowa.

Speaker 16 Now, if you are designing a grocery product like Instacart, and Instacart does this, when you first sign on and say you're looking for groceries, it will say quite sensibly, where are you? Right.

Speaker 16 Operator does not do this. Instacart might also offer suggestions for things that you might want to buy.
It does not just assume that you want milk.

Speaker 15 Wow. I'm just picturing like a house in Des Moines, Iowa, where there's just like a pallet of milk being delivered every day from all these poor operator users.
Yes.

Speaker 16 So I thought, okay, whatever. You know, this thing makes mistakes.
Let's let's hope that it gets on the right track here.

Speaker 16 And so I tried to pick the grocery store that I wanted it to shop at, which is, you know, in San Francisco where I live. And it entered that grocery store's address as the delivery address.

Speaker 16 So like it would try to deliver groceries, presumably from Des Moines, Iowa to my grocery store, which is not what I wanted. And it actually could not solve this problem without my help.

Speaker 16 I had to take over the browser, log into my Instacart account, and tell it which grocery store that I wanted to shop at.

Speaker 16 So already, all of this has taken at least 10 times as long as it would have taken me to do this myself.

Speaker 15 Yeah. So I had some similar experiences.

Speaker 15 The first thing that I had Operator try to do for me was to buy a domain name and set up a web server for a project that you and I are working on that we can't really talk about yet. Secret project.

Speaker 15 Secret project.

Speaker 15 And so I said to operator, I said, go research available domain names related to this project. Buy the one that costs less than $50, the best one that costs less than $50,

Speaker 15 and then buy a hosting account and set it up and configure all the DNS settings and stuff like that.

Speaker 16 Okay, so that was like a true multi-step project and something that would have been legitimately very annoying to do yourself. Yes.

Speaker 15 You know, that would have taken me, I don't know, half an hour to do on my own.

Speaker 15 and it did take operator some time like I had to kind of like set it and forget it and like I you know got myself a snack and a cup of coffee and then when I came back it had done most of these tasks.

Speaker 15 Really? Yes, I had to still do things like take over the browser and enter my credit card number. I had to give it some details about like my address for the sort of registration for the domain name.

Speaker 15 I had to pick between the various hosting plans that were available on this website. But it did 90% of the work for me.
And I just had to like sort of take over and do the last mile.

Speaker 16 And it's this is really interesting because what I would assume was it would get like, I don't know, 5% of the way and it would hit some hiccup, and it just wouldn't be able to figure something out until you came back and saved it.

Speaker 16 But it sounds like, from what you're saying, was it was somehow able to like work around whatever unanswered questions there were and still get a lot done while you weren't paying attention.

Speaker 15 So it's sort of,

Speaker 15 it felt a little bit like training like a very new, very insecure intern because, like, it at first it would keep prompting me. It'd be like, well, do you want a dot-com or a.net?

Speaker 15 And eventually, you just have to prompt it and say, like, make whatever decisions you want. Like, wait, you said that to it.

Speaker 15 Yes, I said, I said, like, only ask for my intervention if you can't progress any farther. Otherwise, just make the most reasonable decision.

Speaker 15 You said, I don't care how many people you have to kill, just get me this domain. And it said, understood, sir.
Yeah. And I'm now wanted in 42 states.

Speaker 15 Anyway, that was one thing that Operator did for me that I thought was pretty impressive.

Speaker 16 I have to say, that feels like a grand success compared to what I got Operator today.

Speaker 15 Yeah, it was pretty impressive. I also had it send lunch to one of my coworkers, Mike Isaac, who was hungry because he was on deadline.
And I went, I said, go to DoorDash and get Mike some lunch.

Speaker 15 It did initially mess up that. process because it decided to send him tacos from a taco place, which, you know, is great.
And it's a taco place, I know it's very good.

Speaker 15 But I said, order enough for two people and sort of ordered two tacos.

Speaker 15 And this is one of those places where the tacos are quite small.

Speaker 16 Operator said, get your portion size under control, America.

Speaker 15 Yeah, so then I had to go in and say, does that sound like enough food, operator? And it said, actually, now that you mentioned it, I should probably order more.

Speaker 16 Wait, no, so here's a question. So in these cases, is the first step that you log into your account? Because it doesn't have any of your payment details or anything.

Speaker 16 So at what point are you actually sort of teaching it that?

Speaker 15 It depends on the website. So sometimes you can just say

Speaker 15 up front, like, here is my email address or here's my login information, and it will sort of, you know, log you in and do all that. Sometimes you take over the browser.

Speaker 15 There are some privacy features that are probably important to people where it says OpenAI says that it does not take screenshots of the browser while you are in control of it because

Speaker 15 you might not want your credit card information getting sent to OpenAI's servers or anything like that. So

Speaker 15 sometimes it happens at the beginning of the process. Sometimes it happens like when you're checking out at the end.

Speaker 16 And so were you taking it over to login or were you saying, I don't care, and you just like, we're giving operator your DoorDash password and plain text?

Speaker 15 I was taking it over. Okay.
Smart. Smart.
So those were the good things. I also, this was a fun one.

Speaker 15 I wanted to see if Operator could make me some money.

Speaker 15 So I said, go take a bunch of online surveys. Cause you know, there are all these websites where you can like get a couple cents for like filling out an online survey.

Speaker 16 Something that most people don't know about Kevin is he devotes 10% of his brain at any given time to thinking about schemes to generate money.

Speaker 16 And it's one of my favorite aspects of your personality that I feel like doesn't get exposed very much. But this is truly the most Russian approach to using Operator I can imagine.

Speaker 16 So I can't wait to find out how this went.

Speaker 15 Well, the most Russian approach might have been what I tried just before this, which was to have it go play online poker for me. But

Speaker 15 it did not do it. It said, I can't help with gambling or lottery-related activities.
Okay, woke AI.

Speaker 15 Does the Trump administration know about this?

Speaker 15 But it was able to actually fill out some online surveys for me, and it earned $1.20. Is that right? Yeah, in about 45 minutes.

Speaker 16 So if you had it going all month, presumably you could maybe eke out the $200 to cover the cost of Operator Pro?

Speaker 15 Yes, and I'm sure I spent hundreds of dollars worth of GPU computing power just to be able to make that $1.20. But hey, it worked.
But hey, it worked. So those were some of the things that I tried.

Speaker 15 There were some other things that it just would not do for me, no matter how hard I tried. One of them, so one of them was to,

Speaker 15 I was trying to update my website and put some links to articles that I'd written on my website.

Speaker 15 And what I found after trying to do this was that there are just websites where operator is not allowed to go.

Speaker 15 And so when I said to Operator, go pull down these New York Times articles that I wrote and, you know, put them onto my website, it said, I can't get to the New York Times website.

Speaker 16 I'm going to guess you expected that to happen.

Speaker 15 Well, I thought maybe it has some clever work around and maybe I should alert the lawyers at the New York Times, if that's the case. But no, I assumed that if any website were to be blocking

Speaker 15 the OpenAI web crawlers, it would be the New York Times. Yeah.
But there are other websites that have also put up similar blockades to prevent Operator from crawling them.

Speaker 15 Reddit, you cannot go onto with Operator. YouTube, you cannot go onto with Operator.
Various other websites. GoDaddy, for some reason, did not allow me to use Operator to buy a domain name there.

Speaker 15 So I had to use another domain name site to do that.

Speaker 15 Right now, there are some pretty janky parts of Operator. I would not say that most people would get a lot of value from using it, but what do you think? Well,

Speaker 16 I do think that there is something just undeniably cool about watching a computer use itself. Of course, it can also be quite unsettling.

Speaker 16 A computer that can use itself can cause a lot of harm, but I also think that it can do a lot of good. And so it was fun to try to explore what some of those things could be.

Speaker 16 And to the extent that Operator is pretty bad at a lot of tasks today, I would point out that it showed pretty impressive gains on some benchmarks.

Speaker 16 So there is one benchmark, for example, that Anthropic used when they unveiled computer use last year, and they scored 14.9%

Speaker 16 on something called OS World, which is an evaluation for testing agents. So not great.
Just three months later, OpenAI said that its CUA model scored 38.1% on the same evaluation.

Speaker 16 And of course, we see this all the time in AI where there's just this very rapid progress on these benchmarks. And so on one hand, 38.1% is a failing grade on basically any test.

Speaker 16 On the other hand, if it improves at the same rate over the next three to six months, you're going to have a computer that is very good at using itself, right? So that I just think is worth noting.

Speaker 15 Yes, I think that's plausible. We've obviously seen a lot of different AI products over the last couple of years start out being pretty mediocre and get pretty good within a matter of months.

Speaker 15 But I would give one cautionary note here. And this is actually the reason that I'm not particularly bullish about these kind of browser using AI agents.

Speaker 15 I don't think the internet is going to sit still and allow this to happen.

Speaker 15 The internet is built for humans to use, right?

Speaker 15 Every news publisher that shows ads on their website, for example, prices those ads based on the expectation that humans are actually looking at them.

Speaker 15 But if browser agents start to become more popular and all of a sudden 10 or 20 or 30% of the visitors to your website are not actually humans, but are instead operator or some similar system, I think that starts to break the assumptions that power the economic model of a lot of the internet.

Speaker 16 Now, is that still true if we find that the agents actually get persuaded by the ads? And that if you send operator to buy DoorDash and it sees an ad from McDonald's, it's like, you know what?

Speaker 16 That's a great idea. I'm gonna ask Kevin if he actually wants some of that.

Speaker 15 Totally.

Speaker 15 That's an I actually think you're joking, but I actually think that is a serious possibility here: that people who build e-commerce sites, Amazon, et cetera, start to put in basically signals and messages for browser agents to look at on their website to try to influence what it ends up buying.

Speaker 15 And I think you may start to see restaurants popping up in certain cities with names like operator, pick me, or order from this one, Mr. Bot.
And

Speaker 15 that's maybe a little extreme, but I do think that there's going to be a backlash among websites, publishers, e-commerce vendors as these agents start to take off.

Speaker 16 I think that that is reasonable. I'll tell you what I've been thinking about is how do we turn this tech demo into a real product?

Speaker 16 And the main thing that I noticed when I was testing operator was there is a difference between an agent that is using a browser and an agent that is using your browser.

Speaker 16 When an agent is able to use your browser, which it can't right now, it's already logged into everything. It already has your payment details.

Speaker 16 It can do everything so much faster and more seamlessly and without as much hand-holding.

Speaker 16 Of course, there are also so many more privacy and security risks that would come from entrusting an agent with that kind of information. So there is some sort of chasm there that needs to be closed.

Speaker 16 And I'm not quite sure how anyone does it.

Speaker 16 But I will tell you, I do not think the future is opening up these virtual browsers and me having to enter all of my login and payment details every single time I want to do anything on the internet, because truly I would rather just do it myself.

Speaker 16 Right.

Speaker 15 I also think there's just a lot more potential for harm here. A lot of AI safety experts I've talked to are very worried about this because what you're essentially doing is

Speaker 15 letting the AI models make their own decisions and actually carry out tasks.

Speaker 15 And so you could imagine a world where an AI agent that's very powerful a couple versions from now decides to start doing cyber attacks because maybe some malevolent user has told it to make money and it decides that the best way to do that is by hacking into people's crypto wallets and stealing their crypto.

Speaker 15 Yeah. So

Speaker 15 those are the kinds of reasons that I am a little more skeptical that this represents a big breakthrough.

Speaker 15 But I think it's really interesting and it did give me that feeling of like, wow, this could get really good, really fast. And if it does,

Speaker 15 the world will look very different.

Speaker 16 When we come back, Kevin, back that caboose up. It's time for the Hot Mass Express.

Speaker 15 You know, Roost Caboose was my nickname in middle school.

Speaker 16 Kevin Cabruse.

Speaker 1 In business, they say you can have better, cheaper, or faster, but you only get to pick two.

Speaker 4 What if you could have all three? You can with Oracle Cloud Infrastructure.

Speaker 7 OCI is the blazing fast hyperscaler for your infrastructure, database, application development, and AI needs where you can run any workload for less.

Speaker 10 Compared with other clouds, OCI costs up to 50% less for compute, 70% less for storage, and 80% less for networking.

Speaker 13 Try OCI for free at oracle.com slash nyt.

Speaker 5 Oracle.com/slash nyt.

Speaker 18 This podcast is supported by the all-new 2025 Volkswagen Tiguan.

Speaker 15 A massage chair might seem a bit extravagant, especially these days. Eight different settings, adjustable intensity, plus it's heated, and it just feels so

Speaker 15 good.

Speaker 15 Yes, a massage chair might seem a bit extravagant, but when it can come with a car,

Speaker 15 suddenly it seems quite practical. The all-new 2025 Volkswagen Tiguan, packed with premium features like available massaging front seats, it only feels extravagant.

Speaker 19 This episode is sponsored by Morgan Stanley's Thoughts on the Market. Today's financial markets move fast.
Morgan Stanley moves faster with their daily podcast, Thoughts on the Market.

Speaker 19 Thoughts on the Market covers daily trends across the global investment landscape with actionable insights from Morgan Stanley's leading economists and strategists.

Speaker 19 And with most episodes under five minutes long, staying informed has never been easier. Listen and subscribe to Thoughts on the Market, wherever you get your podcasts.

Speaker 15 Well, Casey, we're here wearing our train conductor hats, and my child's train set is on the table in front of us, which can only mean one thing. We're going to train a large language model.

Speaker 16 Nope, that's not what that means.

Speaker 15 What does it mean? It's time to play a game of the Hot Mess Express.

Speaker 16 Pause for theme song.

Speaker 16 Hot Mess Express, Kevin, is our segment where we run through some of the messiest recent tech stories and deploy our official hot mess thermometer to tell you just how messy we think things have gotten.

Speaker 16 And Kevin, you better sit down for this one because it's been a messy week.

Speaker 15 Sure has. So why don't we go ahead, fire up the hot mass express and see what is the first story yeah i hear that i hear a faint chugga chugga in my headphones

Speaker 16 oh it's pulling into the station casey what's the first cargo that our hot mess express is carrying All right, Kevin, this first story comes to us from the New York Times, and it says that Fable, a book app, has made changes after some offensive AI messages.

Speaker 15 Now, Casey, have you ever heard of Fable, the book app?

Speaker 16 Well, not until this story, Kevin, but I am told that it is is an app for sort of keeping track of what you're reading, not unlike a Goodreads, but also for discussing what you're reading.

Speaker 16 And apparently, this app also offers some AI chat.

Speaker 15 Yeah, you can have AI sort of summarize the things that you're reading in a personalized way.

Speaker 15 And this story said that in addition to spitting out bigoted and racist language, the AI inside Fable's book app had told one reader who had just finished three books by black authors, quote, your journey dives deep into the heart of black narratives and transformative tales, leaving mainstream stories gasping for air.

Speaker 15 Don't forget to surface for the occasional white author, okay?

Speaker 15 And another personalized AI summary that Fable Produced told another reader that their book choices were, quote, making me wonder if you're ever in the mood for a straight cis white man's perspective.

Speaker 16 And if you are interested in a straight cis white man's perspective, follow Kevin Roos on X.com.

Speaker 15 Now, Kevin, why do we think this happened?

Speaker 15 I don't know, Casey.

Speaker 15 This is a head scratcher for me.

Speaker 15 I mean, we know that these apps

Speaker 15 can spit out biased things. That is just sort of like part of how they are trained and part of what we know about them.
I don't know what model Fable was using under the hood here.

Speaker 15 But yeah, this seems not great.

Speaker 16 Well, it seems like we've learned a lesson that we've learned more than once before, which is that large language models are trained on the internet, which contains near-infinite racism.

Speaker 16 And for that reason, it will actually produce racism when you ask it questions yeah so there are mitigations that you can take against that but it appears that in this case they were not successful fable's head of community kim marsh alley has said that all features using ai are being removed from the app and a new app version is being submitted to the app store so uh you always hate it when um the first time you hear about an app is that they added ai and it made it super racist and they have to redo the app.

Speaker 15 Now, Casey, one more question before we move on. Do you think this poses any sort of competitive threat to Grok, which until this story was the leading racist AI app on the market? I do think so.

Speaker 16 And I have to admit that all the folks over at Grok are breathing a sigh of relief now that they have once again claimed the mantle.

Speaker 15 All right. Casey, how hot is this mess?

Speaker 16 Well, Kevin, in my opinion, if your AI is so bad that you have to remove it from the app completely, that's a hot mess.

Speaker 15 Yeah, I rate this one a hot mess as well. All right, next up

Speaker 15 Amazon pauses drone deliveries after aircraft crashed in rain. Casey, this story comes to us from Bloomberg,

Speaker 15 which had

Speaker 15 a different line of reporting than we did just a few weeks ago on the show about Amazon's drone program, Prime Air. Casey, what happened to Amazon Prime Air?

Speaker 16 Well, if you heard the episode of Hard Fork where we talked about it, Amazon Prime Air delivered us some Brazilian bum bum cream and it did so without incident.

Speaker 16 However, Bloomberg reports that Amazon has had to now pause all of their commercial drone deliveries after two of its latest models crashed in rainy weather at a testing facility.

Speaker 16 And so the company says it is immediately suspending drone deliveries in Texas and Arizona and will now fix the aircraft software. Kevin, how did you react to this?

Speaker 15 Well, I think it's good that they're suspending drone deliveries before they fix the software because these things are quite heavy, Casey. I would not want one of them to fall in my head.

Speaker 15 I wouldn't either.

Speaker 16 And I have to tell you, this story gave me the worst kind of flashbacks because in 2016, I wrote about Facebook's drone Aquila and its first, what the company told me had been its first successful test flight in its mission to deliver internet around the world via drone.

Speaker 16 What the company did not tell me when I was interviewing its executives, including Mark Zuckerberg, was that the plane had crashed after that first flight. And so I was...
A small detail.

Speaker 15 I'm sure it was an innocent omission from their briefing.

Speaker 16 Yes, I'm sure. Well, it was Bloomberg again who reported, you know, a couple months after I wrote this story that the Facebook drone had crashed.

Speaker 16 I was, of course, hugely embarrassed and, you know, wrote a bunch of stories about this.

Speaker 16 But, anyways, it really should have occurred to me when we were out there watching the Amazon drone that this thing was also probably secretly crashing and we just hadn't found out about it yet.

Speaker 16 And indeed, we now learn it is. So here is my vow to you, Kevin, as my friend and my co-host.
If we ever see a company fly anything again, we have to ask them. Now, did this thing actually crash?

Speaker 15 I'm tired of being burned. Now, Casey, we should say, according to Bloomberg, these drones reportedly crash in December.
We visited Arizona to see them in very early December.

Speaker 15 So most likely, you know, this all happened after we saw them.

Speaker 15 But I think it's a good idea to keep in mind that as we're talking about these new and experimental technologies, that many of them are still having the kinks worked out.

Speaker 16 All right, Kevin, so let's get out the thermometer.

Speaker 16 How hot of a mess is this?

Speaker 15 I would say this is a moderate mess. Look, these are still testing programs.
No one was hurt during these tests. I am glad that Bloomberg reported on this.

Speaker 15 I'm glad that they've suspended the deliveries. These things could be quite dangerous flying through the air.
I do think it's one of a string of reported incidents with these drones.

Speaker 15 So I think they've got some quality control work ahead of them, and I hope they do well on it because I want these things to exist in the world and be safe for people around them. All right.

Speaker 16 I will agree with you and say that this is a warm mess and hopefully it can get straightened out over there. Let's see what else is coming down the tracks.

Speaker 16 Wow, this is some tough news. Fitbit has agreed to pay $12 million for not quickly reporting burn risk with watches.
Kevin, do you hear about this?

Speaker 15 I did. This was the Fitbit.
devices were like literally burning people.

Speaker 16 Yes, from 2018 to March of 2022, Fitbit received at least 174 reports globally of the lithium-ion battery in the Fitbit Ionic Watch overheating, leading to 118 reported injuries, including two cases of third-degree burns and four of second-degree burns.

Speaker 16 That comes from the New York Times Adil Hassan. Kevin, I thought these things were just supposed to burn calories.

Speaker 15 Well, it's like I always say, exercising is very dangerous and you should never do it.

Speaker 15 And this justifies my decision not to wear a Fitbit.

Speaker 16 To me, the biggest surprise of this story was that people were wearing Fitbits from March 2018 to 2022.

Speaker 16 I thought every Fitbit had been purchased by like 2011 and then put in a drawer never to be heard again.

Speaker 16 So what is going on with these sort of, you know, late-stage Fitbit buyers, I'd love to find out. But of course, we feel terrible for everyone who was burned by a Fitbit.

Speaker 16 And it's not going to be the last time technology burns you.

Speaker 16 I mean, realistically.

Speaker 15 That's true. You know, it's true.

Speaker 16 Now, what kind of mess is this?

Speaker 15 I would say this is a hot mess. This is an officially hot, literally hot.
They're hot.

Speaker 16 Here's my sort of rubric. If technology physically burns you, it is a hot mess.

Speaker 16 If you have physical burns on your body, what other kind of mess could it be? It's true. That's a hot mess.

Speaker 15 Okay, next stop on the Hot Mess Express. Google says it will change Gulf of Mexico to Gulf of America in Maps app after government updates.
Casey, have you been following this story?

Speaker 16 I have, Kevin. Every morning when I wake up, I scan America's maps and I say, what has been changed? And if so, has it been changed for political reasons?

Speaker 16 And this was probably one of the biggest examples of that we've seen.

Speaker 15 Yeah, so this was an interesting story that came out in the past couple of days.

Speaker 15 Basically, after Donald Trump came out during his first days in office and said that he was changing the name of the Gulf of Mexico to the Gulf of America and the name of Denali, the mountain in Alaska, to Mount McKinley, McKinley.

Speaker 15 Google had to decide: well, when you go on Google Maps and look for those places, what should it call them?

Speaker 15 It seems to be saying that it is going to take inspiration from the Trump administration and update the names of these places in the Maps app.

Speaker 16 Yeah. And look, I don't think Google really had a choice here.
We know that the company has been on Donald Trump's bad side for a while.

Speaker 16 And if it had simply refused to make these changes, it would have sort of caused a whole new controversy for them.

Speaker 16 And it is true that the company changes place names when governments change place names, right?

Speaker 16 Like Google Maps existed when Mount McKinley was called Mount McKinley and President Obama changed it to Denali and Google updated the map. Now it's changed back.
They're doing the same thing.

Speaker 16 But now that we know how compliant Google is, Kevin, I think there's room for Donald Trump to have a lot of fun with the company.

Speaker 15 Yeah, what could he do?

Speaker 16 Well, he could call it the Gulf of Gemini isn't very good and just see what would happen because they would kind of have to just change it.

Speaker 16 Can you imagine every time you opened up Google Maps and you looked at the Gulf of Mexico slash America and it just said the Gulf of Gebani is not very good?

Speaker 16 I, you know, I hate to give Donald Trump any ideas, but

Speaker 16 people are looking at. So what kind of mess do you think this is, Kevin?

Speaker 15 I think this is a mild mess. I think this is a tempest in a teapot.
I think that this is the kind of update that, you know, companies make all the time. Because places change names all the time.

Speaker 15 Let's just say it.

Speaker 16 Well, Kevin, I guess I would say that one is a hot mess because if we're just going to start renaming everything on the map, that's just going to get extremely confusing for me to follow.

Speaker 16 I got places to go.

Speaker 15 You go to like three places.

Speaker 16 Yeah, and I use Google Maps to get there. And I need them to be named the same thing that they were yesterday.

Speaker 15 I don't think they're going to change the name of Barry's boot camp.

Speaker 15 All right. Final stop on the Hot Mess Express.
Casey, bring us home.

Speaker 15 All right, Kevin.

Speaker 16 Oh, and this is some sad news. Another Waymo was vandalized.
This is from one-time hard fork guest, Andrew J. Hawkins at The Verge.

Speaker 16 He reports that this Waymo was vandalized during an illegal street takeover near the Beverly Center in L.A.

Speaker 16 Video from Fox 11 shows a crowd of people basically dismantling the driverless car piece by piece and then using the broken pieces to smash the windows. Kevin, what did you make of this?

Speaker 15 Well, Casey, as you recall, you predicted that in 2025, Waymo would go mainstream.

Speaker 15 and I think there is no better proof that that is true than that people are turning on the Waymos and starting to beat them up.

Speaker 16 Yeah, I, you know, look, I don't know that we have heard any interviews from why these people were doing this.

Speaker 16 I don't know if we should see this as like a reaction against AI in general or of Waymos specifically.

Speaker 16 But I always find it like weird and sad when people attack Waymos because they truly are safer cars than every other car.

Speaker 15 Well, not if you're going to be riding in them and people are just going to start like beating the car, then they're not safer.

Speaker 16 No, but you know, that's only happened a couple of times that we're aware of.

Speaker 15 Right. Yeah.
So yeah, this story is sad to me. Obviously, people are reacting to Waymos.
Maybe they have sort of fears about this technology or think it's going to take jobs or maybe they're just.

Speaker 15 pissed off and they want to break something. But don't hurt the Waymos people in part because they will remember.

Speaker 15 They will remember.

Speaker 15 They will remember and they will come for you.

Speaker 16 I'm not sure that that's true, but I think we should also note that Waymo only became officially available in LA in November of last year.

Speaker 16 And so part of this just might be a reaction to the newness of it all and people getting a little carried away, just sort of curious, what will happen if we try to,

Speaker 16 you know, destroy this thing? Will it deploy defensive measures and so on?

Speaker 15 So they're going to have to put flamethrowers flamethrowers on them. I'm just calling it right now.

Speaker 16 I really hope that doesn't happen. But yeah, well, what kind of mess do you think this one was?

Speaker 15 I think this one

Speaker 15 is a lukewarm mess that has the potential to escalate. I don't want this to happen.
I sincerely hope this does not happen, but I can see as Waymos start

Speaker 15 being rolled out across the country that some people are just going to lose their minds.

Speaker 15 Some people are going to see this as the physical embodiment of technology invading every corner of our lives, and they are just going to react in strong and occasionally destructive ways.

Speaker 15 I'm sure that Waymo has gamed this all out. I'm sure that this does not surprise them.

Speaker 15 I know that they have been asked about what happens, you know, if Waymos start getting vandalized, and they presumably have plans to deal with that, including prosecuting the people who are doing this.

Speaker 15 But yeah, I always go out of my way. to try to be nice to Waymos.

Speaker 15 And in fact, some other Waymo news this week, Jane Manchin Wong, the security researcher, reported on X recently that Waymo is introducing or at least testing a tipping feature.

Speaker 15 And so I'm going to start tipping my Waymo just to make up for all the jerks in LA who are vandalizing them.

Speaker 16 It looks like the tipping feature, by the way, will be to tip a charity and that Waymo will not keep that money. At least that's what's been reported.

Speaker 15 No, I think it's going to the flamethrower fund.

Speaker 15 All right, Casey, that is the Hot Mess Express.

Speaker 15 Thank you for taking this journey with me.

Speaker 1 In business, they say you can have better, cheaper, or faster, but you only get to pick two.

Speaker 4 What if you could have all three? You can with Oracle Cloud Infrastructure.

Speaker 7 OCI is the blazing fast hyperscaler for your infrastructure, database, application development, and AI needs where you can run any workload for less.

Speaker 10 Compared with other clouds, OCI costs up to 50% less for compute, 70% less for storage, and 80% less for networking.

Speaker 13 Try OCI for free at oracle.com/slash nyt.

Speaker 14 Oracle.com/slash nyt.

Speaker 18 This podcast is supported by the all-new 2025 Volkswagen Tiguan.

Speaker 15 A massage chair might seem a bit extravagant, especially these days. Eight different settings, adjustable intensity, plus it's heated, and it just feels so good.

Speaker 15 Yes, a massage chair might seem a bit extravagant, but when it can come with a car,

Speaker 15 suddenly it seems quite practical. The all-new 2025 Volkswagen Tiguan, packed with premium features like available massaging front seats, It only feels extravagant.

Speaker 20 JP Morgan Payments helps you drive efficiency with automated payments and intelligent algorithms across 200 countries and territories. That's automation-driven finance.
That's JPMorgan Payments.

Speaker 21 JP Morgan, Internal Data 2024, Copyright 2025, J.P. Morgan Chase Company, All Rights Reserved, JP Morgan Chase Bank, and a member, FDIC.

Speaker 22 Deposits held in non-U.S. branches are not FDIC insured.
Non-deposit products are not FDIC insured.

Speaker 21 This is not a legal commitment for credit or services. Availability varies.
Eligibility determined by J.P. Morgan Chase.
Visit jpmorgan.com/slash payments disclosure for details.

Speaker 16 Part Fork is produced by Rachel Cohn and Whitney Jones. We're edited this week by Rachel Dry and fact-checked by Ina Alvarado.
Today's show was engineered by Dan Powell.

Speaker 16 Original music by Diane Wong and Dan Powell. Our executive producer is Jen Poyon.
Our audience editor is Nelga Logley. Video production by Ryan Manning and Chris Schott.

Speaker 16 You can watch this whole episode on YouTube at youtube.com/slash heartfork. Special thanks to Paula Schuman, Puy Wing Tam, Dahlia Haddad, and Jeffrey Miranda.

Speaker 16 You can email us at hardfork at nytimes.com with what you're calling the Gulf of Mexico.

Speaker 20 There used to be very little visibility and control in treasury. Today, JPMorgan Payments delivers real-time dashboards and control at your fingertips.

Speaker 22 That's the power of clarity.

Speaker 20 That's JPMorgan Payments.

Speaker 21 Copyright 2025, JPMorgan Chase Company, All Rights Reserve, JPMorgan Chase Bank, and a member, FDIC.

Speaker 22 Deposits held in non-U.S. branches are not FDIC insured.
Non-deposit products are not FDIC insured.

Speaker 21 This is not a legal commitment for credit or services. Availability varies.
Eligibility determined by J.P. Morgan Chase.
Visit jpmorgan.com/slash payments disclosure for details.