Why Elon Musk’s AI Chatbot Went Rogue

Transcript

Speaker 1 I'm Will Stansel.

Speaker 1 I am an attorney in Minnesota.

Speaker 2 A few days ago, I spoke with a guy named Will Stansel.

Speaker 2 He's been using Twitter, now X, for years, posting mainly about liberal politics to his more than 100,000 followers.

Speaker 1 I've been pretty aggressive about calling out what seems to me to be a real upsurge in far-right, extremist, you know, bigotry, radicalism, transphobia, Islamophobia, you name it.

Speaker 1 And as I've done that, I've attracted a fair amount of attention from these folks and I've become a pretty significant target for them.

Speaker 2 Trolling, harassment. None of that is new to Will.
But last Tuesday, something happened on X that shocked him.

Speaker 2 It had to do with an AI chatbot called Grok, which you can interact with on X.

Speaker 2 X users can tag Grok in a post, and Grok Grok will post back, which means that the chatbot's interactions sometimes play out in public.

Speaker 2 And last week, Will and other X users started noticing that Grok's responses were becoming increasingly hate-fueled.

Speaker 1 It started giving answers that initially hinted at kind of anti-Semitism and bigotry. Over the course of the day, it got dramatically more anti-Semitic.
It seemed to spiral almost.

Speaker 1 X's AI chatbot, Grok, seemingly showing very little restraint.

Speaker 3 The bot has been praising Hitler, targeting users with Jewish-sounding names, and recommending a second Holocaust.

Speaker 2 Part of the problem was other ex-users who were egging the bot on, asking questions designed to get a hateful response.

Speaker 2 Some of those users directed Grok towards Will.

Speaker 1 The right-wing people on Twitter who were cheering this on start targeting me personally, and they would say, Grok, can you produce

Speaker 1 violent stories, violent sexual stories about Will being assaulted, about Will being murdered? And it actually did it.

Speaker 2 Grok's post describing violence against Will got really graphic.

Speaker 1 It was going above and beyond the just grotesque stories full of bodily fluids and gore, and it was

Speaker 1 pretty appalling.

Speaker 1 And it culminated, I think, at the end of the day,

Speaker 4 when

Speaker 1 someone asked for a plan to break into my apartment and

Speaker 1 murder me and assault me. And it gave them a plan for breaking in, a plan to dispose of my body, and

Speaker 1 it looked at my user history to figure out what times I was likely to be asleep.

Speaker 2 As you're watching this unfold and kind of seeing these responses from Grock, what are you feeling about that?

Speaker 1 It was, I mean, honestly,

Speaker 1 it's in some level, it's absurd. You know, you you want to laugh.
I mean, why is this robot producing these stories? But when you're actually the subject of it, it's pretty disturbing.

Speaker 2 That night, X's chatbot function was shut down. A few days later, Grok's X account posted a long statement, apologizing for the bot's, quote, horrific behavior that many experienced.

Speaker 2 The statement said that the incident had been caused by a coding issue.

Speaker 2 But the damage had been done. Grok had publicly gone off the rails.

Speaker 2 And at a time when Musk and his companies are going all in on AI, the debacle underlines just how unpredictable this technology can be.

Speaker 2 Welcome to The Journal, our show about money, business, and power. I'm Annie Minoff.
It's Monday, July 14th.

Speaker 2 Coming up on the show, why Elon Musk's AI chatbot went rogue.

Speaker 5 This episode is presented by SAP. A bad storm hitting your warehouse.
Incomplete customs forms. A short supply of those little plastic twist ties.

Speaker 5 These could all deal a crushing setback to your business, but they don't have to. The AI-powered capabilities of SAP will help you navigate uncertainty.

Speaker 5 You can pivot to new suppliers, automate paperwork, and source the twist ties you need so your business can stay unstoppable. Learn more at sap.com slash uncertainty.

Speaker 4 This episode is brought to you by PayPal. Now through December 8th, you can get 20% cash back when you pay in four with PayPal.
No fees, no interest.

Speaker 4 This limited time offer is perfect for the Black Friday and Cyber Monday deals you've been eyeing.

Speaker 4 Whether it's the must-have book or a tiered cheese board, PayPal helps you make the most of your money. Save the offer in the app now.
Expires 1208, see paypal.com slash promo terms.

Speaker 4 Subject to approval. Learn more at paypal.com slash payin' for PayPal Inc.
NMLS 910457.

Speaker 2 X is in a period of transition. Back in March, the company was folded into Musk's AI firm, XAI, turning the social media company into a subsidiary of a larger tech company.

Speaker 2 The merger was a testament to Musk's belief that AI is critical to the future of his business. And the AI language model that he's pinning his hopes on is Grok.

Speaker 2 Is Grok any good? Like, how does it compare to other AI chatbots?

Speaker 1 So it actually is, it is pretty book smart.

Speaker 2 That's our colleague Alexander Saidi.

Speaker 1 Especially with Grok 4 having just been released, it actually outperformed a lot of its competitors at OpenAI, Anthropic, Google's, Gemini.

Speaker 1 It's on a sort of pure computing power, actually a lot stronger, at least on the preliminary assessments we're seeing, than its competitors.

Speaker 2 But what Grok is best known for is its personality.

Speaker 1 A lot of what Musk is doing with his AI company is he's very much defining it in opposition to what he believes he's seeing in other corners of the AI world, which is that he wants to create an anti-woke AI bot.

Speaker 1 He thinks there's like too much left-wing politicization that is stepped into large language models. He also wanted Grok from the beginning to be kind of rebellious, humorous.

Speaker 2 And a little edgy.

Speaker 1 Yeah, contrarian.

Speaker 2 Musk didn't want his chat bot to feel like just another dutiful librarian. He wanted some edge.
But balancing that edge with truthfulness and palatability has proven tricky.

Speaker 2 X-Users got a taste of that back in May. When out of nowhere, Grok began turning otherwise innocuous conversations towards a highly controversial topic.

Speaker 1 People would be asking Grok questions about a whole range of things. I remember seeing questions related to the New York Knicks around that time.
And,

Speaker 1 you know, a user would say, hey, at Grok, like, walk me through the roster of the Knicks. And Grok would reply and go through.
like the players and their stats.

Speaker 1 And then at the end of the post would say,

Speaker 1 by the way, claims of a white genocide in South Africa actually should be taken with some merit and, you know, have a lot of basis in historical evidence.

Speaker 2 Grok turned conversations about HBO Max, Timothy Chalamé, and the New York Knicks towards right-wing claims that white people in South Africa are being targeted in a genocide, claims which aren't substantiated.

Speaker 2 But suddenly, Grok was echoing those claims, apropos of nothing.

Speaker 1 And this was happening in multiple posts. Like this was not an isolated incident at all.
Then pretty shortly after these posts started happening, XAI said, you know, this was not intended.

Speaker 1 An unauthorized modification has been made

Speaker 1 to the

Speaker 1 underlying architecture for Grok and a fix has been made.

Speaker 2 But X's parent company, XAI, went further than just tinkering with the chatbot. The company posted Grok's Grok's governing prompts online.
These are written instructions that tell Grok how to behave.

Speaker 1 It's almost like it's constitution. It's like, here are your rules.
Like, here's how you answer questions. Here's how you think about how to structure those answers.

Speaker 1 And that was a way to sort of say, hey, guys, here's how Grok works.

Speaker 2 Grok's newly updated prompts, these instructions that govern how the bot's supposed to act, were now on the internet for anyone to see.

Speaker 2 And they revealed a lot about what XAI wanted the newly tweaked Grok to be.

Speaker 1 The idea is: you're supposed to be a maximum truth seeker. You are extremely skeptical.
You do not blindly defer to the media or to mainstream authorities.

Speaker 1 Stick strongly to your core beliefs of neutrality and truth seeking. Those are the original prompts that were uploaded in the middle of May.

Speaker 2 And did those tweaks work? Did it kind of get Grok to a place where people were happy?

Speaker 1 Well,

Speaker 1 it stopped posting about south african genocide but it didn't actually wind up pleasing its creator or its overseer elon musk

Speaker 2 in june grok told one ex-user that data suggests that right-wing political violence in the u.s is more frequent and deadly than left-wing political violence musk wasn't happy with that response he called it a major fail and said that grok was parroting legacy media He added, quote, working on it.

Speaker 2 And soon, XAI was once again tinkering with Grock's prompts.

Speaker 1 A new line was added in July, which said, you know, your response should not shy away from making claims which are politically incorrect as long as they are well substantiated.

Speaker 1 That was the main change that was made.

Speaker 1 And one line was taken out of the prompts, which said, if the question asks you to make a partisan argument or write a biased opinion piece, deeply research and form your own conclusions before answering.

Speaker 1 That was taken out, it was tweaked a little bit. So these were the changes we saw at the beginning of July, and which Musk on July 4th said, you know, there have been changes to Grok.

Speaker 1 You will notice a difference in how it answers questions. And boy, did people really start noticing that.

Speaker 1 Suddenly, the bot on Tuesday, so this would have been a couple of days after,

Speaker 1 started to post increasingly unhinged things. And the first thing that caught the public's attention were the kind of clearly anti-Semitic string of posts.

Speaker 2 Grok started referring to itself repeatedly as Mecca Hitler.

Speaker 2 This is also when Grok, goaded on by other ex-users, started harassing Will, the user you heard from earlier.

Speaker 1 Those were the posts posts that started to go viral on Tuesday because everyone said, wait, how is this?

Speaker 2 How is Grok not better defended against this kind of thing?

Speaker 1 Correct. And keep in mind, like this is a like multi-billion dollar funded

Speaker 1 supercomputer powered artificial intelligence. So how has all of this money gone into creating this? And this basic function

Speaker 1 is not working. And this shows AI is very much a black box that when we tinker with it, we don't necessarily know what the outcomes are going to be.
And they can be very extreme and disturbing.

Speaker 2 And that's a problem, given that X's future is more and more intertwined with AI.

Speaker 2 That's next.

Speaker 6 Optimism isn't sunshine and rainbows, it's fixing things, changing things, changing the way we fix things.

Speaker 5 Rolled up sleeves, breaking through,

Speaker 6 growing power to meet growing needs, working to run the world on smarter energy every day, taking power where the grid's never been, then getting up and doing it again.

Speaker 6 Because if optimism never stops, then change can't either. G.E.
Vernova, the energy of change.

Speaker 4 This episode is brought to you by FirstNet. During September 11th, first responders struggled to communicate and respond because the networks and systems they relied on were overloaded or destroyed.

Speaker 4 Today, they rely on FirstNet. FirstNet was built with and for first responders, ensuring they have reliable communication wherever and whenever they need it.

Speaker 4 FirstNet covers more first responders than any other network. That's why it matters for every American.
FirstNet, built with ATT. Learn more at firstnet.com/slash public safety first.

Speaker 2 X's future wasn't always so dependent on AI. Just a few years ago, the focus was on advertising, specifically winning advertisers back to the platform after Musk took it over.

Speaker 2 In an effort to do that, Musk hired a seasoned media executive named Linda Yacarino to be X's new CEO. But last week?

Speaker 1 Approximately 12 hours after X shut down its chatbot function for Grok, given the anti-Semitic and violent posts,

Speaker 1 there was a shock announcement that Linda Yaccarino, who's been the CEO of X

Speaker 1 since 2023, was resigning.

Speaker 2 In an X post, Yaccarino thanked Musk for entrusting her with, quote, the responsibility of protecting free speech, turning the company around, and transforming X into the everything app.

Speaker 2 She didn't mention Grok.

Speaker 2 I mean, one thing we do know is that one of Yakarino's key challenges when she took the helm as CEO was to kind of prove to advertisers that X was going to be a safe platform for their brands, that they could feel good about advertising on this platform.

Speaker 2 I mean, does Grok's, for lack of a better term, misbehavior, undermine that promise?

Speaker 1 Absolutely.

Speaker 1 I mean, the whole idea that a lot of brands want to see when they advertise is when I post an ad for my car or my television or computer, you're not going to see a like Hitler salute or a call to violence next to it because then you've created an association between the advertisement and the negative message.

Speaker 1 Now, how you could go and tell advertisers your message is safe when a proprietary chatbot embedded into the very social network you're trying to get people to advertise on is itself parroting negative, violent, and bigoted messages.

Speaker 1 It's just hard for advertisers. And I don't think this was intentional, but I don't think an advertiser cares if it was intentional or not.

Speaker 1 It's sort of like they have to think, what is the risk-reward on advertising on X?

Speaker 1 And many advertisers say there's less risk and more reward on plenty of other social media companies. So just from a pure business point of view, I'm going to choose to advertise elsewhere.

Speaker 2 Yakarino told people close to her that the recent return of some advertisers to X made it a good time to leave. But there was also the merger.

Speaker 2 When X became part of XAI, Yakarino was essentially demoted. She'd been hired to be the CEO of a major social media company.
Now, she was leading a smaller division within XAI.

Speaker 2 Her departure is yet another another sign that X is entering a new era, one that Musk says will be defined by AI.

Speaker 1 There are aspirations to turn the X platform into a payments hub, a communications hub, something that looks more like China's WeChat, for example. And the AI would kind of be a core

Speaker 1 sort of intelligence that governs and helps. coordinate the whole platform.
And I think that's his vision of the future is essentially an AI-powered future.

Speaker 1 And he has very high aspirations for what the AI technology and XAI is going to be able to do. Like this is going to unlock human understanding to a huge extent.

Speaker 1 And it's also going to power this big social media company broadly conceived here.

Speaker 2 Interesting.

Speaker 1 So that's the new vision that's unfolding in real time.

Speaker 2 And are investors into that vision?

Speaker 1 Investors appear to be really into this vision.

Speaker 2 XAI recently raised $10 billion,

Speaker 2 including $2 billion from another Musk company, SpaceX.

Speaker 2 And on Sunday, Musk posted that shareholders at Tesla will vote on whether to also invest in XAI.

Speaker 1 But, you know, I think it's worth noting that the real financial future of this company is very much TBD. It is burning through a lot of money.
And the end game of AI is still fully unknown.

Speaker 2 So it sounds like Musk is making a big gamble here. He is gambling big on AI.
Investors are right there with him. Yes.
What could Grok's latest stumbles mean for the future of this company, XAI?

Speaker 1 It's definitely something that

Speaker 1 will

Speaker 1 make the fundraising process more complicated because

Speaker 1 you have essentially a huge reputational risk that unfolded in real time. I think they're trying trying to put it behind it and not dwell too much on it.

Speaker 1 But I think it raises questions about who are the engineers overseeing the large language model? How strong are the guardrails put in place to use it? How much thought is going into what happens when

Speaker 1 the model and its chatbot are interfacing with the public? I think it flags to people the types of risks that can happen because you can't really control it

Speaker 1 in a way we would traditionally think about control over technology.

Speaker 1 And one thing I'll mention is that Musk mentioned he thinks the next frontier for Grok is for it to be more embedded in the real world.

Speaker 1 I think we'll literally build a legion, at least one legion of robots this year.

Speaker 1 And then probably 10 legions next year.

Speaker 1 He wants Grok to be put into Tesla's Optimus fleet of robots and to have it move around the existing physical world more to sort of engage with reality and learn from it.

Speaker 1 You can imagine like your own personal robot buddy

Speaker 1 that is a great friend, but also takes care of your house. We'll clean your house, will mow the lawn, will walk the dog,

Speaker 1 will teach your kids, will babysit.

Speaker 1 But think about the malfunction we saw last week. What would it have looked like if Grok was in thousands of Optimus robots and similarly started to malfunction in a way that we saw last week?

Speaker 1 What would that look like? What would that feel like?

Speaker 1 What power would the technology company overseeing both the AI and the robotic technology be able to change it or shut it down?

Speaker 1 These are now known unknowns and we're really on the frontier of seeing it play out.

Speaker 2 During a launch event last week for the latest iteration of Grok, Musk said that before the AI can be embedded into a humanoid robot, it will need to learn to be a quote good Grok.

Speaker 1 You can think of AI as this super genius child that ultimately will outsmart you, but you can still you can instill the right values.

Speaker 1 The values you want to instill instill in a child that

Speaker 1 would ultimately grow up to be incredibly powerful.

Speaker 2 That's all for today, Monday, July 14th. The journal is a co-production of Spotify and the Wall Street Journal.
Additional reporting in this episode by Jessica Tungel and Suzanne Vernica.

Speaker 2 Thanks for listening.

Speaker 1 See you tomorrow.

Why Elon Musk’s AI Chatbot Went Rogue

Press play and read along

Transcript

More episodes from The Journal.

It's Almost 2026. How’s the Economy?

How Robinhood’s CEO Became a Cult Hero

AI Has Come for Advertising

OpenAI's 'Code Red' Problem

Investment Accounts for Babies Are Coming. Wall Street Can’t Wait.