
Leopold Aschenbrenner - China/US Super Intelligence Race, 2027 AGI, & The Return of History
Chatted with my friend Leopold Aschenbrenner on the trillion dollar nationalized cluster, CCP espionage at AI labs, how unhobblings and scaling can lead to 2027 AGI, dangers of outsourcing clusters to Middle East, leaving OpenAI, and situational awareness.
Watch on YouTube. Listen on Apple Podcasts, Spotify, or any other podcast platform. Read the full transcript here.
Follow me on Twitter for updates on future episodes. Follow Leopold on Twitter.
Timestamps
(00:00:00) – The trillion-dollar cluster and unhobbling
(00:20:31) – AI 2028: The return of history
(00:40:26) – Espionage & American AI superiority
(01:08:20) – Geopolitical implications of AI
(01:31:23) – State-led vs. private-led AI
(02:12:23) – Becoming Valedictorian of Columbia at 19
(02:30:35) – What happened at OpenAI
(02:45:11) – Accelerating AI research progress
(03:25:58) – Alignment
(03:41:26) – On Germany, and understanding foreign perspectives
(03:57:04) – Dwarkesh’s immigration story and path to the podcast
(04:07:58) – Launching an AGI hedge fund
(04:19:14) – Lessons from WWII
(04:29:08) – Coda: Frederick the Great
Get full access to Dwarkesh Podcast at www.dwarkesh.com/subscribe
Listen and Follow Along
Full Transcript
Okay, today I'm chatting with my friend Leopold Aschenbrenner.
He grew up in Germany, graduated valedictorian of Columbia when he was 19, and then he had a very interesting gap year, which we'll talk about. And then he was on the OpenAI super alignment team, may it rest in peace.
And now he, with some anchor investments from Patrick and John Collison and Daniel Gross and Nat Friedman, is launching an investment firm. So, Leopold, I know you're off to a-
I know you're off to a slow start, but life is long and I wouldn't worry about it too much. You'll make up for it in due time.
But thanks for coming on the podcast. Thank you.
You know, I first discovered your podcast when your best episode had, you know, like a couple hundred views. And so it's just been, it's been amazing to follow your trajectory and it's a delight to be on.
Yeah. Yeah.
Well, I think in the shelter in Trenton episode, I mentioned that a lot of the things I've learned about AI, I've learned from talking with them. And the third part of this triumvirate, probably the most significant in terms of the things that I've learned about AI has been you.
We'll get a lot of the stuff on the record now. Great.
Okay, first thing to get on record. Tell me about the trillion dollar cluster.
By the way, I should mention, so the context of this podcast is today, you're releasing a series called Situational Awareness. We're going to get into it.
First question about that is tell me about the trillion dollar cluster. Yeah, so unlike basically most things that have come out of Silicon Valley recently, you know, AI is kind of this industrial process.
You know, the next model doesn't just require, you know, some code. It's building a giant new cluster.
You know, now it's building giant new power plants. You know, pretty soon it's going to be building giant new fabs.
And, you know, since ChatGPT, this kind of extraordinary sort of techno capital acceleration has been set into motion. I mean, basically, you know, exactly a year ago today, you know, NVIDIA had their first kind of blockbuster earnings call, right? Where it like went up 25% after hours and everyone was like, oh, my God, AI, it's a thing.
You know, I mean, I think within a year, you know, NVIDIA data center revenue has gone from like, you know, a few billion a quarter to like, you know, 20, 25 billion a quarter now and, you know, continue to go up like, you know, big tech CapEx is skyrocketing. And, you know, it's funny because it's both there's this sort of this kind of crazy scramble going on.
But in some sense, it's just the sort of continuation of straight lines on a graph, right? There's this kind of like long run trend, basically almost a decade of sort of training compute of the sort of largest AI systems growing by about, you know, half an order of magnitude, you know, 0.5 booms a year um and you can just kind of play that forward right so you know gbd4 you know rumored or reported to have finished pre-training in 2022 you know the the sort of cluster size there was rumored to be about you know 25,000 h100s you know sorry a100s on semi-analysis um you know that's that's roughly you know if you do the math on that, it's maybe like a $500 million cluster. You know, it's very roughly 10 megawatts.
And, you know, just play that forward half a new year, right? So then 2024, that's a, you know, that's a cluster that's, you know, 100 megawatts. That's like 100,000 H100 equivalents.
You know, that's, you know, costs in the billions. You know, play it forward, you know, two more years, 2026, that's a cluster.
That's a gigawatt. You know, that's, you know, sort of a large nuclear reactor size.
It's like the power of the Hoover Dam. You know, that costs tens of billions of dollars.
That's like a million H-100 equivalents. You know, 2028, that's a cluster that's 10 gigawatts, right? That's more power than kind of like most U.S.
states. That's, you know, like 10 million H-100 equivalents.
You know, costs hundreds of billions of dollars. And then 2030 trillion dollar cluster, 100 gigawatts, over 20% of U.S.
electricity production, you know, 100 million H100 equivalents. And that's just the training cluster, right? That's like the one largest training cluster, you know.
And then there's more inference GPUs as well, right? Most of, you know, once there's products, most of them are going to be inference GPUs. And so, you know, U.S.
power production has barely grown for like, you know, decades. And now we're really in for a ride.
So, I mean, when I had Zuck on the podcast, he was claiming not a plateau per se, but that AI progress would be bottlenecked by specifically this constraint on energy and specifically like, oh, gigawatt data centers are going to build another Three Gorges Dam or something. I know that there's companies, according to public reports, who are planning things on the scale of a gigawatt data center.
10 gigawatt data center, who's going to be able to build that? I mean, 100 gigawatt center, like a state, are you going to pump that into one physical data center? How is this going to be possible? What is Zuck missing? I mean, you know, I don't know. I think 10 gigawatt center like a state where you're getting are you're going to pump that into one physical data center how is it going to be possible yeah what is zuck missing i mean you know i don't know i think 10 gigawatt you know like six months ago you know 10 gigawatts was the talk of the town i mean i think i feel like now you know people have moved on you know 10 gigawatts is happening i mean i know there's the information report on um open ai and microsoft planning a hundred billion dollar cluster so you know you gotta you know is that the gigawatt or is that the 10 gigawatt i mean i don't know but you know if you try to like map out you know how expensive would the with the 10 gigawatt cluster be you know that's maybe a couple hundred billion so it's sort of on that scale um and they're planning it they're working on it you know so so um the um you know it's not just sort of my crazy take i mean amd amd i think forecasted a 400 billion dollar ai accelerator market by 27 you know i think i think it you know, and AI accelerators are only part of the expenditures.
It's sort of, you know, I think sort of a trillion dollars of sort of like total AI investment by 2027 is sort of like we're very much in track on. I think the trillion dollar cluster is going to take a bit more sort of acceleration.
But, you know, we saw how much sort of chat GPT unleashed, right? And so like every generation, you know, the models are going to be kind of crazy and people, it's going to shift the overton window um and then and then you know obviously the revenue comes in right so these are forward-looking investments the question is do they pay off right and so if we sort of estimated the um you know the the gpd4 cluster at around 500 million um by the way that's that's sort of a common mistake people make as they say you know people say like 100 million dollars before but that's just the rental price right they're like ah you rent the cluster for three months but as you know if you're building the biggest cluster you got to like you got to build the price, right? They're like, ah, you rent the cluster for three months.
But as you know, if you're building the biggest cluster,
you gotta like, you gotta build the whole cluster.
You gotta pay for the whole cluster.
You can't just rent it for three months.
But I mean, really, you know,
once you're trying to get into this sort of
hundreds of billions, eventually you gotta get to like
100 billion a year revenue.
I think this is where it gets really interesting
for the big tech companies, right?
Cause like their revenues are in order, you know,
hundreds of billions, right?
So it's like 10 billion fine, you know,
and it'll pay off the, you know,
2024 size training cluster. But you know, really when sort of big big tech it'll be gangbusters is 100 billion a year and so the question is sort of how feasible is 100 billion a year from ai revenue and um you know it's a lot more than right now but i think um you know if you sort of believe in the trajectory of the ai systems as i do and which we'll probably talk about it's not that crazy right so there's i think's, I think there's like 300 million, you know, ish Microsoft Office subscribers, right? And so they have Copilot now and I don't know what they're selling it for, but, you know, suppose you sold some sort of AI add-on for a hundred bucks a month and you sold that to, you know, a third of Microsoft Office subscribers subscribed to that.
That'd be a hundred billion right there. You know, a hundred dollars a month is, you know, a lot.
It's a lot, it's a lot. For a third of Office subscribers? Yeah, but it's you know for the average knowledge worker it's like a few hours of productivity a month and it's you know kind of like you have to be expecting pretty lame ai progress to not hit like you know some few hours of productivity a month of of yeah okay sure so let's assume all this yeah um what what happens in the next few years in terms of uh what is the one gigawatt training uh uh the the ai that's trained on the one gigawatt data center?
What can it do, the one on the 10 gigawatt data center?
Just map out the next few years of AI progress for me.
Yeah, I think probably the sort of 10 gigawatt-ish range
is sort of my best guess
for when you get the sort of true AGI.
I mean, yeah, I think it's sort of like
one gigawatt data center.
And again, I think actually compute is overrated
and we're going to talk about that,
but we will talk about compute right now.
So, you know, I think so 25, 26,
we're going to get models that are, you know, basically smarter than most college graduates. I think sort of a lot of the economic usefulness, I think, really depends on sort of, you know, sort of on hobbling.
Basically, it's, you know, the models are kind of, you know, they're smart, but they're limited, right? You know, there's this chat bot, you know, and things like being able to use a computer, things like being able to do kind of like agentic long horizon tasks. And then I i think by 27 28 you know if you extrapolate the trends and and you know we'll talk about that more later and i talk about in the series i think we hit you know basically you know like as smart as the smartest experts i think the hobbling trajectory kind of points to um you know looks much more like an agent than a chat bot um and much more almost like basically like a drop in remote worker right so it's not like i think basically i mean i mean, I think this is the sort of question on the economic returns.
I think a lot of the intermediate AI systems could be really useful, but, you know, it actually just takes a lot of schlep to integrate them, right? Like GPT-4, you know, whatever, 4.5, you know, probably there's a lot you could do with them in a business use case, but, you know, you really got to change your workflows to make them useful. And it's just like, there's a lot of, you know, it's a very Tyler Cowen-esque take.
It just takes a long time to diffuse. time to diffuse you know it's like you know we're an sf and so we missed that or whatever um but i think in some sense um you know the way a lot of these systems want to be integrated is is you kind of get this sort of sonic boom where it's um you know the sort of intermediate systems could have done it but it would have taken slap and before you do the slap to integrate them you get much more powerful systems much more powerful systems that are sort of unhobbled and so they're this agent and there's drop in remote worker and um you know and then you're kind of interacting with them like a co-worker right you know you can take do zoom calls with them and you're slacking them and you're like ah can you do this project and then they go off and they you know go away for a week and write a first draft and get feedback on them and uh you know run tests on their code and then they come back and and you it and you tell them a little bit more things or, you know, and that'll be much easier to integrate.
And so, you know, it might be that actually you need a bit of overkill to make the sort of transition easy and to really harvest the gains. What do you mean by the overkill? Overkill on the model capabilities? Yeah, yeah.
So basically the intermediate models could do it, but it would take a lot of schlep. And so then, you know, they're like, actually, it's just the drop in remote worker kind worker kind of agi that can automate you know cognitive tasks that actually just ends up kind of like you know basically it's you're like you know the intermediate models would have made the software engineer more productive but you know will the software engineer adopted and then the you know 27 model is uh well you know you just don't need the software engineer you can literally interact with it like a software engineer and it'll do the work of a software engineer so the last episode i did did was with John Schulman.
Yeah. And I was asking about basically this.
And one of the questions I asked is, we have these models that have been coming out in the last year and none of them seem to have significantly surpassed GPT-4 and certainly not in the agentic way in which they are interacting with as a coworker. You know, they'll brag that they got a few extra points on MMLU or or something.
And even GBD 4.0, it's cool that they can talk like Scarlett Johansson or something. But like...
And honestly, I'm going to use that. I guess not anymore.
Okay, but the whole co-worker thing... This is going to be a run-on question, but you can address it in any order.
But it makes sense to me why they'd be good at answering questions. They have a bunch of data about how to complete Wikipedia text or whatever.
Where is the equivalent training data that enables it to understand what's going on in the Zoom call? How does this connect with what they were talking about in the Slack? What is the cohesive project that they're going after based on all this context that I have? Where is that turning data coming from? Yeah, so I think a really key question for sort of AI progress in the next few years is sort of how hard is it to do sort of unlock the test time compute overhang? So, you know, right now, GPT-4 answers a question and, you know, it kind of can do a few hundred tokens of kind of chain of thought and that's already a huge improvement, right? Sort of like, this is a big on hobbling before, you know, answer a math question. It's just shotgun.
And, you know, if you try to kind of like answer a math question by saying the first thing that came to mind, you know, you wouldn't be very good. So, you know, GP4 thinks for a few hundred tokens.
And, you know, if I thought for a few hundred, you know, if I think at like a hundred tokens a minute and I thought. You think it much more than a hundred tokens a minute? I don't know.
If I thought for like a hundred tokens a minute know if I thought for like 100 tokens a minute you know it's like what GP4 does maybe it's like you know it's equivalent to me thinking for three minutes or whatever right um you know suppose GP4 could think for millions of tokens right that's sort of plus four rooms plus four orders of magnitude on test time compute just like on one problem um it can't do it right now it kind of gets stuck right like write some code even if you know you can do a little bit of iterative debugging but eventually just kind of like it can't it kind of gets stuck in something it can't correct its errors and so on and um you know in a sense there's this big overhang right and like other areas of ml you know there's this great paper on alpha go right where you can trade off train time and test time compute and if you can use you know four rooms more test time compute that's almost like know, a three and a half room bigger model, just because again, like you can, you know, if a hundred tokens a minute, a few million tokens, that's, that's a few months of sort of working time. There's a lot more you can do in a few months of working time than, and then right now.
So the question is how hard is it to unlock that? And I think the, you know, the sort of short timelines AI world is if it's not that hard. And the reason that might not be that hard is that, you know, there's only really a few extra tokens you need to learn, right? You need to kind of learn the error correction tokens, the tokens where you're like, ah, I think I made a mistake.
Let me think about that again. You need to learn the kind of planning tokens.
That's kind of like, I'm going to start by making a plan. Here's my plan of attack.
And then I'm going to write a draft. And I'm going to like, now I'm going to critique my draft.
I'm going to think about it. And so it it and so it's not it's not things that models can do right now But you know the question is how hard is that? And in some sense also, you know, there's sort of two paths to agents, right? You know When Cholto was on your podcast, you know, he talked about kind of scaling leading to more nines of reliability And so that's one path.
I think the other path is a sort of like unhobbling path where you it needs to learn This kind of like system to process and if it can learn this sort of system to process it can just use kind of millions of tokens and think for them and be cohesive and be coherent um you know one analogy so when you drive here's an analogy when you drive right okay you're driving and um you know most of the time you're kind of on autopilot right you're just kind of driving and you're doing well and then um but then, um, but sometimes you hit like a weird construction zone or a weird intersection, you know, and then I sometimes I'm like, you know, my, my passenger seat, my girlfriend, I'm kind of like, ah, be quiet for a moment. I need to like figure out what's going on.
Right. Right.
And that's sort of like, you know, you go from autopilot to like the system two is jumping in and you're thinking about how to do it. And so the scale, scaling is improving that system one autopilot.
And I think it's sort of, it's the brute force way to get to kind of agents. You just improve that system.
But if you can get that system two working, then, you know, I think you could like quite quickly jump, you know, to sort of this, like more agentified, you know, test time compute overhang is unlocked. What's the reason to think that this is an easy win in the sense that, oh, you just get the, there's like some loss function that easily enables you to train it to enable the system to thinking.
Yeah. There's not a lot of animals that have system to thinking, you know, it like took a long time for evolution to give us system to thinking.
Yeah. The free training it like, listen, I get it.
You got like trillions of tokens of internet techs. I get that.
Like, yeah, you like match that and you get all these, all this free training capabilities. What's the reason to think that this is an easy and hobbling? Yeah, so, okay, a bunch of things.
So, first of all, free training is magical, right? And it's, and it gave us this huge advantage for, for, for, for models of general intelligence, because, you know, you could, you just predict the next token, but predicting next token, I mean, it's's sort of a common misconception but what it does is lets this model learn these incredibly rich representations right like these sort of representation learning properties are the magic of deep learning you have these models and instead of learning just kind of like you know whatever cisco artifacts or whatever it learns for these models of the world you know that's also why they can kind of like generalize right because it learned the right representations um and so you know you pre-train this these models and you have this sort of like raw bundle of capabilities that's really useful and sort of this almost unformed raw mass um and sort of the unhobbling we've done over sort of like gp2 to gp4 was was you kind of took this sort of like raw mass and then you like rhf'd it into really good chatbot and that was a huge win right like you know going going you know and you know in the original i think construct gpt paper you rhf versus non rhf model it's like 100x model size win on sort of human preference rating you know it started to be able to do like simple chain of thought and so on but you still have this advantage of all these kind of like raw capabilities and i think there's still like a huge amount that you're not doing with them and by the way i think this sort of this pre-training advantage is also sort of the difference to robotics right where i think robotics you know you know I think you people used to say it was a hardware problem but I think the hardware stuff is getting solved it's but the thing we have right now is you don't have this huge advantage of being able to bootstrap yourself with pre-training you don't have all this sort of unsupervised learning you can do you have to start right away with the sort of RL self-play and so on all right so now the question is why you know why might some of this on hobbling and RL and so on work? And again, there's sort of this advantage of bootstrapping, right? So, you know, your Twitter bio is being pre-trained, right? But you're actually not being pre-trained anymore. You're not being pre-trained anymore.
You're pre-trained in, like, grade school and high school. At some point, you transitioned to being able to, like, learn by yourself, right? you weren't able to do that in elementary school um i don't know middle school probably high school is maybe when it sort of started you need some guidance um you know college you know if you're smart you can kind of teach yourself and then sort of models are just starting to enter that regime right and so it's sort of like it's a little bit probably a little bit more scaling um and then you got to figure out what goes on top and it won't be trivial right so a lot of um a lot of deep learning is sort of like, it's a little bit, it's probably a little bit more scaling and then you got to figure out what goes on top and it won't be trivial, right? So a lot of, a lot of deep learning is sort of like, you know, it sort of seems very obvious in retrospect and there's sort of some obvious cluster of ideas, right? There's sort of some kind of like thing that seems a little dumb, but that just kind of works, but there's a lot of details you have to get right.
So I'm not saying this, you know, we're going to get this, you know, next month or whatever. I think it's going to take a while to like really figure out the details.
A while for you is like half a year or something. I don't know.
That makes a month. Six months.
Between six months and three years, you know. But, you know, I think it's possible.
And I think there's, you know, I think, and this is, I think it's also very related to the sort of issue of the data wall. But I mean, I think the, you know, one intuition on the sort of like learning, learning, learning by yourself, right, is sort of pre-training is kind of the words are flying by yeah right you know and and um or it's like you know the teacher is lecturing to you and the models you know the words are flying by you know they're taking they're just getting a little bit from it um but that's sort of not what you do when you learn from yourself right when you learn by yourself you know so you're reading a dense math textbook you're not just kind of like sk.
You wouldn't learn that much from it. I mean, some word cells just skim through, you know, reread and reread the math textbook.
And then they memorize that sort of, you know, like if you just repeated the data, then they memorize. What you do is you kind of like, you read a page, kind of think about it.
You have some internal monologue going on. You have a conversation with a study buddy.
You try a practice problem. You know, you fail a bunch of times at some point it clicks and you're like, this made sense, then you read a few more pages.
And so we've kind of bootstrapped our way to being able to do that now with models, sort of like just starting to be able to do that. And then the question is, you know, being able to like read it, think about it, you know, try problems, and the question is, can you, you know, all this sort of self-play synthetic data RL is kind of like making that thing work.
Yep.
So basically translating, translating like in context, right?
Like right now there's like in context learning, right?
Super sample efficient.
There's that, you know, in the Gemini paper, right?
It just like learns the language in context.
And then you're pre-training, not at all sample efficient.
But, you know, what humans do is they kind of like, they do in context learning.
You read a book, you think about it until eventually it clicks. But then you somehow distill that back into the weights.
And in some sense, that's sort of like what RL is trying to do. And like when RL is super finicky, but when RL works, RL is kind of magical because it's sort of the best possible data for the model.
It's like when you try a practice problem and, you know, and then you fail and at some point you kind of figure it out in a way that makes sense to you. That's sort of like the best possible data for you because like the way you would have solved the problem.
And that's sort of that's what RL is rather than just, you know, you kind of read how somebody else solved the problem and doesn't, you know, just initially click. Yeah.
By the way, if that take sounds familiar, because it was like part of the question I asked John Shulman that goes to illustrate the thing I said in the intro intro where like a bunch of the things I've learned about AI just like we do these dinners before the interviews and you show it to him a couple and like, what should I ask John Shulman? What should I ask Dario? Okay, suppose this is the way things go and we get these unhoblings. Yeah.
And the scaling, right? So it's like you have this baseline just enormous force of scaling, right? Where it's like GPT-2 to GPT-4, you know, GPT-2, it could kind of like, it was amazing, right? It could string together plausible sentences, but, you know, it could, it could barely do anything. It was kind of like preschooler.
And then GPT-4 is, you know, it's writing code. It like, you know, can do hard math.
It's sort of like smart high school. And so this big jump and, you know, and sort of the essay series, I go through and kind of count the orders magnitude of compute scale up of algorithmic progress.
And so sort of scaling alone, you know, sort of by 27, 28 is going to do another kind of preschool to high school jump on top of GP4. Um, and, um, so that'll already be just like at a per token level, just incredibly smart.
They'll get you some more reliability. And then you add these on hobblings that make it look much less like a chat bot, more like this agent, like a drop in remote worker.
And you know, that's when things really get going. Okay.
I want to ask more questions about this. I think, yeah, let's zoom out.
Okay. So suppose you're right about this.
Yeah. And I guess you, this is because of the 2027 cluster, we've got 10 gigawatt, 2027, 10 gigawatts, 128 is the 10 gigawatt.
Okay, so maybe it'll be pulled forward. Okay, sure.
Something. Yeah.
And so I guess that's like 5.5 level by 2027, like whatever that's called, right? What does the world look like at that point? You have these remote workers who can replace people. What is the reaction to that in terms of the economy, politicsics yeah so you know i think 2023 was kind of a really interesting year to experience as somebody who was like you know really following the eye stuff where you know before that what were you doing in 2023 i mean open ai oh yeah yeah and and uh and um you know it kind of went you know i mean you know i was i was been thinking about this and you know like talking to a lot of people you know in the years before, and it was it kind of went, you know, I mean, you know, I was, I was been thinking about this and, you know, like talking to a lot of people, you know, in the years before and it was this kind of weird thing, you know, you almost didn't want to talk about AI or AGI and it was kind of a dirty word.
Right. And then 2023, you know, people saw ChatGPT for the first time and they saw it before and it just like exploded.
Right. It triggered this kind of like, you know, you know, a huge sort of capital expenditures from all these firms and, and, and, you know, the explosion in revenue from NVIDIA and so on.
and you know, huge sort of capital expenditures from all these firms and, you know, the explosion in revenue from Nvidia and so on. And, you know, things have been quiet since then.
But, you know, the next thing has been in the oven. And I sort of expect sort of every generation these kind of like g-forces to intensify, right? It's like people see the models.
There's like, you know, people haven't counted them. So they're going to be surprised, they'll be kind of crazy.
And then, you know, revenue is going to accelerate, you know, suppose you do hit the 10 billion, you know, end of this year, suppose it like just continues on this sort of doubling trajectory of, you know, like every six months of revenue doubling, you know, it's like, you're not actually that far from 100 billion, you know, maybe that's like 26. And so, you know, at some point, you know, like, you know, sort of what happened to NVIDIA is going to happen to big tech, you know, like their stocks, you know, that's going to explode.
And I mean, I think a lot more people are going to feel it, right? I mean, I think the, I think 2023 was the sort of moment for me where it went from kind of AGI as a sort of theoretical abstract thing, and you'd make the models to like, I see it, I feel it. And like, I see the path, I see where it's going.
I like, I think I can see the cluster where it's trained on, like the rough combination of algorithms, the people, like how it's happening. And I think, you know, most of the world is not, you know, most of the people feel it are like right here, you know, right? But, but, you know, I think a lot more of the world is going to start feeling it.
Um, and I think that's going to start being kind of intense. Okay.
So right now, who feels it? You can, you go on Twitter and there's these GPT rapper companies like, Whoa, GPT-4O is going to change our business. I'm so bearish on the rapper companies, right? Because they're the ones that are going to be like, the rapper companies are betting on stagnation, right? The rapper companies are betting like, you have these intermediate models and take so much time to integrate them.
And I'm kind of like, I'm really bearish because I'm like, we're just going to sonic boom you, you know, and we're going to get the unhauled ones. We're going to get the drop in remote worker.
And then, you know, your stuff is not going to matter. Okay, sure, sure.
So that's done. Now, who...
So SF is paying attention now
or this crowd here is paying attention.
Who is going to be paying attention
in 2026, 2027?
And presumably, these are years
in which the hundreds of billions of capex
is being spent on AI.
I mean, I think the national security state
is going to be starting to pay a lot of attention.
And, you know, I hope we get to talk about that a little bit. Okay, let's talk about it now.
What happens? Yeah. Like, what is the sort of political reaction immediately? Yeah.
And even, like, internationally? Like, what people see? Like, right now, I don't know if, like, Xi Jinping, like, reads the news and sees, like, Yeah, I don't know what to say about Xi Jinping. Oh, my God, like, an MMO score on that.
What are you doing about this, comrade? Yeah. So what the, like, where the GBD, he's like, sees a remote replacement and it has $100 billion in revenue.
There's a lot of businesses that have $100 billion in revenue and people don't like, aren't staying up all night talking about it. The question, I think the question is, when does the CCP and when does the sort of American national security establishment realize that super intelligence is going to be absolutely decisive for national power, right? And this is where, you know, the sort of intelligence explosion stuff comes in, which, you know, we should also talk about later.
You know, it's sort of like, you know, you have AGI, you have this sort of drop in remote worker that can replace, you know, you or me, at least that sort of remote jobs, you know, kind of jobs. And then, you know, I think fairly quickly, you know, I mean, by default, you know, you turn the crank, you know, one or two more times, you know, and then you get a thing that's smarter than humans.
But I think even more than just turning the crank a few more times, you know, I think one of the first jobs to be automated is going to be that of sort of an AI researcher engineer. And if you can automate AI research, you know, I think things can start going very fast.
You know, right now there's already this trend of, you know, half an order of magnitude a year of algorithmic progress, you know, suppose, you know, at this point, you know, you're going to have GPU fleets in the tens of millions for inference, you know, or more. And, you know, you're going to be able to run like a hundred million human equivalents of these sort of automated AI researchers.
And if you can do that, you know, you can maybe do, you know, a decade's worth of sort of ML research progress in a year, you know, you get some sort of 10x speed up. And if you can do that, I think you can make the jump to kind of like, AI that is vastly smarter than humans, you know, within a year, a couple years.
And then you know, that broadens, right? So you have this, you have this sort of initial acceleration of AI research that broadens to like, you apply R&D to a bunch of other fields of technology. And the sort of like extremes, you know, at this point you have like a billion just super intelligent researchers, engineers, technicians, everything, you know, superbly competent, all the things, you know, they're going to figure out robotics.
So we talked about it being a software problem. Well, you know, you have a billion of super smart, smarter than the smartest human researchers, AI researchers on your cluster, you know, at some point, they're going to be able to figure out robotics.
And then again, that expands. I think if you play this picture forward, I think it is fairly unlike any other technology in that it will, I think a couple years of lead could be utterly decisive in say military competition.
you know, if you look at like go for one, right, go for one, you know, a couple years of lead could be utterly decisive in, say, like military competition, right?
You know, if you look at like Gulf War 1, right? Gulf War 1, you know, like the Western Coalition forces, you know, they had, you know, like 100 to 1 kill ratio, right?
And that was like, they had better sensors on their tanks, you know, and they had better, you know, more precision missiles, right?
Like GPS, and they had, you know, stealth, and they had sort of a few, you know, maybe 20, 30 years of technological lead, right? And they, you know, just completely crushed them. Super intelligence applied to sort of broad fields of R&D.
And then, you know, the sort of industrial explosion as well, you have the robots, you're just making lots of material, you know, I think that could compress, I mean, basically compress kind of like a century worth of technological progress into less than a decade. And that means that, you know, a couple years could mean a sort of Gulf War I style, like, you know, advantage in military affairs.
And, you know, including like, you know, a decisive advantage that even like preempts nooks, right? Suppose like, you know, how do you find the stealth in nuclear submarines? Like right now, that's a problem of like, you have sensors, you have the software to like detect where they are, you know, you can do that, you can find them them you have kind of like millions or billions of like mosquito-like you know size drones and you know they take out the nuclear submarines they take out the mobile launchers they take out the other nukes and anyway so i think enormously destabilizing enormously important for national power and at some point i think people are going to realize that not yet but they but they will. And when they will, I think there will be sort of, you know, I don't think it will just be the sort of AI researchers in charge.
And, you know, I think the CCP is going to have sort of an all out effort to infiltrate American AI labs, right? You know, like billions of dollars, thousands of people, you know, full force of the sort of, you know, Ministry of State Security. CCP is going to try to, you know, like outbuild us, right? Like they, you know, their, you know, power in China, you know, like the electric grid, you know, they added a US as, you know, a complete, like they added as much power in the last decade as like sort of entire US electric grid.
So like the 100 gigawatt cluster, at least 100 gigawatts is going to be a lot easier for them to get. And so I think sort of, you know, by this point, I think it's going to be like an extremely intense sort of international competition.
Okay. So in this picture, one thing I'm uncertain about is whether it's more like what you say, where it's more of an implosion of you have developed an AGI and then you make it into an AI researcher.
And for a while, a year or something, you're only using this ability to make hundreds of millions of other AI researchers and then like the thing that comes out of this really frenetic process is a super intelligence and then that goes out in the world and is developing robotics and helping you take over other countries and whatever. I mean, I think it's a little bit more, you know, it's a little bit more kind of like, you know, it's not like, you know, on and off.
It's a little bit more gradual, but it's sort of like it's an explosion that starts narrowly.
It can do cognitive jobs.
You know, the highest R.I. used for cognitive jobs is make the eye better, like solve robotics, you know, and as as as you solve robotics.
Now you can do R&D and, you know, like biology and other technology.
You know, initially you start with the factory workers, you know, they're wearing the glasses and the AirPods, you know, and the A.I. is instructing them.
Right. Because, you know, you kind of make any worker into a skilled technician and then you have the robots come in.
And anyway, so it sort of expands. This process expands.
Meta's ravens are a compliment to their llama. Well, you know, it's like whatever.
Like, you know, the fabs in the U.S., the constrained skilled workers, right? Sure. You have, even if you don't have robots yet, you have the cognitive super intelligence and, you know, it can kind of make them all into skilled workers immediately.
But that's, you know, it's a very brief period. You know, robots will come soon.
Sure. Okay.
Okay. So suppose this is actually how the tech progresses.
In the United States, maybe because these companies are already experiencing hundreds of billions of dollars of AI revenue. At this point, you know, companies are borrowing, you know, hundreds of billions of more in the corporate debt markets, you know.
But why is a CCP bureaucrat, some 60-year-old guy, he looks at this and he's like, oh, it's like Copilot has gotten better now. Why are they now? I mean, this is much more than co-pilot has gotten better now.
I mean, this is like, yeah, so to shift the production of an entire country to dislocate energy that is otherwise being used for consumer goods or something and to make that all feed into the data centers. What, because like part of this whole story is you realize the super intelligence is coming soon, right? And I guess you realize it, maybe I realize it.
I'm not sure how much I realize it, but will the national security apparatus in the United States and will the CCP realize it? Yeah. I mean, look, I think in some sense, this is a really key question.
I think we have sort of a few more years of mid-game basically and where you have a few more 2023s and that just starts updating more and more people and you know i think you know the trend lines you know will become clear um you know i think i think you will see some amount of the sort of covid dynamic right you know like covid was like you know february february of 2020 you know it's like honestly feels a lot like today, you know, where COVID was like, you know, February of 2020, you know, it's like,
honestly feels a lot like today,
you know, where it's like,
you know, it feels like this utterly crazy thing
is about, you know,
is impending, is coming.
You kind of see the exponential
and yet most of the world
just doesn't realize, right?
The mayor of New York is like,
go out to the shows
and this is just, you know,
like Asian racism or whatever,
you know, and, and,
but, you know,
at some point the exponential, like, you know, at some point people saw it and then, you know, like just kind of crazy radical reactions came. Right.
Okay. So by the way, what were you doing during COVID or when like February? Okay.
Uh, like freshmen, sophomore, what? Junior. Hmm.
Yeah. But still like a boy, like 17 year old junior or something.
And, uh, and then you like, 17-year-old junior or something? Yeah.
And then you bought, like, did you short the market or something?
Yeah, yeah, yeah.
Did you sell at the right time?
Yeah.
Okay. Yeah, so there will be, like, a March 2020 moment,
the thing that was COVID, but here.
Now, then you can, like, make the analogy that you make in the series
that this will then... cause the reaction of, like like we got to do the Manhattan Project for America here I wonder what the politics of this will be like because the difference here is It's not just like we need the bomb to beat the Nazis.
It's we're building this thing that's Making all our entry prices rise a bunch and it's automating a bunch of our jobs And the climate change stuff like people are gonna be like oh my god it's making climate change worse and it's helping big tech like politically this doesn't seem like a dynamic where the national security apparatus or the president is like we have to step on the gas here and like make sure america wins yeah i mean again i think a lot of this really depends on sort of how much people are feeling it, how much people are seeing it.
You know, I mean, again, I think a lot of this really depends on sort of how much people are feeling it, how much people are seeing it.
You know, I think there's a thing where, you know, kind of basically our generation, right? We're kind of so used to kind of, you know, basically peace and like, you know, the world, you know, American hegemony and nothing matters.
But, you know, the sort of like extremely intense and these extraordinary things happening in the world and like intense international competition is like very much the historical norm. Like in some sense, it's like, you know, sort of this, there's a sort of 20 year, very unique period, but like, you know, the history of the world is like, you know, you know, like in World War Two, right, it was like 50% of GDP went to, you know, like, you know, war per dime production.
The U.S. borrowed over 60% of GDP, you know, and, you know, I think Germany and Japan over 100%, World War I, you know, UK, Japan, sorry, UK, France, Germany all borrowed over 100% of GDP.
And, you know, I think this sort of much more was on the line, right? Like, you know, and, you know, people talk about World War I being so destructive
and, you know, like 20 million Soviet soldiers dying
and like 20% of Poland.
But, you know, that was just a sort of like,
that happened all the time, right?
You know, like seven years war, you know,
like whatever, 20, 30% of Prussia died,
you know, like 30 years war, you know,
like I think like, you know,
up to 50% of like large swaths of Germany died.
And, you know, I think the question is, will these sort of like, will people see that the stakes here are really, really high and that basically is sort of like history is actually back? And I think, you know, I think the American national security state thinks very seriously about stuff like this. They think very seriously about competition with China.
I think China very much thinks of itself on this as her historical mission and rejuvenation of the Chinese nation. A lot about national power.
I think a lot about like the world order. And then, you know, I think there's a real question on timing, right? Like, do they, do they start taking this seriously, right? Like when the intelligence explosion is already happening, like quite late, or do they start taking this seriously, like two years earlier? And that matters a lot for how things play out play out but at some point they will and at some point they will realize that this will be sort of utterly decisive um for you know not just kind of like some proxy war somewhere but you know like whether liberal democracy can continue to thrive whether you know whether the ccp will continue existing um and i think that will activate sort of forces that we haven't seen in a long time.
The great power conflict thing definitely seems compelling. I think just all kinds of different things seem much more likely when you think from a historical perspective, when you zoom out beyond the liberal democracy that we've been living in, had the pleasure to live in America, let's say 80 years, including dictatorships, including obviously war, famine, whatever.
I was reading the Gulag Archipelago and one of the chapters begins with Sozhanitsyn saying, if you would have told Russian citizens under the czars that because of all these new technologies, we wouldn't see some great Russian revival or becomes a great power and the citizens are made wealthy. But instead, what you would see is tens of millions of Soviet citizens tortured by millions of beasts in the worst possible ways, and that this is what would be the result of the 20th century.
They wouldn't have believed you. They'd have called you a slanderer.
Yeah. And you know, the possibilities for dictatorship with superintelligence are sort of even crazier,
right? I think, you know, imagine you have a perfectly loyal military and security force, right? That's it. No more, no more rebellions, right? No more popular uprisings, you know, perfectly loyal, you know, you have, you know, perfect lie detection, you know, you have surveillance of everybody, you know, you can perfectly figure out who's the dissenter, weed them out, you know, no Gorbachev would have ever risen to power who had some doubts about some doubts about the system.
No military coup would have ever happened. And I think there's a real way in which part of why things have worked out is that ideas can evolve and there's some sense in which time heals a lot of wounds and time a lot of debates and a lot of people had really strong convictions, but, you know, a lot of those have been overturned by time because there's been this continued pluralism and evolution.
I think there's a way in which kind of like, you know, if you take a CCP-like approach to kind of like truth, truth is what the party says. And you supercharge that with superintelligence.
I think there's a way in which that could just be like locked in and enshrined for, you know, a long time. And I think the possibilities are pretty terrifying.
You know, your point about history and sort of living in America for the past eight years, you know, I think this is one of the things I sort of took away from growing up in Germany is a lot of this stuff feels more visceral, right? Like, you know, my mother grew up in the former East, my father in the former West. They met shortly after the wall fell, right? Like, the end of the Cold War was this sort of extremely pivotal moment for me because it's you know it's the reason i exist right and then you know growing up in berlin and you know the former wall you know um my great grandmother who is you know still alive is very important in my life you know she was born in 34 you know grew up you know during the nazi era during you know all that you know then world war ii you know like saw the fire bombing of drisden from the sort of you know country cottage or whatever where you know, all that, you know, then World War II, you know, like saw the firebombing of Dresden from the sort of, you know, country cottage or whatever, where, you know, they as kids were, you know, then, and then, you know, then spends most of her life in sort of the East German communist dictatorship, you know, she'd tell me about, you know, in like, 54, when there's like the popular uprising, you know, and Soviet tanks came in, you know, her husband was telling her to get home really quickly, you know, get off off the streets, you know, had a had a son who who tried to, you know, ride a motorcycle across across the Iron Critten and then was put in a Stasi prison for a while.
You know, and then finally, you know, when she's almost 60, you know, is the first time she lives in, you know, a free country and a wealthy country. And, you know, when I was a kid, she was, the thing she always really didn't want me to do was like get involved in politics because like joining a political party was just, you know, it was very bad connotations for her.
Anyway, and she sort of raised me when I was young, you know, and so it, you know, it doesn't feel that long ago. It feels very close.
Yeah. So I wonder when we're talking today about the CCP, listen, the people in China who will be doing their version of the project will be AI researchers who are somewhat westernized, who interact with either got educated in the West or have colleagues in the West.
Are they going to sign up for the CCP project that's going to hand over control to Xi Jinping? What's your sense on... I mean, it's just like fundamentally they're just people, right? Like can't you like convince them about the dangers of superintelligence or something? Will they be in charge though? I mean, in some sense, this is also the case, you know, in the US or whatever.
This is sort of like rapidly depreciating influence of the lab employees. Like right now, the sort of AI lab employees have so much power, right, over this, you know, like you saw this.
But they're going to get automated. Well, yeah, you saw this in November event, so much power, right? But both, I mean, both they're going to get automated and they're going to lose all their power.
And it'll just be, you know, kind of like a few people in charge with their sort of armies of automated eyes. But also, you know, it's sort of like the generals and the sort of national security state you know a lot you know it's i mean there's sort of this is the sort of some of these classic scenes from you know the oppenheimer movies you know the scientists built it and then it was kind of you know and the bomb was shipped away and it was out of their hands um you know i actually yeah i think i actually think it's good for like lab employees to be aware of this it's like you have a lot of power now but um you know um uh maybe not for that long and you know use it wisely um yeah i do i do think they would benefit from some more you know organs of representative democracy what do you mean by that oh i mean i you know in the sort of the in the open ai board events you know employee power is exercised in a very sort of direct democracy way and i feel like that's how some of how that went about you know i think i really highlighted the benefits of representative democracy and having some deliberative organs.
Interesting. Yeah.
Well, let's go back to the 100 billion revenue, whatever, as these companies. I know a cluster.
Yeah. The companies are deploying, we're trying to build clusters that are this big.
Yeah. Where are they building it? Because if you say it's the amount of energy that would require for a small or medium-sized U.S.
state, is it then Colorado gets no power and it's happening in the United States or is it happening somewhere else? Oh, I mean, I think that, I mean, in some sense, this is the thing that I always find funny is, you know, you talk about Colorado gets no power. You know, the easy way to get the power would be like, you know, displace less economically useful stuff.
You know, it's like, whatever, buy up the aluminum smelting plant and, you know, that has a gigalot and, you know, we're going to replace it with the data center because that's important. I mean, that's not actually happening because a lot of these power contracts are really sort of long-term locked in you know there's obviously people don't like things like this and so it sort of it seems like in practice what it's what it's requiring at least right now is building new power the um that might change and i think that that's when things get really interesting when it's like no we're just dedicating all of the power to the agi okay so right now it's building new power 10 gigawatt i think quite doable um you know it's like a few a few percent of like U.S.
natural gas production. You know, when you have the 10 gigawatt training cluster, you have a lot more inference.
So that starts getting more. You know, I think 100 gigawatt, that starts getting pretty wild.
You know, that's, you know, again, it's like over 20 percent of U.S. electricity production.
I think it's pretty doable, especially if you're willing to go for like natural gas. But I do think it is incredibly important, incredibly important that these clusters are in the United States.
And why does it matter it's in the US? I mean, look, I think there's some people who are, you know, trying to build clusters elsewhere. And, you know, there's like a lot of free flowing Middle Eastern money that's trying to build clusters elsewhere.
I think this comes back to the sort of like national security question we talked about earlier. Like, would you, I mean, would you do the Manhattan Project in the UAE, right? And I think, I think basically like putting, putting the clusters, you know, I think you can put them in the US, you can put them in sort of like ally democracies, but I think once you put them in kind of like, you know, dictatorships, authoritarian dictatorships, you kind of create this, you know, irreversible security risk, right? So, I mean, one cluster is there, much easier for them to exfiltrate the weights.
You know, they can like literally steal the AGI, the super intelligence. It's like they got a copy of the, you know, of the atomic bomb, you know, and they just got a direct replica of that.
And it makes it much easier to them. I mean, we're tied to China, you can ship that to China.
So that's a huge risk. Another thing is they can just seize the compute, right? Like maybe right now, they just think of this, I mean, in general, I think people, you know, i think the issue here is people are thinking of this as a you know chat gpt big tech product clusters but i think the cluster is being planned now you know three to five years out like it will be the like agi super intelligence clusters and so anyway so like when things get hot you know they might just seize the compute and i don't know supposedly put like you know 25 of the compute capacity in these sort of middle eastern decatorships well they seize that and now it's sort of a ratio of compute of three to one.
And, you know, we still have some more, but even like, even, even only, only 25% of compute there, like, I think it starts getting pretty hairy. You know, I think three to one is like, not that great of a ratio, you can do a lot with that amount of compute.
And then look, even even if they don't actually do this, right, even they don't actually seize the compute, even actually don't steal the weights. There's just a lot of implicit leverage you get right they get they get the seat at the agi table um and um you know i don't know why we're giving authoritarian dictatorships the seat at the agi table okay so there's going to be a lot of compute in the middle east if these deals go through first of all who's who is it just like every single big tech company is just trying to figure out what they can invent it?
Not everyone, some.
Okay, okay. I guess there's reports, I think Microsoft or… Yeah, yeah, yeah, yeah.
Which we'll get into. Yeah.
So, UAE gets a bunch of compute because we're building the clusters there. Yeah.
And why, so let's say they have 25% of, why does the compute ratio matter? Is it... If it's about them being able to kick off
the intelligence explosion,
isn't it just some threshold where you have 100 million AI researchers or you don't? I mean, you can do a lot with, you know, 33 million extremely smart scientists. And, you know, and again, a lot of the stuff, you know, so first of all, it's like, you know, that might be enough to build the crazy bioweapons, right? And then you're in where like now wow we've just like they stole the weights they seized the compute now they can make
you know they can build these crazy new wmds that you know will be possible super intelligence and now you've just kind of like proliferated the stuff and you know it'll be really powerful um um and also i mean i think you know three to three acts on compute isn't actually that much And so the, you know, the, you know, I think a thing I worry a lot about is.
I think. actually that much and so the um you know the um you know i think a thing i worry a lot about is i think everything i think the riskiest situation is if we're in some sort of like really tight neck feverish international struggle right if we're like really close with the ccp and we're like months apart um i think the situation we want to be in we could be in if we played our cards, right.
There's a little bit more like, you know, the U S you know, building the atomic bomb versus the German project way behind, you know, years behind. Um, and if we have that, I think we just have so much more wiggle room, like to get safety, right.
We're going to be building, like, you know, there's going to be these crazy new WMDs, you know, things that completely undermine, you know, nuclear deterrence, you know, um, intense competition. And, um, that's so much easier to deal with if, you know, you're like, you know, it's not just, you you know you don't have somebody right on your tails you got to go go go you got to go maximum speed you have no wiggle room um you're worried that at any time they can overtake you i mean they can also just try to outbuild you right like they can might they might literally win like china might literally win um if they can steal the weights because they can they can outbuild you um and they maybe have less caution uh both you know good and bad caution you kind of like whatever unreasonable regulations we have um or you're just in this really tight race and i think is that sort of like if you're in this really tight race this sort of feverish struggle i think that's when sort of there's the greatest peril of self-destruction so then presumably the companies that are trying to build clusters in the middle east realize this what is the is it just that it's impossible to do this in america and if you want american companies to do this at all then you do it in middle east you're not at all and then you just Like I'm trying to build clusters in the Middle East realize this.
Is it just that it's impossible to do this in America? And if you want American companies to do this at all, then you do it in the Middle East or not at all. And then you're just like, I'm trying to build a three gorgeous damn cluster.
I mean, there's a few reasons. One of them is just like, people aren't thinking about this as the EGI superintelligence cluster.
They're just like, ah, you know, like cool clusters for my, you know, for my chat. But so they're building, and the plans right now are clusters, which are ones that are like, because if you're doing ones for inference, presumably could like spread them out across the country or something.
But the ones they're building, they realize we're going to do one training run in this thing we're building. I just think it's harder to distinguish between inference and training compute.
And so people can claim it's training compute, but I think they might realize that actually, you know, this is going to be useful for, sorry, they might say it's inference compute and actually it's useful for training compute too. Because it's synthetic data and things like that? Yeah, the future of training, you know, like RL looks a lot like inference, for example, right? Or you just kind of like end up connecting them, you know, in time.
You know, it's like a lot of raw material. You know, it's like, you know, it's placing your uranium refinement facilities there.
So a few reasons, right? One is just like they don't think about this as the AGI cluster. Another is just like easy money from the Middle East, right? Another one is like, you know, people saying, some people think that, you know, you can't do it in the US.
And, you know, I think we actually face a sort of real system competition here, because again, some people think it's only autocracies that can do this, that can kind of like top down, mobilize the sort of industrial capacity, the power, you know, get the stuff done fast. And again, this is the sort of thing, you know, we haven't faced in a while.
But, you know, during the Cold War, like we really there was this sort
of intense system competition, right? Like East West Germany was this right, like West Germany,
kind of like liberal democratic capitalism versus kind of, you know, communist state planned. And,
you know, now it's obvious that the sort of, you know, the free world would win. But, you know,
even even as late as like 61, you know, Paul Samuelson was predicting the Soviet Union would would outgrow the United States because they were able to sort of mobilize industry better. And so, yeah, there's some people who, you know, shitpost about loving America by day.
But then in private, they're betting against America. They're betting against the liberal order.
And I think I basically just think it's a bad bet. And the reason I think it's a bad bet is I think this stuff is just really possible in the US.S.
And so to make it possible in the U.S., there's some amount that we have to get our act together, right? So I think there's basically two paths to doing it in the U.S. One is you just got to be willing to do natural gas.
And there's ample natural gas, right? You put your cluster in West Texas. You put it in, you know, Southwest Pennsylvania by the, you know, Marcello Shale.
10 gigawatt cluster is super easy. 100 gigawatt cluster, also pretty doable.
You know, I think, you know, natural gas production in the United States has, you know, almost doubled in a decade. If you do that, you know, one more time over the next, you know, seven years or whatever, you know, you could power multiple trillion dollar data centers.
But the issue there is, you know, a lot of people have sort of made these climate commitments. They're not just government.
It's actually the private companies themselves, right? The Microsoft, the Amazons, and so on. They have these climate commitments, so they won't do natural gas.
And, you know, I admire the climate commitments, but I think at some point, you know, the national interest and national security kind of is more important. The other path is like, you know, you can do this sort of green energy megaprojects, right? You do the solar and the batteries and the, you know, the SMRs and geothermal.
But if we want to do that, there needs to be sort of a sort of broad deregulatory push, right? So like you can't have permitting take a decade, right? So you got to reform FERC. You got to like have, you know, blanket NEPA exemptions for this stuff.
You know, there's like inane state level regulations, you know, that are like, yeah, you could build, you know, you can build the solar panels and batteries next to your data center, but it'll still take years because, you know, you actually have to hook it up to the state electrical grid, you know, and you have to like use governmental powers to create rights of way to kind of like, you know, have multiple clusters and connect them, you know, and have thick cables basically. And so look, I mean, ideally we do both, right? Ideally we do natural gas and the broad regulatory agenda.
I think we have to do at least one. And then I think this possible stuff is just possible in the United States.
Yeah. I think a good analogy for this, by the way, before the conversation I was reading, there's a good book about World War II industrial mobilization in the United States called Freedom's Forge.
Yeah. And I guess when we think back on that period, especially if you're from, if you read like the Patrick Collison Fast and the Progress Study stuff, it's like, you had state capacity back then and people just got shit done.
But now it's a clusterfuck. Wasn't it all the case? No.
So it was really interesting. So you have people who are from the Detroit auto industry side, like Knudsen, who are running mobilization for the United States.
And they were extremely competent. But then at the same time, you had labor and agitation, which is actually very analogous to the climate pledges and climate change concern we have today.
Yeah. Where they would have these strikes while literally into 1941 that would cost millions of man hours worth of time when we're trying to make tens of millions, sorry, tens of thousands of planes a
month or something. And they would just debilitate factories before, you know, trivial, like pennies on the dollar kind of concessions from capital.
And it was concerns that, oh, the auto companies are trying to use the pretext of a potential war to actually prevent paying labor that money deserves.
And so what climate change is today, like you would think, ah, fuck, America's fucked. Like, we're not going to be able to build this shit.
Like if you look at Nipah or something. But I didn't realize how debilitating labor was in World War II.
It was just, you know, before at the, you know, sort of like 39 or whatever, the American military was in total shambles, right? You read about it and it reads a little bit like, you know, the German military today, right? It's like, you know, military expenditures, I think, were less than 2% of GDP. You know, all the European countries had gone, even in peacetime, you know, like above 10% of GDP, sort of this like rapid mobilization.
There's nothing, you know, like, we're making kind of like no planes, there's no military contracts, everything had been starved during the Great Depression. But there was this latent capacity.
And, you know, at some point, the United States got their act together. I mean, the thing I'll say is, I think, you know, the supplies are the other way around to to basically to China, right? And I think sometimes people are, you know, they kind of count them out a little bit.
And they're like the export controls and so on. And, you know, they're able to make seven nanometer chips.
Now, I think there's a question of like, how many could they make? But, you know, I think there's at least a possibility that they're going to be able to mature that ability and make a lot of seven nanometer chips and um there's a lot of latent industrial capacity in china and they are able to like you know build a lot of power fast and maybe that isn't activated for ai yet but at some point you know the same way the united states and like you know a lot of people in the u.s and the united states government is going to wake up you know at some point the ccp is going to wake up yep okay going back to the question of presumably companies, are they blind to the fact that there's going to be some sort of, well, okay, so they realize that there's going, they realize scaling is a thing, right? Obviously their whole plans are contingent on scaling. And so they understand that we're going to be in 2028 building the 10 gigawatt data centers.
And at this point, that the people who can keep up are big tech just potentially at like the edge of their capabilities. Then sovereign wealth fund funded things.
And also big major countries like America, China, whatever. So what's their plan? If you look at like these AI labs, what's their plan given this landscape? Do they not want the leverage of having, being in the United States? I mean, I think, I don't know.
I think, I mean, one thing the Middle East does offer is capital, but it's like America has plenty of capital, right? It's like, you know, we have trillion dollar companies. Like, what are these Middle Eastern states? They're kind of like trillion dollar oil companies.
We have trillion dollar companies and we have very deep financial markets and it's like, you know, Microsoft could issue hundreds of billions of dollars of bonds and they can pay for these clusters. I mean, look, I think another argument being made, um, and I think it's worth taking seriously is an argument that look, if we don't work with the UAE or with these Middle Eastern countries, um, they're just going to go to China.
Right. And so, you know, we, you know, they're going to build data centers.
They're going to pour money into AI regardless. And if we don't work with them, you know, they'll just support China.
And look, I mean, I think, I think there's some merit to the argument. And in the sense that I think we should be doing basically benefit sharing with them, right? I think we should talk about this later.
But I think basically sort of on the road to AGI, there should be kind of like two tiers of coalitions should be the sort of narrow coalition of democracies. That's sort of the coalition that's developing AGI.
And then there should be a broader coalition where we kind of go to other countries, including, you know, dictatorships, and we're willing to offer them, you know, we're willing to offer them some of the benefits of AI, some of the sharing. And so it's like, look, if the UAE wants to use AI products, if they want to run, you know, meta recommendation engines, if they want to run, you know, like the last generation models, that's fine.
I think by default, they just like wouldn't have had this seat at the AGI AGI table. And so it's like, yeah, they have some money, but a lot of people have money.
And the only reason they're getting this sort of core seat at the AGI table, the only reason these dictators will have this enormous amount of leverage over this extremely national security relevant technology is because we're kind of getting them excited and offering it to them um you know i think the other yeah who like who specifically is doing this like just the companies who are going there to fundraise or like this is the agi is happening and you can find it or you can't it's been reported that yeah it's been reported that you know sam is trying to raise you know seven trillion or whatever for a chip project and you know it's unclear how many of the clusters will be there and so on but it's you's definitely stuff is happening. I mean, look, I think another reason I'm a little bit at least suspicious of this argument of like, look, if the US doesn't work with them, they'll go to China, is I've heard from multiple people, and this wasn't from my time at OpenAI and I haven't seen the memo, but I have heard from multiple people that at some point several years ago, OpenAI leadership had sort of laid out a plan to fund and sell AGI by starting a bidding war between the governments of, you know, the United States, China, and Russia.
And so, you know, it's kind of surprising to me that they're willing to sell AGI to the Chinese and Russian governments. But also, there's something that sort of feels a bit eerily familiar about kind of starting this bidding war and then kind of like playing them off each other.
And well, you know, if you don't do this,
China will do it.
So anyway.
Interesting.
Okay.
So that's pretty fucked up.
But given that that's...
Okay.
So suppose that you're right about
we ended up in this place
because we got...
The way one of our friends put it
is that the Middle East has like
no other place in the world
billions of dollars or trillions of dollars
up for persuasion.
And...
This is true.
And what we have at the form of Sam Holman. Like, you know, the Microsoft board, it's only the dictator.
But so let's say you're right that you shouldn't have gotten them excited about AGI in the first place. But now we're in a place where they are excited about AGI.
And they're like, fuck, we want us to have GPT-5 where we're going to be off building super intelligence. This Adam Sirpiece thing doesn't work for us.
And if you're in this place, don't they already have the leverage? Aren't you like, and you might as well just... I don't think, I think the UAE on its own is not competitive, right? It's like, I mean, they're already export controlled.
Like, you know, we're not, you know, there's like, you're not actually supposed to ship NVIDIA chips over there, right? You know, it's not like they have any of they have any of the leading ai labs you know it's like they have money but you know it's actually hard to just translate money into like but the other things you've been saying about laying out your vision is very much there's this almost industrial process of you put in the compute and then you put in the algorithms yes you add that up and you get aga on the other end yes if it's something more like that then the case for somebody being able to catch up rapidly seems more compelling than if it's some bespoke. Well, well, if they can steal the algorithms and if they can steal the weights.
And that's really, that's really where sort of, I mean, we should talk about this. This is really important.
And I think, you know. So like right now, how easy would it be for, for an actor to steal the things that are like, not the things that are released about Scarlett Johansson's voice, but the RL things you're talking about, the unhobblings? I mean, extremely easy, right? You know, DeepMind even, like, you know, they don't make a claim that it's hard, right? DeepMind put out their, like, whatever frontier safety something, and they, like, lay out security levels, and they, you know, security level zero to four, and four is the one resistant to state actors, and they say we're at level zero right and then you know i mean just recently there's like an indictment of a guy who just like stole the code a bunch of like really important ai code and went to china with it and you know all he had to do to steal the code was uh you know copy the code and put it into apple notes and then export it as pdf and that got past their monitoring right and you know google is the best security of any of the ai labs probably because because they have the, you know, the Google infrastructure.
I mean, I think, I don't know, roughly I would think of this as like, you know, uh, security of a startup, right? And like, what does security of a startup look like? Right. You know, it's not that good.
It's, it's, it's easy to steal. So even if that's the case, a lot of your posts is making the argument that all, you know, why are we going to get the intelligence explosion? Because if we have somebody with the intuition of an Alec Radford to be able to come up with all these ideas, that intuition is extremely valuable and you scale that up.
But if it's a matter of these, if it's just in the code, that, like, if it's just the intuition, then that's not going to be just in the code, right? And also because of export controls, these countries are going to have slightly different hardware. You're going to have to make different trade-offs and probably rewrite things to be able to be compatible with that.
Including all these things, is it just a matter of getting the right pen drive and you plug it into the gigawatt data center next to the Three Gorges Dam and then you're off to the races? I mean, look, there different things right so one one threat model is just stealing the weights themselves and i and the weights one is sort of particularly insane right because they can just like steal the literal like end product right just like make a replica of the atomic bomb and then they're just like ready to go and um you know i think that one just is you know extremely important around the time we have agi and super intelligence right because it's um you know china can build a can build a big cluster. By default, we'd have a big lead, right? Because we have the better scientists, but we make the super intelligence, they just steal it, they're off to the races.
Weights are a little bit less important right now. Because, you know, who cares if they steal the GPT-4 weights, right? Like, whatever.
And so, you know, we still have to get started on weight security now. Because, you know, look, if we think AGI over 27, you know, this stuff is going to take a while.
And it, you know, it doesn't, you know, it's not just going to be like, oh, we do some access control. It's going now because you know look if we think agi over 27 you know this stuff is going to take a while and it you know it doesn't you know it's not just going to be like oh we we do some access control it's going to you know if you actually want to be resistant to sort of chinese espionage you know it needs to be much more intense um the thing though that i think you know people aren't paying enough attention to is the secrets as you say and um you know i think this is you know the compute stuff is sexy you know we talk about it but you know i think that you know i think people underrate the secrets um because they're you know I think the, you know, I think this is, you know, the compute stuff is sexy, you know, we talk about it, but, you know, I think that, you know, I think people underrate the secrets, um, because they're, you know, I think they're, you know, the half an order of magnitude a year, just by default sort of algorithmic progress, that's huge.
You know, if we have a few year lead by, by default, you know, that's 10, 30 X, a hundred X bigger cluster, if we protected them. Um, and then there's this additional layer of the data wall, right? And so we have to get through the data wall.
That means we actually have to figure out some sort of basic new paradigm. So the AlphaGo step two, right? AlphaGo step one is learns from human imitation.
AlphaGo step two is the sort of self-play RL. And everyone's working on that right now.
And maybe we're going to crack it. And, you know, if China can't steal that, then they're stuck.
If they can't steal it, they're off to the races. But whatever that thing is, is it like literally I can write down on the back of a napkin? Because if it's that easy, then why is it that hard for them to figure it out? And if it's more about the intuitions, then don't you just have to hire Alec Bradford? Like, what are you copying down? Well, I think there's a few layers to this, right? So I think at the top is kind of like sort of the, you know, fundamental approach, right? And sort of like, I don't know, on pre-training, it be you know like you know unsupervised learning next token protection train on the entire internet you actually get a lot of juice out of that already um that one's very quick to communicate then there's like there's a lot of details that matter and you were talking about this earlier right it's like probably the way that thing people are going to figure out is going to be like somewhat obvious or there's going to be some kind of like clear you know not that complicated thing that'll work but there's going to be a lot of details to getting that right but if that's true then again why why are we even why do we think that getting state level security in these startups will prevent china from catching up if it's just like oh we know some sort of self-play rl will be required to get past the data wall and if it's as easy as you say in some fundamental sense i mean again but it's going to be solved by 2027 you say27, you say, like, right? It's like not that hard.
I just think, you know, the US and the sort of, I mean, all the leading AI labs in the United States and they have this huge lead. I mean, by default, you know, China actually has some good LLMs.
Why do they have good LLMs? They're just using the sort of open source code, right? You know, LLAMA or whatever. And so the, I think people really underrate the sort of, both the sort of divergence on algorithmic progress and the lead the US would have by default because, default.
Because all this stuff was published until recently, right? Like, Chinchilla scaling laws were published. There's a bunch of MOE papers.
There's transformers. And all that stuff was published.
And so that's why open source is good. That's why China can make some good models.
That stuff is now, I mean, at least they're not publishing it anymore. And if we actually kept it secret, it would be this huge edge.
To your point about sort of like some tacit knowledge and Alec Bradford, you know, there's, there's another layer at the bottom that is something about like, you know, large scale engineering work to make these big training ones work. I think that is a little bit more tacit knowledge.
So I think that, but I think China will be able to figure that out. That's like sort of engineering slap.
They're going to figure out how to figure that out, but not how to get the RL thing working. Um, I mean, look, I don't know, uh't know.
Germany during World War II, you know, they went down the wrong path. They did heavy water and that was wrong.
And there's actually, there's an amazing anecdote in the making of the atomic bomb on this, right? So, secrecy is actually one of the most contentious issues, you know, early on as well. And, you know, part of it was sort of, you know, Zillard or whatever really thought, you know, this sort of nuclear chain reaction was possible.
And so an atomic bomb was possible. And he went around and he was like, this is going to be of enormous strategic importance, military importance.
And a lot of people didn't believe it or they're kind of like, well, maybe this is possible, but, you know, I'm going to act as though it's not possible. And, you know, science should be open and all these things.
And anyway, and so in these early days, so there had been some sort of incorrect measurements made on graphite as a moderator and that Germany had. And so they thought, you know, graphite was not going to work.
We have to do heavy water. But then Fermi made some new measurements on graphite and they indicated that graphite would work.
You know, this is really important. And then, you know, Zillard kind of assaulted Fermi with the kind of another secrecy appeal.
And Fermi was just kind of, he was pissed off, you know, had a temper tantrum. You know, he was like, he thought it was absurd, you know, like, come on, this is crazy.
But, you know, you know, Zillard persisted. I think they roped in another guy, Pegram.
And then Fermi didn't publish it. And, you know, that was just in time.
Because Fermi not publishing meant that the Nazis didn't figure out graphite would work. They went down this path of heavy water.
And that was the wrong path. That was why this, you know, this is a key reason why this sort of German project didn't work out.
They were kind of way behind. And, you know, I think we face a similar situation.
And are we just gonna instantly leak the sort of, how do we get past the data wall? What's the next paradigm? Or are we not? So, and the reason this would matter is if there's, like, being one year ahead would be a huge advantage. In the world where it's like you deploy AI over time and they're just like, ah, they're going to catch up anyway.
I mean, I interviewed Richard Rhodes, the guy who wrote the making an atomic bomb. Yeah.
And one of the anecdotes he had was when, so they realized America had the bomb, obviously we dropped it in Japan. And Beria goes, the guy who ran the NKBD, which is a famously ruthless guy, just evil.
And he goes to, I forgot the name of the guy, the Soviet scientist was running their version of the Manhattan Project. He says, comrade, you will get us the American bomb.
And the guy says, well, listen, their implosion device actually is not optimal. We should make it a different way.
And Barry says, no, you will get us the american bomb yeah and the guy says well listen their implosion device actually is not optimal we should make it a different way and barry says no you will get us the american bomb or your family will be camp dust but the the thing that's relevant about that anecdote is actually the soviets would have had a better bomb if they hadn't copied the american design at least initially and which suggests in history, this is something that's not just for the Manhattan Project, but there's this pattern of parallel invention, where because the tech tree implies that the certain thing is next, in this case, a self-play, RL, whatever, then people are just like working on that and like people are going to figure out around the same time. There's not going to be that much gap much gap in who gets it first um wasn't like famously the bunch of people were invented something like the light bulb around the same time and so forth yeah so but is it just that like yeah that might be true but it'll with the one year or the six months or whatever two years makes all the difference i don't know if it'll be two years though like right i mean i actually i mean i actually think if we lock down the labs we have we have much better scientists we're way ahead it would be two years but I think, I think whether you, I think, yeah, I think even six months a year would make a huge difference.
And this gets back to the sort of intelligence explosion dynamics. Like a year might be the difference between, you know, a system that's sort of like human level and a system that is like vastly superhuman, right? It might be like five, five booms, you know, I mean, even, even on the current pace, right? We went from, you know, I think on the math benchmark recently, right? Like, you know, three years ago on the math benchmark, we, um, you know, that, that was, you know, this is sort of really difficult high school competition math problems.
You know, we were at, you know, a few percent, couldn't solve anything. Now it's solved.
Um, and that was sort of at normal, the normal pace of AI progress. You didn't have sort of a billion super intelligent resources, researchers.
So like a year is a huge difference. And then particularly after super intelligence, right? Once this is applied to to sort of lots of elements of R&D, once you get the sort of like industrial explosion with the robots and so on, you know, I think a year, you know, a couple years might be kind of like decades worth of technological progress.
And might, you know, again, it's like Gulf War I, right, 20, 30 years of technological lead, totally decisive. You know, I think it really matters.
The other reason it really matters is, you know, suppose they steal the weight, suppose they steal the algorithms and, you know, they're close on our tails. Suppose we still pull out ahead, right? We just kind of, we're a little bit faster, you know, we're three months ahead.
I think the sort of like world in which we're really neck and neck, you know, you only have a three month lead are incredibly dangerous, right? And we're in this feverish struggle where like if they get ahead, they get to dominate, you know, sort of maybe they'd get a decisive advantage. They're building clusters like crazy.
They're willing to throw all caution to the wind. We have to keep up.
There's some crazy new WMDs popping up. And then we're going to be in the situation where it's like, you know, crazy new military technology, crazy new WMDs, you know, like deterrence and mutually disturbed instruction, like keeps changing, you know, every few weeks.
And it's like, no, completely unstable, volatile situation. That is incredibly dangerous.
So it's, I think, I think, you know, both from just the technologies are dangerous from the alignment point of view, you know, I think it might be really important during the intelligence explosion to have the sort of six month, you know, wiggle room to be like, look, we're going to like dedicate more compute to alignment during this period because we have to get it right we're feeling uneasy about how it's going and um so i think in some sense that like one of the most important inputs to whether we will kind of destroy ourselves or whether we will get through this just incredibly crazy period is whether we have that buffer um why so before we go further object level in this i think it's very much worth worth noting that almost nobody, at least nobody I talk to, thinks about the geopolitical implications of AI. And I think I have some object level disagreements that we'll get into, or at least things I want to iron out.
I may not disagree in the end. But the basic premise that obviously, if you keep scaling, and obviously if people realize that this is where intelligence is headed, it's not just going to be like the same old world, where like, what model are we deploying tomorrow? And what is the latest? Like, people on Twitter are like, oh, the GPT-40 is going to shake your expectations or whatever.
You know, COVID is really interesting because before a year or something, when March 2020 hit, we... It became clear to the world, like, presidents, CEOs, media, average person.
There's other things happening in the world right now, but the main thing we as a world are dealing with right now is COVID. Soon on AGI.
Yeah. Okay.
And then so... This is the quiet period..
You know, you want to go on vacation, you know, you want to like, you want to, yeah, you want to have, you know, maybe like now's the last time you can have some kids. Um, you know, my girlfriend sometimes complains, uh, you know, that I, um, you know, when I'm like, you know, off doing work or whatever, she's like, I'm not spending time with her.
She's like, uh, you know, uh, uh, she threatens to replace me with like, you gbd6 or whatever and i'm like gb6 will also be too busy for doing research um okay anyway so let's answer the question of why why aren't other people talking national security i made this mistake with covid right so i you know february of 2020 and i um you know i thought just it was gonna sweep the world and all the hospitals would collapse and it'd And then and then, you know, and then it'd be over. And a lot of people thought this kind of the beginning of COVID.
They shut down their offices a month or whatever. I think the thing I just really didn't price in was the societal reaction.
Right. And and within weeks, you know, Congress spent over 10 percent of GDP on like COVID measures.
The entire country was shut down. It was, I don't know, I didn't price it in with COVID sufficiently.
I don't know, why do people underrate it? I mean, I think there's a sort of way in which being kind of in the trenches actually kind of, I think, gives you a less clear picture of the trend lines. You actually have to zoom out that much, only like a few years, right? But you're in the trenches, you're like trying to the next model to work you know there's always something that's hard you know for example you might underrate algorithmic progress because you're like ah things are hard right now or you know data wall or whatever but you know you zoom out just a few years you actually try to like count out how much algorithmic progress made in the last you know last few years and it's enormous um but i also just don't think people think about this stuff like i think smart people really under underate espionage, right? And, you know, I think part of the security issue is I think people don't realize like how intense state level espionage can be, right? Like, you know, this really company had had software that could just zero click hack any iPhone, right? They just put in your number and then it's just like straight download of everything, right? Like the United States infiltrated an air gapped atomic weapons program, right? Wild, you know, like, yeah, you know, the, you know, you know, intelligence agencies have just stockpiles of, of, of, of zero days, you know, when things get really hot, you know, I don't know, maybe we'll send special forces, right.
To like, you know, get, go to the data center or something that's, you know, or, you know, I mean, China does this, they threaten people's families, right. And they're like, look, if you don't cooperate, if you don't give us the Intel, um, uh, There's a good book, you know i mean china does this they threaten people's families right and they're like look if you don't cooperate if you don't give us the intel um uh there's a there's a good book you know along the lines of the the gulag or develop you know the um inside the aquarium um which is by a soviet gru defector um gru was like military intelligence uh ilia recommended this book to me um and um uh you know i think reading that it was just kind of like shocked at how intense sort of state level espionages.
The whole book was about like they go to these European countries and they try to get all the technology and recruit all these people to get the technology.
I mean, yeah, maybe one anecdote, you know, so when to the spot, you know, this eventual defector, you know, he's being trained.
He goes to the kind of GRU spy academy.
And so then to graduate from the spy academy, sort of before you're sent abroad, you kind of had to pass a test to show that you can do this. And the test was, you know, you had to, in Moscow, recruit a Soviet scientist and recruit them to give you information, sort of like you would do in the foreign country.
But of course, for whomever you recruited, the penalty for giving away sort of secret information was death. And so to graduate from the Soviet spy, this GRU spy academy, you had to condemn a countryman to death.
States do this stuff. I started reading the book on, because I saw it in the series.
And I was actually wondering, the fact that you use this anecdote, and then you're like,, and a book recommended by Ilya, is this some sort of... Is this some sort of Easter egg? We'll leave that for an exercise for the reader.
Okay, so... The beatings will continue until the morale improves.
Yeah. So, suppose that we live in the world in which these secrets are locked down, but China still realizes that this progress is happening in America.
So, in that world, especially if they realize, and I guess it's a very interesting open question. I mean, the secrets probably won't be locked down.
Okay, but suppose... We're probably going to live in the bad world.
Yeah. It's going to be really bad.
Hmm. Why are you so confident that they won't be locked down i mean i'm not confident that won't be locked down but i think it's just um it's not happening but that and so tomorrow the lab leaders get the message how hard like what do they have to do they get the more security guards they like air gap the well you know what do they do so again i think i think basically it's you know i think people there's kind of like two two reactions there which is like it's you know we're already secure you know not um and there's um you know fatalism it's impossible i think the thing you need to do is you kind of got to stay ahead of the curve of basically how egi pill is the ccp yeah right so like right now you've got to be resistant to kind of like normal economic espionage um they're not right i mean i mean, I probably wouldn't be talking about the stuff that the labs were, right? Because I wouldn't want to wake them up more, the CCP, but they're not, you know, this is like, this stuff is like really trivial for them to do right now.
I mean, it's also anyway, so they're not resistant to that. I think it would be possible for a private company to be resistant to it, right? So, you know, both of us have, you know, friends in the kind of like quantitative trading world, right? And and and you know i think actually those secrets are shaped kind of similarly where it's like you know you know they've said you know yeah if i got on a call for an hour with somebody from a competitive firm i could most of our alpha would be gone um and that's sort of like that's like list of details of like really how to how to make you're gonna worry about that pretty soon you're gonna worry about that pretty soon yeah well anyway and so so all the alpha could be gone but in fact their alpha persists right and you know often often for many years and decades and so this doesn't seem to happen and so i think there's like you know i think there's a lot you could go if you went from kind of current startup security you know you just gotta look through the window and you can look at the slides you know it's it's kind of like you know you know good private sector security hedge funds you know the way google treats, treats, you know, customer data or whatever.
That'd be good right now. The issue is, you know, basically the CCP will also get more AGI built.
And at some point, we're going to face kind of the full force of, you know, the Ministry of State Security. And again, you're talking about smart people underwriting espionage and the sort of insane capabilities of states.
I mean, this stuff is wild, right? You like you know there's papers about you know you can find out the location of like where you are on a video game map just from sounds right like states can do a lot with like electromagnetic emanations you know like you know at some point like you got to be working from a scat like your cluster needs to be air gapped and basically be a military base it's like you know you need to have you know intense kind of security clearance procedures for employees they have to be like, you know, all their shit is monitored. You know, they're, you know, they basically have security guards.
You know, it's, you know, you can't use any kind of like, you know, other dependencies. It's all got to be like intensely vetted and, you know, all your hardware has to be intensely vetted.
And, you know, I think basically if they actually really face the full force of state level espionage, I don't really think this is a thing private companies can do. Both, I mean, empirically, right? Like, you know, Microsoft recently had executives' emails hacked by Russian hackers and, you know, government emails they've posted hacked by government actors.
But also, you know, it's basically there's just a lot of stuff that only kind of, you know, the people behind the security currencies know and only they deal with. And so, you know, I think to actually kind of resist the sort of full force of espionage, you're going to need the government.
Anyway, so I think basically we could do it by always being ahead of the curve. I think we're just going to always be behind the curve.
And I think maybe unless we get the sort of government project. Okay, so going back to the naive perspective of we're very much coming at this from there's going to be a race in the CCP, we must win.
Listen, I understand bad people are in charge of the Chinese government, like the CCP and everything. But just stepping back in a sort of galactic perspective, humanity is developing AGI.
And do we want to come at this from the perspective of, we need to beat China to this? Our super intelligent Jupiter brain descendants won't know who China is. Like China will be some distant memory that they have, America too.
So shouldn't it be a more, the initial approach just come to them like, listen, this is super intelligence. This is something like we come from a cooperative perspective.
Why immediately sort of rush into it from a hawkish competitive perspective? I mean, look, thing i want to say is like a lot of the stuff i talk about in the series is you know is sort of primarily you know descriptive right and so i think that on the china stuff it's like you know yeah in some ideal world you know we we uh you know it's just all you know merry-go-round and cooperation but again it's sort of i think i think people wake up to agi i think the the issue particular on sort of like can we make a deal can we make an international treaty i think it really relates to sort of what is the stability of sort of international arms control agreements right and so we did very successful arms control on nuclear weapons in the 80s right and the reason it was successful is because the sort of new equilibrium was stable right so you take go down from you know whatever 60 you know, whatever, 60,000 nukes to 10,000 nukes. You know, when you have 10,000 nukes, you know, basically breakout, breakout doesn't matter that much, right? Suppose the other guy now tried to make 20,000 nukes.
Well, it's like, who cares, right? You know, like it's still mutually assured destruction. Suppose a rogue state kind of went from zero nukes to one nukes.
It's like, who cares? We still have way more nukes than you. I mean, it's still not ideal for destabilization, but it, you know, it'd be very different if the arms control agreement had been zero nukes, right? Because if it had been zero nukes, then it's just like one rogue state makes one nuke.
The whole thing is destabilized. Breakout is very easy.
Your adversary state starts making nukes. And so basically when you're going to sort of like very low levels of arms, or when you're going to kind of, in your sort of very dynamic technological situation, arms control is really because because breakout is easy you know there's there's i mean there's some other sort of uh stories about this in sort of like 1920s 1930s you know it's like you know all the european states have done disarmament and and germany was was kind of did this like crash program to build the luftwaffe and that was able to like massively destabilize things because not that you know they were the first they were able to like pretty easily build kind of a modern, you know, Air Force because the others didn't really have one.
And that, you know, that really destabilized things. And so I think the issue with EGI and super intelligence is the explosiveness of it.
Right. So if you have an intelligence explosion, if you're able to go from kind of EGI to super intelligence, if that super intelligence is decisive, like either, you know, like a year after because you develop some crazy WMD or because you have some like, you know, super hacking ability that lets you you kind of, you know, completely deactivate the sort of enemy arsenal.
That means like suppose you're trying to like put in a break, you know, like we both were both going to like cooperate and we're going to go slower, you know, on the cusp of AGI or whatever. There is going to be such an enormous incentive to kind of race ahead, to break out.
We're just going to do intelligence explosion if we can get three months ahead we win um i think that makes it basically i think any sort of arms control agreement that comes as a situation where it's close very unstable that's really interesting this is very analogous to kind of a debate i had with rose on the podcast where he argued for nuclear disarm. But if some country tries to break out and starts developing nuclear weapons, the six months or whatever that you would get is enough to get international consensus and invade the country and prevent them from getting nukes.
And I thought that was sort of... That's not a stable equilibrium.
It just seemed really tough. Yeah.
But on this, right? So, like, maybe it's a bit easier because you have AGI and so like you can monitor the other person's cluster or something like data centers, you can see them from space actually. You can see the energy draw they're getting.
There's a lot of things, as you were saying, there's a lot of ways to get information from an environment if you're really dedicated. And also because unlike a nukes, the data centers are nukes, you have obviously the submarines, planes, obviously the submarines planes you have bunkers mountains whatever you have in so many different places a data center that your 100 gigawatt data center we can blow that shit up if you're like we're concerned right like just some cruise missile or something yeah it's like very vulnerable to sabotage that gets to the sort of i mean that gets to the sort of insane vulnerability the volatility of this period post super intelligence basically, I think so you have the intelligence explosion, you have these like vastly superhuman things on your cluster, but you're like, you haven't done the industrial explosion yet, you don't have your robots yet, you haven't kind of, you haven't covered the desert in like robot factories yet.
And that is the sort of crazy moment where, you know, say, say the United States is ahead, the CCP is somewhat behind, there's actually an enormous incentive for first strike, right? Because if they can take out your data center, out your data center, they know you're about to have just this command and decisive lead. They know if we can just take out this data center, then we can stop it.
And they might get desperate. And so I think basically we're going to get into a position, I think it's going to be pretty hard to defend early on.
I think we're basically going to be in a position where we're protecting data centers with the threat of nuclear retaliation. Which maybe sounds kind of crazy, though.
Is this the inverse of the Eliezer? Are we going to tie the data centers in New York? Nuclear deterrence for data centers. I mean, this is Berlin.
In the late 50s, early 60s, both Eisenhower and Kennedy multiple times made the threat of full-on nuclear war against the Soviets if they tried to encroach on West Berlin. It's sort of insane.
It's it's kind of insane that that went well but basically i think that's going to be the only option for the data centers it's a terrible option this whole scheme is terrible right like being being being in this like neck and neck race sort of at this point is terrible and you know it's also the you know i think i have some uncertainty basically on how easy that decisive advantage will be i'm pretty confident that if you have super intelligence you have two years you have robots, you're able to get that 30-year lead. Look, then you're in this like go for one situation.
You have your like, you know, millions or billions of like mosquito-sized drones that can just take it out. I think there's even a possibility you can kind of get a decisive advantage earlier.
So, you know, there's these stories, you know, about these as well, about, you know, like colonization and like the sort of 1500s where it was, you know, these like a few hundred kind of Spaniards were able to like topple the Aztec empire, you know, a couple, I think a couple other empires as well. You know, each of these had a few million people and it was not like godlike technological advantage.
It was some technological advantage. It was, I mean, it was some amount of disease.
And then it was kind of like cunning strategic play. And so I think there's a, there's a possibility that even sort of early on, you know, you haven't gone through the full industrial explosion yet.
You have super intelligence, but, you know, you're able to kind of like manipulate the imposing generals, claim you're allying with them.
Then you have you have some you know, you have sort of like some crazy new bioweapons.
Maybe maybe there's even some way to like pretty easily get a paradigm that like deactivates enemy nukes.
Anyway, so I think this stuff could get pretty wild.
Here's what I think we should do.
I really don't want this volatile period.
And so a deal with China would be nice. It's going to be really tough if you're in this unstable equilibrium.
I think, basically, we want to get in a position where it is clear that the United States, that a sort of coalition of democratic allies will win. It's clear the United States, it's clear to China.
That will require having locked down the secrets. That will require having built the 100 gigawatt cluster in the United States and having done the natural gas and doing what's necessary.
And then when it is clear that the democratic coalition is well ahead, then you go to China and then you offer them a deal. And, you know, China will know they're going to win.
This is going to be, they're very scared of what's going to happen. We're going to know we're going to win, but we're also very scared of what's going to happen because we really want to avoid this kind of like breakneck race right at the end and where things could really go awry.
And, you know, and then and so then we offer them a deal. I think there's an incentive to come to the table.
I think there's a sort of more stable arrangement you can do. It's a sort of an atoms for peace arrangement.
And we're like, look, we're going to respect you. We're not we're not going to like we're not going to use superintelligence against you.
You can do what you want. You're going to get your like you're going to get your slice of the galaxy.
We're going to like, we're going to benefit share with you. We're not going to use superintelligence against you.
You can do what you want. You're going to get your slice of the galaxy.
We're going to benefit share with you. We're going to have some compute agreement where there's some ratio of compute that you're allowed to have, and that's enforced with opposing AIs or whatever.
And we're just not going to do this kind of volatile sort of WMD arms race to the death. And sort of it's like a new world order that's US-led, that's sort of democratic-led, but that respects China, lets them do what they want.
Okay, there's so much to... There's so much there.
First on the galaxies thing, I think it's just a funny anecdote, so I kind of want to tell it. And this, we were at an event, and I'm respecting Chatham House rules here.
I'm not revealing anything about it, but we're talking to somebody or Leopold was talking to somebody influential. Afterwards, that person asked the group, Leopold told me that he wants, he's not going to spend any money on consumption until he's ready to buy galaxies.
And he goes, the guy goes, I honestly don't know if he meant galaxies like the brand of private plane galaxy or the physical galaxies. And there was an actual debate.
Like he went away to the restroom and there was an actual debate among people who are very influential about he can't amend galaxies. And the other people who knew you better be like, no, he means galaxies.
I mean, the galaxies. I mean, I think it'll be interesting.
I mean, yeah, I think there's a, I mean, there's two ways to buy the galaxies. One is like at some point, you know, it's like post superintelligence.
You know, there's some crazy. I love, okay, so what happens is he's out there.
I'm laughing my ass off. I'm not even saying it.
People are like having this debate. And then so Leopold comes back and the guy, somebody's like, oh, Leopold, we're having this debate about whether you meant you want to buy the galaxy or you want to buy the other thing.
And Leopold assumes they must mean not the private plane of the galaxy versus the actual galaxy. But do you want to buy the property rights of the galaxy or actually just send out the probes right now? Exactly, exactly.
Oh, my God. All right.
Back to China. There's a whole bunch of things I could ask about that plan, about whether you're going to get credible, promised, you will get some part of galaxies, whether they care about that.
I mean, you have AIs to help you enforce stuff. Okay, sure.
We'll leave that aside. That's a different rabbit hole.
The thing I want to ask is... But it has to be the thing we need.
The only way this is possible is if we lock it down. I see.
If we don't lock it down, we are in this fever struggle. Greatest peril mankind will have ever seen.
So, but given the fact that during this period, instead of just taking their chances and they don't really understand how this AI governance scheme is gonna work, where they're gonna check whether we had to actually get the galaxies. The data centers, they can't be built underground.
They have to be built above ground. Taiwan is right off the coast of us.
They need the chips from there. Why aren't we just gonna invade? Listen, we don't want, like, worst case scenario is they win the super intelligence, which they're on track to do anyways.
Wouldn't this instigate them to either invade Taiwan or blow up the data center in Arizona or something like that? Yeah. I mean, look, I mean, you talked about the data center one and then, you know, you probably have to like threaten nuclear retaliation to protect that.
They might also just blow it up. There's also maybe ways they can do it without sort of attribution, right? Like you pay...
Stuxnet. Stuxnet.
Yeah. I mean, this is, I mean, this is part of, we'll talk about this later, but, later but you know i think um look i think we need to be working on the stuxnet for the chinese project but the um but by the audience i want i mean taiwan the taiwan thing the um you know you know i talk about you know agi by you know 27 or whatever um do you know about the like terrible 20s no okay well i mean sort of in the sort of taiwan watcher circles people often talk
about like the late 2020s is like maximum period of risk for taiwan because sort of like you know
military modernization cycles and basically extreme fiscal tightening on on the military
budget in the united states over the last decade or two um has meant that sort of we're in this
kind of like you know trough in in in the late 20s of like you know basically overall naval capacity
and you know that's sort of when china is saying they want to be ready so it's already kind of
like it's kind of pitching you know there's some sort of like you know parallel timeline there um
Thank you. of like, you know, basically overall naval capacity.
And, you know, that's sort of when China is saying they want to be ready. So it's already kind of like, it's kind of pitching,
you know, there's some sort of like, you know,
parallel timeline there.
Yeah, look, it looks appealing to invade Taiwan.
I mean, maybe not because they, you know,
basically remote cutoff of the chips.
And so then it doesn't mean they get the chips,
but it just means they, you know, it's just, you know, the machines are deactivated.
But look, I mean, imagine if during the Cold War, you know, all of the world's uranium deposits had been in Berlin, you know, and Berlin was already, I mean, almost multiple times and was caused a nuclear war. So God help us all.
Well, the Groves had a plan after the war that the plan was that America would go around the world and getting the rights to every single uranium deposit because they didn't realize how much uranium there was in the world and they thought this was a thing that was feasible. Not realizing, of course, that there's like huge deposits in the Soviet Union itself.
Right, right. East Germany too.
There's a lot of East German workers who kind of got screwed. Oh, interesting.
Got cancer. Okay, so the framing we've been talking about that we've been assuming, and I'm not sure I buy yet, is that the United States, this is our leverage, this is our data center, China is a competitor.
Right now, obviously, that's not the way things are progressing. Private companies control these AIs.
They're deploying them. It's a market-based thing.
Why will it be the case that it's like the United States has this leverage or is doing this thing versus China is doing this thing? Yeah, I mean, look, on the project, there's sort of descriptive and prescriptive claims or normative positive claims. I think the main thing I'm trying to say is, look, we're at these SF parties or whatever, and I think people talk about AGI, and they're always just talking about the private AI labs.
And I think I just really want to challenge that assumption.
It just seems like, seems pretty likely to me, you know, as we've talked about for reasons we've
talked about, that looks like the national security state is going to get involved. And, you know, I think there's a lot of ways this could look like, right? Is it like nationalization? Is it a public private partnership? Is it a kind of defense contract or like relationship? Is it a sort of government project that sokes up all the people? And so there's a spectrum there.
But I think people are just vastly underrating the chances of this more or less looking like a government project. And look, I mean, look, if, you know, it's sort of like, you know, do you think, do you think like we all have literal, like, you know, when we have like literal super intelligence on our cluster, right? And it's like, you know, you have a hundred billion, they're like, sorry, you have a billion like super intelligence scientists that they can like hack everything.
They can like stuck the Chinese data centers, you know, they're starting to build the robo armies, you know, you like, you really think they'll be like a private company and the government wouldn't be like, oh my God, what is going on? You know, like, yeah. Suppose there's no China.
Suppose there's people likean north korea who theoretically at some point will be able to do super intelligence but they're not on our heels and they don't have the ability to be on our heels in that world are you advocating for the national project or do you prefer the private um path forward yes i mean two responses to this one is i mean you still have like russia you still have these other countries. You know, you've got to have Russia proof security, right?
It's like you can't you can't just have Russia steal all your stuff.
And like maybe their clusters aren't going to be as big, but like they're still going to be able to make the crazy bioweapons and the, you know, the musculoskeletal drone swarm, you know, and so on.
And so, I mean, I think I think I think the security component is just actually a pretty large component of the project in the sense of like, I currently do not see another way where we don't kind of like instantly proliferate this to everybody. And so, yeah, so I think it's sort of like you still have to deal with Russia, you know, Iran, North Korea and, you know, like, you know, Saudi and Iran are going to be trying to get it because they want to screw each other and, you know, Pakistan and India because they want to screw each other.
There's like this enormous destabilization still. That said, look, I agree with you.
If, you know, if, you know, by some somehow things had shaking out differently and like you know agi would have been in 2005 um you know sort of like unparalleled in american hegemony um i think there would have been more scope for less government involvement um but again you know as we're talking about earlier i think that would have been sort of this like very unique moment in history. And I think basically, you know, almost all other moments in history,
there would have been a sort of great power competitor.
So, okay, so let's get into this debate.
So, my position here is if you look at the people who are involved in the Manhattan Project itself,
many of them regretted their participation, as you said.
Now, we can infer from that that we should sort of start off with a cautious approach to the nationalized ASI project. Then you might say, well, listen, obviously the super...
Did they regret their participation because of the project or because of the technology itself? I think people will regret it, but I think it's about the nature of the technology and it's not about the project. I think they also probably had a sense that different decisions would have been made if it wasn't some concerted effort that everybody had agreed to participate in.
That if it wasn't in the context of this, we need to race to beat Germany and Japan, you might not develop. So that's the technology part, but also like you wouldn't actually like hit them with- You know, it's like the sort of the destructive potential, the sort of military potential.
It's not it's not because of the project. It is because of the technology.
And that will unfold regardless. You know, I think this underrates the power of modeling.
Imagine you go through like the 20th century in like, you know, a decade. You know, it's just the sort of the sort of.
Yes, great. Let's just run that example.
Suppose you actually there was some reason that the 20th century would be run through in one decade. Do you think the cause of that should have been, should have been, like the technologies that happened through the 20th century shouldn't have been privatized? That it should have been a more sort of concerted government-led project? You know, look, there is a history of just dual-use technologies, right? And so I think AI, in some sense, sense is going to be dual use in the same way.
And so there's going to be lots of civilian uses of it, right? Like nuclear energy itself, right? It was like, you know, there's the government project developed the military angle of it. And then, you know, it was like, you know, then the government worked with private companies.
There's a sort of like real like flourishing of nuclear energy until, you know, the environmentalists stopped it. You know, planes, right? Like Boeing, right? Actually, you know, the Manhattan Project wasn't the biggest defense R&D project during World War II.
It was the B-29 bomber, right? Because they needed the bomber that had long enough range to reach Japan to destroy their cities. And then, you know, Boeing made some Boeing, made that B-Boeing made the B-47, made the B-52, you know, the plane the U.S.
military uses today. And then they used that technology later on to, you know, build the 707.
And sort of the... But what does later on mean in this context? Because in the other...
Like, I get what it means after a war to privatize. But if you have...
The government has ASI. Maybe just let me back up and explain my concern.
So you have the only institution in our society which has a monopoly on violence. And then we're going to give it some, in a way that's not broadly deployed, access to the ASI.
The counterfactual, and this maybe sounds silly, but listen, we're going to go through higher and higher levels of intelligence. Private companies will be required by regulation to increase their security.
But they'll still be private companies and they'll deploy this and they're going to release the AGI. Now McDonald's and JP Morgan and some random startup are now more effective organizations because they have a bunch of AGI workers.
And it'll be sort of like the Industrial Revolution in the sense that the benefits were widely diffused. If you don't end up in a situation like that, then the...
I mean, even backing up, like what is it we're trying to... Why do we want to win against China? We want to win against China because we don't want a top-down authoritarian system to win.
Now, if the way to beat that is that the most important technology that humanity will have has to be controlled by a top-down government, like, what was the point? Like, why... Maybe...'s like run our cards with privatization.
That's the way we get to the classic liberal market-based system we want for the ESIs. Yeah.
All right. So a lot to talk about here.
Yeah. I think, yeah, maybe I'll start a bit about like actually looking at what the private world would look like.
And I think this is part of where the sort of, there's no alternative comes from. And then let's look like, look at like what the government project looks like, what checks and balances look like, and so on.
All right, private world. First of all, a lot of people right now talk about open source.
And I think there's this sort of misconception that AGI development is going to be like, oh, it's going to be some beautiful decentralized thing and some giddy community of coders who gets to collaborate on it. That's not how it's going to look like, right? $100 billion, $1 trillion cluster, it's not going to to be that many people that have it the algorithms you know it's like right now open source is kind of good because people just use the stuff that was published and so they basically you know the algorithms were published or you know as mistral they just kind of like leave deep mind and you know take all the secrets with them and they just kind of replicate it um um but that's not going to continue being the case and so you know the sort of like open source alternative i mean also people say stuff like you know 10 26, it'll be in my phone, or, you know, it's no, it won't, you know, it's like Moore's law is really slow.
I mean, AI chips are getting better. But like, you know, the $100 billion computer will not cost, you know, like $1,000, you know, within your lifetime or whatever, aside from it.
So it's going to be, it's going to be like two or three, you know, big players on the private world. And so look, a few things.
So first of all, you know, you talk about the sort of like, you know, enormous power that sort of super intelligence will have and that the government will have. I think it's pretty plausible that the alternative world is that like one AI company has that power, right? And it's basically, if we're talking about lead, you know, it's like what? I don't know, Open six month lead.
And then, so then you're not talking, you're talking about basically, the most powerful weapon ever. And it's, you're kind of making this like radical bet on like a private company CEO is the benevolent dictator.
No, no, not necessarily like any other thing that's privatized, we don't count on them being benevolent. We just look, think of for example, somebody who manufactures industrial fertilizer.
Yes. Right? This...
The person with this factory, if they went back to an ancient civilization, they could, like, blow up Rome. They could probably blow up Washington, D.C.
Indeed. And I think in their series, you talk about Tyler Cowen's phrase of muddling through.
And I think even with privatization, people sort of underrate that there are actually a lot of private actors who have the ability to like, there's a lot of people who control the water supply or whatever. And we can count on cooperation and market-based incentives to basically keep a balance of power.
Sure. I gather things are proceeding really fast.
Yes. But we have a lot of historical evidence that this is the thing that works best.
So look, I mean, what do we do with nukes, right? The way we keep the sort of nukes in check is not like, you know, a sort of beefed up Second Amendment, where like each state has their own like little nuclear arsenal. And like, you know, Dario and Sam have their own little nuclear arsenal.
No, no, it's like, it's institutions, it's constitutions, it's laws, it's courts. And so, so I don't actually I'm not sure that this, you know, I'm not sure that the sort of balance of power analogy holds.
In fact, you know, sort of the government having the biggest guns was sort of like an enormous civilizational achievement, right?
Like Landfrieden in the sort of Holy Roman Empire, right?
You know, if somebody from the town over kind of committed a crime on you, you know, you didn't kind of start a sort of a, you know, a big battle between the two towns.
No, you take it to a court of the Holy Roman Empire and they would decide.
And it's a big achievement.
Now, the thing about, you know, the industrial fertilizer, I think the key difference is kind of speed and offense, defense, balance issues, right? So it's like 20th century and, you know, 10 years in a few years. That is an incredibly scary period.
And it is incredibly scary, you know, because it's, you know, you're going through just this sort of enormous array of destructive technology and the sort of like enormous amount of like, you basically military advance i mean you would have gone from you know kind of like you know you know bayonets and horses to kind of like tank armies and fighter jets in like a couple years and then from you know like you know and then to like you know nukes and you know icbms and stuff you know just like in a matter of years and so it is sort of that speed um that creates i think basically the way I think about is there's going to be this initial just incredibly volatile, incredibly dangerous period. And somehow we have to make it through that.
And that's going to be incredibly challenging. That's where you need the kind of government project.
If you can make it through that, then you kind of go to like, you know, now we can now, you know, the situation has been stabilized, you know, we don't face this imminent national security threat. You know, it's like, yes, there were kind of WMDs that came along the way, but either we've managed to kind of like have a sort of stable offense defense balance, right? Like I think bioweapons initially are a huge issue, right? Like an attacker can just create like a thousand different synthetic, you know, viruses and spread them.
And it's like going to be really hard for you to kind of like make a defense against each, but maybe at some point you figure out the kind of like, you know, universal defense against every possible virus. And then you're in a stable situation and on the offense defense balance.
Or you do the thing, you know, you do with planes where it's there's like, you know, there's certain capabilities that the private sector isn't allowed to have.
And you've like figured out what's going on, restrict those.
And then you can kind of like let, you know, you let the sort of civilian, civilian uses.
So I'm skeptical of this because.
Well, sorry.
I mean, the other important thing is. So I talked about this sort of sort of you know maybe it's like it's it's a you know it's you know it's one company with all this power and i think it's like i think it is unprecedented because it's like the industrial fertilizer guy cannot overthrow the u.s government i think it's quite plausible that like the ai company with super intelligence can overthrow the multiple ai companies right and i buy that one of them could be ahead so it's not obvious that it'll be multiple i think it's again if there's like a six-month lead maybe maybe there's two or three i agree but if there's two or three then what you have is just like a crazy race between these two or three companies you know it's like you know whatever demis and sam they're just like i don't want to let the other one win and um and they're both developing their nuclear arsenals in the robot it's just like also like come on the government is not going to let these people you know are they going to let like you know is dario going to be the one developing the kind of like you know you know super hacking stuxnet and like deploying against the chinese data center the other issue though is it won't just if it's two or three it won't just be two or three there'll be two or three and it'll be china and russia and north korea because the private in the private lab world there is no way they'll have security that is good enough i think we're also assuming that somehow if you nationalize it, like the security just, especially in the world where this stuff is priced in by the CCP, that now you've like got it nailed down.
And I'm not sure why we would expect that to be the case. But on this...
The government's the only one who does this stuff. So if it's not Sam or Dario, we don't trust them to be a benevolent dictator or whatever.
Well, we're just corporate governance. So, but here we're counting on if it's not sam or dario who's uh we don't trust them to be benevolent dictator or whatever so but here we're counting on if it's because you can cause a coup the same capabilities are going to be true of the government project right and so the uh modal president in 2020 uh 2025 but donald trump will be the person that you don't trust sam or dario to have these capabilities and why okay i agree, I agree that like, I'm worried if Sam or Dario have a one year lead on ASI and in that world, then I'm like concerned about this being privatized.
But in that exact same world, I'm very concerned about Donald Trump having the capability. And potentially for living in a world where the takeoff is slower than you anticipate.
In that world, I'm like very much I want the private company companies. So like in no part of this matrix, is it obviously true that the government-led project is better than the private project.
Let's talk about the government project a little bit and checks and balances. In some sense, I think my argument is a sort of Birkin argument, which is like American checks and balances have held for, you know, over 200 years and through crazy technological revolutions.
You know, the US military could kill like every civilian in the United States. But you're going to make that argument.
The private-public balance of power has held for hundreds of years, and then, like... Corporate, but yeah, why has it held? Because the government has the biggest guns.
And has never before has a single CEO or a random nonprofit board had the ability to launch nukes. And so, again, it's like, you know, what is the track record of the government checks and balances versus the track record of the private company checks and balances? Well, the iLab, you know, like first stress test, you know, went really badly, you know, that didn't really work, you know? I mean, even worse in the sort of private company world.
So it's both like, it is not just the two, it is like the two private companies and the CCP and they just like instantly have all the shit. And then it's, you know, they probably won't have good enough internal control.
So it's like, not just like the random CEO, but it's like, you know, rogue employees that can kind of like use these super intelligences to do whatever they want. And this won't be true of the government? Like the rogue employees won't exist on the project? Well, the government actually like, you know, has decades of experience and like actually really cares about the stuff.
I mean, it's like they deal, they deal with nukes, they deal with really powerful technology. And it's, you know, this is like, this is the stuff that the national that the national security state cares about um you know again to the go let's talk about the government checks and balances a little bit so you know what are what are checks and balances in the government world first of all i think it's actually quite important that you have some amount of international coalition and i talked about these sort of two tiers before basically i think the inner tier is a sort of modeled on the quebec agreement right this was like churchill and and roosevelt they kind of agreed secretly um we're gonna like pull our efforts on nukes, but we're not gonna use them against each other.
And we're not gonna use them against anyone else with their consent. And I think basically look, bring in the UK, they have DeepMind, bring in the kind of like Southeast Asian states who have the chip supply chain, bring in some more of a kind of like NATO, close democratic allies for talent and industrial resources.
And you have this sort of like, so you have those checks and balances in terms of like more international countries at the table um sorry somewhat separately but then you have the sort of second tier of coalitions which is the sort of atoms for peace thing where you go to a bunch of countries including like the uae and you're like look we're gonna basically like you know there's a deal similar to like the npt stuff where it's like you're not allowed to like do the crazy military stuff but we're going to to share the civilian applications. We're in fact going to help you, um, and share the benefits and, you know, sort of kind of like this new sort of post super intelligence world order.
All right. U S checks and balances, right? So obviously Congress is going to have to be involved, right? Appropriate trillions of dollars.
I think probably ideally you have, um, Congress needs to kind of like confirm whoever's running this. So you have Congress, you have like different factions of the government, you have the courts, I expect the First Amendment to continue being really important.
And maybe that I think that sounds kind of crazy to people. But I actually think again, I think these are like, institutions that have withheld this test of time, in a really sort of powerful way.
You know, eventually, you know, this is why, honestly, alignment is important is like, you know, they eyes, you program the eyes to follow the Constitution. And it's like, you know, why does the military work? It's like generals, you know, are not allowed to follow unlawful orders.
They're not allowed to follow unconstitutional orders. You have the same thing for the AIs.
So what's wrong with this argument? Where you say, listen, maybe you have a point in the world where we have extremely fast takeoff. It's like one year from AGI to ASI.
Yeah. And then you have the like years after ASI whereI where you have this like extraordinary explosion of technological progress.
Maybe you have a point. Yeah.
We don't know. You have these arguments.
We'll like get into the weeds on them about why that's a more likely world. But like maybe that's not the world we live in.
Yeah. And in the other world, I'm like very on the side of making sure that these things are privately held.
Now, why? I mean, I don't know. So when you nationalize, that's a one-way function you can't go back why not wait until we have more evidence on which of those worlds we live in why i think like rushing on the nationalization might be a bad idea while we're not sure and okay i'll just respond to that first i mean i don't i don't expect us to nationalize tomorrow if anything i expect it to be kind of with covid where it like kind of too late.
Like ideally you nationalize it early enough to like actually lock stuff down. It'll probably be kind of chaotic and like, you know, you're going to be trying to like do this crash program to lock stuff down and it'll be kind of late.
It'll be kind of clear what's happening. We're not going to nationalize when it's not clear what's happening.
I think the whole, historically, these institutions have held up well. First of all, they've actually almost broken a bunch of times.
It's like this is this is this is the argument that some people who are saying that we shouldn't be that concerned about nuclear war, say, or it's like, listen, we have the nuke for 80 years and like we've been fine so far. So the risk must be low.
And then the answer to that is no, actually, it is a really high risk. And the reason we've avoided is like people have gone through a lot of effort to make sure that this thing doesn't happen.
I don't think that giving government ASI without knowing what that implies
is going through a lot of effort. And I think the base rate, like you can talk about America,
I think America is very exceptional, not just in terms of dictatorship, but in terms of every other
country in history has had a complete drawdown of wealth because of war, revolution, and something.
America is very unique in not having that. And the historical base rate, we're talking about
Thank you. in terms of every other country in history has had a complete drawdown of wealth because of war, revolution, and something.
America is very unique in not having that. And the historical base rate, we're talking about great power competition.
I think that has a really big, that's something we haven't been thinking about the last 80 years, but it's really big. Dictatorship is also something that is just the default state of mankind.
And I think relying on institutions, which in an ASI world, like there's, it's fundamentally right now if the government tried to overthrow, there's a, it's much harder if you don't have the ASI, right? Like there's people who have AK, AR-15s and there's like things that make it harder. It doesn't make it crush the AR-15s.
No, I think it actually pretty hard. The reason it was Vietnam and Afghanistan were pretty hard.
They just knew the whole country. Yeah, yeah, I agree.
But like, I'm... They could.
I agree. It's similar with ASI.
Yeah, I think it's just easier if you have what you're talking about. But there are institutions, there are constitutions, there are legal restraints, there are courts, there are checks and balances.
The crazy bet is the bet which are like private company CEOs... The same thing, by the way, isn't the same thing true of nukes where we have these institutional agreements about non-poliferation and whatever? And we're still very concerned about that being broken and somebody getting nukes.
And like, you should stay up at night worrying about that. It's a precarious situation.
But ASI is going to be a really precarious situation as well. And like, given how precarious nukes are, we've done pretty well.
And so what does privatization in this world even mean? I mean, I think the other thing is... Like, what happens after? I mean, the other thing, you know, because we're talking about like whether the government project is good or not and it's like i have very mixed feelings about this as well again i think my primary argument is like you know if you're at the point where this thing has like vastly superhuman hacking capabilities if you're at the point where this thing can develop you know bioweapons you know like in crazy bioweapons ones that are like targeted you know can kill everybody but the hand chinese or Chinese or, you know, that, you know, you know, would would wipe out, you know, entire countries where you're talking about like building robo armies, you're talking about kind of like drone swarms that are, you know, again, the mosquito sized drones that could take it out, you know.
The United States national security state is going to be intimately involved with this and this will, you know, the labs whether, you know, and I think again, the government, a lot of what I think is the government project looks like it is basically a joint venture between like, you know, the cloud providers between some of the labs and the government. And so I think there is no world in which the government isn't intimately involved in this like crazy period, the very least, basically, you know, like the intelligence agencies need to be running security for these labs.
So they're already kind of like, they're controlling everything, they're controlling access to everything. Then they're going to be like, probably again, if we're in this like really volatile international situation, like a lot of the initial applications, it'll, it'll suck.
It's not what I want to use ASI for. We'll be like trying to somehow stabilize this crazy situation.
Somehow we need to prevent like proliferation of like some crazy new WMDs and like the undermining of mutually assured destruction to kind of like, you know, North Korea and Russia and China. And so I think, you know, I basically think your world, you know, I think there's much more spectrum than you're acknowledging here.
And I think basically the world in which it's private labs is like extremely heavy government involvement. And really what we're debating is like, you know, what form of government project, but it is going to look much more like, you know, the national security state than anything.
It does look like, like a startup as it is right now. And I think that, yeah.
Look, I think something like that makes sense. I would be, if it's like the Manhattan Project, then I'm very worried where it's like, this is part of the U S military, um, where if it's more like, listen, you got to talk to Jake Sullivan before you like run next training line.
It's like Lockheed Martin skunk words part of the US military.
It's like they call the shots.
Yeah, I don't think that's great.
I think that's bad.
I think it would be bad if that happened with ASI.
And what is the scenario?
What is the alternative?
Okay, so it's closer to my end of the spectrum where, yeah, you do have to talk to Jake Sullivan before you can launch the next training cluster.
But there's many companies who are still going for it. Yeah.
And the government will be intimately involved in the security. Yeah.
The, but the, like, three different companies are trying to book. Is Dario launching the Stuxnet attack? Yeah.
What do you, what do you mean? Sorry, Dario. Launching, okay.
Instead of watching. Dario is reactivating the Chinese data centers.
I think this is similar to the story you could tell about, there's a lot of, like like literally big tech right now. Yeah.
I think Satya, if he wanted to, he probably could get his engineers like, what are the zero days in Windows and the companies, and like, well, how do we get, infiltrate the president's computer so that like we can... Maybe shut down.
No, no, no. But like right now, I'm saying Satya could do that, right? Because he knows the zero days.
Maybe shut down. What do you mean? Government wouldn't let them do that.
Yeah, I think there's a story you could tell where like they could pull out pull off a coup whatever but like i think there's like multiple companies okay okay fine fine i agree i'm just saying like something closer to so what's wrong with the scenario where um you the government is there's like multiple companies going for it yeah but the ai is still broadly deployed and alignment works in the sense that you can make sure that it's not you the system level prompt is like you can't help people make bioweapons or something but these are still broadly deployed so that i mean i expect the ais to be broadly deployed i mean first of all even if it's a government project yeah i mean look i think first of all like i think the meadows of the world you know open sourcing their eyes you know that are two years behind or whatever yeah super going to like, you know, and so there's going to be some question of like either the offense defense balance is fine. And so like, even if they open source two year old AIs, it's fine.
Or it's like, there's some restrictions on the most extreme dual use capabilities. Like, you know, you don't let private companies sell kind of crazy weapons.
And that's great. And that will help with the diffusion.
And, you know, you know, after the government project, you know, there's going to be this initial tense period. Hopefully that's stabilized.
And then look, yeah, like Boeing,
they're going to go out and they're going to like make,
do all the flourishing civilian applications.
And, you know, like nuclear energy,
you know, it'll like all the civilian applications
will have their day.
I think part of my argument here is that-
And how does that proceed, right?
Because in the other world,
there's existing stocks of capital that are worth-
Yeah, the clusters,
there'll still be Google clusters.
And so Google,
because they got the contract from the government, they'll be the ones that control the ASI, but like why are they trading with anybody else? Why is it a random startup getting like- It'll be the same companies that would be doing it anyway. But in this world, they're just contracting with the government or like their DPA for all their compute goes to the government.
And, but in the world- It sounds like it's very natural. It's sort of how other dual-use tech has worked.
After you get the ASI and then we're building the robot armies and building fusion reactors or whatever, that's... Only the government will get to build robot armies.
Yeah, now I'm worried. Or like the fusion reactors and stuff.
That's what we do with NICS. It's the same situation we have today.
Because if you already have the robot armies and everything, like the existing society doesn't have some leverage where it makes sense for the government to. Yeah, they get in the sense that there's like, they have a lot of capital that the government wants and there's other things.
Like why was Boeing privatized after? Government has the biggest guns. And the way we regulate it is institutions, constitutions, legal restraints.
Tell me what privatized issue looks like in the ESI world afterwards. Afterwards, like the Boeing example, right? It's like you have this government.
Who gets it? Like Google, Microsoft. And who are they selling it to? Like they already have the robot factory.
It's like, why are they selling it to us? Like they already have, they don't need like our, this is chum change in the ASI world because we didn't get like the ASI broadly deployed throughout this takeoff. So we don't have the robot.
We don't have like the fusion reactors and whatever advanced decades of advanced science that you're talking about. So like it just, what are they trading with us for? Trading with whom for? Everybody who was not part of the project.
They've got that technology that's decades ahead. Yeah, I mean, look, that's a whole other issue of, like, how does, like, economic distribution work or whatever? I don't know.
That'll be rough. Yeah, I think this is a worry.
Basically, I'm kind of like, I don't see the alternative. The alternative is you, like, overturn a 500-year civilizational achievement of Lanfleet, and you basically instantly leak the stuff to the CCP, and either you barely scrape out ahead, but you're in this fever struggle.
You're proliferating crazy WMDs. It's this enormously
dangerous situation, enormously dangerous on alignment, because you're in this crazy race
at the end, and you don't have the ability to take six months to get alignment right.
The alternative is you aren't actually bundling your efforts to kind of like win the race against the authoritarian powers um you know yeah and so you know i don't like it you know i wish i wish the thing we use the asi for is to like you know cure the and do all the good in the world. But it is my prediction that sort of like by the in the end game.
What will be at stake will not just be kind of cool products, but what will be at stake is like whether liberal democracy survives, like whether the CCP survives, like what the world order for the next century will be. And when that is at stake, forces will be activated that are sort of way beyond what
we're talking about now.
And like, you know, in the sort of like crazy race at the end, like the sort of national
security implications will be the most important.
You know, sort of like, you know, World War II, it's like, yeah, you know, nuclear energy
had its day.
But in the initial kind of period, when, you know, when this technology was first discovered, you had to stabilize the situation, you had to get nukes, you had to do it right. And then, and then the civilian applications had their day.
I think a closer analogy to what this is, because nuclear, I agree that nuclear energy is a thing that happens later on, and it's like dual use in that way. But it's, it's something that happened like literally a decade after nuclear weapons were developed.
Yeah, because everything took longer, right? Whereas with AI, like the, immediately all the applications are unlocked. And it's closer to literally, I mean, this is an analogy people explicitly make in the context of AGI, is like, assume your society had 100 million more John Wayne Neumann.
Yeah, yeah. And I don't think, like, if that was literally what happened, if tomorrow you just have 100 million more of them, the approach should have been, well, some of them will convert to ISIS and we need to be really careful about that.
And then like, oh, you know know, like what if a bunch of them are born in China? And then we like if we got to nationalize the John von Neumanns, I'm like, no, I think it will be generally a good thing. And I'd be concerned about one power getting like all the John von Neumanns.
I mean, I think the issue is the sort of like bottling up in the sort of intensely short period of time, like this enormous sort of like, you know, unfolding of technological progress of an industrial explosion. And I think we do worry about the 100 million John Mnuchin.
And it's like rise of China. Why are we worried about the rise of China? Because it's like 100 billion people and they're able to do a lot of industry and do a lot of technology.
And but it's just like, you know, the rise of China times like, you know, 100 because not just 100, 1 billion people, it's like a billion super intelligent, crazy, you know, crazy things. So in like in a very short period.
Let's talk practically. Because if the goal is we need to beat China, part of that is protecting...
I mean, that's one of the goals, right? Yeah, I agree. One of the goals is to beat China.
And also just like manage this incredibly crazy scary period. Right.
So part of that is making sure we're not leaking algorithmic secrets to them. Yep.
Part of that is all... Part of that is all...
Huh?
I mean building the trillion dollar cluster, right?
That's right.
Yeah, but like isn't your whole point that Microsoft can release corporate bonds that
are...
I think Microsoft can do the like hundreds of billions of dollars cluster.
Yeah.
I think the trillion dollar cluster is closer to a national effort.
I thought that your earlier point was that American capital markets are deep and so forth.
They're good.
They're pretty good.
I mean I think it's possible it's private.
It's possible it's private.
But it's gonna be like...
And by the way, at this point we have AGI that's rapidly accelerating productivity.
I don't know. or deep and so forth.
They're pretty good. I mean, I think it's possible it's private.
It's possible. But it's going to be like, you know, by the way, at this point, we have AGI that's rapidly accelerating productivity.
I think the trillion dollar cluster is going to be planned before the AGI. I think it's sort of like you get the AGI on the like 10 gigawatt cluster, like intelligent.
Maybe you have like one more year where you're kind of doing some final on hobbling to fully unlock it. Then you have the intelligence explosion.
And meanwhile, the like trillion dollar cluster is almost finished. And then you you like and then you do your super intelligence on your trillion dollar cluster or you run it on your trillion dollar cluster and by the way you have not just your trillion dollar cluster but like you know hundreds of millions of gpus on inference clusters everywhere and this isn't resulted like i i think private in this world i think private companies have the capital and can raise capital then you will need the government force to do it fast well i was just about to ask like wouldn't it be the, we know companies are on track to be able to do this and beat China if they're unhindered by climate pledges or whatever.
Well, that's part of what I'm saying. If that's the case, if it really matters that we beat China, there's all kinds of practical difficulties of like, will the AI researchers actually join the AI effort? If they do, there's going to be three different teams at least who are currently doing pre-training on different companies.
Now who decides at some point you're going to have YOLO the hyperparameters of the trillion dollar cluster. Who decides that? Just like merging extremely complicated research and development processes across very different organizations this is somehow supposed to speed up America against the Chinese like why don't we just let Brain and DeepMind merge and it was like a little messy but it was fine it was pretty messy and it was also the same company and also much earlier on in the process pretty similar right same different code bases and like lots of different infrastructure and different teams and it was like you know it wasn't it wasn't, it wasn't like, it wasn't the smoothest of all processes, but you know, DeepMind is doing, I think, very well.
I mean, look, you give the example of COVID and the COVID example is like, listen, we woke up to it, maybe it was late, but then we had deployed all this money and COVID response to government was a clusterfuck over. And like the only part of it that was worked is I agree, Warp Suite was like enabled by the government.
It was literally just giving the permission that you can actually do. Well, no, it was also making like the big contract.
Advanced market commitments or whatever. But I agree.
But it was like fundamentally it was like a private sector led effort. Yeah.
That was the only part of COVID that worked. I mean, I think, again, I think the project will look closer to Operation Warp Speed.
And it's not even, I mean, I think you'll have all the companies involved in the government project. I'm not that sold that merging is that difficult.
You know, you have one, you select one code base and you run free training on like GPUs with one code base and then you do the sort of second RL step on the other code base with TPUs. I think it's fine.
I mean, to the topic of like, will people sign up for it? They wouldn't sign up for it today. I think this would be kind of crazy to people.
But also, you know, I mean, this is part of the like secrets thing, you know, people gather at parties or whatever, you know, you know, this, you know, I don't think anyone has really gotten up in front of these people and been like, look, you know, the thing you're building is the most important thing for like the national security of the United States for like whether, you know, like, you know, the free world will have another century ahead of it. Like this is the thing you're doing is really important, like for your country for democracy um and um you know don't talk about the secrets and it's not just about you know oh deep mind or whatever it's about it's about you know these really important things um and so you know i don't know like again we're talking about the manhattan project right this stuff was really contentious initially um but you know at some point it was like clear that this stuff was coming it was clear that there was like sort real exigency on the military and national security front.
And I think a lot of people will come around. On the weather will be competent, I agree.
I mean, this is, again, where a lot of the stuff is more predictive in the sense I think this is reasonably likely, and I think not enough people are thinking about it. A lot of people think about AI lab politics or whatever, but nobody, you know, like, should I make you more pessimistic about it? And like, we don't have a plan for it.
We need to do it very soon because AGI is upon us. Yeah.
Then fuck the only capable, competent technical institutions capable of making AI right now are private companies. Let's go play that leading role.
It'll be a sort of a partnership, basically. But, you know, the other thing is like, you know, again, we talked about World War Two and, you know, American unpreparedness, the beginning of World War II is complete, you know, complete shambles, right? And so there is a sort of like very, you know, I think America has a very deep bench of just like incredibly competent managerial talent.
You know, I think that, you know, there's a lot of really dedicated people. And, you know, I think basically a sort of operational warp speed, public-private partnership, something like that, you know, is sort of what I imagine it would look like.
Yeah. I mean, the recruiting the talent is an interesting question because the same sort of thing where initially for the Manhattan Project, you had to convince people we've got to beat the Nazis and you got to get on board.
I think a lot of them maybe regretted how much they accelerated the bomb. And I want, I think this is generally a thing with war where- I mean, I think they're also wrong to regret it, but.
Yeah, why? What's the reason for regretting it? I think there's a world in which you don't have, the way in which nuclear weapons were developed after the war was pretty explosive because there was a precedent that you actually can use nuclear weapons. Then because of the race that was set up, you immediately go to the H-bomb.
I mean, I think my view is, again, this is related to the view on AI and maybe some of our disagreement is like, that was inevitable. Like, of course, like, you know, there's this, you know, world war.
And then obviously there was the, you know, Cold War right after, of course, like, you know, the military and technology angle of this would be like, you know, pursued with ferocious intensity. And I don't really think there's a world in which that doesn't happen.
We're just like, ah, we're all not going to build nukes. And also just like nukes went really well.
I think that could have gone terribly, right? You know, like, you know, again, I mean, this sort of, I think this is like not physically possible with nukes, this sort of pocket nukes for everybody. But I think sort of like WMDs that are sort of proliferated and democratized and like all the countries have it, like the US leading on nukes and then sort of like building this new world order that was kind of US led or at least sort of like a few great powers and a non-proliferation regime for nukes, a partnership and a deal that's like, look, no military sort of application of nuclear technology, but we're gonna help you with the civilian technology.
We're gonna enforce safety norms on the rest of the world that worked it worked and it could have gone so much worse i okay so zooming out you know they were i mean this is i mean i say this a bit in the piece but it's like actually the a-bomb you know like the a-bomb and hiroshima nagasaki was just like you know the firebombing yeah i think the thing i think the thing that really changed the game was like the super you know the the the the h-bombs and icb. And then I think that's really when it took it to like a whole new level.
I think part of me thinks when you say we will tell the people that for the free world to survive, we need to pursue this project. It sounds similar to World War II.
So World War II is a sad story, obviously, the fact that it happened. But also like the victory is sad in the sense that Britain goes in to protect Poland.
And at the end, the USSR, which is, you know, as your family knows, is incredibly brutal, ends up occupying half of Europe. And the, like part of like, we're protecting the free world.
That's why we got to rush the AI. And if we end up with the American AI Leviathan, I think there's a world where we look back on this, where it has the same sort of twisted irony that Britain going into World War II had about trying to protect Poland.
Look, I mean, I think there's going to be a lot of unfortunate things that happen. I'm just like, I'm just hoping we make it through.
I mean, to the point of it's like, I really don't think the pitch will only be the sort of like, you know, the race. I think the race will be sort of a backdrop to it.
I think the sort of general like, look, it's important that democracy shaped this technology. We can't just like leak this stuff to, you know, North Korea is going to be important.
I think also for the just safety, including alignment, including the sort of like creation of new wmds um i'm not currently sold there's another path right so it's like if you just have the breakneck race both internationally because you're just instantly leaking all the stuff including the weights and just you know the commercial race you know demis and dario and and sam you know just kind of like they all want to be first um and it's incredibly rough for safety and then you say okay safety regulation but you know it's sort of like you know the safety regulation that people talk about it's like oh well nist and they take years and they figure out what the expert consensus is and then they write some guidelines but i think i mean i think the sort of alignment angle during the intelligence explosion it's gonna you know it's not a process of like years of bureaucracy and then you can kind of write some standards i think it looks much more like basically a war war and like you have a fog of war. It's like, look, it's like, is it safe to do the next OOM? You know? And it's like, ah, you know, like, you know, we're like three OOMs into the intelligence explosion.
We don't really understand what's going on anymore. Um, you know, the, um, um, uh, you know, like a bunch of our like generalization scaling curves are like kind of looking not great.
You know, some of our like automated AI researchers that are doing alignment are saying it's fine but we don't quite trust them in this test you know the like the eyes started doing naughty things and ah but then we like hammered it out and then it was fine and and like ah should we should we go ahead should we take you know another six months also by the way you know like China just stole the weights or we you know they're about to like deploy the Romo army like what do we do I think it's this I think it is this crazy situation um and um you know basically you you're relying much more on kind of like a sane chain of command than you are on sort of some like you know deliberative regulatory scheme i wish you had you were able to do the liberative regulatory scheme and this is the thing about the private companies too i don't think you know they all claim they're gonna do safety but i think it's really rough when you're in the commercial race and they're startups, you know, and startups, startups are startups. You know, I think they're not fit to handle WMDs.
Yeah, I'm coming closer to your position. But part of me also, so with the responsible scaling policies, I was told by people who are advancing that, that the way to think about this, because they know I'm like a libertarian type of person.
Yeah, yeah, yeah. And the way they approached me about it was that fundamentally, this is a way to protect market-based development of AGI in the sense that if you didn't have this at all, then you would have this sort of misuse and then it would have to be nationalized.
And the RSPs are a way to make sure that through this deployment, you can still have a market-based order, but then there's these safeguards that make sure that things don't go off the rails. And I wonder if the...
It seems like your story seems self-consistent, but it does feel... I know this was never your position, so I'm not like, I'm not looping you into this, but sort of a Martin Bailey almost in the sense of.
Well, look, here's what I think about RSP type stuff or sort of safety regulation that's happening now. I think they're important for helping us figure out what world we're in and like flashing the warning signs on our coast, right? And so the've been telling is sort of like you know sort of what i think the modal version of this decade is but it's like i think there's lots of ways it could be wrong i really you know we should talk about the data wall more i think there's like again i think there's a world where the stuff stagnates right there's a world where we don't have agi um and so i basically you know the rsp thing is like preserving the optionality let's see how this stuff goes but like we need to be prepared like If the red lights start flashing, if we're getting the automated eye researcher, then it's crunch time.
Then it's time to go. I think I can be on the same page on that.
We should have a very, very strong prior on pursuing a market-based way unless you're right about what the explosion looks like, the intelligence explosion. Don't like, don't move yet, but in that world where like really does seem like
Alec Radford can be automated
and that is the only bottleneck to getting TSI.
Okay, I think we can leave it at that.
I can, yeah, I'm somewhat of the way there.
Okay, okay.
Yeah, I hope it goes well.
It's gonna be very stressful. And again, right now is the way there.
Okay. Okay.
Yeah. I hope it goes well.
It's going to be very stressful.
And again, right now is the chill time.
Enjoy your vacation while it lasts.
It's funny to look out over.
Just like this is San Francisco.
Yeah.
Yeah.
Yeah.
Opening eyes right there.
You know, Anthropics there.
I mean, again, this is kind of like, you know, it's like you guys have this enormous power
over how it's going to go for the next couple of years. and that power is depreciating.
Yeah. Who is you guys? Like, you know, people at labs.
Yeah, yeah, yeah. But it is a sort of crazy world.
And you're talking about like, you know, I feel like you talk about like, oh, maybe they'll nationalize too soon. It's like, you know, almost nobody like really like feels it, sees what's happening.
And it's, I think this is the thing that I find stressful about all this stuff is like, look i'm wrong like if i'm right we're in this crazy situation where there's like you know like a few hundred guys that are like paying attention um and um it's it's daunting i went to washington a few months ago yeah and i was talking to some people who are doing ai policy stuff there yeah and i was asking them how likely they think nationalization is. Yeah.
And they said, oh, you know, like,
it's really hard to nationalize stuff.
It's been a long time since we've done it.
There's these very specific procedural constraints on what kinds of things can be nationalized.
And then I was asked, well, like, ASI,
so that means because there's constraints
at a Defense Production Act or whatever,
that won't be nationalized.
The Supreme Court would overturn that. And they were like, yeah, I guess that would be nationalized.
That's the short summary of my post or my view on the project. Okay, so before we go further on the AI stuff, let's just back off.
Okay.
We began the conversation, I think people would be confused.
You graduated valedictorian of Colombia when you were 19.
Uh-huh.
So you got to college when you were 15.
Right.
And you were in Germany, then you got to college at 15.
Yeah.
How the fuck did that happen?
I really wanted out of Germany.
Mm-hmm.
I, you know, I went to kind of a German public school.
It was not a good environment for me.
In what sense?
It's just like no peers?
Yeah, look, I mean, it wasn't, yeah.
There's also just a sense in which sort of like,
there's this particularly sort of German cultural sense.
I think in the US, you know,
there's all these like amazing high schools
and like sort of an appreciation of excellence. And in Germany, there's really this sort of like Paul Poppy syndrome almost, right? Where it's, you know, you're the curious kid in class and you want to learn more instead of the teacher being like, ah, that's great.
They're like, they kind of resent you for it. And they're like trying to crush you.
I mean, there's also like, there's no kind of like elite universities for undergraduate, which is kind of crazy. So, you know, the sort of, you know, there's sort of like, basically like the meritocracy was kind of crushed in Germany at some point.
Also, I mean, there's a sort of incredible sense of, you know, complacency, you know, across the board. I mean, one of the things that always puzzles me is like, you know, even just going to a U.S.
college was this kind of like radical act. And like, you know, you know it doesn't seem radical to anyone here because it's like ah this is obviously the thing you do and you can go to columbia you go to columbia but it's you know it is very unusual and it's it's it's wild to me because it's like you know this is where stuff is happening you can get so much of a better education and you know like america is where you know it's where where where where all the stuff is and um people don't do it and and so um yeah anyway so i you know i don't know i skipped a few grades and and uh you know i think um at the time it seemed very normal to me to kind of like go to college at 15 and come to america i think um you know now my one of my sisters is now like turning 15 you know and so then i you know and i look at her and now I understand how my mother is.
And as you get to college, you were like presumably the only 15 year old. Yeah.
Yeah. As it was just like normal for you to be a 15 year old.
Like what was the initial years like at the time? Yeah. So again, it's like, now I understand why my mother's working.
And, you know, I think, you know, I worked, I worked on my parents for a while, you know, eventually I was, you know, I persuaded them. No, but yeah, it felt, felt very normal at the time.
And it was great. It was also great because I actually really liked college.
And in some sense, it sort of came at the right time for me. Where, you know, I mean, for example, I really appreciated the sort of like liberal arts education.
And, you know, like the core curriculum and reading sort of core works of political philosophy and literature. And you did what, econ? I mean, my majors and statistics and economics um um but you know columbia has a sort of pretty heavy core curriculum and liberal arts education and honestly like you know i shouldn't have done all the majors i should have just i mean the best courses were sort of the courses where it's like there's some amazing professor and it's some history class and it's um um i mean that's that's honestly the thing i would recommend people spend their time on in college um was there one professor or class that stood out that way i mean a few there's like a class by richard betts um on uh war peace and strategy um adam too is obviously fantastic um uh and you know has written very riveting books yeah uh yeah you should have on the podcast by the way i tried okay i tried man i think you tried for me yeah yeah you gotta get on the pod Yeah.
Ah, it them on the podcast, by the way. I've tried.
I think you've tried for me. Yeah, you've got to get them on the pod, man.
Oh, it'd be so good. Okay.
So then in a couple of years, we were talking to Tyler Cowen recently, and he said that when the way he first encountered you was you wrote this paper on economic growth and existential risk. And he said, when I read it,
I couldn't believe that a 17-year-old had written it.
I thought if this was an MIT dissertation, I'd be impressed.
So you were like, how did you go from,
I guess you were a junior then,
you're writing pretty novel economic papers.
Why did you get interested in this kind of thing and like what was the process to get in that i don't know i just you know i get interested in things in some sense it's sort of like um it feels very natural to me it's like i get excited about a thing i read about it i immerse myself i think i can you know i can learn information very quickly and understand it um the um i mean i think to the paper i mean i think one actual um at least for the the way I work I feel like sort of moments of peak productivity matter much more than sort of average productivity I think there's some jobs you know like CEO or something you know like average productivity really matters but I think there's sort of I often feel like I have periods of like you know there's some there's a couple months where there's sort of nephrolescence and I'm like you know and the other times I'm sort of computing stuff in the in the background. And at some point, you know, like writing the series, this is also kind of similar.
And it's just like you, you write it and, and it's, it's like, it's really flowing. And, um, that's sort of what ends up mattering.
I think even for CEOs, it might be the case that the peak productivity is very important. There's one of our following Chatham House rules.
One of our friends in a group chat has, uh, pointed out how many famous CEOs and founders have been bipolar manic. Right, right, right.
Which is very much the peak. Like, the call option on your productivity is the most important thing.
And you get it by just increasing the volatility through bipolar. Uh-huh.
Okay, so that's interesting. And so you get interested in economics first.
First of all, why economics? Like, you could read about anything at this moment. Like, wanted, you know, you kind of got a slow start on ML, right? You could have, you wasted all these years on econ.
There's an alternative world. You're like on the super alignment team at 17 instead of 21 or whatever it was.
Oh, no. I mean, in some sense, I'm still doing economics, right? You know, what is a graph? I'm looking at the log-log plots and figuring out what the trends are and thinking about the feedback loops and equilibrium arms control dynamics.
I think it is a way of thinking that I find very useful. Dario and Ilya seeing scaling early, in some sense, that is a sort of very economic way of being also the sort of physics,
like empirical physics, you know, a lot of them are physicists.
I think the economists usually can't code well enough and that's their issue.
But I think it's that sort of way of thinking. Um, I mean, the other thing is,
you know, I, I thought there were sort of, um, you know,
I thought of a lot of the sort of like core ideas of economics.
I thought were just beautiful. Um, and, um, you know,
in some sense I feel like I was a little duped, you know,
where it's like actually kind of academia is kind of decadent now. You know,
I think that, you know,
you know, in some sense, I feel like I was a little duped, you know, where it's like actually econ academia is kind of decadent now. You know, I think that, you know, for example, the paper I wrote, you know, it's sort of, I think the takeaway, you know, it's a long paper, it's 100 pages of math or whatever.
I think the core takeaway I can, you know, kind of give the core intuition for in like, you know, 30 seconds and it makes sense. And it's like, you don't actually need the math.
I think that's the sort of the best pieces of economics are like that where you do the work but you do the work to kind of uncover insights that weren't obvious to you before. Once you've done the work it's like some sort of like mechanism falls out of it that like makes a lot of crisp intuitive sense that like explains some facts about the world that you can then use in arguments.
I think you know I think you know like a lot of econ 101 like this and it's great a lot of econ in the you know in the fifties and the sixties, you know, uh, was like arguments. And I think, you know, I think, you know, like a lot of econ 101 like this, and it's great.
A lot of econ in the, you know, in the 50s and the 60s, you know, was like this. And, you know, Chad Jones papers are often like this.
I really like Chad Jones papers for this. You know, I think, you know, why did I ultimately not pursue econ academia was a number of reasons.
One of them was Tyler Cowen. You know, he kind of took me aside and he was kind of like, look, I think you're one of the top young economists I've ever met, but also you should probably not go to grad school.
Oh, interesting. Really? I didn't realize that.
Well, yeah, and it was good because he kind of introduced me to the, I don't know, like the Twitter weirdos. And I think the takeaway from that was kind of, you know, got to move out less one more time.
Wait, Tyler introduced you to the Twitter weirdos? A little bit, yeah. Or just kind of like the sort of...
Like the 60-year-old economist to introduce you to that Twitter? Yeah, well, you know, I had been... I went from Germany, you know, completely, you know, on the periphery to kind of like, you know, in a US elite institution and sort of got some vibe of like sort of, you know, meritocratic elite, you know, US society.
And then sort of, yeah, basically this sort of like... There was a sort of directory then to being like, look, find the true American spirit.
I got to come out here. But the other reason I didn't become an economist was because, or at least econ academia, was because I think sort of econ academia has become a bit decadent.
Maybe it's just ideas getting harder to find and maybe it's sort of things, you know, the sort of beautiful, simple things have been discovered. But what are econ papers these days? It's like 200 pages of empirical analyses on what happened when Wisconsin bought 100,000 more textbooks on educational outcomes.
And I'm really happy that work happened.
I think it's important work, but I think it is not covering these sort of fundamental insights and mechanisms in society.
Or even the theory work is kind of like, here's a really complicated model. And the model spits out, you know, if the Fed does X, you know, then Y happens.
You have no idea why that happened because it's like gazillion parameters and they're all calibrated in some way. And it's some computer simulation.
You have no idea about the validity, you know. Yeah.
So I think the sort of, you know, the most important insights are the ones where you have to do a lot of work to get them. But then there's this crisp intuition.
Yeah. The P versus NP of, uh, sure.
Yeah. Um, that's really interesting.
So just going back to your time in college, you say that peak productivity kind of explains the, this paper and things, but the valedictorian that's getting straight A's or whatever is very much, um, uh, phenomenon right so there's one award for the highest GPA which I won but the valedictorian is like among the people which have the highest GPA and then like selected by faculty okay yeah so it's just not it's not just peak productivity it's just it's it's just it's I generally just love this stuff you know I just I was curious and I thought it was really interesting and I love learning about it and and i love kind of like it made sense to me and you know it was very natural and you know i think i'm you know i'm not you know i think one of my faults is i'm not that good at eating glass or whatever i think there's some people who are very good at it i think the sort of like the sort of moments of pre-productivity come when i you know i'm just really excited and engaged and and and and and uh love it and you know i i uh you know if you take the courses, that's what you got in college. Yeah, yeah.
It's the Bruce Banner code in Avengers. You know, I'm always angry.
I'm always excited. I'm always curious.
That's why I'm always speaking for activity. So it's interesting, by the way, when you were in college, I was also in college.
I think you were, despite being a year younger than me, I think you're ahead in college than me, or at least two years, maybe two years ahead. And we met around this time.
Yeah, yeah, yeah. We also met, I think, through the Tyler Cowen universe.
Yeah, yeah, yeah. And it's very insane how small the world is.
Yeah. I think I, did I reach out to you? I must have.
Yes, I don't know. About when I had a couple of videos and they had a couple hundred views or something.
Yeah, it's a small world. I mean, this is the crazy thing about the AI world, right? It's kind of like, it's the same few people at the kind of SF parties and they're the ones, you know, running the models at DeepMind and, you know, open AI and Anthropic and, you know, I i mean i think some other friends of ours have mentioned this who are now later in their career and very successful that you know they actually met all the people who are also kind of very successful in silicon valley now like you know when they're when they're in their you know when right for the 20s or really 20s the um i mean look i actually think um you know and why is it a small world um i mean i think one of the things is some amount of like, you know, some sort of agency.
And I think in a funny way, this is a thing I sort of took away from the sort of Germany experience where it was, I mean, look, I, it was crushing. I really didn't like it.
And it was like, it was such an unusual move to kind of skip grades and such an unusual move to come to the United States. And, you these things I did were unusual moves.
There's some amount where just trying to do it and then it was fine and it worked, that reinforced you don't just have to conform to what the Overton window is. You can just try to do the thing, the thing that like, you know, most people can be wrong.
I don't know, things like that. And I think that was kind of a, you know, valuable kind of like early experience that was sort of formative.
Okay, so after college, what did you do? I did econ research for a little bit, you know, in Oxford and stuff. And then I worked at Future Fund.
Yeah. Okay, so tell me about it.
Yeah, yeah, yeah. that you know it was a foundation that was um you know funded by sam bankman freed i mean we were our own thing you know we were based in in the bay um you know at the time this was in sort of early 22 um it was uh it was this just like incredibly exciting opportunity right it was basically like a startup you know foundation which is know, it doesn't come along that often that, you know, we thought would be able to give away billions of dollars, you know, thought would be able to kind of like, you know, remake how philanthropy is done, you know, from first principles, thought would be able to have, you know, this like great impact, you know, the causes we focused on were, you know, biosecurity, you know, AI, you know, finding exceptional talent and putting them to work on hard problems.
And, you know, like a lot of the stuff we did, I was I was really excited about, you know, like academics who would, you know, usually take six months would send us emails like, ah, you know, this is great. This is so quick and, you know, and straightforward.
You know, in general, I feel like I've often find that with like, you know, a little bit of encouragement, a little bit of sort of empowerment, kind of like removing excuses, making the process easy. You know the process easy.
You can get people to do great things. I think on the future front, the other thing is context for people who might not realize, not only were you guys planning on deploying billions of dollars, but it was a team of four people.
Yeah, yeah, yeah. So you at 18 are on a team of four people that is in charge of deploying billions of dollars.
Yeah. I mean, just I mean, yeah.
And future fund, you know, the yeah, I mean, so that was that was sort of the heyday. Right.
And then obviously, you know, when when in sort of, you know, November of 22, you know, it was kind of revealed that Sam was this giant fraud. And from one day to the next, you know, the whole thing that was just really tough um i mean you know obviously it was devastating it was devastating obviously for the people at their money on ftx you know um closer to home you know all that you know all these grantees you know we wanted to help them and we thought they were doing amazing projects and so but instead of helping them we ended up saddling them with like a giant problem um you know personally it was you know it was a startup right and so i you know i'd worked 70 hour weeks every week for you know basically a year on this to kind of build this up you know we're a tiny team um and then from one day to the next it was all gone and not just gone it was associated with this giant fraud um and so you know that was incredibly tough yeah and then were there any signs early on that spf was yeah and look obviously i didn't know he was a fraud and the whole you know i would have never worked there if i did you know um and you know we weren't you know we were a separate thing we weren't with the working with the business um i mean i think i do think there are some takeaways for me i think one takeaway was um you know i think there's a, I had this tendency, I think people in general have this tendency to kind of like, you know, give successful CEOs a pass on their behavior because, you know, they're successful CEOs and that's how they are.
And that's just successful CEO things. And, you know, I didn't know Sam McManfred was a fraud, but I knew SBF and I knew he was extremely risk taking.
Right. I knew he he was narcissistic.
He didn't tolerate disagreement. Well, you know, sort of by the end, he and I just like didn't get along well.
And sort of I think the reason for that was like there's some biosecurity grants he really liked because they're kind of cool and flashy. And at some point I'd kind of run the numbers and it didn't really seem that cost effective.
And I pointed that out and he was pretty unhappy about that. Um, and so I knew his character.
Um, and I think, you know, I feel like one takeaway for me was, um, was, um, you know, like, I think it's really worth paying attention to people's character and including like people you work for and successful CEOs. Um, and, um, you know, that can save you a lot of pain down the line.
Okay, so after that, FTX implodes and you're out. And then you got into, you went to OpenAI, the super alignment team had just started.
I think you were like part of the initial team. And so what was the original idea? What was compelling about that for you to join? Yeah, totally.
So, I mean, what was the goal of the super alignment team? You know, the alignment team at OpenAI, you know, other labs sort of like several years ago kind of had done sort of basic research and they developed RLHF, reinforcement learning from human feedback. And that was sort of a, you know, ended up being really successful technique for controlling sort of current generation of AI models.
What we were trying to do was basically kind of be the basic research bet to figure out what is successor to LHF. And the reason we needed that is, you know, basically, you know, LHF probably won't scale to superhuman systems.
LHF relies on sort of human raters who kind of thumbs up, thumbs down, you know, like the model said something, it looks fine, looks good to me. At some point, you know, the superhuman models, the super intelligence, it's going to write, you know, a million lines of, you know, crazy complex code.
You don't know at all what's going on anymore. And so how do you kind of steer and control these systems? How do you add side constraints? You know, the reason I joined was I thought this was an important problem and I thought it was just a really solvable problem, right? I thought this was basically, you know, I think there's, I still do, I mean, even more so do.
I think there's a lot of just really promising sort of ML research on alignment, on sort of aligning superhuman systems. And maybe we should talk about that a bit more later.
But so- It was so solvable, you solved it in a year. It's all over now.
It's all quite tough. Anyway, so look, OpenAI wanted to do this really ambitious effort on alignment, and Elliot was backing it, and I liked a lot of the people there, and so I was really excited, and I was kind of like, I think there was a lot of people sort of, on alignment, there's always a lot of people kind of making hay about it, and I appreciate people highlighting the importance of the problem, and I was just really into, like, let's just try to solve it and let's do the ambitious effort you know let's do the you know operation warp speed for solving alignment and um it seemed like an amazing opportunity to do so.
Okay and uh now basically the team doesn't exist I think the head of it has left yeah like both heads of it have left Jan and Ilya that's been used last week. Mm week.
What happened? Why did the thing break down? I think OpenAI sort of decided to take things in a somewhat different direction. Meaning what? I mean, that super alignment isn't the best way to frame the...
No, I mean, look, obviously, sort of after the November board events, you know, there were personnel changes. I think Ilya leaving was just incredibly tragic for OpenAI.
And, you know, I think some amount of reprioritization, I think some amount of, you know, I mean, there's been some reporting on the super alignment compute commitment. You know, there's this 20% compute commitment as part of, you know, how a lot of people were recruited.
You know, it's like, we're going to do this ambitious effort on alignment and, you know, some amount of, you know, not keeping that and deciding to go in a different direction. Okay, so now Jan has left, Elia has left.
So this team itself has dissolved, but you were the sort of first person who left or was forced to leave. You were, the information reported that you were fired for leaking.
What happened? Is this accurate? Yeah. Look, why don't I, why don't I tell you what they claim I leaked and you can tell me what you think.
Yeah. So opening, I did claim to employees that I was fired for leaking.
And, you know, I and others have sort of pushed them to say what the leak is. And so here's their response in full.
You know, sometime last year, I had written a sort of brainstorming document on preparedness, on safety and security measures we need in the future on the path to AGI. And I'd shared that with three external researchers for feedback.
So that's it. That's the leak.
You know, I think for context, it was totally normal at opening eye at the time to share sort of safety ideas with external researchers for feedback. You know, it happened all the time.
You know, the doc was sort of my ideas. Before I shared it, I reviewed it for anything sensitive.
The internal version had a reference to a future cluster, but I redacted that for the external copy. There's a link in there to some slides of mine, internal slides, but that was a dead link to the external people I shared it with.
The slides, the slides weren't shared with them. And so, obviously, I pressed them to sort of tell me what is the confidential information in this document.
And what they came back with was a line in the doc about planning for AGI by 2728 and not setting timelines for preparedness. You know, I wrote this doc, you know, a couple months after the super alignment announcement.
We had put out, you know, this sort of four year planning horizon. I didn't think that planning horizon was sensitive.
You know, it's, it's the sort of thing Sam says publicly all the time. Um, I think sort of John said it on my podcast a couple of weeks ago.
Um, anyway, so that's it. That's it.
So that's pretty thin for, uh, if, if the cause was leaking, that seems pretty thin. Was there anything else to it? Yeah.
I mean, so that was, that was the leaking claim. I mean, you can say a bit more about sort of what happened.
Yeah. Um, so one thing was, um, last year I had written a memo, internal memo about opening ice security.
I thought it was, you know, egregiously insufficient. You know, I thought it wasn't sufficient to protect, wasn't sufficient to protect the theft of model weights or key algorithmic secrets from foreign actors.
So I wrote this memo. I shared it with a few colleagues, a couple of members of leadership
who sort of mostly said it was helpful. But then a couple of weeks later, a sort of major security incident occurred.
And that prompted me to share the memo with a couple of members of the board. And so and so after I did that you know days later it was made very clear to me that leadership was very unhappy with me having shared this memo with the board you know apparently the board had hassled leadership about security and then I got sort of an official HR warning for this memo you know for sharing it with the board The HR person told me it was racist to worry about CCP espionage.
And they said it was sort of unconstructive. And, you know, look, I think I probably wasn't at my most diplomatic.
You know, I definitely could have been more politically savvy. But, you know, I thought it was a really, really important issue.
And, you know, the security incident had made me really worried. Anyway, and so i guess the reason i bring this up is uh when i was fired it was sort of made very explicit that the security memo is a major reason for my being fired um you know i think it was something like you know the reason that this is a firing and not a warning is because of the security memo um but you were sharing it with the board the warning i'd gotten for the security memo um anyway and i other, you know, what might also be helpful context is the sort of questions they asked me when they fired me.
So, you know, this was a bit over a month ago. I was pulled aside for a chat with a lawyer, you know, that quickly turned very adversariable.
And, you know, the questions were all about my views on AI progress, on AGI, on the level of security appropriate for AGI, on, you know, whether government should be involved in AGI, on, you know, whether I and Superalignment were loyal to the company, on, you know, what I was up to during the OpenAI board events, you know, things like that. And, you know, then they, you know, chatted to a couple of my colleagues and then they came back and told me I was fired.
And, you know, they'd gone through all of my digital artifacts from the time at my, you know, time at OpenAI, you know, messages, docs. And that's when they found, you know, the leak.
Yeah. And so anyway, so the main claim they made was this leaking allegation.
You know, that's what they told employees. They, you know, the security memo.
There's a couple other allegations they threw in. One thing they said was that I was unforthcoming during the investigation because I didn't initially remember who I'd shared the doc with, the sort of preparedness brainstorming doc, only that I had sort of spoken to some external researchers about these ideas.
And, you know, look, the doc was over six months old. You know, I'd spent a day on it.
You know, it was a Google doc I shared with my open email. It wasn't a screenshot or anything I was trying to hide.
It simply didn't stick because it was such a non-issue. And then they also claimed that I was engaging on policy in a way that they didn't like.
And so what they cited there was that I had spoken to a couple external researchers, you know, somebody at a think tank about my view that AGI would become a government project, you know, as we discussed. In fact, I was spoken to a couple external researchers, somebody at a think tank, about my view that AGI would become a government project, as we discussed.
In fact, I was speaking to lots of people in the field about that at the time. I thought it was a really important thing to think about.
Anyway, and so they found a DM that I'd written to a friendly colleague five or six months ago where I relayed this and decided that. And I had thought it was well within open-eyed norms to kind of talk about high-level issues on the future of AGI with external people in the field.
So anyway, so that's what they alleged. That's what happened.
You know, I've spoken to kind of a few dozen former colleagues about this, you know, since I think the sort of universal reaction is kind of like, you know, that's insane. I was sort of surprised as well.
You know, I had been promoted just a few months before. I think, you know, I think Ilya's comment for the promotion case at the time was something like, you know, Leopold's amazing.
We're lucky to have him. But look, I mean, I think the thing I understand and I think in some sense is reasonable is like, you know, I think I ruffled some feathers and, you know, I think I was I was probably kind of annoying at times.
You know, it's like I security stuff and I kind of like repeatedly raised that and maybe not always in the most diplomatic way. You know, I didn't sign the employee letter during the board events, you know, despite pressure to do so.
And you were one of like eight people or something. something? Not that many people.
I guess the, I think the sort of two senior, most people didn't sign were Andre and yeah, and it was since left. And you know, I mean, on the letter, by the way, by the time on sort of Monday morning, when that letter was going around, I think probably it was appropriate for the board to resign.
I think they kind of like lost too much credibility and trust with the employees. But I thought the letter had a bunch of issues.
I mean, I think one of them was it just didn't call for an independent board. I think it's sort of like basics of corporate governance to have an independent board.
Anyway, you know, it's other things. You know, I am in sort of other discussions.
I pressed leadership for sort of opening eye to abide by its public commitments. You know, I raised a bunch of tough questions about whether it was consistent with the opening eye mission and consistent with the national interest to sort of partner with authoritarian dictatorships to build the core infrastructure for AGI.
So, you know, look, you know, it's a free country, right? That's what I love about this country. You know, we talked about it.
And so they have no obligation to keep me on staff um and you know I think in some sense I think it would have been perfectly reasonable for them to come to me and say look you know we're taking the company in a different direction you know we disagree with your point of view um you know we don't trust you enough to sort of tow the company line anymore and um you know thank you so much for your work at open ai but I think it's time to part ways I think that would have made sense I think you know we did start sort of mater the company line anymore. And, you know, thank you so much for your work at OpenAI, but I think it's time to part ways.
I think that would have made sense. I think, you know, we did start sort of materially diverging on sort of views on important issues.
I'd come in very excited and aligned with OpenAI, but that sort of changed over time. And look, I think there would have been a very amicable way to part ways.
And I think it's a bit of a shame that sort of this is the way it. All that being said, I think I really want to emphasize there's just a lot of really incredible people at OpenAI and it was an incredible privilege to work with them.
And overall, I'm just extremely grateful for my time there. When you left, now there's been reporting about an NDA that former employees have to sign in order to have access to their vested equity.
Did you sign such NDA? No. My situation was a little different in that I was basically right before my cleft.
But then they still offered me the equity, but I didn't want to sign the non-disparagement you know freedom is priceless and how much was how much was the equity like uh close to a million dollars so it was definitely a thing you were you and others were aware of that this is like a choice that open ai is explicitly offering you yeah and presumably the person on opening ai staff knew that we're offering them equity but they had to sign this NDA that has these conditions that you can't, for example, give the kind of statements about your thoughts on AGI and OpenAI that you're giving on this podcast right now. Look, I don't know what the whole situation is.
I certainly think sort of vested equity is pretty rough if you're conditioning that onto an NDA. It might be a somewhat different situation if it's a sort of severance agreement.
Right. But an OpenAI employee who had signed it presumably could not give the podcast that you're giving today.
Quite plausibly not. Yeah.
I don't know. Okay.
So analyzing the situation here, I guess if you were to, yeah, the board thing is really tough because if you were trying to defend them, you would say, well, listen, you were just kind of going outside the regular chain of command, and maybe there's a point there. Although, the way in which the person from HR thinks that you have an adversarial relationship with...
Or you're supposed to have an adversarial relationship with the board, where to give the board some information which is relevant to whether OpenAI is fulfilling its mission and whether it can do that in a better way is part of the leak, as if the board is, that is supposed to ensure that OpenAI is following its mission is some sort of external actor. That seems pretty...
I mean, I think, I mean, to be clear, the leak allegation was just that sort of document I chose for feedback. This is just sort of a separate thing that they cited and they said i wouldn't have been fired if not for the security memo they said they said you wouldn't have been fired the reason this is a firing and not a warning is because of the warning you had gotten for the security memo oh um before you left the incidents with the board happened um where sam was fired and then rehired a ceo and now he's on the board.
Now, Ilya and Jan, who are the heads of the Super Alignment team, and Ilya, who is a co-founder of OpenAI, obviously the most significant in terms of stature, a member of OpenAI from a research perspective, they've left. Seems like, especially with regards to Super Alignment stuff and just generally with OpenAI, a lot of this sort of personnel drama has happened over the last few months.
What's going on? Yeah, there's a lot of drama about OpenAI. Yeah, so why is there so much drama? You know, I think there would be a lot less drama if all OpenAI claimed to be was sort of building ChatGPT or building business software.
I think where a lot of the drama comes from is, you know, OpenAI really believes they're building AGI, right? And it's not just, you know, a claim that you make for marketing purposes, you know, whatever. You know, there's this report that Sam is raising, you know, $7 trillion for chips.
And it's like that stuff only makes sense if you really believe in AGI. And so I think what gets people sometimes is sort of the cognitive dissonance between sort of really believing in AGI, but then sort of not taking some of the other implications seriously.
You know, this is going to be incredibly powerful technology, both for good and for bad. And that implicates really important issues like the national security issues we spoke about, like, you know, are you protecting the secrets from the CCP? Like, you know, does America control the core AGI infrastructure or does it, you know, a Middle Eastern dictator control the core AGI infrastructure? And then, I mean, I think the thing that, you know, really gets people is the sort of tendency to kind of then make commitments and sort of like, you know, they say they take these issues really seriously.
They make big commitments on them, but then sort of frequently don't follow through. Right.
So, you know, again, as mentioned, there was this commitment around super alignment compute, you know, sort of 20% of compute for this long-term safety research effort. And I think, you know, you and I could have a totally reasonable debate about what is the appropriate level of compute for super alignment.
But that's not really the issue. The issue is that this commitment was made and it was used to recruit people.
And, you know, it was, it was very public. Um, and it was made because, you know, there's a recognition that there would always be something more urgent than a long-term safety research effort, you know, like some new product or whatever.
And, um, but then in fact, they just, you know, really didn't keep the commitment. And so, you know, there was always something more urgent than long-term safety research.
I mean, I think, I think another example of this is, um, you know, when I raised these issues about security, you know, they would tell me, you know, securities are number one priority. But then, you know, invariably, when it came time to sort of invest serious resources, when it came time to make trade-offs, to sort of take some pretty basic measures, security would not be prioritized.
And so, yeah, I think it's the cognitive dissonance And I think it's the sort of unre some pretty basic measures, um, security would not be prioritized.
Um, and so, yeah, I think it's the cognitive dissonance and I think it's the, the sort of, uh, unreliability that causes a bunch of the drama. So let's zoom out, uh, talk about the part of, big part of the story and also a big motivation of the way in which we must proceed with regards to geopolitics and everything is that once you Once you have the AGI, pretty soon after you proceed to ASI,
because, superigence, because you have these AGI's, which can function as researchers into further AI progress. And within a matter of years, maybe less, you go to something that is like superintelligence.
And then from there, then you do, according to your story, do all this research and development into robotics and pocket nukes and whatever other crazy shit. Yeah.
But at a high level... Good stuff.
Okay. But there's, I'm skeptical of this story for many reasons.
Yes. At a high level, it's not clear to me this input-output model of research is how things actually happen in research.
We can look at economy-wide, right? Patrick Collis and others have made this point that compared to 100 years ago, we have 100x more researchers in the world. It's not like progress is happening 100x faster.
So it's clearly not the case that you can just pump in more population into research and you get higher um research on the other end i i don't know why it would be different for the researchers themselves okay great so this this is getting into some good stuff as i have a classic disagreement i have with patrick and others all right so you know obviously inputs matter right so it's like the united states produces a lot more scientific and technological progress than you know lichten Stein, right? Or Switzerland. And even if I made, you know, Patrick Collison dictator of like Liechtenstein or Switzerland, and Patrick Collison was able to implement his like, you know, utopia of ideal institutions, keeping the talent pool fixed.
He's not able to like do some crazy high school immigration thing or like, you know, whatever, some like crazy genetic breeding scheme or whatever he wants to do. Keeping the talent pool fixed, but amazing institutions.
I claim that still, even if you made Patrick Collison dictator of Switzerland, maybe you get some factor, but Switzerland is not going to be able to outcompete the United States in scientific and technological parts. Obviously, magnitudes matter.
Okay. No, actually, I'm not sure I agree with this.
There's been many examples in history where you have small groups of people who are part of like Bell Labs or Skunk Works or something. There's a couple hundred researchers.
Open AI, right? A couple hundred researchers. They do make...
Highly selected though, right? You know, it's like saying, you know, the 500,000... That's part of why Patrick Olsen as a dictator is going to do a good job of this.
Well, yes. If you can highly select all the best AI researchers in the world, you might only need a few hundred.
But if you... You know, that's the talent pool.
It's like you have the, you know, 300 best AI researchers in the world. But there's, there has been, it's not the case that from 100 years to now, there haven't been, the population has increased massively.
A lot of the, in fact, you would expect the density of talent to have increased in the sense that malnutrition and other kinds of poverty, whatever, that have debilitated past talent at the same sort of level is no longer debilitated. Yeah.
so to the 100x point, right? So I don't know if it's 100x, I think it's easy to inflate these things, probably at least 10x. And so people are sometimes like, ah, you know, like, you know, come on, ideas haven't gotten that much harder to find, you know, why would you have needed this 10x increase in research effort? Whereas to me, I think this is an extremely natural story.
And why is it a natural story? It's a straight line on a log log plot.. This is sort of a deep learning researcher's dream, right? What is this log-log plot? On the x-axis, you have log cumulative research effort.
On the y-axis, you have log GDP or looms of algorithmic progress or log transistors per square inch or in the sort of experience curve for solar, kind of like whatever the log of the price for a gigawatt of solar. um it's extremely natural for that to be a straight line you know this is sort of a class yeah it's a classic and um you know it's basically the first thing is very easy then basically you know you have to have log increments of cumulative research effort to find the next thing um and so you know in some sense i think this is a natural story um now one objection kind of people then make is like oh you know it suspicious, right? That like ideas, you know, well, we increased research effort 10x and ideas also just got 10x harder to find.
And so it perfectly, you know, equilibriates. And to there I say, you know, it's just, it's an equilibrium.
It's an adagis equilibrium, right? So it's like, you know, isn't it a coincidence that supply equals demand, you know, and the market clears? Right. And that's and same thing here.
Right.
So it's, you know, ideas getting how much ideas have gotten harder to find is a function
of how much progress you've made.
And then, you know, what the overall growth rate has been is a function of how much ideas
have gotten harder to find in ratio to how much you've been able to increase research
effort.
What is the sort of growth of research effort?
So in some sense, I think the story is sort of like fairly natural.
And you see this and you see this not just economy wide. You see it in kind of experience curve for all sorts of individual technologies.
So I think there's some process like this. I think it's totally plausible that, you know, institutions have gotten worse by some factor.
Obviously, there's some sort of exponent of diminishing returns on more people, right? So like serial time is better than just parallelizing. But still, I think it's like clearly inputs matter.
Yeah, I agree. But if the coefficient of how fast they diminish as you grow, the input is high enough, then in the abstract, the fact that inputs matter isn't that relevant.
Okay. So, I mean, we're talking to a very high level, but just like take it down to the actual concrete thing here.
OpenAI has a staff of at most low hundreds who are directly involved in the algorithmic progress in future models. If it was really the case that you could just arbitrarily scale this number and you could have much faster algorithmic progress and that would result in much better AIs for OpenAI, basically, then it's not clear why OpenAI doesn't just go out and hire every single person with 150 IQ, of which there are hundreds of thousands in the world.
And my story there is there's transaction costs to managing all these people that don't just go away if you have a bunch of AIs, that these tasks aren't easy to parallelize. And I think you, I'm not sure how you would explain the fact of like, why does an open AI go on a recruiting binge of every single genius in the world? All right, great.
So let's talk about the open AI example and let's talk about the automated AI researchers. So, I mean, in the open AI case, I mean, just, you know, just kind of like look at the inflation of like AI researcher salaries over the last year.
I mean, I think like, I don't know, I don't know what it is, you know, 4x, 5X. It's kind of crazy.
So they're clearly really trying to recruit the best AI researchers in the world. And, you know, I don't know.
They do find the best AI researchers in the world. And I think my response to your thing is like, you know, almost all of these 150 IQ people, you know, if you just hire them tomorrow, they wouldn't be good AI researchers.
They wouldn't be an Alec Radford. But they're willing to make investments that take years to pan out of the four.
The data centers they're buying right now will come online in 2026 or something.
Why wouldn't they be able to make every 150 IQ person?
Some of them won't work out.
Some of them won't have the traits we like.
But some of them by 2026 will be amazing AI researchers.
Why aren't they making that bet?
Yeah.
And so sometimes this happens, right?
Like smart physicists have been really good at AI research.
You know, it's like all the anthropic co-founders.
But like if you talk to, I had Dario on the podcast. They have this very careful policy of like we're not going to just hire arbitrarily.
We're going to be extremely selective. Yep.
So training is not as easily scalable, right? So training is very hard. You know, if you just hired, you know, a hundred thousand people, it's like, I mean, you couldn't train them all.
It'd be really hard to train them all. You know, you wouldn't be doing any AI research.
Like, you know, there's, costs to bringing on a new person and training them. This is very different with AIs, right? And I think this is, it's really important to talk about the sort of like advantages the AIs will have.
So it's like, you know, training, right? It's like, what does it take to be an Alec Radford? You know, we need to be in a really good engineer, right? AIs, they're going to be an amazing engineer. They're going to be amazing at coding.
You can just train them to do that. They need to have, you know, not just be a good engineer, but have really good research intuitions and like really understand deep learning.
And this is stuff that, you know, I like Radford or, you know, somebody like him has acquired over years of research over just like being deeply immersed in deep learning, having tried lots of things himself and failed. Um, the eyes, you know, they're going to be able to read every research paper I've written, every experiment ever run at the lab, you know, like gain the intuitions from all of this.
They're going to be able to learn in parallel from all of each other's experiment you know experiences um you know i don't know what else you know it's like what does it take to be an alec radford well there's a there's a sort of cultural acclimination aspect of it right you know if you hire somebody new and there's like politicking maybe they don't fit in well in the ai case you just make replicas right there's a like motivation aspect for it right so it's like you know alec you could just like duplicate Alec Radford, and before I run every experiment, I have him spend like, you know, a decade's worth of human time, like double checking the code and thinking really carefully about it. I mean, first of all, I don't have that many Alec Radfords, and you know, he wouldn't care, and he would not be motivated.
But you know, the AI, they can just be like, look, I have 100 million of you guys, I'm just going to put you on just like, really making sure making sure this code is correct there are no bugs this experiment is thought through every hyper parameter is correct um final thing i'll say is you know the 100 million human equivalent ai researchers that is just a way to visualize it so that doesn't mean you're going to have literally 100 million copies you know so there's trade-offs you can make between serial speed and in parallel so you might make the trade-off is look we're going to run them at you know 10x 100x serial speed. It's gonna result in fewer tokens overall because of sort of inherent trade-offs but you know then we have I don't know what the numbers would be but then we have you know a hundred thousand of them running at a hundred acts human speed and thinking and you know and there's other things you can do on coordination you can kind of like share latent space attend to each other's context.
There's basically this huge range of possibilities of things you can do. The 100 million thing is more, I mean, another illustration of this is, you know, if you kind of, I run the math in my series, and it's basically, you know, 27, 28, you have this automated AI researcher, you're going to be able to generate an entire internet's worth of tokens every single day.
So it's clearly sort of a huge amount of like intellectual work that you can do. I think the analogous thing there is today we generate more patents in a year than during the actual physics revolution in the early 20th century that we're generating across like half a century or something.
And are you making more physics progress in a year today than we were? So, yeah, you're going to generate all these tokens. Are you generating as much codified knowledge as humanity has been able to generate in the initial creation of the internet? No, no, probably not.
Internet tokens are usually final output, right? Right. A lot of these tokens, if we talked about the unhobbling, right? Right.
I think of a kind of like, you know, a GPT-N token is sort of like one token of my internal monologue. Yeah.
Right. And so that's how I do this math on human equivalents.
You know, it's like 100 tokens a minute. And then, you know, humans working for for X hours and, you know, what is the equivalent there? I think this goes back to something we're talking about earlier where why haven't we seen the huge revenues from people often ask this question that if you took GPT-4 back 10 years and you show people this and they think this is going to automate, this is already automated half the jobs.
And so there's a sort of a modus ponens, modus totens here where part of the explanation is like, oh, it's like just on the verge, you need to do these unhobblings. And part of that is probably true.
But there is another lesson to learn there, which is that just looking at face value at a set of abilities, there's probably more sort of hobblings that you don't realize that are hidden behind the scenes. I think the same will be true of the AGI that you have running as AI researchers.
I think a lot of things. I basically agree, right? I think my story here is like, you know, I talk about, I think there's going to be some long tail, right? And so, you know, maybe it's like, you know, 26, 27, you're like the proto automated engineer and it's really good at engineering.
It doesn't have the research intuition yet. You don't quite know how to put them to work.
But, you know, the sort of even the underlying pace of AI progress is already so fast, right? In three years from not being able to do any kind of like math at all, to now crushing, crushing these math competitions. And so you have the initial thing in like 26, 27, maybe the sort of auto, it's an automated research engineer, it speeds you up by 2x, you go through a lot more progress in that year.
By the end of the year, you figured out like the remaining kind of unhoblings, you've like got a smarter model. And you then that thing or maybe it's two years you know and that thing just like that thing really can do automate 100 percent um and again you know they don't need to be doing everything they don't need to be making coffee you know they don't need to like you know maybe there's a bunch of you know uh tacit knowledge and a bunch of other fields but you know ai researchers at ai labs really know the job of an ai researcher and it's in some sense it's a sort of there of, there's lots of clear metrics.
It's all virtual. There's code.
It's things you can kind of develop and train for. So, I mean, another thing is how do you actually manage a million AI researchers? Humans, the sort of comparative ability we have that we've been especially trained for is like working in teams.
And despite this fact, we have for thousands of years, we've been learning about how we work together in groups. And despite this, management is a clusterfuck, right? It's like most companies are badly managed.
It's really hard to do this stuff. For AIs, the sort of like, we talk about AGI, but it'll be some bespoke set of abilities, some of which will be higher than humans, some of which will be at human level.
And so it'll be some bundle, and we'll need to figure out how to put these bundles together with their human overseers, with the equipment and everything. And the idea that as soon as you get the bundle, you'll figure out how to get mil...
like just shove millions of them together and manage them. I'm just very skeptical of.
Like any other revolution, technological revolution in history has been very piecemeal. Much more piecemeal than you would expect on paper if you just thought about what is the industrial revolution.
Well, we dig up coal that powers the steam engines. You use the steam engines to run these railroads that helps us get more coal out.
And there's sort of like factorial story you can tell where in like a six, six hours, you can be pumping thousands of times more coal, but in real life, it takes centuries often. Right.
In fact, the electrification, there's this famous study about how initially to electrify factories, it was decades after electricity to change from the pulleys and water wheel base system that we had for steam engines to one that works with more spread out electrical motors and everything. I think this will be the same kind of thing.
It might take like decades to actually get millions of AI researchers to work together. Okay, great this is great okay so a few responses to that first of all i mean i totally agree with the kind of like real world bottlenecks type of thing i think this is sort of um you know i think it's easy to underrate um you know basically what we're doing is we're removing the labor constraint we automate labor and we like kind of explode technology but you know there's still lots of other bottlenecks in the world and so i think this is part of why the story is it kind of like starts pretty narrow at the thing where you don't have these bottlenecks.
And then only over time, as we let it,
it kind of expands to sort of broader areas.
This is part of why I think it's like initially
this sort of AI research explosion, right?
It's like AI research doesn't run
into these real world bottlenecks.
It doesn't require, you know, like plow a field
or dig up coal.
It's just, you're just doing AI research.
The other thing, you know, the other thing about-
I love how like in your model, AI research,
it's not complicated like flipping a burger. It's doing AI research.
The other thing, you know, the other thing about- I love how like in your model AI research, it's not complicated like about flipping a burger,
it's just AI research.
Well, no, but it's, I mean, this is because people make these arguments of like, oh, you know, AGI won't do anything because it can't flip a burger. I'm like, yeah, it won't be able to flip a burger, but it's gonna be able to do algorithmic progress, you know? And then when it does algorithmic progress, it'll figure out how to flip a burger, you know? And then we'll have a burger flipping robot.
You know, look, the other thing is about, you know, again, the sort of quantities are lower bound, right? So it's like, this is just like, we can definitely run a hundred million of these. Probably what will happen is one of the first things we're gonna try to figure out is how to like, again, run like, you know, translate quantity into quality, right? And so it's like, even at the baseline rate of progress, you're like quickly getting smarter and smarter systems, right? and smarter systems, right? If we said it was like, you know, translate quantity into quality, right? And so it's like, even at the baseline rate of progress, you're like, quickly getting smarter and smarter systems, right? If we said it was like, you know, four years between the preschooler and the high schooler, right? So I think, you know, pretty quickly, you know, there's probably some like simple algorithmic changes you find, you know, if instead of one Alec Radford, you have 100, you know, you don't even need 100 million.
And then, and then you get even smarter systems. And now these systems are, you know, they're capable of sort of creative, complicated behavior.
You don't understand. Maybe there's some way to like use all this test time compute in a more unified way rather than all these parallel copies.
And, you know, so they won't just be quantitatively superhuman. They'll pretty quickly become kind of qualitatively superhuman.
You know, it's sort of like, you know, you're a high school student. You're like trying to wrap yourself, wrap your mind around kind of standard physics.
And then there's some like super smart professor who is like, quantum physics, it all makes sense to him. And you're just like, what is going on? And sort of, I think pretty quickly, you kind of enter that regime, just given even the underlying pace of AI progress, but even more quickly than that, because you have this sort of accelerated force of now this automated AI research.
I agree that over time, you would, I'm not denying that ASI is I think that's possible. You know I'm just like you know how is this happening in a year like you okay first of all I think I think the story is sort of like basically I think it's a little bit more continuous you know right like I think already you know like I talked about you know 25 26 you're basically gonna have models as good as a college graduate and you know I don't I don't know where the unhobbling is going to be but I think it's possible that even then you have kind of the proto-automated engineer so there's i think there is a bit of like a smear kind of an agi smear or whatever where it's like there's sort of unhobblings that you're missing there's kind of like ways of connecting them you're missing there's like some level intelligence you're missing but then at some point you are going to get the thing that is like 100 automated alec radford once you have that you know things really take off i Yeah.
Okay. So let's go back to the unhoblings.
Yeah. Is there, we're going to get a bunch of models by the end of the year.
Is there something, let's suppose we didn't get some capacity by the end of the year. Yeah.
Is there some such capacity which lacking would suggest that EIFR is going to take longer than you are projecting? Yeah. I mean, I think there's, there's two kind of key things.
There's the unhobbling and there's the data wall, right?
I think we should talk about the data wall for a moment.
I think the data wall is, you know, even though kind of like all this stuff has been about,
you know, crazy AI progress, I think the data wall is actually sort of underrated.
I think there's like a real scenario where we're just stagnant.
You know, because we've been running this tailwind of just like, it's really easy to
bootstrap and you just do unsupervised learning next token prediction.
It learns these amazing world models, like bam, you know, great model. And you just got to buy some more compute you know do some simple efficiency changes um you know and and again like so much of deep learning all these like big gains on efficiency have been like pretty dumb things right like you know you add a normalization layer you know um you know you fix the scaling laws you know and these already have been huge things let alone kind of like obvious ways in which these models aren't good Anyway, so data wall, big deal.
You know, I don't know, some like put some numbers on this, you know, some like you do common crawl, you know, online is like, you know, 30 trillion tokens on the three, you trained on 15 trillion tokens. So you're basically already using all the data.
And then, you know, you can get somewhat further by repeating it. So there's an academic paper by, you know, Boaz Barak and some others that does scaling laws for this.
And they're basically like, yeah, you can repeat it sometime after 16 times of repetition, it's just like returns basically go to zero. You're just completely screwed.
And so I don't know, say you can get another 10 X on data from say like Lama three and GP four, you know, Lama three is already kind of like at the limit of all the data. You know, maybe you can get 10 X more by repeating data.
You know, I don't know, maybe that's like at most a 100x better model than GPT-4, which is like, you know, 100x effective compute from GPT-4 is, you know, not that much. You know, if you do half an order of magnitude a year of compute, half an order of magnitude a year of algorithmic progress, you know, that's kind of like two years from GPT-4.
So, you know, GPT-4 finished pre-treading in 22, you know, 24. So I think one thing that really matters, I think we won't quite know by end of the year, but, you know, 25, 26, are we cracking the data wall? Okay, so suppose we had three orders of magnitude less data in common crawl on the internet than we just happen to have now.
And for decades, the internet, other things, we've been rapidly increasing the stock of data that humanity has. Is it your view that for contingent reasons, we just happen to have enough data to train models that are just powerful enough at 4.5 level where they can kick off the self-play RL loop? Or is it just that we, you know, if it had been three rooms higher, then progress would have been slightly faster? In that world, we would have been looking back at like, oh, how hard it would have been to like kick off the RL explosion with just 4.5, but we would have figured it out.
And then so in this world, we would have gotten to GPT-3, and then we'd have to kick off some sort of RL explosion. But we would have still figured it out.
The sort of, We didn't just like luck out on the amount of data we happen to have in the world. I mean, three ohms is pretty rough, right? Like three ohms, if less data means like six ohms, smaller, six ohms, less compute model and Tertulli scaling laws.
You know, that's, it's basically capping out at like GPT-2. So I think that would be really rough.
I think you do make an interesting point about the contingency. You know, I guess earlier we were talking about this sort of like when in the sort of human trajectory are you able to learn from yourself? And so, you know, if we go with that analogy again, like if you'd only gotten the preschooler model, it can't learn from itself.
You know, if you'd only gotten the elementary schooler model, can't learn from itself. And, you know, maybe GP4, you know, smart high schooler is really where it starts.
Ideally, you have a somewhat better model, but then it really is able to kind of like learn from itself or learn by itself itself so i yeah i think there's an interesting i think i mean i think maybe one less data i would be like more iffy but maybe still doable yeah i think it would feel chiller if we had you know like one or two it would be an interesting exercise to get probably distributions of hei contingent done yeah across like data yeah okay i the thing that makes me skeptical of this story is that the things... It totally makes sense why pre-training works so well.
Yeah. These other things, they're stories of, in principle, why they ought to work.
Yes. Like, humans can learn this way and so on.
Yes. And maybe they're true.
Uh-huh. But I worry that a lot of this case is based on sort of first principles, evaluation of how learning happens, that fundamentally we don't understand how humans learn.
And maybe there's some key thing we're missing. On the sort of sample efficiency, yeah, humans actually, maybe there's, you say, well, the fact that these things are way less sample efficient in terms of learning than humans are suggests that there's a lot of room for improvement.
Another perspective is that we are just on the wrong path altogether, right?
That's why there's a sample inefficient when it comes to pre-trading.
Yeah.
So, yeah, I'm just like, there's a lot of like,
first principles arguments stacked on top of each other,
where you get these unhobblings and then you get to AGI.
Then you, because of these reasons why you can stack all these things
on top of each other, you get to ASI. And I'm worried that there's too many steps of this sort of first-person-visible thinking.
I mean, we'll see, right? I mean, on the sort of sample efficiency thing, again, sort of first principles, but I think, again, there's this clear sort of missing middle. And so, you know, and sort of like, you know, people hadn't been trying.
Now people are really trying, you know, and so it's sort of, you know, I think often again in deep learning something like the obvious thing works. And there's a lot of details to get right.
So it might take some time, but it's now people are really trying. So I think we get a lot of signal in the next couple of years, you know, on hobbling.
I mean, what is the signal on hobbling that I think would be interesting? be interesting? I think the question is basically like, are you making progress on this test time compute thing? Right? Like, is this thing able to think longer horizon than just a couple hundred tokens? Right? That was unlocked by chain of thought. And on that point in particular, the many people who have longer timelines have come on the podcast have made the point that the way to train this long horizon RL, it's not, I mean, earlier talking about like, well, they can think for five minutes, but not for longer.
But it's not because they can't physically output an hour's worth of tokens. It's just really, at least from what I understand, what they say.
Right, like even like Gemini has like a million contacts and the million of contacts is actually great for consumption. And it solves one important on hobbling, which is the sort of onboarding problem, right? Which is, you know, a new coworker, you know, in your first five minutes, like a new smart high school intern, first five minutes, not useful at all.
A month in, you know, much more useful, right? Because they've like looked at the monorepo and understand how the code works and they've read your internal docs. And so being able to put that in context, great, solves this onboarding problem.
Yeah, but they're not good at sort of the production production of a million tokens yeah yeah right but on the production of a million tokens yeah um there's no public evidence that there's some easy loss function where you can gpd4 has gotten a lot better since it's actually so the gp4 gains since launch i think are a huge indicator that there's like you know so you talked about this with john on the podcast john said this was mostly post-training gains right you know if you look at the sort of lm cis scores um you know it's like 100 elo or something it's like a bigger gap than between claude 3 opus and claude 3 haiku and the price difference between those is 60x but it's not more agentic it's like better in the same chat about math right like you know went from like you know 40 percent the crux is like whether like because i'd be able to no but i think i think it indicates that clearly there's stuff to be done on hobbling i think yeah i think i think the interesting question is like this time a year from now you know is there a model that is able to think for like you know a few thousand tokens coherently cohesively agentically um and i think probably there's you know again this is what i'd feel better if we had an um or two more data because it's like the scaling just gives you this sort of of like tailwind, right? Where like, for example, tools, right? Tools, I think probably there's, you know, again, this is, I'd feel better if we had an oom or two more data, cause it's like the scaling just gives you this sort of like tailwind, right? Where like, for example, tools, right? Tools, I think, you know, talking to people who try to make things work with tools, you know, actually sort of GP4 is really when tools start to work. And it's like, you can kind of make them work with GP3.5, but it's just really tough.
And so it's just like having GP4, you can kind of help it learn tools in a much easier way. And so just a bit more tailwind from scaling.
And then, yeah.
And I don't know if it'll work, but it's a key question.
Oh, yeah.
I think it's a good place to sort of close that part where we know what the crux is and what the progress, what evidence of that would look like.
On the AGI to super intelligence, maybe it's a case that the gains are really easy right now and you can just sort of let loose an Alec Ratford, give him a compute budget and he comes out the other end with something that is an additive, like change this part of the code. This is a compute multiplier, change this other part.
What other parts of the world? Okay, maybe here's an interesting way to ask this. Yes.
Yeah. How many other domains in the world are like this, where you think you could get the equivalent of, in one year, you just throw enough intelligence across multiple instances, and you just come out the other end with something that is remarkably decades, centuries ahead? Yeah.
Like, you start off with no no flight and then you're the Wright brothers, a million instances of GPT-6 and you come out the other end with Starlink. Yeah.
Is that your model of how things work? I think you're exaggerating the timelines a little bit, but I think decades worth of progress in a year or something, I think that's a reasonable prompt. So I think this is where, you know, basically the sort of automated AI researcher comes in because it gives you this enormous headwind on all the other stuff, right? So it's like, you know, you automate AI research with your sort of automated Alec Radfords, you come out the other end, you've done another five booms, you have a thing that is like vastly smarter.
Not only is it vastly smarter, you like, you know, you've been able to make it good at everything else, right? You're. The robots are important because for a lot of other things, you do actually need to try things in the physical world.
I don't know. Maybe you can do a lot in simulation.
Those are the really quick worlds. I don't know if you saw the last NVIDIA GTC and it was all about the digital twins and just having all your manufacturing processes and simulation.
I don't know. Again, if you have these super intelligent cognitive like, you know, super intelligent, like cognitive workers, like, can they just like make simulations of everything, you know, kind of off of load style, and then and then, you know, make a lot of progress in simulation possible.
But I also just think you're going to get the robots. Again, I agree about like, there are a lot of real world bottlenecks, right.
And so, you know, I don't know, it's quite possible that we're going to have, you know, crazy drone storms, but also, you know, like lawyers and doctors still need to be humans because of like, you know, regulation. But, you know, I think, you know, you kind of start narrowly, you broaden, and then the worlds in which you kind of let them loose, which again, because of, I think these competitive pressures, we will have to let them loose in some degree on, you know, various national security applications.
I think like quite rapid progress is possible. The other thing though, is it's sort of, basically in the sort of an explosion after, there's kind of two components.
There's the A, right, in the production function, the growth of technology. And that's massively accelerated by you.
Now you have a billion super intelligent scientists and engineers and technicians, you're superbly competent at everything. You also just automated labor, right? And so it's like, even without the whole technological explosion thing, you have this industrial explosion, at least if you let them loose, which is like, now you can just build, you know, you can cover Nevada and like, you know, you start with one robot factory is producing more robots.
And basically this like just the cumulative process because you've taken labor out of the equation. Yeah, that's super interesting.
Although when you increase the K or the l without increasing the a you can look at the soviet union or china where they rapidly increase inputs yeah and that does have the effect of being geopolitically game-changing where you it is remarkable like you go to shanghai over a close of decades these crazy cities in a decade right Right, right. I mean, the closest thing to people talk about 30% growth rates or whatever.
Asian tigers, 10% of yours. It's totally possible.
But without productivity gains, it's not like the Industrial Revolution, where from the perspective of you're looking at a system from the outside, your goods have gotten cheaper, they can manufacture more things. But it's not like the next century is coming at you.
Yeah, it's both. It's both.
So it's, you know, both that are important. The other thing I'll say is like all this stuff, I think the magnitudes are really, really important, right? So, you know, we talked about a 10x of research effort or maybe 10, 30x over a decade.
You know, even without any kind of like self-improvement type loop, you know, we talk the sort of even in the sort of cheap before to AGI story, we're talking about an order of magnitude of effective compute increase a year, right? Half an order of magnitude of compute, half an order of magnitude of algorithmic progress that sort of translates into effective compute. And so you're doing a 10X a year, right? Basically on your labor force, right? So it's like, it's a radically different world if you're doing a 10X or 30X in a century versus a 10X a year on your labor force.
So the magnitudes really matter. They also really matter on the sort of intelligence explosion, right? So like just the automated AI research part.
So, you know, one story you could tell there is like, well, ideas get harder to find, right? Algorithmic progress is going to get harder. Yeah, right now you have the easy wins, but in like four or five years, there'll be fewer easy wins.
And so the sort of automated AI researchers are just going to be what's necessary to just keep it going, right? Because it's gotten harder. But that's sort of, it's like a really weird knife-edge assumption in economics where you assume it's just enough.
But isn't that the equilibrium story you were just telling with why the economy as a whole has 2% economic growth? Because you just proceeded on the equilibrium. I guess you were saying by the time you get to the equilibrium.
But the result of the equilibrium here is it's like way faster. At least, you know, and it's at least, and it depends on the sort of exponents, but it's basically, it's the increase is like, suppose you need to like 10x effective research effort in AI research in the last, you know, four or five years to keep the pace of progress.
We're not just getting a 10x, you're getting, you know, a million x or 100,000 x, there's just the magnitudes really matter. And the magnitude is just basically, you know, once one way to think about this is that you have kind of two exponentials, you have your sort of like, normal economy, that's growing at, you know, 2% a year, and you have your like, AI AI economy and that's going at like 10x a year.
And it's starting out really small, but sort of eventually it's going to it's just it's it's it's it's way faster and eventually it's going to overtake. Right.
And even if you have you can almost sort of just do the simple revenue extrapolation. Right.
You think your AI economy, you know, that has some growth rate. I mean, it's very simplistic way and so on.
But there's there's this sort of 10x a year process. And that will eventually kind of like, you're going to transition the sort of whole economy from, as it broadens from the sort of, you know, 2% a year to the sort of much faster growing process.
And I don't know, I think that's very like consistent with historical change, you know, stories of, right. There's this sort of like, you know, there's this sort of long run hyperbolic trend, you know, it manifested in the sort of like sort of change in growth mode in the Australovic Revolution, but there's just this long run hyperbolic trend.
And now you have that another sort of change in growth mode. Yeah, yeah.
I mean, that was one of the questions I asked Tyler when I had him on the podcast, is that you do go from... The fact that after 1776, you go from a regime of negligible economic growth two percent yeah it's really interesting it shows that yeah i mean from the perspective of somebody in the middle ages or before yeah two percent is equivalent to the sort of ten percent yeah i guess you're projecting even higher for the ai economy but yeah i mean it depends i think again and it's all this stuff you know i have a lot of uncertainty right so a lot of the time i'm trying to kind of tell the modal story i think it's important to be kind of concrete and visceral about it and i you know you know, I have a lot of uncertainty, right? So a lot of the time I'm trying to kind of tell the modal story.
I think it's important to be kind of concrete and visceral about it. And, you know, I have a lot of uncertainty basically over how the 2030s play out.
And basically the thing I know is it's going to be fucking crazy. But, you know, exactly what, you know, where the bottlenecks are and so on, I think that'll be kind of like.
So let's talk through the numbers here you hundreds of millions of ai researchers so right
now gpd 4o turbo is like 15 bucks for a million tokens outputted and a human thinks 150 tokens a
minute or something and if you do the math on that i think it's for an hour's worth of human
output you it's like 10 cents or something now Now... Cheaper than a human worker.
Cheaper than a human worker. But it can't do the job yet.
That's right. That's right.
But by the time you're talking about models that are trained on the 10 gigawatt cluster, then you have something that is four orders of magnitude, more expensive via inference, three orders of magnitude, something like that. So that's like $100 an hour of labor.
And now you're having hundreds of millions of such laborers. Is there enough compute to do with the model that is a thousand times bigger, this kind of labor? Great.
Okay, great question. So I actually don't think inference costs for sort of frontier models are necessarily going to go up that much.
So I mean, one historical data point is... But isn't the test time sort of thing that it will go up even higher i mean we're just doing per token right and then i'm just saying you know if suppose each model token was the same as sort of a human token thing at 100 tokens a minute so it's like yeah it'll use more but the sort of if you just the token calculations is already pricing that in um the the question is like per token pricing right um and And so like GPT-3 when it launched
was like actually more expensive than GPT-4 now.
And so over just like, you know,
fast increases in capability gains,
inference costs has remained constant.
That's sort of wild.
I think it's worth appreciating.
And I think it gestures that sort of
an underlying pace of algorithmic progress.
I think there's a sort of like
more theoretically grounded way
to why inference costs would stay constant. And it's the following story, right? So on Cicillia scaling laws, right? You know, half of the additional compute you allocate to bigger models and half of it you allocate to more data, right? But also if we go with the sort of basic story of half an order a year, more compute and half an order of magnitude a year of algorithmic progress, you're also kind of like you're saving half an order of magnitude a year.
And so that kind of would exactly compensate for making the model bigger. The caveat on that is, you know, obviously not all training efficiencies are also inference efficiencies, you know, a bunch of the time they are separately, you can find inference efficiencies.
So I don't know, given this historical trend, given the sort of like, you know, baseline sort of theoretical reason, you know, I don't know, I think it's not crazy baseline assumption that actually these models the frontier models are not necessarily going to get more expensive per token oh really yeah like okay that's that's wild um we'll see we'll see i mean the other thing you know maybe they get you know even if they get like 10x more expensive then you know you have 10 million instead of 100 million you know so it's like it's not really like, but okay, so part of
the intelligence solution is that each of them has to run experiments that are GPT-4
sized. And the result.
So that takes up a bunch of compute. Yes.
Then you consolidate
the results of experiments. And what is the synthesized?
I mean, you have a much bigger influence street anyway, than your training.
Sure. Okay.
But I think the experiment compute is a constraint.
Yeah. Okay.
I'm going back to maybe a sort of bigger fundamental thing we're talking about here. We're projecting in a series you say we should denominate the probability of getting to AGI in terms of orders and magnitude of effective compute.
Effective here accounting for the fact that there's a quote-unquote compute multiplier if you have better algorithms. And I'm not sure that it makes sense to be confident that this is a sensible way to project progress.
It might be, but I have a lot of uncertainty about it. It seems similar to somebody trying to project when we're going to get to the moon.
And they're like looking at the Apollo program in the 450s or something. And they're like, we have some amount of effective jet fuel.
And if we get more efficient engines, then we have more effective jet fuel. And so we're going to like probability of getting to the moon based on the amount of effective jet fuel we have.
And I don't deny that jet fuel is important to launch rockets, but that seems like an odd way to denominate when you're going to get to the moon. Yeah.
Yeah. So, I mean, I think these cases are pretty different.
I don't know. I didn't, I don't think there is a sort of clear, I don't know how rocket science works, but I didn't, I didn't get the impression that there's some clear scaling behavior with like, you know, the amount of jet fuel.
I think the, um, I think an AI, you know, I mean, first of all, the scaling laws, you know, they've just helped. Right.
And so if you, a friend of mine pointed this out, and I think it's a great point, if you kind of concatenate both these sort of original Kaplan scaling laws paper that I think went from 10 to the negative nine to 10 petaflop days, and then, you know, concatenate additional compute to, from there to kind of GP4, you assume some algorithmic progress, you know, it's like the scaling laws have held, you know, like probably over 15 ohms, you know, I know it's rough, probably maybe even more held for a lot of ohms. They held for the specific loss function, which they're trained on, which is a training next token.
Whereas the the the progress you are forecasting will be required for further progress. Yes.
In capabilities. Yeah, it was specifically we know that scaling can't work because of the data wall.
And so there's some new thing that has to happen. And I'm not sure whether you can extrapolate that same scaling curve to tell us whether these hobblings will also...
Like, is this not on the same graph? The hobblings are just a separate thing. Yeah, exactly.
So this is sort of like... Yeah, so I mean mean a few few things here right okay so um the um on the on the effect of compute scaling the um you know in some sense i think it's like people center the scaling laws because they're easy to explain and the sort of like why why is scaling matter um the scaling laws like came way after people or at least you know like dario ilia realized that scaling mattered and i think you know i think that almost more important than the sort of loss curve is just like just in general make you know you know, there's this great quote from Dario on your, on your, on your podcast.
It's just like, you know, Ilya was like, the models, they just want to learn, you know, you make them bigger, they learn more. And, and that just applied just across domains, generally, you know, all the capabilities.
And so, and you can look at this in benchmarks. Again, like you say, headwind data wall, and I'm sort of bracketing that and talking about that separately the other thing is on hobblings right if you just put them on the effect of compute graph these on hobblings would be kind of huge right so like i think what does it even mean like what is what is on the y-axis here um like say mlpr on this benchmark or whatever right and so you know like you know we mentioned the sort of the lmsys differences you know rhl you know again as good as 100x chain of chain of thought, just going from this prompting chain, a simple algorithmic chain can be like 10X effective compute increases on math benchmarks.
I think this is useful to illustrate that on hobblings are large, but I think they're like, I kind of think of them as slightly separate things. And kind of the way I think about it is that at a per token level, I think GP4 is not that far away from a token of my internal monologue.
Even 3.5 to 4 took us from the bottom of the human range to the top of the human range on a lot of high school tests. And so it's like a few more 3.5 to 4 jumps per token basis, per token intelligence.
And then you've got to unlock the test time, you've got to solve the onboarding problem, make it use a computer. And then you're getting real close.
I'm reminded of... And again, the story might be wrong, but I think it is strikingly plausible.
I agree. And so I'm just...
I think actually, I mean, the other thing I'll say is like, you know, I say this 2027 timeline, I think it's unlikely, but I do think there's worlds that are like AGI next year. And that's basically if the test time compute overhang is really easy to crack.
If it's really easy to crack, then you do like four rooms of test and compute, you know, from a few hundred tokens to a few million tokens, you know, quickly. And then, you know, again, maybe it's maybe only takes one or two, 3.5 to four jumps per token, like one or two of those jumps per token, plus uses test and compute.
And you basically have the proto automated engineer. Um, so I of uh stephen pinker releases this book on um what is it the better angels of our nature and it's like a couple years ago or something and he says the secular decline in violence and war and everything and you can just like plot the line from the end of world war ii in fact before world war ii then these are just aberrations, whatever.
And basically, as soon as it happens, Ukraine, Gaza, everything is like... So, impending ASI and crazy global conflict.
Right, right. Impending ASI and crazy new WND.
I think this is the sort of thing that happens in history, where you see a straight line and you're like, oh my gosh gosh. And then just like, as soon as you make that prediction, who was that famous author?
So yeah, just, you know, again, people are predicting deep learning will hit a wall every
year, maybe one year they're right, but it's like gone a long way and it hasn't hit a wall.
And we don't have that much more to go.
And, you know, so yeah.
I guess I think this is a sort of plausible story and let's just run with it and see what
it implies.
Yeah.
So we were talk in your series, you talk about alignment from the perspective of... This is not about some doomer scheme to get the 0.01% probability distribution where things don't go off the rails.
It's more about just controlling the systems, making sure they do what we intend them to do. If that's the case, and we're going to be in this sort of geopolitical conflict with China, and part of that will involve, and what we're worried about, is them making these CCP bots that go out and take the red flag of Mao across the galaxies or something.
Then shouldn't we be worried about alignment as something that, if in the wrong hands, this is the thing that enables brainwashing, sort of dictatorial control? This seems like a worrying thing. This should be part of the sort of algorithmic secrets we keep hidden, right? How to align these models, because that's also something the CCP can use to control their models.
I mean, I think in the world where you get the Democratic coalition yeah i mean also just alignment is often dual use right like rhf you know it's like alignment team developed it was great you know it was a big win for alignment but it's also you know obviously makes these models useful right um the um um but yeah so yeah alignment enables the ccp bots alignment also is what you need to get the you know get the sort of you know whatever usai so like the constitution and like disobey a lot, you know, unlawful orders and, you know, like respect separation of powers and checks and balances. And, um, so yeah, you need alignment for whatever you want to do.
It's just, it's, it's the sort of underlying technique. Tell me what you make of this take.
I've been starting with this a little bit. Okay.
So fundamentally there's many different ways the future could go.
Yeah.
There's one path in which the Eliezer-type crazy AIs with the nanobots take the future
and turn everything into great goo or paperclips.
And the more you solve alignment, the more that path of the decision tree is circumscribed.
And then, so, the more you solve alignment, the more it is just different humans and the
visions they have.
And, of course, we know from history that things don't turn out the way you expect,
so it's not like you can decide the future. But it will appear.
It's part of the beauty of it, right? Yeah. You want these mechanisms of error correction, moralism.
But from the perspective of anybody who's looking at the system, it will be like, I can control where this thing is going to end up. And so the more you solve alignment and the more you circumscribe the different futures that are the result of AI will, the more that accentuates the conflict between humans and their visions of the future.
And so in the world where alignment is solved and the world in which alignment is solved is the world in which you have the most sort of human conflict over where to take AI. Yeah, I mean, by removing the worlds in which the AIs take over, then the remaining worlds are the ones where it's like the humans decide what happens.
And then as we talked about, there's a whole lot of worlds and how that could go. And I worry, so when you think about alignment and this is just controlling these things, just think a little forward and there's worlds in which hopefully, you know, human descendants or some version of things in the future merge with super intelligences, and they have the rules of their own, but they're in some sort of law and market-based order.
I worry about if you have things that are conscious and should be treated with rights. If you read about what alignment schemes actually are, and then you read these books about what actually happened during the Cultural Revolution, what happened when Stalin took over Russia, and you have very strong monitoring from different instances where, one, everybody's tasked with watching each other.
You have brainwashing. You have red teaming, where you have the spy stuff you were talking about, where you try to convince somebody you're on like a defector and you see if they defect with you.
And if they do, then you realize they're an enemy. And then you take, and listen, maybe I'm stretching the analogy too far.
But the way, like, the ease of these alignment techniques actually map onto something you could have read about during like Mao's culture revolution is a little bit troubling. Yeah, I mean, look, I think sentient AI is a whole nother topic.
I don't know if we want to talk about it. I agree that like it's going to be very important how we treat them.
You know, in terms of like what you're actually programming these systems to do, again, it's like alignment is just, it's a technical problem, a technical solution enables the CCP bots. I mean, in some sense, I think the, you know, I almost feel like the sort of model and also about talking about checks and balances is sort of, you know, like the Federal Reserve or Supreme Court justices.
And there's a funny way in which they're kind of this like very dedicated order, you know, Supreme Court justices. And it's amazing.
They're actually quite high quality, right? And they're like really smart people. They really believe in the Constitution.
They love the Constitution. They believe in their principles.
They have, you know, these wonderful um you know back you know and yeah they have different persuasions but they have sort of i think very sincere kind of debates about what is the meaning of the constitution you know what is the best actuation of these principles um you know i guess that's good you know by the way recommendation sort of scotus oral arguments is like the best podcast you know when i run out of high quality content on the internet i mean i think there's going to be a process of like figuring out what the constitution should be. I think, you know, this constitution has like worked for a long time.
You start with that. Maybe eventually things change enough that you want added to that.
But anyway, you want them to like, you know, for example, for the checks and balances, they like, they really love the constitution and they believe in it and they take it really seriously. And like, look, at some point, yeah, you are going to have like AI police and AI military, but I think sort of like, you know, being able to ensure that they like, you know, believe in it in the way that like a Supreme Court justice does or like in the way that like a Federal Reserve, you know, official takes their job really seriously.
Yeah. And I guess the big open question is whether if you do the project or something like the project.
The other important thing is like a bunch of different factions need their own AIs. Right.
And so it's important that like each political party gets to like have their you know and like whatever crazy you might totally disagree with their values but it's like it's really important that they get to like have their own kind of like super intelligence and and again i think it's that these sort of like classical liberal processes play out including like different people of different persuasions and so on and i don't know the ai advisors might not make them you know They might not follow the advice or whatever, but I think it's important. Okay, so speaking of alignment, you seem pretty optimistic.
So let's run through the source of the optimism. Yeah.
I think there, you laid out different worlds in which we could get AI. Yeah.
There's one that you think is low probability of next year where a GPT-4 plus scaffolding plus unhoplings gets you to AGI. Not GPT- you know like oh sorry sorry so gp4 yeah yeah yeah and there's ones where it takes much longer there's ones where it's something that's a couple years in the modal world yeah so gpd4 seems pretty aligned in the sense that i don't expect it to go off the rails yeah maybe with scaffolding things might change looks pretty good yeah exactly so the works pretty good.
Yeah, exactly. So, and maybe you will keep turning, there's cranks, you keep going up and one of the cranks gets to ASI.
Yeah. Is there any point at which the sharp left turn happens? Is it when you start, is it the case that you think plausibly when they act more like agents, this is the thing to worry about? Yeah.
Is there anything qualitatively that you expect to change with regards to the enlightenment perspective yeah yeah so i don't know if i believe in this concept of sharp left turn but i do think there's basically i think there's important qualitative changes that happen between now and kind of like somewhat superhuman systems kind of like early on the intelligence explosion and then important qualitative changes that happen from like early in intelligence explosion to kind of like true super intelligence and all its power and might and And let's talk about both of those. And so, okay, so the first part of the problem is one, you know, we're gonna have to solve ourselves, right? We have to, gonna have to line the like initial AI and the intelligence explosion, you know, the sort of automated out of Bradford.
I think there's kind of like, I mean, two important things that change from GPT-4, right? So one of them is, you know, if you believe the story on like, you know, synthetic data URL or self-play to get past the data wall, and if you believe there's a hobbling story, at the end you're going to have things, they're agents, right, including they do long-term plans, right? They have long, long, you know, they're somehow able to act over long horizons, right? But you need that, right? That's the sort of prerequisite to be able to do the sort of automated AI research. And so, you know know I think there's basically you know I basically think sort of pre-training is sort of alignment neutral in the sense of like it has all these representations it has good representations that you know as as representations of doing bad things you know but but there's there's it's not like you know scheming against you or whatever um I think the sort of misalignment can arise once you're doing more kind of long horizon training right right? And so you're training, you know, again, too simplified example, but to kind of illustrate, you know, you're training an AI to make money.
And, you know, if you're just doing that with reinforcement learning, you know, it's, you know, it might learn to commit fraud or lie or deceive or seek power simply because those are successful strategies in the real world, right? So maybe, you know, RL is basically it explores, maybe it figures out like, oh, it tries to like hack and then it gets some money and that made more money, you know, and then if that's successful, if that gets reward, that's just reinforced. So basically, I think there's sort of more serious misalignments, kind of like misaligned long-term goals that could arise between now and, or that sort of necessarily have to be able to arise if you're able to get long horizon system.
That's one. What you want to do in that situation is you want to add side constraints, right? So you want to add, you know, don't lie, don't deceive, don't commit fraud.
And so how do you add those side constraints, right? The sort of basic idea you might have is like RLHF, right? You're kind of like, yeah, it has this goal of like, you know, make money or whatever, but you're watching what it's doing. It starts trying to like, you know, lie or deceive or fraud or whatever, break the law.
You're just kind of like thumbs down, don't do that, you anti-reinforce that. The sort of critical issue that comes in is that these AI systems are getting superhuman, and they're going to be able to do things that are too complex for humans to evaluate.
So again, even early on in the intelligence explosion, the automated AI researchers and engineers, they might write millions, billions, trillions of lines of complicated code. They might be doing all sorts of stuff you just don't understand anymore.
And so, you know, in might write millions, you know, billions, trillions of lines of complicated code. You know, they might be doing all sorts of stuff.
You just like don't understand anymore.
And so, you know, in the million lines of code, you know, is it somewhere kind of like,
you know, hacking, hacking or like exfiltrating itself or like, you know, trying to go for
the nukes or whatever, you know, like you don't know anymore.
Right.
And so this sort of like, you know, thumbs up, thumbs down, pure RLHF doesn't fully
work anymore.
Second part of the picture.
And we should maybe talk more about this first part of the picture. I think it's going to to be like there's a hard technical problem of what do you do sort of post-RLHF, but I think it's a solvable problem and it's like you know there's various things I'm bullish on, I think there's like ways in which deep learning has shaped out favorably.
The second part of the problem is you're going from your like initial systems and intelligence explosion to like super intelligence and you know it's like many ooms, it ends up being like by the end of it you have a thing that's vastly smarter than humans i think the intelligence explosion is really scary from an alignment point of view because basically if you have this rapid intelligence explosion you know less than a year two years or whatever you're going say in the period of a year from systems where like you know failure would be bad but it's not catastrophic to like you know saying a bad word it's like you know it's it's something goes awry to like, you know, failure is like, you know, failure would be bad, but it's not catastrophic to like, you know, saying a bad word. It's like, you know, it's something goes awry to like, you know, failure is like, you know, it exfetrated itself.
It starts hacking the military. It can do really bad things.
You're going less than a year from sort of a world in which like, you know, it's some descendant of current systems and you kind of understand it. And it's like, you know, has good properties.
There's something that potentially has a very sort of alien and different architecture, right? After having gone through another decade of a mal-advances. I think one example there that's very salient to me is legible and faithful chain of thought, right? So a lot of the time when we're talking about these things, we're talking about, you know, it has tokens of thinking and then it uses many tokens of thinking and, you know, maybe we bootstrap ourselves by, you know, it's pre-trained, it learns to think in English and we do something else on top so so it can do the sort of longer chains of thought.
And so, you know, it's very plausible to me that like, for the initial automated alignment researchers, you know, we don't need to do any complicated mechanistic interpretability, and just like literally you read what they're thinking, which is great. You know, it's like huge advantage, right? However, I'm very likely not the most efficient way to do it, right? There's like probably some way to have a recurrent architecture.
It's all internal states is a much more efficient way to do it. That's what you get by the end of the year.
Um, you know, you're going in this year from like RLHF plus plus some extension works to like, it's vastly superhuman. It's like, you know, it's, it's, it's, it's to us, like, you know, uh, you know, an expert in the field might be to like an elementaryer, middle schooler.
And so, you know, I think it's this sort of incredibly sort of, like, hairy period for alignment. Thing you do have is you have the automated AI researchers, right? And so you can use the automated AI researchers to also do alignment.
And so in this world, why are we optimistic that the project is being run by people who are thinking?
I think, so here's something to think about.
The OpenAI starts off with people who are very explicitly thinking about exactly these kinds of things.
Yes.
But are they still there?
No, no, but here's the thing.
No, no, even the people who are there, even like the current leadership, it's like exactly these things. You can find them in interviews in their blog posts talking about.
And what happens is when, as you were talking about, when some sort of trivial, and Jan talked about it. This is not just you.
Jan talked about it in his tweet thread. When there is some trade-off that has to be made with, we need to do this flashy release this week and not next week because whatever google io is the next week so we're going to get it and then the trade-off is made in favor of um uh the the less the more careless decision uh-huh when we have the government or the national security advisor the military or whatever which is much less familiar with this kind of discourse.
He's naturally thinking in this way about,
ah, I'm worried the chain of thought isn't faithful,
and how do we think about the features
that are represented here.
Why should we be optimistic that a project run by people like that
will be thoughtful about these kinds of considerations?
I mean, they might not be.
You know, I agree. I think, um, all right, a few thoughts, right? First of all, I think the private world, even if they sort of nominally care is extremely tough for alignment.
A couple of reasons. One, you just have the race between the sort of commercial labs, right? And it's like, you don't have any headroom there to like, be like, ah, actually we're going to hold back for three months, like get this right.
And, you know, we're going to dedicate 90% of our compute to automated alignment research instead of just like pushing the next zoom. The other thing though, is like in the private world, you know, China has stolen your weights, China has your secrets, they're right on your tails, you're in this fever struggle, no room at all for maneuver.
So like the way it's like absolutely essential to get alignment right, and to get it during this intelligence explosion to get get it right is you need to have that room to maneuver and you need to have that clear lead. And, you know, again, maybe you've made the deal or whatever, but I think you're an incredibly tough space, tough spot if you don't have this clearly.
So I think the sort of private world is kind of rough there on like whether people will take it seriously. You know, I know I have some faith in sort of sort of normal mechanisms of a liberal society sort of if if if alignment is an issue which you know we don't fully know yet but sort of the science will develop we're going to get better measurements of alignment you know and and the case will be clear and obvious um i worry that there's you know i worry about worlds where evidence is ambiguous and i think a lot of um a lot of the most scary kind of intelligence explosion scenarios are worlds in which evidence is ambiguous.
But again, it's sort of like, if evidence is ambiguous, then that's the worlds in which you really want the safety margins. And that's also the worlds in which kind of like running the intelligence explosion is sort of like, you know, running a war, right? It's like, ah, the evidence is ambiguous.
We have to make these really tough trade-offs. And you better have a really good chain of command for that.
And it's not just like, you know, YOLOing it. Ah, let's go.
You know, it's cool. Yeah.
Let's talk a little bit about Germany. We're making the analogy to World War II.
And you made a really interesting point many hours ago. At this point.
Oh, no. We should start after.
You know, after the marathon. The fact that throughout history, World War II is not unique, at least when you think in proportion to the size of the population.
Yeah. But these other sorts of catastrophes where a subsequent portion of the population has been killed off.
Yeah. After that, the nation recovers and they get back to their heights.
Uh-huh. So, what's interesting after World War II is that Germany especially, and maybe Europe as a whole, obviously they experienced vast economic growth in the direct aftermath because of catch-up growth.
Mm-hmm. But subsequently, we just don't think of Germany as… No, we're not talking about Germany potentially launching an intelligence explosion and they're going to get in seat at the AI table.
We were talking about Iran and North Korea and Russia. We didn't talk about Germany, right? Well, because they're our allies.
Yeah, yeah, yeah. But, so, what happened? I mean, World War II and now it didn't, like, come back under the seven years of war or something, right? Yeah, yeah, yeah.
I mean, look, I'm generally very bearish on Germany. I think in this context, I'm kind of like know it's a little bit you know i think you're underwriting a little bit i think it's probably still one of the you know top five most important countries in the world um you know i mean europe overall you know it still has i mean it's a gdp that's like close to the united states the size of the gdp you know um um and there's things actually that germany is kind of good at right like state capacity right like you know the you know, the roads are good and they're clean and they're well maintained.
And, you know, in some sense, a lot of this is the sort of flip side of things that I think are bad about Germany. Right.
So in the U.S., it's a little bit like there's a bit more of a sort of Wild West feeling to the United States. Right.
And it includes the kind of like crazy bursts of creativity. It includes like, you know, political candidates that are sort of, know there's a much broader spectrum and you know much you know like both an obama and trump
is somebody you just wouldn't see in the sort of much more confined kind of german political debate
um you know i wrote this blog post at some point yours political stupor about this um
but anyway and so there's this sort of punctilious sort of rule following that is like good in terms
of like you know keeping your kind of state capacity functioning um um but that is also um you know i think i kind of i think there's a sort of very constrained view of the world in some sense um um you know that includes kind of you know i think after world or two there's a real backlash against anything like elite you know and um you know again no you know no elite high schools or elite colleges and sort of no meritocracy. Why is that the logic? Excellence isn't cherished, you know, there's, yeah.
Why is that the logical intellectual thing to rebel against if you're trying to overcorrect from the Nazis? Yeah. Was it because the Nazis were very much into elitism? What was, I don't understand why that's a logical sort of counter reaction.
I I know, maybe it was sort of a counterreaction against the sort of whole Aryan race and that sort of thing. I mean, I also just think there was a certain amount in what, certain I mean, look at sort of World War I, end of World War I versus end of World War II for Germany, right? And sort of, you know, a common narrative is that the peace of Versailles was too strict on Germany.
But, you know, the peace imposed after World War II was like much more strict, Right. It was a complete, you know, a common narrative is that the peace of Versailles, you know, was too strict on Germany.
But, you know, the peace imposed after World War II was, like, much more strict, right? It was a complete, you know, I mean, the whole country was destroyed. You know, it was, you know, in most of the major cities, you know, over half of the housing stock had been destroyed, right? Like, you know, in some birth cohorts, you know, like, 40% of the men had died.
Half the population displaced. Oh, yeah.
I mean, almost 20 million people, right, displaced, right? Huge, crazy, right? And the borders are died um half the population displaced oh yeah i mean almost 20 million people
right displaced right huge crazy right um you know like and the borders are way smaller than the
versailles borders yeah exactly and and and sort of complete imposition of a new political system and and uh you know on both sides you know and um um yeah so it was um but in some sense that worked out better than the post-World War I piece
where then there was this kind of resurgence of German nationalism and in some sense a thing that has been a pattern. So it's sort of like, it's unclear if you want to wake the sleeping beast.
I do think that at this point, it's gotten a bit too sleepy. Yeah.
I do think it's an interesting point about we underrate the American political system and I've been making the same correction myself. There was this book
written by a Chinese economist
called China's worldview. And overall, I wasn't a big fan, but they made a really interesting point in there.
Yeah. Which was the way in which candidates rise up through the Chinese hierarchy for politics, for administration.
In some sense, it selects for, you're not gonna get some Marjorie Taylor Greene or somebody running some... Don't get that in Germany either.
Right. But he explicitly made the point in the book that that also means we're never gonna get a Henry Kissinger or Barack Obama in China.
We're gonna get like, by the time they end up in charge of the Politburo, on the Politburo, there'll be like some 60 some 60 year old bureaucrat who's never like ruffled any feathers. Yeah.
Yeah. I mean, I think, I think there's something really important about the sort of like very raucous political debate.
And, um, I mean, yeah, in general kind of like, you know, there's the sense in which in America, you know, lots of people live in their kind of like own world. I mean, like we live in this kind of bizarre little like bubble in San Francisco and people, you know, and, and, um, but I, I think that's important for the sort of evolution of ideas, error correction, that sort of thing.
Um, you know, there's other ways in which the German system is more functional. Yeah.
But it's interesting that there's major mistakes, right? Like the sort of defense spending, right? And, you know, then, you know, Russia invades Ukraine and, and you're like, wow, what did we do?'s a really good point right the main issues there's everybody agrees but exactly yeah it's a consensus blob kind of thing right and on the china point you know just having this experience of like reading german newspapers and i think how much you know how much more poorly i would understand the sort of german debate and sort of the sort of state of mind from just kind of afar.
I worry a lot about, you know, or I think it is interesting just how kind of impenetrable China is to me. It's a billion people, right? Right.
And like, you know, almost everything else is really globalized. You have a globalized internet and I kind of have a sense of what's happening in the UK.
You know, I probably, even if I didn't read German newspapers, I sort of would have a sense of what's happening in Germany, but I really don't feel like I have a sense of what, like, you know, what is the state of mind? What is the state of political debate, you know, of a sort of average Chinese person or like an average Chinese elite? And yeah, I think that, that I find that distance kind of worrying. And I, you know, you know, and there's, and, you know, there's some people who do this and they do really great work where they kind of go through the like party documents and the party speeches and it seems to require a kind of a lot of interpretive ability where there's like very specific words in mandarin that like mean we'll have one connotation not the other connotation um but yeah i think it's sort of interesting given how globalized everything is and like i mean now we have basically perfect translation machines and it's still so so impenetrable that's really interesting i've been i should i'm sort of ashamed almost that yeah i haven't done this yet yeah i think many months ago i when alexi interviewed me on his uh youtube channel i said i'm meaning to go to china to actually see for myself what's going on and actually i'm i should so by the way if anybody listening has a lot of context on china if i went to china who could introduce me to people yeah please email me because you gotta do some pods and you gotta find some of the Chinese AI researchers, man.
I know. I was thinking at some point again, this is the fact that I have I don't know if they can speak freely.
I was thinking of they had these papers and on the paper they'll say who's a co-author. It's funny because I was thinking of just emailing, cold emailing everybody, like here's my Calendly.
Let's talk. I just want to see what is the vibe? Even if they don't tell me anything.
I'm just like, what kind of person is this? How westernized are they? But as I was saying this, I just remembered that, in fact, ByteDance, according to mutual friends we have at Google, they cold emailed every single person on the Gemini paper and said, if you come work for ByteDance, we'll make you an LED engineer, you would report directly to the CTO. And in fact, this actually, I'm gonna go- That's how the secrets go over, right? Right, no, I meant to ask this earlier, but suppose they hired one, if there's only a hundred or so people, or maybe less who are working on the key algorithmic secrets.
If they hired one such person, is all the alpha gone that these labs have? If this person was intentional about it, they could get a lot. I mean, they couldn't get the sort of like, I mean, actually, you could probably just also exfiltrate the code.
They could get a lot of the key ideas. Again, like, you know, up until recently stuff was published, but you know, they could get a lot of the key ideas if they tried, if they like, you know, I think there's a lot of people who don't actually kind of like look around to see what the other teams are doing, but you know, I think of can um but yeah i mean they could it's scary right i think the project makes sort of more sense there where you can't just recruit a manhattan project engineer and then just get and it's like these are secrets that can be used for like probably every training around the future that'll be like maybe are the key to the data wall that are like they can't go on or they can't go on that are like you know they're going to be worth you know given sort of like the multipliers on compute you know hundreds of billions trillions of dollars you know and all it takes is you know china to offer 100 million dollars to somebody and be like ah come work for us right and then and then yeah i mean yeah this i mean yeah i'm i'm really uncertain on how sort of seriously china is taking agi right now one one anecdote that was related to me on the topic of anecdotes by another sort of like, you know, kind of researcher in the field was at some point they were at a conference with somebody, a Chinese AI researcher, and he was talking to him and he was like, I think it's really good that you're here.
And like, you know, we got to have the international coordination and stuff. And apparently this guy said that I'm the kind of most senior most person that they're going to let leave the country to come to things like this.
Wait, what's the takeaway? As in they're not letting really senior researchers leave the country. Interesting.
Kind of classic, you know, Eastern Bloc move. Yeah.
I don't know if this is true, but it's what I heard. That's what you're saying.
So I thought the point you made earlier about being exposed to German newspapers and also to... because earlier you were interested in economics and law and national security, you have...
the variety and intellectual diet there has exposed you to thinking about the geopolitical question here in ways others... talking about...
I mean, this is the first episode I've done about this where we've talked about things like this, which is, now that I think about it, weird, given that this is an obvious thing in, I should have been thinking about. Anyways, so that's one thing we've been missing.
What are you missing? And national security you're thinking about, so you can't say national security. What perspective are you probably underexposed to as a result? And China, I guess you mentioned.
Yeah, so I think the China one is an important one. I mean, I think another one would be a sort of very Tyler Cowen-esque take, which is like, you're not exposed to how will a normal person in America both use AI, probably not, and that being kind of like bottlenecks to the fusion of these things.
I'm overrating the revenue because I'm kind of like, ah, everyone on SF is adopting it. But kind of like Joe Schmo engineer at a company, will they be able to integrate it um and then also the reaction to it right you know i mean i think this was a question again hours ago where it was um um about like you know won't people kind of rebel against this yeah and they won't want to do the project i don't know maybe they will um yeah here's a political reaction that i didn't anticipate yeah so tucker carlson was recently recently on the Joe Rogan episode.
I already told you about this, but I'm just gonna tell the story again. You should.
So, Tucker Carlson is on Joe Rogan. Yeah.
And they start talking about World War II. Aha.
And Tucker says, well, listen, I'm gonna say something that my fellow conservatives won't like, but I think nuclear weapons are immoral. I think it was obviously immoral that we used them on Nagasaki and Hiroshima.
And then he says, in fact, nuclear weapons are always immoral. Uh-huh.
Except when we would use them on data centers. In fact, it would be immoral not to use them on data centers because look, these people in Silicon Valley, these fucking nerds are making super intelligence.
And they say that it could enslave humanity. We made machines to serve humanity, not to enslave humanity.
And they're just going on and making these machines. And so we should, of course, be nuking the data centers.
And that is definitely not a political reaction in 2024, I was expecting. I mean, who knows? It's going to be crazy.
It's going to be crazy. The thing we learned with COVID is that also the left-right reactions that you would anticipate just based on hunches.
It completely flipped multiple times. Initially, like kind of the right was like, you know, it's so contingent.
And then, and then, and then, and the left was like, this is racist. And then it flipped.
The left was really into the COVID. Yeah.
And the whole thing also is just so blunt and crude. And so I think probably in general, I think people are really under, people like to make sort of complicated technocratic AI policy proposals.
And I think, especially if things go kind of fairly rapidly on the last AGI, there might not there might not actually be that much space for kind of like complicated, kind of like, you know, clever proposals. It might just be kind of much cruder reactions.
Yeah. Look, and then also when you mentioned the spies and national security getting involved and everything, and you can talk about that in the abstract.
But now that we're living in San Francisco and we know many of the people who are doing the top e-eye research is also a little scary to think about people i personally know i'm friends with it's not unfeasible if they have secrets in their that are head that are worth 100 billion dollars or something kidnapping assassination sabotage their family or yeah it's it's really bad yeah i mean this is to the point on security you know like right now it's just really foreign but you know at some point as it becomes like really
serious it's the you know you're gonna want the security cards yeah yeah yeah so presumably you have thought about the fact that people in china will be listening to this and we're reading your Yeah.
And somehow you made the trade-off that...
It's better to let the whole world know...
Yeah.
...know. be listening to this and reading your series yeah and somehow uh you made the trade-off that it's better to let the whole world yeah no yeah and also including china yeah and make them up to agi which is part of the thing you're worried about is trying to wake up to agi yeah then to stay silent yeah i'm just curious walk me through how you've thought about that trade-off yeah i actually look i think this is a tough trade-off i thought about this a bunch you know i think um you know i think people
on the prc will read this um i think you know i think there's some extent to which sort of cat
is out of the bag you know this is like not you know agi being a thing people are thinking about
very seriously is not new anymore there's sort of you know a lot of these takes are kind of old or
you know i've had i had you know similar views a year ago might not have written it up a year ago in part because I think this cat wasn't out of the bag enough um you know I think the other thing is I think to be able to manage this challenge you know I think much broader swaths of society will need to wake up right and if And if we're gonna get the project, we actually need sort of like, a broad bipartisan understanding the challenges facing us. And so, I think it's a tough trade-off, but I think the sort of need to wake up people in the United States, in the sort of Western world, in the Democratic coalition is ultimately imperative.
And I think my hope is more people here will read it than the PRC. You know, and I think people sometimes underrate the importance of just kind of like writing it up, laying out the strategic picture.
And, you know, I think you've done actually a great service to sort of mankind in some sense by, you know, with your podcast. And, you know, I think it's overall been good.
Okay. So by the you know on the topic of you know Germany yeah you know we were talking at some point about kind of immigration story right I feel like you have a kind of interesting yeah story you haven't told and I think you should tell.
So a couple years ago I was in college and I was 20 yeah I was about to turn 21 yeah I think it was. Yeah, you came from India when you were really right.
Yeah. So I was 20.
Yeah. I was about to turn 21.
Yeah. I think it was.
Yeah, you came from India when you were really young. Right, yeah, yeah.
So until I was eight or nine, I lived in India, and then we moved around all over the place. But because of the backlog for Indians.
Yeah, the green card backlog. Yeah.
We've been in the queue for like decades. Even though you came at eight, you're still on the H1B.
Yeah. And when you're 21, you get kicked off the queue and you had to restart the process.
I'm on my dad's, my dad's a doctor and I'm on his H1B as a dependent. But when you're 21, you get kicked off.
Yeah. And so I'm 20 and it just kind of dawns on me that this is my situation.
Yeah. And you're completely screwed.
Right. And so I also had the experience that my dad, we've like moved all around the country.
They have to prove that him as a doctor is like, you can't get native talent. Yeah.
And you can't start a startup. Yeah.
So where can you not get native? And like even getting the H1B for you would have been like, you know, 20% lottery. So if you're lucky, you're in this.
And they had to prove that they can't get native talent, which means like for him, and like we live in North Dakota for three years, West Virginia for three years, Maryland, West Texas.
Yeah.
And so it kind of dawned on me
this is my situation.
As I turn 21,
I'll be like on this lottery.
Even if I get the lottery,
I'll be a fucking code monkey
for the rest of my life
because this thing
isn't going to let up.
Yeah.
Can't do a startup.
Exactly.
And so at the same time,
I had been reading
for the last year,
I've been super obsessed
with Paul Graham essays.
My plan at the time
was to make a startup
or something.
I was super excited about that.
Yeah.
And it just occurred to me that I couldn't do this. That like, this is just not in the cards for me.
And so I was kind of depressed about it. I remember I kind of just, I was in a daze through finals because it had just occurred to me and I was really really anxious about it.
And I remember thinking to myself at the time that if somehow I end up getting my green card before I turn 21, there's no fucking way I'm becoming a code monkey because the feeling of dread that I have is this realization that I'm just going to have to be a code monkey. And I realized that's my default path.
If I hadn't sort of made a proactive effort not to do that, I would have graduated college as a computer science student and I would have just done that. And that's the thing I was super scared about.
So that was an important sort of realization for me. Anyway, so COVID happened because of that, since there weren't foreigners coming, the backlog cleared fast.
And by the skin of my teeth, like a few months before I turned 21. So crazy.
Extremely contingent reasons. So crazy.
I ended up getting a green card. Yeah.
Because I got a green card, I could, you know. The whole podcast, right? Exactly.
I graduated college and I was like bumming around. Yeah.
And I got, was like, I graduated semester early. I'm going to like do this podcast, see what happens.
And it hadn't it didn't have a green card the best case scenario fact you know and it only existed because yeah you know it's actually because I think it's hard it's probably it's you know what is the impact of like immigration reform right what is the impact of clearing you know like whatever 50,000 green cards in the backlog and you're such like an amazing example of like you know all of this is only possible and it's yeah it's i mean it's just incredibly tragic right this is so dysfunctional yeah yeah no it yeah it's insane i'm glad you did it i'm glad you kind of like you know tried the you know the the uh the unusual path well yeah but i could only do it obviously i was extremely fortunate that i got the green card I was like, I had a little bit of saved up money. I got a small grant out of college, thanks to the future fund to like do this for basically the equivalent of six months.
And so it turned out really well. And then at each time and I was like, oh, okay, podcast, come on.
Like I wasted a few months on this. Let's now go do something real.
Something big would happen. I would, Jeff Bezos would...
You kept with it. Yeah, yeah.
But there would always be, just like the moment I'm about to quit the podcast, something like, Jeff Bezos would say that there's something nice about me on Twitter. The Aaliyah episodes gets like half a million views.
And then now, this is my career, but it was sort of very, looking back on it, incredibly contingent that things worked out the right way. Yeah.
I mean, look, if the AGI stuff goes down, you know, it'll be the most important kind of like, you know, source of, it'll be how maybe most of the people who kind of end up feeling AGI first heard about it. Yeah, yeah, yeah.
Also very much, you're very linked with the story in many ways. First, I got like a $20,000 grant from a future fund right out of college.
And that sustained me for six months or however long it was. And without that, I wouldn't- Tiny grant is kind of crazy.
Yeah, 10 grand or what was it? No, it's just, it's tiny, but it goes to show kind of how far small grants can go. Yeah.
Sort of the emergent ventures too. Exactly.
The emergent ventures. And the, well, the last year I've been in San Francisco, we've just been in close contact the entire time and just bouncing ideas back and forth.
We're just basically the alpha I have, I think people would be surprised by how much I got from you, Sholto, Trent, and a couple others. I mean, it's been an absolute pleasure.
Yeah, likewise. It's been super fun.
Okay, so some random questions for you. Yeah.
If you could convert to Mormonism and you could really believe it, would you do it? Would you push the button? Well, okay. Okay.
Before I answer that question, one sort of observation about the Mormons. So actually, there's an article that actually made a big impact on me.
I think it was by McKay Coppins at some point, you know, in the Atlantic or whatever about the Mormons. And I think the thing he kind of, you know, and I think he even was like interviewed Mitt Romney in it and so on.
And I think the thing I thought was really interesting in this article was he kind of talked about how the experience of kind of growing up different, you know, growing up very unusual, especially if you grow up Mormon outside of Utah, you know, like the only person doesn't drink caffeine, you don't drink alcohol, you're kind of talked about how the experience of kind of growing up different, growing up very unusual, especially if you grow up Mormon outside of Utah, you know, like the only person who doesn't drink caffeine, you don't drink alcohol, you're kind of weird. How that kind of got people prepared for being willing to be kind of outside of the norm later on.
And like, you know, Mitt Romney, you know, was willing to kind of take stands alone, you know, in his party, because he believed, you know, what he believed is true. And I don't know, I mean, probably not to the same way, but I feel a little bit like this from kind of having grown up in Germany and really not having like this sort of German system and having been kind of an outsider or something.
I think there's a certain amount in which kind of, yeah, growing up in an outsider gives you kind of unusual strength later on to be kind of like willing to say what you think. And anyway, so that is one thing I really appreciate about the Mormons, at least the ones that have grown up outside of Utah.
I think, you know, the fertility rates, they're good. They're important.
They're going down as well, right? Right. This is the thing that really clinched the kind of fertility decline story.
Even the Mormons? Yeah, even the Mormons, right? You're like, oh, this is like a sort of good story. The Mormons will replace everybody.
I don't know if it's good, but it's like at least, you know, at least come on. Like at least some people will maintain high, you know, but it's no, no, you know, even the Mormons and sort of basically once the sort of these religious subgroups have high fertility rates, right, once they kind of grow big enough, they become, you know, they're too close in contact with sort of normal society and become normalized, more infertility rates drop from, I remember the exact numbers, maybe like four to two in the course of 10, 20 years.
Um, anyway, so it's like, you know, now people point to the, you know, Amish or whatever, but I'm just like, it's probably just not scalable. And if you grow big enough, then there's just like, you know now people point to the you know amish or whatever but i'm just like it's probably just not scalable and if you grow big enough then there's just like you know the sort of like you know the sort of like overwhelming force of modernity kind of gets you yeah um no if i could convert to mormonism look i think there's something i don't believe it right if i believed it i obviously would convert to mormonism right because it's you gotta you gotta you can choose the world in which you do believe it.
Um, I think there's something really valuable and kind of believing in something greater than yourself and believing and having a certain amount of faith. Um, you do, right? And, and, and, and, you know, um, you know, there's a, you know, feeling some sort of duty to the thing greater than yourself.
Um, and, you know, maybe my version of this is somewhat different. You I think I feel some sort of duty to the thing greater than yourself.
And maybe my version of this is somewhat different.
I think I feel some sort of duty to like,
I feel like there's some sort of historical weight
on how this might play out.
And I feel some sort of duty to make that go well.
I feel some sort of duty to our country,
to the national security of the United States.
And I think that can be a force for a lot of good.
I, the, um, going back to the opening, I think just, uh, the, the thing that's especially
impressive about that is, look, there's people who at the company who have through
years and decades of building up savings from working in tech have probably tens of millions
I'm not sure what happens. Look, there's people at the company who have, through years and decades of building up savings from working in tech, have probably tens of millions, liquid, more than that in terms of their equity.
And the person, very many people were concerned about the clusters in the Middle East and the secrets leaking to China and all these things. But the person who actually made a hassle about it.
And I think hassling people is so underrated. I think that one person who made a hassle about it is the 22-year-old who has less than a year at the company, who doesn't have savings built up, who isn't like a solidified member of the...
I think that's sort of like... Maybe it's me being naive and you know not having of knowing how big companies work and you know but like i there's a you know i think sometimes a bit of a speech geontologist you know i kind of believe in saying what you think yeah sometimes friends tell me i should be more of a speech consequentialist no i think um i i i really think the amount of people who when they have the opportunity to talk to the person, will just bring up the thing.
I've been with you in multiple contexts, and I guess I shouldn't reveal who the person is or what the context was, but I've just been, like, very impressed that the dinner begins and by the end, somebody who has a major voice in how things go is seriously thinking about a worldview they would have found incredibly alien before the dinner or something um and i've been impressed with it like just like just give them the spiel and hassle them um i mean look i just i think i think i feel this stuff pretty viscerally now you know i think there's a time you know there's a time when i thought about this stuff a lot but it was kind of like econ models and and these sort of theoretical abstractions. And you talk about human brain size or whatever.
And I think since at least last year, I feel like I can see it. I just feel it.
And I think I can sort of see the cluster that AGI is going to be trained on. And I can see the kind of rough combination of algorithms and the people that be involved and how this is going to play out.
Yeah. And I think, look, we'll see how it plays out.
There's many ways this could be wrong. There's many ways it could go.
But I think this could get very real. Yeah.
Should we talk about what you're up to next? Sure. Yeah.
Okay. So you're starting an investment firm.
Yep. Anchor investments from Nat Friedman, Daniel Gross, Patrick Collison, John Collison.
First of all, why is this the thing to do? You believe the AGI is coming in a few years. Why the investment firm? Good question.
Fair question. Okay, so I mean, a couple of things.
One is just, you know, I think we talked about this earlier, but it's like the screen doesn't go blank, you know, when sort of AGI or superintelligence happens. I think people really underrate the sort of, basically the sort of decade after it.
You have the intelligence explosion. That's maybe the most sort of wild period, but I think the decade after is also going to be wild.
And, you know, this combination of human institutions, but superintelligence, you have crazy kind of geopolitical things going on, you have the sort of broadening of this explosive growth. And basically, yeah, I think it's going to be a really important period.
I think capital will really matter. Eventually, we're going to go to the stars, we're going to go to the galaxies.
So anyway, part of the answer is just like, look, I think done right, there's a lot of money to be made. I think if AGI were priced in tomorrow, you could maybe make 100x.
Probably you can make even way more than that because of the sequencing. And capital matters.
I think the other reason is just some amount of freedom and independence. And I think there's some people who are very smart about this AI stuff and who are kind of like see it coming.
But I think almost all of them are kind of constrained in, you know, they're in some, you know, some other position where they can't really talk about this stuff. And, you know, in some sense, I've really admired sort of the thing you've done, which is I think it's really important that there's sort of voices of reason on this stuff publicly or people who are in positions to kind of advise important actors and so on.
And so I think there's a, you know, basically the thing this investment firm will be will be kind of like, you know, a brain trust on AI. It's going to be all that situational awareness.
We're going to have the best situational awareness in the business. You know, we're going to have way more situational business than any of the people who manage money in New York.
We're definitely going to, you know, we're going to do great on investing. But it's the same sort of situational awareness that I think is going to be important for understanding what's happening, being a voice of reason publicly and sort of being able to be in a position to advise.
Yeah. I...
the book about Peter Thiel... Yeah.
They had an interesting quote about his hedge fund. I think it got terrible returns.
So this isn't the example. Well, they blew up.
That's the sort of bare case, right? It's like too theoretical. Sure, yeah.
But they had an interesting quote that it's like basically a think tank inside of a hedge fund. Yeah, that's what I'm trying to build.
Right. Yeah.
So presumably you've thought about the ways in which these kinds of things can blow. There's a lot of interesting business history books about people who got the thesis right but timed it wrong where they buy that the internet's going to be a big deal, they sell at the wrong time and buy at the wrong time during the dot-com boom, and so they miss out on the gains, even though they're right about the...
Anyways, yeah. What is that trick to preventing that kind of thing? Yeah, I mean, look, obviously you can't, you know, not blowing up is sort of like, you know, task number one and two or whatever.
I mean, you know, I think this investment firm is going to just be betting on agi you know betting on agi and super intelligence before the decade is out taking that seriously making the bets you would make you know if you took that seriously so you know i think if that's wrong you know firm is not going to do that well the thing you have to be resistant to is like you have to be able to resist you know one or a couple or a few kind of individual calls right you know it's like ai stagnates for stagnates for a year because of the data wall or like, you know, you got the call wrong on like when revenue would go up. And so anyway, that's pretty critical.
You have to get timing right. I do think in general that the sort of sequence of bets on the way to AGI is actually pretty critical.
And I think a thing people underrate. So, all right.
I mean, yeah. So like, where does the story start? Right.
So like, obviously the sort of only bet over the last year was NVIDIA. Um, and, um, you know, it's obvious now very few people did it.
Um, this is sort of also, you know, a classic debate I and a friend had with another colleague of ours where this colleague was really into TSM, you know, TSMC. And he was just kind of like, well, you know, like these tabs are going to be so valuable.
And also like NVIDIA, there's just a lot of video syncretic, right? It's like, maybe somebody else makes better GPUs. That was basically right.
But sort of only NVIDIA had the AI beta, right? Because only NVIDIA was kind of like large fraction AI, the next few doublings would just like meaningfully explode their revenue. Whereas TSMC was, you know, a couple percent AI.
So you even though there's going to be a few doublings of AI, not gonna make that big of an impact. All right, so it's sort of like the only place to find AI beta basically was NVIDIA for a while.
You know, now it's broadening, right? So now TSM is like, you know, 20% AI by like 27 or something is what they're saying. One more doubling, it'll be kind of like a large fraction of what they're doing.
And, you know, there's a whole, you know, whole stack, you know, there's like, you know, there's people making memory and co-hosts and, you know, power, you know, utility companies are starting to get excited about AI and they're like, oh, it'll, you know, power production in the United States will grow, you know, not 2.5%, 5% of the next five years. And I'm like, no, it'll grow more.
You know, at some point, you know, you know, you know, like a Google or something becomes interesting. You know, people are excited about them with AI because it's like, oh, you know, AI revenue will be, you know,
10 billion or tens of billions.
And I'm kind of like,
ah, I don't really care about them before then.
I care about it, you know,
once you get the AI beta, right?
And so at some point, you know,
Google will get, you know,
$100 billion of revenue from AI.
Probably their stock will explode.
You know, they're going to become,
you know, 5 trillion, 10 trillion dollar company.
Anyway, so the timing there is very important.
You have to get the timing right.
You have to get the sequence right.
You know, at some point, actually, I think like, you know, there's going to be a real tailwind to equities from real interest rates, right? So basically in these sort of explosive growth worlds, you would expect real interest rates to go up a lot, both on the sort of like, you know, basically both sides of the equation, right? On the supply side or on the sort of demand for money side, because, you know, people are going to be making these crazy investments, you know know initially in clusters and then in the robo factories or whatever right and so they're going to be borrowing like crazy um they want all this capital higher um and then on the sort of like consumer saving side right to like you know to to give up all this capital you know sort of like oiler equation standard sort of inter-temporalratemporal trade-off of consumption. Standard.
Very standard. Possibly, you know, some of our friends have a paper on this.
You know, basically, if you expect, if consumers expect real growth rates to be higher, you know, interest rates are going to be higher because they're less willing to give up consumption, you know, consumption in the, they're less willing to give up consumption today for consumption in the future. Anyway, so at some point, real interest rates will go up if sort of ADA is greater than one.
That actually means equities, higher growth rate expectations mean equities go down because the sort of interest rate effect outweighs the growth rate effect. And so at some point, there's like the big bond short.
You got to get that right. You got to get it right, that nationalization.
You got, yeah, anyway. So there's this whole sequence of things.
You got to get that right. unknown unknowns unknown unknowns yeah and so you've look you've got to be really really careful about your like overall like risk positioning right and because you know you know if you expect these kind of crazy events to play out there's going to be crazy things you didn't ever see um you know you do also want to make the sort of kind of bets that are tailored to your scenarios in the sense of like you know you want to find bets that are bets on the tails right um you know i don't think anyone is expecting you know interest rates to go above 10% like real interest rates.
like, you know, you wanna find bets that are bets on the tails, right? You know, I don't think anyone is expecting, you know, interest rates to go above, you know, 10%, like real interest rates. But, you know, I think there's at least a serious chance of that, you know, before the decade is out.
And so, you know, maybe there's some like cheap insurance you can buy on that, you know, that pays off. Very silly question.
In these worlds, are financial markets where you make these kinds of bets going to be respected and like, know like is my fidelity account going to mean anything when we have the 50 economic growth like who's who's like we got to respect his property rights into it the bond short the sort of 50 and 50 second hour growth that's pretty deep into it i mean again there's this whole sequence of things but yeah no i think property rights will be restricted again in the sort of modal world the project yeah at. At some point, there's going to be figuring out the property rights for the galaxies.
That'll be interesting. That will be interesting.
So there's an interesting question about going back to your strategy about, well, the 30s will really matter a lot about how the rest of the future goes. And you want to be in a position of influence by that point because of capital.
It's worth considering, as far as I know, there's probably a whole bunch of literature on this. I'm just riffing.
But the landed gentry before the beginning of the Industrial Revolution, I'm not sure if they were able to leverage their position in a sort of Georgist or Piketty-type sense in order to accrue the returns that were realized through the Industrial Revolution.
And I don't know what happened.
At some point, they just weren't the landed gentry.
But I'd be concerned that even if you make great investment calls, you'll be like the guy who owned a lot of land, farmland, before the Industrial Revolution. And the guy who's actually going to make a bunch of money is one of the steam engine.
Even if he doesn't make that much money, most of the benefits are sort of widely diffused and so forth. I mean, I think the analog is like you sell your land, you put it all in sort of the people who know, the people who are building the new industry.
I think the, I mean, I think the sort of like real depreciating asset, you know, for me is human capital, right? Yeah, no, look, I'm serious, right? It's like, you know, there's something about like, you know, I don't know, I was like valedictorian of Columbia, you know, the thing that made you special is you're smart, right? But actually, like, you know, that might not matter in like four years, you know, because it's actually automatable. Right.
And so anyway, a friend joked that the sort of investment firm is perfectly hedged for me. It's like, you know, either like AGI this decade.
And yeah, your human capital is depreciated, but you've turned that into financial capital. Or, you know, like no AGI this decade, in which case maybe the firm doesn't do that well.
But, you know, you're still in your 20s and you're still smart. Excellent.
And what's your story for why AGI hasn't been priced in? The story... Financial markets are supposed to be very efficient.
It's very, very hard to get an edge. Here, naively, you just say, well, I've looked at these scaling curves and they imply that we're going to be buying much more computed energy than the analysts realize.
Shouldn't those analysts be broke by now? What's going on? Yeah. I mean, I used to be a true EMH guy.
I was an economist, you know, I am, you know, I think the thing I, you know, changed my mind on is that I think there can be kind of groups of people, smart people, you know, who are, you know, say they're in San Francisco, who do just have off over the rest of society and kind of seeing the future. And so like COVID, right? Like, I think there's just honestly kind of similar group of people who just saw that and called it completely correctly.
And, you know, they showed it the market, they did really well, you know, a bunch of other sort of things like that. Um, so, you know, why is AGI not priced in, you know, it's sort of, um, you know, why, why hasn't the government nationalized the labs yet? Right.
It's like, you know, this, you know, society hasn't priced it in yet and sort of, it hasn't completely diffused. And, you know, again, but um um i just think sort of you know not that many people take these ideas seriously yeah yeah yeah yeah a couple other sort of ideas that i was playing around with with regards to we didn't get a chance to talk about but the the systems competition yeah there's a Very interesting.
Um, the, uh, the, one of my favorite books about World War II is a Victor Davis Hanson summary of everything. And he explains why the Allies made better decisions than the Axis.
Why did they? And so obviously there were some decisions the Axis made that were pretty... Like Blitzkrieg, whatever.
That was sort of by accident, though. In what sense? That they just had the infrastructure left over? Well, no, I mean, the sort of, I think, I mean, I don't, I mean, I think sort of my read of it is Blitzkrieg wasn't kind of some like ingenious strategy.
It was just kind of, it was like more like their hand was forced. I mean, this is sort of the very Adam Tuzian story of World War II, right? But it was, you know, there's sort of this long war versus short war.
I think it's actually kind of an important concept. I think sort of Germany realized that if they were in a long war, including the United States, you know, they would not be able to compete industrially.
So their only path to victory was like, make it a short war, right? And that sort of worked much more spectacularly than they thought, right? And sort of take over France and take over much of Europe. And so then, you know, the decision invade the Soviet Union, it was, you know, it was, it was, look, if it was, it was about the Western front in some sense, because it was like, we've got to get the resources.
You know, we don't, we're actually, we don't actually have a bunch of the stuff we need, like, you know, oil and so on. You know, Auschwitz was actually just this giant chemical plant to make kind of like synthetic oil and a bunch of these things.
It's the largest industrial project in Nazi Germany. And so, you know, and sort of they thought, well, you know, we completely crushed them in World War I, you know, it'll be easy, we'll invade them, we'll get the resources, and then we can fight on the Western Front.
And even during the sort of whole invasion of the Soviet Union, even though kind of like a large amount of the sort of, you know, the sort of deaths happened there, you know, like a large fraction of German industrial production was actually, you know, like planes and naval, you know, and so on. Those directed, you know, towards the Western Front and towards, you know, the Western allies.
Well, and then so the point that Hanson was making was... By the way, I think this concept of like long war and short war is kind of interesting with respect to thinking about the China competition, which is like, you know, I worry a lot about kind of, you know, decline of sort of American, like latent American industrial capacity.
You know, like I think China builds like 200 times more ships than we do right now, you know, in some crazy way. And so it's like, maybe we have this superiority, say in the non-AI worlds, we have the superiority in military material to kind of like win a short war, at least, you know, kind of defend Taiwan in some sense.
But like, if it actually goes on, you know, it's like maybe China is much better able to mobilize, mobilize industrial resources in a way that like we just don't have the same ability anymore. I think this is also relevant to the AI thing in the sense of like if it comes down to sort of a game about building, right, including like maybe AGI takes the trillion dollar cluster, not the hundred billion dollar cluster, maybe or even maybe AGI takes the, you know, is on the hundred billion dollar cluster cluster but you know it really matters if you can run you know 10x you can do one more order of magnitude of compute for your super intelligence or whatever um that you know maybe right now they're behind but they just have this sort of like raw latent industrial capacity to outbuild us yeah and that matters both in the run-up to agi and after right where it's like you have the super on your cluster.
Now it's time to kind of like expand the explosive growth. And you know, like, will we let the robo factories run wild? Like maybe not, but like maybe China will.
Or like, you know, will we, will, yeah, will we produce the, how many, how many of the drones will we produce? Um, and I think, yeah, so there's some sort of like outbuilding in the industrial explosion that I worry about. You've got to be one of the few people in the world who is both concerned about alignment, but also wants to make sure that we'll let the robot factories proceed once we get the ASI to beat out China.
Which is like very... It's all part of the picture.
Yeah, yeah, yeah. Yeah.
But by the way, speaking of the ASIs and the robot factories, one of the interesting things... Robo armies too.
Yeah, one of the interesting things, there's this question of what you do with industrial scale intelligence, and obviously it's not chatbots. Yeah.
But I think it's very hard to predict in a net. Yeah, yeah.
But the history of oil is very interesting, where in the, I think it's in the 1860s that we figure out how to refine oil, some geologist. And so then Standard Oil gets started.
There's this huge boom. It changes American politics.
Entire legislators are getting bought out by oil interest. And presidents are getting elected based on the divisions about oil and breaking them up and everything.
And all of this has happened. The world has never illusionized before the car has been invented.
And so when the light bulb was invented, I think it was like 50 years after oil refining had been discovered, the majority of Standard Oil's history is before the car is invented. Kerosene lamps.
Exactly. So it's just used for lighting.
Oh, and then they thought oil would just no longer be relevant?
Yeah, yeah. So there was a concern that Standard Oil would go bankrupt
when the light bulb was invented.
And, but then there's sort of...
You realize that there's immense amount of compressed energy here.
You're going to have billions of gallons of this stuff a year.
And it's hard to sort of predict in advance what you can do with that.
Yep, that's right. And then later on, it turns out, oh, transportation, cars, that's what it'd be used for.
Anyways, with intelligence, maybe one answer is the intelligence explosion. Right.
But even after that, so you have all these ASIs and you have enough compute, especially the compute they'll build, to run… Hundreds of millions of GPUs will hum. Yeah.
But what are we doing with that? And it's very hard to predict in advance. And I think it will be very interesting to figure out what the Jupyter brains will be doing.
So, look, there's situational awareness of where things stand now. Uh-huh.
And we've gotten a good dose of that. Obviously, a lot of the things we're talking about now, you couldn't have prejudged many years back in the past.
And part of your worldview implies that things will accelerate because of AI getting into the process. Many other things that are unpredictable fundamentally.
Basically how people will react, how the political system will react, how foreign adversaries will react. Those things will become evident over time.
So the sexual awareness is not just knowing where the picture stands now, but being in a position to react appropriately to new information, to change your worldview as a result, to change your recommendations as a result. Yep.
What is the appropriate way to think about situational awareness as a continuous process rather than as a one-time thing you realized? Yep. No, I think this is great.
Look, I think there's a sort of mental flexibility and willing to change your mind that's really important. I actually think this is sort of like how a lot of brains have been broken in the AGI debate, right? The sort of the doomers who actually, you know, I think were really prescient on AGI and thinking about the stuff, you know, like a decade ago, but, you know, they haven't actually updated on the empirical realities of deep learning.
They're sort of like, their proposals are really kind of naive and unworkable. It doesn't really make sense.
You know, there's people who come in with sort of a predefined ideology. They're just kind of like, you know, little bit.
They like to shitpost about technology but they're not actually thinking through it. Either they're sort of stagnationists who think this stuff is only going to be a chatbot so of course it isn't risky or they're just not thinking through the kind of actually immense national security implications and how that's going to go.
I actually think there's kind of a risk in kind of having having written the stuff down and like put it online and um you know there's a there's um i think this sometimes happens to people as a sort of calcification of the worldview because now they've publicly articulated this position and you know maybe there's some evidence against it but they're clinging to it um and so i actually you know i want to give the big disclaimer on like you know i think it's really valuable to paint a sort of very concrete and visceral picture. I think this is currently my best guess on how this decade will go.
I think if it goes anywhere like this, it will be wild. But, you know, given the rapid pace of progress, we're going to keep getting a lot more information.
And, you know, I think it's important to sort of keep your head on straight about that. You know, I feel like the most important thing here is that, you know, and this relates to
some of the stuff we've talked about and, you know, sort of the world being surprisingly
small and so on.
You know, I feel like I used to have this worldview of like, look, there's important
things happening in the world, but there's like people who are taking care of it, you
know, and there's like the people in government.
And there's, again, even like AI labs have idealized and people people are on it you know surely they must be on it right and i think just some of this personal experience even seeing how kind of covid went you know people aren't necessarily there's not some that's not somebody else who's just kind of on it and making sure this goes well however it goes um you know the thing that i think will really matter is that there are sort of good people who take this stuff as seriously as it deserves and who are willing to kind of take the implications seriously, who are willing to, you know, who have situational awareness, are willing to change their minds, are willing to sort of stare the picture in the face. And, you know, I'm counting on those good people.
All right, that's a great place to close, Leopold. Thanks so much, Charkesh.
This is an absolute joy. Hey, everybody.
I hope you enjoyed that episode with Leopold. There's actually one more riff about German history that he had after a break, and it was pretty interesting, so I didn't want to cut it out.
So I've just included it after this outro can advertise on the show now so if you're interested you can reach out at the form in the description below other than that the most helpful thing you can do is just share the episode if you enjoyed it send it to group chats twitter wherever else you think people who might like this episode might congregate and other than that I guess here's this riff on Frederick the Great. See you on the next one.
I mean, I think the actual funny thing is, you know, a lot of the sort of German history stuff we've talked about is sort of like not actually stuff I learned in Germany. It's sort of like stuff that I learned after.
And there's actually, you know, a funny thing where I kind of would go back to Germany over Christmas or whatever. And suddenly you understand the street names.
You know, it's like, you know, Gneisnau andarnhorst and they're all these like prussian military reformers and you're like finally understood you know sansa c and you're like it was for frederick you know frederick the great is this really interesting figure um where um so he's this sort of in some sense kind of like gay lover of arts right where he um he um he uh you know he He only wants to speak French. You know, he like plays the flute.
He composes, he has all this sort of great, you know, artists of his day, you know, over at Sansa C. Um, and he actually had this sort of like really tough upbringing where his father, um, was this sort of like really stern sort of Prussian military man.
And, um, he had had a, um, Frederick the Great as a child, as sort of a 17-year-old or whatever. He basically had a male lover.
And what his father did was imprison his son and then I think hang his male lover in front of him. And again, his father was this kind of very stern Prussian guy.
He was this kind of gay lover of arts arts but then later on frederick the great turns out to be this like you know one of the most kind of like you know successful kind of prussian conquerors right like he gets silesia he wins the seven years war you know also you know amazing military strategists you know amazing military strategy at the time consisted of like he was able to like flank the army and that was crazy you know and that was brilliant and then and then they like almost lose the seven years war at the very end you know the sort of the the russian czar changes and he's like ah i'm actually kind of a prussia stan you know i think i'm like i'm into this stuff and then he lets you know let's the great lewis and he had let's let's let's let their army be okay and um anyway sort of like a yeah kind of bizarre interesting figure in german
history