Invest Like the Best with Patrick O'Shaughnessy - 加文·贝克 - 英伟达对决谷歌,规模法则与AI经济学 - [最佳投资之道,第451期] 封面

加文·贝克 - 英伟达对决谷歌,规模法则与AI经济学 - [最佳投资之道,第451期]

Gavin Baker - Nvidia v. Google, Scaling Laws, and the Economics of AI - [Invest Like the Best, EP.451]

本集简介

本周的嘉宾是加文·贝克。加文是阿特雷迪斯管理公司的管理合伙人兼首席投资官,他此前多次做客本节目。 我永远不会忘记2017年第一次见到加文的情景。他对市场的兴趣、对世界的求知欲,是我见过的所有投资者中最具感染力的。他对当今科技领域的了解堪称百科全书,我很幸运能每隔一两年就在这档播客中邀请他做客。 加文二十多年前就开始以投资者身份关注英伟达,这让他对这家公司以及整个半导体生态系统的发展演变拥有了罕见的视角。自我们一年前的上一次对话以来,许多事情已经改变,现在正是重新探讨这一话题的绝佳时机。 在这次对话中,我们讨论了加文感兴趣的一切——英伟达的GPU、谷歌的TPU、不断变化的AI格局、AI公司的数学原理与商业模式,以及其间的一切。我们还讨论了“太空数据中心”的构想,他一如既往地以热情和逻辑阐述了这一观点。 最后,因为我之前已经问过他我传统的结尾问题,这次我换了一个不同的问题,这引出了他从未向我讲述过的整个投资生涯起源故事。 因为加文是我认识的最具热情的思想者和投资者之一,这些对话总是我最喜爱的。希望你喜欢这次与加文·贝克的最新对话。 如需完整节目笔记、文字稿及提及内容的链接,请访问节目页面⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠here⁠⁠⁠⁠⁠⁠⁠⁠。⁠⁠⁠⁠⁠⁠⁠⁠ ----- 本集由由⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠Ramp⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠赞助。Ramp的使命是帮助公司以降低开支、释放团队时间专注于更有价值项目的方式管理支出。前往⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠ramp.com/invest⁠免费注册,即可获得250美元欢迎奖励。 ----- 本集由本集由⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠Ridgeline⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠赞助。Ridgeline为投资经理打造了一套完整、实时、现代化的操作系统,通过一体化实时云平台处理交易、投资组合管理、合规、客户报告等众多功能。前往⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠ridgelineapps.com⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠了解更多平台信息。 ----- 本集由⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠AlphaSense⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠赞助。AlphaSense通过前沿AI技术和海量顶级可靠商业内容,彻底革新了研究流程。《像最好的投资者那样投资》的听众现在可前往⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠Alpha-Sense.com/Invest⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠免费试用,亲身体验AlphaSense和Tegus如何助你更快做出更明智的决策。 ----- 本集的编辑与后期制作由The Podcast Consultant(⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠https://thepodcastconsultant.com⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠⁠)提供。 节目笔记: (00:00:00) 欢迎来到《像最好的投资者那样投资》 (00:04:00) 认识加文·贝克 (00:06:00) 理解Gemini 3 (00:09:05) 预训练的扩展定律 (00:12:12) 谷歌 vs 英伟达 (00:14:53) 谷歌作为最低成本的Token生产者 (00:25:18) AI可以自动化任何可验证的任务 (00:30:23) AI的看空观点:边缘AI (00:33:43) 从智能到实用性的转变 (00:36:34) 财富500强企业中的AI采用 (00:41:40) 前沿模型与行业动态 (00:48:05) 中国的失误与Blackwell的地缘政治杠杆 (00:50:11) OpenAI的红色警报 (00:52:50) 太空中的数据中心 (00:58:00) AI中的周期 (01:01:14) 电力作为瓶颈 (01:03:51) AI原生创业者 (01:05:41) 半导体风险投资 (01:09:26) SaaS行业正在犯的错误 (01:14:53) 一系列泡沫 (01:16:31) AI需要什么,就会得到什么 (01:17:53) 投资是对真理的追寻 (01:18:37) 加文的投资起源故事

双语字幕

仅展示文本字幕,不包含中文音频;想边听边看,请使用 Bayt 播客 App。

Speaker 0

这里有个值得思考的问题:如果你的财务团队突然每月多出一周时间,你会让他们做什么工作?

Here's an interesting question to think about: If your finance team suddenly had an extra week every month, what would you have them work on?

Speaker 0

大多数CFO都不知道答案,因为他们的财务团队总在最后一刻疲于处理丢失的报销单、发票编码和收据追踪。

Most CFOs don't know because their finance teams are grinding it out on lost expense reports, invoice coding, and tracking down receipts until the last possible minute.

Speaker 0

这正是Ramp立志要解决的问题。

That's exactly the problem that Ramp set out to solve.

Speaker 0

审视财务工作中那些人人暗自厌恶的环节,思考为什么这些事还需要人力来完成?

Looking at the parts of finance everyone quietly hates and asking why are humans doing any of this?

Speaker 0

事实证明根本不需要。

Turns out they don't need to.

Speaker 0

Ramp的人工智能能自动处理85%的费用审核,准确率高达99%。

Ramp's AI handles 85% of expense reviews automatically with 99% accuracy.

Speaker 0

这意味着你的财务团队将不再是处理事务的部门,而会成为思考战略的团队。

Which means your finance team stops being the department that processes stuff and starts being the team that thinks about stuff.

Speaker 0

这才是真正的转变:使用Ramp的公司不只是节省时间——他们是在重新分配时间价值。

Here's the real shift: Companies using Ramp aren't just saving time, they're reallocating it.

Speaker 0

当竞争对手花两周时间做账时,你已经在规划下一季度了。

While competitors spend two weeks closing their books, you're already planning next quarter.

Speaker 0

当他们忙着整理电子表格时,你在思考新的定价策略、新市场以及下一美元的投资回报从何而来。

While they're cleaning up spreadsheets, you're thinking about new pricing strategy, new markets, and where the next dollar of ROI comes from.

Speaker 0

这种差异会不断累积。

That difference compounds.

Speaker 0

访问ramp.com/invest试用Ramp,看看当必须做的工作不再阻碍你想做的工作时,你的团队能获得多少杠杆效应。

Go to ramp.com/invest to try Ramp and see how much leverage your team gains when the work you have to do stops getting in the way of the work that you want to do.

Speaker 0

对我来说,Ridgeline不仅仅是一个软件供应商,更是创新道路上真正的合作伙伴。

To me, Ridgeline isn't just a software provider, it's a true partner in innovation.

Speaker 0

他们正在重新定义资产管理技术的可能性,帮助公司更快扩展、更智能运营并保持领先优势。

They're redefining what's possible in asset management technology, helping firms scale faster, operate smarter, and stay ahead of the curve.

Speaker 0

我想分享一个他们如何产生实际影响的真实案例。

I want to share a real world example of how they're making a difference.

Speaker 0

让我向您介绍布莱恩。

Let me introduce you to Brian.

Speaker 0

布莱恩,请介绍一下你自己并谈谈你的职责。

Brian, please introduce yourself and tell us a bit about your role.

Speaker 1

我叫布莱恩·斯特兰奇。

My name is Brian Strang.

Speaker 1

我是技术运营负责人,在国会资产管理公司工作。

I'm the technical operations lead, and I work at Congress Asset Management.

Speaker 0

你如何描述与Ridgeline的合作体验?

How would you describe your experience working with Ridgeline?

Speaker 1

Ridgeline是技术合作伙伴,而非软件供应商,他们的人员真心在乎客户。

Ridgeline is a technology partner, not a software vendor, and the people really care.

Speaker 1

我经常接到销售电话,但从不理会。

I get sales calls all the time, and I ignore them.

Speaker 1

Ridgeline很快就赢得了我的信任。

Ridgeline sold me very quickly.

Speaker 1

我们的规模从70亿增长到230亿,目标是达到500亿。

We went from 7,000,000,000 to 23,000,000,000, and the goal is 50,000,000,000.

Speaker 1

Ridgeline显然是帮助我们实现规模扩张的领先者。

Ridgeline was the clear front runner to help us scale.

Speaker 0

在您看来,Ridgeline最突出的特点是什么?

In your view, what most distinguishes Ridgeline?

Speaker 1

他们重新构想了这个行业应有的运作方式。

They reimagined how this industry should work.

Speaker 1

很明显他们的运营水平与众不同。

It was obviously they were operating on another level.

Speaker 0

值得联系Ridgeline,看看能为您的公司带来怎样的变革。

It's worth reaching out to Ridgeline to see what the unlock can be for your firm.

Speaker 0

访问RidgelineApps.com预约演示。

Visit RidgelineApps dot com to schedule a demo.

Speaker 0

投资中最困难的部分之一就是在众人之前洞察行业变化。

One of the hardest parts of investing is seeing what's shifting before everyone else does.

Speaker 0

AlphaSense正帮助投资者做到这一点。

AlphaSense is helping investors do exactly that.

Speaker 0

你可能已经知道AlphaSense这个市场情报平台,它受到全球75%顶级对冲基金的信任,提供访问超过5亿优质信息来源,包括公司文件、经纪商研究报告、新闻、行业期刊以及超过20万份专家电话会议记录。

You may already know Alphasense as the market intelligence platform trusted by 75% of the world's top hedge funds, providing access to over 500,000,000 premium sources from company filings and broker research to news, trade journals, and over 200,000 expert transcript calls.

Speaker 0

你可能不知道的是,他们最近推出了一项颠覆性的AI驱动的渠道核查功能。

What you might not know is that they've recently launched something game changing AI powered channel checks.

Speaker 0

渠道核查能让你在上市公司财报或共识修正数据公布前数周,就获得由专家提供的实时视角。

Channel checks give you a real time expert driven perspective on public companies weeks before they show in earnings or consensus revisions.

Speaker 0

AlphaSense利用AI访谈员每月与真实人类专家进行数千次电话访谈,通过向不同专家提出统一问题,确保信号清晰、可比且实用。

AlphaSense uses an AI interviewer to run thousands of expert calls with real human experts every month, asking consistent questions across experts so the signals are clean, comparable, and useful.

Speaker 0

随着访谈内容实时更新,您可以获取完整文字记录,并覆盖所有主要行业。

You get live updates as interviews come in, full transcript access, and coverage across every major sector.

Speaker 0

即时比较专家见解,分析季度间情绪变化趋势及关键绩效指标。

Instantly compare insights across experts and analyze quarter over quarter trends in sentiment and key performance indicators.

Speaker 0

对于试图领先快速变化市场的投资者而言,这已是基本配置。

For investors trying to stay ahead of the fast moving markets, it's already table stakes.

Speaker 0

大家好,欢迎各位的到来。

Hello, and welcome, everyone.

Speaker 0

我是帕特里克·奥肖内西,这里是《像最佳投资者一样投资》。

I'm Patrick O'Shaughnessy, and this is Invest Like The Best.

Speaker 0

本节目是对市场、理念、故事和策略的开放式探索,旨在帮助您更好地投资时间和金钱。

This show is an open ended exploration of markets, ideas, stories, and strategies that will help you better invest both your time and your money.

Speaker 0

如果您喜欢这些对话并想深入了解,请查看我们的季刊《Colossus Review》,其中包含对塑造商业和投资人物的深度剖析。

If you enjoy these conversations and wanna go deeper, check out Colossus Review, our quarterly publication with in-depth profiles of the people shaping business and investing.

Speaker 0

您可以在joincolossus.com上找到《Colossus Review》以及我们所有的播客内容。

You can find Colossus Review along with all of our podcasts at joincolossus.com.

Speaker 2

帕特里克·奥肖内西是Positive Sum公司的首席执行官。

Patrick O'Shaughnessy is the CEO of Positive Sum.

Speaker 2

帕特里克及播客嘉宾表达的所有观点仅代表其个人意见,并不反映Positive Sum的观点。

All opinions expressed by Patrick and podcast guests are solely their own opinions and do not reflect the opinion of Positive Sum.

Speaker 2

本播客仅供信息参考,不应作为投资决策的依据。

This podcast is for informational purposes only and should not be relied upon as a basis for investment decisions.

Speaker 2

Positive Sum的客户可能持有本播客中讨论的证券头寸。

Clients of positive sum may maintain positions in the securities discussed in this podcast.

Speaker 2

了解更多信息,请访问psum.vc。

To learn more, visit psum.vc.

Speaker 0

我永远不会忘记2017年第一次见到加文·贝克时的情景。

I will never forget when I first met Gavin Baker in 2017.

Speaker 0

我发现他对市场的兴趣、对世界的好奇心,是我见过最具感染力的投资者之一。

I find his interest in markets, his curiosity about the world to be as infectious as any investor that I've ever come across.

Speaker 0

他对当今科技领域的动态了如指掌。

He is encyclopedic on what is going on in the world of technology today.

Speaker 0

我很幸运能每隔一两年就在这档播客中邀请他做客。

And I've had the good fortune to host him every year or two on this podcast.

Speaker 0

在这次对话中,我们聊遍了所有让加文感兴趣的话题。

In this conversation, we talk about everything that interests Gavin.

Speaker 0

我们探讨了英伟达、谷歌及其TPU芯片、不断变化的人工智能格局、AI公司的数学与商业模式,以及其间的一切。

We talk about Nvidia, Google and its TPUs, the changing AI landscape, the math and business models around AI companies, and everything in between.

Speaker 0

我们甚至讨论了太空数据中心的疯狂构想,他用一贯的热情与逻辑阐述了这个想法。

We even discussed the crazy idea of data centers in space, which he communicates with his usual passion and logic.

Speaker 0

最后,在这次对话结束时,因为我之前已经问过他我的传统结束问题,所以我问了他一个不同的问题,这引发了他整个投资起源故事的讨论,这是我以前从未听过的。

In closing, at the end of this conversation, because I've asked him my traditional closing question before, I asked him a different question, which led to a discussion of his entire investing origin story that I had never heard before.

Speaker 0

因为加文是我认识的最充满热情的思想家和投资者之一,这些对话始终是我最喜欢的。

Because Gavin is one of the most passionate thinkers and investors that I know, these conversations are always amongst my most favorite.

Speaker 0

希望你们喜欢这次与加文·贝克系列讨论的最新一期。

I hope you enjoy this latest in the series of discussions with Gavin Baker.

Speaker 0

我很想聊聊你是如何在具体过程中应对AI世界不断涌现的新事物的,因为变化实在太频繁了。

I would love to talk about how you, like, in the nitty gritty process, new things that come out in this AI world because it's happening so constantly.

Speaker 0

我对这个非常感兴趣,发现很难跟上节奏。

I'm extremely interested in it, I find it very hard to keep up.

Speaker 0

而且,我有几个常读的博客和一些会联系的朋友,但就拿Gemini三号作为最近的例子来说吧。

And I, you know, I have a couple of blogs that I go read and friends that I call, but like maybe let's take Gemini three as like a recent example.

Speaker 0

等它发布的时候,带我去你办公室看看。

When that comes out, take me into your office.

Speaker 0

比如你在做什么具体工作?

Like what are you doing?

Speaker 0

考虑到这些更新如此频繁,你和你的团队是如何处理像这样的更新的?

How do you and your team process an update like that given how often these things are happening?

Speaker 1

我认为首要的是你必须亲自使用它。

I mean, I think the first thing is you have to use it yourself.

Speaker 1

我只想说,我对如此多知名且资深的投资者对AI得出非常明确的结论感到惊讶。

And I would just say I'm amazed at how many famous and August investors are reaching really definitive conclusions about AI.

Speaker 1

不,是基于免费层级。

Well, no, based on the free tier.

Speaker 1

免费版就像是在和一个10岁的孩子打交道。

The free tier is like you're dealing with a 10 year old.

Speaker 1

没错。

Right.

Speaker 1

然后你根据这个10岁孩子的表现来判断成年人的能力。

And you're making conclusions about the 10 year old's capabilities as an adult.

Speaker 1

其实你完全可以选择付费,我认为确实需要购买最高级别的服务,无论是Jim My Ultra、Super Grock还是其他什么,每月200美元的高端套餐是必须的。

And you could just pay, and I do think actually you do need to pay for the highest tier, whether it's Jim My Ultra, Super Grock, whatever it is, you have to pay the $200 per month tiers.

Speaker 1

而那些就像是完全成熟的30、35岁的成年人。

Whereas those are like a fully fledged 30, 35 year old.

Speaker 1

很难从一个8岁或10岁的孩子身上推断出35岁成年人的样子。

It's really hard to extrapolate from an eight or a 10 year old to the 35 year old.

Speaker 1

然而很多人正在这么做。

And yet a lot of people are doing that.

Speaker 1

第二点是,OpenAI内部有篇帖子提到,在很大程度上,OpenAI是靠Twitter(现称X)维系的。

And the second thing is there was a, you know, an insider post about OpenAI and they said to a large degree, OpenAI runs on Twitter Vox.

Speaker 1

我个人认为AI的发展动态主要发生在X平台上。

And I just think AI happens on X.

Speaker 1

平台上曾出现过一些令人难忘的时刻。

There've been some really memorable moments.

Speaker 1

比如Meta的PyTorch团队和Google的Jax团队在X上爆发过一场大战。

Like there was a giant fight between the PyTorch team at Meta and the Jax team at Google on X.

Speaker 1

最后双方实验室负责人不得不公开介入,声明禁止各自团队成员诋毁对方实验室。

And the leaders of each lab had to step in publicly say, no one from my lab is allowed to say bad things about the other lab.

Speaker 1

我尊重他们。

And I respect them.

Speaker 1

事情就这样结束了。

And that is the end of that.

Speaker 1

是啊。

Yeah.

Speaker 1

公司之间都在互相评论对方的帖子。

Companies are all commenting on each other's posts.

Speaker 1

你知道,研究论文会发表出来。

You know, the research papers come out.

Speaker 1

地球上大约有500到1000人真正理解并处于这一领域的最前沿,其中相当一部分人在中国,我认为必须密切关注这些人。

There's a list of, if on planet earth, there's 500 to a thousand people who really, really understand this and are at the cutting edge of it, and a good number of them live in China, I just think you have to follow those people closely.

Speaker 1

而且我认为这里有非常宝贵的信号。

And I think there is incredible signal.

Speaker 1

人工智能领域的一切都只是下游产物。

Everything in AI is just downstream.

Speaker 1

那些人吗?

Of those people?

Speaker 1

是的。

Yeah.

Speaker 1

安德烈·卡帕西写的每篇文章,你都得读三遍。

Everything Andre Karpathy writes, you have to read it three times.

Speaker 1

没错。

Yep.

Speaker 1

至少。

Minimum.

Speaker 0

是啊,他太厉害了。

Yeah, he's incredible.

Speaker 1

然后我想说的是,只要是那四家重要实验室——OpenAI、Gemini、Anthropic和XAI(显然是最顶尖的四家实验室)的任何动态。

And then I would say anytime at one of those labs, the four labs that matter, OpenAI, Gemini, Anthropic and XAI, which are clearly the four leading labs.

Speaker 1

每当这些实验室有人上播客节目,我都觉得必须认真听。

Anytime somebody from one of those labs goes on a podcast, I just think it's so important to listen.

Speaker 1

对我来说,AI的最佳用途之一就是能跟上所有这些信息。

For me, one of the best use cases of AI is to keep up with all of this.

Speaker 1

收听播客。

Listen to a podcast.

Speaker 1

然后如果我觉得某些部分有趣,就直接和AI讨论。

And then if there are parts that I thought were interesting, just talk about it with AI.

Speaker 1

我认为尽可能减少操作摩擦非常重要。

And I think it's really important to have as little friction as possible.

Speaker 1

我会主动提起这个话题。

I'll bring it up.

Speaker 1

我可以按这个按钮调出Grok,或者用这个。

I can either press this button and pull up Grok or I have this.

Speaker 1

这实在太棒了。

It's so amazing.

Speaker 0

你能相信我们拥有这个吗?

Can you believe we have this?

Speaker 1

我知道这就像,有人在X上说的,我们给这些石头施加了疯狂的咒语,现在我们可以通过手机在空中召唤超级智能精灵。

I know it's like, somebody said on X, you know, like we imbued these rocks with crazy spells, and now we could summon super intelligent genies on our phones over the air.

Speaker 1

你知道,这太疯狂了。

You know, it's crazy.

Speaker 0

疯狂。

Crazy.

Speaker 0

好吧。

Okay.

Speaker 0

所以,当Gemini三号发布时,公众的解读是,哦,这很有趣。

So, so something like Gemini three comes out with the public interpretation was, oh, this is interesting.

Speaker 0

它似乎提到了扩展定律和预训练内容。

It seems to say something about scaling laws and the pre training stuff.

Speaker 0

你对前沿模型整体进展的现状有什么看法?

What is your frame on like the state of general progress in frontier models in general?

Speaker 0

你最关注的是什么?

Like what are you watching most closely?

Speaker 1

是的,我认为Gemini三号非常重要,因为它向我们证明了预训练的缩放法则依然有效。

Yeah, well, do think Gemini three was very important because it showed us that scaling laws for pre training are intact.

Speaker 1

他们明确地阐述了这一点。

They stated that unequivocally.

Speaker 1

这很重要,因为地球上没人知道为什么预训练的缩放法则会起作用。

And that's important because no one on planet earth knows how or why scaling laws for pre training work.

Speaker 1

这实际上并不是一个法则。

It's actually not a law.

Speaker 1

这是一个经验观察,而且是我们极其精确测量并长期验证的经验观察。

It's an empirical observation and it's an empirical observation that we've measured extremely precisely and is held for a long time.

Speaker 1

但我们对预训练缩放规律的理解——可能这点会与20%的研究者存在分歧,但争议不会更大——某种程度上就像古英国人或古埃及人对太阳的认知。

But our understanding of scaling laws for pre training, and maybe this is a little bit controversial with 20% of researchers, but probably not more than that, is kind of like the ancient British people's understanding of the sun or the ancient Egyptians understanding of the sun.

Speaker 1

他们能如此精确地测量,以至于大金字塔的东西轴线与春秋分完美对齐,石器时代的建筑东西轴线也是如此。

They can measure it so precisely that the East West axis of the Great Pyramids are perfectly aligned with the equinoxes and so are the East West axis of stone age.

Speaker 1

完美的测量。

Perfect measurement.

Speaker 1

他们并不理解轨道力学。

They didn't understand orbital mechanics.

Speaker 1

他们完全不知道太阳为何东升西落,以及如何在地平线上移动。

They had no idea how or why it rose in the East, sat in the West and, you know, moved across the horizon.

Speaker 0

那些不了解这些的外星人在哪呢?

Where's the aliens that don't.

Speaker 1

是啊。

Yeah.

Speaker 1

是的。

Yeah.

Speaker 1

我们的,我们的,我们的神乘着战车。

Our, our, our God in a chariot.

Speaker 1

因此,每次我们得到这方面的确认都非常重要。

And so it's really important every time we get a confirmation of that.

Speaker 1

所以从这个角度来看,Gemini三号非常重要。

So Gemini three was very important in that way.

Speaker 1

但我想说的是,公众股权投资领域或更广泛的非专业群体可能对预训练扩展定律存在重大误解,实际上2024和2025年本不该有进展。

But I'd say, I think there's been a big misunderstanding of maybe in the public equity investing community or the broader more generalist community based on the scaling laws of pre training, there really should have been no progress in '24 and '25.

Speaker 1

原因在于,当XAI解决了20万个Hoppers芯片协同工作的问题后,必须等待下一代芯片问世。

And the reason for that is, is after XAI figured out how to get 200,000 hoppers coherent, you had to wait for the next generation of chips.

Speaker 1

因为你确实无法让超过20万个Hoppers保持协同运作。

Cause you really can't get more than 200,000 hoppers coherent.

Speaker 1

所谓协同,简单理解就是每块GPU都能感知其他GPU的运算状态。

And coherent just means you could just think of it as each GPU knows what every other GPU is thinking.

Speaker 1

它们某种程度上共享内存,通过扩展网络相互连接,在预训练过程中必须保持协同性。

They kind of are sharing memory, you know, they're connected, they scale up networks and scale out, and they have to be coherent during the pre training process.

Speaker 1

我们取得这些突破的原因,或许可以参考Arc AGI那张展示四年间智能水平从0%提升到8%的幻灯片。

The reason we've had all this progress, maybe we could show like the Arc AGI slide where you had zero to eight over four years, zero to 8% intelligence.

Speaker 1

而当OpenAI首个推理模型问世后,仅用三个月就从8%跃升至95%。

And then you went from 8% to 95% in three months when the first reasoning model came out from OpenAI.

Speaker 1

我们现在掌握了两条新的后训练扩展定律,其实就是带验证奖励的强化学习。

We have these two new scaling laws of post training, which is just reinforcement learning with verified rewards.

Speaker 1

在人工智能领域,'已验证'是一个极其重要的概念。

Verified is such an important concept in AI.

Speaker 1

就像Karpathy的一个伟大贡献是在软件方面。

Like one of Karpathy's great things was with software.

Speaker 1

任何你能明确指定的东西,都可以自动化。

Anything you can specify, can automate.

Speaker 1

借助AI,任何你能验证的事物,皆可实现自动化。

With AI, anything you can verify, you can automate.

Speaker 1

这是个非常重要的概念,我认为也是个关键区别。

It's such an important concept and I think an important distinction.

Speaker 1

然后是测试时间计算。

And then test time compute.

Speaker 1

因此我们取得的所有进展,自2024年10月至今的巨大进步,完全基于这两条新的扩展定律。

And so all the progress we've had, and we've had immense progress since October 24 through today, was based entirely on these two new scaling laws.

Speaker 1

而Gemini 3可以说是自Hopper问世以来对预训练扩展定律的首次验证,结果证明定律成立。

And Gemini three was arguably the first test since Hopper came out of the scaling law for pre training and it held.

Speaker 1

这很棒,因为所有这些扩展定律都是乘数效应的。

And that's great because all these scaling laws are multiplicative.

Speaker 1

所以现在我们要将这两种新的强化学习方法——带验证奖励和测试时计算——应用到更好的基础模型上。

So now we're going to apply these two new reinforcement learning with verified rewards and test time compute to much better base models.

Speaker 1

关于Gemini三号存在很多误解,我认为这非常重要。

There's a lot of misunderstanding about Gemini three that I think is really important.

Speaker 1

因此最重要的事情是要理解AI领域的一切都处于谷歌和英伟达的竞争之中。

So the most important thing to conceptualize everything in AI has a struggle between Google and Nvidia.

Speaker 1

谷歌拥有TPU,英伟达有他们的GPU,而谷歌只有TPU,他们使用一堆其他芯片做网络处理。

And Google has a TPU, and Nvidia has their GPUs and Google only has a TPU and they use a bunch of other chips for networking.

Speaker 1

要知道,英伟达拥有完整的堆栈。

Know, Nvidia has the full stack.

Speaker 1

Blackwell被推迟了。

Blackwell was delayed.

Speaker 1

Blackwell是英伟达的下一代芯片。

Blackwell was Nvidia's next generation chip.

Speaker 1

该产品的首个迭代版本是Blackwell 200。

The first iteration of that was the Blackwell 200.

Speaker 1

许多不同型号的产品都被取消了。

A lot of different SKUs were canceled.

Speaker 1

原因在于这是迄今为止我们在科技领域经历的最复杂的产品转型。

And the reason for that is it was by far the most complex product transition we've ever gone through in technology.

Speaker 1

从Hopper过渡到Blackwell,首先需要从风冷转为液冷。

Going from hopper to Blackwell, first you go from air cooled to liquid cooled.

Speaker 1

机架的重量从大约一千磅增加到了三千磅。

The rack goes from weighing round numbers, a thousand pounds to 3,000 pounds.

Speaker 1

功率从大约30千瓦(相当于30户美国家庭用电量)提升到130千瓦(相当于130户美国家庭用电量)。

It goes from round numbers, 30 kilowatts, which is 30 American homes to 130 kilowatts, which is 130 American homes.

Speaker 1

我打个比方:想象你要换新iPhone,却得先改造全屋电路为220伏、安装特斯拉Powerwall电池、配备发电机和太阳能板——这就是它的功耗需求,还得装全屋加湿系统并加固地板承重。

I analogize it to imagine if to get a new iPhone, you had to change all the outlets in your house to two twenty volt, put in a Tesla Powerwall, put in a generator, in solar panels, that's the power, you know, put in a whole home humidification system and then reinforce the floor because the floor can't handle this.

Speaker 1

因此这是一次巨大的产品迭代。

So it was a huge product transition.

Speaker 1

而且机架密度实在太高,他们很难把热量排出去。

And then just the rack was so dense, it was really hard for them to get the heat out.

Speaker 1

所以Blackwell真正开始大规模部署也就是过去三四个月的事。

So Blackwell's have only really started to be deployed in really scaled deployments over the last three or four months.

Speaker 0

你能解释为什么Blackwell的延迟如此重要吗?

Can you explain why it has been such an important thing that Blackwell was delayed?

Speaker 1

因为Blackwell太复杂了,要让这些极其精密的机架稳定运行对所有人来说都非常困难。

Because Blackwell is so complicated and it was so hard for everyone to get these exquisitely complex racks working consistently.

Speaker 1

如果没有推理能力的突破,从2024年年中到Gemini三代问世前,AI领域将不会有任何进展。

Had reasoning not come along, there would have been no AI progress from mid twenty twenty four through essentially Gemini three.

Speaker 1

根本不会有。

There would have been none.

Speaker 1

一切都会停滞不前。

Everything would have stalled.

Speaker 1

你能想象那对市场意味着什么吗?

And can you imagine what that would have meant to the markets?

Speaker 1

确实,我们本会生活在一个截然不同的环境中。

For sure, we would have lived in a very different environment.

Speaker 1

因此推理技术某种程度上填补了这十八个月的空白。

So reasoning kind of bridged this eighteen month gap.

Speaker 1

推理技术某种程度上拯救了AI,因为它让AI在缺乏Blackwell或下一代TPU的情况下仍能取得进展,而这些本是延续预训练规模法则所必需的。

Reasoning kind of saved AI because it let AI make progress without Blackwell or the next generation of TPU, which were necessary for the scaling laws for pre trading to continue.

Speaker 1

谷歌在2024年推出了TPU V6,2025年又发布了TPU V7。

Google came out with a TPU V six in 2024 and the TPU V seven in 2025.

Speaker 1

以半导体发展的时间尺度来看,想象一下Hopper就像二战时期的飞机。

In semiconductor time, imagine like Hopper, it's like a World War II era airplane.

Speaker 1

而且它绝对是当时最出色的二战时期飞机。

And it was by far the best World War II era airplane.

Speaker 1

就像是搭载梅林发动机的P-51野马战斗机。

It's a P 51 Mustang with a Merlin engine.

Speaker 1

而两年后的半导体时代,那就像是你已经拥有了F-4鬼怪战斗机。

And two years later in semiconductor time, that's like, you're an F four Phantom.

Speaker 1

明白吗?

Okay?

Speaker 1

因为Blackwell是一款极其复杂且难以快速投产的产品,谷歌当时是在用2024和2025年的TPU(相当于F4鬼怪式战机)训练Gemini 3。

Because Blackwell was such a complicated product and so hard to ramp, Google was training Gemini three on '24 and '25 era TPUs, which are like F4 Phantoms.

Speaker 1

Blackwell则像是F35战机。

Blackwell, it's like an F35.

Speaker 1

只是它花了很长时间才真正投入使用。

It just took a really long time to get it going.

Speaker 1

所以我认为从预训练的角度来看,谷歌目前确实拥有这种暂时优势。

So I think Google for sure has this temporary advantage right now from a pre training perspective.

Speaker 1

我认为同样重要的是,他们一直是生成token成本最低的生产者。

I think it's also important that they've been the lowest cost producer of tokens.

Speaker 1

这一点非常重要,因为在我作为科技投资者的职业生涯中,AI是第一次让成为低成本生产者真正具有意义。

And this is really important because AI is the first time in my career as a tech investor that being the low cost producers ever matter.

Speaker 1

苹果公司价值数万亿美元并不是因为他们是手机的低成本生产商。

Apple is not worth trillions because they're a low cost producer of phones.

Speaker 1

微软市值突破万亿并非因为他们是低成本的软件生产商。

Microsoft is not worth trillions because they're a low cost producer of software.

Speaker 1

英伟达市值突破万亿并非因为他们是低成本的AI加速器生产商。

NVIDIA is not worth trillions because they're the low cost producer of AI accelerators.

Speaker 1

这从来都不重要。

It's never mattered.

Speaker 1

这一点非常关键,因为谷歌作为低成本生产商一直在吸走AI生态系统的经济氧气,这对他们和任何低成本生产商来说都是极其理性的策略。

And this is really important because what Google has been doing as the low cost producer is they have been sucking the economic oxygen out of the AI ecosystem, which is an extremely rational strategy for them and for anyone who's the low cost producer.

Speaker 1

让我们给竞争对手制造巨大压力。

Let's make life really hard for our competitors.

Speaker 1

那么现在会发生什么?

So what happens now?

Speaker 1

我认为这将产生非常深远的影响。

I think this has pretty profound implications.

Speaker 1

第一,我们将在2026年初看到首批基于Blackwell训练的模型。

One, we will see the first models trained on Blackwell in early twenty twenty six.

Speaker 1

我认为首个Blackwell模型将出自xAI。

I think the first Blackwell model will come from xAI.

Speaker 1

原因很简单,根据黄仁勋的说法,没人比埃隆更快建成数据中心。

And the reason for that is just according to Jensen, no one builds data centers faster than Elon.

Speaker 1

黄仁勋公开说过这话。

Jensen has said this on the record.

Speaker 1

而且即便有了Blackwell芯片,还需要6到9个月才能让其性能达到Hopper的水平。

And even once you have the Blackwells, it takes six to nine months to get them performing at the level of Hopper.

Speaker 1

因为Hopper已经过精细调校。

Because Hopper's finely tuned.

Speaker 1

大家都熟悉它的使用方法。

Everybody knows how to use it.

Speaker 1

配套软件已臻完善。

The software's perfect for it.

Speaker 1

工程师们完全掌握其特性。

The engineers know all its quirks.

Speaker 1

现在大家都知道如何设计Hopper数据中心了。

Everybody knows how to architect a Hopper data center at this point.

Speaker 1

顺便说一句,Hopper刚推出时,花了六到十二个月时间才真正超越第四代的Ampere。

And by the way, when Hopper came out, it took six to twelve months for it to really outperform Ampere, which was generation four.

Speaker 1

所以如果你是NVIDIA数据中心,你需要尽可能快地在同一个数据中心部署尽可能多的GPU,形成一个连贯的集群,这样才能解决各种问题。

So if you're gin center NVIDIA, you need to get as many GPUs deployed in one data center as fast as possible in a coherent cluster so you can work out the bugs.

Speaker 1

这就是xAI为NVIDIA所做的实质贡献——他们建设数据中心最快,能最快大规模部署Blackwell,还能协助NVIDIA为其他用户解决各种技术问题。

And so this is what xAI effectively does for NVIDIA, because they build the data centers the fastest, they can deploy Blackwells at scale the fastest, and they can help work with NVIDIA to work out the bugs for everyone else.

Speaker 1

正因为他们速度最快,所以将率先推出Blackwell模型。

So because they're the fastest, they'll have the first Blackwell model.

Speaker 1

我们知道预训练的扩展法则依然有效,这意味着Blackwell模型将会非常出色。

We know that scaling laws for pre training are intact, And this means the Blackwell models are going to be amazing.

Speaker 1

Blackwell就像是...我的意思是,它并非F35对阵F4鬼怪战斗机那样的代差。

Blackwell is I mean, it's not an F35 versus an F4 Phantom.

Speaker 1

但从我的角度来看,这确实是一款更优秀的芯片。

But from my perspective, it is a better chip.

Speaker 1

要知道,这大概就像F35对阵拉斐尔战机。

You know, maybe it's like an F35 versus a Raphael.

Speaker 1

既然我们已经了解了预训练扩展法则,就知道这些Blackwell模型将会非常出色。

And so now that we know pre scaling laws we know that these Blackwell models are going to be really good.

Speaker 1

根据原始规格参数,它们的性能应该会更优越。

Based on the raw specs, they should probably be better.

Speaker 1

然后更重要的事情发生了。

And then something even more important happens.

Speaker 1

根据消息,GB200确实很难获得。

So the GB200 was really hard to get according.

Speaker 1

GB300是款非常出色的芯片。

The GB 300 is a great chip.

Speaker 1

它在各方面都与那些GB200机架完全兼容。

It is drop in compatible in every way with those GB 200 racks.

Speaker 1

现在你不会去替换那些GB200设备。

Now you're not gonna replace the GB two hundreds.

Speaker 0

是啊。

Yeah.

Speaker 0

但只是普通的Powerwalls。

But just any Powerwalls.

Speaker 1

对。

Yeah.

Speaker 1

没错。

Yeah.

Speaker 1

只要能处理这些的数据中心就行。

Just any data center that can handle those.

Speaker 1

你可以直接插入GB300。

You can slot in the GB three hundreds.

Speaker 1

现在大家都擅长制造这些机架了。

And now everybody's good at making those racks.

Speaker 1

你知道怎么散热。

You know how to get the heat out.

Speaker 1

你知道如何给它们降温。

You know how to cool them.

Speaker 1

你会把那些GB300装进去。

You're gonna put those GB300s in.

Speaker 1

而使用GB300的公司将成为代币的低成本生产者,尤其是如果你采用垂直整合模式的话。

And then the companies that use the GB300s, they're going to be the low cost producer of tokens, particularly if you're vertically integrated.

Speaker 1

如果你需要支付利润给别人来生产这些代币,那你很可能做不到低成本。

If you're paying a margin to someone else to make those tokens, you're probably not going to be.

Speaker 1

而且我认为这具有相当深远的影响。

And think this has pretty profound implications.

Speaker 1

我认为这必须改变谷歌的战略计算方式。

I think it has to change Google's strategic calculus.

Speaker 1

如果你拥有决定性成本优势,又是谷歌这样拥有搜索和其他所有业务的企业,为什么不以负30%的利润率运营AI呢?

If you have a decisive cost advantage and you're Google and you have Search and all these other businesses, why not run AI at a negative 30% margin?

Speaker 1

这显然是抽走行业经济氧气的理性决策。

It is by far the rational decision to take the economic oxygen out of the environment.

Speaker 1

你最终会让那些需要融资的竞争对手难以筹集所需资金,而你自己则无需担心。

You eventually make it hard for your competitors who need funding, unlike you, to raise the capital they need.

Speaker 1

而在另一方面,也许你能占据极其主导的市场份额。

And then on the other side of that, maybe you have an extremely dominant share position.

Speaker 1

一旦谷歌不再是低成本生产者,所有这些计算都将改变,我认为这将成为现实。

Well, all that calculus changes once Google is no longer the low cost producer, which I think will be the case.

Speaker 1

Blackwell芯片现在正被用于训练阶段。

The Blackwells are now being used for training.

Speaker 1

当模型训练完成后,你开始将Blackwell集群转向推理任务。

And then when that model is trained, you start shifting Blackwell clusters over to inference.

Speaker 1

然后所有这些成本计算和动态关系都会发生变化。

And then all these cost calculations and these dynamics change.

Speaker 1

这非常有趣,就像玩家之间的战略与经济博弈,我从未见过这样的局面。

It's very interesting, like during the strategic and economic calculations between the players, I've never seen anything like it.

Speaker 1

每个参与者都清楚自己在棋盘上的位置、争夺的奖品是什么,以及对手正在采取什么策略。

Everyone understands their position on the board, what the prize is, what play their opponents are running.

Speaker 1

观察这一过程确实非常有趣。

And it's really interesting to watch.

Speaker 1

如果谷歌改变策略,因为作为高成本生产商承受30%的负利润率将非常痛苦,这可能会开始影响其股价。

If Google changes its behavior, because it's gonna be really painful for them as a higher cost producer to run that negative 30% margin, it might start to impact their stock.

Speaker 1

这对人工智能的经济学具有相当深远的影响。

That has pretty profound implications for the economics of AI.

Speaker 1

而当鲁宾芯片面世时,这一差距将显著扩大。

And then when Rubin comes out, the gap is gonna expand significantly.

Speaker 1

与TPU相比如何?

Versus TPUs?

Speaker 1

与TPU及所有其他ASIC芯片相比。

Versus TPUs and all other ASICs.

Speaker 1

我认为Trainium第三代可能会相当不错,第四代也会表现良好。

Now I think Trainium three is probably gonna be pretty good and Trainium four are gonna be good.

Speaker 0

为什么会这样呢?

Why is that the case?

Speaker 0

为什么TPU第九代不会同样出色?

Why won't TPU v nine be every bit as good?

Speaker 1

有几个原因。

A couple of things.

Speaker 1

首先,无论出于何种原因,谷歌做出了更为保守的设计决策。

So one, for whatever reason, Google made more conservative design decisions.

Speaker 1

我认为部分原因是,以TPU为例,半导体设计分为前端和后端,还要与台积电打交道。

I think part of that is, so Google, let's say the TPU, so there's front end and back end of semiconductor design, and then there's dealing with Taiwan Semi.

Speaker 1

设计ASIC有很多种方式。

And you can make an ASIC in a lot of ways.

Speaker 1

谷歌的做法是他们主要负责TPU的前端设计。

What Google does is they do mostly the front end for the TPU.

Speaker 1

然后由博通处理后端设计并管理台积电等相关事务。

And then Broadcom does the back end and manages Taiwan Semi and everything.

Speaker 1

这个比喻可能不太准确,但前端就像是房子的建筑师。

It's a crude analogy, but the front end is like the architect of a house.

Speaker 1

他们负责设计房子。

They design the house.

Speaker 1

后端则是建造房子的人。

The back end is the person who builds the house.

Speaker 1

而台湾半导体就像莱纳公司那样批量生产这些房子。

And the Imagine Taiwan Semi is like stamping out that house like Lennar or, you know, D.

Speaker 1

R.

R.

Speaker 1

霍顿。

Horton.

Speaker 1

而完成后面这两部分工作,博通能获得50%到55%的毛利率。

And for doing those two latter parts, Broadcom earns a 50 to 55 gross margin.

Speaker 1

我们不清楚TPU的具体情况。

We don't know what on TPUs.

Speaker 1

假设到2027年,TPU的规模,我们估计可能达到300亿左右,谁知道呢?

Let's say in 2027, TPU, I think it sets us estimates maybe somewhere around 30,000,000,000 again, who knows?

Speaker 1

300亿美元,我认为是个合理的预估。

30,000,000,000, I think is a reasonable estimate.

Speaker 1

50%至55%的毛利率。

50 to 55% gross margins.

Speaker 1

所以谷歌要支付博通150亿美元,这可是一大笔钱。

So Google is paying Broadcom $15,000,000,000 That's a lot of money.

Speaker 1

从某个节点来看,将半导体项目完全内部化是明智之举。

At a certain point, it makes sense to bring a semiconductor program entirely in house.

Speaker 1

换句话说,苹果的芯片没有ASIC合作伙伴。

So in other words, Apple does not have an ASIC partner for their chips.

Speaker 1

他们自行完成前端设计、后端制造,并管理台积电的生产。

They do the front end themselves, the back end, and they manage Taiwan Semiconductor.

Speaker 1

原因在于他们不愿支付那50%的利润差价。

And the reason is they don't want pay that 50% margin.

Speaker 1

因此在特定阶段,重新协商这件事就变得合乎情理了。

So at a certain point, it becomes rational to renegotiate this.

Speaker 1

从另一个角度看,博通半导体部门的全部运营支出大约为50亿美元。

And just as perspective, the entire OpEx of Broadcom semiconductor division is round numbers, 5,000,000,000.

Speaker 1

所以这在经济上是合理的。

So it'd be economically rational.

Speaker 1

现在谷歌支付的是300亿,而我们支付给他们150亿。

Now that Google's paying, if it's 30,000,000,000, we're paying them 15.

Speaker 1

谷歌可以给博通半导体的每位员工双倍薪酬,还能额外多赚50亿。

Google can go to every person who works in Broadcom semi, double their comp and make an extra 5,000,000,000.

Speaker 1

假设到2028年业务规模达到500亿。

In 2028, let's just say it does 50,000,000,000.

Speaker 1

现在就是250亿了。

Now it's 25,000,000,000.

Speaker 1

你甚至可以给他们三倍薪酬。

You could triple their comp.

Speaker 1

顺便说一句,你并不需要他们所有人。

And by the way, you don't need them all.

Speaker 1

是的。

Yeah.

Speaker 1

当然他们不会这么做,因为存在竞争顾虑。

And of course they're not gonna do that because of competitive concerns.

Speaker 1

但随着TPU V8和V9的推出,这一切开始产生影响,因为谷歌正在引入联发科技术。

But with TPU V eight and V nine, all of this is beginning to have an impact because Google is bringing in media tech.

Speaker 1

这可能是你们向博通发出的第一个警告信号。

This is maybe the first way you send a warning shot to Broadcom.

Speaker 1

我们对支付这么多钱确实很不满意。

We're really not happy about all this money we're paying.

Speaker 1

但他们确实引入了联发科,而且台湾的ASIC公司毛利率要低得多。

But they did bring MediaTek in and the Taiwanese ASIC companies have much lower gross margins.

Speaker 1

所以这算是第一记警告。

So this is kind of the first shot against the bow.

Speaker 1

还有人们常说的那些话,但博通拥有最好的SerDes技术。

And then there's all this stuff people say, but Broadcom has the best SerDes.

Speaker 1

博通的SerDes技术确实非常出色,而SerDes是一项极其基础的技术,因为它决定了芯片之间如何通信。

Broadcom has really good SerDes and SerDes is like an extremely foundational technology because it's how the chips communicate with each other.

Speaker 1

你必须进行序列化和反序列化。

You have to serialize and deserialize.

Speaker 1

但世界上还有其他优秀的SerDes供应商。

But there are other good SerDes providers in the world.

Speaker 1

一个真正优秀的SerDes技术可能价值100到150亿美元一年,但实际价值可能在250亿美元左右。

A really good SerDes is maybe it's worth 10 or 15,000,000,000 a year, but it's probably about worth 25,000,000,000 a year.

Speaker 1

由于这种摩擦,再加上我认为谷歌方面做出的保守设计选择——也许他们采取这种保守设计的原因是采用了双轨供应策略。

So because of that friction and I think conservative design choices on the part of Google, and maybe the reason they made those conservative design choices is because they were going to a bifurcated supply.

Speaker 1

TPU的发展正在放缓,而GPU正在加速发展。

TPU is slowing down, I would say, as the GPUs are accelerating.

Speaker 1

这是Lisa和Jensen对'我们要自主研发ASIC'言论的第一个竞争性回应:'我们只会加速发展'。

This is the first competitive response of Lisa and Jensen to everybody saying we're gonna have our own ASIC is, hey, we're just gonna accelerate.

Speaker 1

我们将每年推出一款GPU,你们根本跟不上我们的步伐。

We're gonna do a GPU every year and you cannot keep up with us.

Speaker 1

然后我觉得大家都在学的是,哇,这太酷了。

And then I think what everybody is learning is like, oh wow, that's so cool.

Speaker 1

你们做了自己的加速器,有了ASIC芯片。

You made your own accelerator, has an ASIC.

Speaker 1

哇,网卡会是什么样?

Wow, what's the NIC gonna be?

Speaker 1

CPU会是什么样?

What's the CPU gonna be?

Speaker 1

扩展交换机将是什么样?

What's the scale up switch gonna be?

Speaker 1

扩展协议是什么?

What's the scale up protocol?

Speaker 1

分布式交换机是什么?

What's the scale out switch?

Speaker 1

你们打算用什么类型的光学器件?

What kind of optics are you gonna use?

Speaker 1

要让这一切协同工作的软件是什么?

What's the software that's gonna make all this work together?

Speaker 1

然后就会突然意识到:糟糕,我做了这么个小芯片——不管承认与否,GPU制造商肯定不喜欢客户自己做ASIC来和他们竞争。

And then it's like, oh shit, I made this tiny little chip and you know, like whether it's admitted or not, I'm sure the GPU makers don't love it when their customers make ASICs and try and compete with them.

Speaker 1

就像在说:哎呀,我干了什么?

And like, whoops, what did I do?

Speaker 1

我以为这很简单。

I thought this was easy.

Speaker 1

你怎么知道?

How do you know?

Speaker 1

要做出好芯片至少需要三代迭代。

It takes at least three generations to make a good chip.

Speaker 1

就像TPU第一代,能造出来已经是成就了。

Like the TPU v1, I mean, it was an achievement and that they made it.

Speaker 1

直到TPU第三代甚至第四代,TPU才开始稍微有点竞争力。

It was really not till TPU v3 or v4 that the TPU started to become like even vaguely competitive.

Speaker 0

这是不是就是典型的边做边学的过程?

Is that just a classic like learning by doing thing?

Speaker 1

百分之百是的。

A 100%.

Speaker 1

而且据我所知,即便你已经组建了团队,目前半导体公司里最好的ASIC团队其实是亚马逊的ASIC团队。

And even if you've made, from my perspective, the best ASIC team at any semiconductor company is actually the Amazon ASIC team.

Speaker 1

他们是第一个研发出Graviton处理器的团队。

They're the first one to make the Graviton CPU.

Speaker 1

他们有这个Nitro。

They have this Nitro.

Speaker 1

它叫Supernic。

It's called Supernic.

Speaker 1

他们一直极具创新力,非常聪明。

They've been extremely innovative, really clever.

Speaker 1

像Trainium和Infantry one,可能比TPUV1稍好一些,但只是稍好一点。

And like Trainium and Infantry one, maybe they're a little better than the TPUV1, but only a little.

展开剩余字幕(还有 480 条)
Speaker 1

Trainium二代,性能会稍好一些。

Trainium two, you get a little better.

Speaker 1

Trainium三代,我认为是第一次达到还不错的水平。

Trainium three, it's, I think the first time it's like, okay.

Speaker 1

然后,我觉得Trainium四代可能会很不错。

And then, you know, I think Trainium four will probably be good.

Speaker 1

如果除了Trainium和TPU之外还有很多ASIC芯片,我会感到惊讶。

I will be surprised if there are a lot of ASICs other than Trainium and TPU.

Speaker 1

顺便说一句,Trainium和TPU最终都会在客户自有的设备上运行。

And by the way, Trainium and TPU will both run on customer owned tooling at some point.

Speaker 1

我们可以争论何时会发生,但刚才描述的成功经济学意味着这是不可避免的。

We can debate when that will happen, but the economics of success that I just described mean it's inevitable.

Speaker 1

无论公司怎么说,纯粹的经济学原理和第一性原理都表明这绝对不可避免。

Like no matter what the companies say, just the economics make it and reasoning from first principles make it absolutely inevitable.

Speaker 1

如果我

If I were

Speaker 0

从宏观角度来看这一切,我发现这些细节令人难以置信地有趣,这就像是有史以来最宏大的游戏。

to zoom all the way out on this stuff, I find these details unbelievably interesting and it's like the grandest game that's ever been played.

Speaker 0

这太疯狂了,追踪这些发展太有趣了。

It's so crazy and so fun to follow.

Speaker 0

有时我会忘记退一步思考,然后问自己:那又怎样?

Sometimes I forget to zoom out and say, well, so what?

Speaker 0

比如,好吧,那么预测三代之后的情况,超越鲁本或什么的。

Like, okay, so project this forward three generations past Ruben or whatever.

Speaker 0

所有这些疯狂发展——我们不断降低预训练扩展模型的损失——给全人类带来的红利究竟是什么?

What is like the global human dividend of all this crazy development where we keep making the loss lower on these pre training scaling models?

Speaker 0

谁在乎呢?

Like who cares?

Speaker 0

我已经有段时间没向这个东西提问了,每次得到的答案都让我个人感到震撼。

Like it's been a while since I've asked this thing something that I wasn't kind of blown away by the answer for me personally.

Speaker 0

这场疯狂的基础设施竞赛将让我们解锁接下来的哪些新事物,因为它们如此成功?

What are the next couple of things that all this crazy infrastructure war allows us to unlock because they're so successful?

Speaker 1

如果要我预测发展路径,我认为Blackwell模型将会令人惊叹。

If I were to posit like an EventPath, I think the Blackwell models are going to be amazing.

Speaker 1

GB300带来的每令牌成本大幅降低,可能更多是MI4.50而非MI3.55的功劳,这将使这些模型能够进行更长时间的思考,意味着它们将能完成新任务。

The dramatic reduction in per token cost enabled by the GB 300 and probably more the MI $4.50 than the MI $3.55 will lead to these models being allowed to think for much longer, which means they're going to be able to do new things.

Speaker 1

这让我印象深刻。

I was very impressed.

Speaker 1

GMI三代帮我订了餐厅。

GMI three made me a restaurant reservation.

Speaker 1

这是它第一次为我做实事。

It's the first time it's done something for me.

Speaker 1

我是说,除了检索信息和教我知识之外。

And I mean, other than like, go research something and teach me stuff.

Speaker 1

如果能订餐厅,那离订酒店、机票和叫Uber也不远了。

If you can make a restaurant reservation, you're not that far from being able to make a hotel reservation and an airplane reservation and order me an Uber.

Speaker 1

突然间

And all of a

Speaker 0

突然间你就有了一个助手。

sudden you got an assistant.

Speaker 1

是啊。

Yeah.

Speaker 1

你可以想象大家都在讨论这个,但你可以想象它就在你的手机上。

And you could just imagine everybody talks about that, but you can just imagine it's on your phone.

Speaker 1

我认为这在近期就能实现,但一些技术领先的大公司,超过50%的客户服务已经由AI完成,这是一个价值4000亿美元的行业。

I think that's pretty near term, but some big companies that are very tech forward, 50% plus of customer support is already done by AI and that's a $400,000,000,000 industry.

Speaker 1

要知道,AI最擅长的是说服,这正是销售和客户支持的核心能力。

And then if, you know, what AI is great about is persuasion, that's sales and customer support.

Speaker 1

因此从企业职能来看,无非就是生产产品、销售产品以及服务客户这几项。

And so of the functions of a company, if you think about them, they're to make stuff, sell stuff, and then support the customers.

Speaker 1

所以到2026年底左右,AI很可能已经能出色胜任其中两项职能了。

So right now, maybe in late twenty six, you're gonna be pretty good at two of them.

Speaker 1

我确实认为AI将对媒体行业产生重大影响。

I do think it's gonna have a big impact on media.

Speaker 1

就像我之前提到的机器人技术,终于要开始真正落地应用了。

Like I think robotics, you know, talked about the last time are gonna finally start to be real.

Speaker 1

要知道,现在涌现了大量令人兴奋的机器人初创企业。

You know, there's an explosion and kind of exciting robotics startups.

Speaker 1

我仍然认为主战场将在特斯拉的乐观主义者与中国企业之间,因为你知道,制造原型很容易。

I do still think that the main battle's gonna be between Tesla's optimist and the Chinese, because you know, it's easy to make prototypes.

Speaker 1

但要大规模生产却很困难。

It's hard to mass produce them.

Speaker 1

但这又回到了Andre Carpathi所说的观点:AI可以自动化任何可被验证的事情。

But then it goes back to that, what Andre Carpathi said about AI can automate anything that can be verified.

Speaker 1

所以任何存在对错答案或结果的功能领域,你都可以应用强化学习让AI在这方面变得非常出色。

So any function where there's a right or wrong answer or a right or wrong outcome, you can apply reinforcement learning and make the AI really good at that.

Speaker 1

有哪些

What are

Speaker 0

你最喜欢的例子

your favorite examples of that

Speaker 1

是迄今为止还是理论上的?

so far or theoretically?

Speaker 1

我是说,模型能平衡吗?

I mean, just does the model balance?

Speaker 1

它们会非常擅长建模。

They'll be really good at making models.

Speaker 1

全球所有账本都能对得上吗?

Do all the books globally reconcile.

Speaker 1

它们会非常擅长会计、复式记账,而且必须平衡。

They'll be really good at accounting, double entry bookkeeping, and has to balance.

Speaker 1

这是可以验证对错的。

There's a verifiable you got it right or wrong.

Speaker 1

支持还是销售。

Support or sale.

Speaker 1

你完成销售了吗?

Did you make the sale or not?

Speaker 1

这就像AlphaGo一样。

That's just like AlphaGo.

Speaker 1

你赢了还是输了?

Did you win or you lose?

Speaker 1

那个人转化了吗?

Did the guy convert or not?

Speaker 1

客户在客服过程中要求升级了吗?

Did the customer ask for an escalation during customer support or not?

Speaker 1

其最重要的功能之所以重要,是因为它们可以被验证。

It's most important functions are important because they can be verified.

Speaker 1

所以我认为如果这一切开始发生并在2026年实现,Blackwell将会产生投资回报,然后这一切将继续推进。

So I think if all of this starts to happen and starts to happen in '26, there'll be an ROI on Blackwell, and then all this will continue.

Speaker 1

然后我们会有鲁本,那将是另一个巨大的自旋量子——鲁本与MI450以及TPU V9。

And then we'll have Ruben and then that'll be another big quantum of spin Ruben and the MI four fifty and the TPU V nine.

Speaker 1

然后我认为最有趣的问题是:人工超级智能的经济回报是什么?

And then I do think just the most interesting question is what are the economic returns to artificial super intelligence?

Speaker 1

因为在这场伟大博弈中的所有公司,都陷入了囚徒困境。

Because all of these companies in this great game, they've been in a prisoner's dilemma.

Speaker 1

他们害怕一旦减速就会永远失去机会。

They're terrified that if they slow down Gone forever.

Speaker 1

而如果竞争对手没有减速,这就是生存危机。

And their competitors don't, it's an existential risk.

Speaker 1

你知道,微软今年早些时候有大约六周时间动摇了。

And you know, Microsoft blinked for like six weeks earlier this year.

Speaker 1

是啊。

Yeah.

Speaker 1

我想他们会说对此感到后悔。

I think they would say they regret that.

Speaker 1

但有了Blackwell,尤其是Rubin,从决策和投入角度看,经济规模将完全主导囚徒困境,因为数字实在太庞大了。

But with Blackwell, and for sure with Rubin, the economics are going to dominate the prisoner's dilemma from a decision making and spending perspective, just because the numbers are so big.

Speaker 1

这又回到了关于AI投资回报率的问题。

And this goes to kind of the ROI on AI question.

Speaker 1

人工智能的投资回报率在实证、事实和明确性上都是正向的。

And the ROI on AI has empirically, factually, unambiguously been positive.

Speaker 1

我一直觉得奇怪,这有什么可争论的,因为购买GPU的最大开销方都是上市公司。

I just always find it strange that there's any debate about this because the largest spendages on GPUs are public companies.

Speaker 1

他们会公布一种叫经审计的季度财报的东西。

They report something called audited quarterly financials.

Speaker 1

你可以用这些数据计算所谓的资本回报率。

And you can use those things to calculate something called a return on invested capital.

Speaker 1

如果进行这种计算,就会发现这些大公司在GPU上的资本回报率比增加投入前还要高。

And if you do that calculation, the ROIC of the big public spenders on GPUs is higher than it was before they ramped spending.

Speaker 1

你可能会说,这其中部分是运营成本节省带来的。

Then you could say, well, part of that is, you know, OpEx savings.

Speaker 1

某种程度上,这正是你期望从AI获得的投资回报。

Well, at some level that is part of what you expect the ROI to be from AI.

Speaker 1

然后你会发现,很大部分其实只是通过TPU应用,把支撑广告和推荐系统的大型推荐引擎从CPU迁移到了GPU上。

And then you say, well, a lot of it is actually just applying TPUs, moving the big recommender systems that power the advertising and the recommendation systems from CPUs to GPUs.

Speaker 1

而且你们已经获得了巨大的效率提升。

And you've had massive efficiency gains.

Speaker 1

这就是为什么这些公司的收入增长都加速了。

And that's why all the revenue growth at these companies has accelerated.

Speaker 1

但这又怎样呢?

But like, so what?

Speaker 1

投资回报率已经摆在那里了。

The ROI has been there.

Speaker 1

这确实很有趣。

And it is interesting.

Speaker 1

就像所有大型互联网公司一样,负责营收的人对分配给研究人员的GPU数量感到非常恼火。

Like every big internet company, the people who are responsible for the revenue are intensely annoyed at the amount of GPUs that are being given to the researchers.

Speaker 1

这是个非常简单的等式。

It's a very linear equation.

Speaker 1

如果你给我更多GPU,我就能创造更多收入。

If you give me more GPUs, I will drive more revenue.

Speaker 1

是啊。

Yeah.

Speaker 1

把那些GPU也给我,我们就能创造更多收入、更多毛利,然后赚到大钱。

Give me even those GPUs, we'll have more revenue, more gross profit, and then we get good money.

Speaker 1

所以这是每家公司在持续面临的斗争。

So it's this constant fight at every company.

Speaker 1

囚徒困境的一个因素是,每个人都怀着这种宗教信仰般的信念,认为我们终将实现人工超级智能。

One of the factors in the prisoner's dilemma is everybody has this like religious belief that we're going to get to ASI.

Speaker 1

说到底,他们真正想要的是什么?

And at the end of the day, what do they all want?

Speaker 1

几乎所有人都想永生不死。

Almost all of them want to live forever.

Speaker 1

他们认为人工超级智能能帮他们实现这个目标。

And they think that ASI is going to help them with that.

Speaker 0

没错。

Right.

Speaker 0

不错的回报。

Good return.

Speaker 1

这是个不错的回报,但我们并不确定。

That's a good return, But we don't know.

Speaker 1

而且如果作为人类,我们已经突破了物理、生物和化学的界限——这些支配宇宙的自然法则——那么或许ASI带来的经济回报并没有那么高。

And if as humans, have pushed the boundaries of physics, biology, and chemistry, the natural laws that govern the universe, then maybe the economic returns to ASI aren't that high.

Speaker 0

我很好奇你有时会想到的那些泼冷水的观点。

I'm very curious about your favorite sort of throw cold water on this stuff type takes that you think about sometimes.

Speaker 0

其中一个可能是——我很好奇你的看法——哪些因素会导致对计算需求的变化,甚至改变其发展趋势。

One would be like the things that would cause, I'm curious what you think, the things that would cause this demand for compute to change or even the trajectory of it to change.

Speaker 0

有一个非常明显的看跌因素,那就是

There's one really obvious bear case and it is just

Speaker 1

边缘AI。

edge AI.

Speaker 1

而这与ASI的经济回报息息相关。

And it's connected to the economic returns to ASI.

Speaker 1

三年后,在更大更笨重的手机上,为了容纳所需容量的DRAM,电池续航可能也会缩短,你将能够运行精简版的Gemini五或Grok四、Grok 4.1或ChatGPT,每秒30到60个token。

In three years, on a bigger and bulkier phone to fit the amount of DRAM necessary, you know, and the battery won't probably last as long, you will be able to probably run like a pruned down version of something like Gemini five or Grok four, Grok 4.1 or ChatGPT at 30, 60 tokens per second.

Speaker 1

而且这还是免费的。

And then that's free.

Speaker 1

这显然是苹果的战略。

And this is clearly Apple's strategy.

Speaker 1

我们就是要成为AI的分发商,保证隐私安全,让AI在手机上运行。

It's just we're gonna be a distributor of AI and we're gonna make it privacy safe and run on the phone.

Speaker 1

然后你可以随时调用云端的大型模型,无论你有什么问题。

And then you can call one of the big models, you know, the the god models in the cloud, whatever you have a question.

Speaker 1

如果这种情况发生,如果每秒30到60个token、115的智商水平就够用了,我认为那...

And if that happens, if like thirty, sixty tokens a second at a one fifteen IQ is good enough, I think that's

Speaker 0

是个熊市案例。

a bear case.

Speaker 1

除了规模定律失效的情况。

Other than just the scaling laws break.

Speaker 1

但就假设缩放定律持续有效而言,我们现在知道它们至少还会延续一代预训练阶段,而在后训练、中期训练、RLVR(无论人们怎么称呼)以及测试时计算推理这两条新缩放定律方面,我们还处于非常早期的阶段。

But in terms of if we assume scaling laws continue and we now know they're going to continue for pre training for at least one more generation, and we're very early in the two new scaling laws for post training, mid training, RLVR, whatever people want call it, and then test time computed inference, we're so early in those and we're getting so much better at helping the models hold more and more context in their minds as they do this test time compute.

Speaker 1

这其实非常强大,因为大家都会问:模型要怎么知道这些?

And then that's really powerful because everybody's like, well, how's the model going to know this?

Speaker 1

最终你会感觉能掌握足够的上下文信息。

Well, eventually you can feel you can hold enough context.

Speaker 1

你可以把公司里每一条Slack消息、Outlook邮件和企业手册都纳入上下文。

You can just hold every Slack message and outlook message and company manual in a company in your context.

Speaker 1

然后你就能计算新任务,并将其与你对世界的认知、你的想法、模型的判断等所有上下文进行比较。

And then you can compute the new task and compare it with your knowledge of the world, what you think, what the model thinks, all this context.

Speaker 1

可能超长的上下文窗口就是解决当前诸多限制的关键方案。

And it may be that like just really, really long context windows are the solution to a lot of the current limitations.

Speaker 1

这得益于KV缓存卸载之类的巧妙技术实现。

And that's enabled by all these cool tricks, like KV cash offload and stuff.

Speaker 1

但我认为,除了缩放定律放缓或超级智能经济回报率低之外,边缘AI对我来说是目前最可信也最可怕的利空情景。

But I do think other than scaling laws slowing down, other than there being low economic returns to ASI, edge AI is to me by far the most plausible and scariest bear case.

Speaker 0

我喜欢用不同的S曲线来可视化,就像你通过iPhone投资所经历的那样。

I like to visualize like different S curves you invested through the iPhone.

Speaker 0

我特别喜欢看iPhone型号的演变过程,从最初笨重的砖块模样,逐步发展到如今的样子——每一代都只是稍作改进,显然在外形设计上我们已经达到了一个瓶颈。

And I love to like see the visual of the iPhone models as it sort of went from this clunky bricky thing up to the what we have now where like each one's like a little bit, you know, obviously we've sort of petered out on its form factor.

Speaker 0

如果你为前沿模型设想类似的演进路径,是否感觉它们正处于技术范式发展的某个特定阶段?

If you picture something similar for the Frontier models themselves, does it feel like it's at a certain part of that natural technology paradigm progression?

Speaker 1

如果你付费使用Gemini Ultra或Super Grok这些顶级AI服务,其实很难察觉到它们之间的差异。

If you're paying for Gemini Ultra or Super Grok and you're getting the good AI, it's hard to see differences.

Speaker 1

除非我深入研究某些专业问题,比如:你认为PCI Express和以太网协议哪个更适合扩展网络规模?为什么?

Like I have to go really deep on something like, do you think PCI Express or Ethernet is a better protocol for scale up networking and why?

Speaker 1

给我看看相关科学论文。

Show me the scientific papers.

Speaker 1

当你在不同模型间切换,并提出这种你非常熟悉的深度问题时,才能真正看出差异。

And if you shift between models and you ask a question like that where you know it really deeply, then you see differences.

Speaker 1

我确实玩梦幻橄榄球。

I do play fantasy football.

Speaker 1

奖金都捐给了慈善机构,但你知道吗,这些新模型在帮助我决定该派谁上场方面表现得相当不错。

Winnings are donated to charity, but it is like, you know, these new models are quite a bit better at helping who should I play?

Speaker 1

它们的思考方式要复杂得多。

They think in much more sophisticated ways.

Speaker 1

如果你是个历史上表现不错的梦幻足球玩家,但这个赛季表现不佳,这就是原因,因为你没用它。

If you're a historically good fantasy football player and you're having a bad season, this is why, this is why, because you're not using it.

Speaker 1

是啊。

Yeah.

Speaker 1

你懂吗?

You know?

Speaker 1

我认为我们会在越来越多的领域看到这种情况,但我确实认为它们已经达到了一定水平,除非你是真正的专家或拥有超乎想象的智力,否则很难看出进步。

And I think we'll see that in more and more domains, but I do think they're already at a level where unless you are a true expert or just have an intellect that is beyond mind, it's hard to see the progress.

Speaker 1

这就是为什么我认为我们需要从追求更智能转向更有用。

And that's why I do think we need to shift from getting more intelligent to more useful.

Speaker 1

除非更高的智能开始带来重大科学突破,比如我们在2026、2027年攻克癌症。

Unless more intelligence starts leading to these massive scientific breakthroughs and we're curing cancer in '26 and '27.

Speaker 1

是啊。

Yeah.

Speaker 1

我不认为我们即将攻克癌症,但从投资回报率曲线来看,我认为我们需要从追求智能转向追求实用性。

I don't know that we're going be curing cancer, but I do think from almost an ROI curve, we need to kind of hand off from intelligence to usefulness.

Speaker 1

然后实用性将不得不接力给科学突破,从而创造全新的产业。

And then usefulness will then have to hand off to scientific breakthrough just that creates whole new industries.

Speaker 0

在你看来,实用性的基础要素是什么?

What are the building blocks of usefulness in your mind?

Speaker 1

就是能够持续可靠地完成任务。

Just being able to do things consistently and reliably.

Speaker 1

这其中很大程度上是要保持所有上下文。

And a lot of that is keeping all the context.

Speaker 1

比如存在大量的上下文信息。

Like there's a lot of context.

Speaker 1

如果有人要为我规划行程,你知道,我已经形成了一些特别的偏好。

If someone wants to plan a trip for me, like, you know, I've acquired these strange preferences.

Speaker 1

比如我关注那个叫安德鲁·休伯曼的人。

Like I follow that guy, Andrew Huberman.

Speaker 1

所以我喜欢有朝东的阳台,这样我就能享受晨光,明白吗?

So I like to have an east facing balcony so I can get morning sun, you know?

Speaker 1

所以AI必须记住,这是我喜欢乘坐飞机的方式。

So the AI has to remember, here's how I like to fly.

Speaker 1

这些是我在那方面的偏好。

Here are my preferences for that.

Speaker 1

对我来说,乘坐配备星链网络的飞机很重要。

Being on a plane with Starlink is important to me.

Speaker 1

好的。

Okay.

Speaker 1

这些是我历来喜欢的度假村。

Here are the resorts I've historically liked.

Speaker 1

这些是我喜欢的区域类型。

Here are the kinds of areas I've liked.

Speaker 1

这些是我在每个地方真正想住的房间。

Here are the rooms that I would really like at each.

Speaker 1

这需要很多上下文信息。

That's a lot of context.

Speaker 1

要记住并权衡所有这些因素,这是个难题。

And to keep all of that and kind of weight those, it's a hard problem.

Speaker 1

所以我认为上下文窗口是其中很重要的一部分。

So I think context windows are a big part of it.

Speaker 1

你知道,有个叫meter task的评估体系。

You know, there's this meter task evaluation thing.

Speaker 0

它能持续多久

How long it can

Speaker 1

能持续工作多久。

work How long it can work for.

Speaker 1

你可以认为这与上下文有一定关联,虽不精确,但任务长度需要不断扩展,因为预订餐厅虽然具有经济价值,但价值并不那么大。

And you could think of that as being related in some way to context, not precisely, but that just task length needs to keep expanding because booking a restaurant and booking is economically useful, but it's not that economically useful.

Speaker 1

但为我安排整个假期,并了解我父母、姐姐、侄女和侄子的偏好,那是个困难得多的问题。

But booking me an entire vacation and knowing the preferences of my parents, my sister, my niece, and my nephew, that's a much harder problem.

Speaker 1

这种事情人类可能要花三四个小时来优化。

And that's something that like a human might spend three or four hours on optimizing that.

Speaker 1

如果你能做到这一点,那就太棒了。

And then if you can do that, that's amazing.

Speaker 1

但话说回来,我认为它必须很快在销售和客户支持方面表现出色。

But then again, I just think it has to be good at sales and customer support relatively soon.

Speaker 1

在那之后,它必须进入,我认为它已经在这里了。

And then after that, it has to be in, I think it is already here.

Speaker 1

我确实认为我们将看到各类产品卓越性的加速提升。

I do think we're going to see an kind of an acceleration in the awesomeness of various products.

Speaker 1

工程师们正在利用人工智能让产品变得更好、开发得更快。

Engineers are using AI to make products better and faster.

Speaker 0

我们都投资了Fortell这家100%公司,它简直令人惊叹。

We both invested in Fortell, the 100% company, which is just absolutely remarkable.

Speaker 0

就像我从未想过的事情。

Like something I never would've thought of.

Speaker 1

我认为我们将在每个垂直领域都看到类似的情况,那就是AI被用于任何公司最核心的功能——产品设计。

And we're gonna see, I think something like that in every vertical and that's AI being used for the most core function of any company, which is designing the product.

Speaker 1

然后,已经有很多例子显示AI被用于帮助制造产品和更高效地分销,无论是优化供应链,还是通过视觉系统监控生产线。

And then it will be, you know, there's already lots of examples of AI being used to help manufacture the product and distribute it more efficiently, whether it's optimizing a supply chain, having a vision system, watch a production line.

Speaker 1

很多事情正在发生。

A lot of stuff is happening.

Speaker 1

我认为在整个投资回报率方面真正有趣的是,财富500强企业总是最后采用新技术的。

The other thing I think is really interesting in this whole ROI part is Fortune 500 companies are always the last to adopt a new technology.

Speaker 1

它们很保守。

They're conservative.

Speaker 1

它们有很多法规限制,很多律师。

They have lots of regulations, lots of lawyers.

Speaker 1

初创企业总是第一个吃螃蟹的。

Startups are always the first.

Speaker 1

让我们想想云计算,这是企业领域上一次真正具有变革性的新技术。

So let's think about the cloud, which was the last truly transformative new technology for enterprises.

Speaker 1

能够将所有计算资源和云服务结合使用SaaS。

Being able to have all of your compute and the cloud and use SaaS.

Speaker 1

所以它总是保持最新状态。

So it's always upgraded.

Speaker 1

它总是很棒,诸如此类。

It's always great, etcetera, etcetera.

Speaker 1

你可以在任何设备上使用它。

You can get it on every device.

Speaker 1

我记得第一届AWS re:Invent大会应该是在2013年。

I think the first AWS re:Invent, I think it was in 2013.

Speaker 1

而到了2014年,地球上所有初创公司都在云上运行了。

And by 2014, every startup on planet earth ran on the cloud.

Speaker 1

那种自己购买服务器、存储设备和路由器的想法变得很荒谬。

The idea that you would buy your own server and storage box and router was ridiculous.

Speaker 1

这种情况可能发生得更早,甚至在首届Reinvent大会之前就已经出现了。

And that probably happened like even earlier that that had probably already happened before the first Reinvent.

Speaker 1

第一批大型财富500强企业开始将其标准化,大约是在五年之后。

The first big fortune 500 companies started to standardize on it, like maybe five years later.

Speaker 1

在AI领域你也能看到这种趋势。

You see that with AI.

Speaker 1

我相信你在创业公司中已经见识过这一点了。

I'm sure you've seen this in your startups.

Speaker 1

我认为风投比公开市场投资者更普遍看好AI的一个原因是,风投看到了实实在在的生产力提升。

And I think one reason VCs are more broadly bullish on AI than public market investors is VCs see very real productivity gains.

Speaker 1

有大量图表显示,在同等收入水平下,如今的公司员工数量比两年前的公司显著减少。

There's all these charts that for a given level of revenue, a company today has significantly lower employees than a company of two years ago.

Speaker 1

原因是AI正在承担大量销售、支持工作,并帮助完善产品。

And the reason is AI is doing a lot of the sales, the support and helping to make the product.

Speaker 1

我是说,你知道Iconic有些图表,a16z的David George是我的好友,他是个很棒的人。

I mean, there's, you know, Iconic has some charts, a16z, by the way, David George is a good friend, great guy.

Speaker 1

你知道,他有这个模型破坏者的理论。

You know, he has this model busters thing.

Speaker 1

所以有非常明确的数据表明这种情况正在发生。

So there's very clear data that this is happening.

Speaker 1

因此那些了解风险投资领域的人都能看到这一点。

So people who have a lens into the world of venture see this.

Speaker 1

我确实认为这在第三季度非常重要。

And I do think it was very important in the third quarter.

Speaker 1

这是首个季度我们看到科技行业外的财富500强企业给出AI驱动增长的具体量化案例。

This is the first quarter where we had fortune 500 companies outside of the tech industry give specific quantitative examples of AI driven uplift.

Speaker 1

比如C。

So C.

Speaker 1

H。

H.

Speaker 1

罗宾逊公司的盈利增长了约20%。

Robinson went up something like 20% on earnings.

Speaker 1

我该告诉大家C.

Should I tell people what C.

Speaker 1

H.

H.

Speaker 1

Robinson是做什么的吗?

Robinson does?

Speaker 1

是的。

Yeah.

Speaker 1

比如说,有辆卡车从芝加哥开往丹佛。

Like, let's just say a truck goes from Chicago to Denver.

Speaker 1

而卡车司机住在芝加哥。

And then the trucker lives in Chicago.

Speaker 1

所以它要从丹佛返回芝加哥。

So it's gonna go back from Denver to Chicago.

Speaker 1

C.那里有个空载。

There's an empty load at C.

Speaker 1

H.

H.

Speaker 1

罗宾逊公司与这些卡车司机和运输公司建立了所有关系,他们通过匹配货主需求与空载供应来提高运输效率。

Robinson has all these relationships with these truckers and trucking companies, and they match shippers demand with that empty load supply to make the trucking more efficient.

Speaker 1

你知道,他们是一家货运代理公司,实际上有很多类似的公司,但他们是规模最大、最具主导地位的。

You know, they're a freight forwarder, you know, there's actually lots of companies like this, but they're the biggest and most dominant.

Speaker 1

所以他们做的最重要的事情之一就是报价和确认可用性。

So one of the most important things they do is they quote price and availability.

Speaker 1

比如有客户打电话给他们说:'嘿,我急需三辆十八轮大卡车从芝加哥运到丹佛。'

So somebody, a customer calls them up and says, Hey, I urgently need three eighteen wheelers from Chicago to Denver.

Speaker 1

但在过去,他们需要花费大约十五到四十五分钟来处理。

But in the past, they said it would take them, you know, fifteen to forty five minutes.

Speaker 1

而且他们只对60%的进站询价请求进行了报价。

And they only quoted 60% of inbound requests.

Speaker 1

借助人工智能,他们现在能对100%的请求进行报价,并且只需几秒钟就能完成。

With AI, they're quoting a 100% and doing it in seconds.

Speaker 1

因此他们公布了一个亮眼的季度报告,股价上涨了20%,这完全得益于AI驱动的生产力提升。

And so they printed a great quarter and the stock went up 20% and it was because of AI driven productivity.

Speaker 1

这正在影响收入线、成本线等方方面面。

That's impacting the revenue line, the cost line, everything.

Speaker 1

我其实非常担心Blackwell芯片的投资回报率会出现空档期,因为我们投入了巨额资金。

I was actually very worried about the idea that we might have this Blackwell ROI air gap because we're spending so much money on Blackwell.

Speaker 1

那些Blackwell芯片仅用于训练模型,而训练本身无法产生直接收益。

Those Blackwells are being used for training and there's no ROI on training.

Speaker 1

训练就是在构建模型。

Training is you're making the model.

Speaker 1

真正的投资回报来自模型推理阶段。

The ROI comes from inference.

Speaker 1

所以我真的很担心,我们可能会面临长达三个季度的资本支出高到难以想象的时期。

So I was really worried that, you know, we're gonna have maybe this three quarter period where the CapEx is unimaginably high.

Speaker 1

那些Blackwell芯片目前仅用于训练用途。

Those Blackwells are only being used for training.

Speaker 1

对。

Right.

Speaker 0

我们的保持不变,他们的在上升。

Ours staying flat, eyes going up.

Speaker 1

是的,没错。

Yeah, exactly.

Speaker 1

所以投资回报率下降,你可以看看Meta的情况,Meta公布的财报显示,因为他们一直没能研发出前沿模型。

So ROIC goes down and you could see like Meta, Meta they printed, you know, because Meta has not been able to make a frontier model.

Speaker 1

Meta公布的季度财报显示投资回报率下降,这对股价很不利。

Meta printed a quarter where ROIC declined and that was not good for the stocks.

Speaker 1

我当时非常担心这一点。

I was really worried about that.

Speaker 1

我认为这些数据点很重要,它们表明我们或许能度过这段投资回报率的潜在空窗期。

I do think that those data points are important in terms of suggesting that maybe we'll be able to navigate this potential air gap in ROIC.

Speaker 0

是啊。

Yeah.

Speaker 0

这让我思考,在这个市场中,与其他所有公司不同,市值最高的那10家公司吸引了所有注意力。

It makes me wonder about in this market, unlike everybody else, it's the 10 companies at the top that are all the market cap, more than all of the attention.

Speaker 0

标普500指数中还有490家公司。

There's four ninety other companies, the S and P 500.

Speaker 0

你也研究过那些公司。

You studied those too.

Speaker 0

比如,你对那群公司有什么看法?

Like, what do you think about that group?

Speaker 0

现在似乎没人谈论、也没人真正关心的那群公司中,有什么让你感兴趣的点?因为它们既没有带来回报,在整体指数中的占比也越来越小。

Like what is interesting to you about the group that now nobody seems to talk about and no one really seems to care about because they haven't driven returns and they're a smaller percent of the overall index?

Speaker 1

我认为如果越来越多的公司公布这类财报,人们就会开始关注。

I think that people are going to start to care if you have more and more companies print these C.

Speaker 1

C。

H.

Speaker 1

H。罗宾逊式的季度业绩。

Robinson like quarters.

Speaker 1

我认为那些历史上经营良好的公司,之所以能保持长期的成功记录,是因为不充分利用技术就无法取得成功。

I think the companies that have historically been really well run, the reason they have a long track record of success, you cannot succeed without using technology well.

Speaker 1

因此,如果企业内部具有实验和创新的文化氛围,我认为它们会在AI领域表现出色。

And so if you have a kind of internal culture of experimentation and innovation, I think you will do well with AI.

Speaker 1

我敢打赌,最优秀的投资银行会比那些落后的银行更早、更好地采用AI技术,历史往往就是未来的序幕。

I would bet on the best investment banks to be earlier and better adopters of AI than maybe some of the trailing banks, just sometimes past his prologue.

Speaker 1

在我看来,这种情况很可能会发生——我有一个强烈的观点:所有这些风险投资公司都在建立控股公司,声称要利用AI来提升传统企业的业务表现。

And I think it's likely to be in this case, one strong opinion I have, all these VCs are setting up these holding companies and, know, we're gonna use AI to make traditional businesses better.

Speaker 1

他们是非常精明的风险投资家,有着出色的业绩记录,但这正是私募五十年来一直在做的事。

And they're really smart VCs and they're great track records, but that's what private equity has been doing for fifty years.

Speaker 1

你不可能在私募的游戏中胜过他们。

You're just not gonna be private equity at their game.

Speaker 0

这就是Vista早期所采取的策略。

This is what Vista did in the early days.

Speaker 0

对吧?

Right?

Speaker 1

是的。

Yeah.

Speaker 1

我确实认为私募股权最近经历了一段艰难时期。

And I do think this is actually private equities, maybe had a little bit of a tough run.

Speaker 1

估值倍数已经上升了。

Just multiples have gone up.

Speaker 1

现在私募资产变得更昂贵了。

Now private assets are more expensive.

Speaker 1

融资成本也增加了。

The cost of financing has gone up.

Speaker 1

让公司上市变得很困难,因为公开市场的估值比私募估值低了30%。

It's tough to take a company public because the public valuation is 30% lower than the private valuation.

Speaker 1

所以私募确实遇到了困境。

So PE's had a tough run.

Speaker 1

实际上我认为这些私募股权公司在系统化应用AI方面会做得相当不错。

I actually think these private equity firms are gonna be pretty good at systematically applying AI.

Speaker 0

我们还没怎么讨论过Meta、Anthropic或OpenAI。

We haven't spent much time talking about meta, anthropic, or open AI.

Speaker 0

我很想听听你对基础设施领域所有进展的看法,就像我们之前讨论的那样。

And I'd love your impression on everything that's going on in this infrastructure side that we talked about.

Speaker 0

在这场宏大棋局中,这三家都是举足轻重的参与者。

These are three really important players in this grand game.

Speaker 0

我们目前讨论的这些发展将如何具体影响这几家公司?

How does all of this development that we've discussed so far impact those players specifically?

Speaker 1

首先,让我从整体上谈谈前沿模型。

First thing, let me just say about frontier models broadly.

Speaker 1

是的。

Yeah.

Speaker 1

在2023和24年,我很喜欢引用埃里克·维克里亚的话,我们的朋友埃里克·维克里亚是个才华横溢的人。

In 2023 and '24, I was fond of quoting Eric Vichria and Eric Vichria's statement, our friend, brilliant man.

Speaker 1

埃里克常说,基础模型是历史上增值最快的资产。

And Eric would always say, Foundation models are the fastest appreciating assets in history.

Speaker 1

我认为他有90%是正确的。

And I would say he was 90% right.

Speaker 1

我对这个说法做了修改。

I modified the statement.

Speaker 1

我说过:'没有独特数据和互联网规模分发的基础模型,是历史上增值最快的资产'。

I said, Foundation models without unique data and internet scale distribution are the fastest appreciating assets in history.

Speaker 1

而推理从根本上深刻改变了这一局面。

And reasoning fundamentally changed that in a really profound way.

Speaker 1

用杰夫·贝索斯的话说,每个伟大的互联网公司核心都存在这样一个循环飞轮。

So there was a loop, flywheel to quote Jeff Bezos, that it was at the heart of every great internet company.

Speaker 1

这个逻辑就是:你做出了好产品,

And it was, you made a good product.

Speaker 1

获得了用户,用户使用产品产生的数据可以反馈到产品中使其变得更好。

You got users, those users using the product generated data that could be fed back into the product to make it better.

Speaker 1

这个飞轮已经在Netflix、亚马逊、Meta和谷歌运转了十多年。

And that flywheel has been spinning at Netflix, at Amazon, at Meta, at Google for over a decade.

Speaker 1

这是一个极其强大的飞轮效应。

And that's an incredibly powerful flywheel.

Speaker 1

这就是为什么这些互联网企业如此难以被竞争超越。

And it's why those internet businesses were so tough to compete with.

Speaker 1

这就是为什么它们具有规模报酬递增的特性。

It's why they're increasing returns to scale.

Speaker 1

所有人都在谈论网络效应。

Everybody talks about network effects.

Speaker 1

这对社交网络来说曾经很重要。

They were important for social networks.

Speaker 1

我不确定Meta现在在多大程度上还能算作社交网络。

I don't know to what extent Meta is a social network anymore.

Speaker 1

它更像是个内容分发平台,但正是由于这个飞轮效应,它们实现了规模报酬递增。

It's more like a content distribution, but they just had increasing returns to scale because of that flywheel.

Speaker 1

而在AI具备推理能力之前的时代,这种动态机制并不存在。

And that dynamic was not present in the pre reasoning world of AI.

Speaker 1

你预训练了一个模型,把它放到现实世界中,它就那样了。

You pre trained a model, you let it out in the world and it was what it was.

Speaker 1

这实际上相当困难。

And it was actually pretty hard.

Speaker 1

他们会通过RLHF(人类反馈强化学习)来尝试改进机器人模型。

They would do RLHF reinforcement learning with human feedback, and you try and make the bot model better.

Speaker 1

或许你能从推特氛围中感觉到人们不喜欢这样。

And maybe you'd get a sense from Twitter vibes that people didn't like this.

Speaker 1

于是你进行微调。

And so you tweak it.

Speaker 1

虽然有上下箭头的小按钮,但要把这些反馈真正融入模型其实相当困难。

There are the little up and down arrows, but it was actually pretty hard to feed that back into the model.

Speaker 1

虽然推理能力还处于早期阶段,但那个飞轮已经开始转动了。

With reasoning, it's early, but that flywheel started to spin.

Speaker 1

这对这些前沿实验室来说意义深远。

And that is really profound for these Frontier Labs.

Speaker 1

首先,推理能力从根本上改变了前沿实验室的行业格局。

So one, reasoning fundamentally changed the industry dynamics of Frontier Labs.

Speaker 0

具体解释一下为什么是这样。

Just explain why specifically that is.

Speaker 0

到底发生了什么?

Like what is going on?

Speaker 1

因为如果很多人都在问类似的问题,他们要么一直喜欢这个答案,要么一直不喜欢。

Because if a lot of people are asking a similar question, they're consistently either liking or not liking the answer.

Speaker 1

那你就可以利用这种可验证的反馈作为奖励信号。

Then you can kind of use that like that has a verifiable reward.

Speaker 1

这是个好结果。

That's a good outcome.

Speaker 1

然后你就可以把这些优质回答反馈给模型。

And then you can kind of feed those good answers back into the model.

Speaker 1

目前这个飞轮才刚刚开始转动。

And we're very early at this flywheel spinning.

Speaker 0

是的。

Yeah.

Speaker 0

明白

Got

Speaker 1

了。

it.

Speaker 1

虽然现在操作起来很困难,但你能看到它开始运转了。

Like it's hard to do now, but you can see it beginning to spin.

Speaker 1

因此,这是影响所有这些动态的首要重要事实。

So this is important fact number one for all of those dynamics.

Speaker 1

其次,我认为非常关键的是,Meta的马克·扎克伯格在今年一月初曾表示——虽然我可能会记错原话——但他高度自信地宣称2025年某个时间点他们将拥有最优秀且性能最佳的人工智能。

Second, I think it's really important that Meta, you know, Mark Zuckerberg at the beginning of this year in January said, I'm highly confident I'm gonna get the quote wrong, that at some point in 2025, we're gonna have the best and most performant AI.

Speaker 1

我不确定他是否排得进前100名。

I don't know if he's in the top 100.

Speaker 1

所以他的判断可以说是错得离谱。

So he was as wrong as it was possible to be.

Speaker 1

我认为这是一个非常重要的事实,因为这表明这四家公司所做的事情确实非常困难,因为Meta投入了大量资金却失败了。

And I think that is a really important fact because it suggests that what these four companies have done is really hard to do because Meta threw a lot of money at it and they failed.

Speaker 1

Yama Koon不得不离开。

Yama Koon had to leave.

Speaker 1

他们不得不为那些人才支付了十亿美元。

They had to have the famous billion dollar for AI researchers.

Speaker 1

顺便说一句,微软也失败了。

By the way, Microsoft also failed.

Speaker 1

他们并未做出如此明确的预测,但他们确实是翻译中。

They did not make such an unequivocal prediction, but they bought Inflection AI.

Speaker 1

而且他们有很多评论和预期表明我们的内部模型会迅速改进。

And there were a lot of comments from them that we anticipate our internal models quickly getting better.

Speaker 1

我们将在内部模型上运行越来越多的人工智能。

And we're going to run more and more of our AI on our internal models.

Speaker 1

不。

Nope.

Speaker 1

亚马逊收购了一家名为Adept AI的公司。

Amazon, they bought a company called Adept AI.

Speaker 1

他们有自己的模型叫Nova。

They have their models called Nova.

Speaker 1

我认为它们排不进前20名。

I don't think they're in the top 20.

Speaker 1

显然这比人们一年前预想的要困难得多。

So clearly it's much harder to do than people thought a year ago.

Speaker 1

这其中有许多许多原因。

And there's many, many reasons for that.

Speaker 1

比如要维持大型GPU集群的协同运作实际上非常困难。

Like it's actually really hard to keep a big cluster of GPUs coherent.

Speaker 1

很多这些公司过去习惯于运行以成本优化而非性能优化的基础设施。

A lot of these companies were used to running their infrastructure to optimize for cost instead of performance.

Speaker 0

复杂性与性能。

Complexity and performance.

Speaker 1

在大规模集群中保持GPU的高利用率及其复杂性,这实际上非常困难。

Complexity and keeping the GPUs running at high utilization rate in a big cluster, it's actually really hard.

Speaker 1

而且不同公司在GPU运行效率上存在巨大差异。

And there are wild variations in how well companies run GPUs.

Speaker 1

考虑到物理定律的限制,也许最多只能让二三十万个黑井保持一致性。

If the most anybody, because of laws of physics, you know, maybe you can get two or 300,000 black wells coherent.

Speaker 1

我们拭目以待。

We'll see.

Speaker 1

但如果你的集群运行时间只有30%,而竞争对手有90%,那你根本没法竞争。

But if you have 30% uptime on that cluster and you're competing with somebody who has 90% uptime, you're not even competing.

Speaker 1

所以第一点,人们在GPU运行效率上存在巨大差异。

So one, there's a huge spectrum in how well people run GPUs.

Speaker 1

第二,我认为这些AI研究人员喜欢谈论品味,我觉得这很有趣。

Two, then I think there is, you know, these AI researchers, they like to talk about taste.

Speaker 1

我觉得这非常滑稽。

I find it very funny.

Speaker 1

你知道,哦,为什么你能赚这么多钱?

You know, oh, why do you make so much money?

Speaker 1

我的品味非常好。

I have very good taste.

Speaker 1

明白吗?

You know?

Speaker 1

所谓品味,就是你对要进行的实验有良好的直觉判断力。

What taste means is you have a good intuitive sense for the experiments to perform.

Speaker 1

这就是为什么你要付给人们高薪——因为事实证明,随着这些模型规模扩大,你无法先在千级GPU集群上做实验,再在十万级GPU集群上复现。

This is why you pay people a lot of money because it actually turns out that as these models get bigger, you can no longer run an experiment on a thousand GPU cluster and replicate it on a 100,000 GPUs.

Speaker 1

你必须直接在五万级GPU集群上运行实验,这可能需要耗时数天。

You need to run that experiment on 50,000 GPUs and maybe it takes, you know, days.

Speaker 1

因此机会成本极高。

And so there's a very high opportunity cost.

Speaker 1

你必须拥有一支真正优秀的团队,能够正确决策该进行哪些实验。

You have to have a really good team that can make the right decisions about which experiments to run on this.

Speaker 1

然后你还需要在训练后阶段做好所有强化学习,并优化推理时的计算资源。

And then you need to do all the reinforcement learning during post training well, and the test time compute well.

Speaker 1

这真的很难做到,但所有人都以为很容易。这些事情就像...我早年做零售分析师时常说,随便选美国任何一个垂直领域。

It's really hard to do and everybody thinks it's easy, but all those things, you know, I used to have this saying, like, I was a retail analyst long ago, pick any vertical in America.

Speaker 1

如果你能在50个州运营上千家门店,保持店面整洁、照明良好、商品陈列合理、价格优惠,员工友善且不偷窃,你就能成为200亿甚至300亿美元市值的公司。

If you can just run a thousand stores in 50 states and have them clean, well lit, stocked with relevant goods at good prices and staffed by friendly employees who are not stealing from you, you're going to be a $20,000,000,000 company, a $30,000,000,000 company.

Speaker 1

但全美能做到这点的公司不超过15家。

Like 15 companies have been able to do that.

Speaker 1

这真的很难。

It's really hard.

Speaker 1

道理是相通的。

And it's the same thing.

Speaker 1

要完美做好所有这些事情真的非常困难。

Doing all of these things well is really hard.

Speaker 1

而随着这个飞轮效应持续运转,它正在形成行业准入壁垒。

And then reasoning with this flywheel, this is beginning to create barriers to entry.

Speaker 1

更重要的是,这些实验室——XAI、Gemini、OpenAI和Anthropic——每个内部都拥有更先进的模型检查点。

And what's even more important, every one of those labs, XAI, Gemini, OpenAI and Anthropic, they have a more advanced checkpoint internally of the model.

Speaker 1

检查点基本上就是持续优化这些模型,然后发布一个阶段性版本。

Checkpoint is just kind of continuously working on these models and then you release kind of a checkpoint.

Speaker 1

这就是为什么这些模型能快速迭代的原因,是的,

And then the reason these models get fast Yeah, the

Speaker 0

他们内部使用的版本是专门为

one they're using internally is for a

Speaker 1

球形优化设计的,他们正在用那个模型训练下一代模型。

spherical And they're using that model to train the next model.

Speaker 1

如果你没有那个最新检查点,追赶会变得极其困难。

And if you do not have that latest checkpoint, it's getting really hard to catch up.

Speaker 1

中国的开源项目简直是天赐给Meta的礼物,因为你可以利用中国的开源资源。

Chinese open source is a from God to meta because you can use Chinese open source.

Speaker 1

那就能成为你的检查点。

That can be your checkpoint.

Speaker 1

你可以将其作为一种启动方式。

And you can use that as a way to kind of bootstrap this.

Speaker 1

我确信他们以及其他人都在尝试这么做。

And that's what I'm sure they're trying to do and everybody else.

Speaker 1

最大的问题和关键变数在于,我认为中国在稀土问题上犯了个严重错误。

The big problem and the big, a giant swing factor, I think China's made a terrible mistake with this rare earth thing.

Speaker 1

因为中国有华为昇腾芯片,虽然性能尚可但相比已淘汰的热保存技术优势不大。

So China, because, you know, they have Huawei, Ascend, and it's a decent chip versus something like the deprecated hot preserving something that looks okay.

Speaker 1

所以他们试图强制中国开源项目使用国产自主研发芯片。

And so they're trying to force Chinese open source to use their Chinese chips, their domestically designed chips.

Speaker 1

问题是 Blackwell 即将面世,这将导致美国前沿实验室与中国开源之间的差距急剧扩大。

The problem is Blackwell's going to come out now and the gap between these American frontier labs and Chinese open source is going to blow out because of Blackwell.

Speaker 1

实际上深度求索在最新技术论文 V3.2 中坦言:'难以与美国前沿实验室竞争的原因之一是我们的算力不足'。

And actually DeepSeek in their most recent technical paper, V3.2, said one of the reasons we struggle to compete with the American frontier labs is we don't have enough compute.

Speaker 1

这是他们用非常政治正确(但仍带风险)的方式在暗示:因为中国表态拒绝 Blackwell 芯片。

That was their very politically correct, still a little bit risky way of saying, cause China said we don't want the black wells.

Speaker 1

没错。

Right.

Speaker 1

他们说的是,

And they're saying,

Speaker 0

能不能把Blackwell给我们?

won't you please give us the black wells?

Speaker 1

这可能是个重大失误。

That might be a big mistake.

Speaker 1

如果这样发展下去,美国四大实验室与中国开源之间的差距会进一步拉大,这让其他追赶者更加困难,因为差距正在扩大。

So if you just kind of play this out, these four American labs are gonna start to widen their gap versus Chinese open source, which then makes it harder for anyone else to catch up because that gap is growing.

Speaker 1

所以你无法利用中国开源来启动。

So you can't use Chinese open source to bootstrap.

Speaker 1

而从地缘政治角度看,中国本以为他们掌握了筹码。

And then geopolitically, China thought they had the leverage.

Speaker 1

他们会意识到:哦,这下搞砸了。

They're going to realize, oh, whoopsie daisy.

Speaker 1

我们确实需要Blackwell芯片

We do need the Blackwells.

Speaker 1

不幸的是这对他们来说很可能是

And unfortunately they'll probable for them.

Speaker 1

他们可能会在2026年底意识到这一点

They'll probably realize that in late twenty six.

Speaker 1

到那时,将会付出巨大的努力

And at that point, there's an enormous effort underway.

Speaker 1

DARPA有各种非常酷的项目,包括国防部的计划,旨在激励针对稀土资源的创新技术解决方案。

DARPA has, there's all sorts of really cool DARPA and DOD programs to incentivize really clever technological solutions for rare earths.

Speaker 1

而且有很多稀土矿藏位于对美国非常友好的国家,这些国家并不介意用传统方式进行提炼。

And then there's a lot of rare earth deposits in countries that are very friendly to America that don't mind actually refining it in the traditional way.

Speaker 1

所以我认为稀土问题会比任何人预想的都更快得到解决。

So I think rare earths are going to be solved way faster than anyone thinks.

Speaker 1

要知道,它们显然并不那么稀有。

Know, they're obviously not that rare.

Speaker 1

它们只是名字起错了。

They're just misnamed.

Speaker 1

它们之所以稀有是因为提炼过程非常复杂。

They're rare because they're really messy to refine.

Speaker 1

所以从地缘政治角度看,我认为Blackwell确实很重要,随着这个差距的扩大,它将给美国带来很大的筹码。

And so geopolitically, I actually think Blackwell's pretty significant and it's going to give America a lot of leverage as this gap widens.

Speaker 1

然后在这种背景下,回到这些公司之间的动态关系,xAI将率先推出首个Blackwell模型,很可能也会成为第一个大规模使用Blackwell进行推理的公司。

And then in the context of all of that, going back to the dynamics between these companies, xAI will be out with the first Blackwell model and then they'll be the first ones probably using Blackwell for inference at scale.

Speaker 1

我认为这对他们来说是个重要时刻。

And I think that's an important moment for them.

Speaker 1

顺便说一句,这很有趣。

And by the way, it is funny.

Speaker 1

比如你去open router看看,就会发现他们占据主导份额。

Like if you go on open router, you can just look, they have dominant share.

Speaker 1

现在open routers不管具体是什么,它只占API令牌的1%,但这表明他们处理了1,350,000,000,000次请求。

Now open routers, whatever it is, it's 1% of API tokens, but it's an indication they process 1,350,000,000,000.

Speaker 1

谷歌大概处理了80到9000亿次。

Google did like 8 or 900,000,000,000.

Speaker 1

这大概是过去七天或上个月的数据。

This is like whatever it is last seven days or last month.

Speaker 1

Anthropic处理了7000亿次。

Anthropic was at 700,000,000,000.

Speaker 1

xAI的表现确实非常出色。

Like xAI is doing really, really well.

Speaker 1

而且他们的模型非常棒。

And the model is fantastic.

Speaker 1

我强烈推荐,你会看到xAI推出这个。

I highly recommend it, But you'll see xAI come out with this.

Speaker 1

OpenAI会更快推出。

OpenAI will come out faster.

Speaker 1

OpenAI试图通过Stargate解决的问题是因为他们需要为算力支付额外成本。

OpenAI's issue that they're trying to solve with Stargate is because they pay a margin to people for compute.

Speaker 1

也许管理他们电脑的人并不擅长运行GPU。

And maybe the people who run their computer are not the best at running GPUs.

Speaker 1

他们是高成本的token生产者。

They're a high cost producer of tokens.

Speaker 1

我认为这在一定程度上解释了他们很多

And I think this kind of explains a lot of their

Speaker 0

最近的Code Red。

Code Red recently.

Speaker 1

是的。

Yeah.

Speaker 1

嗯,那1.4万亿美元的支出承诺。

Well, the $1,400,000,000,000 in spending commitments.

Speaker 1

我想那就像是,他们知道自己需要筹集大量资金,尤其是如果谷歌继续保持当前吸走整个行业经济氧气的策略。

And I think that was just like, hey, they know they're gonna need to raise a lot of money, particularly if Google keeps its current strategy of sucking the economic oxygen out of the room.

Speaker 1

而且,你知道,从1.4万亿美元的粗略感觉很快变成Code Red状态,速度相当快,对吧?

And, know, you go from 1,400,000,000,000.0 rough vibes code red, like pretty fast, you know?

Speaker 1

他们发布红色警报正是因为所有这些动态因素。

And the reason they have a code red is because of all these dynamics.

Speaker 1

所以他们虽然会推出新模型,但在单位token成本上仍无法与XAI、谷歌和Anthropic相抗衡。

So then they'll come out with a model, but they will not affix their per token cost disadvantage yet relative to both XAI and Google and Anthropic at that point.

Speaker 1

Anthropic是家优秀的公司。

Anthropic is a good company.

Speaker 1

要知道,他们的现金消耗速度远低于OpenAI,增长速度却更快。

You know, they're burning dramatically less cash than OpenAI and growing faster.

Speaker 1

所以我认为必须给予Anthropic高度认可。

So I think you have to give Anthropic a lot of credit.

Speaker 1

这很大程度上得益于他们与谷歌和亚马逊在TPU及Trainium芯片上的合作关系。

And a lot of that is their relationship with Google and Amazon for the TPUs and the Trainiums.

Speaker 1

因此Anthropic得以共享谷歌享有的相同发展红利。

And so Anthropic has been able to benefit from the same dynamics that Google has.

Speaker 1

我认为这盘大棋局很能说明问题。

I think it's very indicative in this great game of chess.

Speaker 1

你可以看看达里奥和詹森的公开言论,他们之间可能有过一些交锋。

You can look at Dario and Jensen, maybe have taken a few public comments, you know, that were between them.

Speaker 1

较量。

Jousting.

Speaker 1

一点小小的较量。

A little bit of jousting.

Speaker 1

Anthropic刚刚与英伟达签署了50亿美元的协议。

Well, Anthropic just signed the $5,000,000,000 deal with Nvidia.

Speaker 1

这是因为达里奥是个聪明人,他了解Blackwell和Rubid相对于TPU的这些动态。

That is because Dario is a smart man and he understands these dynamics about Blackwell and Rubid relative to TPU.

Speaker 1

所以英伟达现在从拥有xAI和OpenAI两个战士,变成了三个战士,这有助于英伟达与谷歌的对抗。

So Nvidia now goes from having two fighters, xAI and OpenAI, to three fighters that helps in this Nvidia versus Google battle.

Speaker 1

如果Meta能迎头赶上,那将非常重要。

And then if Meta can catch up, that's really important.

Speaker 1

我确信英伟达正在尽其所能帮助Meta。

I am sure NVIDIA is doing whatever they can to help Meta.

Speaker 1

你正在这样运行那些GPU。

You're running those GPUs this way.

Speaker 1

也许我们应该这样拧螺丝或者那样调节旋钮。

Maybe we should twist the screw this way or turn the dial that way.

Speaker 1

如果Blackwell回归的话——看起来它很可能会回归——那也将非常好,因为届时中国的开源生态也会回归。

And then it will be also if Blackwell comes back to which it seems like it'll probably happen, that will also be very good because then Chinese open source will be back.

Speaker 0

我总是对事物的两极充满好奇。

I'm always so curious about the poles of things.

Speaker 0

一极是你关注的其它突破性进展,比如我们之前讨论过的数据中心里非芯片类的东西就是一个例子。

One pole would be the other breakthroughs that you have your mind on, things in the data center that aren't chips that we've talked about before as one example.

Speaker 1

我认为未来三到四年内,世界上最重要的事情将是太空中的数据中心。

I think the most important thing that's going to happen in the world, in this world, in the next three to four years is data centers in space.

Speaker 1

这对地球上所有建设发电厂或数据中心的人都有着极其深远的影响。

And this has really profound implications for everyone building a power plant or a data center on planet earth.

Speaker 1

明白吗?

Okay?

Speaker 1

而且现在正掀起一股巨大的淘金热。

And there is a giant gold rush into this.

Speaker 0

我还没听说过这件事。

I haven't heard anything about this.

Speaker 0

所以请赐教。

So please educate.

Speaker 1

是啊。

Yeah.

Speaker 1

要知道,就像大家都觉得,嘿,AI有风险,但你猜怎么着?

Know, it's like everybody thinks like, Hey, AI is risky, but you know what?

Speaker 1

我要建一个数据中心。

I'm going build a data center.

Speaker 1

我要建一个能运行数据中心的发电厂。

I'm going to build a power plant that's going to do a data center.

Speaker 1

我们会需要这个的。

We will need that.

Speaker 1

但从第一性原理来思考,数据中心应该建在太空。

But if you think about it from first principles, data centers should be in space.

Speaker 1

运营数据中心的基本要素是什么?

What are the fundamental inputs to running a data center?

Speaker 1

就是电力和冷却系统。

Their power and their cooling.

Speaker 1

然后还有芯片。

And then there are the chips.

Speaker 0

如果从总成本的角度来考虑。

If you think about it from a total cost perspective.

Speaker 1

没错。

Yeah.

Speaker 1

还有让这些神奇机器产出代币所需的投入。

And just the inputs to making the tokens come out of the magic machines.

Speaker 1

在太空中,卫星可以24小时持续接受阳光照射,而且太阳光强度要高出30%。

In space, you can keep a satellite in the sun twenty four hours a day and the sun is 30% more intense.

Speaker 1

你可以让卫星持续接收光照。

You can have the satellite always kind of catching light.

Speaker 1

太阳光强度增加30%,这导致外太空的辐射量是地球表面的六倍。

The sun is 30% more intense and this results in six times more irradiance in outer space than on planet earth.

Speaker 1

所以你能获得大量太阳能。

So you're getting a lot of solar energy.

Speaker 1

第二点,由于卫星24小时处于日照中,不需要电池。

Point number two, because you're in the sun twenty four hours a day, don't need a battery.

Speaker 1

这是成本中很大一部分。

This is a giant percentage of the cost.

Speaker 1

因此太阳系中最低成本的能源就是太空中的太阳能。

So the lowest cost energy available in our solar system is solar energy and space.

Speaker 1

其次是冷却问题。

Second for cooling.

Speaker 1

在这些机架中,大部分质量和重量都来自冷却系统。

In one of these racks, a majority of the mass and the weight is cooling.

Speaker 1

这些数据中心的冷却系统极其复杂。

The cooling in these data centers is incredibly complicated.

Speaker 1

包括暖通空调、冷却分配单元、液体冷却等。

The HVAC, the CDUs, the liquid cooling.

Speaker 0

看起来非常酷。

It's very cool to see.

Speaker 1

令人惊叹的景象。

It's amazing to see.

Speaker 1

在太空中,冷却是免费的。

In space, cooling is free.

Speaker 1

只需在卫星背阴面安装散热器即可。

You just put a radiator on the dark side of the satellite.

Speaker 0

这简直太棒了。

It's fucking gold.

Speaker 0

而且

And it's

Speaker 1

几乎可以接近绝对零度。

as close to absolute zero as you can get.

Speaker 1

所以所有这些都不需要了,能省下巨额成本。

So all that goes away and that is a vast amount of cost.

Speaker 1

好的。

Okay.

Speaker 1

让我们想想,也许每个卫星就像一个机架。

Let's think about how these maybe each satellite is kind of a rack.

Speaker 1

这是一种思考方式。

It's one way to think of it.

Speaker 1

也许有些人会造更大的卫星,相当于三个机架。

Maybe some people make bigger satellites that are three racks.

Speaker 1

那么,你打算如何连接这些机架呢?

Well, how are you going to connect those racks?

Speaker 1

有趣的是在数据中心,机架之间超过一定距离就会用光纤连接。

Well, it's funny in the data center, the racks are over a certain distance connected with fiber optics.

Speaker 1

这其实就是指激光通过电缆传输。

And that just means a laser going through a cable.

Speaker 1

唯一比激光通过光纤电缆更快的是激光在绝对真空中传输。

The only thing faster than a laser going through a fiber optic cable is a laser going through absolute vacuum.

Speaker 1

因此,如果能用激光将这些太空中的卫星连接起来,实际上就拥有了比地球数据中心更快、更连贯的网络。

So if you can link these satellites in space together using lasers, you actually have a faster and more coherent network than in a data center on earth.

Speaker 1

好的。

Okay.

Speaker 1

至于训练过程,那将需要很长时间。

For training, that's going to take a long time.

Speaker 1

因为规模太庞大了。

Because it's so big.

Speaker 1

没错,正因为规模如此庞大,训练最终总会完成的。

Yeah, just because it's so big, training will eventually happen.

Speaker 1

但说到推理环节,让我们考虑下用户体验。

But then for inference, let's think about the user experience.

Speaker 1

当我向Grok询问关于你的事情,它给出了一个很好的回答时,整个过程是这样的。

When I asked Grok about you and it gave the nice answer, here's what happened.

Speaker 1

无线电波从我的手机传输到基站。

A radio wave traveled from my cell phone to a cell tower.

Speaker 1

然后信号到达基站,进入光纤电缆,传到纽约某个城域汇聚设施,大概离这里就10个街区左右。

Then it hit the base station, went into a fiber optic cable, went to some sort of metro aggregation facility in New York, probably within like, you know, 10 blocks of here.

Speaker 1

那里有个小型城域路由器,把数据包路由到某个大型XAI数据中心。

There's a small little metro router that's routed those packets to a big XAI data center somewhere.

Speaker 1

完成计算后,结果沿着原路返回。

The computation was done and it came back over the same path.

Speaker 1

如果卫星能直接与手机通信(星链已展示过直连手机技术),整个过程就变成咻-咻两下。

If the satellites can communicate directly with the phone and Starlink has demonstrated direct to cell capability, you just go boom, boom.

Speaker 1

这种用户体验更好,成本也更低。

It's a much better, lower cost user experience.

Speaker 1

所以从第一性原理来看,太空数据中心在各方面都优于地球上的数据中心。

So in every way, data centers in space, from a first principles perspective, are superior to data centers on earth.

Speaker 0

如果我们能瞬间实现这个构想,这部分我能理解。

If we could teleport that into existence, I understand that portion.

Speaker 0

为什么这不会实现呢?

Why will that not happen?

Speaker 0

是因为发射成本吗?

Is it launch cost?

Speaker 0

是因为发射可用性吗?

Is it launch availability?

Speaker 0

是因为容量问题吗?

Is it capacity?

Speaker 1

需要大量这种设备。只有星舰能经济高效地实现这个目标。

Need a lot of those Like stationary the Starships are the only ones that can economically make that happen.

Speaker 1

我们需要大量这样的星舰。

We need a lot of those Starships.

Speaker 1

也许中国或俄罗斯能够实现火箭着陆。

Maybe China or Russia will be able to land a rocket.

关于 Bayt 播客

Bayt 提供中文+原文双语音频和字幕,帮助你打破语言障碍,轻松听懂全球优质播客。

继续浏览更多播客