a16z Podcast - 构建AI的现实世界基础设施,携手谷歌、思科与a16z 封面

构建AI的现实世界基础设施,携手谷歌、思科与a16z

Building the Real-World Infrastructure for AI, with Google, Cisco & a16z

本集简介

人工智能不仅正在改变软件领域,更推动着现代史上最大规模的实体基础设施建设。本期节目中,a16z的Raghu Raghuram与谷歌AI及基础设施副总裁兼总经理Amin Vahdat、思科总裁兼首席产品官Jeetu Patel共同探讨这场从芯片到电网再到全球数据中心的空前建设浪潮。 他们讨论了以电力、算力和网络为新稀缺资源的"AI工业革命";地缘政治竞争如何影响芯片设计和数据中心选址;以及为何下一代AI基础设施需要硬件、软件和网络的协同设计。对话还涉及企业如何适应转型、为何我们仍处于资本支出超级周期的早期阶段,以及AI推理、强化学习和多站点计算将如何重塑系统构建与运行方式。 资源: 关注Raghu的X账号:https://x.com/RaghuRaghuram 关注Jeetu的X账号:https://x.com/jpatel41 关注Amin的领英:https://www.linkedin.com/in/vahdat/ 持续更新: 喜欢本期节目请点赞、订阅并分享给朋友! 关注a16z的X账号:https://x.com/a16z 关注a16z的领英:https://www.linkedin.com/company/a16z 在Spotify收听a16z播客:https://open.spotify.com/show/5bC65RDvs3oxnLyqqvkUYX 在Apple Podcasts收听a16z播客:https://podcasts.apple.com/us/podcast/a16z-podcast/id842818711 关注主持人:https://x.com/eriktorenberg 请注意,本内容仅供信息参考,不作为法律、商业、税务或投资建议,也不用于评估任何投资或证券,且不针对任何a16z基金的现有或潜在投资者。a16z及其关联方可能持有讨论企业的投资。详情请见a16z.com/disclosures。 持续更新: 关注a16z的X账号 关注a16z的领英 在Spotify收听a16z播客 在Apple Podcasts收听a16z播客 关注主持人:https://twitter.com/eriktorenberg 请注意,本内容仅供信息参考,不作为法律、商业、税务或投资建议,也不用于评估任何投资或证券,且不针对任何a16z基金的现有或潜在投资者。a16z及其关联方可能持有讨论企业的投资。详情请见a16z.com/disclosures。 本节目由AdsWizz旗下Simplecast托管。关于我们收集和使用个人数据用于广告的信息,请访问pcm.adswizz.com。

双语字幕

仅展示文本字幕,不包含中文音频;想边听边看,请使用 Bayt 播客 App。

Speaker 0

好消息是基础设施又变得性感了,这还挺酷的。

The good news is infrastructure's sexy again, so that's kinda cool.

Speaker 0

这就像是互联网建设、太空竞赛和曼哈顿计划的结合体,涉及地缘政治影响、经济影响、国家安全影响,还有速度影响——这种影响相当深远。

This is, like, the combination of the build out of the Internet, the space race, and the Manhattan Project all put into one, where there's a geopolitical implication of it, there's an economic implication, there's a national security implication, and then there's, just a speed implication that's pretty profound.

Speaker 1

我是说,我觉得这话说起来容易。

I mean, I think it's easy to say.

Speaker 1

我从未见过这样的景象。

I've seen nothing like this.

Speaker 1

我很确定没人见过类似的情况。

I'm fairly certain no one's seen anything like this.

Speaker 1

九十年代末、两千年代初的互联网很宏大,我们当时觉得,天哪,不敢相信这种建设速度和规模。

The Internet in the late nineties, early two thousands was big, and we felt like, oh my gosh, can't believe the, build out, the rate.

Speaker 1

相比之下,这次...我是说,用10倍来形容都是保守的。

This makes it I I mean, 10 x is an understatement.

Speaker 1

这是互联网规模的100倍。

It's a 100 x what the Internet was.

Speaker 2

AI热潮不仅仅在改变软件行业。

The AI boom isn't just changing software.

Speaker 2

它正在改造支撑AI运行的实体基础设施。

It's transforming the physical infrastructure that runs it.

Speaker 2

今天你将听到来自谷歌的Amin Vadat、思科的Jitu Patel以及a16z的Raghu Raghuram的对话,探讨如何构建支撑大规模AI的现实系统——从芯片、电力到数据中心和网络。

Today, you'll hear a conversation with Amin Vadat from Google, Jitu Patel from Cisco, and Raghu Raghuram from a sixteen z on what it takes to build the real world systems behind large scale AI from chips and power to data centers and networking.

Speaker 2

他们讨论了当前建设规模、算力与互连的新限制,以及硬件和架构专业化如何重塑行业格局和全球地缘政治。

They discuss the scale of the current build out, the new constraints on compute power and interconnect, and how specialization in hardware and architecture is reshaping both the industry and global geopolitics.

Speaker 2

地缘政治。

Geopolitics.

Speaker 2

这是对基础设施本身如何在AI时代被重塑,以及未来发展趋势的务实观察。

It's a grounded look at how infrastructure itself is being reinvented for the AI era and what comes next.

Speaker 2

我们开始吧。

Let's get into it.

Speaker 3

还有什么比现在更适合讨论基础设施的呢?

What better time and place to talk infrastructure?

Speaker 3

好的。

Alright.

Speaker 3

我们又回到了休息室。

We were back in the green room.

Speaker 3

就在第一个问题得到回答时,我被中断了。

And just as the first question was getting answered, I got cut off.

Speaker 3

所以据我所知,这可能是完全重复的。

So this could be an entire repeat for all I know.

Speaker 3

不过无论如何,我们继续吧。

So but, anyway, let's go.

Speaker 3

对吧?

Right?

Speaker 3

第一个问题很相似。

The first question is similar.

Speaker 3

首先欢迎两位的到来,感谢你们的参与。

So both of you, firstly, welcome and thank you for being here.

Speaker 1

非常感谢

Thank you so

Speaker 3

也祝你们度过愉快的一天半时间。

And hope you'll have a great day and a half as well.

Speaker 3

两位在行业里都已有相当资历。

Both of you have been in the industry for a while.

Speaker 3

你们都经历过多次基础设施周期。

And both of you have lived through many infrastructure cycles.

Speaker 3

对吧?

Right?

Speaker 3

那么从你的角度来看,你见过这样的周期吗?

So have you seen anything like this cycle from your vantage point?

Speaker 3

不是从投资者的角度,而是从你内部负责构建事物、规划事物等的角度来看?

Not from an investor vantage point, but from your internal vantage point where you are responsible for building things and planning for things and so on?

Speaker 3

你们谁想先开始?

Anyone of you, where do you wanna start?

Speaker 3

阿明,你想先开始吗?

You wanna start Amin?

Speaker 0

阿明,你先说吧。

Go ahead, Amin.

Speaker 1

我觉得这很容易说。

I mean, I think it's easy to say.

Speaker 1

我从未见过这样的情况。

I've seen nothing like this.

Speaker 1

我很确定没人见过类似的情况。

I'm fairly certain no one's seen anything like this.

Speaker 1

九十年代末、两千年代初的互联网很庞大,我们当时觉得,天啊,难以置信这种建设速度和规模。

The Internet in the late nineties, early two thousands was big, and we felt like, oh my gosh, can't believe the build out, the rate.

Speaker 1

相比之下这个...我是说,说10倍都是保守估计。

This makes it I I mean, 10 x is an understatement.

Speaker 1

这是互联网规模的100倍。

It's a 100 x what the Internet was.

Speaker 1

我认为其上升空间和互联网一样巨大。

I think the upside is as big as the Internet was, same thing.

Speaker 1

10倍乃至100倍。

10 x and a 100 x.

Speaker 1

是的。

Yeah.

Speaker 1

无与伦比。

Nothing like it.

Speaker 0

我同意。

I'd agree.

Speaker 0

我认为在规模、速度和范围上都是前所未有的。

I don't think there's any priors to the size the speed and scale.

Speaker 0

我想说好消息是基础设施又变得有吸引力了,这挺酷的。

I'd say the good news is infrastructure is sexy again, so that's kinda cool.

Speaker 0

有很长一段时间它都不受关注。

It was a long time where it wasn't sexy.

Speaker 0

我认为真正有趣的是,这就像是互联网建设、太空竞赛和曼哈顿计划的结合体,具有地缘政治影响、经济影响、国家安全影响,以及相当深远的速度影响。

The thing I would say that's really interesting is this is like the combination of the build out of the Internet, the space race, and the Manhattan Project all put into one, where there's a geopolitical implication of it, there's an economic implication, there's a national security implication, and then there's just a speed implication that's pretty profound.

Speaker 0

所以,是的,我们从未见过如此规模和范围的景象。

So, yeah, none of us have ever seen it at this size and scale.

Speaker 0

另一方面,我认为我们严重低估了。

On the other hand, I think we are grossly underestimating.

Speaker 0

比如,我现在最常问的问题是,是否存在泡沫?

Like, there's the most common question I ask right now is, is there a bubble?

Speaker 0

我认为我们严重低估了建设规模。

I think we're grossly underestimating the build out.

Speaker 0

我认为实际需求将远超我们当前的预测。

I think there's gonna be much more needed than what we are putting the projections towards.

Speaker 3

那么接下来的问题是,你认为我们现在处于资本支出周期的哪个阶段?

So that's the follow on question is, where are we, do you think, of the CapEx spend cycle?

Speaker 3

但更重要的是,你们内部使用哪些信号来指导决策?

But more importantly, what are the signals that you guys use internally, right, in your thinking?

Speaker 3

我是说,数据中心这类设施必须提前四五年规划。

I mean, you have to plan data centers, whatever, four, five years in advance.

Speaker 3

还得采购核反应堆之类的设备。

You have to buy nuclear reactors and whatnot.

Speaker 3

那么你如何看待需求信号和技术信号?

So how do you think about the demand signals as well as your technology signals?

Speaker 3

Jeetu,对你来说也是同样的问题,但要从企业和新云等角度考虑。

And, Jeetu, the same thing for you, but from the point of view of enterprise and neo cloud, etcetera.

Speaker 3

Amit?

Amit?

Speaker 1

我认为我们尚处于周期早期,尤其是相对于当前观察到的需求而言。

We're early in the cycle is what I would say, certainly relative to the demand that we're seeing.

Speaker 1

我们的内部用户——我们研发TPU已有十年,目前第七代产品已投入内外部使用。

Our internal users are we've been building TPUs for ten years, so we have now seven generations in production for internal and external use.

Speaker 1

我们七八年前生产的TPU仍保持100%的使用率。

Our seven and eight year old TPUs have 100% utilisation.

Speaker 1

这充分说明了市场需求。

That just shows what the demand is.

Speaker 1

当然每个人都想用最新一代产品,但能拿到什么就用什么。

Everyone would of course prefer to be on the latest generation, but whatever they can get.

Speaker 1

这说明需求非常旺盛,同时也反映出我们不得不拒绝的客户和应用场景。

So this tells me that the demand is tremendous, but also who we're turning away and the use cases that we're turning away.

Speaker 1

不是那种'哦还行吧'的程度。

It's not like, oh yeah, that's kinda cool.

Speaker 1

而是'天哪,我们真的不打算投资这个',而且因为优先级问题根本没有选择余地。

It's, oh my gosh, We're actually not going to invest in this and there's no option because that's where we are on the list.

Speaker 1

在座各位很多人也面临同样情况。

Same with many of you in the room.

Speaker 1

我们正在与在座的各位合作,许多人直接向我表达了感谢。

We're working with many of you in the room and many of yours are telling me directly and thank you.

Speaker 1

我们需要更早行动。

We need more earlier.

Speaker 1

对吧?

Right?

Speaker 1

正如你所说,当前的挑战在于我们受限于电力供应。

Now the challenge here though is, as you said, that we're limited by power.

Speaker 1

我们受限于土地改造。

We're limited by transforming land.

Speaker 1

我们受限于审批流程。

We're limited by permitting.

Speaker 1

我们还受限于供应链中许多物资的备用交付能力。

And we're limited by backup delivery of lots of things in the supply chain.

Speaker 1

所以我担心的是,供应实际上无法像我们期望的那样迅速赶上需求。

So one worry I have is that the supply isn't actually going to catch up to the demand as quickly as we'd all like.

Speaker 1

我在之前的会议中听到关于我们将花费数万亿美元的讨论,我认为这个数字是准确的。

I heard in the previous session some of the discussions of the trillions of dollars that we're gonna be spending, which I think is accurate.

Speaker 1

我不确定我们能否兑现所有这些支票。

I'm not sure that we're gonna be able to cash all those checks.

Speaker 1

换句话说,实际上你们有些钱无法按理想速度全部花出去。

In other words, literally, you all have some money you can't spend it all as fast as you want.

Speaker 1

我认为这种情况会持续三到五年。

I think that's going to extend for three, four, five years.

Speaker 3

哇。

Wow.

Speaker 3

那你们如何处理其中涉及的折旧周期问题呢?

And how do you deal with the depreciation cycles that are involved there?

Speaker 3

需求曲线与折旧周期曲线是否匹配?

Does the demand curve and the depreciation cycle curves match up?

Speaker 1

幸运的是,我们采用的是准时制采购。

Well, fortunately, we buy just in time.

Speaker 1

但好处是硬件设备也正好赶上这个时机。

But the nice thing is just in time for the hardware.

Speaker 1

太空能源设备的折旧周期大约在25到40年之间。

The depreciation cycle for the space power is more like somewhere between twenty five and forty years.

Speaker 1

所以我们享有优势

So we have benefits I

Speaker 0

我认为,如果从网络角度考虑,同时观察企业用户、超大规模运营商以及新兴云服务商,情况会大不相同。

think if you think of on the networking side and you look at both enterprise and the hyperscalers as well as neo clouds, I think the story is quite different.

Speaker 0

企业市场还相当初级,是基于真实基础设施构建的。

So the enterprise is pretty nascent, and it's built out of true infrastructure.

Speaker 0

我完全不认为数据中心会像假设的那样——所有数据中心都将在某个时间点需要重新架设机架,而且每个机架所需的电力水平将与传统数据中心有根本性不同。

I just don't think that the data centers like, if you assume that 100% of the data centers at some point in time will need to get re racked, and you will need a very different level of power requirement per rack that's gonna be there compared to what used to be there in the traditional data centers.

Speaker 0

我只是觉得企业用户的发展程度还远远不够。

I just don't think that the enterprises are far enough along.

Speaker 0

也许少数超大规模企业可能达标,但绝大多数企业还远未达到这个阶段。

Maybe the few enterprises that are at super high scale might be there, but I don't think the enterprises are far enough along.

Speaker 0

超大规模运营商和新兴云服务商则是完全不同的情况。

Hyperscalers and Neo Clouds is a completely different story.

Speaker 0

关于阿明提到的电力、计算和网络资源稀缺性这三大制约因素,我认为当前由于单一地点电力供应不足,数据中心正转向电力资源丰富的地区建设,而非将电力输送到数据中心所在地。

And to Amin's point on this notion of scarcity of power, compute, and network being the three big kind of constraints in this thing, I would say right now that because there's not enough power singularly in one location, data centers are being built where the power is available rather than power being brought to where the data centers are.

Speaker 0

这就是为什么我们看到全球各地都在开展大量建设项目。

And that's why you're seeing a lot of projects that are being built out all throughout the world.

Speaker 0

不过另一个关键点是,我们面临的大部分制约因素在未来很长时期内都将持续存在。

The other point though is the lion's share of the constraints that we're gonna have, I think are gonna be sustainable for a long period of time.

Speaker 0

随着数据中心建设得越来越分散,首先,扩展网络将面临巨大需求,以便机架能获得越来越多的网络资源进行纵向扩展。

And as you have data centers that are being built farther and farther apart, one, there's gonna be a huge demand for scale up networking so that you can have a rack that gets more and more networking for scale up.

Speaker 0

其次,横向扩展也将有很大需求,需要将多个机架和集群连接在一起。

The second is you're gonna have a lot of demand for scale out where you have multiple racks and clusters that need to get connected together.

Speaker 0

但我们刚刚推出了一款新的硅片、芯片和系统,用于跨网络扩展,可以让两个相距800-900公里的数据中心作为一个逻辑数据中心运行。

But we just launched a new piece of silicon as well as a new chip and a system for scale across networking, where you might have two data centers that act as a logical data center that could be up to 800, 900 kilometers apart.

Speaker 0

你会看到这种情况,因为单个地点的电力资源将无法满足集中需求。

And you will see that just because there's not gonna be enough concentration of power in a single location.

Speaker 0

因此必须构建不同的架构体系。

So you'll just have to have different architectures that get built out.

Speaker 3

实际上,这正好引出了我想讨论的下一个话题——系统和网络的未来发展等等。

Actually, that brings us to the next topic that I wanna discuss, the future of systems and networking and so on and so forth.

Speaker 3

谷歌最早(至少是大规模)为网络革命采购了横向扩展的商用服务器。

So Google bought the first or at least large scale, scale out commodity servers in production for the web revolution.

Speaker 3

而现在,英伟达正以不同形式让大型机回归。

And now, NVIDIA is bringing back the mainframe in a different form.

Speaker 3

那么你认为接下来会发生什么?

So what do you think happens next?

Speaker 3

这是否意味着我们需要这种新型的集群范围一致性计算模式,将实现共享内存等功能?

I mean, is this the new style of coherent cluster wide computing that we need and there's going be shared memory and all sorts of things?

Speaker 3

还是你认为这种模式会再次改变?

Or do you think the pattern changes again?

Speaker 1

我认为我们并未完全回归大型机时代,因为人们仍然在这些资源池上运行横向扩展架构。

I don't think we're quite too back to mainframes in that it is still the case that people are running on scale out architectures across these pools.

Speaker 1

换句话说,你拥有GPU或TPU时,并不是在说'那是我的GPU超级计算机'。

In other words, you have GPUs or TPUs, you're not necessarily saying, hey, that's my GPU supercomputer.

Speaker 1

而是在说'我拥有16,384个GPU'。

You're saying I've got 16,384 GPUs.

Speaker 1

是的。

Yep.

Speaker 1

或许我要去获取一些子集。

And maybe I'm going to go grab some subset.

Speaker 1

现在我在许多情况下实现了全面的统一连接,这太棒了。

Now I've got uniform all to all connectivity in many cases, which is fantastic.

Speaker 1

TPU也是如此。

Same with TPUs.

Speaker 1

并不是说我有一个9000芯片的集群,就必须让我的任务适配它。

It's not like I say I have a 9,000 chip pod and I have to make my job fit on that.

Speaker 1

也许我实际上只需要256个。

Maybe I actually only need 256.

Speaker 1

也许我需要10万个。

Maybe I need a 100,000.

Speaker 1

所以我确实认为这种软件横向扩展仍将存在。

So I do think that actually this software scale out is still going to be there.

Speaker 1

不过我要指出两点。

I'll note two things though.

Speaker 1

第一,你说得完全正确,大约二十五年前在谷歌和其他地方同时发生的,确实是计算基础设施的一次变革。

One, you're absolutely right that say about twenty five years ago at Google and other places simultaneously, was really a transformation of computing infrastructure.

Speaker 1

就像那种理念——实际上你会在商用PC上进行横向扩展,那些你可以直接买到的运行Linux系统的机器,这就是你用于磁盘、计算和网络的方式。

Like the notion that actually you would scale out on commodity PCs essentially, those same ones that you could buy off the shelf running a Linux stack and that's what you would do for disk, that's what you would do for compute, that's what you do for networking.

Speaker 1

我的意思是你们都认为这是理所当然的,但这在当时是相当激进的。

I mean you all take it for granted that this is sort of it was radical.

Speaker 1

有很多人认为这是个糟糕的主意,根本行不通。

There are many people who thought that this was a terrible idea, that wasn't going to work.

Speaker 1

我认为当前这个时刻令人兴奋的地方在于,我们将要重新发明——我不是说谷歌——我们将要重新发明计算。

I think the exciting thing about this moment right now is actually that we're going to be reinventing I'm not saying Google, we are going to be reinventing computing.

Speaker 1

五年后,无论从硬件到软件的计算堆栈会变成什么样,都将变得面目全非。

And five years from now whatever the computing stack is from the hardware to the software, it's going to be unrecognizable.

Speaker 1

顺便说下,这种协同设计的存在是有道理的——以我最熟悉的谷歌为例:Bigtable、Spanner、GFS、Borg、Colossus这些系统都与硬件集群的横向扩展架构进行了深度协同设计。

And by the way there was this co design because if you think about it, I'll use Google examples because I know those best: Bigtable, Spanner, GFS, Borg, Colossus they were hand in hand co designed with the hardware cluster scale out architecture.

Speaker 1

如果没有横向扩展软件,你们也不会去开发横向扩展硬件。

And if you wouldn't have done the scale out hardware if you didn't have the scale out software.

Speaker 1

没错。

Yep.

Speaker 1

同样的情况即将在此刻重演。

Same thing is gonna happen in this moment.

Speaker 1

所以我认为大型机的形态将会变得非常非常不同。

So I think actually the mainframe is gonna look very, very different.

Speaker 0

好的。

Okay.

Speaker 0

是的。

Yeah.

Speaker 0

我确实认为市场会对集成系统产生极大需求——目前思科很幸运,我们的业务覆盖了从物理层到语义层的全栈。

I do think there'll be, like, this extreme demand for an integrated system because right now we are very fortunate at Cisco where we do everything from the physics to the semantics.

Speaker 0

你要考虑从芯片到应用的全链路。

You think about the silicon to the application.

Speaker 0

除了功耗之外,关键约束在于这些系统的集成度如何,以及它们是否真能在全栈范围内实现最低损耗的协同工作?

And other than power, one of the constraints is how well integrated are these systems, and do they actually work with the least amount of lossiness across the entire stack?

Speaker 0

因此这种深度集成的水平将变得至关重要。

And so that level of tight integration is gonna be super important.

Speaker 0

这意味着行业必须进化到这样的状态:即使由多家公司分别负责不同组件,我们也必须像单一企业那样协同工作。

And what that means the industry will have to evolve into is we will have to work like one company even though we might actually be multiple companies that actually do these pieces.

Speaker 0

所以当我们与谷歌等超大规模企业合作时,在正式签约前就会开展长达数月的深度设计合作。

And so when we work with hyperscalers like Google or others, there's a deep design partnership that actually goes on for months and months together ahead of time before we actually even do the deal.

Speaker 0

交易一旦完成,当然会有巨大压力确保他们进展迅速。

And then once the deal is done, of course, there's a tremendous amount of pressure to make sure that they're moving pretty fast.

Speaker 0

但我认为行业的力量在于确保你在开放生态系统中运作,而不是成为封闭花园,这在技术栈的每一层都将变得至关重要。

But I think the industry's muscle of making sure that you operate in an open ecosystem and not be a walled garden is gonna get important at every layer of the stack.

Speaker 3

非常棒。

Really great.

Speaker 3

那么让我们稍微拆解一下技术栈。

And so let's talk about the disaggregate the stack a little bit.

Speaker 3

最有趣的话题之一是处理器。

One of the most interesting topic is processors.

Speaker 3

对吧?

Right?

Speaker 3

显然,有一家出色的供应商生产了目前市场份额巨大的优秀处理器。

Clearly, is an amazing vendor producing an amazing processor that has massive market share today.

Speaker 3

对吧?

Right?

Speaker 3

而且我们不断看到初创公司尝试各种处理器架构。

And we see startups all the time doing all sorts of processor architectures.

Speaker 3

你的堡垒里有一个出色的处理器。

You've got an amazing processor inside your fortress.

Speaker 3

你认为处理器领域接下来会发生什么?

What do you think happens next in processor land?

Speaker 1

是的。

Yeah.

Speaker 1

我们是NVIDIA的超级粉丝。

We're huge fans of NVIDIA.

Speaker 1

我们销售大量NVIDIA的产品和芯片。

We we sell a lot of NVIDIA products and chips.

Speaker 1

客户非常喜欢它们。

Customers love them.

Speaker 1

我们也是自己TPU的超级粉丝。

We're also huge fans of our TPUs.

Speaker 1

我认为未来其实非常令人兴奋,实际上我们并没有达到那种'好吧,有TPU、有GPU、有训练芯片或其他什么'的阶段。

I think the future is actually really exciting, and actually we're it's not that I don't think that we've hit the point of, okay, there's TPUs, there's GPUs, there's whatever, trainiums or something else.

Speaker 1

我们正亲眼见证专业化的黄金时代,这是我的观察。

We're really seeing the golden age of specialization, And that's my observation.

Speaker 1

换句话说,以TPU为例(我再次用它举例因为我最熟悉),对于某些计算任务,它的每瓦效能比CPU高出10到100倍,而瓦数才是关键指标。

In other words, if you look at it, a TPU, I'll use that example again because I know it best, for certain computation is somewhere between ten and one hundred times more efficient per watt, and it's this watt that really matters than a CPU.

Speaker 1

这种优势让人难以忽视,对吧?

That's hard to walk away from, right?

Speaker 1

10到100倍的差距。

10 to 100x.

Speaker 1

但我们知道还有其他计算任务,如果能构建更专用的系统——不只是小众计算,而是谷歌经常运行的那些计算任务。

And yet we know that there are other computations that if you built even more specialized systems for, but not just a niche computation, computations that we run a lot of at Google.

Speaker 1

比如服务型任务,或者能受益于更专用架构的代理型工作负载。

Like for example maybe for serving, maybe for agentic workloads that would benefit from an even more specialized architecture.

Speaker 1

所以我认为真正的瓶颈在于:设计和投产一个专用架构到底有多难、需要多长时间?

So I think that actually at one bottleneck is how hard is it and how long does it take to turn around a specialized architecture?

Speaker 1

目前这个过程遥遥无期。

Right now it's forever.

Speaker 1

对全球顶尖团队来说,从概念到实际投产,最快也要两年半——这已经是光速了。

For the best teams in the world, really from concept to live in production, speed of light is two and a half years.

Speaker 1

我是说在一切顺利的情况下。

I mean that's if you nail everything.

Speaker 1

对吧?

Right?

Speaker 1

确实有少数团队能做到这一点。

And there are a few teams that do.

Speaker 1

但你要如何预测两年半后的未来,来构建专用硬件呢?

But how do you predict the future two and a half years out for building specialized hardware?

Speaker 1

所以第一,我认为必须缩短这个周期。

So A, I think we have to shrink that cycle.

Speaker 1

但第二,当发展速度稍缓时(这是必然的),我们必须构建更多专用架构,因为其省电、节约成本和空间的优势实在不容忽视。

But then B, at some point when things slow down a little bit, and they will, I we're gonna have to build more specialized architectures because the power savings, the cost savings, the space savings are just too dramatic to ignore.

Speaker 0

这实际上也会对地缘政治结构产生非常有趣的影响。

And this will actually have a really interesting implication on geopolitical structures as well.

Speaker 0

因为想想中国的现状,他们其实并不生产两纳米芯片。

Because if you think about what's happening in China, China actually doesn't make two nanometer chips.

Speaker 0

他们制造的是七纳米芯片。

They make, you know, seven nanometer chips.

Speaker 0

但关键在于他们拥有无限的电力供应。

And and so if you think about what but they have unlimited amount of power.

Speaker 0

他们还拥有无限的工程人力资源。

And they have unlimited amount of engineering resource.

Speaker 0

因此他们可以在工程层面进行优化,保持七纳米芯片的同时确保无限供电。

And so what they can do is do the optimization on the engineering side, keep the seven nanometer chips, and make sure that they give people unlimited amount of power.

Speaker 0

我们可能采用不同的架构设计——虽然电力资源有限,工程师数量也不及中国充裕。

We might have a different architectural design where you have to get extremely You power don't have as many engineers as you might enjoy in China.

Speaker 0

但我们可以实现两纳米芯片的突破。

And you can actually go to two nanometer chips.

Speaker 0

这些芯片在某些方面可能更节能,但在其他方面可能存在热损耗问题。

But and those might be power efficient in some ways, but they might have thermal lossiness in other ways.

Speaker 0

需要综合考虑各种因素,未来架构将根据地域特点变得更加专业化。

Like, there's a whole bunch of things that have to get factored in, on the architecture that will get more specialized even by geo and by region.

Speaker 0

然后根据监管框架的演变情况,你知道,那个地理范围会如何扩展。

And then depending on how the regulatory frameworks evolve, you know, how that that geo then expands.

Speaker 0

就像如果中国扩展到世界不同地区,与美国扩展到世界不同地区相比,会出现非常不同的架构。

Like if China expands to different regions in the world, you will have a very different architecture that plays out than if America expands to different regions in the world.

Speaker 0

所以这是一个非常有趣的博弈论练习,可以预测未来三年科技领域的总体发展趋势。

So this is a very interesting kind of game theory exercise to go through on what happens in the next three years in in tech in general.

Speaker 0

而目前没有人知道答案。

And no one knows right now.

Speaker 3

是啊。

Yeah.

Speaker 3

这就是我们所处世界的魅力所在。

That's the beauty of the world that we live in.

Speaker 3

没错。

Yeah.

Speaker 3

是的。

Yeah.

Speaker 3

所以我们很快就要用工程师数量除以代币数来衡量系统,就像用瓦特除以代币数一样。

So we'll soon be measuring systems by engineers per token in addition to watts per token.

Speaker 3

好吧。

Alright.

Speaker 3

那么让我们转向另一个话题,这个非常

So let's turn to another topic, which very

Speaker 0

工程师每千瓦。

Engineer per kilowatt.

Speaker 0

工程师每千瓦。

Engineer per kilowatt.

Speaker 0

在美国。

In The US.

Speaker 3

网络,对吧?

Networking, right?

Speaker 3

显然,你提到了扩展规模,横向扩展。

Obviously, you alluded to it, scale up, scale out.

Speaker 3

在你的案例中,你提到了跨规模扩展。

In your case, you mentioned scale across.

Speaker 3

所以在我看来,网络也将以一种相当显著的方式被重新定义。

So it seems to me that networking is also gonna get reinvented in a fairly significant way.

Speaker 3

那么你看到的主要迹象是什么?或者网络将朝着什么方向发展?

So what are the leading signs that you're seeing that and the signals that you're seeing in or the direction networking is gonna take?

Speaker 1

是的。

Yeah.

Speaker 1

网络确实需要转型。

Networking is going to need a transformation for certain.

Speaker 1

换句话说,建筑物内部所需的带宽规模简直惊人。

In other words, the amount of bandwidth that's needed at scale within a building is just astounding.

Speaker 1

我的意思是,需求还在增长。

I mean, it's going up.

Speaker 1

网络正在成为主要瓶颈,这很可怕。

The network is becoming a primary bottleneck, which is scary.

Speaker 1

因此更高的带宽直接转化为更好的性能。

So more bandwidth translates directly to more performance.

Speaker 1

考虑到网络最终实际上是一个小型电力消费者,每瓦特提供的效用。

And then given that the network winds up actually being a small power consumer that delivered utility you get per watt.

Speaker 1

这就像是一种超线性收益。

Like it's a super linear benefit.

Speaker 1

就像在这里投入一点,就能在那里获得更多。

Like spend a little bit here, get way more there.

Speaker 1

因此我认为那一方面确实存在。

So I think that that side is absolutely there.

Speaker 1

我要在此强调,对于这些工作负载,我们实际上清楚网络通信模式是什么,这是我们的优先事项。

I'll put in a plug here in that for these workloads we actually know what the network communication patterns are, our priority.

Speaker 1

所以我认为这是一个巨大的机遇。

So I think this is a massive opportunity.

Speaker 1

换句话说,当你已经知道大致电路走向时,是否还需要数据包交换的全部功能?

In other words do you then need the full power of a packet switch when actually you know what the rough circuits are going to be.

Speaker 1

我并不是说需要构建电路交换机,但这里存在优化空间。

I'm not saying you need to build a circuit switch but there is an optimization opportunity.

Speaker 1

另一个关键点是这些工作负载具有极强的突发性。

The other aspect of this here is these workloads are just incredibly bursty.

Speaker 1

是的。

Yeah.

Speaker 1

正如我们曾撰文所述,电力公司都清楚这点——当我们在进行涉及数十至数百兆瓦规模的计算与网络通信时。

And to the point where, and we've written about this, power utilities know this when we're doing network communication relative to computation at the scale of tens and hundreds of megawatts.

Speaker 1

对吧?

Right?

Speaker 1

比如电力需求激增时突然中断,转而进行网络通信,随后又迅速切回计算模式。

Like massive demand for power, stop all of a sudden and do some network communication, and then burst back to computing.

Speaker 1

那么如何构建一个需要在极短时间内满负荷运行,随后又进入闲置状态的网络?

So how do you build a network that needs to go at 100% for a really short amount of time and then go idle.

Speaker 1

同样道理适用于跨用例场景——我们明确认为,没有人会全年12个月在所有广域数据中心站点持续进行大规模自由训练。

And then same actually for the scale across use case which we're absolutely saying, you don't run large scale free training across all your wide area data center sites twelve months out the year.

Speaker 1

这其实是我经常思考的问题:假设你在这三个数据中心站点部署了最新最强的芯片。

So and then you're gonna this is a problem I think about a lot is let's say you build the latest greatest chips in these three data center sites.

Speaker 1

在迁移到另外三个站点更新的芯片之前,这些设备能服役多久?

How long are you gonna be there before you migrate to the latest latest chips in three other sites?

Speaker 1

那么你如何处理遗留下来的网络呢?

And then what do you do with the network that you left behind?

Speaker 1

人们会在上面运行任务,但你根本不需要像大规模训练(特别是预训练)那样庞大的网络容量。

People are going to run jobs on them but you're not going to need nearly the network capacity that you did for large scale training, pre training anyway.

Speaker 1

所以这种只需要在5%时间使用大规模网络的转变,我不知道该如何构建这样的网络。

So the shift of needing massive networks for like 5% of the time, I don't know how to build a network like that.

Speaker 1

如果你们有人知道,请告诉我。

So if any of you do, let me know.

Speaker 3

我是说,你都不知道怎么构建,现在根本没人知道该怎么建

I mean, you don't know how to build this, there's nobody that knows how We're to build

Speaker 1

正在努力解决这个问题。

trying to figure it out.

Speaker 1

这确实是个引人入胜的难题。

It actually is a fascinating problem.

Speaker 0

是啊。

Yeah.

Speaker 0

我确实认为,如果把电力视为限制条件而算力作为资产,那么网络将成为力量倍增器。

I do think, like, if if you think of if power is the constraint and if compute is the asset, I think network is gonna be the force multiplier.

Speaker 1

嗯。

Mhmm.

Speaker 0

因为你看,如果数据包具有低延迟、低性能和高能耗,那么每节省一千瓦的传输功耗,就能把这一千瓦电力分配给GPU使用。

Because, you know, if a if a packet if if you have low latency and low performance and high energy inefficiency, then the pack the every kilowatt of power you save, moving the packet is a kilowatt of power you can give to the GPU Yeah.

Speaker 0

这非常关键。

Which is, you know, super important.

Speaker 0

另外,当你考虑纵向扩展、横向扩展和跨域扩展时,特别是在推理与训练之间,需要优化的目标也不同。

The the other thing is, you know, when you think about scale up versus scale out versus scale across, you also need, especially on inference versus training, there are different things that get optimized.

Speaker 0

比如在训练过程中,你可能需要更优先优化延迟。

Like, you might optimize for latency much more on training runs.

Speaker 0

你可能会在推理过程中更注重内存优化。

You might optimize much more for memory on inferencing.

Speaker 0

从架构角度来看,我认为网络的发展方向将不再是先建立训练基础设施再应用于推理,而是会逐渐构建专为推理设计的原生基础设施。

There there's there's architectural and so I I also feel like the way that networking will evolve is rather than it being a training infrastructure that then gets applied to inferencing, you might have inferencing native infrastructure that gets built over time.

Speaker 0

因此需要重点考量的是所有架构组件是如何演进的。

And so there's there's good considerations to look at on, like, how all of the architectural components are are moving.

Speaker 0

但在我看来,从战略角度而言,网络领域正在发生的最重大变化之一是:如果你只是博通的封装商,那么你将面临一个极具掠夺性的垄断局面。

But in in my mind, like, if if I were to say strategically one of the biggest things that's happening in networking from our vantage point is if you're just a wrapper around Broadcom, then you've got a monopoly that's gonna be a very predatory one.

Speaker 0

思科之所以至关重要,正是因为你不必生活在一个所有人都只是封装博通芯片的世界里。

And so one of the big reasons where Cisco is super relevant is you don't just have a Broadcom world with people just wrapping Broadcom.

Speaker 0

虽然他们的系统基于博通,但实际上你会有芯片选择权。

I mean, kind of, their systems are on Broadcom, but you will actually have a choice of silicon.

Speaker 0

这种芯片选择的多样性将变得极其重要,特别是对于高吞吐量的消费模式而言。

And that choice and diversity of silicon is gonna be super important, especially for high volume, you know, kind of consumption patterns.

Speaker 3

既然提到系统,最后问个相关问题,然后我们转向用例讨论。

So last question on the systems since you brought that up, we'll move to use cases.

Speaker 3

推理环节——你们两位都提到了,你刚才是在处理器背景下谈的。

Inference, both of you have mentioned I mean, you talked about it in the context of the processors.

Speaker 3

你刚刚开始讨论架构问题。

You just started talking about the architecture.

Speaker 3

你们目前是否在部署专门的推理架构?

Are you deploying today's specific architectures for inference, I mean?

Speaker 3

还是仍在采用共享工作负载模式?

Or is it still shared workloads?

Speaker 1

我们正在部署专为推理设计的架构。

We are deploying specialized architectures for inference.

Speaker 1

我认为软件和硬件同样重要,但硬件也会以不同配置形式部署——这是我的表述方式。

And I think as much software as hardware, but the hardware is also deployed in different configurations is the way I would say it.

Speaker 1

推理中另一个变得非常有趣的方面是强化学习,尤其是在服务关键路径上,因为延迟变得绝对关键。

And then the other aspect of inference that is becoming really interesting is reinforcement learning, especially on the critical path of serving because latency just becomes absolutely critical.

Speaker 1

我认为如何构建系统以及如何相互连接,当然网络在其中扮演关键角色,正变得越来越有趣。

And I think that so how you would build your system and how you would connect it up to one another, and of course networking plays a key role there, becomes increasingly interesting.

Speaker 3

是否存在某些单一瓶颈点,如果移除它们就能加速实现我们所需的推理成本千倍降低,还是说这只是我们正在记录的自然曲线?

And are there singular choke points that if removed would accelerate the thousand fold reduction in the cost of inference that we need, or is this just a natural curve that we are writing down?

Speaker 1

我们的规模非常庞大。

So we're massive.

Speaker 1

我是说,这里的事情。

I mean, things here.

Speaker 1

第一点,可能你们很多人都熟悉这个。

One, again, maybe many of you are familiar with this.

Speaker 1

推理中的预填充和解码阶段看起来非常非常不同。

Prefill and decode on inference look very, very different.

Speaker 1

所以实际上,理想情况下应该使用不同的硬件。

So actually, ideally, you would have different hardware, actually.

Speaker 1

平衡点是不同的。

The balance points are different.

Speaker 1

所以这是一个机会。

So that's one opportunity.

Speaker 1

它也有缺点。

It comes with downsides.

Speaker 1

我们可以讨论这个问题。

We can talk about that.

Speaker 1

不过我想说的是,可能有些人没有意识到,我们实际上正在推动推理成本的大幅降低。

What I would say though is that maybe something people don't realize is that we're actually driving massive reductions in the cost of inference.

Speaker 1

我的意思是10倍甚至100倍的降低。

I mean 10 Xs and 100 Xs.

Speaker 1

问题或机遇在于社区,用户群体不断要求更高的质量,而非更高的效率。

The problem or opportunity is the community, the user base keeps demanding higher quality, not better efficiency.

Speaker 1

因此,就在我们实现所有预期的效率提升后,新一代模型又出现了,其单位成本的智能水平大幅提升,但价格仍比上一代更高、成本更贵。

So just as soon as we deliver all the efficiency improvements we're looking for, the next generation model comes out and it is the whatever intelligence per dollar is way better, but you still pay more and it costs more relative to the previous generation.

Speaker 1

然后我们又重复这个循环。

And then we repeat the cycle.

Speaker 0

这几乎就像你的推理链条越长,市场就越没耐心。

And it's almost like the longer the reasoning that you have, the more impatient the market gets.

Speaker 0

对吧?

Right?

Speaker 0

例如,如果你有二十分钟的推理周期,比如深度研究时,可以实现约二十分钟的自主执行。

So for example, if you have a twenty minute reasoning cycle, like for example, with deep research, you could have autonomous execution for about twenty minutes.

Speaker 0

这很有趣。

That was interesting.

Speaker 0

现在大多数编程工具已经可以实现七小时到三十小时的持续自主执行。

Now you have, you know, most of the coding tools that can go up to seven hours to thirty hours of, you know, duration of autonomous execution.

Speaker 0

当这种情况发生时,市场反而更强烈要求压缩时间。

When that happens, there's actually a greater demand for saying compress the time down.

Speaker 0

于是这就形成了一个自我实现的预言:正因为你能实现更长时间的自主执行,反而需要更强的性能。

And so you lack it's it's kind of a self fulfilling prophecy where you need to have more performance because of the fact that you've been able to go out and do things for a longer autonomous amount of time.

Speaker 0

这几乎是个永无止境的循环,你将永远需要更强的推理性能。

And so it's almost a never ending loop where you'll you'll need to have more performance for inference Yeah.

Speaker 0

永无止境。

In perpetuity.

Speaker 3

是啊。

Yeah.

Speaker 3

不过,单位成本的智能水平是商业模式指标,而不仅仅是处理器能力的体现。

Though, intelligence intelligence per dollar is a business model metrics metric, so it is not just a processor capability.

Speaker 1

不。

No.

Speaker 1

这是端到端的。

It's end to end.

Speaker 1

绝对是的。

Absolutely.

Speaker 3

对。

Yeah.

Speaker 3

那好吧。

So okay.

Speaker 3

那我们换个话题,聊聊实际应用吧。

So let's change topics and talk about actual usage.

Speaker 3

对吧?

Right?

Speaker 3

你们两位都管理着庞大的组织。

So both of you have massive organizations.

Speaker 3

在运用现有AI技术方面,你们目前取得的关键性成果有哪些?

Where are the key wins that you're getting today with with applying all the AI that's available to you?

Speaker 3

然后我们再谈谈客户的应用,但我其实更想知道你们内部的实施情况。

And then we'll talk about what your customers are doing, but I'm actually curious about what you're doing internally.

Speaker 1

在团队内部吗?

Within the teams?

Speaker 1

是的。

Yeah.

Speaker 1

我是说,编码是最明显的应用领域,而且这方面的应用正在获得越来越多的关注和能力提升。

So I mean, coding is the obvious one, and that's actually picking up increasing traction and increasing capability.

Speaker 1

实际上就在前几天,我们刚发表了一篇论文,展示了如何运用AI技术实现指令集迁移。

We just actually in the last couple of days published a paper that showed how we applied AI techniques to do instruction set migration.

Speaker 1

换句话说,我们实际上经历了从x86到ARM的大规模迁移,使得我们整个代码库——在谷歌这是一个非常庞大的代码库——变得与指令集无关,并且能够适应未来可能出现的第五代风险或其他任何情况。

So in other words we actually had a fairly massive migration from x86 to ARM, making our entire code base, and at Google it's a very very large code base sort of instruction set agnostic and including to you know future risk five or whatever else might come along.

Speaker 1

数以万计,数十万的独立个体

Tens and thousands, hundreds of thousands of individual Your

Speaker 3

整个代码库,你要让它变得与平台无关?

entire code base, you're going make it agnostic?

Speaker 1

整个代码库,因为我们希望需要所有代码库都能

Entire code base because we we want want to need all of our code base to be

Speaker 3

老兄,这项目也太疯狂了。

Man, that's a crazy ass project.

Speaker 1

是啊。

Yeah.

Speaker 1

所以,我们确实这么做了。

So so we we it it was.

Speaker 1

不过这个计划的动机其实源于几年前。

And the the motivation, though, for this actually was a few years ago.

Speaker 1

我们当时有一个了不起的旧系统叫Bigtable,后来又开发了更出色的新系统Spanner。

We had this amazing legacy system called Bigtable and then a new amazing system called Spanner.

Speaker 1

于是我们决定告诉全公司:所有人都需要从Bigtable迁移到Spanner。

And we decided to tell the company, hey everyone needs to move from Bigtable to Spanner.

Speaker 1

Bigtable在其时代非常出色,但Spanner更优秀。

And by the Bigtable was amazing for its time but Spanner was better.

Speaker 1

谷歌估算完成这次迁移需要七千人年的工作量。

The estimate from doing that migration for Google was seven staffed millennia.

Speaker 3

多少?

How much?

Speaker 0

怎么

How

Speaker 1

多少?

much?

Speaker 1

七千年的员工配置。

Seven staffed millennia.

Speaker 1

当时我们有个新部门,必须实际考察情况,结果发现并不是员工偷懒编造的借口。

That that we we had a new unit that we had to actually to see what and and it was it wasn't, like, made up people being lazy.

Speaker 1

就像是,这就是我

It's like, this is what I

Speaker 0

在做的事。

was doing.

Speaker 0

不过他们能想到这点还挺可爱的。

It's endearing that they came up with that though.

Speaker 1

你猜我们最后怎么决定的?

And you know what we decided?

Speaker 1

Bigtable万岁。

Long live Bigtable.

Speaker 1

我决定什么?

I decided, what?

Speaker 1

老实说就是觉得不值得。

It just wasn't worth it, honestly.

Speaker 1

机会成本太高了。

Like the opportunity cost was too high.

Speaker 1

所以我们做了这类迁移,比如从TensorFlow转到JAX。

So and we have these sorts of migrations, TensorFlow to JAX.

展开剩余字幕(还有 80 条)
Speaker 1

实际上...这个有点内部信息,不过AI辅助让我们的内部效率提升了好几个整数倍。

We actually, I mean, again, somewhat private but not We've not too affected this internally with AI assist one, integer factors faster.

Speaker 1

当然还有些任务目前工具可能达不到标准,但技术覆盖范围正在不断扩大。

Now there are other tasks which the tools probably aren't quite yet up to the whatever standard for, but the the area under the curve is getting bigger and bigger and bigger.

Speaker 0

我们目前可能看到了三四个非常不错的应用场景,但同时也发现一些尚未奏效的用例。

So we are seeing probably like three or four really good use cases, and then we are seeing some use cases which are not working yet.

Speaker 0

就现有成果而言,代码迁移工作进展相对顺利。

And so what is working, code migrations are working relatively well.

Speaker 0

目前我们主要使用Codecs、Claude和Cursor的组合方案,也会用到一些Windsurf。

So far, we use largely a combination of Codecs, Claude, and Cursor, some Windsurf.

Speaker 0

因此,代码迁移通常都能取得很好的效果。

And so, code migrations tends to work pretty well.

Speaker 0

出乎意料的是,调试工作在这些工具支持下效率极高,特别是配合CLI使用时。

Debugging, oddly enough, has actually been very, very productive with with these tools, especially with CLIs.

Speaker 0

我们表现欠佳的领域是前端从零到一的项目,这类项目往往能取得极佳效果。

The where we've not done as good a job and then front end zero to one projects tend to do extremely well.

Speaker 0

工程师们的工作效率非常高。

Like, the engineers are super productive.

Speaker 0

但当你处理年代较久的代码,尤其是基础设施栈底层的部分时,推进起来就困难得多。

When you go to code that's older, and especially further down in the infrastructure stack, much harder to go out and get that to happen.

Speaker 0

我们面临的核心挑战是要引导工程师们认识到:这本质上是个文化重塑问题而非单纯的技术问题。如果有人使用工具后反馈存在问题,你不能把它束之高阁半年九个月不闻不问。

But the challenge that we have to orient our engineers on, this is actually much more of a cultural reset problem than it is a just a technical problem, which is if someone uses something and says this isn't working right, you can't put it back on the shelf saying this doesn't work for another six or nine months.

Speaker 0

必须在四周内重新评估其可用性。

You have to come back to it within four weeks and see if it works again.

Speaker 0

因为这些工具的进化速度实在太快了——今天我面对150位杰出工程师时强调:必须预设这些工具在六个月内会有质的飞跃。

Because the speed at which these tools are kind of advancing is so fast that you almost have to kind of get like, so I was with a 150 of our distinguished engineers today, and what I had to urge them to do is assume that these tools are gonna get infinitely better within six months.

Speaker 0

没错。

Yep.

Speaker 0

关键是要建立对未来六个月工具发展形态的认知框架,规划如何在此期间保持行业领先,而不是基于当前水平评估后就搁置半年,想当然认为工具在此期间不会进步。

And make sure that you get your mental model the way that tool is gonna be in six months and what are you gonna do to be best in class in six months rather than assessing it for where it is today and then putting it aside for six months, assuming that that's not gonna work for the next six months.

Speaker 0

我认为这是个重大的战略失误。

I think that's a big strategic error.

Speaker 0

我们拥有25000名工程师。

So and we've got 25,000 engineers.

Speaker 0

我希望我们能在明年短时间内将生产力提升至少两到三倍。

I'm hoping that we can get at least two or three x productivity within a very short amount of time within the next year.

Speaker 0

然后我们就能看到如果这种情况发生会怎样。

And we we else we we will be able to see what if if that happens.

Speaker 0

第二个方面,我们开始看到良好反馈的另一个重要领域是销售。

The second like, a couple other big areas that we are starting to see some good responses is in sales.

Speaker 0

客户拜访前的准备工作,效果非常好。

Preparation going into an account call, really good.

Speaker 0

法律合同审查,实际效果比我们预想的要好得多。

Legal contract reviews, actually much better than what we had thought.

Speaker 0

最后一个是产品营销,虽然推理量不算特别大。

And then the last one is not super high inference volume, but product marketing.

Speaker 0

我认为ChatGPT首次提出的竞品分析总是比任何产品营销人员独立提出的要好。

I think the first ChatGPT take on competitive is always better than what my any product marketing person comes up by themselves.

Speaker 0

所以我们永远不应该从零开始。

So we should never start from a white slate.

Speaker 0

直接从ChatGPT开始,然后在此基础上推进。

Just start from ChatGPT and then go from there.

Speaker 3

好的。

Okay.

Speaker 3

这个话题我们可以讨论很久,但他们给我看了两分钟倒计时提示。

Well, we could be talking about the topic for a long time, but they showed me the two minute warning.

Speaker 3

所以我想在这里聚焦最后一个问题。

So I wanna focus on one last question here.

Speaker 3

现场有很多创始人,对吧,都在创建了不起的公司。

So we got a lot of founders here, right, building amazing companies.

Speaker 3

那么在未来一个日历年度,或者说未来十二个月内,他们最应该期待的有趣发展是什么?

So what is the most interesting development they should look forward to in the next calendar year, let's call it, or the next twelve months?

Speaker 3

从你们公司(A)和整个行业(B)的角度来看,如果你要预测的话。

A, from your company and b, from the industry, if you are looking at your crystal ball.

Speaker 1

我想接着这个话题说,这些模型每个月都在变得更惊人,接下来无论哪家公司都会推出许多令人兴奋的成果,包括我们。

I I think to build on the point, these these models are getting more spectacular by the by the month, and then they'll be from whatever companies you like a bunch of really exciting, including ours.

Speaker 3

哦,天啊。

Oh, man.

Speaker 3

忘了说,你不能说模型会变得更好。

Forgot to say, you're not allowed to say models will get better.

Speaker 1

是啊。

Yeah.

Speaker 1

大家都知道。

Everybody knows.

Speaker 1

模型会变得...是啊。

The models are going to get Yeah.

Speaker 1

但我想说的是,它们正在变得好到可怕。

But I mean, they're getting scary good is the part that I would say.

Speaker 1

但我认为基于它们构建的智能体以及实现这一目标的框架,也同样在变得好到可怕。

But I think that then the agents that get built on top of them and the frameworks for making that happen are also getting scary good.

Speaker 1

因此在未来十二个月内,让事情长期保持正确运行的能力将会是变革性的。

So the ability to have things go quite right for quite long over the coming twelve months is gonna be transformative.

Speaker 3

我在想,你想透露些路线图的内容吗?

I'm gonna think, do you wanna leak any aspect of your road map?

Speaker 1

未来十二个月?

Next twelve months?

Speaker 1

现在还...现在还不行。

Not not so not right now.

Speaker 1

是的。

Yeah.

Speaker 1

好的。

Okay.

Speaker 0

你们俩呢?

Do you two?

Speaker 0

我...我想说的是...最大的转变,也是我强烈建议初创公司做的,就是不要在别人的模型基础上只做一层薄薄的封装。

I I I'd say the the the big shift and what I would urge startups to do is don't build thin wrappers around models that are other people's models.

Speaker 0

我认为模型与产品紧密结合,并且随着产品反馈不断优化,这种组合将会变得极其重要。

I think the the the combination of a model working very closely with the product and the model getting better as there's feedback in the product is gonna be super important.

Speaker 0

所以你们需要基础模型,但如果只是做一层浅封装,我认为企业的持久性会非常非常...你知道...短暂。

So you are gonna need foundation models, but if you just have a thin wrapper, I think the durability of your business will be very, very, you know, short lived.

Speaker 1

那么

So

Speaker 0

这...这就是我想...想强烈建议你们的一点。

that that would be something that I would I would urge you on.

Speaker 0

我认为某种智能路由层会很有用——它能决定哪些任务用自有模型,哪些可能用基础模型,并持续动态优化。Cursor在这方面做得就很好。

And I think that intelligent routing layer of some sort that says I'm gonna use my models for these things, I'm gonna probably use foundation models for other things, and dynamically keep optimizing will be I think Cursor does that pretty well.

Speaker 0

但这...这会成为软件开发生命周期演进的好方向。

But that that'll be a good way that the the software development life cycle will evolve.

Speaker 0

你们对思科的期待应该是...说实话,长期以来人们认为思科是家传统企业。

What you should expect from Cisco is look, truth be told, for the longest time, people thought Cisco was a legacy company.

Speaker 0

但...我认为业务已经形成了一定势头。

Like, there there has been I think there's a level of momentum in the business.

Speaker 0

员工队伍焕发着活力与干劲。

There's a spring and a step in the employee base.

Speaker 0

所以正如我所说,从物理层到语义层,从硅片到应用——在硅芯片、网络、安全、可观测性、数据平台以及应用等各个层面,你们都将会看到我们带来相当多的创新。

So, you should expect, like I said, from the physics to the semantics in every layer from silicon to the application, A fair amount of innovation in silicon, and networking, and security, and observability, and the data platform, as well as applications, you know, from us.

Speaker 0

我们非常期待与初创企业生态系统合作,如果你有意向与我们合作,请务必联系我们。

And we're excited to work with the startup ecosystem, and so if you if you ever feel like you wanna work with us, make sure that you reach out to us.

Speaker 3

阿米特,你要说点什么吗?

Are you gonna say something, Amit?

Speaker 1

我想强调模型的一个方面,就是大约两年前、三年前的文本模型,它们当时还挺有趣的。

I mean, one aspect that I I wanna highlight about the models is where we were with, let's say, text models two and a half, three years ago, they were fun.

Speaker 1

比如,让它写一首关于马丁的俳句。

Like, hey, write me a haiku about Martin.

Speaker 1

完成得很出色。

Did a great job.

Speaker 1

现在它们已经令人惊叹了。

Now they're amazing.

Speaker 1

我认为未来十二个月内,图像和视频的输入输出模型将会发生同样的进步。

I think that what's gonna happen in the next twelve months is the same thing is gonna be happening with input and output of images and video to these models.

Speaker 1

甚至对于图像,想象它们作为生产力和教育工具,而不仅仅是生成马丁内斯超人那样的酷炫图片。

And to the extent that even for images, imagine them as productivity and educational tools, not just, okay, here's Martinez Superman on a like, that's cool too.

Speaker 1

对吧?

Right?

Speaker 1

但将其用于提升生产力和学习,我认为将会带来真正革命性的变化。

Using But it for productivity gains and learning, I think, is gonna be really, really transformative.

Speaker 3

太棒了。

Awesome.

Speaker 3

那么我们就此结束本次会议。

So on that note, we're end this session.

Speaker 3

感谢这次精彩的对话,阿明。

Thanks for a great conversation, Amin.

Speaker 3

谢谢,你也是。

Thanks, you too.

Speaker 2

感谢收听本期a16z播客节目。

Thanks for listening to this episode of the a 16 z podcast.

Speaker 2

如果您喜欢本期节目,请记得点赞、评论、订阅、给我们评分或留言,并与亲朋好友分享。

If you like this episode, be sure to like, comment, subscribe, leave us a rating or a review, and share it with your friends and family.

Speaker 2

更多节目内容请前往YouTube、Apple Podcasts和Spotify平台观看。

For more episodes, go to YouTube, Apple Podcasts, and Spotify.

Speaker 2

在X平台关注我们@a16z,并订阅我们的Substack专栏a16z.substack.com。

Follow us on x at a sixteen z, and subscribe to our Substack at a16z.substack.com.

Speaker 2

再次感谢您的收听,我们下期节目再见。

Thanks again for listening, and I'll see you in the next episode.

Speaker 2

温馨提示:本节目内容仅供信息参考,不应视为法律、商业、税务或投资建议,也不用于评估任何投资或证券,且不针对任何a16z基金的现有或潜在投资者。

As a reminder, the content here is for informational purposes only, should not be taken as legal business, tax, or investment advice, or be used to evaluate any investment or security, and is not directed at any investors or potential investors in any a sixteen z fund.

Speaker 2

请注意,a16z及其关联机构可能持有本播客讨论公司的投资权益。

Please note that a sixteen z and its affiliates may also maintain investments in the companies discussed in this podcast.

Speaker 2

更多详情(包括我们的投资链接)请参见a16z.com/disclosures页面。

For more details, including a link to our investments, please see a 16z.com forward /disclosures.

关于 Bayt 播客

Bayt 提供中文+原文双语音频和字幕,帮助你打破语言障碍,轻松听懂全球优质播客。

继续浏览更多播客