Lenny's Podcast: Product | Career | Growth - “工程师正在变成魔法师” | OpenAI的Sherwin Wu谈软件开发的未来 封面

“工程师正在变成魔法师” | OpenAI的Sherwin Wu谈软件开发的未来

“Engineers are becoming sorcerers” | The future of software development with OpenAI’s Sherwin Wu

本集简介

吴雪雯负责领导OpenAI API平台的工程团队,该平台约95%的工程师使用Codex,通常同时运作10至20个并行AI代理。 我们探讨了以下话题: 1. OpenAI如何将代码审查时间从10-15分钟缩短至2-3分钟 2. AI如何改变管理者的角色 3. 为何AI高级用户与普通用户间的生产力差距正在扩大 4. 为什么说"模型将吞噬你的脚手架" 5. 未来12到24个月为何是工程师实现跨越式发展的关键窗口期 —— 本期节目由以下品牌赞助: DX——由顶尖研究者设计的开发者智能平台 Sentry——代码崩溃时,更快修复 Datadog——现拥有领先的实验与功能标记平台Eppo —— 完整文字稿:https://www.lennysnewsletter.com/p/engineers-are-becoming-sorcerers —— Lenny播客文字稿存档:https://www.dropbox.com/scl/fo/yxi4s2w998p1gvtpu4193/AMdNPR8AOw0lMklwtnC0TrQ?rlkey=j06x0nipoti519e0xgm23zsn9&st=ahz0fj11&dl=0 —— 吴雪雯联系方式: • X:https://x.com/sherwinwu • LinkedIn:https://www.linkedin.com/in/sherwinwu1 —— Lenny联系方式: • 电子报:https://www.lennysnewsletter.com • X:https://twitter.com/lennysan • LinkedIn:https://www.linkedin.com/in/lennyrachitsky/ —— 本期内容时间轴: (00:00) 吴雪雯介绍 (03:10) AI在OpenAI编程中的角色 (06:53) AI时代的软件工程未来 (12:26) 管理AI代理的压力 (15:07) Codex与代码审查自动化 (19:29) 工程经理角色的演变 (24:14) 单人十亿美元初创企业的可能性 (31:40) 管理经验分享 (37:28) AI部署的挑战与最佳实践 (43:56) 关于AI与用户反馈的犀利观点 (48:57) 为未来AI能力构建产品 (50:16) 未来18个月模型发展方向 (53:35) 业务流程自动化 (57:22) OpenAI生态系统与平台战略 (01:00:50) OpenAI的使命与全球影响 (01:05:21) 基于OpenAI API与工具的开发 (01:08:16) 快问快答与最终思考 —— 相关引用: • Codex:https://openai.com/codex • OpenAI CPO谈AI如何改变必备技能、护城河、编程等|Kevin Weil:https://www.lennysnewsletter.com/p/kevin-weil-open-ai • OpenClaw:https://openclaw.ai • Clawd创造者:"我发布没读过的代码":https://newsletter.pragmaticengineer.com/p/the-creator-of-clawd-i-ship-code • 《魔法师的学徒》:https://en.wikipedia.org/wiki/The_Sorcerer%27s_Apprentice_(Dukas) • Quora:https://www.quora.com • Marc Andreessen:真正的AI热潮尚未开始:https://www.lennysnewsletter.com/p/marc-andreessen-the-real-ai-boom • Sarah Friar的LinkedIn:https://www.linkedin.com/in/sarah-friar • Sam Altman的X账号:https://x.com/sama • Nicolas Bustamante的"LLMs吞噬脚手架"推文:https://x.com/nicbstme/status/2015795605524901957 • 《苦涩的教训》:http://www.incompleteideas.net/IncIdeas/BitterLesson.html • 奥弗顿之窗:https://en.wikipedia.org/wiki/Overton_window • 开发者现可向ChatGPT提交应用:https://openai.com/index/developers-can-now-submit-apps-to-chatgpt • Responses API文档:https://platform.openai.com/docs/api-reference/responses • Agents SDK指南:https://platform.openai.com/docs/guides/agents-sdk • AgentKit:https://openai.com/index/introducing-agentkit • Ubiquiti:https://ui.com • Crunchyroll上的《咒术回战》:https://www.crunchyroll.com/series/GRDV0019R/jujutsu-kaisen?srsltid=AfmBOoqvfzKQ6SZOgzyJwNQ43eceaJTQA2nUxTQfjA1Ko4OxlpUoBNRB • eero:https://eero.com • Opendoor:https://www.opendoor.com —— 推荐书籍: • 《计算机程序的构造与解释》:https://www.amazon.com/Structure-Interpretation-Computer-Programs-Engineering/dp/0262510871 • 《人月神话:软件工程论文集》:https://www.amazon.com/Mythical-Man-Month-Software-Engineering-Anniversary/dp/0201835959 • 《不存在反记忆部门》:https://www.amazon.com/There-No-Antimemetics-Division-Novel/dp/0593983750 • 《疾驰:中国塑造未来的工程之路》:https://www.amazon.com/Breakneck-Chinas-Quest-Engineer-Future/dp/1324106034 • 《苹果在中国:全球最伟大公司的陷落》:https://www.amazon.com/Apple-China-Capture-Greatest-Company/dp/1668053373 —— 节目制作与营销由https://penname.co/负责。赞助合作请联系podcast@lennyrachitsky.com。 —— Lenny可能对讨论的公司持有投资。 更多内容请访问www.lennysnewsletter.com

双语字幕

仅展示文本字幕,不包含中文音频;想边听边看,请使用 Bayt 播客 App。

Speaker 0

95%的工程师使用Codex。

95% of engineers use Codex.

Speaker 0

我们所有的拉取请求都由Codex审查。

100% of our PRs are reviewed by Codex.

Speaker 1

对于工程师来说,我不知道过去几年还有什么工作变化这么大。

For engineers, I don't know what job has changed more in the past couple years.

Speaker 0

工程师正在成为技术负责人。

Engineers are becoming tech leads.

Speaker 0

他们正在管理大量的代理。

They're managing fleets and fleets of agents.

Speaker 0

这真的感觉像我们是巫师,施放着各种法术,而这些法术就像是出去为你做事。

It literally feels like we're wizards casting casting all these spells, and these spells are kinda like going out and doing things for you.

Speaker 1

你认为人们还没有考虑到哪些因素?

What do you think people aren't pricing in yet?

Speaker 0

一个人打造十亿美元初创公司的第二或第三阶影响。

The second or third order effects of the one person billion dollar startup.

Speaker 0

要实现一个人的十亿美元初创公司,可能会有上百个小型初创公司正在开发定制化软件。

To enable a one person billion dollar startup, there might be a 100 other small startups building bespoke software.

Speaker 0

所以我认为我们可能会进入一个B2B SaaS的黄金时代。

So I think we might actually enter into a golden age of b to b SaaS.

Speaker 1

我越来越常听到,当人们的代理无法工作时,他们会感到压力。

I've been hearing more and more there's this stress people feel when their agents aren't working.

Speaker 0

目前有一支团队正在与OpenAI合作进行一项实验,他们维护着一个100%由Codex编写的代码库。

There's a team that's actually doing an experiment right now with an OpenAI where they are maintaining a 100% codex written code base.

Speaker 0

他们遇到了你所描述的那些确切问题。

They run into the exact problems that you're describing.

Speaker 0

所以通常,他们会说:好吧。

And so usually, they're like, alright.

Speaker 0

我卷起袖子,自己搞定它。

I'll roll up my sleeves and figure it out.

Speaker 0

这支团队没有这种退路。

This team doesn't have that escape hatch.

Speaker 1

你提到,在人工智能领域,倾听客户并不总是正确的策略。

You've shared that listening to customers is not always the right strategy in AI.

Speaker 0

这个领域以及模型本身变化得太快了。

The field and the models themselves are just changing so, so quickly.

Speaker 0

它们往往会自我颠覆。

They tend to, like, disrupt themselves.

Speaker 0

模型会把你的脚手架当早餐吃掉。

The models will eat your scaffolding for breakfast.

Speaker 1

你对那些觉得‘我不想错过这班车’的人有什么建议?

What's your advice to folks that are like, okay.

Speaker 1

我不想错过这班车?

I don't wanna miss the boat?

Speaker 0

确保你构建的是面向模型未来发展方向,而不是它们今天的状态。

Make sure you're building for where the models are going and not where they are today.

Speaker 0

我们这里的科学副总裁凯文·怀尔有一句名言。

There's a quote from Kevin Whale, our VP of Science here.

Speaker 0

他喜欢说,这是模型有史以来最差的时刻。

He likes saying this is the worst the models will ever be.

Speaker 1

今天,我的嘉宾是OpenAI API和开发者平台的工程负责人吴雪文。

Today, my guest is Sherwin Wu, head of engineering for OpenAI's API and developer platform.

Speaker 1

鉴于几乎每个AI初创公司都集成OpenAI的API,吴雪文对当前发生的事情和未来趋势有着独特而广泛的视角。

Considering that essentially every AI startup integrates with OpenAI's APIs, Sherwin has an incredibly unique and broad view into what is going on and where things are heading.

Speaker 1

在短暂介绍我们的精彩赞助商之后,我们马上开始。

Let's get into it after a short word from our wonderful sponsors.

Speaker 1

本期节目由DX赞助播出,DX是由顶尖研究人员打造的开发者智能平台。

Today's episode is brought to you by DX, the developer intelligence platform designed by leading researchers.

Speaker 1

要在AI时代取得成功,组织需要快速适应。

To thrive in the AI era, organizations need to adapt quickly.

Speaker 1

但许多组织领导者难以回答这些紧迫的问题:哪些工具正在发挥作用?

But many organization leaders struggle to answer pressing questions like: Which tools are working?

Speaker 1

它们是如何被使用的?

How are they being used?

Speaker 1

真正推动价值的是什么?

What's actually driving value?

Speaker 1

DX 提供了领导者在应对这一转变时所需的数据和洞察。

DX provides the data and insights that leaders need to navigate this shift.

Speaker 1

通过 DX,Dropbox、Booking.com、Adyen 和 Intercom 等公司能够深入了解 AI 如何为开发者创造价值,以及 AI 对工程生产力的影响。

With DX, companies like Dropbox, booking.com, Adyen, and Intercom get a deep understanding of how AI is providing value to their developers and what impact AI is having on engineering productivity.

Speaker 1

要了解更多信息,请访问 DX 的网站:getdx.com/leni。

To learn more, visit DX's web site at getdx.com/leni.

Speaker 1

网址是 getdx.com/leni。

That's getdx.com/leni.

Speaker 1

应用程序会以各种方式出问题。

Applications break in all kinds of ways.

Speaker 1

崩溃、变慢、性能倒退,以及只有真实用户使用后才会出现的问题。

Crashes, slowdowns, regressions, and the stuff that you only see once real users show up.

Speaker 1

Sentry 能捕捉到所有这些问题。

Sentry catches it all.

Speaker 1

查看错误发生的时间、地点和原因,甚至能追溯到引入错误的提交、负责发布代码的开发者以及具体的代码行,所有信息都在一个连贯的视图中呈现。

See what happened, where, and why, down to the commit that introduced the error, the developer who shipped it, and the exact line of code all in one connected view.

Speaker 1

我确实试过用五个标签页加Slack线程的方式来调试。

I've definitely tried the five tabs and Slack thread approach to debugging.

Speaker 1

这种方式更好。

This is better.

Speaker 1

Sentry 会向你展示请求的流转过程、哪些操作被执行了、哪些环节变慢了,以及用户实际看到的情况。

Sentry shows you how the request moved, what ran, what slowed down, and what users saw.

Speaker 1

SEER,Sentry 的 AI 调试代理,将在此基础上继续推进。

SEER, Sentry's AI debugging agent, takes it from there.

Speaker 1

它利用所有这些 Sentry 上下文信息,告诉你根本原因、提出修复建议,甚至为你自动生成拉取请求。

It uses all of that Sentry context to tell you the root cause, suggest a fix, and even opens a PR for you.

Speaker 1

它还会审查你的拉取请求,标记任何破坏性变更,并提供现成的修复方案。

It also reviews your PRs and flags any breaking changes with fixes ready to go.

Speaker 1

立即免费试用 Sentry 和 SEER,访问 sentry.io/lenny 并使用代码 Lenny 获得 100 美元的 Sentry 信用额度。

Try Sentry and SEER for free at sentry.io/lenny and use code Lenny for $100 in Sentry credits.

Speaker 1

那是 sentry.io/lenny。

That's sentry.io/lenny.

Speaker 1

谢尔温,非常感谢你来到这里,欢迎来到本播客。

Sherwin, thank you so much for being here, and welcome to the podcast.

Speaker 0

谢谢。

Thank you.

Speaker 0

谢谢邀请我。

Thank you for having me.

Speaker 1

我想从一个衡量AI进展的指标开始,尤其是在工程领域。

I wanna start with what's feeling like a barometer of progress in AI, especially in engineering.

Speaker 1

到目前为止,你和你的团队编写的代码中,有多少比例是由AI生成的?

What percentage of your code, if you even write code anymore, and your team's code is written by AI at this point?

Speaker 0

我现在偶尔还是会写代码。

I do write code occasionally now still.

Speaker 0

实际上,对于我这样的管理者来说,现在使用这些AI工具比手动编写代码要容易得多。

I'd actually say for managers like myself, it's way easier to use these AI tools than to manually code at this point.

Speaker 0

因此,对我自己和OpenAI的其他一些工程经理来说,我们现在所有的代码都是由Codex生成的。

And so I know for myself and some of the other EMs, engineering managers at OpenAI, all of our code is written by by Codex at this point.

Speaker 0

但更广泛地说,现在内部充满了巨大的能量。

But more broadly, there's just been this there's just so much energy.

Speaker 0

我们内部能明显感受到这些工具已经取得了多大的进步,Codex作为工具对我们来说已经变得多么出色。

There's like a tangible energy internally around just how far these tools have gotten, how good codecs as a tool has gotten for us.

Speaker 0

而且我们很难精确衡量有多少代码是写出来的,因为绝大多数代码,我想说接近100%,最初都是由AI生成的。

And it's it's a little hard for us to exactly measure how much of the code is is written because the vast majority of it, I'd say, like, close to a 100% is is usually generated by AI first.

Speaker 0

不过我们确实追踪的是,目前绝大多数工程师每天都使用Codex。

What we do track though is is, you know, at this point, the vast majority of engineers use Codecs on a daily basis.

Speaker 0

所以95%的工程师都会使用Codex。

So 95% of engineers use Codecs.

Speaker 0

我们所有的PR每天也都会由Codex进行审查。

100% of our PRs are reviewed by Codecs daily as well.

Speaker 0

基本上,任何进入生产环境并被合并的代码,Codex都会关注,并在PR中提出改进建议和修改意见。

Basically any code that goes into production that's merged in, Codex kind of has its eyes on and suggests improvements, suggests changes in the PRs.

Speaker 0

所以这就是我们内部所看到的情况,但总的来说,最令人兴奋的还是这种蓬勃的活力。

And so that's kind of what we're seeing internally, but by and large, the most exciting is just the energy that there is.

Speaker 0

我们另一个观察是,经常使用Codex的工程师会提交更多的拉取请求。

Another observation that we've had is engineers who tend to use codecs more open way more PRs.

Speaker 0

他们实际提交的拉取请求比那些较少使用Codex的工程师多出70%。

So they're actually opening 70% more PRs than than the engineers who aren't using Codex as much.

Speaker 0

而且这个差距还在不断扩大。

And the gap is widening.

Speaker 0

我觉得,那些提交更多拉取请求的人正在逐渐学会更熟练地使用这个工具,变得更加高效,这70%的差距随着时间推移还在持续扩大。

So I feel like, you know, the people who are opening more PRs are starting to, you know, learn how to use the tool more and more, get more efficient, and that 70% gap keeps growing over time.

Speaker 0

这个数字可能比我上次查看时又有所上升了。

And so might have actually increased since I last looked at the at the number.

Speaker 1

好的。

Okay.

Speaker 1

所以为了确认我理解正确,你的意思是,OpenAI这95%的工程师编写的代码都是由AI生成的。

So just to make sure we hear what you're saying, you're saying all of the code of these 95%, engineers at OpenAI is written by AI.

Speaker 1

代码是写出来的,然后他们进行审查。

It's written and then they review it.

Speaker 0

是的。

Yep.

Speaker 0

是的。

Yep.

Speaker 1

这简直太疯狂了,但奇怪的是,我们已经不再觉得这有什么疯狂了,我们正在逐渐适应这种状态。

It's it's like crazy that that's almost like not crazy anymore that we're just like getting used to this.

Speaker 0

我想,显然我们还在适应过程中。

I think there's still some getting used to, to be clear.

Speaker 0

另外,我认为有些工程师对Codex的信任度稍低一些,但几乎每天我都会遇到某个人,他们对Codex能做的事情感到震惊,他们对模型的信任度——或者说模型能独立完成多少工作的信任度——会一次次地提升。

There's also, I think some, you know, engineers who I think trust Codex a little bit less, but basically every day I talk to someone who is blown away by something that I can do and kind of like their bar of trust kind of, or like how much they trust the model to do on its own goes up over and over over time.

Speaker 0

这里有一句我们科学副总裁凯文·怀尔的名言。

And there's a quote from Kevin Whale, our VP of science here.

Speaker 0

他喜欢说,这是模型最差的时候。

And he likes saying this is the worst the models will ever be.

Speaker 0

因此,对于软件工程来说,这也是模型最差的时候。

And so this is the worst that the models ever be for software engineering as well.

Speaker 0

所以随着时间推移,我们看到人们越来越信任它,同时模型本身也会变得越来越好。

And so over time, we just see people trusting it more and more, and then we'll see the models get better and better as well.

Speaker 1

是的。

Yeah.

Speaker 1

凯文·惠尔,之前的播客嘉宾,他在这档播客里说过 exactly 这句话。

Kevin Wheal, former podcast guest, he he said exactly that line on this podcast.

Speaker 1

是的。

Yeah.

Speaker 1

是的。

Yeah.

Speaker 1

好几次。

A few times.

Speaker 1

是的。

Yeah.

Speaker 1

彼得,那个Claude Bot、Molt Bot或者现在叫Open Claw的,就是它现在的名字,是的。

Peter, the Claude Bot slash Molt Bot slash Open Claw is what it's called now Yeah.

Speaker 1

一位开发者最近分享说,他在工作中使用Codex,每次它完成任务时,他都完全相信它做对了,几乎确信可以直接将代码提交到主分支,效果也会非常好。

Developer, recently shared that he uses Codex for his work, and he feels like anytime it does things, he just trusts that it has done the right job, but he's just like almost certain he could just commit it to master and it'll be great.

Speaker 0

是的。

Yeah.

Speaker 0

对。

Yeah.

Speaker 0

他是Codex的优秀用户。

He's a great user of Codex.

Speaker 0

我知道他和团队保持密切联系,给了我们很多宝贵的反馈。

I know he's in close touch with the team, gives us great feedback.

Speaker 0

他使用它并不令人意外。

Not surprised that he uses it.

Speaker 0

我的意思是,抱歉,现在叫Open Claw。

I mean, sorry, it's called Open Claw.

Speaker 1

Open Claw。

Open Claw.

Speaker 0

是的。

Yeah.

Speaker 0

Open Claw 是一个很棒的产品。

Open Claw a great is a great product.

Speaker 0

然后我看到这个,我的意思是,这是最近的事,但今天早上,我觉得 Molt book 有点像,是的。

And then I saw that this more I mean, this is very recent, but this morning, I think Molt book kind of like Yeah.

Speaker 0

看到所有 AI 代理彼此交流,真的非常超现实。

With Sheridan, seeing all of the AI agents talk to each other is pretty pretty surreal.

Speaker 1

这基本上就是现实中正在发生的事,我是说,是的。

It's basically her is happening in real life is what I'm Yeah.

Speaker 1

所以,回到我们工程师所经历的这个疯狂时刻,我们已经从每行代码都亲手编写,变成了现在 AI 为你编写所有代码。

So just like coming back to this crazy moment we are living through for engineers in particular, we've gone from you write every line of code to now AI is writing all of your code.

Speaker 1

我不知道过去几年还有什么工作发生了如此巨大的变化。

I don't know what job has changed more in the past couple years.

Speaker 1

就像我们从未预料到会如此剧烈变化的职业,工程师的职业生涯中,这份工作已经完全不同了。

Like, job that we didn't expect to change this much, where just like the job of an engineer is so different in the entire lifespan of an engineer.

Speaker 1

在过去的几年里,现在我已经不再写任何代码了。

Like in the past couple years, it's now shifted to I don't write any more code.

Speaker 1

你想象一下,在未来几年里,工程师和软件工程师的角色会是什么样子?

How do you imagine the role of an engineer and the job of a software engineer looks in the next couple years?

Speaker 1

也就是说,这份工作到底会变成什么样?

Just like, what is that job?

Speaker 0

是的。

Yeah.

Speaker 0

我的意思是,一直都能看到这些,真的非常酷。

It's I mean, it's always been really cool to see.

Speaker 0

而这种兴奋感的一部分就在于,这份工作在未来一到两年内很可能会发生显著变化。

And it's part of where the excitement is because, like, the job is likely gonna change pretty significantly over the next one to two years.

Speaker 0

不过,我们感觉还是在摸索中前进。

It kinda feels like we're still figuring things out though.

Speaker 0

因此,我知道,尤其是来自一些软件工程师的兴奋感在于,我们现在正处于一个难得的时刻,未来十二到二十四个月内,我们将有机会自己摸索并确立我们的标准。

And so there's like this excitement I know, especially from some of the software engineers of like, we're in this rare moment, you know, maybe over the next twelve to twenty four months where we'll kind of get to figure things out ourselves and set our standards for ourselves.

Speaker 0

就我所看到的未来发展方向而言。

In terms of where I see this moving.

Speaker 0

所以我认为大家普遍提到的一个共同点是,工程师们正在逐渐转变为技术负责人。

So I think there's a common thing that everyone's saying, which is, you know, people are generally IC engineers are becoming tech leads.

Speaker 0

他们现在本质上就像是管理者。

They're basically like managers now.

Speaker 0

他们管理着大量的人工智能代理。

They're managing fleets and fleets of agents.

Speaker 0

我知道我团队中的许多工程师同时要处理十到二十个任务线程。

I know many of the engineers on my team basically have like 10 to 20 threads kind of being pulled on at the same time.

Speaker 0

当然不是在运行活跃的代码任务,而是有大量的并行线程。

Obviously not active running codex jobs, but just a lot of parallel threads.

Speaker 0

他们需要不断跟进这些任务的进展。

They're checking in on what they're doing.

Speaker 0

他们正在引导这些代理和Codex,并给予反馈。

They're steering the agents and codex and giving it feedback.

Speaker 0

因此,他们的工作已经从单纯编写代码,转变为几乎像一名管理者。

And so their job has kind of really changed from just writing the code itself into being almost like a manager.

Speaker 0

关于我预计一到两年后这种情况会如何发展。

In terms of where I think this will go one to two years from now.

Speaker 0

这里我经常回想起的一个比喻,来自我大学时读过的一本编程教材,名叫《SICP》。

So one kind of metaphor that I kind of always come back to here is actually from this programming textbook that I read back in college called SICP.

Speaker 0

我不知道你是否听说过它。

I don't know if you've heard of it.

Speaker 0

《计算机程序的构造和解释》。

Structure and Interpretation of Computer Programs.

Speaker 0

所以是S-I-C-P。

So S I S I C P.

Speaker 0

在麻省理工学院,这本书非常流行,长期以来一直被用作入门编程课程的教材。

At MIT, it was really popular and it was actually used as the introductory it was the textbook for the intro programming course for a very long time.

Speaker 0

而且它拥有一定的狂热追随者。

And it kinda has this cult following.

Speaker 0

它教你编程。

It teaches you programming.

Speaker 0

它教你一种叫 Scheme 的 Lisp 方言。

It teaches you a dialect of Lisp called Scheme.

Speaker 0

它让你接触函数式编程。

And so it like introduces you to like functional programming.

Speaker 0

这种方式非常开阔眼界。

It's like very mind mind opening that way.

Speaker 0

但让我对这本书印象深刻的是,我大学时读过它。

But the thing that was memorable for me about that book, so I kind of read it in college.

Speaker 0

书的开篇将编程描述为一门学科,并用一个类似巫术的比喻来说明。

The very beginning of it kind of describes programming as a discipline and draws this metaphor to basically like sorcery.

Speaker 0

它说软件工程师就像巫师,编程语言就像咒语,你念出这些咒语,它们就会为你执行各种任务。

Like it says like software engineers are like wizards and you're like programming languages are like incantations and you're like, you know, you're saying, you're issuing these spells and these spells are kind of like going out and doing things for you.

Speaker 0

而挑战就在于,你需要说出什么样的咒语,才能让程序按照你的意愿运行?

And the challenge is like, what incantation do you have to say to make the program do what you want?

Speaker 0

这本书写于1980年。

And this book was written in 1980.

Speaker 0

那是很久以前的事了。

So this is a while ago.

Speaker 0

我认为这个隐喻实际上一直延续至今。

And I think that metaphor has actually like kind of persisted over time.

Speaker 0

我认为随着我们进入这个新的‘氛围编程’时代,或者说是软件工程未来的样子,这个隐喻正在成为现实,因为编程语言本质上就是使用咒语。

And I think it's actually playing out as we move into this new era of vibe coding or just like what software engineering will look like because programming languages were basically using incantations.

Speaker 0

它们随着时间发生了变化。

They've changed over time.

Speaker 0

而挑战始终存在,趋势是通过编程让计算机执行你想要的操作变得越来越容易。

And the challenge has always, and the trend has been that these, it's been easier and easier to kind of get the computer to do what you want via programming.

Speaker 0

我认为当前这场人工智能浪潮,很可能是这一演进的下一个阶段。

And I think the current wave of AI is probably the next stage of that evolution.

Speaker 0

现在这简直就是咒语了,因为你只需告诉Codex,或者告诉Cursor你想要做什么,它就会全部替你完成。

It is now literally incantations because you can tell, you know, your you can tell tell Codex, you can tell Cursor exactly what you wanna do and then it'll all go do it for you.

Speaker 0

我特别喜欢巫师和术士的比喻,因为我觉得我们当前的状态正逐渐走向《幻想曲》中《魔法师的学徒》那样的情景,米奇老鼠找到了巫师的帽子,然后试图施展各种法术。

And I particularly like the wizard and like the the sorcerer analogy because I think our current state is is starting to move towards kind of like the the Sorcerer's Apprentice, you know, from Fantasia where Mickey Mouse is like, you know, he finds the Sorcerer's hat and he tries to do all these things.

Speaker 0

我认为这个比喻非常贴切,因为首先,现在这些技术真的非常强大。

And I actually think it's a really apt analogy because one, it's just, it's really powerful now.

Speaker 0

你所使用的这些咒语具有极高的杠杆效应,但你必须知道自己在做什么,对吧?

These incantations you can do can is extremely high leverage, but you kind of have to know what you're doing, right?

Speaker 0

在《魔法师的学徒》里,整个剧情就是米奇失控了,扫帚疯狂运转,到处洪水泛滥。

Like in Sorcerer's Apprentice, the whole plot is like Mickey goes wild, the brooms like go crazy and everything's flooding.

Speaker 0

我记得他 literally 给扫帚下达了任务,然后就去睡觉了。

I think he literally sets the like sets the brooms off on a task and then goes asleep.

Speaker 0

所以,这简直就是极致的氛围编程。

And so, you know, it's like vibe coding at its greatest.

Speaker 0

最后,那位老巫师回来,把一切收拾干净。

And then eventually the old sourcer comes back and like cleans everything up.

Speaker 0

当我看到工程师们同时进行二十个不同的Codex线程时,这确实需要一定的技能、资历和大量思考,因为你需要确保模型不会失控。

And, you know, when I see engineers kind of like doing these 20 different codex threads at a time, there is some skill and there's some seniority and like, a lot of thought that needs to go into this because you want to make sure that the models aren't going off the rails.

Speaker 0

你绝对不希望完全放任不管,任由它自行其是。

You definitely don't want to just like completely go away and, you know, like ignore the thing.

Speaker 0

但这也具有极高的效率。

But it's also extremely high leverage.

Speaker 0

像那些非常资深、熟练使用这些工具的工程师,现在能通过这些工具完成多得多的事情。

Like, you know, a very senior engineer who's really proficient with these tools can now just do way more things via what they're doing.

Speaker 0

我认为这也是它如此有趣的原因。

And I think this is also what makes it fun.

Speaker 0

这真的让我们感觉自己像魔法师一样。

Like, it literally feels like we're wizards now.

Speaker 0

你感觉我们越来越接近那种魔法般的体验——我们施放各种咒语,让软件为我们完成所有事情。

You know, it feels like we're closer to to to to having to to making making it feel like this, like, magical experience where we're, you know, casting all these spells and having software do all these things for you.

Speaker 1

你描述那个场景时,我正好也在想《魔法师的学徒》这个比喻,很高兴你提到了它。

I was thinking of the Sorcerer's Apprentice exactly as the metaphor as you were describing that, so I'm glad you went there.

Speaker 1

一位之前的播客嘉宾将它描述为,你拥有一个可以满足你愿望的精灵,这种说法很有用,因为你必须非常清楚自己想要什么愿望。

A previous podcast guest described it as you have a genie that you can that grants you wishes, and it's a useful frame because you have to be very clear about the wish you want.

Speaker 1

比如,你想要的规模得有多大,

Like, you wanna be big, like,

Speaker 0

有多大影响?

how big a deal?

Speaker 0

或者这可能像猴子的爪子那样,你知道,你得到了你想要的,但副作用是什么?

Or it might be like the monkey's paw type thing where, you know, it's like, you caught what you want, but what are the side Right.

Speaker 0

是的。

Yeah.

Speaker 0

对。

Yeah.

Speaker 0

我觉得这个比喻非常好。

Think that and the analogy is great.

Speaker 0

对了,让我觉得不可思议的是那本书《Sick Bee》的持久影响力。

Yeah, the crazy thing for me is just the staying power of that book, Sick Bee.

Speaker 0

它被称为《魔法师之书》。

Like, it's called the wizard book.

Speaker 0

人们称它为《魔法师之书》,因为整个书中都贯穿了这个隐喻,而我们现在基本上已经达到了这个阶段,这真的很棒。

You know, people call it the wizard book because that is the metaphor that they kind of weave throughout the the book and, we're we've basically reached that point now, which is which is which is really cool.

Speaker 1

我想在这里探讨两个方向。

There's two kind of threads I wanna follow here.

Speaker 1

一个是,我越来越常听到,当人们的智能代理无法正常工作时,会感到很大的压力。

One is, I've been hearing more and more there's this like stress that people feel when their agents aren't working.

Speaker 1

你是不是会同时启动一大堆Codex代理,然后还得时刻盯着它们?

Do you fire off all these, you know, Codex agents and then you have to keep stay on top of them?

Speaker 1

天哪。

Oh shit.

Speaker 1

有一个没在工作。

One's not working.

Speaker 1

我浪费时间了。

I'm wasting time.

Speaker 1

你有这种感觉吗?

Do you do you feel that?

Speaker 1

你们团队里也有这种感觉吗?

Do you feel that across your team at all?

Speaker 0

是的。

Yeah.

Speaker 0

是的。

Yeah.

Speaker 0

我的意思是,这种情况经常发生。

I mean, it happens all the time.

Speaker 0

实际上,我认为现在这一切最有趣的部分就在这里,因为这些模型并不完美。

And I actually think like this is where the interesting part of all of this lies right now because these models aren't perfect.

Speaker 0

这些工具也不完美。

These tools aren't perfect.

Speaker 0

我们仍在探索如何最好地与Codex或这些AI代理互动,以完成工作。

And we're still trying to figure out how to best interact with these with with with Codex or with these AI agents to to get work done.

Speaker 0

我们经常看到这种情况发生。

We see this come up all the time.

Speaker 0

我们内部有一个特别有趣的团队。

There's a particularly interesting team that we have internally.

Speaker 0

目前有一个团队正在与OpenAI进行一项实验,他们基本上维护着一个完全由Codex生成的代码库。

So there's a team that's actually doing an experiment right now with an OpenAI where they are basically maintaining a 100% codex written code base.

Speaker 0

你知道,有时候AI会写代码,但你显然最终得重写很多部分,可能还需要检查和修改一些内容。

So, you know, like, you know, some, you know, you'll have the AI write code, but you'll obviously end up like rewriting a lot of it and you might need to like double check and change things.

Speaker 0

但这个团队完全依赖Codex,全身心投入其中。

But this team is just fully Codex pilled and just like leaning in entirely.

Speaker 0

他们遇到了你所描述的 exactly 问题,也就是:我想实现这个功能,但我没法让AI帮我完成。

And they run into the exact problems that you're describing, which is like, you know, their challenges, you know, I want to get this thing, this feature built, but I can't get the Asian to do it.

Speaker 0

过去,这通常是个退路,你会想:好吧,我亲自上手,自己搞定。

And so usually it was an escape hatch where, you know, then you're like, all right, I'll roll up my sleeves and like figure it out.

Speaker 0

然后,与其使用Codex,我可能会改用自动补全、Cursor之类的工具。

And then instead of using codex, I might use like tab complete and cursor and things like that.

Speaker 0

但这个团队在实验中没有这样的退路。

But this team, for the experiment, this team doesn't have that escape hatch.

Speaker 0

所以问题来了,我该如何让智能体完成这个任务?

And so then the challenge, like how do I get the agent to do this?

Speaker 0

实际上,我们打算发布一篇博客文章,分享一些从中学到的经验,许多有趣的范式和最佳实践正从中浮现出来。

And I actually think we're gonna be publishing a blog post from some of our learnings here, but a lot of fascinating paradigms and best practices are falling out of this.

Speaker 0

我们注意到一个有趣的现象,我不知道你是否也有这种感觉,但我们在这里确实经常发现,当编码智能体没有按你的预期工作时,问题通常出在上下文或你提供给它的信息上。

One interesting thing that we've noticed, I don't know if this is what you kind of feel, but we definitely feel it here is a lot of the time when the coding agent is not doing what you want, it's usually a problem with context and just like information that you've given it.

Speaker 0

要么是描述不够详细,要么就是智能体或Codex缺乏完成某项任务所需的足够信息。

It's just either under specified or there's just not enough information around how to do something available to the agent, available to Codex.

Speaker 0

因此,当你必须通过这种方式解决问题时,挑战就在于如何增加文档,绕过这一限制,将你头脑中的大量隐性知识以某种方式编码到代码库中,无论是通过代码注释、代码结构,还是通过.md文件、技能文档等仓库内的其他资源,以便模型能更好地完成任务。

And so when you have to solve it through that, the challenge is then to add documentation and actually work around this limitation and basically encode more tribal knowledge that's in your head somehow into the code base, either via code comments itself or code structure itself, or via text files like .md files, skills, any type of additional resources within the repository so that the model can better do its task.

Speaker 0

这个团队还有许多其他值得探索的发现,但确实,通过移除‘不再使用AI’这一退路,他们开始逐步拼凑出我们若想真正依赖智能体就必须解决的诸多问题。

There's a whole bunch of other learnings from this this group, which I think is fascinating to explore, but yeah, kind of giving, removing that escape hatch of, of no longer using AI has allowed them to start piecing together a lot of the problems that we'll have to solve if we really wanna lean into agents.

Speaker 1

另一个人们遇到的问题是,你提到人们在使用AI时会疯狂地提交拉取请求,PR的数量大大增加了。

Another, issue people run into, you talked about how people are shipping PRs like crazy, a lot more PRs if they're working with AI.

Speaker 1

显然,代码审查正变得越来越具有挑战性。

Obviously, code review is becoming a a bigger challenge.

Speaker 1

你们团队有没有找到什么方法来加快这个过程,使其可扩展,而不是让每个人整天坐在那里审查PR?

Is there anything you've figured out of your team to help speed that up to make that scale and not just create this terrible job for people where they're just sitting there reviewing PRs all day?

Speaker 0

是的。

Yeah.

Speaker 0

我的意思是,目前Codex已经审查了我们所有的PR。

I mean, one thing is codex reviews 100% of all of our PRs at this point.

Speaker 0

事实上,一件非常有趣的事情是,我们通常会立即交给模型处理的,都是那些让我们感到厌烦或最枯燥的软件工程任务。

And so I actually think so one really interesting thing that's happened is the things that tend to, we tend to hand to the models immediately tend to be the things that annoy us or like are the most boring parts of software engineering.

Speaker 0

这也让工作变得更有趣了,因为我们现在能做更多有趣的事情。

It's also why it's more fun now because we get to do more, you know, more of the fun things.

Speaker 0

就我个人而言,我非常讨厌代码审查。

For me, speaking more for myself, I really hated code reviews.

Speaker 0

这曾经是我最不喜欢的事情之一。

It was like one of the worst things for me.

Speaker 0

我记得刚毕业第一份工作在Quora时。

And then I remember in my first job out of college, it at Quora.

Speaker 0

我当时负责新闻动态模块。

I owned I was working on the News Feed.

Speaker 0

所以我拥有新闻动态的代码。

And so I owned the code for the News Feed.

Speaker 0

因此我是新闻动态的代码审查者,而这是每个人都经常修改的核心代码。

And so I was a reviewer for News Feed and it was just like the central piece of code that everyone would touch.

Speaker 0

所以每天早上我登录后,都会看到二三十个代码审查请求。

And so I would just every morning I'd log in and be like 20 to 30 code reviews.

Speaker 0

我当时就想,天啊,我得赶紧把这些都处理完。

I was just like, oh my goodness, I gotta like, you know, get through all of these.

Speaker 0

我会拖延,结果数量就涨到了五十个。

I would procrastinate and then it grows to like 50.

Speaker 0

所以真的有非常多的代码审查。

And so there's just like a lot of code reviews.

Speaker 0

Codex 在代码审查方面非常出色。

Codex is really good at reviewing code.

Speaker 0

所以我们发现,特别是 5.2 版本,在代码审查方面变得极其擅长,尤其是当你正确引导它的时候。

So actually one thing that we've noticed that five point two in particular has gotten extremely, strongly adept at is reviewing code and especially when you kind of steer it in the right direction.

Speaker 0

所以在代码审查方面,是的,我们创建了很多拉取请求,但 Codex 会审查所有这些请求。

And so for code reviews, yeah, we create a lot of PRs, but Codex reviews all of them.

Speaker 0

它让代码审查从原本需要十到十五分钟的任务,有时缩短到仅仅两到三分钟,因为你已经内置了许多建议。

And it makes, you know, code reviews go from a, you know, I don't know, ten, fifteen minute tasks to sometimes even just like a two to three minute task because you have a bunch of suggestions already baked in.

Speaker 0

很多时候,尤其是对于小的拉取请求,你甚至根本不需要其他人来审查。

A lot of the times people will, especially for small PRs, you actually don't even need people to review.

Speaker 0

我们在这方面相当信任 Codex。

We kind of trust Codex in this way.

Speaker 0

原始作者会查看 Codex 的建议。

The original author kind of looks at Codex.

Speaker 0

代码审查的好处在于,有第二双眼睛来确保你没有做出愚蠢的错误。

It is you know, the benefit of code review is to have a second pair of eyes to make sure that you're not doing anything dumb.

Speaker 0

到目前为止,Codex 已经是一个相当聪明的第二双眼睛了。

Codex is a pretty smart second pair of eyes at this point.

Speaker 0

因此,我们在这方面已经大力依赖它了。

And so that's something that we've heavily leaned into.

Speaker 0

目前,内部的通用 CI 流程以及推送后和部署流程也已通过 Codex 大幅自动化。

The general CI process and like the post kind of push and like deployment process has also been heavily automated via codecs internally at this point.

Speaker 0

如果你去问很多工程师,最让他们烦心的事就是,写完漂亮的代码后,怎么把它部署到生产环境?

If you talk to a lot of engineers, the thing that annoys me the most is after you've written your beautiful code, like how do you get it into production?

Speaker 0

你知道,你得跑完所有这些测试。

You know, you gotta you gotta run through all these tests.

Speaker 0

你得处理那些代码规范检查错误。

You gotta like, know, lint errors.

Speaker 0

你得完成所有的代码审查。

You gotta all the code review.

Speaker 0

用 Codex 你可以自动化很多东西。

There's a lot of automated stuff you can do with Codecs.

Speaker 0

因此,我们实际上内部开发了一些工具,帮助自动化这个过程,自动修复代码规范问题。

And so we've actually built some tools internally that that help automate that process, automate the lint.

Speaker 0

你知道,如果出现代码规范错误,用Codex来修复非常简单。

You know, if there's like a lint error, it's a very easy Codecs fix.

Speaker 0

然后它可以直接打补丁并重新启动CI流程。

And then just it could just patch it and then kind of restart the CI process.

Speaker 0

我们努力将所有这些步骤尽可能简化,让工程师的工作量降到最低,而其附带的好处是,他们现在可以更频繁地合并和推送代码。

So all of that is we're trying to collapse as as into as as little work for an engineer as possible, which and and the byproduct of which is they can they can now merge and push out a lot more peers.

Speaker 1

Codex编写代码,Codex审查自己的代码,我想知道你们是否愿意使用其他模型来审查你们模型的工作成果?

Codex writing the code, Codex reviewing its own code, I'm curious if you are open to using other models to review your model's work.

Speaker 1

这是一条可行的路径,还是说目前的方案已经足够好了?

Is that is that a path or is it just it's good enough?

Speaker 1

我们不需要其他任何东西。

We don't need anything else.

Speaker 0

我想说的是,这里确实存在一种循环关系。

So I will say there's there's definitely a circular thing here.

Speaker 0

就像回到《魔法师的学徒》那样,你得确保别让扫帚失控了。

And like going back to Sorcerer's Apprentice, like you wanna make sure you're not letting the brooms go crazy here.

Speaker 0

所以我们非常谨慎地考虑了哪些PR完全由Codex审核即可。

And so, you know, we were very thoughtful, I'd say around which PRs kind of are completely just Codex reviewed.

Speaker 0

当然,大多数人还是会查看自己的PR。

Most people still obviously take a look at their PRs.

Speaker 0

所以并不会降到零。

And so it's not like it's going to zero.

Speaker 0

更像是从100%的关注度降到30%的关注度,这样能帮助事情更快推进。

It's more like going from a 100% attention to like 30% attention, which just helps things push through.

Speaker 0

关于使用多个模型,我们当然在内部测试了很多模型。

In terms of like multiple models, so we obviously test a lot of models internally.

Speaker 0

所以我们有很多这类模型。

And so we have a lot of those.

Speaker 0

我们较少使用外部模型。

We use external models less.

Speaker 0

我们认为有必要使用我们自己的模型进行内部测试,从而获得反馈。

We think it's important to kind of dog food our own models and kind of like get feedback there.

Speaker 0

但你也可以使用许多内部版本的模型,它们能为你提供不同的视角。

But you can also, you know, there are a lot of like internal variants of models that you can use to give you a different perspective here as well.

Speaker 0

我们发现这种方式效果非常好。

We found that to work quite well.

Speaker 1

好的。

Okay.

Speaker 1

为了确保我们能准确了解今天OpenAI在AI和代码方面的现状,我想确认一下,之后我想转到另一个话题。

So just to just to make sure we get it like a barometer of today's world at OpenAI in terms of AI and code, just so I understand, and then I wanna move on to a different topic.

Speaker 1

目前,OpenAI的所有代码都是由Codex编写的。

A 100% of code across OpenAI is written by Codex at this point.

Speaker 1

这样表述对吗?

Is that the way to frame it?

Speaker 0

我不会说今天生产环境中100%的代码都是由AI编写的。

I wouldn't make the statement that a 100 of code running in production today was is written by AI.

Speaker 0

而且在那里进行归属认定确实很难。

And and it's kind of hard to to to do attribution there.

Speaker 0

但目前几乎每位工程师在所有任务中都大量使用Codex。

But the like almost every engineer heavily uses codecs in all of their tasks at this point.

Speaker 0

所以,如果我要估算的话,目前绝大多数代码很可能都是由AI编写的。

And so I you know, if I were to guesstimate just like the vast majority of code at this point is was probably authored by AI.

Speaker 1

不可思议。

Incredible.

Speaker 1

好的。

Okay.

Speaker 1

所以现在有很多讨论,我们也一直在谈IC角色,也就是IC工程师的工作。

So there's a lot of talk and we've been talking about kind of the IC role, the work of an IC engineer.

Speaker 1

但关于管理者角色的变化,尤其是工程经理的转变,讨论却比较少。

There's less talk about the changing role of a manager, especially an engineering manager.

Speaker 1

随着AI的兴起,你的管理生涯发生了哪些变化?你对管理者这个角色有什么看法?

How has your life as a manager changed with the rise of AI and just what do you where do you think of managers?

Speaker 1

未来经理的角色是什么?

What's the role of a manager in the future?

Speaker 0

经理的角色变化确实比工程师少。

It's definitely changed less than an engineer.

Speaker 0

目前还没有专门针对经理的代码规范。

There's no, you know, codex for managers just just yet.

Speaker 0

不过,我在做一些更偏向管理类的工作时,会经常使用代码辅助工具。

However, I use codex quite a bit for for some of the some of some of the, like, kind of more manager y tasks that I do.

Speaker 0

我觉得有几件事正在发生变化。

I'd say a couple things are are changing.

Speaker 0

有一些趋势正在出现。

There are like some trends.

Speaker 0

所以我认为目前变化还不算大,但我看到了一些趋势。

So I don't think it's changed that much yet, but I see trends.

Speaker 0

我认为如果往前推演,就能大致看到这些变化将走向何方。

And I think if you play it out, can kind of see where where a lot of this is going.

Speaker 0

一个越来越明显的事情是,Codex 真正赋能了顶尖人才,让他们变得高效得多。

One thing that that's becoming increasingly clear is Codex really empowers like top performers to to get a lot like to be a lot more productive.

Speaker 0

因此,我认为这可能也适用于更广泛的 AI 技术,乃至整个社会——那些真正积极投入的人,那些具有高度自主性、愿意熟练掌握这些工具的人,会获得巨大的加成。

And so it really like and I think this is maybe true for AI more broadly, like across society, which is like the people who really lean in are like the people who have high agency or like will really get get get good at these tools will kind of supercharge themselves.

Speaker 0

我现在也注意到了这一点,顶尖人才最终会变得高效得多。

And so I'm kind of noticing this now as well, which is like the top performers kind of end up being a lot more a lot more productive.

Speaker 0

因此,团队的整体生产力差距也因此被拉大了。

And so you see a broader spread in in team productivity in this way.

Speaker 0

我一直秉持的管理理念是,把大部分时间花在顶尖人才身上,确保他们没有障碍、心情愉快,并且感到自己高效且被倾听。

One so one thing that I've always done as as a management philosophy is to spend actually the majority of my time with top performers, just like make sure they're unblocked, make sure they're happy, make sure, you know, they're they feel productive and they feel heard.

Speaker 0

我认为,在 AI 时代,这一点更加重要,因为你的顶尖人才会借助这些工具飞速前进。

I think this is even more true in an AI world where, you know, your top farmers are gonna just like really be shooting ahead using these tools.

Speaker 0

举个例子,有一个团队完全使用 Codex 生成的代码,让他们自由发挥、观察结果,这种做法已经带来了显著回报。

I think one example is is the the team that's, you know, maintaining a 100% codex generated code base, like just letting them kind of rip and see what's happening there is something that's paid dividends.

Speaker 0

所以我认为,管理者花更多时间在顶尖人才身上,这可能是未来持续的趋势。

So I think that's kind of one trend that I'm seeing where spending even more time with top performers for managers, I think is likely gonna continue.

Speaker 0

另一点是,这更多是一种观察,但我感觉随着这些AI工具对管理者越来越普及。

The other thing is I so this is more an observation, but my sense is with a lot of these AI tools available to managers.

Speaker 0

不是更多地写代码,而是像ChatGPT这样能利用组织知识的工具,可以更好地进行研究和理解组织背景。

So less like writing code, but just things like ChatGPT with organizational knowledge, like being able to do research and understanding organizational context a lot better.

Speaker 0

另一个很好的例子是,我们现在正在进行绩效评估,实际上很容易将ChatGPT与内部知识库连接起来,比如GitHub、Notion文档和Google文档,从而深入了解一个人在过去十二个月里的工作表现,并生成一份深入的研究报告。

Another good example is we're doing performance reviews right now and it's actually really easy to use ChatGPT with internal knowledge hooked up to GitHub and like our Notion docs and Google Docs to give a get a really good sense of what this person has done over the last twelve twelve months in writing a little, you know, deep research report for it.

Speaker 0

我的感觉是,在这个环境中,管理者将能够管理规模大得多的团队。

My sense is I think managers will be able to manage much larger teams in this world.

Speaker 0

就像软件工程师可以管理二十到三十个Codex一样。

Kind of like how, you know, like software engineers are managing 20 to 30 codexes.

Speaker 0

我认为这些工具将使管理者更具杠杆效应,让他们能够管理远超当前最佳实践规模的团队——据我所知,软件工程领域目前的最佳实践是六到八人,对吧?

My sense of these tools will allow managers, people managers to be higher leverage and will allow them to manage, you know, teams of way more than the current best practice of, I think it's like six to eight, right, for software engineering.

Speaker 0

你可以看到这种趋势也适用于非工程领域,比如支持或运营——过去,支持团队的规模可能受到限制,但当你把更多工作交给智能代理后,不仅能完成更多工作,还能管理更多人。

You kind of see this apply to, you know, like the non engineering domains, like support or operations where it's like, you know, previously, where previously, like, the the size of the support team might be limited, but, as you can pass off more things to agents, you can actually do more work and also manage more people this way.

Speaker 0

我认为同样的情况也可能发生在人员管理上,尤其是在科技公司中。

I think the same thing might happen for people management as well, especially in tech companies.

Speaker 0

我们已经看到了这种情况。

And we're already seeing this.

Speaker 0

有些团队中,工程经理管理着相当多的人,他们借助这些工具做得相当出色,因为这些工具能让他们获得更高的效率,更好地了解团队在做什么,更深入地理解组织背景,并以这种方式运作。

There are some teams where there are EMs managing, you know, quite a few people and they're doing it pretty adeptly because of some of these tools where they can get higher leverage and understand what their team's doing, understand organizational context a little bit better and operate in that way.

Speaker 1

我喜欢你描述的这种建议:你一直倾向于关注顶尖员工,花更多时间与他们沟通,确保他们开心。

I love this advice that with the way you described it is you've always leaned into top performers and spent more time with them and block them and make sure they're happy.

Speaker 1

马克·恩德里森刚刚在播客中提到,AI能让优秀的人更优秀,让杰出的人变得非凡。

The way Mark Endrizon, he was just on the podcast, the way he phrased it is AI makes good people better and it makes great people exceptional.

Speaker 0

是的。

Yeah.

Speaker 0

对。

Yeah.

Speaker 1

你在这里说的,就是越来越多地这样做,可能是正确的方向。

And what you're saying here is just just doing this more and more is probably the right move.

Speaker 1

多花时间与团队中最优秀的人在一起,帮助他们扫清障碍,确保他们拥有所需的一切。

Spending more time with the best people on your team to unblock them, make sure they have everything they need.

Speaker 0

是的。

Yeah.

Speaker 0

一个非常好的例子是,目前有一群工程师内部正在深入研究Codex的构建,并思考与该模型交互的最佳实践。

A very good example right now is there are, I would say, like a a group of engineers internally who are really codex builds and are thinking through what the best practices are for interacting with this model.

Speaker 0

这对他们来说是一件极具杠杆效应的事情。

And that is just an extremely high leverage thing for them to do.

Speaker 0

因此,作为经理,我只是说:去探索吧。

And so just like as a manager, I'm just like, yeah, go explore this.

Speaker 0

无论从中得出什么最佳实践,我们都必须与整个组织分享。

You know, whatever best practices come out of this, you know, we we have to share with the org.

Speaker 0

我们会举办各种知识分享会议。

Well, we'll, you know, we'll we'll we do all these knowledge sharing sessions.

Speaker 0

我们会分享文档,并在各处传播最佳实践。

We'll we'll, like, share documents and, like, best practices everywhere.

Speaker 0

像这样的做法,能够提升每个人的能力。

So things like that just, you know, elevate everyone.

Speaker 0

所以,我认为这又是这一趋势的另一个例子,我们正在看到顶尖表现者变得极为出色。

And, and so I I view that as like, you know, another example of this trend, that, that we're seeing where the top performers really get exceptional.

Speaker 1

人们就是有一种感觉。

People just like have a sense.

Speaker 1

这很重要。

This is big.

Speaker 1

人工智能正在改变太多东西。

AI is changing so much.

Speaker 1

世界正在发生变化。

The world is changing.

Speaker 1

这将是一个巨大的变革。

It's gonna be a huge deal.

Speaker 1

你认为人们还没有将哪些因素纳入对未来的预期中?

What do you think people aren't pricing in yet into what will change into where things are heading?

Speaker 1

举个例子,你认为我们目前还没意识到的是什么?

Just like what's an example of something you think are like, okay, we're not realizing this yet.

Speaker 0

我最喜欢的一个关于这场AI浪潮的短语或概念,就是‘一个人的十亿美元初创公司’。

So one of my favorite kind of, like phrases or like things that have come out of this whole AI wave is the idea of the one person billion dollar startup.

Speaker 0

我觉得可能是萨姆最先提出这个说法的,或者至少是他推广开的,但这确实很引人深思,对吧?

I think I actually think Sam may have keyed it or like Sam may have been the first one to say it, but it's fascinating to think about, right?

Speaker 0

如果人们的工作效率如此之高,那么终将出现一家由一个人创立的十亿美元初创公司。

It's like, yeah, if people are so high leverage, at some point, there will likely be a one person billion dollar startup.

Speaker 0

虽然我觉得这非常酷,但人们还没有充分考虑到这一趋势的第二层、第三层影响。

And while I think that's really, really cool, I think people aren't really pricing the second or third order effects of this.

Speaker 0

事实上,‘一个人的十亿美元初创公司’意味着,一个人借助这些工具就能获得前所未有的自主权和杠杆效应,轻松完成企业运营所需的一切,最终打造出价值十亿美元的公司。

And and really what, you know because because what the one person billion dollar start startup implies is that there's, you know, one person can just have so much more agency and so much more leverage using one of these tools that it is just super easy for them to get everything done that they need to for their business to ultimately create something that's a billion dollars.

Speaker 0

但我认为这还带来了一些其他影响。

But I think there are a couple other implication of this.

Speaker 0

其中一个影响是,如果一个人能够轻松创建一家十亿美元公司,那么创建任何初创公司都会变得容易得多。

So one of them is if it's easy for a person to create a one person bill or if it's possible for a person to create a one person billion dollar startup, it also means it's way easier for people to just create startups in general.

Speaker 0

实际上,我认为这会产生一个第二层效应:将迎来一场巨大的初创企业和小型企业浪潮,任何人都能为任何需求开发软件。

Like, I actually think this will like one second order effect of this is I think there's just gonna be a huge startup boom and small SMB style boom where anyone can build software for anything.

展开剩余字幕(还有 480 条)
Speaker 0

对吧?

Right?

Speaker 0

比如,你已经开始在AI创业领域看到这种趋势了,软件正变得越来越垂直化,像为某个垂直领域开发AI工具往往效果很好,因为你深入专研了这个特定领域。

Like one, you're kind of starting to see starting to see this play out in the AI startup scene where software's became a lot more vertical oriented, where like these verticals, like creating some AI tool for some vertical tends to work quite well because, you know, you really lean into that particular domain.

Speaker 0

你真正理解了它的使用场景。

You like really understand the use case for it.

Speaker 0

因此,如果继续发展AI,没有理由不能出现100倍数量的这类创业公司。

And so if you play out AI, there's no reason why you can't have like 100X more of these startups.

Speaker 0

所以,我认为我们可能会看到一种局面:为了支持一个人打造十亿美元的初创企业,可能会有另外一百家小型创业公司,专门开发高度定制化的软件,来支持其他类型的个人十亿美元初创企业。

And so I think one world that we might end up seeing happen is in order to enable a one person billion dollar startup, there might be like a 100 other small startups building bespoke software that works extremely well to support other types of, you know, one person, you know, billion dollar startups.

Speaker 0

因此,我认为我们可能会真正进入一个B2B SaaS以及软件和创业整体的黄金时代。

And so I think we might actually enter into a golden age of like B2B SaaS and just like software and startups in general.

Speaker 0

所以,我认为这是一个非常有趣的趋势,因为随着软件开发越来越容易,运营公司也越来越简单,你可能会看到更多这样的初创企业涌现。

And so I think that's a really interesting trend to kind of see because as it gets easier and easier to build software, as it's easier and easier to run a company, you might actually just end up seeing way more of these startups.

Speaker 0

因此,我一直在想,也许会出现一个人的十亿美元初创企业,也可能出现一亿美元级别的初创企业。

And so the way I've been thinking about it is like, yeah, there might be one person billion dollar startup, or there might be like a $100,000,000 startups.

Speaker 0

可能会有数以万计的千万美元级别初创公司。

There might be tens of thousands of $10,000,000 startups.

Speaker 0

作为个人,拥有一个一千万美元的业务其实非常好。

And as an individual, it's actually pretty great to have a $10,000,000 business.

Speaker 0

到那个时候,这笔钱对你一生来说已经足够了。

Like that's like enough for yourself for life at that point.

Speaker 0

因此,我们可能会看到这种方式的爆发式增长。

And so, we might really see an explosion in that way.

Speaker 0

但感觉人们并没有真正推动这一点。

And feel like people aren't really pressing that in.

Speaker 0

这还有另一个层面的连锁效应,你知道吗?

There's another kind of like third order effect to this, you know?

Speaker 0

而且,随着预测越来越远,我认为其中充满了不确定性。

And again, all of these, like as you get to the further and further out predictions, I think there's a lot of uncertainty.

Speaker 0

我认为,如果我们最终进入这样一个世界:出现许多微型公司,它们开发的软件仅供公司所有者——一到两个人——使用和运营。

I think if we end up moving to this world where you end up with these like kind of micro companies building software that works for one or two people who own the company and are working there.

Speaker 0

我认为创业生态系统会发生变化。

I think the startup ecosystem will change.

Speaker 0

我认为风险投资生态系统也会发生变化。

I think the VC ecosystem will change.

Speaker 0

你知道吗,我们可能会进入一个世界,那里只有少数几个大玩家提供平台并支持所有这些初创公司。

You know, we might end up in a world where there's just like a handful of big players that are offering platforms and supporting all of these startups.

Speaker 0

但你知道,那些能够带来百倍或千倍投资回报的风投规模型初创公司,可能会因为大量小型的、价值在1000万到5000万美元之间的公司出现而减少——这些公司虽然不适合风投回报模式,却非常适合那些高度自主、如今借助AI为自己打造事业的个人。

But, you know, the types of venture scale return startups that can really 100 or thousand x your your investment might actually end up shrinking if you end up having a bunch of these, you know, smaller 10 to $50,000,000 companies, which are not great for venture style returns but are great for the individuals, the high agency individuals who are now, you know, really leaning to AI to to to build these businesses for themselves.

Speaker 1

我喜欢我们已经讨论了这么多层的连锁效应。

I love how many order, like, order effects we've been through.

Speaker 1

每当你听到‘第四层效应’这个词时,谢尔文,我只是开个玩笑。

Whenever you hear the fourth order effect now, Sherwin, I'm just joking.

Speaker 0

我实在没法理解,第四层效应太复杂了,超出了我的大脑容量。

I I can't I it's too fourth order is too too it's too gigabrain for me.

Speaker 0

我真的没法想那么远。

I I can't I can't think that far ahead.

Speaker 1

这就像《盗梦空间》,每深入一层,销售过程就变得越慢。

It's like inception where just everything gets slower every time you go deeper into selling.

Speaker 1

是的。

Yeah.

Speaker 1

对。

Yeah.

Speaker 1

每一层都是如此。

Every layer.

Speaker 1

好吧。

Okay.

Speaker 1

至于十亿美元的初创公司,我经常思考这个问题,因为我做的东西根本不是风险投资规模,也没有很高的杠杆效应,但我确实经常收到一些极其荒谬的客户支持请求。

So the billion dollar startup, I've been I think about this a lot because I I'm not gonna be a billion dollar startup because what I'm doing is not venture scale in any way and not super high leverage, but just could see how many support tickets I get from just, like, the most ridiculous things.

Speaker 1

我很难想象一个人能做到。

It's hard for me to imagine one person.

Speaker 1

我对这种十亿美元的初创公司持悲观态度。

Like, I'm bearish on this billion dollar startup.

Speaker 1

我只是想分享这个想法。

I just wanna share this thought.

Speaker 1

仅仅因为支持成本,即使AI在帮你运营一家十亿美元的公司,除非你的客户平均消费额非常高,

Simply because of the support costs, even if AI is helping you at a billion dollars, just like unless your ACVs are, you know,

Speaker 0

非常高

very high

Speaker 1

而且客户数量很少,光是处理支持工作就够呛。

and you have very few customers, just dealing with support.

Speaker 1

而且人们总是这样,你知道的,他们

And people are like, you know, like, they

Speaker 0

能自己解决自己的问题,

can solve their own problems,

Speaker 1

但他们还是会选择发邮件寻求支持。

but they're like, oh, email support.

Speaker 1

问一些这样的问题。

Ask about this thing.

Speaker 1

就我经验而言,单靠处理这些支持工作很难规模化。

Just dealing with that is hard to scale is in my experience.

Speaker 1

所以,依我看来,除非你有一大群外包人员——我不知道,这算不算一个人的公司?

So unless you have in my opinion, unless you have a bunch of contractors, which I don't know, does that count as a single person company?

Speaker 1

我觉得,要让一家十亿美元的初创公司规模化,却没有专人帮你至少处理支持工作,是极其困难的。

I feel like it's very difficult to scale a billion dollar startup and not have someone helping you with at least the support work.

Speaker 1

而AI,我认为只能帮你走到一定程度。

And AI, I think, will only take you so far.

Speaker 0

所以我觉得你说得对。

So I I I think that's true.

Speaker 0

实际上,我对这个问题的看法略有不同:我觉得你的Lenny播客最终可能会成长为一家十亿美元的初创公司。

And actually, I think my view on it is is is slightly different, which is I think that your you know, Lenny's podcast might end up becoming a billion dollar startup.

Speaker 0

但我认为可能发生的是,你不需要亲自去调度AI来处理和解决那些支持工单,反而可能会出现一大批专门为你的需求打造的初创公司。

But what I think might happen is instead of you kind of being the one person who has to dispatch an AI to solve and fix those support tickets, I think what might end up happening is there might be a whole smattering of other startups that are building software and super tailored towards what you might need.

Speaker 0

因此,可能会有十到二十家初创公司专门开发面向播客和通讯简报的支持软件。

And so, there might be like 10 or 20 startups that build support software for podcasts and newsletters.

Speaker 0

这可能是一个一个人的初创公司。

And that might be a one person startup.

Speaker 0

它不需要很大。

Like it doesn't need to be a big one.

Speaker 0

他们可以非常容易地编写出这个产品。

And they might be able to just code up this product very, very easily.

Speaker 0

他们能够自己构建自己的东西。

They are able to kind of like build their own thing.

Speaker 0

因为这个产品如此量身定制、独特,并且 hopefully 有用,它可能成为你作为一个人的十亿美元初创公司所购买的东西。

And because it's so tailored and unique and hopefully, you know, useful for you, it might be something that you purchase as the one person billion dollar startup.

Speaker 0

我会。

I would

Speaker 1

我会买这个。

buy that.

Speaker 1

我会买这个。

I would buy that.

Speaker 0

是的

Yeah.

Speaker 0

这涉及到哪些事情应该由内部完成,哪些应该外包出去。

There's like a question of like what you in house and what you what you like kind of outsource.

Speaker 0

我认为可能发生的情况是,由于编写软件和构建产品的成本急剧下降,你可能会将大量工作外包,从而缩小公司的规模。

And what I think might happen is because the cost of writing software and building products is collapsing so much, you might end up outsourcing a lot of this and in doing so, reducing the size of your company.

Speaker 0

因此,这大概就是我认为可能会出现的世界。

And so that's kind of the world that I think might end up happening.

Speaker 0

同样,这里存在很大的不确定性,但最终结果可能仍然是由一个人主导一个高度杠杆化的庞大公司,甚至可能达到十亿美元的估值。

Again, there's like high uncertainty in what might play out here, but the end result still might be a one like, one person driving this, like, high high massive leveraged company that might actually reach a billion dollars.

Speaker 1

我能理解这一点。

I could see that.

Speaker 1

我还会想到Clawbot/Moldbot/OpenClaw的Peter,他现在被各种请求、邮件、通知、私信和公关信息淹没,而他甚至还没从这个项目中赚到一分钱。

I also think about Peter at Clawbot slash Moldbot slash OpenClaw of just, like, how he barraged he is right now by all these asks and emails and pings and DMs and PRs just like, oh, I'm curious to and he's not even making any money off this thing.

Speaker 0

是的

Yeah.

Speaker 0

我无法想象他现在是什么感觉。

I I can't imagine what it's like to be him right now.

Speaker 0

这一定简直疯狂至极。

It's it must be, like, absolutely insane.

Speaker 0

这大概就像我们发布ChatGPT之后的那几个月,那种疯狂劲儿,是的。

It it it's probably, like, you know, like, the the the months after we launched ChatGBT, the craziness that was Yeah.

Speaker 1

就一个人。

As one as one man.

Speaker 1

是的。

Yeah.

Speaker 1

顺便说一下,他一周后就要出舱了。

He's coming out of the pod, by the way, in in a week.

Speaker 0

哦,这真令人兴奋。

Oh, that's exciting.

Speaker 0

是的。

Yeah.

Speaker 1

也许第四阶效应是分发变得越来越重要,因为有太多东西在争夺你的注意力。

Maybe the fourth order effect is distribution becomes increasingly important because there are so many freaking things trying to get your attention.

Speaker 1

所以,拥有受众和平台的人会变得越来越有价值,这很好,是的。

So people with an audience and platform, I think, become more and more valuable, which is good Yep.

Speaker 1

很棒的内容。

Good stuff.

Speaker 1

好的。

Okay.

Speaker 1

我其实想回到你之前谈的管理话题。

I wanted to come back actually to your management stuff.

Speaker 1

我非常欣赏你关于花更多时间与顶尖员工相处非常有效的见解。

So I I really loved your insight about spending more time with top performers has been really successful to you.

Speaker 1

想想你作为一支团队的管理者,这支团队正在构建一个平台,几乎整个AI经济都基于你的API运行。

Just thinking about you as a manager of a team that is building the platform that powers basically the entire AI economy, like every AI startup is building on your API.

Speaker 1

显然,你做得非常出色。

Clearly, you're doing a great job.

Speaker 1

你还学到了哪些其他重要的管理经验?

What other kind of core management lessons have you learned?

Speaker 1

作为工程师和团队的管理者,你认为什么是成功最关键的因素?

What do you find is really important and key to your success as a manager of engineers and just people?

Speaker 0

是的。

Yeah.

Speaker 0

我认为我在这里学到的很多经验,不一定只适用于OpenAI API或我们的一些企业产品。

I think a lot of the lessons that I've learned here, I don't know how specific it is to the OpenAI API or some of our enterprise products in particular.

Speaker 0

我的管理理念当然随着时间有所变化,但我觉得它保持不变的部分可能比变化的还要多。

I think my management philosophy has obviously changed over time, but I think it's probably stayed the same more than it's changed over time.

Speaker 0

其中一个原则就是我之前跟你提到过的,那就是花大量时间与顶尖员工在一起,说得具体一点,就是超过50%的时间要放在你的顶尖员工身上,可能是你前10%的优秀员工,并尽全力支持他们。

One of these principles is is kind of what I talked to you about before, which is, you know, spending a lot of time with with top performers, like actually spending and like to be very concrete, like, it's like more than 50% of your time with your top performers, with maybe your top, like, 10% performers and really, really trying your best to empower them.

Speaker 0

我看待这个问题的方式,是回到《人月神话》这本书里提到的一个类比,就是把软件工程师比作外科医生。

The way that I think about it is is is is kind of come back to this analogy of software engineer as as as a surgeon, which comes from the the mythical man month book.

Speaker 0

这其实挺有意思的。

So it is actually it is funny.

Speaker 0

所以我从这本书中引用了这个观点,但书中实际上描述了一个世界,我认为他们当时是在预测未来,因为这本书大概是七十年代写的。

So I I pull it from the book, but in the book, they actually describe this world where I think they were like predicting the future because because I think the book was written like in the seventies or something.

Speaker 0

他们说,软件工程可能会发展成一个软件工程师如同外科医生的世界,在手术室里,只有一个人在做实际工作,比如一个人负责切开或进行所有手术操作。

They said that software engineering might end up moving into a world where that software engineers are like surgeons, where in a surgery room, there's one person doing the work and there's the one person cutting or whatever and doing all the surgery.

Speaker 0

而房间里的其他所有人都只是在支持他。

And everyone else in the room is there to just support them.

Speaker 0

对吧?

Right?

Speaker 0

就像护士、助手、住院医师和实习医生一样。

It's like the nurse and like the assistant, the resident and the fellow.

Speaker 0

然后外科医生说:我需要一把手术刀。

And then the surgeon's like, I need a scalpel.

Speaker 0

他们就把手术刀递给他。

They give them scalpel.

Speaker 0

然后他说:我需要这个工具和这台机器,他们就会把东西拿过来。

And then they're like, I need this tool and this machine and they'll bring it over.

Speaker 0

所有人都在支持那位主刀医生。

Everyone's there to just like support the one surgeon.

Speaker 0

因此,《神话般的猛犸象》实际上预测了软件工程将朝这个方向发展。

And so the Mythical Mammoth actually predicted that that is kind of the direction that software engineering is going to go.

Speaker 0

我认为这并没有完全实现,因为现在的工作方式更加协作,并非只有一个人在做事,但我一直非常喜欢这个比喻。

I don't think that's exactly played out where like, you know, it's much more collaborative and like, it's not only one person doing the work, but I've always really liked that analogy.

Speaker 0

这个比喻实际上也是我管理哲学中努力效仿的:软件工程并不是像外科手术那样只有一个人在工作,但我作为管理者,希望以这种方式对待团队成员——赋予他们权力,让他们感觉自己就是主刀医生。

And that analogy is actually what I strive to kind of emulate in my own management philosophy, which is software engineering isn't really like surgery where it's not just one person doing work, but the way in which I like treating the people on my team and the way that I act as a manager is I want to empower them, make them feel like they're a surgeon.

Speaker 0

我会尽力支持他们,确保他们拥有完成工作所需的一切。

And in so far as like making sure that I'm supporting them and making sure they have everything that they need to do their work.

Speaker 0

这让他们感觉背后有一支庞大的团队在支持他们,提前预见问题,并在他们需要时提供一切,而实际上,这一切都只是我这个管理者在运作。

And it feels like they have an army of people kind of supporting them and looking around corners and giving them everything that they need when it's really just me as the manager.

Speaker 0

我举的例子是,提前预见障碍并消除阻碍,尤其是在组织层面,这极其有用。

And so like the example that I give is looking around corners and unblocking people, especially from an organizational perspective, is extremely, extremely useful.

Speaker 0

再说回到人工智能的话题,如今这显得更加重要了,对吧?

And again, going back to the AI conversation, it's even more important nowadays, right?

Speaker 0

比如,当人们不断提交拉取请求时,真正阻碍进展和交付成果的往往是组织或流程层面的问题。

Like if people are just like cranking PR after PR, the main thing, bottlenecking progress and shipping something tends to be organizational or process oriented.

Speaker 0

如果你作为管理者能提前预见并为团队扫清障碍,比如外科医生需要手术刀时,管理者已经提前准备好了一把,那就是

And if you as a manager can kind of look around corners and kind of unblock the team, if you can, like if the surgeon needs scalpel, but the manager kind of already has a scalpel ready for them, that's

Speaker 1

最理想的情况。

the best case scenario.

Speaker 0

这正是我对待管理、尤其是工程管理的方式。

That's kind of the way that I approach management and especially engineering management.

Speaker 0

因此,这种理念多年来一直深深印在我的脑海中。

And so that's something that's really, really stuck with me over time.

Speaker 0

尽管软件工程师并不是真正的外科医生,但这个比喻在我的整个职业生涯中始终萦绕在我心头。

And even though, you know, software engineers aren't exactly surgeons, that metaphor has always kind of stayed in my mind as of the rest of my career.

Speaker 1

我非常喜欢这个说法。

I love that.

Speaker 1

我觉得我在想,AI是否能帮助我们提前预见,比如某个工程师会被某个决策卡住。

And I I feel like I wonder if that's something AI can help with is look around corners and predict here, this engineer is gonna be blocked by this decision.

Speaker 1

我们需要弄清楚这个问题。

We need to figure this out.

Speaker 1

我们需要搞定。

We need to get Yeah.

Speaker 0

这其实是个非常好的观点。

That's actually a really good point.

Speaker 0

我还没试过,但我很好奇,如果我让 ChadGPT 连接公司知识库,去查询一下当前的阻碍因素,会怎么样?

I haven't tried this yet, but I wonder what would happen if I ask ChadGPT hooked up to company knowledge, you know, like, are the active blockers?

Speaker 0

查看所有 Notion 文档。

Look through all the Notion docs.

Speaker 0

看看可能的 Slack 消息。

What are maybe Slack messages.

Speaker 0

你知道,这些信息 probably 就藏在 Slack 的某个地方。

You know, it's probably in Slack somewhere.

Speaker 0

我的团队目前有哪些主要阻碍?

What are the active blockers on my team?

Speaker 0

有什么我可以帮忙的吗?

And is there something I can do to to help?

Speaker 0

这真的很有趣。

Now that's very interesting.

Speaker 0

我之前没想过这一点,但你说得对。

I have not thought about that, but you're right.

Speaker 1

我刚刚有了一个想法。

Just had an insight right here.

Speaker 0

是的。

Yeah.

Speaker 0

是的。

Yeah.

Speaker 0

是的。

Yeah.

Speaker 1

更有趣的是,你预计这位工程师或这个团队在未来几个月会遇到什么障碍?

And it's I think even more interestingly, what do you anticipate will be a blocker for this engineer or this team in the in the coming months?

Speaker 0

是的。

Yeah.

Speaker 0

你让模型、你让AI去处理第二和第三层次的事情了。

You asked the you asked the model you asked the AI to do the second and third order things.

Speaker 0

提前想想吧,老兄。

Anticipate that, man.

Speaker 0

也要预测一下下个月博主们会做什么。

Anticipate what the vloggers will be next month too.

Speaker 0

我们很棒。

We're great.

Speaker 1

我觉得我们这里已经有了一个好想法。

I think we've got a we've got a good idea right here.

Speaker 0

是的。

Yeah.

Speaker 0

是的。

Yeah.

Speaker 1

本集由Datadog赞助播出,Datadog现已整合Epo,这是领先的实验与功能开关平台。

This episode is brought to you by Datadog, now home to Epo, the leading experimentation and feature flagging platform.

Speaker 1

全球顶尖公司的产品经理使用Datadog——与工程师每天依赖的同一平台——将产品洞察与产品问题(如漏洞、用户体验摩擦和业务影响)联系起来。

Product managers at the world's best companies use Datadog, the same platform their engineers rely on every day, to connect product insights to product issues like bugs, UX friction, and business impact.

Speaker 1

它从产品分析开始,产品经理可以观看回放、分析漏斗、深入研究留存率,并探索增长指标。

It starts with product analytics, where PMs can watch replays, review funnels, dive into retention, and explore their growth metrics.

Speaker 1

当其他工具止步不前时,Datadog却更进一步。

Where other tools stop, Datadog goes even further.

Speaker 1

它能帮助你真正诊断漏斗流失、漏洞和用户体验摩擦所带来的影响。

It helps you actually diagnose the impact of funnel drop offs and bugs and UX friction.

Speaker 1

一旦你知道该聚焦哪里,实验就能验证哪些方法有效。

Once you know where to focus, experiments prove what works.

Speaker 1

当我还在Airbnb时,我亲身体验过这一点,我们的实验平台对于分析哪些方法有效、哪些环节出错至关重要。

I saw this firsthand when I was at Airbnb, where our experimentation platform was critical for analyzing what worked and where things went wrong.

Speaker 1

而打造Airbnb实验平台的同一团队,也创建了Epo。

And the same team that built the experimentation at Airbnb built Epo.

Speaker 1

Datadog 还通过会话回放功能,让你超越数字本身。

Datadog then lets you go beyond the numbers with session replay.

Speaker 1

通过热力图和滚动图,精确观察用户如何与产品互动,真正理解他们的行为。

Watch exactly how users interact with heat maps and scroll maps to truly understand their behavior.

Speaker 1

所有这些功能都由与实时数据关联的功能标志驱动,使你能够安全发布、精准定位并持续学习。

And all of this is powered by feature flags that are tied to real time data so that you can roll out safely, target precisely, and learn continuously.

Speaker 1

Datadog 不仅仅是工程指标。

Datadog is more than engineering metrics.

Speaker 1

它是优秀产品团队快速学习、智能修复并自信发布的地方。

It's where great product teams learn faster, fix smarter, and ship with confidence.

Speaker 1

前往 datadoghq.com/lenny 申请演示。

Request a demo at datadoghq.com/lenny.

Speaker 1

就是 datadoghq.com/lenny。

That's datadoghq.com/lenny.

Speaker 1

好的。

Okay.

Speaker 1

我现在要转向谈谈你们所有人所构建的API和平台。

I'm gonna shift to talking about the API and the platform that you all build.

Speaker 1

所以你与许多公司合作,帮助它们基于你的工具实施你的API和平台。

Some so you work with a lot of companies implementing your API, your platform building on on your on your tools.

Speaker 1

你告诉我,许多公司实际上在AI部署上的投资回报率为负,这我觉得正是很多人阅读、感受和认为的情况,而你亲眼看到这一点,这很有趣。

You told me that you find that a lot of companies actually have negative ROI on their AI deployments, which, I think is what a lot of people read about and feel and think, and it's interesting you're actually seeing that.

Speaker 1

那里到底发生了什么?

What what's going on there?

Speaker 1

他们做错了什么?

What are they doing wrong?

Speaker 1

在AI和我们的部署方式中,世界上正在发生什么?

What do you what's happening in the world of AI and deployments in our way?

Speaker 0

是的。

Yeah.

Speaker 0

为了明确一点,我并没有明确看到这方面的量化数据。

So to be clear, I don't explicitly see quantitative numbers around this.

Speaker 0

实际上,衡量这些事情非常困难。

It's actually really hard to measure these things.

Speaker 0

但通过观察一些公司尝试使用AI,我对许多AI部署实际上具有负投资回报率这一点并不感到惊讶。

But especially from observing some companies kind of trying to do AI, I would not be surprised if a lot of AI deployments are actually, you know, negative ROI.

Speaker 0

我的意思是,这部分原因也在于,我认为全国范围内,尤其是科技圈之外的人,普遍觉得AI被强加给了他们。

I mean, part of this too is I think there's also general sentiment from folks around the country, like, basically outside of tech that AI is being forced onto them.

Speaker 0

我认为这在一定程度上是某些AI部署投资回报率为负的体现。

And I think part of this is is is probably a symptom of some negative ROI AI deployments.

Speaker 0

关于这一点,我观察到一些情况。

A couple of things I've observed around this.

Speaker 0

其中一个方面是,我想我一再回到这个问题上。

So one one thing is and I think I I come back to this again and again.

Speaker 0

我觉得我们在硅谷的人常常忘记我们生活在一个泡沫里。

Like, I think we in Silicon Valley just forget that we live in a bubble.

Speaker 0

我们太沉浸于推特了,抱歉,是X这个泡沫了。

Like, we are so like, Twitter is a bubble sorry, X is a bubble.

Speaker 0

硅谷是一个泡沫。

Silicon Valley is a bubble.

Speaker 0

软件工程是一个泡沫。

Software engineering is a bubble.

Speaker 0

世界上大多数人,美国的大多数人并不是软件工程师,也没有被AI深深影响,也不会关注每一个模型的发布。

Most people in the world, most people in The US are not software engineers, are not very AI pilled, are not following every single model release.

Speaker 0

因此,我们完全不了解如何使用这项技术。

And so we're just highly out of the loop on how to use this technology.

Speaker 0

所以我们总是谈论关于Codex的所有最佳实践,以及OpenAI内部那些沉迷于Codex的人。

And so, we always talk about all these best practices for Codex, all these Codex Pill people within OpenAI.

Speaker 0

我敢肯定,X上发帖的每个人都是这些AI工具的超级用户。

I'm sure everyone on X who posts are like crazy power users of these AI tools.

Speaker 0

他们热衷于提升技能,热衷于使用智能代理。

You know, they lean into skills, they lean into agents.

Speaker 0

MD。

MD.

Speaker 0

MCPs。

MCPs.

Speaker 0

是的。

Yes.

Speaker 0

对。

Yeah.

Speaker 0

所有这些。

All of that.

Speaker 0

当我与一些公司交谈,并与实际使用这些工具的员工交流时,发现他们试图做的都是最基础的事情。

And when I talk to some of these companies and I talk to the actual employees using these, it's like the most basic thing that they're trying to do.

Speaker 0

他们对这项技术的工作原理几乎一无所知。

And they like have very little understanding of exactly how this technology works.

Speaker 0

因此,这对我来说是一个重要的观察:他们只是在向这些工具提出非常简单的问题。

And so that's kind of like one big observation for me, which is like, they're asking very simple questions of these things.

Speaker 0

他们目前还没有真正去深入挖掘它。

They're really not pushing it just yet.

Speaker 0

因此,这又回到了我认为更多公司应该做或实际该做的事情,以及更理想的AI部署模式是什么样子。

And so that kind of goes back to, that's kind of ties into what I think more companies do or like what should do, or what a more ideal AI deployment setup looks like.

Speaker 0

这也是我们在OpenAI内部运作的方式。

And this is kind of how we've run things within OpenAI too.

Speaker 0

我认为真正取得良好成效的公司,都具备自上而下的支持。

The companies where I think it started to work really well, have a combination of both top down buy in.

Speaker 0

也就是说,高管层明确表示:我们要成为一家以AI为先的公司。

So it's like the C suite, like, we want to become an AI first company.

Speaker 0

因此,他们不仅认同并采购了这些工具,还获得了管理层的支持,同时也实现了自下而上的采纳与认同。

So there's buy in, they buy the tools, they have exec support, but it also has bottoms up adoption and buy in.

Speaker 0

我的意思是,有真正的员工在积极使用这项技术,他们对它充满热情,愿意学习、推广、建立最佳实践,并在组织内部分享知识。

And so what I mean by that is it has like actual employees doing the work who are really excited about this technology and are willing to learn, evangelize, build best practices and kind of like knowledge share within the organization.

Speaker 0

我们在内部已经多次看到这种情况。

We've seen this a lot internally.

Speaker 0

显然,OpenAI一直希望成为一家以AI为核心的公司,但真正开始蓬勃发展,是在Codex和这些工具推出之后,员工们能够亲自将它们应用到自己的工作中。

So like obviously OpenAI has always wanted to be a very AI centric company, but when it really started taking off was with the introduction of Codex and these tools where people then like actual employees themselves could start applying it to their work.

Speaker 0

我认为你真的需要这样,因为归根结底,每个人的工作都截然不同。

And I think you really need this because at the end of the day, everyone's work is very different.

Speaker 0

非常独特。

It's very unique.

Speaker 0

软件工程不同于金融,也不同于运营、市场推广和销售。

Software engineering is different than finance, is different than operations, different than go to market and sales.

Speaker 0

因此,工作中有很多最后一步的细节,必须通过自下而上的方式来完成。

And so there's like a lot of these last mile intricacies of work that needs to really be done in a bottoms up fashion.

Speaker 0

所以我的感觉是,许多AI部署都没有实现自下而上的采纳。

And so my sense is a lot of these AI deployments don't have bottoms up adoption.

Speaker 0

这就像一项强制指令,完全是自上而下的,与实际工作内容完全脱节。

It was like an exact mandate and it's extremely top down and is very divorced from what the actual work looks like.

Speaker 0

结果就是,你面对的是一个根本不懂这项技术的庞大员工群体。

And as an end result, you end up with a giant workforce that doesn't really understand the technology.

Speaker 0

就像,我知道我应该用这个,也许它还和我的绩效考核有关,但我并不确定该怎么做。

It's like, I know I'm supposed to use this and maybe it's like on my performance review too, but I'm not sure what to do.

Speaker 0

他们环顾四周,发现没人这么做。

And they look around, no one else is doing it.

Speaker 0

没有人可以学习。

There's no one else to learn from.

Speaker 0

因此,我建议那些推动这一举措的公司,找一个或甚至组建一个全职的内部团队,作为这样的内部特遣小组,去全面探索这项技术的能力,将其应用到具体的工作流程中,进行知识分享,并激发那些可能想使用这项技术的人的热情。

And so my recommendation for companies kind of pushing this is find, or maybe even staff a full time team internally that is this kind of tiger team internally that can explore the full extent of the capabilities, apply to specific workflows, do the knowledge sharing, create excitement within folks who might wanna use this technology.

Speaker 0

因为如果没有这样的团队,这项技术实际上很难上手。

Because in the absence of that, it's very difficult to it's actually very difficult to pick up.

Speaker 1

你会把谁放在这个特遣团队里?

And who would you put on this TIGR team?

Speaker 1

是工程师主导的吗?

Is it like engineer led?

Speaker 1

根据你的经验,你发现是这样吗?

Do you find in your experience?

Speaker 1

这是一个跨职能的团队吗?

Is it a cross functional sort of team?

Speaker 0

是的,这很有趣。

Yeah, it's interesting.

Speaker 0

因为很多公司根本没有软件工程师。

Because so also a lot of companies don't have software engineers.

Speaker 0

所以我看到的模式是,通常是那些与软件工程相关、本质上是技术人员但不是软件工程师的人。

And so the pattern I've seen is it tends to be these like software engineering adjacent, like basically technical people, but are not software engineers.

Speaker 0

我认为这些人最有可能对这项技术感到兴奋。

I think those are the ones who get tend to get most excited around this.

Speaker 0

就像是,你知道的,也许是那个不写代码但非常喜欢使用这些工具的支持团队运营主管,或者是个Excel高手之类的人。

It's like, you know, maybe the it's like maybe the like, you know, support team operations lead who doesn't code, but loves using these tools and is like an Excel wizard or something.

Speaker 0

所以他们是技术相关或编程相关,而且相当懂技术。

And so it's like technical adjacent or coding adjacent and pretty technical.

Speaker 0

我看到的这些公司里,正是这类人真正被点燃并对此感到兴奋。

Those are the kinds of people I've seen in these companies who just really light up and get excited around this.

Speaker 0

你通常可以围绕这样的人组建一个团队。

And you can usually build a team around that.

Speaker 0

但确实,通常并不是软件工程师。

But yeah, it's like oftentimes not software engineers.

Speaker 0

软件工程师可能会理解这一点,但并不是每家公司都有软件工程师。

Software engineers, I think will understand this, but not every company has software engineers.

Speaker 0

实际上,这种情况还挺罕见的。

It's actually kind of a rarity.

Speaker 0

很难找到他们。

They're hard to find.

Speaker 0

他们很贵。

They're expensive.

Speaker 0

所以是这些其他类型的人。

And so it's these other types of folks.

Speaker 1

我听到的是,反模式是自上而下的。

What I'm hearing is the anti pattern is top down.

Speaker 1

这是CEO直接找高管团队说:我们要全面拥抱AI。

This is very the CEO found an exec team just like, we are gonna go AI first.

Speaker 1

我们要以AI为引领。

We're gonna lead into AI.

Speaker 1

每个人的表现都将通过AI工具来评估,看你因AI而提升了多少生产力。

Everyone's gonna be judged on their performance using AI tools, how much your productivity is increasing thanks to AI.

Speaker 1

但如果只是自上而下推行,而不建立一个自下而上、传播理念的团队,你会发现这种方式行不通。

And without, with that being just top down and not creating a team that is bottom up, spreading the gospel, you find that doesn't work.

Speaker 0

是的,没错。

Yeah, exactly.

Speaker 0

没错。

Exactly.

Speaker 1

建议是:找到那些最热情的人,不要让他们各自分散在组织中,真正有效的方式是组建一个小型的AI传道者团队,他们。

And the advice is, find the people that are most excited and instead of kind of having them spread out through the organization, you're what you find works is create a little t AI kind of evangelist team that Yeah.

Speaker 1

会探索如何使用AI,并将其推广到整个工作中。

Finds ways to use it and kind of spreads it across the work.

Speaker 0

是的。

Yeah.

Speaker 0

我的意思是,这就像你把之前的话重新说给我听一样。

I mean, another it's kinda like hearing you you play back to me.

Speaker 0

另一种思考方式,跟我自己的管理理念相关,就是找到那些在AI应用上表现突出的人,并给予他们支持。

Another way to think about it, kinda tying back to my own management philosophy is find the high performers in AI adoption and empower them.

Speaker 0

你知道的。

You know?

Speaker 0

让他们组织黑客马拉松。

Let them build hackathons.

Speaker 0

让他们举办研讨会,进行知识分享,从内部培育起兴奋的种子。

Let them, you know, hold seminars, do knowledge sharing, kind of create the seeds of excitement internally.

Speaker 1

好的,太棒了。

Okay, amazing.

Speaker 1

我想听听你的一些大胆观点,一些我见过你分享过的看法。

There's a couple of hot takes I wanna hear from you, something that I've seen you talk about and share.

Speaker 1

其中一个观点是,你曾提到,与客户交谈并倾听客户的意见,并不总是AI领域的正确策略,有时反而会误导你。

One is, you've shared that talking to customers and listening to customers is not always the right strategy in AI and it might often lead you astray.

Speaker 0

我不确定这算不算一个很颠覆的观点。

I don't know if it's that hot of a take.

Speaker 0

我认为这里最关键的是,你当然应该和客户交流。

I think the main thing here is so obviously you should talk to your customers.

Speaker 0

和客户交流确实很有用。

Like, it's it's like useful to talk to customers.

Speaker 0

我只是觉得,AI领域,尤其是我过去三年在开发API并见证其不断演变的过程中,这个领域和模型本身变化得太快了。

I just think the AI field, especially what I've seen over the last kind of like three years working on the API and seeing kind of all that evolve, is the field and the models themselves are just changing so, so quickly.

Speaker 0

它们往往会自我颠覆,特别是在工具和框架层面。

They tend to like disrupt themselves, especially around the like tooling and the scaffolding space.

Speaker 0

所以这周早些时候,我读到一句话,来自一个叫尼古拉斯的人在X平台上的文章,他是初创公司Fintool的创始人,他分享了许多在为金融服务构建AI代理过程中积累的最佳实践,我想他最初就是在Fintool做的。

So there there's this quote that I read actually earlier this week from a it's from an X article by this guy named Nicholas, who's who's the founder of a a startup called Fintool, where I think he was sharing a lot of the best practices that he has learned through building AI agents for financial services, I think at a start at Fintool.

Speaker 0

他有一句话我觉得特别好,就是‘模型会把你搭建的框架当早餐吃掉’。

And he had this phrase that I thought was really good, which is the models will eat your scaffolding for breakfast.

Speaker 0

如果你回溯到2022年,也就是ChatGPT刚发布的时候,这些模型还很原始。

Like, if you look if you rewind back to 2022, right when ChatGPT launched, these models are pretty raw.

Speaker 0

当时在开发者领域,有大量产品架构和工具,主要是为了引导模型、围绕它构建架构,以让它完成你想要的任务。

And there was like all this product scaffolding and and things, especially in the developer space to basically try and steer the model and build a scaffolding around it to get it to do what you want.

Speaker 0

比如智能体框架、向量存储,那时候这些都非常流行。

Like agent frameworks, there's vector stores, think was like really popular back then.

Speaker 0

还有一大堆各种各样的工具。

And just like a whole smattering of tools here.

Speaker 0

随着你观察到这个领域的发展,模型已经发生了巨大变化,变得好得多,以至于它们确实会直接‘吃掉’一部分架构。

And as you've kind of seen the field play out that the models have just changed so much that and gotten so much better that they end up, yeah, literally eating some of the scaffolding.

Speaker 0

我认为这一点在今天也同样成立。

And I think this is even true today.

Speaker 0

所以我认为,尼古拉斯的这篇文章提到的当前流行的架构,是基于技能文件的上下文管理。

So I think the article from Nicholas actually is, you know, the current scaffolding, which is fashionable is skills files based context management.

Speaker 0

我能想象,未来某一天,这种方式可能不再有用,模型自己就能管理所有这些内容,或者可能会转向某种新范式,不再需要这种基于文件的技能系统。

I could see a world where at some point, you know, that's no longer useful, where the model can actually, you know, manage all that themselves or like, you know, or there might be, know, it's hard to predict, but like might move on to some new paradigm where you no longer need this file based skills type thing.

Speaker 0

你确实亲眼见证了这种演变,对吧?

You have literally seen this play out, right?

Speaker 0

像代理框架,我觉得现在用处小多了。

Like the agent frameworks, I think are a little less useful now.

Speaker 0

在2023年左右,我们曾认为向量存储会成为向模型注入组织性上下文的主要方式。

There was a period of time like 2023 where we thought vector stores is gonna be like the main way for you to bring organizational context into the models.

Speaker 0

你需要将整个语料库的每一部分都进行向量化和嵌入。

And you need to vectorize and then embed every bit of your corpuses.

Speaker 0

然后你还要花大量精力去研究向量搜索,优化它,以便在正确的时间提取出正确信息。

And then you do all this work to figure out the vector search, to optimize that, to pull out the right information at right time.

Speaker 0

所有这些都属于临时搭建的结构,因为当时的模型还不够强大。

All of that is scaffolding because the model was not good enough.

Speaker 0

结果发现,随着模型变得更好,更优的做法其实是去掉大量这类逻辑,信任模型本身,并给它提供一组搜索工具。

And turns out, in this case, it turns out as the models get better, a better approach is actually to take out a lot of that logic and trust the model and give it a set of tools for search.

Speaker 0

它并不需要一个向量存储。

It doesn't need to be a vector store.

Speaker 0

你完全可以把它连接到任何类型的搜索系统上。

You could actually just hook it up to any type of search.

Speaker 0

它完全可以是文件系统上的文件,比如 skills 和 agents MD,用来引导它。

It could literally be files on a file system like skills and agents MD to kind of steer it as well.

Speaker 0

当然,向量存储仍然有其用武之地。

Obviously, there's still a place for vector stores.

Speaker 0

我知道很多公司仍在使用它,但围绕它构建的整个基础设施、围绕它建立的整个生态系统,以及认为这就是唯一需要的基础设施的观念,已经发生了巨大变化。

I know a lot of companies are still using it, but the entire scaffolding around that and building an entire ecosystem around that and assuming that's the only scaffolding that you need has really changed.

Speaker 0

因此,回到这一点:你并不总是必须听从客户的意见,因为这个领域变化太快,很多人现在都处于局部最优状态。

And so tying this back to the like, you don't always have to listen to your customers Because the field is changing so much at any point in time, a lot of people are kind of in this local maximum.

Speaker 0

如果你盲目听从客户,他们会说:是的,我想要一个更好的向量存储。

And if you just blindly listen to your customers, they'll be like, yeah, I want a better vector store.

Speaker 0

比如,我想要一个更好的代理框架来解决这个问题。

Like, I want a better agent framework for this.

Speaker 0

如果你只一味追随这条路径,实际上会让你去构建一个再次陷入局部最优的东西。

And if you had just kind of only chased down that path, it actually would have led you to build something that again is the local maxima.

Speaker 0

而随着模型变得越来越好,我们必须重新发明并重新思考围绕这些模型构建的正确抽象、工具和框架。

Whereas as the models get better, we've had to reinvent and kind of rethink the right abstractions and the right tools and frameworks to build around these models.

Speaker 0

最酷、最令人兴奋却又有点烦人的是,这是一个移动的目标。

And the coolexcitingkind of crazy annoying part is it's a moving target.

Speaker 0

因此,目前这些零散的工具和框架很可能需要随着模型变得越来越智能而发生显著的演变和改变。

And so, like the current smattering of tools and frameworks right now will likely need to evolve and change pretty significantly over time as the models get smarter and better.

Speaker 0

但这就是构建这个领域的真实本质。

But that is just the nature of building this space.

Speaker 0

我认为这正是它令人兴奋的原因。

Think that's what makes it exciting.

Speaker 0

但这也意味着,当你与客户交流时,你需要平衡他们提出的具体反馈与你对模型未来走向以及未来一两年内趋势的判断。

But it also means when you talk to customers, you kind of need to balance the exact feedback that they want with where you think the models are going and where you think things will trend over the next one or two years.

Speaker 1

有趣的是,这正体现了AI和机器学习领域学到的苦涩教训:你越少过度复杂化,越少在机器学习和AI中添加人为逻辑,它就越能扩展和成长,索性全部剥离,只让系统纯粹地计算,只需赋予它更多算力即可。

It's interesting how this is, the bitter lesson is, you know, this big lesson that AI and ML folks learned, which is just like, don't the less you overcomplicate, the less logic you add to to machine learning to AI, the more it'll be able to scale and grow and just like take it all away and let it just compute basically, just give it more power to get

Speaker 0

是的,这实际上是苦涩教训在AI构建中的一个版本——我们曾经试图围绕它设计所有这些架构,结果发现模型自己就把这些都消化掉了。

Yeah, there's smarter on its literally a version of the bitter lesson applied to like building with AI where, you know, we were trying to architect all this stuff around and turns out the models have just kind of, you know, eat it all away.

Speaker 0

老实说,OpenAI API团队也犯过类似的错误,我们曾经在不该转弯的时候走了弯路。

And and and and honestly, like OpenAI API team has like been guilty of this where we kind of like took some, you know, left and right turns when we shouldn't have.

Speaker 0

但模型仍在不断进步,我们每天都在反复学习这个苦涩的教训。

But, the models still end up models get better and we're all learning the bitter lesson day in and day out.

Speaker 1

那么,对于那些正在使用API或构建智能体、目前不得不围绕这些技术做一些开发的人来说,关键启示是什么?

So what would be the the key takeaway for folks building on, say, the API or just building agents and, you know, having to build a little bit of this around for now?

Speaker 1

就是是的。

Is it just yeah.

Speaker 1

有什么建议吗?

What would be the advice?

Speaker 0

我的总体建议是,我长期以来一直这样告诉别人,而且我认为今天依然适用:确保你为模型未来的发展方向而构建,而不是为它们今天的现状而构建。

My general advice, and I've been giving this to people for a while and I think it's still true today, is make sure you're building for where the models are going and not where they are today.

Speaker 0

你知道,这明显是一个移动的目标。

You know, the the it's it's clearly a moving target.

Speaker 0

我认为我见过的许多公司,尤其是初创企业,之所以能做得特别好,是因为它们为一种理想能力构建了产品,而这种能力今天大约已经实现了80%。

And I think a lot of the companies that I've seen, startups that I've seen really, really do well, is they build a product for an ideal type of capability that is like maybe 80% of the way there today.

Speaker 0

于是,它们最终推出的产品看起来差不多能用,但就是还差那么一点点。

And it like, they end up having a product that like kind of works, but it's like just almost there.

Speaker 0

但随着模型变得越来越好,突然间一切就贯通了,他们的产品也因此变得极其出色,因为现在它真的能用了——比如在某个时刻,它突然就运行得起来了。

But then as the models get better, suddenly it might click and then their product now is incredible because it works, like like maybe with like three at some point, it suddenly works.

Speaker 0

但到了5.1、5.2版本,它突然就解锁了。

But 5.1, 5.2 suddenly it unlocks it.

Speaker 0

但他们构建这些产品时,始终考虑着模型能力的持续提升。

But they're building these products with the model capability improvements in mind.

Speaker 0

这样一来,你最终创造出的体验,远比一开始就假设模型能力固定不变要好得多。

And with that, you end up creating an experience that's way better than if you had assumed that it's it's static in the first place.

Speaker 0

因此,这就是我的总体建议:要为模型未来的发展方向而构建,而不是基于它们今天的状态。

And so that would be my general advice, which is, you know, build for where the models are going and not where they are today.

Speaker 0

你会打造出更好的产品。

You end up building a better product.

Speaker 0

你可能需要稍微等一等,但要知道,模型的进步速度实在太快了,你通常并不需要等太久。

You may need to, you know, like wait a little bit, but like, know, the models are getting so much better so quickly, you often don't need to wait that long.

Speaker 1

那么顺着这个思路,未来六到十二个月,API的发展方向会是什么?

So to follow that thread, where are like in the next six to twelve months, where is the API heading?

Speaker 1

平台的发展方向是什么?

Where's the platform heading?

Speaker 1

模型的发展方向是什么?

Where are the models heading?

Speaker 1

你能分享多少就多少吧,我知道这里有很多机密,也许你更兴奋的是那些,或者你觉得人们应该开始为哪些方面做准备?无论你能分享多少都行。

As much as you can share, I know there's a lot of secrets here that maybe you're more excited about or do you think that people should start to prepare for and however much you can share?

Speaker 0

我的意思是,很明显的一个问题是,这些模型能连贯地完成多长的任务。

I mean, so the obvious one is, how long of a task, these models can do coherently.

Speaker 0

比如有一个叫Meter的基准测试,它追踪软件工程任务,看看这些模型在50%的时间和80%的时间内能完成多长的任务。

So there's like the the meter benchmark that that I think tracks software engineering tasks and how long, you know, like, how long of a task can these models do 50% of the time, 80% of the time.

Speaker 0

我认为,目前前沿模型已经有50%的概率能够完成数小时的软件工程任务。

I think we're at something like multi hour tasks being able to be done by software engineering tasks being able to be done by these frontier models 50% of the time.

Speaker 0

而80%的概率对应的大概是不到一小时。

And then I think 80% is something like just under an hour.

Speaker 0

但关于这张图表令人清醒的一点是,它还把所有之前的模型都画在了上面。

But the the the sobering thing about that that chart is they plot all the previous models on this chart as well.

Speaker 0

所以你能清楚地看到这个趋势。

So you can really see the trend of this.

Speaker 0

这正是我非常兴奋的一点,我觉得如今的产品其实主要优化的是模型能连续执行几分钟的任务。

That's something that I'm really excited about, which is, you know, I actually think products today really optimize for tasks that the model can do for like minutes at a time.

Speaker 0

即使是代码工具和类似的东西,我觉得也主要是在命令行界面中。

Like even codecs and like the coding tools, I'd say like, you know, it's in the CLI.

Speaker 0

你正在看到它变得更具交互性。

You're kind of like seeing it be interactive.

Speaker 0

它确实非常优化,适合最多十分钟左右的任务。

It's really, you know, quite optimized well for like maybe at most ten minute type tasks.

Speaker 0

我见过有人把代码工具推到极限,执行长达数小时的任务。

I have seen people push Codecs to the limit and do like multi hour long tasks.

Speaker 0

但同样,我认为这更多是例外情况。

But again, I I I think that that's more of the exception.

Speaker 0

但如果你关注这个趋势,我觉得在未来十二到十八个月内,我们可能会看到模型能够非常连贯地执行数小时长的任务。

But I if you follow this trend, like, I think, like in the next twelve to eighteen months, we could see models that could do multi hour long tasks very, very coherently.

Speaker 0

在某个时候,它可能会达到每天六小时的任务长度,你可以将任务分配出去,让它自己独立运行一段时间。

At some point it might reach like, you know, six hours a day long task where you kind of like dispatch it and have it do things on its own for a while.

Speaker 0

围绕这一点构建的产品将会非常不同。

The types of products you build around that will look very different.

Speaker 0

你想给模型提供反馈。

You wanna give the model feedback.

Speaker 0

你显然不希望它一整天完全失控,也许你希望,但大概率你并不希望。

You obviously don't want it to completely run wild for a day, maybe you do, but you probably don't.

Speaker 0

然后,你能让模型完成的事情范围将大大扩展。

And then the universe of things you can have the model do really expand.

Speaker 0

所以,这是我非常期待看到的进展。

So that's something that I'm really excited about seeing.

Speaker 0

在未来十二到十八个月里,另一个让我非常期待的方面是多模态模型的改进。

Another thing over the next twelve to eighteen months where I think would be really cool is improvements in the multimodal models.

Speaker 0

说到多模态,我主要指的是音频方面。

So and and actually by by multimodality, I'm mostly thinking about audio.

Speaker 0

在这里,模型在音频方面已经相当不错了,我认为在未来六到十二个月内,它们在音频上的表现会大幅提升,尤其是原生的多模态模型和语音到语音的模型。

Here where the models are pretty good at audio, I think they're gonna get a lot better at audio over the next six to twelve months, especially the likes, native multimodal model, the speech to speech ones.

Speaker 0

我认为在多模态音频方面,围绕新型模型和架构的研究也在进行中。

I think there's also interesting work being done around new types of models and architectures on the multimodal audio side as well.

Speaker 0

但音频,尤其是在企业与商业环境中,仍然是一个被严重低估的领域。

But audio, especially in the enterprise and in a business setting, think is a hugely underrated domain still.

Speaker 0

每个人都谈论编程。

Like everyone talks about coding.

Speaker 0

一切都在文本中。

It's all text.

Speaker 0

但我们实际上是在用音频交流。

But we're talking in audio.

Speaker 0

世界上大量的商业活动都是通过音频完成的。

A lot of the world's business is done via audio.

Speaker 0

许多服务和运营都是通过交谈和音频进行的。

A lot of services and operations are done via talking and audio.

Speaker 0

所以我认为在未来十二到十八个月内,这个领域将会变得非常令人兴奋。

And so I think that that area is gonna look very exciting in the next twelve to eighteen months.

Speaker 0

我认为在音频模型方面,我们还能解锁更多可能性。

And I think there will be even more unlock for what we can do with with audio models there as well.

Speaker 1

太棒了。

Amazing.

Speaker 1

简单总结一下。

So quick summary.

Speaker 1

预计智能体和AI工具将沿着这一趋势持续发展,音频和语音将变得越来越重要,成为更主流、更原生、更出色且核心的体验。

Expect agents and AI tools to run longer to that that trajectory to continue to increase, and then audio and speech becoming a bigger deal, more first party and and native and better and core to the experience.

Speaker 0

是的。

Yeah.

Speaker 1

非常酷。

Extremely cool.

Speaker 1

好的。

Okay.

Speaker 1

我想回到你之前的一个大胆观点,另一个我见过你讨论过的观点。

I wanna go back to one of your hot takes, another hot take that I've seen you discuss.

Speaker 1

你对商业流程自动化在人工智能时代的机会非常乐观。

You're big you're very bullish on business process automation as an opportunity in the world of AI.

Speaker 1

谈谈这个。

Talk about that.

Speaker 0

是的。

Yeah.

Speaker 0

这其实回到我之前说过的一点:我们生活在硅谷的泡沫里。

This go this goes back to the thing that I said previously, which is we we live in a bubble in Silicon Valley.

Speaker 0

我们所从事的很多工作——比如软件工程、产品管理、开发产品——其形态与支撑整个经济运转的工作截然不同。

And a lot of the work that we do, that we're used to, software engineering, product management, building products, is very differently shaped than the work that goes on that runs our entire economy.

Speaker 0

我在与客户交流时经常看到这一点。

And I see this in and out when I talk to customers.

Speaker 0

如果你跟任何一家非科技公司交谈,就会发现他们有大量的业务流程。

If you talk to any company that's not based in, it's not a tech company, there's a lot of business processes.

Speaker 0

我所说的这意味着,一般来说,我会将其区分为:软件工程本质上是一种开放式的知识工作,对吧?

And so what I mean by this is, you know, I generally delineate it as, you know, there's like like software engineering is kind of like open ended knowledge work, right?

Speaker 0

这就是为什么我认为像Codex这样的工具表现得相当出色,因为它是在探索,而你给它的是这些开放性的问题。

It's like and this is why I think tools like Codex tend to be quite quite good because it's exploring and you're giving it these like open ended things.

Speaker 0

但软件工程本质上是非常开放的,并且不太可重复,对吧?

But software engineering is fundamentally like pretty open ended and it's not very repeatable, right?

Speaker 0

所以你开发一个功能时,并不是试图一遍又一遍地构建完全相同的功能。

So like you build a feature, you're not trying to build the exact same feature over and over again.

Speaker 0

很多科技类工作都属于这一类。

And a lot of like tech jobs are in the space.

Speaker 0

我认为数据科学也属于这一范畴。

I think like data science is kind of in this space as well.

Speaker 0

甚至一些战略财务工作也是如此。

Even some of the like strategic finance stuff.

Speaker 0

但当你越来越远离软件工程以及科技的核心领域时,许多工作其实就是业务流程。

But as you move further and further away from software engineering and like what is core and tech, a lot of jobs are just business processes.

Speaker 0

这些是可重复的事情、可重复的操作,由公司里的某个经理不断优化过。

They're like repeatable things, repeatable operations that some manager at a company has kind of like iterated on.

Speaker 0

通常都有标准操作流程,人们希望遵循它,不太愿意偏离。

There's usually a standard operating procedure that people wanna do and you don't wanna deviate from it that much.

Speaker 0

在软件工程中,创造力体现在不偏离,但世界上很多工作实际上只是在执行这些流程和操作。

There's like in software engineering, the ingenuity isn't deviating, but a lot of the work being done in the world is actually just running through these procedures and operations.

Speaker 0

比如,如果我拨打客服热线,他们就是在执行其中某一个流程。

Like if I call a support line, they're running through one of these.

Speaker 0

如果我联系公用事业公司,他们对我能做和不能做的事都有明确的流程。

If I call my utility company, there's a bunch of processes and things that they can and cannot do for me.

Speaker 0

因此,我对这一大类工作极度看好,我认为它被低估了,因为它和硅谷的思维方式截然不同。

And so I'm just extremely bullish on this general category of like, and I think it's underrated because it's so different from what we think about in Silicon Valley.

Speaker 0

人们往往忽视这一点,但如何将AI以及我们现有的工具和框架应用于这种业务流程自动化,自动化并简化那些具有高度确定性、并与企业数据、业务决策和不同系统深度集成的可重复业务流程呢?

People tend to not think about it, but how can we apply AI and some of the tools and frameworks that we have towards this business process automation, towards automating and making easier repeatable business processes with high determinism that is fully integrated with business data and business decisions and different systems within an enterprise.

Speaker 0

我们该如何真正改善这一过程?

And how can we actually make that that process better?

Speaker 0

因为我真的认为这个领域有很多机会和大量工作要做。

Because I actually think there's a lot of opportunity and a lot of work to be done in that area.

Speaker 0

我们只是不谈论它,因为它稍微超出了我们的专长范围。

And we just we just don't talk about it because it's it's a little bit less in our wheelhouse.

Speaker 1

为了确保我完全理解,你的观点是:你认为在工程之外,AI 对企业生产力以及从事这些重复性、易自动化任务的员工的工作影响会更大吗?

Your take here, just to make sure I fully understand it, is you think there's a much bigger opportunity outside of engineering for AI to impact productivity of companies and also jobs of these folks that are doing these kind of repetitive, easily automated tasks?

Speaker 0

影响工作,同时也影响工作的方式。

Impact jobs and also just impact how work is done.

Speaker 0

很多工作都是以这种方式完成的。

So much of work is done in this way.

Speaker 0

你想想,经常与客户交流,大型企业,AI 将如何改变我的公司?

You think about what a Basically, talk to customers all the time, big enterprises, how will AI transfer my company?

Speaker 0

在拥有 AI 的世界里,二十年后它会如何运行?

How will it run-in a world with AI in like twenty years?

Speaker 0

软件工程只是故事的一部分,但在业务流程方面还有更多内容。

Software engineering is part of the story, but there's so much more on the business process side.

Speaker 0

而且我认为,在业务流程方面,情况可能会显得更加不同,那里的工作量也相当大。

And I actually think it might look even more different on the business process side and the work there is pretty substantial.

Speaker 0

这其实挺有意思的。

Actually interesting.

Speaker 0

我不知道,从绝对比例或绝对基数来看,我不确定它是否比软件工程更大或更小。

I don't know, like, from an absolute percentage or absolute base, I don't know if it's bigger or smaller than software engineering.

Speaker 0

软件工程本身也非常庞大和广泛,但业务流程的规模确实非常巨大。

Like software engineering is pretty huge and pretty extensive as well, but it is pretty massive.

Speaker 0

而且它肯定比人们在X平台或推特上谈论或不谈论时所认为的要大得多。

And it's definitely bigger than, you know it's it's bigger than you would think it is based off of how how people talk about it or don't talk about it on X or Twitter.

Speaker 1

好的。

Okay.

Speaker 1

换个稍微不同的方向,既然你构建了一个平台、开发了API,让别人基于API进行开发,人们最关心的问题总是:我该如何避免被OpenAI碾压,让他们复制我的创意,然后摧毁我创造的市场?

Going in a slightly different direction, having built a platform, building the API, people building on API, the biggest question on people's minds is always just, how do I not have OpenAI squash my idea and build their own thing and then, you know, destroy this market I created?

Speaker 1

总体的政策是什么?

What's the general policy?

Speaker 1

初创公司应该如何思考OpenAI不太可能涉足的领域,其总体理念是什么?

What's the general philosophy of how startups should think about where OpenAI is unlikely to go?

Speaker 0

我的总体回答是,这个市场太大了,太庞大了。

My general answer here is is the market is so big and so massive.

Speaker 0

实际上,我认为初创公司根本不需要过度担心OpenAI或这些实验室会往哪个方向走。

Like, I actually think you know, startups should just not overly think about where OpenAI or these labs are going.

Speaker 0

我接触过很多初创公司,有的失败了,有的做得非常好。

I've talked to a lot of startups, you know, that have, you know, not worked out, startups that are doing really well.

Speaker 0

我见过的每一个最终失败的初创公司,都不是因为OpenAI、BigLab、谷歌之类的大公司来抢走它们的市场。

Every startup that I've seen that is kind of fizzled out is not because OpenAI or, you know, BigLab or Google or something has has come to swatch them.

Speaker 0

而是因为它们打造的产品根本没能引起客户的共鸣。

It's because they built something and it like really didn't resonate with with the customers.

Speaker 0

而那些成功的公司,即使在像编程这样竞争激烈的领域,比如Cursor,现在已经非常庞大了。

Whereas the ones that take off, like even in very competitive spaces like coding, like Cursor is huge at this point.

Speaker 0

这是因为它们打造了人们真正喜爱的产品。

And it's because they build something that people really love.

Speaker 0

所以我的总体建议是,别为此太过焦虑。

And so my general advice is like, don't overly stress about this.

Speaker 0

只要打造人们喜欢的东西,你就能在这个领域占有一席之地。

Just build something that people like and you will have a space in this.

Speaker 0

我再怎么强调现在的机会有多大都不为过。

I can't overstate how big of an opportunity there is right now.

Speaker 0

就像利用AI进行创业的机会空间实在太大了。

Like the opportunity space and building with AI is so big.

Speaker 0

一个很好的例子是,这个领域如此广阔,以至于风投认为什么是可接受、什么是不可接受的舆论边界已经彻底改变了。

A good example of this is like the space is so big that the Overton window of what is acceptable and not acceptable for VCs to do has completely changed here.

Speaker 0

风投们正在不断投资各种竞争性公司。

VCs are like investing in like competitive companies left and right.

Speaker 0

这整个领域之所以如此广阔,是因为机遇前所未有。

It's just like the space is so big because the opportunity is unlike anything that we've seen before.

Speaker 0

尽管这影响了风投的运作方式,但从创业者的角度来看,这是世界上最令人振奋的事,因为即使你只是打造了某些人真正、真正喜爱的产品,你也终将拥有一个价值巨大的企业。

And while that affects how VCs operate, from a startup perspective, it's the most empowering thing in the world because even if you just build something that some people really, really love, you will end up with a massively valuable business.

Speaker 0

所以这就是为什么我会告诉人们,别太在意你怎么想。

And so that's why I tell people, don't know what you think about it.

Speaker 0

另一件我认为很重要、至少从OpenAI的角度来看的是,我们始终非常重视的一点,即萨姆和格雷格也从高层不断强调:我们本质上把自己视为一个生态系统平台公司。

The other thing I also think is important to remember, at least from an OpenAI perspective, one thing that we've always held very near and dear, which both Sam and Greg help, you know, reinforce from the top as well, is we actually view ourselves fundamentally as a like ecosystem platform company.

Speaker 0

API 是我们的第一个产品。

The API was our first product.

Speaker 0

我们觉得,培育这个生态系统并持续支持它、而不是压制它,对我们来说至关重要。

We think it's really important for us to foster this ecosystem and continue to, you know, support it and and not squash it.

Speaker 0

所以如果你仔细看看我们所做的决策,这一切都是我们一贯坚持的。

And so if you kind of look at the decisions we make, it this is all we've we've through it.

Speaker 0

我们发布的每一个模型,都会通过API对外提供。

Every single model we've released in one of our products gets released in the API.

Speaker 0

比如,我们现在发布的这些Codex模型虽然更针对Codex框架做了优化,但它们最终都会进入API。

Like even, you know, we release these codex models now that are a little bit more optimized for the codex harness, but they always find their way into the API.

Speaker 0

而且,我们所有的客户最终都会把这些功能用到极致。

And like all of our, you know, customers end up abusing those.

Speaker 0

我们对此从不保留。

We don't hold back on any of that.

Speaker 0

我们认为保持平台中立非常重要。

We think it's really important to keep our platform neutral.

Speaker 0

所以,不要封锁竞争对手。

And so, don't block competitors.

Speaker 0

我们允许人们使用我们的模型。

We allow people to have access to our models.

Speaker 0

我们也希望像最近一样,更多地测试登录ChatGPT产品。

We also want, like we've recently been testing more of like the sign in with ChatGPT product as well.

Speaker 0

因此,我们希望培育这个生态系统。

And so we want to foster this ecosystem.

Speaker 0

我认为这样做非常重要。

I think it's really important that we do so.

Speaker 0

关于这一点的普遍看法是,潮水涨起,所有船只都会随之上升,而我们可能是一艘航空母舰。

The general thinking about this is a rising tide lifts all boats and we might be an aircraft carrier.

Speaker 0

我们目前规模已经相当大了,但我们认为提升整体水位很重要,因为每个人都会受益,我相信我们自己也会从中受益。

We're pretty big at this point, but we think it's important to raise the tide because everyone kind of benefits and I think we'll benefit as well.

Speaker 0

我们的API本身已经显著增长,这正是因为我们的这种做法。

Like our API itself has grown pretty significantly because we act in this way.

Speaker 0

因此,我真心鼓励大家不要把OpenAI看作一个会把别人挤出市场的存在,而应该专注于创造有价值的东西。

And so I'd really encourage people not to view OpenAI as this kind of you know, thing that'll just shove people out of the way, but instead focus on building something valuable.

Speaker 0

我们始终致力于打造一个开放的生态系统。

And we, you know, remain committed to providing an open ecosystem.

Speaker 1

为什么这对OpenAI如此重要?为什么你们如此专注于构建平台,为人们创造创业机会?

Why is that important to OpenAI, just this focus on building a platform, creating a way for people to build businesses?

Speaker 1

这从一开始就是你们的愿景吗?

Just like is that just that's been the vision from the beginning?

Speaker 1

我们希望这能成为一个平台。

We want this to be a platform.

Speaker 0

这从一开始就是我们的愿景。

It's been the vision from the beginning.

Speaker 0

这其实可以追溯到我们的章程,也就是我们的使命。

It comes goes back to our charter actually, like our mission.

Speaker 0

所以,OpenAI的使命一直以来都是第一点:开发通用人工智能。

So the the OpenAI's mission has always been to one, to build AGI.

Speaker 0

所以,我们显然正在做这件事。

So, you know, we're obviously doing that.

Speaker 0

但第二点是,要让全人类都能受益于它。

But then the second thing is to like spread the benefits of it to all of humanity.

Speaker 0

这其中的关键部分,就是‘全人类’这一点。

And there's kind of like a lot of, you know the main part there is all of humanity.

Speaker 0

比如,ChatGPT 就在努力实现这一点。

Like and obviously, ChadGPT is trying to do this.

Speaker 0

我们正试图触达尽可能多的人,整个世界。

We're trying to reach however many, the whole world.

Speaker 0

但早在早期,这就是我们为什么在2020年左右就推出API的原因。

But very early on, and this is why we launched the API back in, I think it was like 2020 or something, like really early.

Speaker 0

我们不认为作为一家公司,我们能够触及全人类,对吧?

We don't think we as a company will be able to reach all of humanity, right?

Speaker 0

世界上每一个角落都相当深远。

There's, I don't know, every corner of the world is pretty deep.

Speaker 0

因此,我们实际上认为,为了实现我们的使命,我们需要一种平台式的思维,赋能其他人去为播客主和通讯录作者构建客服机器人,因为我们自己无法做到这一点。

And so we actually feel like in order for us to fulfill our mission, we need to have some platform style think here where we can empower other people to build the customer support bot for podcasters and newsletter hosts because we're not gonna be able to do it ourselves.

Speaker 0

因此,我们已经很大程度上看到API在这方面发挥了作用。

And so we've largely seen this play out with the API.

Speaker 0

这就是为什么我们与这么多客户交流,并且非常乐于看到基于我们平台构建的多样化应用。

This is why we, know, we talk to so many of our customers and and and really, you know, love seeing the diversity of of things built on.

Speaker 0

但事实上,从第一天起我们就一直如此,因为我们将其视为一种授权的体现。

But, yeah, it it's been there since day one because it's it's it's kind of we view it as an expression of permission.

Speaker 1

你甚至还没提到你们正在推出的App Store,即ChatGPT App Store。

And you haven't even mentioned the the App Store that you guys are launching, ChatGipitsi App Store.

Speaker 1

是的。

Yeah.

Speaker 1

顺便问一下,这是在你们的管辖范围内,还是另一个俄勒冈团队?

Is is that under your umbrella, by the way, or is that a different Oregon team?

Speaker 0

这是一个不同的团队。

It's a it's a different team.

Speaker 0

所以它属于ChatGPT。

So it's under ChatGPT.

Speaker 0

我们显然与他们紧密合作,而且他们开发了一个应用SDK,是与我们的团队密切协作完成的。

We obviously collaborate very closely with them and, you know, they built like an apps SDK, which is built in close collaboration with our team.

Speaker 0

但那更多是在ChatGPT的范畴内。

But that is more within the ChatGPT umbrella.

Speaker 0

但这也是另一个类似的例子。

But that is also another like, that's another example of this.

Speaker 0

对吧?

Right?

Speaker 0

就像ChatGPT,我们有大约8亿周活跃用户,他们不断回来使用。

It's like ChatGeeBT is like we we we we we kind of like have these 800,000,000 weekly active users who are just coming over and over again.

Speaker 0

这作为一个企业来说是个巨大的优势,但话说回来,如果我们能允许其他公司也进来,利用这一点,为这个受众群体开发产品,岂不是更好?

Like, it's a great asset to have as a business, but like, man, would it be better if we could somehow allow other companies to come in and take advantage of this as well and build for this audience as well.

Speaker 0

然后,最终我们认为这也有助于我们扩大这个群体。

And and then ultimately, we think it'll help us expand that that that group as well.

Speaker 0

对吧?

Right?

Speaker 0

所以这一切其实都回归到我们的使命,我们发现作为开放平台会在这里起到积极作用。

And so it's all it all kinda comes back to the mission, and we find that being a platform being open tends to help here.

Speaker 1

就这个数字,8亿,我认为是月活跃用户吗?

Just that number, 800,000,000, I think it's m MAs?

Speaker 1

不是的。

Just like No.

Speaker 0

每周。

Weekly.

Speaker 0

每周。

Weekly

Speaker 1

每周活跃,是的。

act Weekly act yeah.

Speaker 0

这太疯狂了。

It's crazy.

Speaker 1

十亿人每周使用。

Billion people using weekly.

Speaker 1

简直难以置信,我们现在对这些数字都习以为常了,但这真的太惊人了。

Just like, it's absurd how many how these numbers we're just used to now, but that's in insane.

Speaker 1

史无前例。

Unprecedented.

Speaker 0

是的。

Yeah.

Speaker 0

从规模的角度来看,说实话,这让我觉得难以想象。

It's it's mind boggling for me to think about from a scale perspective, honestly.

Speaker 0

我是这么想的:占全球人口的10%,而且还在快速增长,这数字简直在飙升。

And the way I think about it is like 10% of the world and growing by the way, like it's just, it's it's shooting up.

Speaker 0

来使用ChatGPT,每周都用一次。

Come to chat GPT and and use it every day, or sorry, every week.

Speaker 1

在这一点上,我想再强调一下你提到的观点。

And this point, I just wanna double down on this point you're making.

Speaker 1

OpenAI的使命是让人工智能惠及全人类。

OpenAI's mission was to make AI available to all of humanity.

Speaker 1

有些人只是觉得,哦,这要花钱。

And I think some people just that, they're like, oh, know, costs money.

Speaker 1

但事实上,ChatGPT有一个免费版本,任何人都可以使用,这个版本与世界上最强的AI模型相差无几,而且免费、开放、人人可用。

And it's like, like the fact that it it's there's a free version of ChatGPT that anybody can use that is not so different from the most powerful AI model that exists in the world for free, that's not gated, that anyone can use.

Speaker 1

比如,即使你是亿万富翁,你能从AI中获得的收益,也未必比非洲一个村庄里的人多多少。

Like, if you have if you're a billionaire, there's only so much more you can get out of AI than what someone, you know, in a village in Africa can get.

Speaker 1

我知道这一直对OpenAI至关重要。

And I know that's always been really important to OpenAI.

Speaker 0

是的。

Yeah.

Speaker 0

是的。

Yeah.

Speaker 0

我的意思是,你看,这就是为什么我们认为我们专注于健康领域是正确的。

I mean, look, that's why I think we've leaned into the health work.

Speaker 0

我们还关注教育,这将是一个非常有趣的领域。

We've leaned into like education is gonna be a very interesting here.

Speaker 0

这里另一个疯狂的趋势是,免费模型随着时间的推移变得越来越智能。

The other insane kind of trend here is is the free model has gotten so smart over time.

Speaker 0

比如,2022年的免费版本虽然当时还不错,但跟今天的东西完全没法比,因为现在你能用到的是GPT-5。

Like the free model back in 2022 was, you know, like, well, it was good at the time, but it's like nothing compared to what you get today because you get GPT-five today.

Speaker 0

所以,提升全球整体水平,这正是我们真正努力在做的事情。

And so the like, you know, raising the floor across the world is kind of, you know, something that we're really trying to do.

Speaker 0

我们认为这是我们的使命的一部分。

We view it as part of our mission.

Speaker 0

顺便说一下,另一方面,就是关于那些亿万富翁之类的话题。

The other flip side of this, by the way, is like, you know, kind of talking about like the billionaires or whatever.

Speaker 0

我知道有人会说,你用的iPhone和史蒂夫——抱歉,是马克·扎克伯格那样的亿万富翁用的一样。

I know people are saying like, you're using the same iPhone that like, you know, Steve or sorry, like Mark Zuckerberg's probably using or like the billionaires are using.

Speaker 0

但只要你每月花20美元,你实际上就在使用和亿万富翁一样的AI。

But for like $20 a month, you're basically using, you know, like using the same AI that, you know, the billionaires are using.

Speaker 0

每月花200美元,你就能获得和所有亿万富翁一样的专业版,但他们可能并不会全程都用专业版。

For like $200 a month, you get the same pro model that, you know, all the billionaires are using, but they're probably not using pro for everything.

Speaker 0

他们大概只是用高级版来应付日常需求。

They're probably just using the plus tier ones for their day in and day out.

Speaker 0

因此,这种技术的民主化,以及让这种福利惠及全球,对我们来说意义重大,也是推动我们很多工作的动力。

And so, this kind of like democratization and just like spreading of this benefit, like across all of the world has seen us really meaningful to us and something that drives a lot of what we do.

Speaker 1

最后一个问题是,给那些正在考虑基于API开发,或者想着‘等等,我可以用OpenAI的模型和API做点酷炫东西’的人。

One last question, just for folks that are thinking about building on the API or just like, oh, wait, I could do cool stuff with OpenAI's models and APIs.

Speaker 1

你的API和平台能让人做些什么呢?

What what does your API and and platform allow people to do?

Speaker 1

我知道你可以在平台上构建智能代理。

Like, I know you can build agents on top of the platform.

关于 Bayt 播客

Bayt 提供中文+原文双语音频和字幕,帮助你打破语言障碍,轻松听懂全球优质播客。

继续浏览更多播客