The Twenty Minute VC (20VC): Venture Capital | Startup Funding | The Pitch - 20VC:Codex 对比 Claude Code 对比 Cursor:谁胜谁负?所有编程都会被自动化吗?我们还需要产品经理吗?AGI 的真正瓶颈是什么?代理的三个阶段及你需要了解的一切——对话 OpenAI Codex 负责人 Alex Embiricos 封面

20VC:Codex 对比 Claude Code 对比 Cursor:谁胜谁负?所有编程都会被自动化吗?我们还需要产品经理吗?AGI 的真正瓶颈是什么?代理的三个阶段及你需要了解的一切——对话 OpenAI Codex 负责人 Alex Embiricos

20VC: Codex vs Claude Code vs Cursor: Who Wins, Who Loses | Will All Coding Be Automated - Do We Need PMs | The Real Bottleneck to AGI | The Three Phases of Agents and What You Need to Know with Alex Embiricos, Head of Codex at OpenAI

本集简介

亚历山大·恩比里科斯是OpenAI的Codex负责人,领导公司旗舰AI编码系统的开发,这些系统赋能自动化软件生成、调试和开发者工作流。在他的带领下,Codex已成为应用最广泛的AI开发者平台之一。 议程: 05:13 编程会被自动化吗?为什么AI会创造更多工程师,而非更少 07:17 我们还需要产品经理吗?“未定义”的产品角色及其重要性 08:06 真正的AGI瓶颈:人类提示、验证与“过多努力” 13:04 代理的三个阶段:编程 → 计算机使用 → 产品化工作流 13:52 企业现实检验:安全、权限与安全的代理浏览 17:57 推理是否已成为新的销售与营销? 18:49 Codex中有多少比例是由AI编写的? 21:33 OpenAI是否使用AI进行代码审查? 23:31 AI编程工具是否有粘性? 28:22 在OpenAI,“胜利”意味着什么?使命、竞争与护城河 32:04 未来界面:聊天还是语音 34:10 代理间工作流:设计审批、合规与自动化 35:39 编程模型是否有数据护城河? 36:50 Codex如何看待数据:他们会自建Mercor和Turing吗? 37:27 Codex如何看待消费者:他们会与Lovable竞争吗? 41:56 基准测试 vs “感觉”:人们如何真正评估模型 42:43 Cursor的优势及自建模型的理由 47:37 SaaS已死吗?什么仍在捍卫价值(人类 + 记录系统) 51:28 人才战争与AI时代新工程师的职业建议 01:01:03 安全护栏、全AI管理的堆栈,以及面向每个人的十年愿景

双语字幕

仅展示文本字幕,不包含中文音频;想边听边看,请使用 Bayt 播客 App。

Speaker 0

欢迎来到20 Product,我是哈里·斯蒂平斯。

Welcome to 20 product with me, Harry Steppings.

Speaker 0

20 Product是一个月度节目,我们会与顶尖的产品领导者对话,揭示他们打造最佳产品和产品团队的技巧、策略和方法。

Now 20 product is the monthly show where we sit down with the best product leaders to reveal their tips, tactics, and strategies to scaling the best products and product teams.

Speaker 0

真正的问题是,谁会赢?

Now the real question is who's gonna win?

Speaker 0

是Codex吗?

Is it Codex?

Speaker 0

是ClawCode吗?

Is it ClawCode?

Speaker 0

还是Cursor?

Or is it Cursor?

Speaker 0

别走开。

Well, stay.

Speaker 0

今天我们邀请到的嘉宾是亚历山大·因比里科斯,OpenAI的Codex产品负责人。

Joining us in the hot seat, we have Alexander Imbirikos, product lead for Codex at OpenAI.

Speaker 0

这是一场令人惊叹的讨论。

This is an incredible discussion.

Speaker 0

该拿出笔记本了。

Time to get the notebook out.

Speaker 0

我想听听你的反馈。

I want your feedback.

Speaker 0

告诉我你的想法。

Let me know what you think.

Speaker 0

邮箱:harry@twentybc.com。

Harry at twenty b c dot com.

Speaker 0

但在我们深入今天的节目之前,Atlassian的早期故事很可能和你自己的经历非常相似。

But before we dive into the show today, the early story of Atlassian is probably very similar to your own.

Speaker 0

Atlassian深知初创公司每天面临的挑战,以及正确的工具对于从MVP走向IPO至关重要。

Atlassian knows firsthand the challenges that start ups face every day and that the right tools are essential to go from MVP to IPO.

Speaker 0

因此,Atlassian为初创公司提供免费名额,符合条件的公司可免费获得Jira、Confluence、Loom、Jira产品发现、Compass和Bitbucket等产品的高级版最多50个席位,让你的团队能够使用业界顶尖的工具来规划、跟踪和协作完成工作。

That's why Atlassian for start ups gives eligible companies up to 50 seats free on the premium edition for products like Jira, Confluence, Loom, Jira Product Discovery, Compass, and Bitbucket so your team can use the best in class tools to plan, track, and collaborate on work.

Speaker 0

无论你的工作是什么,如今许多最成功的初创公司,如Cloudflare、Canva和Rivian,都依赖Atlassian实现了增长,而Atlassian希望为下一代创业者和投资者提供同样的机会。

Whatever that work may be, many of today's most successful startups like Cloudflare, Canva, and Rivian relied on Atlassian for their growth trajectory, and Atlassian wants to give that same opportunity to the next generation of builders and investors.

Speaker 0

我们深知在早期阶段专注于打造正确事物的重要性。

We know how important it is to focus on building the right things early.

Speaker 0

无论你正处于便利贴阶段,还是已经踏上征程,任何阶段的团队都能更聪明地协同工作。

Whether you're in the sticky note stage or well on your journey, teams at any stage can work smarter together.

Speaker 0

尽早开始使用Atlassian永远不会太早。

It's never too early to start with Atlassian.

Speaker 0

前往atlassian.com/startups/harry了解更多信息和资格条件。

Head on over to atlassian.com/startups/harry for more details and eligibility.

Speaker 0

在Atlassian帮助你的团队构建和交付优秀产品之后,Intercom则帮助你用这些产品支持客户。

After Atlassian helps your team build and ship great products, Intercom helps you support customers using them.

Speaker 0

如果你正在寻找一种方式来革新你的客户服务,让我向你介绍Finn,宝贝。

If you're looking for a way to transform your customer service, let me introduce you to Finn, baby.

Speaker 0

Finn是客户服务领域排名第一的AI代理,能够自动解决高达93%的客户查询。

Finn is the number one AI agent for customer service, resolving up to 93% of customer queries automatically.

Speaker 0

没有其他代理能做到这一点。

There is no other agent that can do that.

Speaker 0

不是93%的客户查询。

Not 93% of customer queries.

Speaker 0

明白吗?

Okay?

Speaker 0

没有其他代理能做到这一点。

No other agent can do that.

Speaker 0

那为什么要选择Finn呢?

So why choose Finn?

Speaker 0

Finn是客户服务领域表现最出色的AI代理。

Finn is the best performing AI agent for CS.

Speaker 0

Finn不仅仅是回答问题。

Finn doesn't just answer questions.

Speaker 0

它还会采取行动。

It takes actions.

Speaker 0

它能自动处理最复杂的客户查询,例如退款、交易争议和技术故障排除,速度快且可靠。

It automates the most complex customer queries, like refunds, transaction disputes, technical troubleshooting with speed and reliability.

Speaker 0

真希望我的团队也能又快又可靠。

I wish my team was speedy and reliable.

Speaker 0

在每一场直接对比中都胜过所有竞争对手。

Beats every competitor in every head to head bake off.

Speaker 0

完全可配置,无需编写代码即可设置。

Completely configurable and code optional setup.

Speaker 0

天哪,我的意思是,好处真是源源不断。

My word, I mean, the benefits just go on and on.

Speaker 0

实施起来简单高效。

It's easy and efficient implementation.

Speaker 0

它能在任何客服系统上运行,无需繁琐的迁移。

It works on any help desk with no tedious migration needs.

Speaker 0

它已获得超过6000位客户服务负责人信赖,包括Anthropic、Lovable、Synthesia、Clay、Vanta等顶尖AI公司。

It's trusted by over 6,000 customer service leaders, including top AI companies like Anthropic, Lovable, Synthesia, Clay, Vanta.

Speaker 0

所以,如果你准备好改造你的客户服务团队,扩展你的支持能力,并让团队成员有时间专注于更高层次的战略工作,了解更多关于Finn的信息,请访问fin.ai/20vc。

So if you're ready to transform your customer service team, scale your support, and give team members time to focus on the really high level strategic work, learn more about Finn at fin.ai/20vc.

Speaker 0

当Finn在不降低速度的情况下扩展你的支持服务时,Reford将教你如何将这种扩展转化为持久的产品驱动增长。

While Finn scales your support without losing speed, Reford shows you how to translate that scale into durable product led growth.

Speaker 0

每个人都在比以往更快地发布产品。

Everyone's shipping faster than ever.

Speaker 0

Cursor、ClawCode、Codecs。

Cursor, ClawCode, Codecs.

Speaker 0

人工智能正在让编码和编写代码比以往任何时候都更快。

AI is making code and writing code faster than ever.

Speaker 0

但这里有个问题。

But here's the problem.

Speaker 0

如果没人使用你发布的产品,速度就毫无意义。

Speed means nothing if nobody uses what you ship.

Speaker 0

这就是Reforge的用武之地。

That's where Reforge comes in.

Speaker 0

Reforge 正在构建一个位于你的编码代理上游的产品发现引擎,它不是另一个原型工具、研究库或 AI 面试官,而是一个能够摄入你的客户数据、生成多种产品解决方案、在代码编写前验证这些方案,并将获胜方向交付给你的团队的产品。

Reforge is building the product discovery engine that sits upstream of your coding agents, Not another prototyping tool, research repo, or AI interviewer, but a product that will ingest your customer data, generate variations of product solutions, validate the solutions before code is written, and hand off winning directions to your team.

Speaker 0

Reforge 在产品债务产生之前就将其消除,因为每一个你发布但无人使用的功能,都不只是浪费了工程时间。

Reforge kills product debt before it starts because every unused feature you ship isn't just wasted engineering time.

Speaker 0

它还带来了维护负担、复杂性税,以及你无法缩减的攻击面。

It's a maintenance burden, complexity tax, and surface area that you cannot shrink.

Speaker 0

Reforge 已被 Toast、Vimeo、Klaviyo 等公司的产品团队使用,帮助团队发布更多实际被使用的功能。

Used by product teams at companies like Toast, Vimeo, Klaviyo, and many more, Reforge helps teams ship more features than actually get used.

Speaker 0

前往 reforge.com/build 试用 Reforge,并使用代码 two zero VC。

Try Reforge at reforge.com/build and use the code two zero VC.

Speaker 0

输入 20 VC,即可免费使用一个月专业版。

That's 20 VC for one month free of pro.

Speaker 1

您已到达目的地。

You have now arrived at your destination.

Speaker 1

亚历克斯,我太期待这次了,兄弟。

Alex, I'm so excited for this, dude.

Speaker 1

我跟你说过,我刚参加完一个私募股权会议,满脑子想的都是,幸好有亚历克斯在旁边,这会是一场超棒的对话。

I told you, I've been at a PE conference, and all I could think was, thank god I've got Alex next because this is gonna be a great one.

Speaker 1

非常感谢你来参加,老兄。

So thank you so much for joining me, man.

Speaker 2

非常高兴能来到这里。

So excited to be here.

Speaker 2

谢谢。

Thank you.

Speaker 1

这个开场有点奇怪,但请跟着我的节奏来。

Now this is a weird first start, but roll with it.

Speaker 1

你会理解我的英式表达细节的。

You'll you'll understand my British intricacies.

Speaker 1

我对人们的动机非常着迷。

I'm fascinated by people's motivations.

Speaker 1

你是更害怕失败,还是更享受胜利带来的刺激和兴奋?

Are you motivated more by the fear of losing or, like, the thrill and excitement of winning?

Speaker 2

我是个极致主义者。

I I'm a maximalist.

Speaker 2

我确实更受获胜的念头驱动,而不是害怕失败。

I'm definitely much more motivated by the idea of winning than the fear of losing.

Speaker 2

但我得向你坦白一件事。

But I'll admit to you something.

Speaker 2

在我加入OpenAI之前经营初创公司时,我最黑暗的时刻之一——而我在经营公司期间经历过很多黑暗时刻——就是意识到自己过去几个月一直在努力避免失败。

When I was running a startup before joining OpenAI, and one of my darkest moments, and there were many dark moments while I was running the startup, was recognizing that I'd spent the past few months trying to avoid losing.

Speaker 2

突然间,我意识到:天啊,这就是我如此不快乐的原因。

All of a sudden, was like, oh my god, that is why I'm so unhappy.

Speaker 2

这可能也是公司进展不顺的原因。

And that's probably why the startup isn't going well.

Speaker 2

你知道,我时不时得提醒自己,重新转向获胜的信念。

You know, I basically every now and then I have to re catch myself and like flip back into this idea of winning.

Speaker 2

但真正更激励我的,其实是我觉得我就是热爱创造事物,为人们创造东西。

But really what motivates me even more than that is I think I just love building things and building things for people.

Speaker 2

而且,天啊,我对今年充满期待,因为许多目前还不存在的惊人事物即将被创造出来并交付给很多人。

And, man, I am so excited for this year because many amazing things that don't exist yet are gonna be built and given to a lot of people.

Speaker 1

我马上就开始了。

I'm diving right in.

Speaker 1

马斯克说,编程是最早将被大规模自动化的行业之一。

Elon said that coding is one of the first professions to be largely automated.

Speaker 1

考虑到你的职位和你每天所见,你同意这个观点吗?

Do you agree, given your position and what you see day to day?

Speaker 2

当然,我同意编程是大型语言模型表现非常出色的领域之一。

For sure, would agree that coding is one of the first domains where LLMs are really good.

Speaker 2

但你知道,编程被自动化到底意味着什么?

But, you know, what does it mean for coding to be automated?

Speaker 2

这话说得有点重了。

It's like kind of a heavy statement.

Speaker 2

对吧?

Right?

Speaker 2

例如,现在我们不再写汇编语言了,当这种转变发生时,我们转向了高级语言,那时候我们说编程被自动化了吗?

For example, now that we no longer write assembly, like when that change happened, and we moved to higher level languages, did we say coding is automated?

Speaker 2

其实没有,对吧?

Not really, right?

Speaker 2

我们只是能够写出更多的代码。

We were just able to write much more code.

Speaker 2

结果,实际上对代码的需求大大增加,需要更多的软件工程师。

And then as a result, actually, there was much more demand for code, there were many more software engineers required.

Speaker 2

但没错,他们过去做的一部分工作确实被自动化了,就像你知道‘计算机’这个词的起源吗?

But yeah, part of what they used to do is automated in the same way that like, do you know the origin of the word computer?

Speaker 2

不知道。

No.

Speaker 2

我可能发音不太准,但我记得是在布莱切利园,那里有大量用于解码德国恩尼格玛密码的机器。

I might pronounce the location wrong, but I think it was at Bletchley Park, there were all these machines for like decoding German Enigma.

Speaker 2

当时有大量人类负责打孔卡片,把它们放进机器里,进行大量的表格计算。

And like, there were humans who would like punch out punch cards and like put them into the machine and do a bunch of like tabulated math.

Speaker 2

我可能把这说得一团糟。

I'm probably butchering this.

Speaker 2

但基本上,那曾经是一项极其手动的工作。

But basically, was an intensely manual part of work.

Speaker 2

甚至最早的电子表格软件,也大致基于这样一个想法:办公室里摆满成排的桌子,人们进行数据计算,然后把表格传给下一个人。

And even like the first spreadsheet software was kind of loosely based off this idea that you would have an office full of desks arranged in a grid and people doing tabulations and then passing their sheets to the next person.

Speaker 2

所以,所有这些特定的任务都已经实现了自动化。

And so all these things, like those specific tasks have become automated.

Speaker 2

但每次发生这种情况,对产出的需求都会激增。

But every time that's happened, there's been an explosion in demand for the output.

Speaker 2

因此,即使具体任务发生了变化,仍然需要更多的人来完成这类工作。

And so you need many more people actually to do that kind of work, even if the specific task has changed.

Speaker 1

所以你认为五年后工程师会更多,而不是更少?

So you think we will have more engineers in five years, not less?

Speaker 2

是的。

Yeah.

Speaker 2

而且你知道吗,有时候我们会改变术语的含义,对吧?

And I, you know, sometimes we change what terms mean, right?

Speaker 2

比如现在‘计算机’这个词指的是别的东西,但我们现在有了‘软件工程师’这个术语。

Like the term computer now refers to something else, but now we have the term software engineer.

Speaker 2

所以我确信我们会拥有更多的创造者。

And so I definitely think we'll have many more builders.

Speaker 2

而且,我现在观察到一个有趣的现象,那就是人才结构正在压缩。

And, you know, something interesting that I'm observing now is like there's this compression of the talent stack.

Speaker 2

你知道,今天你仍然需要软件工程师,仍然需要设计师。

You know, you still need software engineers today, you still need designers.

Speaker 2

我是产品经理,你需要产品经理吗?

I'm a PM, do you need PMs?

Speaker 2

你可以拿这个开些有趣的玩笑。

You know, you can have a fun fun some fun jokes about that.

Speaker 2

我认为你不需要他们。

I don't think you need them.

Speaker 2

但当你提到工程师时,也许你想到的是比以往更全栈型的人。

But maybe when you say engineer, you might be thinking of someone who's like much more full stack than has been true before.

Speaker 2

比如,即使回溯几年,那时后端工程师和前端工程师的分工要明确得多。

Like, even if you go back a few years, you had many more places where there was like the back end engineer and the front end engineer.

Speaker 2

而如今,至少以我了解的Codex团队来说,这种情况已经少了很多,工作更趋向于全栈化。

Whereas like now, at least if I think about the Codex team, like, that's much less the case and things are much more full stack.

Speaker 2

对吧?

Right?

Speaker 2

所以我认为这种人才结构会进一步压缩,但仍然会有人从事构建工作。

And so I think this comp this talent stack will compress, but we'll still have people building.

Speaker 1

你为什么觉得在这个世界里不需要产品经理?

Why do you think we don't need PMs in this world?

Speaker 1

你刚才吊足了我的胃口。

You you dangled the carrot.

Speaker 2

是的。

Yeah.

Speaker 2

是的。

Yeah.

Speaker 2

这是我的一个有趣玩笑。

It's it's my fun joke.

Speaker 2

我认为,首先,要定义什么是产品经理是非常困难的。

I think well, first of all, I think it's incredibly hard to define what a PM is, what a product manager is.

Speaker 2

我倾向于认为这个角色本质上是明确未定义的,你的目标只是适应团队或业务的需求。

I kind of think of the role as like, actually explicitly undefined, and your goal is just to adapt to whatever the team or business needs.

Speaker 2

通常,如果你有一群人试图尽快推进开发,那么产品经理可以做的就是退后几步,观察全局,预测下一步方向,与市场团队协作,并成为团队最热情的倡导者和质量把关者。

Often, if you have a bunch of people like trying to build as quickly as possible, then what a what a product manager can do is spend time like taking a few steps back and trying to look around corners and figure out what to do, you know, collaborate with the folks in go to market, and maybe be the the team's like greatest cheerleader and quality razor.

Speaker 2

但所有我刚才描述的这些职责——可能也是我当前的角色——完全可以由一位能力很强的工程负责人或非常关注产品的设计师来完成。

But like all of those things I just described, which are maybe my current role could be done by a really strong eng lead or a designer who thinks a lot about product.

Speaker 2

所以我认为,产品经理通常是有用的,但在团队规模真正变大之前,你可能并不需要太多产品经理。

And so I think it's, like, often useful to have product managers, but you probably don't want many of them until the team is really large.

Speaker 1

过去几天我一直在深入研究你的内容,包括你的文章、推文和之前的访谈,你提到人类打字速度和验证工作是通向通用人工智能的关键瓶颈,而不是模型、算力或架构。

I was stalking the shit out of you for the last few days, which was a very fun expedition into your writing, into your tweets, into your prior interviews, and you said that human typing speed and validation work is the key bottleneck to AGI, not model, compute, or architecture.

Speaker 0

然后它就留在那里了,

And it kind of left there,

Speaker 1

我当时就想,帮我理解一下,为什么人类的打字速度和验证工作是AGI的关键瓶颈,你这么说到底是什么意思?

and I was like, help me understand why human typing speed and validation work is the key bottleneck and what you really meant by that.

Speaker 2

当然。

For sure.

Speaker 2

好的。

Okay.

Speaker 2

这真是个有趣的问题。

That's a that's a fun one.

Speaker 2

我认为存在多个瓶颈,但这个可能是最吸引眼球的一个。

I think there are multiple bottlenecks, but that's maybe the most sort of click baity one.

Speaker 2

如果你不介意的话,我会稍微用苏格拉底式的方式问你:你今天会用多少次AI?

So if you don't mind, I will do this slightly Socratically, like how many times would you say use AI today?

Speaker 1

三十多次。

30 plus times a day.

Speaker 2

好的,不错。

Okay, cool.

Speaker 2

假设你不需要付出任何精力,你觉得AI一天能帮你多少次?

How many times do you think assuming it was like zero energy expenditure from you, How many times do you think AI could help you per day?

Speaker 1

我的意思是,在所有事情上,我认为AI的推理会全天候在每件事上运行。

I mean, in everything, I think we'll have inference running twenty four hours a day across every single thing.

Speaker 2

没错。

Exactly.

Speaker 2

我现在听到一些工程师,比如在OpenAI内部和外部的人告诉我,我一直在运行Codex,从不关笔记本电脑。

And like, I hear things now from engineers, like at OpenAI and also outside who were telling me like, you know, I constantly have Codex running, I never close my laptop.

Speaker 2

如果开会时它没有运行,我就觉得是在浪费时间,我必须确保Codex始终有任务在为我处理。

And if it's not running while I'm in a meeting, I'm like wasting my time, I need to make sure Codex always has work for me that it's doing.

Speaker 2

这非常酷,也非常令人兴奋。

And that's like super cool and super exciting.

Speaker 2

但管理这些代理并确保它们始终在工作,确实需要很多精力,对吧?

But that's a lot of work, right, to like manage these agents and make sure they're always working.

Speaker 2

再回到每天30次这个话题。

And going back to the 30 times per day thing.

Speaker 2

是的,当我们观察Codex用户使用Codex的频率时,大概在每天十几二十次的范围。

Yeah, like when we look at how often Codex users are using Codex, it's like kind of this like 10s of times kind of range.

Speaker 2

我认为AI应该每天帮助我们成千上万次,当然前提是计算资源允许,随着时间推移我们终将实现这一点。

And I think AI should be helping us 10s of 1000s of times per day, you know, compute budget permitting will and we'll get there over time.

Speaker 2

但问题是,至少就我自身而言,我一直在研究这些东西,我知道我应该用AI来处理所有事情。

But the problem is, at least if I think of myself, I work on this stuff, I know I should be using AI for everything.

Speaker 2

但我太懒了,懒得输入那么多提示词。

But I'm too lazy to type out that many prompts.

Speaker 2

我也缺乏创意,想不出AI能帮我做的所有事情。

And I am too uncreative to figure out all the ways that AI can help me.

Speaker 2

所以我最终用AI的次数和你差不多。

And so I end up kind of at a similar number as you.

Speaker 2

我仍然处于这样的阶段:当我用AI做些酷炫的事情,比如为这次和你的对话做准备时,我会有点为自己感到骄傲。

I still am at the point where when I use AI to do something cool, like prep for this conversation with you, I'm like kind of proud of myself.

Speaker 2

我觉得,哇,我竟然用AI做了件新事情。

I'm like, oh, cool, I managed to use AI in this new way.

Speaker 2

这对像你我这样对这个话题特别感兴趣的人来说没问题,对吧?

That's fine for people like you and me who are like, really interested in this topic, right?

Speaker 2

但我不认为,为了让大多数人从AGI中受益,他们需要花这么多精力去琢磨怎么使用这个工具,它应该对他们来说毫不费力。

But I don't think most people we should expect in order to benefit from AGI should need to like, put so much effort into how to use this tool, it should just be effortless for them.

Speaker 2

我认为我们想要达到的世界是,使用AI时,你根本不需要去琢磨该怎么写提示词。

I think the world we want to get to is one where to use AI, you don't really need to, like, figure out the right way to prompt.

Speaker 2

对你来说,它就是超级简单。

It's just super easy for you.

Speaker 2

你甚至都不需要意识到AI能帮上你。

And you don't even need to recognize that AI could help you.

Speaker 2

它只是自然而然地了解你的上下文,然后主动提供帮助。

It's just like knows you connected to your context and chimes in helpfully.

Speaker 1

我认为Claude在产品包装上做得很好,比如Claude用于法律、Claude用于Excel,你可以直接用它来构建DCF模型。

That's what I think like Claude has done well in terms of the packaging they've done, like Claude for legal, Claude for Excel where you can implement it and have a DCF model.

Speaker 1

我对模型本身不感兴趣,但比以前能做得更好。

I'm not into models, but like better than one could do before.

Speaker 1

那么,你认为你的工作是将提示和人类行为产品化,以消除这个瓶颈吗?

Do you think it is your job then to productize the prompts and the human actions to remove that bottleneck?

Speaker 2

是的。

Yeah.

Speaker 2

完全正确。

Totally.

Speaker 2

所以我认为我们的职责是确保拥有具备强大能力的模型,最终实现一个高度产品化的世界。

So I think that it is our job to make sure that we have the models with the with amazing capabilities, and then eventually to get to a world where this is, like, highly productized.

Speaker 2

你只需要一个神奇的文本框或语音输入,或者直接把AI加入你的群聊,它就会自动提供帮助。

And so you just have this like magic text box or audio input, or whatever, or you can just add AI to your like group chat, and it just starts to help.

Speaker 2

但我认为中间阶段非常有意思。

But I think there's quite an interesting in between stage.

Speaker 2

我认为目前最大的价值恰恰就存在于这个中间阶段。

And I think that that is actually where the most value lies right now.

Speaker 2

这就是我的意思。

So here's what I mean.

Speaker 2

你可以尝试为特定市场产品化AI的某个特定功能。

You could try to productize like a specific feature of AI for a specific market.

Speaker 2

我知道很多公司都在这么做,但我认为很难确定什么会真正有效,什么样的产品形态才是正确的。

And I you know that many companies are doing this, but I think it's a little bit hard to know what exactly will work, what is the right form factor.

Speaker 2

之前有人在你的播客中提到,我认为他说的一件事很有意思:在企业中采用AI离不开全职员工。

And someone was on your podcast earlier, and I they said something that I thought was quite interesting about how you cannot adopt AI at enterprise without FTEs.

Speaker 1

是的。

Yeah.

Speaker 1

那是来自Invisible AI的马特·菲茨帕特里克。

It was Matt Fitzpatrick from, Invisible AI.

Speaker 2

是的。

Yeah.

Speaker 2

尽管我确实在招聘全职员工,如果你是全职员工,欢迎申请我的职位,但我完全不同意这个观点。

So so even though I am literally hiring FTEs, and if you're an FTE, please apply for a job with me, I actually disagree with that entirely.

Speaker 2

所以我认为我们需要为人们构建工具,就像帕特里克在播客中说的那样,用全职员工来自动化工作流程,对吧?

So what I think we need to do is build tools for people like you can use FTEs, as as Patrick said on the podcast, to automate workflows, right?

Speaker 2

但随后你会受限于从上至下的视角能做什么,以及你通过全职员工配置能构建什么,对吧?

But then you're limited by like what you from your top down perspective can do and what you from your FDE staffing can can staff to be built, right?

Speaker 2

但对我来说,AI最令人兴奋的未来是每个人都能感觉自己像超人一样,被AI赋能。

But for me, the most exciting future with AI is one where everyone just feels like a superhuman, just like empowered by AI.

Speaker 2

为此,我们需要为个人用户设计的工具,让每个人都能够熟练使用。

And for that, we need tools that are for people, for individual users, and that everyone feels fluent with.

Speaker 2

我认为我们现在所处的最有趣的阶段,是为那些对探索如何使用AI感兴趣的人构建工具。

I think the the phase that's most interesting that we're at now is building for the kind of people who are interested in figuring out how to use AI.

Speaker 2

所以我们需要推出的产品,我认为Cloud Code最初发布时就做得很棒,他们真正做对的是,这个工具非常易于使用,无论你在什么环境下,只需在终端里就能操作,人们开始尝试它在各种场景中的应用。

So what we need to ship, and I think this was like the genius of like when Cloud Code first shipped, what they really got right was they had this tool that was super easy to use in whatever context you want just in your terminal, and people started experimenting with where to use it.

Speaker 2

因此,当我们思考AI在编码工作之外的应用时,我们能做的最重要的一件事,不是过度限定它——比如,这只适用于金融领域,或只适用于某个特定工作流程,而是要构建一个更开放、任何人都能创造性地用于任何任务的工具。

And so I think as we think about AI being used outside of coding work, one of the most important things we can do is not overly build it like, okay, this is AI capabilities, but only specifically for finance only for specifically for this workflow, but actually build a much more open ended tool that someone can just use for any given task creatively.

Speaker 1

是的。

Yeah.

Speaker 1

但这不是把责任或努力又推回给用户了吗?回到了你所说的瓶颈——人类行动不足的问题上?

But does that not put the onus or the effort back on the user, back to the point of your bottleneck of human action and lack of activity on them?

Speaker 1

如果你不定义任务,就把定义任务的责任推给了用户,而人类恰恰缺乏这种能力或意愿。

If you don't define the task, you put the responsibility on them for the defining the task, which humans lack the ability or inclination to do.

Speaker 2

是的。

Yeah.

Speaker 2

所以这就是我认为的瓶颈所在。

So that's why I think it's the bottleneck.

Speaker 2

所以,在我看来,大致有三个阶段。

So basically, here are the three phases in my mind.

Speaker 2

首先,让代理在软件工程和编程中表现得非常好,因为大语言模型恰好擅长这个。

First, let's have agents work really well for software engineering and coding because LLMs happen to be good at that.

Speaker 2

接下来,我们要意识到,为了让代理在更广泛的场景中发挥作用,让它能够使用计算机非常关键。

Next, let's realize that for an agent to be useful more generally, it using a computer is super valuable.

Speaker 2

而且我们还会意识到,所有代理实际上都是编程代理,因为编程是代理使用计算机的最好方式。

And also, we'll realize that all agents are actually coding agents because coding is just the best way for an agent to use a computer.

Speaker 2

让我们把这个同样高度灵活的理念推广给所有渴望探索和动手尝试的人。

So let's take that same super flexible idea, but make it available to anyone who's excited to explore and tinker.

Speaker 2

我们已经看到人们开始用类似Codex的应用程序来做这件事,Codex虽然是为软件开发者设计的,但我们现在看到开发者们用它来完成各种非编程任务。

And we're already seeing people start to do this with like the Codex app, like Codex app is built for software builders, but we're seeing builders use it for all sorts of non coding tasks.

Speaker 2

最后,一旦我们看清了哪些方法有效,就去构建你提到的产品化方案,即提供高度特定的功能,让用户开箱即用。

Then finally, once we see what's working, let's build that productization that you were talking about, where you have highly specific features that just work immediately out of the box for people.

Speaker 2

我认为在未来几个月里,我们会快速走完这个从一到三的完整过程。

And I think we're going to speed run this entire like one, two, three journey in the next months.

Speaker 1

我对你说的关于企业内部FTE和实施的挑战是数据安全、敏感性、权限管理和访问授权,这些真的非常困难,而且我认为人们,尤其是在大型企业中,远没有我们想象的那么聪明和自信,抱歉。

My challenge with what you said about kind of FTEs and implementation within enterprise is data security, sensitivity, permissioning, access provisions is really fricking hard and people are much less intelligent and confident than we give them credit for, I think, especially in large enterprise, sorry.

Speaker 1

我认为你确实需要一个FDE去深入定制各种横向解决方案,才能让它们真正发挥作用。

And I think you actually need an FDE to go in and custom fit a lot of the different horizontal solutions to make it work.

Speaker 1

我错了吗?

Am I wrong?

Speaker 2

我觉得你说得对。

I think you're right.

Speaker 2

如果你正试图从零到一完全实现,而且我刚才说的,并不是想表达负面的意思。

If you're trying to go like all the way from zero to one and you have this like, and I said, I don't mean grand negatively here.

Speaker 2

但如果你有一个宏伟的愿景,想要打造一个终极的工作流自动化系统,那么是的,你必须克服所有这些真实存在的安全和合规障碍,对吧?

But if you have like a grand vision for some like ultimate workflow automation system, then yeah, you're going to have to clear through all of these security hurdles, all these like compliance hurdles that are really real, right?

Speaker 2

连接所有这些数据系统以及记录和操作体系。

Build connections to all these data systems and like systems of record and action.

Speaker 2

是的,你需要一个FTE来完成这项工作。

Yeah, so you're going to need an FTD to do that.

Speaker 2

我所看到的是,当我们自上而下地做这些事情时,往往会严重低估AI的潜力,也无法真正帮助企业;而你其实可以并行地推进,对吧?

What I've seen is that when we do these things, top down, we end up like massively underleveraging the potential of AI and like helping that company, Whereas you can maybe do that in parallel, right?

Speaker 2

但如果你能直接把AI交给真正从事这些工作的人,他们就能开始形成对AI如何提供帮助的心理模型。

But if you can just give AI to the people like actually doing the work, they can start to like get a mental model for how AI can help.

Speaker 2

然后他们就能同时将AI融入自己的工作流程中。

And then they can start pulling AI into their workflows at the same time.

Speaker 2

举个简单的例子:假设你在客服岗位工作,AI正被引入你的工作,开始自动化你工作中相当重要的部分,但你从未听说过ChatGPT,也不被允许使用它。

Here's just like an analogy or something here is like, imagine if, you work in like a customer support role, and AI is being brought into your role and starting to automate like meaningful chunks of your work, but you've never heard of ChatGPT, nor are you allowed to use it.

Speaker 2

在这种情况下,你对这个东西完全没有直观感受。

So in that scenario, you have like no intuition for what this thing is.

Speaker 2

而在一个你同时在工作中使用ChatGPT、并且部分工作正被大语言模型自动化的世界里,你对它的运作方式会有更深刻的直观理解。

Whereas in a world where actually you've been using ChatuchPutty for work at the same time as like parts of your work are getting automated by an LLM, you have much more intuition for how this works.

Speaker 2

而且,我认为你会觉得这种加速过程让你更有掌控感,你能够一定程度上引导这些自动化功能的构建方向,而不是像一种完全外来的、令人无力的‘天降神兵’式的东西。

And, you know, I would argue you feel much more empowered about this idea that it's being accelerated, and you have some degree of control to steer like where these automations are built, as opposed to like, it's like this complete, like, ex machina kind of thing that is quite disempowering.

Speaker 2

所以回到这一点,我认为存在一种可行的方法,因为你提到的数据控制问题确实存在。

So bringing this back, like I think there is a way to do this because the data control issues you mentioned are real.

Speaker 2

但归根结底,每一个工具、每一个功能、每一个工作流程,都是为某个地方的某位员工服务的,而这位员工最终是通过浏览器或文件系统来使用这些工具的。

But at the end of the day, every tool, every feature, every workflow is for a human who is somewhere, an employee somewhere, and that employee is accessing that tooling via their browser or via their file system, like at the end of the day.

Speaker 2

因此,归根结底,所有东西最终都会落实到一个本地运行的智能代理可以交互的界面。

And so, at the end of the day, everything comes to an interface that an agent running locally on your computer can work with.

Speaker 2

我认为这相当不寻常,比如OpenAI,我们正在构建一个浏览器形态的Atlas。

And I think it's quite unusual, like an OpenAI, we're building a browser Atlas.

Speaker 2

你可能会好奇为什么。

And you might wonder why.

Speaker 2

这其中有很多原因。

And there are many reasons why.

Speaker 2

但我认为其中一个关键原因是,通过构建浏览器并端到端地严格控制它,我们可以为企业打造安全的智能代理浏览功能。

But I think one of the key reasons is that by building a browser, and by controlling it like tightly end to end, we can build like safe agentic browsing for enterprise.

Speaker 2

这是一种以代理方式访问那些尚未由全职员工开发出来的功能的方法。

That is a way to access things agentically that is that are otherwise not yet built out by FTEs.

Speaker 1

我有很多问题想问你。

There are so many questions that I have to ask you.

Speaker 1

我想先回去,免得思路断了。

I I wanna go back before I lose thread.

Speaker 1

你提到工程师们不愿意合上笔记本电脑,因为他们不想因为使用代码库而浪费生产力和时间。

You mentioned about engineers, like, not closing their laptops because they don't actually wanna lose productivity and time with with building with codecs.

Speaker 1

你与Cerrebris合作,而Cerrebris显然是目前最快的推理服务提供商。

You partnered with Cerrebris and Cerrebris is the fastest provider obviously of inference out there.

Speaker 1

这无疑是个了不起的成果,坦白说,对双方都是如此。

Amazing win, I think, for both bluntly.

Speaker 1

对于开发者在使用Codex以及未来AI编程时,速度有多重要?

How important is speed for developers when using Codex and in the future of AI code?

Speaker 2

我的意思是,这些简单的答案,真的非常重要。

I mean, these simple answers, it's it's super important.

Speaker 1

那这是否构成了一种推理垄断?

We And so is it like an inference monopoly?

Speaker 1

也就是说,现在你们拥有这项技术,而竞争对手还没有?

Like, you have it now and competitors don't?

Speaker 2

这只是我的观点,但我认为我们不会陷入这种垄断局面。

This is just my opinion, but I don't think we're gonna end up in, this kind of monopolistic world.

Speaker 2

我认为竞争压力太大了,最终会涌现出多种解决方案。

I think there's so much competitive pressure that there'll be like multiple answers to this.

Speaker 2

但我可以透露,关于我们这项合作的消息很快就会公布。

But I will say that we have like news coming out about that partnership soon.

Speaker 2

我非常期待这些成果能够落地发布。

And I'm very excited for these kinds of things to ship.

Speaker 2

这将会非常棒。

It's going to be awesome.

Speaker 2

但即便如此,像GPT 5.3这样的编码器模型,其效率相比之前的模型有了显著提升。

But even so, like, you know, with GPT 5.3 codecs, that model is like significantly more efficient than prior models.

Speaker 2

根据我们收到的反馈,人们确实觉得现在的模型比以前快得多。

And so we've in the feedback we've heard is that people actually feel like now this is like a very competitively fast model than before.

Speaker 2

因此,在模型层面,你可以做很多事情。

So there's a lot of things you can do, just in terms of the model.

Speaker 2

此外,你还可以通过改进推理方式来提升性能。

There are also things you can do with like improving how you do inference.

Speaker 2

我们最近推出了一项变更,在API中,这些模型的响应速度提升了40%,在Codex中则提升了25%。

So we recently rolled out a change where in the API, like those models are served like 40% faster and in Codex, they're served 25% faster.

Speaker 2

所以我认为速度非常重要。

So I think like speed matters a lot.

Speaker 2

我们正从多个角度入手,包括硬件、推理方式以及模型本身。

And we're kind of approaching it from all angles, like both the hardware, how you do inference and the model level.

Speaker 1

你之前提到过把主动权交给用户,我们当时谈到了推理。

You mentioned earlier about kind of putting it in the hands of users and we talked about inference there.

Speaker 1

我一位好朋友是Saster的Jason Lemkin,他说实际上推理就是新的销售和营销。

One of my dear friends is Jason Lemkin from Saster and he says that actually inference is the new sales and marketing.

Speaker 1

不再需要销售和营销团队,而是通过支付推理成本,让用户能够快速上手、轻松看到价值,最终销售和营销团队可能会被取代。

Instead of sales and marketing teams, you're paying for Inference so users can onboard quickly, easily see value, and you will actually see the removal of sales and marketing teams.

Speaker 1

这可以说是下一代产品驱动增长(PLG)。

It's kind of like next gen of PLG.

Speaker 2

我不确定。

I don't know.

Speaker 2

我觉得我对这个观点有些保留。

I think I struggle with that.

Speaker 2

在这个任何人都能构建、并且构建变得越来越容易的新世界里,什么才是困难的呢?

I think, fundamentally in this new world where anyone can build and it is increasingly easy to build things, what is hard?

Speaker 2

对吧?

Right?

Speaker 2

我认为,与客户建立良好的关系并了解他们的需求,和以前一样困难,甚至可能更难,因为市场上可供选择的东西太多了。

I think having a good relationship with the customer and knowing what they need is as hard as ever, maybe even harder as it's just like there's just more stuff in the market to choose from.

Speaker 2

还有其他更难的事情,比如打造正确的东西,做出高质量的产品。

You know, the other things that are harder, like building the right thing, having a really high quality thing.

Speaker 2

但回到销售和营销这个话题,我认为它并不会消失,因为正如我所说,随着市场上软件越来越多,任何细分市场都变得更加竞争激烈,这反而让销售和营销变得更难了。

But going back to the sales and marketing thing, like, don't think that goes away because I think that's as like I said, I think that's just gotten harder as the as the markets any given market gets more competitive with more software out there.

Speaker 1

今天,你们内部的代码有多少是通过Codex生成的?

How much of internal code for you today is produced by Codex?

Speaker 1

我记得,Boris说过,Claude for Work的代码几乎是100%由它生成的。

I remember, like, Claude for Work, Boris said, was, like, a 100% or nearly a 100%.

Speaker 1

内部使用Codex的比例是多少?

How much is internal Codex used?

Speaker 2

我先说说自己,再说说团队的情况。

So I'll speak for myself and then for the team.

Speaker 2

据我所知,大多数人基本上已经不再打开编辑器了。

Would say, like, most people that I know are basically not opening editors anymore.

Speaker 2

这是一次阶跃式的变化,虽然之前一直在逐渐发生,但我认为关键的外部市场转折点是GPD 5.2 Codecs,突然之间,模型在运行时长、端到端处理任务、管理上下文和遵循指令方面都变得强大得多。

And this was a step function change that happened in it's been happening gradually, but I'd say the key external market touchpoint for this was, g p d 5.2 codecs, where all of a sudden the model was like way better running for longer, handling tasks end to end, managing its context, and following instructions.

Speaker 2

因此,我们看到了这个拐点。

And so we kind of saw this inflection point.

Speaker 2

而这正是我们开发这款应用的部分原因。

And that's actually what part of why we built the app.

Speaker 2

所以我认为,在GPD 5.2 Codecs之前,我们用来写代码的AI功能主要是代码补全,或者你是在和模型进行结对编程。

So I think what before gpd 5.2 codecs, the kinds of AI features we were using to write code were like, tab completion, or maybe you were pair programming with the model.

Speaker 2

在我看来,你仍然需要坐在电脑前,手放在键盘上。

And in my mind, you still need it to be at your laptop with your hands on the keyboard ish.

Speaker 2

它可能会去执行一些小任务,但你还是得在场,主导整个过程,它只是帮你处理一些小事。

Like, And it might go off and do a little bit of work, but you know, you're kind of still need to be there and like drive, it's just like handling these small things for you.

Speaker 2

而在去年12月GPD 5.2 Codecs发布时,我们开始转向完全委托任务的做法。

And then at the time of g p d 5.2 codecs in December, we kind of switched to like, actually, I'm just going to fully delegate this task.

Speaker 2

也就是说,我会和它一起制定计划,确认它要执行的规格,然后就放手让它去运行。

It's like, you know, I'm going to do a plan with it, make sure we like the spec that it's going to do, and then I'm just going to let go let it cook.

Speaker 2

这是一种完全不同的工作方式。

And this is quite a different way of working.

Speaker 2

所以,它正在我们说话的时候,真正地发生变化。

So it's like it's changing, like, literally as we speak.

Speaker 2

因此,我们上周发布的Codex应用,部分原因就是为了打造一种形态或用户体验,让委托任务而非与代理协作感觉非常顺手。

And so part of why we built this Codex app that we released last week is because we wanted to build like a form factor or user experience where it felt like very ergonomic to be delegating instead of pairing with an agent.

Speaker 2

所以,同时委托给多个代理。

And so like delegating to multiple agents at once.

Speaker 2

甚至在OpenAI内部,这也正在发生巨大变化。

And so even at OpenAI, this is changing massively.

Speaker 2

我没法给你一个具体的百分比数据。

I don't have a percentage stat for you.

Speaker 2

但我可以说,绝大多数代码都是由AI编写的。

But I would say like the vast majority of code is written by AI.

Speaker 2

我认为现在,大多数人甚至根本不会打开IDE了。

And I would say that now, probably like, most people are not even like opening IDEs.

Speaker 2

也许他们打开IDE是为了掌控界面。

Maybe if they are opening IDEs to like, maybe you want to own the interface.

Speaker 2

对吧?

Right?

Speaker 2

所以你会帮忙梳理两个模块之间的接口,然后由AI来填充内容。

So you'll like help flush out like the interface between like two modules, and then like AI fills it out.

Speaker 2

或者你可能想参与规划,但让AI来完成具体实现。

Or maybe you wanna, like, collaborate on a plan, but then have AI fill it out.

Speaker 2

代码本身已经不再由人类编写了。

The code itself is not being written by humans anymore.

Speaker 1

二十四个月后,我们会把ID纳入技术栈吗?

Will we have IDs as a part of the stack in twenty four months time?

Speaker 2

这取决于你怎么定义IEE。

Depends how you define IEE.

Speaker 2

正式的定义,也就是集成开发环境。

So the the formal definition, right, integrated development environment.

Speaker 2

我的意思是,这个说法太模糊了,几乎任何东西都可以被称为IDE。

I mean, that that phrase is so squishy that, like, literally anything could be an IDE.

Speaker 2

对吧?

Right?

Speaker 2

所以我觉得这个说法没什么用。

So I don't think that's very useful.

Speaker 2

如果这就是答案的话,那确实是。

If that's the answer, then yes.

Speaker 2

你甚至可以说Codex应用就是一个IDE。

You could even argue the Codex app is an IDE.

Speaker 2

我不这么认为。

I don't think it is.

Speaker 2

对我来说,IDE就像一个功能强大的编辑器。

Like, for me, think of an IDE is like a really powerful editor.

Speaker 2

我们特意没有在Codex应用中加入编辑功能,因为我们希望用户能清楚地知道该如何使用它。

And we explicitly didn't build editing into the Codex app because we wanted it to be really clear how you're meant to use it.

Speaker 2

所以,它具备很多功能,用于管理多个代理、委派任务和审查更改。

So, know, it has a lot of affordances for managing multiple agents for delegating, for reviewing changes.

Speaker 2

它拥有非常突出的技能,这些技能是开放标准,对于非编码工作非常有用,比如任务分类、监控部署等,但它没有文本编辑功能。

It has really prominent skills, which are an open standard that are really useful for doing non coding work, stuff like, you know, triaging tasks or monitoring deploys or something, but it doesn't have text editing.

Speaker 1

如果我们假设大部分代码是由AI生成的,那么你如何进行代码审查?AI是否应对内部代码审查负责?

If we assume a large percentage is done by codecs in terms of the code produced, how do you do coding reviews and is AI responsible for internal coding reviews?

Speaker 2

这里有几个要点。

There are a few things here.

Speaker 2

首先,你想要实现的目标或计划变得比以往任何时候都更重要。

First off, the spec for what you wanna do or the plan becomes more important than ever.

Speaker 2

从架构角度思考,这段代码应该如何运作?

Think like architecturally, like how should this code work?

Speaker 2

我们最近推出了一种非常显著的计划模式,它的运作方式与其他模式略有不同:代理会自行提出如何完成某项任务的方案。

We recently shipped like a very prominent plan mode that works a little differently than others where you have the agent go off and like propose how it's going to do something.

Speaker 2

这个计划相当长。

It's like quite a long plan.

Speaker 2

它会问你是否同意它的做法,或者你是否希望提供意见。

And it asks you questions about if you agree on how it wants to do it, or if you want have input.

Speaker 2

这非常类似于新入职的员工,他们对你的代码库还不熟悉,必须在开始工作前向团队提交一份征求意见的请求。

And this is very similar to like if you had a new hire who was new to your code base, you know, they had to present a sort of request for comments to the rest of the team before they started doing the work.

Speaker 2

因此,尽管这并不属于正式的代码审查,但我认为对计划的审查正变得越来越重要,因为我们正进入与代理协作的更多委托阶段。

So even though that's not formally code review, I would say review of the plan is actually something that's becoming more important because we're entering more of this like delegation phase of working with agents.

Speaker 2

这是一个被低估的方面。

So that's an underrated thing.

Speaker 2

然后,好吧,接下来是实际的代码审查。

Then, okay, there's actual code review.

Speaker 2

我经常听到人们谈论一个问题,尤其是在开源领域:大量AI生成的低质量代码,人们只是把垃圾PR提交到开源仓库中。

I think a problem that I hear a lot of people talking about, especially in the open source world is like a lot of AI slop, like people will just be submitting PRs to these open source repos, and they're trash.

Speaker 2

而且,提交PR的人可能根本没测试过代码,更不用说审查代码了。

And like, maybe the user hasn't even the person submitting the PR hasn't even tested them or definitely hasn't reviewed the code.

Speaker 2

我认为这是一个问题。

I think this is a problem.

Speaker 2

因此,Codex 的一种常见做法是让 Codex 自己审查自己的拉取请求或更改。

And so a common practice with Codex is to have Codex like review its own PR or its own change.

Speaker 2

Codex 在这方面实际上非常出色,我们专门训练了模型使其擅长代码审查。

And Codex is actually incredibly good at this, we've explicitly trained the model to be good at code review.

Speaker 2

这包括确保它能够提供高价值的反馈意见。

And you know, that included things like making sure it's like really good at creating like high signal feedback.

Speaker 2

因此,它对批评的误报率很低,这意味着你可以非常信任它的反馈。

So it'll, like, basically have few false positives of criticism, which means you can really trust when it has feedback.

Speaker 2

因此,我们不仅鼓励团队内外的人主动让 Codex 进行审查,还可以设置为自动审查。

And so not only do we encourage people, on the team and elsewhere, like, to, like, just ask Codex to review, you can then also set it up to just, like, automatic review.

Speaker 2

实际上,OpenAI 几乎所有的代码在推送到合格的仓库时都会由 Codex 自动审查。

So like nearly all code at OpenAI is reviewed by Codex automatically whenever you push it to a good repo.

Speaker 2

对于还没试过 Codex 或最近没用过的人,有一个有趣的做法:让人让 Codex 审查其他模型的代码。

Actually, like one one fun thing for people who haven't tried Codex yet or didn't try it recently, sometimes the way that people like see how good our models are is by asking Codex to review a different models code.

Speaker 2

然后他们就会说:‘天哪。’

And basically, they're like, oh, shoot.

Speaker 2

我大概应该直接用Codex来写我的代码了。

I should probably just be using Codex to write my code in general.

Speaker 1

你提到一个非常有趣的观点,对于那些可能还没试过,或者重新回来使用的人,你怎么看待这个领域的用户留存问题?

You said something really interesting that you said for those that maybe haven't tried it yet or, you know, coming back to it, how do you think about retention with this category?

Speaker 1

我记得YC的合伙人Tom Blomfield几个月前发过一条推文,虽然很久了,但一直留在我脑海里——关于在不同提供商之间切换的便捷性,无论是Curse、CoreCode还是Codex,说实话我记不清具体是哪一个了。

I remember Tom Blomfield, who's a YC partner, tweeted months and months ago, but it stuck with me, weird brain, about the ease of transition between different providers, whether it was Curse or CoreCode or Codex, can't remember which one it was to be honest.

Speaker 1

但用户有多粘性?你怎么看待留存问题?

But how sticky are users, and how do you think about retention?

Speaker 2

我们对Codex采取了一种反直觉的做法,就是非常开放地构建它。

We've taken this, like, kind of counterintuitive approach with codex to just build it super openly.

Speaker 2

所以,Codex的核心引擎是开源的,我们一直在努力让用户更容易切换。

So, like, the codex core harness is open source, and we're always trying to make it easier for people to switch.

Speaker 2

比如,去年我们刚推出Codex时,甚至‘推出’这个词都显得太重了。

So for instance, when we first launched Codex last year, we created like created as even a heavy word.

Speaker 2

那只是个约定俗成的叫法,叫做agents.md。

It was just just established convention, which is called agents dot m d.

Speaker 2

这基本上是一个你可以放入代理指令的文件。

This is basically a file that you can put instructions for the agent in.

Speaker 2

而我们没有称之为Codex md,我们只是希望它能成为所有代理都能使用的标准。

And instead, we didn't call it Codex md, we just wanted it to be something that all agents can use.

Speaker 2

除了Claude之外,几乎每个代理都使用agents.md。

And pretty much every agent except Claude uses agents.

Speaker 2

这太棒了。

Md, which is awesome.

Speaker 2

就在上周,我们推动将技能标准化,技能是用来给代理提供指令和脚本的。

And then just last week, actually, we helped push for putting skills, which are a standard for like giving the agent instructions and scripts.

Speaker 2

我们推动将这些技能放在一个中立命名的文件夹中,称为agents,而不是放在codecs之类的文件夹里。

We push for those to be sorted in sort of a neutral named folder called agents, instead of in like codecs or something.

Speaker 2

而且,除了那个一贯的例外,所有人都采纳了这个标准。

And again, everyone has jumped on it except the usual suspect.

Speaker 2

我认为这对开发者来说拥有大量选择是非常棒的。

I think it's really great for the developers to have a lot of choice.

展开剩余字幕(还有 480 条)
Speaker 2

我们正努力让人们更容易尝试不同的东西。

And we're trying to make it even easier for people to try different things.

Speaker 2

话虽如此,这些编码任务——当你让代理编写代码时——是非常封闭的。

Now that said, these coding tasks, right, where you're asking an agent to write some code, they're quite hermetic.

Speaker 2

我所说的封闭,可以用电视节目来打个比方,就像单元剧一样。

And what I mean by this is maybe an analogy in TV would be like episodic, right?

Speaker 2

比如,你可以进来,有一个开放的、任何代理都能读取的代理文件,还有任何代理都能使用的技能,你可以让代理编写代码,它会生成一个补丁并提交到 Git 中。

Like, you can come in, and you've got this like open ended, like agents file that any agent can read from, you've got these skills that any agent can use, and you can ask the agent to write some code, and it produces a patch and that patch goes into git.

Speaker 2

因此,目前这个过程的两端都非常中立、厂商无关,切换起来非常容易。

So kind of like both ends of this are pretty neutral, vendor neutral, so very easy to move between for now.

Speaker 2

当代理开始从事的不再是编写代码,而是更通用的工作——无论是针对软件工程师还是其他构建者时,它们就需要开始与其他系统交互。

As agents start to do work that is not writing code, but more general work, again, for software engineers or beyond for any builder, they're going to need to start interfacing with other systems.

Speaker 2

所以,当代理开始与 Sentry、Google 文档或其他系统通信时,我认为这些代理会变得更具粘性,因为决定让代理连接到某个系统是一个难以逆转的决策。

So as they start, maybe your agent is talking to Sentry, right, or it's talking to your Google Docs or something, then I think these agents become much stickier because actually deciding to connect an agent to that system is a sticky decision.

Speaker 2

如果你是企业用户,真正信任代理能访问这些工具,那么确保有完善的安全防护、沙箱环境以及对代理如何与这些系统交互的控制机制,就显得至关重要。

And if you're an enterprise, really trusting that the agent is going to have access to these tools, but there are really good secure guardrails and sandbox and like controls over how the agent works with these systems, think is critically important.

Speaker 2

这可不是你愿意反复去做的事情。

And that's not something that you're going to want to to do multiple times.

Speaker 2

因此,我们在开发Codex时就已经预见到这一点。

And so, you know, we've been kind of building Codex knowing that this is coming.

Speaker 2

所以我们采用了最保守的沙箱策略。

And so we have like the most conservative sandboxing approach.

Speaker 2

沙箱本质上是一组操作系统级别的控制机制,用于限制代理的行为。

Sandboxing is kind of like a set of controls, OS level controls over what the agent can do.

Speaker 1

但我很喜欢《七种力量》这本书,它深入探讨了企业获取价值和可持续性的七种方式。

But I'm a fan of Seven Powers, this brilliant book, which talks about kind of seven ways that businesses accrue value and sustainability.

Speaker 1

而你的用户粘性或留存率就是其中之一。

And like your stickiness or your retention is one.

Speaker 1

如果我们和柯达在同一团队,该如何建立持久的模式、行为和项目,确保人们留在Codex,而不是在有更好的模型时转向Cursor或ClawCode?

If we're on the same team with Kodak, how do we create retentive patterns, behaviors, programs to ensure that people stay with Codex and they don't flip to cursor when there's a better model or ClawCode when there's a better model?

Speaker 2

是的。

Yeah.

Speaker 2

我的意思是,这很有趣,因为一方面,我们当然在经营一家公司,但我们的使命是确保安全地将AGI的好处惠及全人类。

I mean, it's interesting because I think on the one hand, like we think about this, obviously, we're running a business, but, you know, our our mission here is to, like, ensure that, like, we safely deliver the benefits of AGI to all humanity.

Speaker 2

所以,关于Codex团队,有些事情对人们来说是反直觉的。

And so something that's, like, unintuitive to people about, like, the Codex team.

Speaker 0

亚历克斯,我其实我是我

Alex, you actually I'm I'm I'm

Speaker 1

我知道,但你的职责是Codex的成功。

I know but your job is the success of Codex.

Speaker 1

我明白。

I get

Speaker 2

实际上,我们的职责是智能的分发。

actually, our job is the distribution of intelligence.

Speaker 2

因此,我们显然在构建Codex,这对许多听众来说非常反直觉。

And so we're obviously building out Codex, and this is really unintuitive to a lot of listeners.

Speaker 2

但我们投入了大量精力训练这些模型,然后却把这些模型提供给我们的竞争对手。

But, like, we put all this effort into training these models, and then we serve these models to our competitors.

Speaker 2

从我们的角度来看,

And from our perspective,

Speaker 1

作为一位风险投资家,这对我来说太难理解了。

this is so difficult for me as a venture capitalist to understand.

Speaker 1

你对此是知情的。

You are aware of this.

Speaker 2

是的。

Yeah.

Speaker 2

我完全清楚这一点。

Totally aware of it.

Speaker 2

就像我们OpenAI是一个非常有趣且非同寻常的工作场所。

It's like we're OpenAI is like a really interesting and unusual place to work.

Speaker 2

但基本上,因为我们着眼于长远,如果竞争对手变得更强,我们就能学到东西。

But basically, because we're playing such a long game for us, if the competition gets better, we learn.

Speaker 2

这实际上对我们有帮助。

It's actually helpful for us.

Speaker 2

所以我们正在大力推动Codex的发展。

And so we're pushing really hard at growing Codex.

Speaker 1

你们会学习吗?

Do you learn?

Speaker 1

因为如果他们封闭起来并取得进步,你们就学不到东西了。

Because if if if they're closed and they improve, you don't learn.

Speaker 2

我不这么认为。

I don't think so.

Speaker 2

例如,最近有一些新产品发布。

For example, there are a bunch of recent launches.

Speaker 2

比如今天早上,我刚刚转发了一条关于Warp发布的产品推文。

Like, today, I literally just, like, quote tweeted a thing this morning about a launch from Warp.

Speaker 2

并没有任何特定的关联。

No particular affiliation.

Speaker 2

对吧?

Right?

Speaker 2

里面还有很多很棒的想法,关于他们如何设计代理,使其能够同时在云端和本地运行。

And there are a bunch of cool ideas in there about how they, like, framed up the way that their agent can work in the cloud at the same time as working locally.

Speaker 2

对我来说,这非常有启发性。

And for me, that's like inspiring.

Speaker 2

我看到了来自不同公司的各种这些成果。

And I think I see all these things from various companies.

Speaker 2

我觉得这个领域最酷的一点是,我们所有人最终都会不约而同地得出相同的结论,然后各自去实现它们。

And like, one of the coolest things about the space is it's like, we're all kind of inevitably reaching the same conclusions together and then building things out.

Speaker 2

所以,在Codex团队里,我认为我们有一些巨大的优势,对吧?

And so, you know, on the Codex team, think I we have some massive advantages, right?

Speaker 2

我们拥有ChatGPT带来的巨大分发优势,拥有训练我们自己的模型并使其在我们的系统中表现优异的能力优势,同时构建出适配新模型的优秀系统,而其他人根本没有机会提前接触到这些。

We have the massive distribution advantage with ChatGPT, we have the massive like capability advantage of training our own models to be good in our harness, and building our harness to be good at the new models, and like no one else has early access to those.

Speaker 2

因此,我认为我们是在为胜利而战,拥有一个或多个巨大的优势,但同时我们也在打一场持久战——因为我们把模型提供给所有人,推动开放标准,让每个人都能使用我们所倡导的一切。

And so I think we're we're playing to win and we have a a really big advantage or a number of advantages, but we're also playing this long game where, you know, again, we serve our models to everyone, where we push for open standards so that everyone can use, like all the things that we're pushing for as well.

Speaker 1

我能问你一个问题吗?赢得胜利的决定性因素会是什么?

Can I ask you, what would be the defining factor of winning?

Speaker 1

我知道我用了风险投资的术语,而你非常出色,更自由和开放。

And I know I'm using venture language and you're brilliant and kind of much more free and open.

Speaker 1

但赢得胜利的关键因素是什么?我再问你一次,是市场推广策略吗?就是全球最大的企业是否愿意与OpenAI合作?

But it was like the defining factor of winning, again, I push you, is it like GTM, is which like the biggest enterprise in the world do want to work with OpenAI?

Speaker 1

我有很多朋友在你们的销售团队里。

I have many friends in your sales team.

Speaker 1

你们从最大品牌那里获得的主动咨询量简直不可思议。

The inbound that you get from the largest brands is incredible.

Speaker 1

所以,是凭借卓越的品牌、产品执行力,以及Codex本身是个超棒的产品,还是计算与推理速度、真正的计算优势?

So GTM because of the incredible brand, product execution and just Codex being a fricking awesome product, or compute inference, speed, actual compute advantage?

Speaker 1

哪一个才是决定性的胜利因素?

Which one is the defining winner?

Speaker 2

好吧。

Okay.

Speaker 2

所以,如果我们从OpenAI的角度来讨论这个问题,显然这超出了我的职责范围,但我会说,是计算优势和拥有最好的模型。

So I think if we're going to talk about it more from an OpenAI perspective, obviously, this is way above my pay grade, But I would say it's compute advantage and having the best models.

Speaker 2

为了实现这一点,我们需要建立能够产生收入的业务。

And in order to achieve that, we then need to build businesses to generate revenue.

Speaker 2

而且我们注意到,有了Codex团队——这个融合了研究和产品职能的团队——通过打造这些成功的产品,我们反而催生了更快改进模型的巨大压力。

And also that something we've that's really interesting we noticed with having the Codex team, which is a sort of combined team of research and product is also by building these these successful products, we create a lot of pressure to improve the model in sort of a faster way.

Speaker 2

这可能是从公司角度来说的。

That's maybe the company perspective.

Speaker 2

对吧?

Right?

Speaker 2

如果我们从产品角度出发,我认为我们能做的最重要的一件事,就是打造一个人们真正想要使用的优秀产品。

If we come to the product perspective, I think the single most important thing we can do is build a a really good product that people want to use.

Speaker 2

就像我之前说的,我们真正希望为个人打造产品,让人们对这些产品变得熟练,然后自然地融入自动化功能。

And like I was saying earlier, I think we really want to build products for individuals, and then allow, like, people to become fluent in those products, and then like, pull in automation.

Speaker 2

我认为这可能看起来反直觉,但相比单纯从企业工作流角度出发,这种方式会带来更大的影响力。

And I think that may be counterintuitive, but will result in way more impact than anyone purely approaching it from like the enterprise workflow perspective.

Speaker 2

我觉得这主要是个产品执行的问题。

You know, I think that's mostly a question of product execution.

Speaker 2

这对专业消费者来说是有效的。

And then that works for say, like prosumer.

Speaker 2

谈到企业市场,推广策略非常重要。

When it comes to enterprise, the go to market side is really important.

Speaker 2

我通过惨痛教训学到的是,如果我们去接触企业,只是说:嘿,我们在这儿,随便用我们的东西,这是行不通的。

Something that I've learned the hard way is if we go to an enterprise, and we're just like, hey, we're here, like, feel free to use the stuff that doesn't work.

Speaker 2

实际上需要做大量的教育工作。

There's actually quite a lot of education that needs to be done.

Speaker 2

而且我们需要支持很多配置,并对整个团队进行培训。

And there's a lot of, like, configuration that we need to support and sort of like education of the broader team.

Speaker 2

所以这个过程更像是:先介入,进行推介,与开发者体验负责人或其他相关人士会面,了解他们希望团队如何运作,然后提供工具,帮助他们将这种运作机制推广到整个团队。

So like, that motion looks much more like coming in, pitching, meeting the head of developer experience or whatever, understanding how they want their team to operate, and then giving them tools to like propagate that mechanism of operating to the rest of the team.

Speaker 1

你刚才提到了‘收入’这个词,这是衡量企业的一个指标。

You said the word revenue there, which is one metric to measure a business against.

Speaker 1

当你思考自己的成功指标时,你会坐下来和萨姆、布拉德或者其他人说:嘿,我们正在优化的目标是什么。

When you think about like your metric of success, which you sit down with Sam or Brad or whoever it is and say, hey, this is what we're optimizing for.

Speaker 1

你们用什么指标作为衡量进展的核心目标?

What is the metric that you use as the defining north star for your progression?

Speaker 2

其实不是收入,最主要的是活跃用户。

It's actually not revenues, the primary, the primary is active users.

Speaker 2

你们怎么

How do you

Speaker 1

你们如何衡量活跃用户?是日活跃用户吗?

measure active users, daily active users?

Speaker 2

是的。

Yeah.

Speaker 2

所以我们衡量的是周活跃用户,也就是这个人有没有真正使用过我们的产品?

So we measure weekly active users and it's, know, did this person like actually do a turn in our product?

Speaker 2

他们有没有发送过一个提示?

You know, did they send a prompt?

Speaker 1

你觉得周活跃用户这个指标足够频繁吗?

Is weekly active a frequent enough metric do you think?

Speaker 1

听起来不错。

Sounds nice.

Speaker 1

但如果这真的取代了IDE,每日活跃用户不是更好吗?

But if this is actually replacing the IDE, is daily active not better?

Speaker 2

我认为每日活跃用户很快就会更好。

I think daily active will be better soon.

Speaker 2

我们只是恰好使用每周活跃用户。

We just happen to use weekly active.

Speaker 2

这里这是一种标准。

It's like a standard here.

Speaker 2

而且我认为在我们起步时,这样做是有道理的。

And I think as we were getting started, it made sense.

Speaker 2

但我真的同意那里的批评意见。

But I I I actually agree with the the criticism there.

Speaker 2

我们应该干脆直接用每日活跃用户。

It's like, we should probably just be a daily.

Speaker 2

我的意思是,我们应当进入这样一个时代:无论你面对什么任务,第一反应都是找一个智能代理来帮忙。

Like, I think we we need to be getting to a world where for any given task that you have, your first instinct is to ask an agent to help.

Speaker 2

对吧?

Right?

Speaker 2

就像使用谷歌搜索一样,不管需要做什么,我只要打开那个文本框,就能被引导到正确的地方。

It's kinda like, you know, how, like, with Google search, it's just like, okay, anything I need to do, I just like go into this text box and I can get navigated to the right location.

Speaker 2

然后有了ChatGPT。

Then you had ChatGPT.

Speaker 2

对于任何我需要的信息,我都可以打开这个文本框,打出来,就能获得有用的答案。

It's like for any information I need, I can go into this text box, type it out and get information that helps me.

Speaker 2

我认为今年我们将看到的下一阶段是:针对任何我需要完成的任务——而不仅仅是获取信息——我只要进入这个文本框或输入界面,就会有某种帮助发生,哪怕它只完成了任务的一小部分。

And I think the next phase that we'll see this year is like for any task I need to do, as opposed to just get information, I go to this text box or this input, and something happens that helps me, even if it's not the full task, even if it's only a small part of it.

Speaker 1

你刚才提到聊天这件事,我又跑题了。

You said about kind of chat that again, I jump around.

Speaker 1

抱歉,我妈妈得陪我在伦敦散步,她得应对我这种躁动不安、跳跃式思维的大脑。

Sorry, my brain My mother has to walk with me around London and she deals with this manic episodic brain.

Speaker 1

但你刚才提到聊天和那个界面。

But you said about chat and the interface there.

Speaker 1

我对此非常着迷,因为对于忙碌的人来说,这似乎是一种极其高效的输入方式。

I'm really fascinated by this because it is a seemingly incredibly efficient input function for busy humans.

Speaker 1

但我最近和安德森的全科医生阿尼什·阿卡亚聊过,他说不不不,这东西是萨姆和马斯克创造的,只适合效率极高的人。

But I spoke to Anish Akaya, who's a GP at Andreessen, and he came out the other day, and he's like, no, no, no, this was created by Sam and Elon and it works for very efficient people.

Speaker 1

但地球上大多数人还是想要基于浏览器的发现式交互界面。

But most of the planet want browser based discovery interactions UIs.

Speaker 1

你认为聊天会成为下一代AI与人类交互中持久的界面吗?

Do you think that chat will be the enduring UI in the next wave of AI interaction with humanity?

Speaker 2

简单回答是肯定的,但其实我觉得这里有两个方面。

The simple answer is yes, but actually I think there's two components here.

Speaker 2

比如,如果我们只是想象一下未来,就像看一部科幻电影那样。

Like, if we if we just imagine the future, like, just like, let's think of some sci fi movie.

Speaker 2

对吧?

Right?

Speaker 2

那么,人工智能看起来是什么样子?

Like, what does AI look like?

Speaker 2

我相信科幻作品能很好地预测未来应有的样子。

I I believe that sci fi is a really good predictor of what the simple the future should look like.

Speaker 2

而且通常它都很简单,因为故事需要简洁,我认为简单往往才是正确的。

And usually, it's pretty simple because it's the story, and I think simple is usually right.

Speaker 2

它应该就是一个实体,我可以以任何方式、谈论任何话题和它交流,对吧?

It's gonna be some just, like, entity that I can talk to however I want about whatever I want, right?

Speaker 2

我觉得我不应该不得不切换到一个专门用于编程AI的界面。

I felt like I shouldn't have to navigate to a place where I work with like my coding AI.

Speaker 2

然后我还要去另一个地方使用我的销售AI。

And then I have this like different place for my like sales AI.

Speaker 2

我还得特意提醒自己:现在我在和销售AI对话,然后开始操作。

And I have to like be like, hey, I'm now talking to sales thing and like do that.

Speaker 2

它就该是一个我直接和它说话,它就会帮我解决问题的东西。

It's just like, I just gonna talk to a thing and it's just gonna help.

Speaker 2

所以我认为,我们未来会拥有聊天或语音形式的对话式界面,它将成为你与之交谈任何事物的核心,你可以将它添加到任何群聊或其他场景中。

So I think what we're gonna have is that we'll have chat or voice, basically conversational interface will be sort of the the pillar of everything that you can talk to about anything, and that you can add into any group chat or whatever.

Speaker 2

它能够自行学习如何帮助你。

So it can like discover how to help you.

Speaker 2

但如果你是一个高级用户,对某件事非常擅长,你可能不希望因为必须与另一个人交谈而被中间环节打断。

But then if you're like a power user and you're very good at a specific thing, you probably don't want to be disintermediated by having to talk to another person.

Speaker 2

这就像是你有一个执行助理,但你只能通过与他交谈来工作。

It'd be like if you had an executive assistant, but you can only work by talking to them.

Speaker 2

这简直太烦人了。

That's like super annoying.

Speaker 2

对吧?

Right?

Speaker 2

所以在某个时候,你希望直接查看笔记,自己浏览并编辑它们。

So So at some point, you wanna you wanna get to the show notes and, like, look at them yourself and, like, edit them yourself.

Speaker 2

对吧?

Right?

Speaker 2

你想自己编辑内容。

You wanna edit the thing yourself.

Speaker 2

所以我认为我们会把聊天功能与专门为个人需求定制的功能性图形界面结合起来。

So I think we'll pair chat with, like, functional, like, graphical interfaces that are bespoke to, like, what someone needs.

Speaker 2

就我而言,我可能会通过聊天来完成我的播客准备工作。

So, like, in my case, I will probably chat to, like, do my, you know, podcast prep.

Speaker 2

但当我需要查看产品和代码时,我更希望进入像Codex这样的应用,深入操作。

But when it comes to like, actually looking at product and code, I probably want like the Codex app that I can go into and get deep in.

Speaker 2

对吧?

Right?

Speaker 2

而如果是一个市场人员,他们可能会通过聊天来询问产品相关的问题。

Whereas maybe if we're talking to a marketer, maybe that marketer will like chats to ask questions about the product.

Speaker 2

他们不会为了问产品问题就去下载Codex应用。

They're not going to download the Codex app just to ask questions about the product.

Speaker 2

但他们可能会使用一个高度定制的图形界面,比如广告数据分析工具,来深入使用。

But maybe they'll have like a super custom GUI for like ad analytics or something that they go into.

Speaker 1

我完全理解。

Totally get that.

Speaker 1

而且这在某种程度上替我假定了一段旅程中某个时刻的消费者互动。

And it it kind of wrongly assumes on my behalf a consumer interaction at some point in that journey.

Speaker 1

我想问你,你怎么看待代理与代理之间的体验,以及如何为代理设计体验?

And I wanna ask you, how do you think about agent to agent experiences and designing experiences for agents?

Speaker 1

我们之前讨论过,比如如何帮助大型企业。

We spoke about, for example, going to large enterprises and how you can be helpful.

Speaker 1

我只是选了个最无聊的例子:费用审批。

I'm just choosing the most boring thing ever, expense approval.

Speaker 1

你可以有一个代理代表我提交我去旧金山旅行的费用,而另一边的代理则代表OpenAI的合规部门进行审批。

You could have agent submission of expenses on my behalf for my trip to San Francisco, and then the agent on the flip side doing approvals for that from OpenAI's compliance department.

Speaker 1

你怎么看待这种模式的转变?

How do you think about that and that paradigm shift?

Speaker 2

这很有趣。

That's interesting.

Speaker 2

说实话,我不确定这会是什么样子。

To be honest, I'm not sure what that's gonna look like.

Speaker 2

我最快的回答是,我们在构建编码器时发现,最适合编码器工作的界面,通常也最适合人类使用。

My quickest answer to this is that we've noticed as we build codecs that the best interfaces for codecs to do work are also tend to be the best interfaces for humans.

Speaker 2

所以当人们问,如何让我的代码库对代理更高效时?

So when people ask, oh, how can I make my code base, like, more efficient for the agent to work with?

Speaker 2

答案通常是:那你自己看过吗?

The answer is often like, well, have you looked at it yourself?

Speaker 2

它对人类来说容易使用吗?

And is it is it easy for a human to work with?

Speaker 2

一个具体的例子是,在代码库中运行测试。

So like a very specific example would be, like, running tests in a code base.

Speaker 2

如果只是简单地设置大多数测试运行器,它们会输出所有测试的全部结果。

Naively, if you just like set up most test runners, they just like in it all the outputs of all the tests.

Speaker 2

作为人类,这真的很烦人,因为你必须去找到那个失败的测试,还得阅读成千上万行内容。

And so like as a human, it's really annoying because you have to go in and like find the one that failed, and it's like you've got to read hundreds of thousands of lines.

Speaker 2

结果发现,这对AI来说也很糟糕。

Turns out that's terrible for AI as well.

Speaker 2

但如果你只输出失败的测试,对人类更好,对智能体也更好。

But if you filter it down to just only emit the failed test, better for humans, also better for agents.

Speaker 2

因此,智能体之间的交互点很可能与有人参与时非常相似。

So probably the agent to agent interaction points will be very similar to like if there was a human in the loop.

Speaker 2

这很好,因为它意味着你可以逐个替换各个系统。

And that's nice because it means you can kind of atomically replace individual systems.

Speaker 1

我提到了我们在LinkedIn上发布的节目,以及来自另一家公司的优秀投资人。

I mentioned our show on LinkedIn and a wonderful investor from a different company.

Speaker 1

那是哈利·波特里的伏地魔,你知道的?

Is that Harry Potter, you know, Voldemort?

Speaker 1

就像那个不能说名字的人。

And it's like, you know, he who shall not be named.

Speaker 1

我不希望萨姆杀了我。

I don't want Sam to kill me.

Speaker 1

但来自另一家公司,我曾经

But from another company, I was

Speaker 0

比如,你问他,你怎么看待

like, you ask him, how do you think about

Speaker 1

编码数据护城河?

a coding data moat?

Speaker 1

Anthropic现在拥有所有数据了吗?

And does Anthropic have all the data now?

Speaker 2

我认为,根据我们所看到的,当然,这方面我得听从我的研究团队的意见,但我觉得我们拥有足够多的数据来构建非常优秀的编码模型。

I think that from what we've seen, and, you know, and I'll I would defer to my research team on this, but I feel like we we feel like we have plenty enough data to build really good coding models.

Speaker 2

实际上,我认为现在获取数据更有意思的地方在于,当我们进入知识型工作任务时,这类数据在互联网上大多数地方其实并不可得。

I actually think the the place that's more interesting for getting data now is, like, as we get into, knowledge work tasks, that's kind of data that's, like, not really, like, available most places on the Internet.

Speaker 2

因此,你会开始产生一些非常有趣的头脑风暴,思考如何帮助模型擅长这些任务。

And so you start to have, like, really interesting brainstorms for, like, how to help a model be good at it.

Speaker 2

比如,你可能需要付费让人模拟执行任务,以便你能够学习这些轨迹供模型使用。

Like, maybe you have to, like, pay people to, like, simulate doing tasks so that you can, like, learn these trajectories for the model.

Speaker 2

也许你应该收购那些已经停止运营但拥有大量数据的初创公司,比如Slack之类的。

Maybe you should acquire startups, you know, that are no longer in business but have have a lot of, like, data, like, say, Slack or something.

Speaker 2

是的。

Yeah.

Speaker 2

我认为这类知识工作任务的分布比编程要难得多。

I think that that kind of knowledge work task distribution is, like, much harder than coding.

Speaker 1

你提到的那种不存在的数据,这一点非常有趣。

That's so interesting you said there about kind of the data that doesn't exist,

Speaker 0

换句话说。

so to speak.

Speaker 0

你觉得

How do you think

Speaker 1

你与数据提供者——你的Macaws、你的Turing、你的Invisibles——的互动如何?

about your interactions with the data providers, your Macaws, your Turing's, your Invisibles, your of the world?

Speaker 1

比如,你会在那里投入10倍的资源吗?

Like, will you spend 10x there?

Speaker 1

还是你会认为,我们在数据上的花费太多了,应该自己来做数据采集?

Or will you go, we are spending too much on data, we should do it ourselves and do data acquisition?

Speaker 2

是的,我的意思是,我们思考这些问题的方式就是,如何尽可能快地推进?

Yeah, I mean, I think the way that we think about these things is just like, how do we move as quickly as possible?

Speaker 2

因此,自己搭建这些系统在时间上非常昂贵,而我们是一个小团队。

And so becoming able to set these things up in house is like very expensive in time, and we're a small team.

Speaker 2

所以到目前为止,我观察到的是,如果我们需要大规模开展数据项目,通常会寻求这些公司的帮助。

So what I have observed so far is that if we need to run a data campaign at scale, we're usually gonna enlist help from one of these companies.

Speaker 1

在Codex的消费者端,我们已经讨论过如何进入企业市场,以及如何提升开发者体验和开发者关系。

On the consumer side for Codex, we've spoken about like enterprises and going into them how to engage in terms of developer experience, developer relations.

Speaker 1

在未来一两年内,你们会在低端消费者市场与Lovable和Rapid竞争吗?

Do you compete with a Lovable and a Rapid on a like low end consumer basis in a year or two's time?

Speaker 1

这是一个你们会认为,像柯达这样的产品,并不适合每个人来创建个人简介,或小企业搭建自己的网站的业务吗?

Is that a business where you're like, you know what, Kodak is not for every person to create an about me or a small business to create their own site.

Speaker 1

你们如何看待消费者端的这种定位?

How do you think about consumer in that way?

Speaker 2

是的。

Yeah.

Speaker 2

我会说,目前我们并没有感受到直接的竞争。

I would say that right now, it doesn't feel like we're competing super directly.

Speaker 2

但我不知道你有没有看过我们的超级碗广告,它的标语是‘你可以轻松构建东西’。

But I don't know if you saw our Super Bowl ad, the tagline of which is this you can just build things.

Speaker 2

通过这款应用,我们注意到许多技术能力较弱的人已经开始构建东西了。

With the app, we noticed that like many people who are less technical are starting to build things.

Speaker 2

因此,他们所构建的东西大多非常基础,像是‘Hello World’级别的。

And so the kinds of things they're building are much more hello worldy.

Speaker 2

所以我认为,我们会看到一些使用场景上的重叠,比如人们因为ChatHPT内置了这些工具而直接使用编解码器。

And so I think that we will see some overlap in use cases where you have people just pulling up codecs because they have it as part of their ChatHPT.

Speaker 2

实际上,上周有一个重大公告:我们现在甚至向免费版ChatHPT用户或Go版ChatHPT用户提供一些编解码器。

Actually, like a big announcement, last week was that we're now offering some codecs to people even on free ChatHPT plans, or on the Go ChatHPT plan.

Speaker 2

这在让更多人获得访问权限方面意义重大。

So this is this is massive just in terms of like bringing availability to everyone.

Speaker 2

所以我认为,肯定会有一些使用免费 ChatHPT 计划的人进来,构建一些原本他们可能会去使用专业工具完成的简单东西。

And so I think we're definitely going to see people with like a free ChatHPT plan coming in and just like building simple things where they otherwise might have gone to a specialized tool.

Speaker 1

你最想做但因为某种原因无法改变的是什么?

What would you most like to do differently, but for whatever reason you can't?

Speaker 2

我觉得最近几周对我们来说非常好。

I feel like it's been a very good few weeks for us.

Speaker 2

所以我们非常兴奋,对正在发生的一切都充满热情。

So we're very I'm pretty jazzed by everything that's happening.

Speaker 2

我感受到的最多的就是这种情绪。

Feeling that I have the most.

Speaker 0

是的。

Yeah.

Speaker 0

这真的很有趣。

That's really interesting.

Speaker 0

你说最近几周

You said it's been a

Speaker 1

对我们来说,这是非常棒的几周,我也感受到了。

very good few weeks for us, and I feel that.

Speaker 1

团队是否感受到正负两方面势头的变化?

Does the team feel the changing winds of momentum both in positive and negative cycles?

Speaker 2

当然。

Absolutely.

Speaker 2

我们对此非常敏感。

We are very attuned to it.

Speaker 2

对吧?

Right?

Speaker 2

比如,看看Codex的历史,我们去年推出的第一个东西,是一个让人无比兴奋的绝佳创意。

Like, you look at the the history of Codex, the first thing we launched last year was, like, this amazing idea that people were super excited about.

Speaker 2

就像是,我们给智能体在云端配备一台独立的电脑,你可以拥有任意数量的智能体并行为你完成任务。

It's like, hey, we're going give the agent its own computer in the cloud, you can have as many of them as you want work for you in parallel on tasks.

Speaker 2

超级棒的点子。

Super great idea.

Speaker 2

说实话,它没有我们后来发布的产品那么好。

To be honest, it didn't work as well as what we shipped later.

Speaker 2

它并不是最好的。

It was not the best.

Speaker 2

从八月开始,随着GPT-5的推出,我们大力推动交互式编程,这也是市场上大多数竞争的焦点。

And then since August with GPT-five, we started pushing really hard on interactive coding, which is where most of the competition in the market is.

Speaker 2

我们简直势如破竹。

You know, went on an absolute tear.

Speaker 2

我觉得我们当时的公开数据是,自八月以来,我们的增长达到了20倍。

I feel like the public metric we had was like since August, we grew by like 20 x.

Speaker 2

甚至在年底,我们从十二月到现在又翻了一倍。

And then like even like late in the year, we like doubled from December to now.

Speaker 2

那个确切的数字我记不清了。

I forget the exact number there.

Speaker 2

但那确实是与对手激烈竞争、不分上下。

But, like, that was competing neck and neck.

Speaker 2

但上周我们感受到的转变是,我们认为自己拥有最智能的模型,这一点通过五三编码器得到了巩固。

But the the shift that we feel last week is, you know, we we felt like we had the most intelligent model that was cemented with five three codecs.

Speaker 2

我们收到了一些反馈,说我们的模型运行较慢,可能不够有趣,而且在工作时与用户沟通的能力也较差。

We had feedback around our model being slower and, like, maybe less fun to work with and, like, being less good at communicating with you while it was working.

Speaker 2

我们针对这些反馈进行了改进,甚至相比那些比我们早二十分钟发布竞品模型,情况也明显更好,或许可以说它们有点逊色。

We addressed that feedback, and that's true even compared to, like, the the other competitor model that launched, like, twenty minutes before us and was like, maybe this is spicy.

Speaker 2

它只领先了二十分钟。

It was, soda for twenty minutes.

Speaker 2

Soda 意思是业界领先。

Soda means state of the art.

Speaker 2

我们一直收到很多关于 Codex 用户体验质量的反馈。

And then we'd always been getting a lot of feedback on, like, the quality of the user experience in Codex.

Speaker 2

我们最受欢迎的界面是 IDE 插件,而命令行界面(CLI)则显得不够完善。

Our most popular surface was the IDE extension, and our CLI, which is the command line interface, was less polished.

Speaker 2

但通过这款应用,市场反馈强烈表明,这是一款非常高质量的体验。

But with the app, the feedback has been like resounding from the market that this is like a really high quality experience.

Speaker 2

它简单得有点出人意料,人们都非常喜欢使用,即使是我们最大的用户也已全部转化。

It's like simple, like unintuitively simple, and people are just loving using even our biggest credits are converted.

Speaker 2

所以是的,然后我们做了超级碗广告,接着就免费了。

So yeah, and then we and then we had the Super Bowl ad, and then we went to free.

Speaker 2

回到你之前问的问题,我最想做出什么不同的改变?

And going back to your question of like, what do I most want to do differently?

Speaker 2

我有两点想跟你分享。

I have two things for you.

Speaker 2

第一点是,我其实想重新回归云服务。

The first is I actually want to get back to cloud.

Speaker 2

当我们去年将战略从专注于云代理转向交互式开发时,当时的思路非常简单。

When we pivoted our strategy from like focusing on the cloud agent last year to working interactively, the thinking was very simple.

Speaker 2

这其实有点像我之前跟你提到的FDEs的情况。

It was just and it's kind of like what I was telling you about FDEs actually.

Speaker 2

如果你在终端用户还没熟练掌握工具、无法轻松使用之前,就过早地追求工作流自动化,就会出现脱节,最终只能是一个不切实际的空想,除了最资深的用户外,其他人根本用不起来。

If you go too far ahead to workflow automation before your end user is fluent with the tooling and can get it to work simply, then there's like this disconnect and you just have this pipe dream idea that's not like effective except for the most power users.

Speaker 2

但一旦你有了一个基础,人们每天都在使用你的工具并进行配置,每次使用它时它都会变得更好,那么让其在云端独立运行就只是一个很小的跃升。

But once you have this base where people are using your tool every day and they're configuring it and every time they use it, it gets better, then, like, the step up to, like, letting it run independently in the cloud is a much smaller step up.

Speaker 2

所以我认为现在是我们重新回归云端产品开发,并使其与本地产品高度集成的时候了。

So I think it's time for us to, like, get back to, like, building out the cloud product and making it super tightly integrated with the local product.

Speaker 2

它目前已经有一定的集成度了。

It already is somewhat integrated.

Speaker 2

我想做出的另一个改变是,更多地关注瓶颈问题。

And the other thing I want to do differently is, start thinking more about the bottlenecks.

Speaker 2

比如代码生成,写代码现在基本上已经变得微不足道了。

Like code gen, writing code has become, like, basically trivial now.

Speaker 2

但困难的部分正如你提到的代码审查,对吧?

But the hard part is like what you were talking about with like code review, right?

Speaker 2

我们如何知道代码质量是好的?

Like, how do we know the code quality is good?

Speaker 2

我们如何确定自己在做正确的事情?

How do we know we're doing the right things?

Speaker 2

这些瓶颈仍然被低估,投入也不够。

And those bottlenecks, I think are under underappreciated still and under invested in.

Speaker 2

所以,我认为我们要朝着这样一个方向努力:拥有一个不受瓶颈限制、值得信赖的代理,它可以独立负责整个微系统或内部工具,并完成完整的迭代循环,包括来自用户的反馈,而无需人工审查。

So like, I think we want to get to a world where you can have an agent that is unbottlenecked that you trust to like own an entire microsystem or internal tool or whatever, and can do the full iterative loop, including feedback from users without having to go through human review.

Speaker 2

这既是一个从智能角度难以解决的问题,也是一个从安全性和控制角度难以解决的问题。

And that is a really hard problem to solve both from an intelligence perspective, but also from like a safety perspective and a controls perspective.

Speaker 1

我们应该在基准测试和评估上赋予多大权重?

How much weight should we place on benchmarks and evals?

Speaker 2

这可能是个让你觉得烦的答案。

Probably, this is an annoying answer for you.

Speaker 2

就是一些吧。

It's like some, right?

Speaker 2

在我看来,它们确实能很好地衡量智能水平。

Like they do tell you in my mind, they give you a good measure of intelligence.

Speaker 2

因此,你可以对这些基准在智能方面的表现给予重视,尤其是在评估指标尚未饱和之前,当你看到这些基准取得实质性进展时,会非常有帮助。

And so you can put weight on those for intelligence And especially before evals are saturated, I think when you see meaningful progress in those benchmarks, it's like very, very helpful.

Speaker 2

然后我认为你必须将这一点与使用模型的实际感受结合起来。

And then I think you have to pair that though with like what it feels like to use the model.

Speaker 2

这是一种感觉上的东西。

And that's that's a vibes thing.

Speaker 2

每当我与任何人交流,哪怕是内部人员,或是我们模型的客户时,我总是惊讶于人们对模型使用体验的评估竟然如此依赖感觉。

Like whenever I talk to any, even internally or even talking to, like, customers of our models, I'm always surprised by how vibes based the evaluation of how it feels to work with the model is.

Speaker 1

生活真是靠感觉啊。

How vibes based life is.

Speaker 1

人们更愿意和自己喜欢的人共事,这是我给孩子们的建议。

People wanna work with people they like is the lesson that I give to kids.

Speaker 2

没错。

Exactly.

Speaker 2

人们更愿意和自己喜欢的妈妈们共事。

People want to work with the moms they like.

Speaker 1

从市场构成的角度来看,作为投资者,我必须思考如何预测这个市场的最终状态。

In terms of like market composition, as an investor, I have to think through how do I think about the eventual state of this given market kind of a terminal state.

Speaker 1

你如何看待这一点?

How do you think about that?

Speaker 1

是像Uber和Lyft那样,大部分市场都会集中在Codex或ClawCode上吗?

Is it like Uber and Lyft and like the majority of the market will be on Codex or ClawCode?

Speaker 1

还是像AWS、Azure、Google Cloud那样,形成三足鼎立的局面?

Or is it like AWS, Azure, Google Cloud and a 30 three-thirty three-thirty three?

Speaker 2

好吧,我认为从长远来看,最终能捕获大量价值的提供商可能会更少。

Okay, so I think this might end up with fewer providers that are capturing a lot of value in the long run.

Speaker 2

原因如下。

And here's why.

Speaker 2

这可能有点尖锐,但我认为我们现在正处于一个临时阶段,即代理在编程方面表现得非常出色。

And maybe this is a bit spicy, but I think that we are kind of in this temporary phase where we have agents that are really good at coding.

Speaker 2

对吧?

Right?

Speaker 2

如果回顾去年,也许更多人认为我们会拥有在其他领域也表现优异的代理。

And if you look back last year, like maybe more people thought we would have agents that are good at other domains too.

Speaker 2

但去年并没有发生这种情况。

But that didn't happen last year.

Speaker 2

所以目前来看,整个行业只有编程代理达到了产品市场契合度,对吧?

So we only have PMF for coding agents, like in the industry overall, I would say, right?

Speaker 2

然后还有一些非常狭窄的其他应用场景,比如客户服务等。

And then there's some like very narrow, narrow other use cases, like customer support, etc.

Speaker 2

但我认为这可能是暂时的。

But I think that's probably temporary.

Speaker 2

随着时间推移,我们将拥有能够为你做任何事情的代理。

And then over time, we're going to end up with agents that kind of can do anything for you.

Speaker 2

这就像我之前说的,会有一个超级助手,你可以跟它聊任何话题。

This kind of what I was saying earlier, like there's just like a super assistant, you talk to it about anything.

Speaker 2

然后如果你恰好深入某个特定功能,也可以使用专门的界面查看。

And then there is like specific UI that you can go look at if you happen to be deep in a specific function.

Speaker 2

在这种情况下,我不认为一家公司需要配备12个不同的代理,让员工去摸索该找哪个,因为那样他们就无法达到熟练程度。

So in that world, I don't think you want like 12 agents at the company and you have to like go your employees have to go figure out the right ones to talk to, because then they won't achieve fluency.

Speaker 2

如果他们不想达到熟练程度,就不会愿意将自动化融入自己的工作中。

If they don't want to achieve fluency, then they will also won't like pull automation into their roles.

Speaker 2

但如果你有一个可以和它聊任何事情的工具,对吧?

But if you have this one thing that you can talk to about anything, right?

Speaker 2

所以在入职时,只需去和这个工具聊聊你需要的任何事,人们就会形成习惯去使用它,它将成为工作的中心,人们也会主动引入自动化。

So you're onboarding, it's just like go talk to this thing about anything you need, then people will develop muscle memory to go to it, it'll become the center of gravity of work, and people will pull an automation.

Speaker 2

所以我认为,这种未来更有意义。

So I think that that future makes much more sense.

Speaker 2

而且我觉得,像ChatGPT的构建者们,其实非常适合实现这一点。

And I think like, as the people building ChatchPT were like really well set up to deliver that.

Speaker 2

这可能有点牵强,但一个类比是,我以前在Dropbox工作过。

This this is kind of a stretch, but an analogy here is I used to work at Dropbox.

Speaker 2

在Slack还没兴起的那段时间里。

And for a while, this is before Slack was big.

Speaker 2

有一段时间,我们一直在想,人们应该是在Dropbox的文档里评论,还是应该去Slack里讨论这些文档。

And for a while, we thought we wondered if people should like go comment on like documents in Dropbox or or if they should like go talk about the documents in Slack.

Speaker 2

很明显,人们更倾向于在Dropbox中视频的正确时间戳下留言,或者直接在文档上留言。

And it was like obvious that it was like more optimal for people to like put comments on the right timestamp in the video in Dropbox or like comment on the document in Dropbox.

Speaker 2

对吧?

Right?

Speaker 2

所以这样更高效。

So it was more optimal.

Speaker 2

然而,我们看到Slack已经成为人们彼此交流的中心枢纽。

However, what we saw is that Slack is just such a center of gravity of people just like talking to each other.

Speaker 2

根本没人愿意在文档上留言。

Like nobody wants to comment on the document.

Speaker 2

我只想在Slack上给你发消息。

I just want to Slack you.

Speaker 2

因此我们发现,即使效率更低,人们也强烈倾向于在Slack上进行交流。

And so we saw that like there was this really big pull towards things happening in Slack even if it was less efficient.

Speaker 2

我认为工作中也会出现类似的情况:如果有一个能应对几乎所有需求的单一智能体,就会形成巨大的吸引力,每个人都会讨论他们如何用这个智能体处理各种事情。

And I think we're going to see something similar at work where if there is a single agent you can use for nearly anything, it there will just be this giant poll, and everyone will talk about how they use that one agent for things.

Speaker 2

团队之间会分享最佳实践。

Teams will share best practices with each other.

Speaker 2

会围绕如何最佳使用这个工具举办黑客马拉松。

There'll be hackathons around how to use that best thing.

Speaker 2

是的。

Yeah.

Speaker 2

最终你只会剩下寥寥几个这样的工具。

And you'll end up with just a handful of these.

Speaker 1

你提到过,除了编程之外,代理在其他领域的使用并没有真正扩散,而客户服务就是一个例子。

You said about kind of agents not really proliferating in terms of usage other than coding, and actually maybe this being the time and and, customer support is one of the examples.

Speaker 1

我的问题是,我今天是一名投资者。

My question to you is, I'm an investor today.

Speaker 1

我在寻找那些能够随着时间积累价值、为客户提供卓越产品的企业。

I'm looking for companies which would accrue value over time and provide incredible products to customers.

Speaker 1

有一种观点认为,如今大型SaaS公司的收入持久性为零,SaaS已经死亡,因为像你、Anthropic这样的模型提供商将前来分走我们的蛋糕。

There is a belief that the durability of revenue of large SaaS companies today is zero and that SaaS is dead because the model providers, you, Anthropic, others are going to come for our lunch, so to speak.

Speaker 1

你建议我怎么做?

What would you advise me?

Speaker 2

像这些工具都是为人类设计的,否则意义何在?

Like things are built for humans, like otherwise, what's the point?

Speaker 2

即使是SaaS工具也是为人类设计的。

Even SaaS tools are built for humans.

Speaker 2

所以对我来说,我想问的是,这家SaaS公司是否与终端的用户建立了关系?

So for me, I think my question is like, does this SaaS company own a relationship with a human on the other end of things?

Speaker 2

如果建立了关系,那我怀疑它不会消失?

And if it does, then I suspect it's not going away?

Speaker 2

还是说这家SaaS公司掌握着某种至关重要的核心数据系统?

Or does the SaaS company own some like really important system of record?

Speaker 2

它大概也不会消失。

It's probably not going away.

Speaker 2

也许这两点——与人的互动和核心数据系统——实际上比以往任何时候都更重要。

Maybe both of those two things, the interaction with the human and the system of record are like more important than ever actually.

Speaker 2

另一方面,这家SaaS公司是否只是一种粘合层,却并不拥有上述任何一项?

On the other hand, is the SaaS company like a kind of a glue layer, but it doesn't own either of those two things?

Speaker 2

我不是这方面的专家,但我对这类公司更感到担忧。

I'm not the expert here, but I'm more nervous about that kind of company.

Speaker 1

如果我们持这种观点,Salesforce和ServiceNow的股价已经下跌了20%、30%、40%。

If we take that stance, Salesforce and ServiceNow, they're down 20%, 30%, 40%.

Speaker 1

我认为这种下跌被严重夸大了。

I think it's massively exaggerated.

Speaker 1

我认为确实有一些公司理应如此。

I think there are some companies that legitimately should be.

Speaker 1

恕我直言,我认为Dropbox正处于非常艰难的境地。

Respectfully, I think Dropbox is in a very difficult position.

Speaker 1

但像monday.com这样的公司,对于绝大多数使用它的中小企业和消费者而言——这实际上是它们的主要市场——难道用户不能自己用代码写一个待办事项清单吗?

And I think your monday.coms of the world though, for the majority of SMBs and consumers who use it, which is the large majority of their market, actually, could they vibe code a to do list?

Speaker 1

当然可以。

Yes.

Speaker 1

这样做会更经济高效吗?

Would it be cost efficient to do so?

Speaker 1

其实不会,一旦你开始定制并完善它,成本就上去了。

Not really actually by the time you customize it and perfect it.

Speaker 1

说实话,这个待办事项列表在功能上通常都很平淡,无非就是添加任务、完成任务、显示分配给新成员的历史任务。

And to be honest, this to do list is generally pretty bland in terms of what you need to do, add task, complete task, show historical tasks assigned to new members.

Speaker 1

这并不难。

It's not very difficult.

Speaker 1

所以实际上,我觉得你就直接保留它好了。

And so actually, I think you just keep it.

Speaker 1

所以我觉得这被严重夸大了。

And so I think it's massively overblown.

Speaker 1

我认为这是市场典型的本能反应。

And I think that's the classic knee jerk reaction from markets.

Speaker 1

我觉得你最终还是会转向客户支持,而我不希望自己属于这一类。

I do think you're going to come for customer support and I wouldn't want to be in that category.

Speaker 2

我觉得这可能会改变你投资的创业者类型。

I think this maybe changes what kind of founder you invest in.

Speaker 2

对吧?

Right?

Speaker 2

比如,曾经有一段短暂的时期,作为产品构建者,我个人很喜欢那种专注于能做出好产品的创业者。

Like, think there was this maybe temporary phase where that I liked personally as a product builder, there was this phase where you would invest in like the the person who can just like build good product.

Speaker 2

当时你可以忽略他们是否对客户、市场推广或分销有清晰的见解,因为做出好产品实在太难了。

And you could kind of ignore if they had a good thesis around a customer or go to market or distribution or anything like that, because it was so hard to build good product.

Speaker 2

对吧?

Right?

Speaker 2

我认为那只是一个异常现象。

And I think that was a that was an anomaly.

Speaker 2

如果我们看看现在的情况,也许这种类型的创业者不再是你应该投资的对象了,因为现在做出好产品相对容易了,你需要重新回归到投资那些深入思考过分销渠道、对特定客户群体有深刻领域专长的创业者。

If we look at where we are now, like, maybe that kind of founder is not the founder you should invest in because it's, like, kind of relatively easier to build good product, and you need to go back to, like, investing in the founder who's, like, thought through distribution, who has a good good domain expertise of what to build for a specific customer, etcetera.

Speaker 1

所以,再次假设你是我团队中的一名投资人,你会如何思考我们值得关注的领域,投资那些能够积累价值、不会被模型提供商威胁的公司?

So, again, if you were on my team as an investor, how would you think about interesting areas for us to invest in companies that will accrue value and not be threatened by model providers?

Speaker 1

因为again,你进入医疗领域,进入代码领域,显然Codex非常明确,你进入客户服务领域,那你不去哪里呢?

Because again, like you're going to health, you go into code, obviously, Codex is very clear, you go into customer support, Where are you not going?

Speaker 1

那为什么Claude Code不去呢?

And why is Claude Code not going?

Speaker 2

我真想直接说,我不知道。

I'm tempted to just say, like, I don't know.

Speaker 2

我觉得现在当投资人真不容易。

I couldn't I I think it's a hard time to be an investor.

Speaker 2

市场太动态了。

It's a the market is so dynamic.

Speaker 2

很难说清楚。

It's hard to say.

Speaker 1

如今投资真的非常困难。

It's a really tough time to be investing today.

Speaker 1

我的回答其实是两方面的:第一,我寻找具有物理基础设施的领域。

My my answer is kind of twofold actually, which is like, number one, I look for things with physical infrastructure.

Speaker 1

我不认为你会进入能源供应领域。

I don't think you're going into energy supply.

Speaker 1

第二点是金融科技和银行整合,以及复杂的金融产品。

And then two is like the fintech and banking integrations, gnarly financial products.

Speaker 1

我不认为OpenAI会去东南亚的500家银行建立合作关系。

I don't think OpenAI is going to go into building 500 relationships with banks in Southeast Asia.

Speaker 2

我倾向于同意。

I tend to agree.

Speaker 2

我再说一遍,这又回到一个问题:你是否要进入一个极其复杂、客户关系和市场知识至关重要的市场?

I I I again, it comes back to, are you going into, like, a gnarly, complicated market where customer relationships and, like, knowledge of the market are everything?

Speaker 2

这看起来仍然很不错。

That still seems great.

Speaker 1

人才争夺战有多激烈?

How bad is the war for talent?

Speaker 1

从英国来看,我们观察旧金山,我会告诉企业,最好在欧洲发展,因为根本无法吸引和留住人才。

From The UK, we look at SF, and I say to companies it's better to build in Europe because it's impossible to acquire talent and it's impossible to retain it.

Speaker 1

我错了吗?

Am I wrong?

Speaker 2

我认为现在的人才争夺战极其激烈。

I think that the war for talent is incredibly fierce right now.

Speaker 2

你知道,显然在OpenAI,我们拥有非常强大的品牌,因此能够吸引大量人才。

You know, obviously, at OpenAI, we have an incredibly strong brand, and so we're able to attract a lot of talent.

Speaker 2

但即便如此,我们仍投入大量精力去争取那些我们非常看好的候选人。

But even so, we put a ton of effort into, like, closing candidates that we're really excited about.

Speaker 2

甚至可以说,我们也不觉得可以免费随意招到任何想要的人。

Even, like, even we feel that it's not like you don't just get whoever you want for free.

Speaker 1

我可以问一下,你们提供股票的入门价格,对顶尖人才来说仍然有吸引力吗?

Can I ask, at the entry price that you get a stock at, is it still attractive for the best talent?

Speaker 2

还没有人告诉我相反的情况。

I haven't had anyone tell me anything to the contrary.

Speaker 1

你在多大程度上会考虑寻找完美匹配的人,而不是找一个足够好的人?

To what extent do you think about, like, finding the perfect fit versus finding someone who's good enough?

Speaker 2

所以,你知道吗,我之前开过一个玩笑,说产品经理其实是可有可无的。

So, you know, earlier I made my joke about, like, PMs kinda being optional.

Speaker 2

是的。

Yeah.

Speaker 2

但我认为这其实并不对。

I think that's not actually true.

Speaker 2

你仍然需要产品方面的人才。

You still need product people.

Speaker 2

但我确实认为他们必须是完美匹配的人选。

But I do think that they have to be the perfect fit.

Speaker 2

如果你找的人不是完美匹配,反而可能弊大于利。

And if you have someone who's like not the perfect fit, might just do more harm than good.

Speaker 2

这意味着我们在选拔上比我在其他职位时要严格得多。

It's kind of means that like we're way more selective than I might have been in other roles.

Speaker 1

我是一名计算机科学专业的学生。

I'm a CS student.

Speaker 1

好吧?

Okay?

Speaker 1

我在斯坦福。

And I'm at Stanford.

Speaker 1

我是帝国理工的。

I'm an Imperial.

Speaker 1

我在剑桥。

I'm at Cambridge.

Speaker 1

我在ETH任何地方。

I'm wherever ETH.

Speaker 1

顶尖学府。

Great institution.

Speaker 1

基于你现在的所有认知,你对我有什么建议,能帮助我规划未来五年的职业发展?

What would you advise me knowing all that you know now that would help me navigate the next five years of my career?

Speaker 1

我想成为一名工程师,在未来一年进入职场,并为AI生态系统创造价值。

I want to be valuable to the AI ecosystem environment as an engineer entering the workforce in the next year.

Speaker 2

基本上,现在是工程师最好的时代,因为你拥有令人难以置信的工具,可以完成海量工作。

Basically, there's actually never been a better time to be an engineer because you have incredible tooling available to you to get incredible amount done.

Speaker 2

而且,你快速上手一个可能被雇佣进入的复杂代码库的能力从未如此之快,因为你随时可以向AI询问大量关于代码库的问题,还能让它帮你规划那些原本可能需要你花几天时间研究的改动。

And your ability to like ramp into like a complex code base that you might be hired into has never been faster because you can go ask AI like a ton of questions about the code base, and you can ask it to plan out changes that would otherwise take you like days to research maybe.

Speaker 2

我认为首先,你应该保持非常乐观的态度。

I think first off, I would say like you should be like very optimistic.

Speaker 2

但当然,你也要考虑自己入职后希望提升哪些能力。

But then of course, like about you want your abilities once you're at the job.

Speaker 2

那么接下来的问题就是:你该如何找到这份工作?

Then now the question is how do you get the job?

Speaker 2

因为现在构建东西从未如此简单。

Because it's never been like easier to build things.

Speaker 2

真正变得稀缺的是主动性、品味和质量。

The thing that becomes scarcer is like agency, taste, and like quality.

Speaker 2

我建议你直接动手去构建一些东西,展示你的主动性和对作品的品味,打造高质量的作品,然后分享出去。

I would urge you to like, just build things and demonstrate your agency and your taste around what you build and like build things that are of high quality and then share those things.

Speaker 2

你知道吗,我们收到了很多来自申请职位的人的主动联系,他们有的通过招聘页面申请,有的则通过社交媒体联系。

You know, we get a lot of inbound for from folks both applying for jobs through the careers page or also on social.

Speaker 2

这只是我个人的看法,但当有人给我写信,分享一些有趣的想法,并附上一个有趣项目的链接时,这比一份普通的简历更能引起我的注意。

And this is just me, but when someone writes to me with like some interesting thoughts and, like, a link to an interesting project, that gets my attention much more than, a normal resume does.

Speaker 1

在进入快速问答环节之前,最后几个问题。

Final questions before we do a quick fire.

Speaker 1

你之前提到了Dropbox。

You mentioned Dropbox earlier.

Speaker 1

Dropbox的校友群体非常出色。

The alumni from Dropbox is incredible.

Speaker 1

我的意思是,看到从Dropbox走出来的这么多人才,真的令人惊叹。

I mean, really, like, amazing to see the talent that's come out of Dropbox.

Speaker 1

你在Dropbox学到的最重要的一个经验是什么?这个经验如何影响了你现在在OpenAI的思维方式?

What was your single biggest lesson from Dropbox that has shaped some of your thinking now with OpenAI?

Speaker 2

哦,这个问题我都不用想。

Oh, I I don't need to think about that one.

Speaker 2

这正是我之前跟你说的那件事。

That that's kind of the thing I was telling you about earlier.

Speaker 2

对吧?

Right?

Speaker 2

当你为人们、为终端用户构建工具时,你必须把这种工具视为一种参与系统,对吧?

Like, when you're building tooling for people, like, for end users, you have to think about, like, that tooling as a system of engagement, right?

Speaker 2

如果人们不想用你的工具,如果它不能自然地成为完成某件事最简单的方式,那么人们就不会使用它。

If people don't want to use your tool, if it doesn't like naturally feel like the easiest way to get something done, then people just won't use it.

Speaker 2

我从观察Slack如何迅速火爆中学到了这一点。

Again, I learned that from watching how Slack just absolutely took off.

Speaker 2

所以现在我们在构建这些智能体时,经常会想到这一点:如果我们只是把我们的智能体做成工作流自动化,那每次想让它启动都会像拔牙一样困难,对吧?

And so I think about that a lot now, when we're building these agents, I'm like, if we build our agent surely is like, workflow automation, then it's always going to be like pulling teeth to get that thing started, right?

Speaker 2

你得雇咨询公司比如埃森哲之类的人来介入,他们还得部署全职人员,这会非常困难。

You're going to need to hire Accenture or someone to come in, they're going to deploy FTEs, it's going to be tough.

Speaker 2

但如果你能构建一个让人爱不释手的系统,即使他们只用它来完成部分任务,随着时间推移,他们会越来越熟练,也会逐渐将它与你希望集成的工具连接起来,然后你就能逐步引入自动化。

But if you can build a system that, like, people just love using, even if they only use it for partial tasks, over time, they'll get better and better at using it, and then that you'll get connected to the tools you want over time, and then you can start laddering in automation.

Speaker 2

显然,这些并不是互斥的。

Obviously, these aren't mutually exclusive.

Speaker 1

你如今该如何重振Dropbox的增长?

How on earth do you reinvigorate growth at Dropbox today?

Speaker 2

至少在我还在Dropbox的时候,我们最擅长的是桌面软件。

At least from when I was at Dropbox, the thing we were uniquely good at was desktop software.

Speaker 2

而桌面软件,说来有趣,它从未真正退出过。

And desktop software, it's funny, it was never not back.

Speaker 2

但不管怎样,它现在又火了。

But anyways, it's so back.

Speaker 2

基本上,因为如果你在解决生产力和知识工作的问题,确实,到处都有你需要连接的数据系统。

Basically, because if you're solving for productivity and knowledge work, yes, there are systems of record everywhere that you need to connect with.

Speaker 2

但归根结底,所有工作最终都发生在用户的电脑上,要么在浏览器里,要么就是本地的应用程序。

But everything at the end of the day happens on the user's computer, either in their browser or just like locally and apps on their computer.

Speaker 2

我认为,我们看到AI代理在工作中带来生产力提升的最快方式,首先是让它们直接在用户的电脑上与他们已有的工具协同工作,而无需部署FDE来设置任何东西。

I do think that the fastest way we're going to see productivity gains from agents at work is going to be at first meeting users on their computer, working with the stuff that they have available to them, you know, without having deployed FDEs to set anything up.

Speaker 2

随着时间推移,你会逐步接入这些不同的系统。

And then over time, you'll connect in these various systems.

Speaker 2

所以如果我是Dropbox,我会思考如何利用我们在构建优秀桌面软件以及计算机上协作层方面的独特专业能力,来赋能生产力代理工具。

And so if I was Dropbox, I'd be thinking about how do we leverage our unique domain expertise in like building really good like desktop software, and the sort of collaborative layer on top of your computer, how do we leverage that to enable productivity agents?

Speaker 2

这有点宽泛,但我认为这就是你要切入的角度。

It's a bit broad, but I think that's the angle you go.

Speaker 1

不,我非常喜欢这个回答,也非常感谢你的分享。

No, I love it and I really appreciate the response.

Speaker 1

在我们进入快速问答之前,最后一个问题是。

Final one before we do a quick fire, I promise.

Speaker 1

我成长于一个重视利润率的世界。

I've been brought up in a world where margin matters.

Speaker 1

软件的利润率非常可观,这也是软件成为绝佳投资领域的原因。

Software margins are wonderful and it's what makes software a brilliant category to invest in.

Speaker 1

我们正看到,在以推理为主的业务中,利润率状况变得非常不同。

We're seeing margin profiles that are very different in inference heavy plays in particular.

Speaker 1

我该多大程度上忽略这一点,相信成本会下降,令牌成本会下降,真正重要的是使用量和用户喜爱,利润自然会来,还是说利润其实极其重要?

To what extent should I put that out of mind and appreciate that costs will come down, cost of tokens will come down and actually it's about usage and customer love, margins will come or no, margins are actually freaking important.

Speaker 1

保持这个焦点。

Keep that focus.

Speaker 2

我认为两者成本都会大幅下降。

I think both costs are gonna come down significantly.

Speaker 2

而且我认为,如果今年是代理在工作中广泛部署的一年,那么今年也是它们必须连接到所有这些系统的年份。

And I also think that if this is the year of agents being deployed like broadly at work, then this is also the year where they're going to have to be connected to all these various systems.

Speaker 2

我认为这会非常具有粘性。

And I think that's going to be very sticky.

Speaker 2

所以我认为今年是一场竞赛。

And so I view this year as a race.

Speaker 2

因此,你希望赢得这场竞赛,同时暂时承受一些利润损失也是可以接受的。

And so I think you want to win that race, and you should be okay taking some hit to margin in the meantime.

Speaker 1

老兄,快问快答环节。

Dude, quick fire round.

Speaker 1

所以我先说一个简短的陈述。

So I say a short statement.

Speaker 1

你告诉我你最直接的想法。

You give me your immediate thoughts.

Speaker 1

这样可以吗?

Does that sound okay?

Speaker 1

是的。

Yeah.

Speaker 1

在过去十二个月里,你最改变主意的是什么?

What have you changed your mind on most in the last twelve months?

Speaker 2

我加入OpenAI的时候,我认为——这比十二个月前还要早一点。

When I joined OpenAI, I thought that and this is a little longer than twelve months ago.

Speaker 2

但我加入OpenAI时,以为我们都会只是和电脑屏幕共享内容。

But when I joined OpenAI, I thought that we would all just be hanging out with our computer screen sharing.

Speaker 2

但就在那之后的一年内,我们会拥有一个可以和我们对话的智能代理。

But within a year from there, you know, we'd have this agent that we're just talking to.

Speaker 2

那完全错了。

That was completely wrong.

Speaker 2

我认为多模态模型的进展速度比我预期的要慢。

I think the rate of like progress in like multimodal models was like slower than I expected.

Speaker 2

多模态指的是,比如能够处理视频和音频的模型。

Multimodal means, you know, like models that work with like video and audio.

Speaker 2

但事实上,我们发现通过代码与你的电脑交互的智能体才是正确的方向。

So instead, what happened was that we saw that like agents that work with your computer through code are the way.

Speaker 2

因此,对我来说,这彻底改变了我对如何将人工智能的好处普及给大众的看法。

And so for me, that's been a complete rethink in terms of like how we bring the benefits of AI to like just people generally.

Speaker 2

主要并不是通过视频和音频。

It's not not through video and audio primarily.

Speaker 1

你最尊重哪个不太为人所知的竞争对手?为什么?

Which lesser known competitor do you respect most and why?

Speaker 2

我第一个想到的是AMP。

The first one that came to mind was AMP.

Speaker 1

AMP。

AMP.

Speaker 2

我认为他们正在构建中。

I think they're building yeah.

Speaker 2

AMP。

AMP.

Speaker 2

它来自Sourcegraph的团队。

It's out of out of the folks at Sourcegraph.

Speaker 2

他们的产品口碑非常好,真正做到了以小博大。

Their product has a great reputation of just being, like, you know, punching way above its weight.

Speaker 2

但我真正钦佩的另一点是,他们推动了围绕智能体的整个标准化进程。

But I think the other thing that I really respect is that they helped initiate this whole like standardization around like agents.

Speaker 2

Md 和 dot agents/skills,这正是我前面提到的,让用户更容易管理他们正在尝试的各种智能体。

Md and like dot agents slash skills, which are what I was saying earlier about like, making it so that it's easier for users to manage all these different agents that they're trying.

Speaker 2

显然,我们发布了智能体。

Obviously We put out agents.

Speaker 2

M d,但他们发布了代理。

M d, but they put out agent.

Speaker 2

M d。

M d.

Speaker 2

基本上,昆恩通过发布一条推文开启了这一切,内容是:嘿。

And, basically, Quinn started this all by putting out a tweet that said, hey.

Speaker 2

如果你们购买了 agents 这个域名。

If you guys buy the domain agents.

Speaker 2

M d,我们就按照你们的拼写方式标准化。

M d, we'll standardize to your your spelling.

Speaker 2

尽管这件事很小,但它引发了这场我認為對社區來說非常棒的标准化进程。

And as small as that was, that initiated this whole standardization that I think has been awesome in the community.

Speaker 1

你认为对Anthropic广告的回应是正确的吗?

Do you think the response to Anthropix ads was the right response?

Speaker 2

我的意思是,当时有太多不同的回应了。

I mean, there were so many different responses.

Speaker 2

我听到的那个,显然觉得是正确的。

The one that I heard, obviously, think was right.

Speaker 2

我听到的说法是,一家公司对未来的看法相当消极,而另一家公司,也就是我们OpenAI,则非常积极,告诉人们可以构建属于自己的梦想。

The one that I heard was, well, one company is being pretty negative about the future and the other company, us, OpenAI, is being really positive and just telling people they can build things into Dream.

Speaker 2

我觉得这个回应太棒了。

I I thought that response was brilliant.

Speaker 1

自从加入Codex以来,你做出过最难的产品决策是什么?

What's the hardest product decision you've had to make since being at Codex?

Speaker 2

我可以告诉你我们曾经做出的最痛苦的产品决策。

Well, I can tell you the most painful product decision we had to make.

Speaker 2

有一段时间,Codex Cloud实际上是无限制的,虽然不是免费的——你需要为Traci PT付费,但使用量是无限的。

For a while, Codex Cloud was, like, effectively unlimited, not free, like you needed to pay for Traci PT, but then you had unlimited usage.

Speaker 2

我们每天维持这种无限使用状态,都知道将来要收回这个政策会越来越难。

Every day that we left it that way, we knew that would be harder to wind back at being like unlimited.

Speaker 2

但我们当时太专注于其他更具产品市场契合度的竞争点,所以就把这个决策推迟了。

But we were just so focused on competing on our other things that had more PMF that we kind of punted that decision out.

Speaker 2

当我们把无限使用限制为更合理的额度时,用户们产生了大量反弹。

When we wound back that unlimited use to some like more reasonable limit, there was a lot of blowback from users.

Speaker 2

但只有极少数用户认为所有东西都该永远免费。

And it was a very small minority of users who like thought everything should be kind of like sudo free forever.

Speaker 2

但这种反弹影响了我们方方面面,因为社交媒体上的讨论并不会区分这些细节。

But that blowback affected us everywhere because like the social chatter doesn't really distinguish between these things.

Speaker 2

我认为我从中艰难学到的教训是:你不能让东西无限免费太久。

I think the lesson I learned the hard way there is, like, you can't make things unlimited for too long.

Speaker 1

数据网站的定价和祖父条款定价真的非常棘手。

Data site pricing, grandfathering pricing is just it's such a hard thing.

Speaker 1

今天我们工程或产品上做的哪些事情,五年后你会回过头来说:天啊,我们居然做过这种事?

What do we do today in engineering or product that in five years time, you'll look back on and go, oh my God, can you believe that we did that?

Speaker 2

其中之一就是手动编辑代码。

Well, one is just editing code by hand.

Speaker 2

我想另一个可能更尖锐的是:手动管理系统的部署和监控。

I think probably another one, this is maybe spicier, but another one might even be like actually managing the deployment and monitoring of systems by hand.

Speaker 2

我觉得大型公司可能需要很长时间才能部署这项技术,但许多初创公司可能会直接基于一个全新的、完全由AI管理的栈来构建。

Like, I basically think that probably big companies will take a long time to like deploy this, but many startups might actually kind of start building on a completely new stack that's like fully AI managed.

Speaker 2

需要明确的是,这个栈目前还不存在,但一个完全由AI管理的栈将会被设计成对代理的行为提供强大的确定性约束,并能轻松回滚部署等操作。

To be clear, the stack doesn't exist yet, but a fully managed AI stack where because the basically it's been built to give you really strong deterministic guardrails over what the agent can do and like control of to like rollback deploys and everything like that.

Speaker 2

因此,未来你创办一家公司的方式可能是:先获取一个代理,然后直接让它帮你构建产品。

And so we'll get to a world where the way you start a company is you start by getting an agent and just asking it to build things.

Speaker 2

然后你会拥有更多的代理。

And then you get more agents than that.

Speaker 2

最终,你可能会把联合创始人也加入到你用来与代理协作的服务中。

And then maybe eventually you add your co founders to this service that you use to work with agents.

Speaker 2

于是,你最终可能会发现,你的主要沟通工具其实是代理沟通工具,你不再需要亲自处理那些繁琐、痛苦的CI和部署流程,而是让代理去完成这些工作。

And so you end up like, maybe your main communication tool is actually your agent communication tool, and then maybe you're not actually handholding this, like, very point painful CI and deploy process, but you're just, like, having agents do things.

Speaker 1

有个奇怪的问题,但我很好奇。

Weird question, but I'm intrigued.

Speaker 1

你是提供代理约束机制的那个人吗?

Are you the one providing agent guardrails?

Speaker 1

我的意思是,你的代理可以在企业内部任意访问。

And what I mean by that is your agents can go anywhere within an enterprise.

Speaker 1

那么,提供这些安全限制的是你,还是有第三方服务商说,‘亚历克斯,你不能进入那个区域,那是人力资源部’?

Are you responsible providing those guardrails or is there a third party matter provider who is saying, hey, Alex, you can't go into that, that's human resources.

Speaker 1

或者你不能进入那个区域,那是市场部。

Or you can't go into that, that's marketing.

Speaker 1

你怎么看待安全限制的提供方式?

How do you think about guardrail provisioning?

Speaker 1

这是代理提供商的责任,还是第三方提供商的责任?

And is that the role of the agent provider or a third party provider?

Speaker 2

我认为我们可能会看到两种方式并存。

I think we'll probably see both.

Speaker 2

就像我说的,我们正在为代理安全限制投入大量精力。

Like we are putting a lot of effort into agent guardrails.

Speaker 2

就像我说的,我们几乎是唯一一家重视为编码代理提供操作系统级沙箱隔离的公司。

Like I said, we have, we're basically the only company that cares about OS level sandboxing for coding agents.

Speaker 2

例如,Windows 上目前没有任何这样的方案。

For instance, there's none that exists on Windows.

Speaker 2

我们正在开发这个功能。

We're the ones building that.

Speaker 2

而且我们是以开源的方式进行的,希望其他人也能使用。

And we're doing it in open source, hopefully other people can use it.

Speaker 2

我们对此进行了大量思考。

We think about that a lot.

Speaker 2

ChatHPT 支持连接器。

ChatHPT supports connectors.

Speaker 2

所以,你可以与你的 Google 文档之类的东西进行交互。

So, you know, you can talk to your, like, Google Docs or something.

Speaker 2

我们投入了大量精力来设置代理在操作你的 Google 文档时的权限限制。

And we put a lot of effort into guardrails around what the agent can do with your Google Docs.

Speaker 2

这只是两个例子。

Those are just two examples.

关于 Bayt 播客

Bayt 提供中文+原文双语音频和字幕,帮助你打破语言障碍,轻松听懂全球优质播客。

继续浏览更多播客