AI and I - 如何构建原生智能体产品 | 迈克·克里格 封面

如何构建原生智能体产品 | 迈克·克里格

How to Build an Agent-native Product | Mike Krieger

本集简介

迈克·克里格作为Instagram的联合创始人,打造了过去二十年最具影响力的消费类应用之一。如今,他作为Anthropic Labs的联合负责人,正站在探索什么是突破性AI原生产品的前沿。 丹·希珀与克里格就Every的《AI与我》节目展开对话,探讨了他创建Instagram的经历如何塑造了他对AI开发的思考,包括哪些环节可以加速,哪些仍难以绕开时间成本。 如果你觉得这期节目有趣,请点赞、订阅、评论并分享! 收听更多丹·希珀的内容: 订阅Every:https://every.to/subscribe 在X上关注他:https://twitter.com/danshipper 免费下载Grammarly:grammarly.com 时间戳 引言:00:01:39 AI时代产品开发中哪些变容易了,哪些仍困难:00:02:33 为何“氛围编码”会造出“室内树木”:00:05:00 重写如何成为开发流程中的常态:00:09:00 “智能体原生”产品设计的含义:00:11:39 迈克的实验室团队结构与联合创始人模式:00:24:27 产品押注的最佳信号是有人具备“突破墙壁”的坚定信念:00:29:33 在快速变化的AI环境中应对企业客户:00:38:51 OpenClaw、个人智能体,以及定义2026年的产品问题:00:40:54 本集中提及的资源链接: 迈克·克里格:https://x.com/mikeyk 智能体原生架构:https://every.to/guides/agent-native

双语字幕

仅展示文本字幕,不包含中文音频;想边听边看,请使用 Bayt 播客 App。

Speaker 0

现在的模型在添加功能方面很擅长。

The models today are good at adding features.

Speaker 0

但它们不一定擅长判断该从产品中去掉什么。

They're not necessarily good about figuring out what to cut out of the product.

Speaker 0

你可以在几个小时内迅速实现从零到n,而不是从零到一。

You can get it to go zero not at zero to one, but zero to n pretty quickly over the matter of hours.

Speaker 0

在这个过程中它做出了很多决策。

It's made a lot of decisions along the way.

Speaker 0

你对哪些东西该加入其中所形成的直觉,是随着时间积累起来的。

And some of the sort of intuitions you build about what are the right things to put in there, think you build over time.

Speaker 0

我觉得这就是2026年软件设计的艺术与科学。

I feel like that is the art and science of software design in 2026.

Speaker 1

工作进展很快。

Work moves fast.

Speaker 1

在人工智能时代,压力不仅仅是加快速度。

And in the age of AI, the pressure isn't just to move faster.

Speaker 1

它的目的是确保你发送的内容听起来就像你自己。

It's to make sure that what you send actually sounds like you.

Speaker 1

从邮件到提案再到利益相关者更新,泛泛而谈的 Rush 完全不够用。

From emails to proposals to stakeholder updates, generic in Rush just doesn't cut it.

Speaker 1

如果你曾经盯着空白页面,明明知道自己想说什么,却不知道如何下笔,Grammarly 能解决这个问题。

If you've ever stared at a blank page knowing exactly what you want to say but not how start, Grammarly fixes that.

Speaker 1

Grammarly 为你提供了一个统一的地方来思考、撰写和完成工作:在你原本就写作的地方写作。

Grammarly gives you one place to think, write, and finish your work: write where you already write.

Speaker 1

大多数 AI 工具要么完全接管,要么完全不参与。

Most AI tools either take over or stay out of the way.

Speaker 1

Grammarly 两者都不做。

Grammarly does neither.

Speaker 1

它能帮你打破空白页面的困境,调整语气,让信息准确传达给特定的读者,并无缝集成在你正在使用的 50 多万个应用程序和网站中。

It helps you break the blank page, adjust your tone so a message lands right for the specific person reading it, and works seamlessly across more than 500,000 apps and sites that you're already using.

Speaker 1

它内置了针对你工作流程每个环节的智能代理,90% 的专业人士表示它帮他们节省了时间。

It's loaded with agents built for every step of your process, and 90% professionals say it saved them time.

Speaker 1

93%的人表示它帮助他们完成了更多工作。

93 say it helps them get more done.

Speaker 1

这是与你协作而非取代你的AI。

This is AI that works with you, not over you.

Speaker 1

在这个千篇一律的AI世界里,别变得和别人一样。

In a world of generic AI, don't sound like everyone else.

Speaker 1

使用Grammarly,你永远不会如此。

With Grammarly, you never will.

Speaker 1

前往grammarly.com免费下载Grammarly。

Download Grammarly for free at grammarly.com.

Speaker 1

就是grammarly.com。

That's grammarly.com.

Speaker 1

迈克,欢迎来到节目。

Mike, welcome to the show.

Speaker 0

很高兴能来这里。

Great to be here.

Speaker 0

谢谢邀请我参加。

Thanks for having me on.

Speaker 1

很高兴有你来。

Great to have you.

Speaker 1

我超级兴奋。

I I'm super excited.

Speaker 1

对于还不了解的人,你是Instagram的联合创始人,现在你在Anthropic和Anthropic Labs工作。

For people who don't know, you are the cofounder of Instagram, and now you are at Anthropic at Anthropic Labs.

Speaker 1

我一直远远地钦佩你在Anthropic和Instagram的工作,你显然处于构建产品和人工智能的前沿。

I'm, you know, I've admired your work from afar both at Anthropic and Instagram for a really long time and you're obviously at the forefront of building products and AI.

Speaker 1

非常感谢你来参加。

So thank you for coming on.

Speaker 1

当然。

Absolutely.

Speaker 1

我们从哪里开始?

Where should we start?

Speaker 1

就像我们刚才在前期制作中讨论的那样,随着我们构建产品所依赖的基础技术或流程发生了彻底变化,哪些方面变得更容易了,哪些变得更难了,或者可能保持不变?

Like, what what we were talking about just now in the in the preproduction is what has gotten easier and what has gotten harder or stayed let maybe stayed the same in product building as we've come as as the underlying substrate or the process by which we build products has changed completely.

Speaker 1

所以,能跟我谈谈你现在的经历,与早期在Anthropic和Instagram时相比,你觉得这些事情是如何变化的吗?

So, like, tell me about your experience now versus, you know, earlier Anthropic versus Instagram and and how you think things are changing.

Speaker 0

是的。

Yeah.

Speaker 0

几周前我做了一个思维练习,我们知道Instagram的故事,我们还做过另一个产品叫Bourbon。

I was doing the thought exercise a couple weeks ago of, you know, we know the Instagram story, we had another product called Bourbon.

Speaker 0

我们为它工作了将近一年。

We worked on that for almost a year.

Speaker 0

但它没有成功。

It wasn't working.

Speaker 0

我们进行了转型。

We pivoted.

Speaker 0

我们花了大约三个月时间构建了后来成为Instagram的产品,发布后迅速扩大了规模。

We basically spent three months building what became Instagram, launched it, and then scaled it.

Speaker 0

所以问题来了,现在什么变得微不足道了,而哪些东西其实是构建过程中固有的、并不会变得更简单?

And so asking the question, like, what is now trivial, and what was actually inherent in that building process that doesn't get easier?

Speaker 0

对吧?

Right?

Speaker 0

那一年,我们本可以更快地遇到后来那些死胡同,但到达那里本身也有价值。

And that year, we probably could have hit some of the dead ends we had eventually hit sooner, but there was value in getting there too.

Speaker 0

对吧?

Right?

Speaker 0

我们把产品设计得过于复杂,以至于后来不得不简化它。

Like, we overcomplicated the product so that we then had to simplify it.

Speaker 0

我发现,即使是现在的模型,也很擅长添加功能。

I find even the models today are good at adding features.

Speaker 0

但它们并不一定擅长判断该从产品中去掉哪些功能,这需要大量实际的用户使用经验才能明白。

They're not necessarily good about figuring out what to cut out of the of the product, and that took a lot of just sort of, you know, hitting actual actual real world usage.

Speaker 0

而现在,逐步添加功能的过程本身就有某种特点。

And there was something about the process of incrementally adding things right now.

Speaker 0

我的意思是,今天,尤其是我们正在构建的一些实验室项目,你可以在几个小时内快速实现从零到n,而不是从零到一。

I mean, today, especially some of stuff we're building labs, like, you can get it to go zero not at zero to one, but zero to n pretty quickly over the matter of hours.

Speaker 0

但在这个过程中它做出了很多决定,而且你可以让它跟进你并进行输入。

But it's made a lot of decisions along the way, and, yeah, you can ask it to follow-up with you and and then do input.

Speaker 0

但关于哪些东西该放进去的那些直觉,我认为是随着时间积累起来的。

But some of the sort of intuitions you build about what are the right things to to put in there, I think you build over time.

Speaker 0

所以我一直在反思,即使在AI快速发展的时代,也几乎没有出现突破性的消费产品,我认为部分原因在于,要真正明确你想对世界做出什么样的干预,然后从那里开始构建,仍然需要时间。

And so I I've been reflecting, like, there haven't been a lot of breakout consumer products even in the age of accelerated AI building, and I think part of it is because it just still takes time to sort of hone your view about what sort of intervention you wanna make on the world and then build from there.

Speaker 0

当然,一旦你知道要构建什么,实际的开发过程就容易多了。

Now the actual building part, once you know what to build, is of course so much easier.

Speaker 0

我让Claude基本上重新构建了Bourbon。

I I had Claude basically rebuild Bourbon.

Speaker 0

花了大约两个小时。

It took about two hours.

Speaker 0

它已经功能完备了。

It was feature complete.

Speaker 0

它添加了Bourbon没有的过滤器。

It added filters, which Bourbon didn't have.

Speaker 0

我们为Instagram添加了这些功能,但我想它知道,你知道,它知道这些产品未来的方向,所以决定内置这些功能。

We added those for Instagram, but I think it knew, you know, it it knew the eventual future of the products have decided to build that in.

Speaker 0

所以我觉得这部分感觉真的很不一样。

So I think that that part feels feels really different.

Speaker 0

但我记得有一周,凯文独自去为Instagram v1开发了所有过滤器。

But I think there's also, you know, I remember those a week where Kevin went off and built all the filters for Instagram v one.

Speaker 0

我则去开发了应用的其他部分。

I went off and build, like, sort of the rest of the app.

Speaker 0

那时候,我会熬夜到凌晨四点,然后睡到中午。

And, you know, sitting there, I would stay up till 4AM and then sleep till noon.

Speaker 0

这就像我自然的昼夜作息。

That's like my natural day night cycle.

Speaker 0

在这个过程中,你会做出很多决定,比如位置功能应该怎么设计?

And, like, in that process, you're making so many decisions like how should location work?

Speaker 0

我们得找到一种方法,在加速开发的同时,帮助人们在过程中逐步建立对这些决策的直觉。

How and you know, it's we gotta find a way of accelerating building while still sort of helping people build intuition of those decisions along the way.

Speaker 0

否则,我认为你最终只会得到非常通用的产品,这些产品很难脱颖而出,或者根本无法反映你对自身领域或产品所形成的深层直觉。

Because otherwise, I think you either get just get very generic products that are unlikely to break out or ones that just don't reflect some deeper intuition that you come to about your space or your product.

Speaker 1

这太棒了。

This is great.

Speaker 1

我非常喜欢这一点。

I love this.

Speaker 1

这让我想到了两件事。

It's making me think of two things.

Speaker 1

第一点是,我脑子里有个小想法:如果你把一棵树养在室内,不让它接触风,它就不会长得强壮,因为树在生长过程中需要各种外力前后推拉,才能长成真正的树。

One is I have this, like, little thing in my head that if you grow a tree without it like, with it being indoors, without it being exposed to wind, it doesn't get as strong as because as it's growing, it needs all these forces pushing it, like, back and forth in order to, like, make a a real tree.

Speaker 1

所以,如果你把它养在室内不接触风,它虽然也会生长,但会歪斜、脆弱,完全不是一回事。

And so if you if you have it indoors without wind, it you're gonna grow a tree, but it, like, leans and it gets all and it's not as strong and it's not it's not the same thing.

Speaker 1

我觉得你这里说的正是这一点:因为我们极大地加速了开发节奏,原本那种循序渐进、每一步都让用户参与体验的过程,现在却像是在室内完全封闭地长出了一整棵树,结果整个产品缺乏每一步应有的直觉积累和经验沉淀,也就无法打造出真正优秀的产品。

And I think there's something that you're saying here where because we've accelerated the pace of development so drastically, what what would normally be this sort of incremental thing where you're you're doing things one at a time and then you're exposing it to users, you can actually kind of grow an entire tree indoors, and then you have this, like, whole thing that you're just like it doesn't have the same level of intuition and exposure to experience at each step that that creates a a great product.

Speaker 1

是那样吗,是那样吗,是那样吗

Is that is that is that

Speaker 0

我喜欢这个说法。

I love that.

Speaker 0

我也非常喜欢这个比喻。

I I I I love that metaphor too.

Speaker 0

当我们刚开始做Instagram的时候,我们非常推崇埃里克·莱斯和精益创业,还有整个‘你根本不需要它’的原则。

We you know, when we were starting Instagram, we had this we're very into, like, Eric Reese and Lean Startup and the whole, like, yag me, like, you ain't gonna need it principle.

Speaker 0

我发现,甚至最近在实验室里我做的一个项目,我们在还没进入早期测试阶段之前就过度开发了第一个版本,因为我们可以这么做。

And I have found and actually, even one of things I was working on in labs recently, we way overbuilt for v one before we even got to early access because you can.

Speaker 0

然后你会想,哦,我们有这个功能。

And you're like, oh, well, we have this option.

Speaker 0

为什么不也加上这个呢?

Why not add this one as well?

Speaker 0

这就像多做了一点额外的工作。

That's like that's a PR of work.

Speaker 0

如果你在云代码中有了非常好的流程,你就知道,事情会自动触发。

And if you get a really good flow in Cloud Code, you know, you're firing things off.

Speaker 0

你去吃午饭。

You're going to lunch.

Speaker 0

你回来的时候。

You're coming back.

Speaker 0

事情已经完成了。

The thing is done.

Speaker 0

你会觉得,太棒了。

You're like, great.

Speaker 0

我们把它加上了。

We added it.

Speaker 0

我们意识到,我们创建了一套功能矩阵,这在发布前甚至向别人解释时,都很难测试和维护。

And the thing we realized was we created this sort of matrix of functionality that was actually quite hard to test and and keep up with right before launch or even to explain to people.

Speaker 0

就像,他们来了。

Like, they're arriving.

Speaker 0

有人给我打过一个比方,我特别喜欢,就是看电视剧时,一集一集地看,完全不认识角色,和突然被扔进最后一集,然后你心想:等等。

The metaphor somebody else gave me, which I really like, is the difference between sort of getting episode by episode, getting no characters in the TV show versus imagine, like, you're thrown into the final episode and you're like, wait.

Speaker 0

这些都什么啊?

What are all these things?

Speaker 0

这些人都谁啊?

And who are all these people?

Speaker 0

而且,你知道,我已经默认自己得掌握所有这些背景信息。

And, like, I already, you know, I'm expected to have all of this context.

Speaker 0

我觉得在长期开发产品时,也有类似的感受。

Think there's the same kind of feeling around, like, developing something over time.

Speaker 0

但我觉得‘树’这个比喻也很贴切。

But I the tree metaphor, I think, sticks too as well.

Speaker 0

所以,直接给人看一棵已经长成的树,也是一下子信息量太大了。

And so, like, showing somebody the fully formed tree is also kind of a lot all at once.

Speaker 0

我认为,如今在打造产品时,如何保持简洁,确实是个值得思考的问题——不是因为能做到,就一定得在第一个版本里全加上。

And I I think there's there's there's definitely something there in how do you build product these days and still keep it simple and not just because you can doesn't necessarily mean that it should be in at least the first version.

Speaker 1

我也有同样的问题,因为直到凌晨四点,我还在调试和修复我私下开发的这款应用,叫做Proof,这是一个面向代理的、协作式的营销编辑器。

I'm having the same problem because, you know, I was I was literally up until 4AM debugging and and fixing this app that I made, like, on the side at Every called Proof, which is a agent native, collaborative, marketing editor.

Speaker 1

你可以快速与团队或其他代理共享计划文档等,它还有个小的在线状态功能,非常有趣。

So you can, like, share share really quick plan docs and stuff with your team or with other agents and you have a little presence and it's really fun.

Speaker 1

这是我在端到端完整产品上的第二或第三次迭代,现在能这样做真的很有意思。

And this is, like, my second or third iteration of the full product end to end, which is really interesting that you can do now.

Speaker 1

但前几次迭代时,我发现因为氛围编程太有趣、太上瘾了,我总是忍不住想:是的。

But the first couple iterations, I just found myself because vibe coding is so fun and so addictive, I just found myself being like, yeah.

Speaker 1

我会做这个,再做那个。

Like, I'll do this and I'll do this.

Speaker 1

结果就搞出了一个庞大又难用的怪物。

And, like and it just created this monstrosity that wasn't that good to wasn't that good to use.

Speaker 1

我从另一个叫Monologue的产品中获得了很大启发,不知道你有没有用过。

And I got really inspired by we have another product called monologue, I'm not sure if you've run into or not.

Speaker 1

我深受Monologue的启发,这是一个由Jim Navin开发的极简语音转文字应用,他一心只专注于把一件事做得极致好。

But I got really inspired by monologue, which is a really simple speech to text app run by, Jim Navin who he's just so focused on making one simple thing work so well.

Speaker 1

我看到在这个任何人都能做出产品的时代,那种极其精致、专注于做好一件事的产品有多成功。

And I saw how well that works in this age of just like anyone can make a product is like something that's super polished and just super good at what it does.

Speaker 1

所以我干脆把整个产品扔掉,重新开始,做了一个非常简单的东西——只是一个可共享的 Markdown 链接。

And so I just basically threw out the product and started over with this very simple, like it's just a shareable markdown link.

Speaker 1

然后这个东西就开始在 Every 内部病毒式传播,每个人都开始频繁使用它。

And that then just, like, started growing virally inside of every like, everyone started using it all the time.

Speaker 1

后来我们正式上线,结果一下子爆火了。

And then now I we launched it, and it just blew up.

Speaker 1

所以我昨晚整晚都没睡,拼命修复它,心里想着:我年纪大了,受不了这种事了。

And so I spent all last night, like, not sleeping trying to fix it and being like, I'm too old for this shit.

Speaker 1

我真的不能再这样下去了。

I can't I can't be doing this anymore.

Speaker 1

因为它让我想起了自己二十多岁、还在上大学时,疯狂写代码的日子,虽然很有趣,但也让人精疲力尽。

Because it just reminded me of, like, being in my twenties or, like, being in college and, like, hacking on stuff and whatever, which is fun, but also exhausting.

Speaker 1

所以,是的,我意识到自己必须彻底调整心态,因为现在能做的事情实在太多了。

And so, yeah, I've I've found that I've had to really modify my psychology because so much is possible.

Speaker 1

你是怎么应对这种情况的?

How are you dealing with that?

Speaker 0

是的。

Yeah.

Speaker 0

顺便提一下,我记得在Bourbon项目上,我们最大的错误是随着时间推移不断添加功能,而不是删除功能。

And just as a brief aside on that, I remember with Bourbon, our biggest mistake was adding functionality over time rather than deleting it.

Speaker 0

对吧?

Right?

Speaker 0

我们不能啊,你知道的,八个功能并不能造就一个好产品。

And we couldn't oh, you know, eight features doesn't make for a good product.

Speaker 0

也许第九个功能可以,但结果只是让产品显得特别复杂。

Maybe the ninth one will, and instead it just made for, you know, something that felt really complicated.

Speaker 0

我的意思是,我们应对这个问题的另一种方式其实是更愿意进行重写,就像经典的《人月神话》里弗雷德·布鲁克斯说的那样。

I mean, think a couple of things are also like, part of how we're dealing with it is actually being more willing to do rewrites, you know, like classic, you know, Fred Brooks mythical man month.

Speaker 0

你不应该重写软件,因为第一版中蕴含的所有东西,你重写时都会搞砸

Like, you you you shouldn't rewrite software because all the things that were imbued in v one, you're gonna mess up and will

Speaker 1

看到事情。

see things.

Speaker 1

是的。

Yep.

Speaker 0

是的。

Yeah.

Speaker 0

没错。

Exactly.

Speaker 0

还有整个第二系统综合症。

And that whole second system syndrome.

Speaker 0

这其中仍然有很多道理,但一方面,这些模型能帮你做差异对比,基本上能看出你有没有漏掉第一版中的任何内容。

And there is still a lot of truth to that, but one, you know, the models can help you sort of diff and basically see, did you miss anything that was in that first one?

Speaker 0

另一方面,现在不再像过去那样,需要进行可能毁掉一家公司的长达一年的重写,比如著名的网景浏览器。

But second, it's just it's no longer you're not, like, talking about a year long rewrite that might have killed a company like, you know, Famous, like Netscape.

Speaker 0

这些重写现在可能只需要几天时间,尤其是基于某个特定的源代码。

Like, these are, like, days probably, especially off a given source.

Speaker 0

所以我们实际上已经启动过好几个项目,通常是在上线前,很少在上线后,但至少在上线前,我们会构建出完整的版本,然后意识到我们过于复杂了,或者做出了某些核心假设,于是推倒重来,做第二版,再从那里继续迭代。

So we've actually had several initiatives, like, usually prelaunch, rarely postlaunch, but at least prelaunch, like, have built the full blown thing, realized we've overcomplicated or made some kind of core assumption, and then, like, tore it down, done a v two, and then and then iterate on it from there.

Speaker 0

所以这并不让我惊讶,你也必须做类似的事情,但感觉没那么痛苦。

So it doesn't surprise me that that's become sort of part of what you've had to do as well, but it doesn't feel as painful.

Speaker 0

你不会觉得,哦,要花一年时间来构建这个东西。

You're not like, oh, like, a year of building this thing.

Speaker 0

而是觉得,哦,那只是上周的事,这周我又可以重做一遍,还能删掉很多之前的内容。

It's like, oh, that was last week, and then I get to do it this week, and I get to cut out a lot of a lot of what was there as well.

Speaker 0

我认为从功能层面和产品开发的角度来看,我们正在学会更早地发布产品,但这确实需要在平衡中进行,因为我们的规模已经扩大了。

I think functionality wise and how we're dealing with it from a product development standpoint, I think we are learning to launch earlier, and it's definitely a balance around, you know, we've grown.

Speaker 0

我们已经有了强大的企业用户基础。

We have, like, a strong enterprise footprint.

Speaker 0

人们对初始版本有各种期望,但我们不会假设在上线前就能知道所有需要添加的连接器或功能,因为人们依然会给我们带来惊喜。

People have expectations about, like, what the initial version is, but not assuming that we're going to know what every connector or everything that we need to add to the product is ahead of launch, because people still will absolutely surprise us.

Speaker 0

对吧?

Right?

Speaker 0

我们有一群忠实的用户,我们称他们为‘蚂蚁食物’,因为我们是Anthropic的蚂蚁。

We're we have a strong contingent of we call them ant fooders because we're ants at Anthropic.

Speaker 0

但仅靠这些远远不够,你还需要真实的现实接触。

But that only that only gets you so far before you need that that real world contact.

Speaker 0

比如,以协同办公为例。

Like, take co work, for example.

Speaker 0

我们很久以来一直在琢磨类似的产品形态。

We'd been noodling on a product of that shape for a long time.

Speaker 0

但当我们决定:不,

And then once we decided, no.

Speaker 0

让我们把它发布出去吧。

Let's get this out.

Speaker 0

让我们真正地、以最简方式构建出我们认为能解决这个问题的V1版本,并在十天内推出,这真是一个巨大的推动力。

Let's actually, you know, build the build the v one that we think solves the problem in most minimal way possible and get that out in ten days was really a good push around.

Speaker 0

是的。

Yes.

Speaker 0

V1 本可以或应该包含一百个功能,但它没有。

There are a 100 things that v one should or could have had, but it didn't.

Speaker 0

但与此同时,它已经足够有用,足以验证某些想法。

And at the same time, it was it was useful enough to prove something out there.

Speaker 0

我不确定再花两个月开发,增加五十个功能会更有用。

And I'm not sure developing it for another two months adding, you know, 50 features would have been more useful.

Speaker 0

事实上,我们可能只会一直在室内构建树,然后是第二个行为用途,但实际上,没人想这么做。

In fact, we probably would have been building in a the indoor tree would have been getting built, and then the second behavioral use, it's like, actually, nobody wants to do that.

Speaker 0

他们想做的是另一部分。

They wanna do this this other piece.

Speaker 0

所以我认为,原始精益创业理念中的直觉依然存在。

So I think that piece of that again, there there's like the intuitions of the original Lean Startup ideas are still here.

Speaker 0

只是它们以不同的时间尺度和方式体现出来。

It's just they manifest at different timescale and in a different way.

Speaker 1

我非常好奇你想如何理解产品设计以及产品应该如何运作,因为任何在 Every 工作的人都会告诉你,我提到最多的词是‘必须是代理原生的’。

I'm really curious to hear how you think about product design and how products should work because the I've been anyone at Every will tell you the the the the phrase that I use the most or the word that I use the most about the software we build is it has to be agent native.

Speaker 1

所以代理必须能够像用户在应用中能做的任何事情一样去做。

So agents have to be able to, like, use it as anything that an agent a user can do in the in the app, the the agent can do.

Speaker 1

还有一些其他关于代理原生的小原则,但我基本上是从你们那里借鉴来的。

There's a couple other, like, little principles of being agent native, but I basically stole that from you guys.

Speaker 1

我觉得Claude Code就是那个教给我这类产品如何高效运作的典范——它是一个代理,能像你一样在你的电脑上做任何事情,而且可定制、灵活、可扩展。

Like, I think that Claude code is the canonical thing that taught me about how that kind of product can work so well where it's like, it's an agent, it can do anything on your computer that you can do and it's customizable and flexible and extensible.

Speaker 1

所以它很容易上手,但也能完成设计师最初根本没想到的各种意想不到的事情。

So it's easy to start, but it can do all sorts of unexpected things that the designers didn't really like think about beforehand.

Speaker 1

我认为这是产品开发和人工智能的一个绝佳模式。

And I think that that's such a good model for a product development and AI.

Speaker 1

我很好奇,这其实只是我通过观察你们的做法,然后加入了自己的理解。

And I'm kind of curious like, this is just sort of what I've cribbed from watching what you guys do and then like kind of put my own spin on.

Speaker 1

但你们是怎么看待这个问题的?你们是如何谈论打造这样的产品的?

But how how do you think about it, and how do you how do you talk about making products like that?

Speaker 0

是的。

Yeah.

Speaker 0

这里面内容太多了。

That's so much in here.

Speaker 0

我特别喜欢你们写的关于原生智能体的那篇文章。

And I love the agent native write up you all did.

Speaker 0

对我来说,这是对这一理念的典范性探索。

It's like, to me, the canonical exploration of this.

Speaker 0

谢谢你们以如此清晰的方式把这些想法表达出来。

So thanks for, like, putting that those ideas out in a in a really in a very clear way.

Speaker 0

我觉得这里有几条线索可以深入探讨。

So I think a few threads to pull on this.

Speaker 0

其中一个是我最近和某人的一次对话,他们说,你们都是非技术人员。

One is a conversation I had with somebody recently where they said, you know, like, you all they're a nontechnical person.

Speaker 0

他们说,你们在谈论智能体、这些东西。

They're like, you are talking about, like, agents, all this stuff.

Speaker 0

实际上,他们觉得现在的电脑本来就能正常工作。

Like, they're just like, actually, computers just work now.

Speaker 0

我一直希望电脑能正常工作,它们以前不工作,现在终于能工作了。

I always wanted computers to work, they didn't work, and now they work.

Speaker 0

这真是件挺有趣的事:过去你需要知道复杂的命令才能通过命令行安装东西,但没人会去这么做,而现在Claude可以帮你完成,因此电脑现在感觉就像一个与你并肩协作的工具。

And it's just sort of funny thing where if you knew the incantations to properly get on the command line and brew install the thing that, like, nobody is gonna do that, but now Claude can do it for you, and therefore, like, the computer now feels like a a tool that is alongside you.

Speaker 0

我认为这个核心洞察不仅仅是为新软件增加更多功能和算力。

And I think that's that that core insight is it's more than even just adding power and functionality to new software.

Speaker 0

它还在于解锁了那些本应存在或可用、却一直让人觉得极其困难的功能。

It's also just unlocking the functionality that always should have been there or available and just felt, like, extremely hard for people.

Speaker 0

所以,这可能是第一个想法。

So that's, like, maybe thought number one.

Speaker 0

第二个想法是,比较我们那些做得好和做得不好的产品。

Thought two is actually comparing our products that do this well versus not.

Speaker 0

我认为Cloud Code做得不错。

I think Cloud Code does it well.

Speaker 0

我认为Cloud AI还需要大幅改进。

I think Cloud AI still needs to evolve a lot.

Speaker 0

举个例子,我曾看到有人使用Cloud,他们在某个项目中创建了一个工件或新文档。

So as an example, I was watching somebody use Cloud and they were in a project and they had built think, an artifact or or a new document.

Speaker 0

然后他们说,很好。

And they said, great.

Speaker 0

你能把这个添加到我的项目知识中吗?

Can you add this to my project knowledge?

Speaker 0

Cloud回答说,好的。

And Cloud's like, yeah.

Speaker 0

让我告诉你如何将它添加到我的项目知识中。

Let me tell you the steps to go add it to my project knowledge.

Speaker 0

我说,不用了。

Was like, no.

Speaker 0

这应该是一个它能原生支持的功能。

That should just be a thing that it can do really natively.

Speaker 0

因此,我认为即使在这样一个2024年推出、经过多次迭代和演进的产品中,依然没有从一开始就内建这样一个理念:它的每一个基本操作都应当具备相关知识并能进行修改。

And so I think even in that, you see a product that was a 2024 product that has been iterated on and evolved a lot, but still, I don't think has been baked in from the very beginning, the idea that every single one of its primitives, it should have knowledge about and the ability to modify.

Speaker 0

我认为这在当今的产品中是至关重要的。

And I think that's essential in products these days.

Speaker 0

我认为Cloud Code是2025年版本的这种理念,当你看到一些人用它进行实验时,还能发现更多层面的东西。

And I think Cloud Code is the 2025 vintage of that, and I think there's even further aspects of it when you see what some of the the harnesses that folks are experimenting with with it.

Speaker 0

它甚至能够修改这些工具本身。

It can actually sort of modify the harness itself.

Speaker 0

这开始触及到下一个层次,可能对大多数人来说有点深奥。

That starts getting to the next maybe level of that where, you know, it's probably esoteric for most people.

Speaker 0

但即使只是解锁这一功能,也意味着你不必坐在那里想:‘唉,要是它能稍微不一样地做就好了。’

But even unlocking that functionality means that you don't have to sit there and be like, oh, I I wish it did this a little bit differently.

Speaker 0

比如,与其只是要求Gmail这样那样,不如希望它能以稍微不同的方式运行。

You know, I wish Gmail worked in this slightly different way instead of just asking it to.

Speaker 0

我认为这感觉像是下一个重大突破。

And and I think that that feels like the big next step.

Speaker 0

但即使在Cloud Code内部,仅仅教会Cloud Code关于它自己的知识,也是一次非常有价值的体验。

But even within, like, Cloud Code, just teaching Cloud Code about Cloud Code was a a really valuable experience.

Speaker 0

我当时就想,这确实相关。

I was I you'll like, this definitely relates.

Speaker 0

现在变得非常循环且元层面了,但请耐心听我说。

This is now getting very circular meta, but bear with me.

Speaker 0

我非常喜欢你关于原生智能体的那篇文章,我当时就想,我要把这个做成一个技能。

I I loved your write up on agent native, and I was like, I want this as a skill.

Speaker 0

所以每当我进行原型设计时,都会以原生智能体的方式思考。

So whenever I'm prototyping something, it thinks in an agent native way.

Speaker 0

所以我不得不把它打包成一个技能,整个过程就是,嘿,Cloud 和 Cloud Code。

So I had to package it up as a skill, and that whole process was, you know, hey, Cloud and Cloud Code.

Speaker 0

你知道吗,你能为这个创建一个技能吗?

I you know, can you create a skill for this?

Speaker 0

当然可以。

Like, sure.

Speaker 0

我正在看我的技能技能。

I'm looking at my skill skill.

Speaker 0

我要为它创建一个技能。

I'm gonna create a skill about it.

Speaker 0

我要安装它。

I'm gonna install it.

Speaker 0

我觉得太好了。

I'm like, great.

Speaker 0

现在有这个功能吗?我需要重新加载吗?

Is that available now, do I need to reload?

Speaker 0

它说,对。

It said, right.

Speaker 0

我觉得你需要重启一下。

I think you need to restart it.

Speaker 0

我查一下。

Let me check.

Speaker 0

没错。

Yep.

Speaker 0

你需要。

You do.

Speaker 0

好的。

Alright.

Speaker 0

那很好。

That's good.

Speaker 0

而且尽管所有内容都具备自我认知,这同样解锁了其中的大量能力,或许这就是最后需要撬动的关键点。

And although everything was, it has knowledge about itself, and that unlocks so much capability in there as well, which maybe is like the last thread to pull on.

Speaker 0

我认为这些话题每一个都足以展开一小时的讨论,这也是我们在实验室里深入思考的问题:如何让克劳德构建的软件更具克劳德意识,甚至从一开始就具备克劳德代理原生的思维模式?

I think all of these could be hour long conversations, which is I think and one of the things that we're really thinking about in labs is how do you imbue the software that Claude builds to be more Claude aware and even just Claude agent native sort of building awareness that even thinks to build in that way to start with?

Speaker 0

因为至今为止,部分原因在于数十年的软件都不是这样的。

Because it still won't partially because decades of software is not that.

Speaker 0

对吧?

Right?

Speaker 0

那么,你如何让新软件从一开始就内建这一原则?

So how do you get new software to have that principle baked in?

Speaker 1

这正是我正要问你的事情。

That's the thing I was about to ask you about.

Speaker 1

比如,我非常荣幸你读了这份文档并用它做了个技能。

Like so, a, I'm super I I'm super honored that you're you that you read the write up and you're using you made a skill for it.

Speaker 1

这太棒了。

That's that's amazing.

Speaker 1

其次,没错,你指出了一个我确实发现的问题,我认为云模型在这方面是最合适的。

And b, like, yeah, you're pointing to a real problem that I found is I think actually cloud models are the best for this.

Speaker 1

比如,Codex 模型通常不擅长构建原生智能体,因为它们本质上是通用模型,除非你主动引导,否则它们会像传统工程师那样思考。

Like, a codex model generally is not as good at building an agent native because they're models in general, unless you push them, they think like traditional engineers.

Speaker 1

而这涉及一套完全不同的东西,比如你需要设置防护机制和测试。

And that's a whole different set of, you know, you wanna have guardrails and tests.

Speaker 1

你希望确保用户只能走一条路径,而不是创建一个极其灵活的可扩展系统。

So you wanna make sure that there's like one path the user can go down versus for creating this extensible thing that's super flexible.

Speaker 1

所以,你是怎么设计你的产品,来训练模型和配套工具,让它们学会以这种方式思考和工作的?

So, yeah, how how are you how how do you how do you architect your product to teach the models and the harness that you teach the models to think and work in this way?

Speaker 0

是的。

Yeah.

Speaker 0

我认为这包含两个部分。

I think there's two parts to it.

Speaker 0

一个是比较平凡的部分,第二个部分我认为在开发中更有趣。

One is the more sort of mundane part, and the second one, I think, is the one that's more sort of interesting in developing.

Speaker 0

第一个是,即使在模型构建过程中提供良好的模式和范式,也已经非常有价值。

The first one is, like, even just having good patterns and paradigms available to the model while it builds has been really valuable.

Speaker 0

找到模板化与技能化之间的恰当平衡,以及这个平衡点到底在哪里。

And finding the right balance of templatized to skillified, right, and what that that right balance is.

Speaker 0

但我们现在拥有的一个技能是关于云API的,听起来很显而易见。

But having you know, one of the things that we have now is a skill about the cloud API, sounds super obvious.

Speaker 0

但即便只是拥有这个技能也非常有价值,因为有时我们会推出新模型。

But even just having that is really valuable because you would sometimes find you know, we'd launch a new model.

Speaker 0

而这个模型并不具备这方面的固有知识,于是就会出现一些非常奇怪的争论。

It wasn't in the the model's sort of innate knowledge, and then you'd get into these really funny arguments.

Speaker 0

不。

No.

Speaker 0

我知道你打错字了。

I know you made a typo.

Speaker 0

这是十四行诗四十五。

It's it's sonnet four five.

Speaker 0

你却说:不。

You're like, no.

Speaker 0

我知道。

I I know.

Speaker 0

就是不。

It's no.

Speaker 0

不。

No.

Speaker 0

不。

No.

Speaker 0

所以,拥有这种能力,拥有良好模板化的示例和技能,我认为是有帮助的。

So, like like, having that capability, having, like, good templatized examples of that and skills, I think, helps.

Speaker 0

但第二部分也很有趣,那就是这类软件是一种不同类型的测试。

But then the second part is what's also interesting is that class of software is just a different type of test.

Speaker 0

因为要为一个原生智能体产品编写端到端的功能测试要困难得多,部分原因在于它的不可预测性。

Like, it's much harder to sort of write an end to end functional test around an agent native product because part of it is that unpredictability.

Speaker 0

因此,我们在实验室里经常讨论的一个想法是:如何提高验证的保真度?

And so another idea we've been kicking around a lot in labs is, how do you increase, like, the sort of fidelity of the verification?

Speaker 0

前几天,我正在开发一个原生智能体的iOS应用,让Claude与它进行交互。

The other day, I had a agent native iOS app that I was working on, and I was I was having Claude interact with it.

Speaker 0

Claude最终在应用的聊天功能里和自己对话,看着Claude和Claude聊天非常有趣,就像有人在假装扮演人类一样。

And Claude was end up having a conversation with itself in, like, a chat feature in the was very funny watching Claude talk to Claude because it's like somebody's pretending to, like, be what humans are.

Speaker 0

而这个特定的原型是我做的关于工作日志反思的,Claude当时说,是的。

And this particular one was, like, a prototype I was doing about, like, sort of, like, work journal reflections, and the Claude was like, yeah.

Speaker 0

我的老板对我要求真的很严格。

My boss is really rough on me.

Speaker 0

我那天过得很糟。

Like, I had a hard day.

Speaker 0

然后Claude说:‘听到这个我很难过’,接着他们就来回聊了起来。

And then the Claude's like, oh, I'm so sorry to hear that, and they're just going back and forth.

Speaker 0

但你不会为这种情况写单元测试,而且你知道,它也可能引发一些其他涌现的想法。

But you wouldn't have written a unit test for this, and, you know, maybe it would have come up with some other emergent idea as well.

Speaker 0

所以我认为你必须更多地转向设置能够尽可能充分测试这些代理原生功能的测试框架,因为你无法准确预知它们会做什么。

So I think you just have to go much more towards, you know, setting up harnesses that are actually exercising as much of that agent native capability as possible because you don't exactly know what things are going do.

Speaker 0

而且事情往往会陷入奇怪的状态,比如Claude会尝试做一些你根本想不到它会做的事,还可能把你的应用带入一种新状态。

And things are going to end up in a weird place where Cloud's going try to do something that you wouldn't even think it was going to do, and it might put your app in new state.

Speaker 0

所以,也许回到最初的问题,真正困难的是让底层架构依然能应对这种情况,这非常重要。

So maybe circling all the way back to still what's hard, it's like having the underlying architecture still be robust to that is really important.

Speaker 0

对吧?

Right?

Speaker 0

它虽然是代理原生的,但也具备一种你可能没预料到的灵活性,而你只要掌握了正确的基础组件就够了。

It's like it's agent native, but it's also able to flex in a way that you might not have anticipated, but you've got the right primitives right.

Speaker 0

我觉得这就是2026年软件设计的艺术与科学。

I feel like that is the art and science of software design in in 2026.

Speaker 1

这真的很有意思。

That's really interesting.

Speaker 1

我完全同意你的观点。

I I totally agree with you.

Speaker 1

是的。

Yeah.

Speaker 1

你需要在一个安全的环境中设立一个试验场。

You wanted to have a playground within a safe a safe environment.

Speaker 1

只有在边缘安全的情况下,才能称得上是试验场。

That's the only way you can have a playground is if it's safe around the edges.

Speaker 1

但我觉得,最初我们把试验场做得太小、太受限了。

But I think, initially, we made we made the playground, like, way too small and constrained.

Speaker 1

现在模型变了,所以我们能把它开放得多得多。

And now the models have changed, and so we can open it up a lot.

Speaker 1

但我们仍然没有搞清楚,至少我自己还没搞清楚这些界限到底在哪里。

But we still haven't figured out exactly, like, at least I I have not figured out exactly what the lines are.

Speaker 1

是的。

Yeah.

Speaker 1

我觉得这里涉及的内容太多了。

I I think that there's there's there's so much here.

Speaker 1

比如,我脑子里有个想法,我想知道你有没有更简洁的方式来表达它。

Like, one thing this is making me think of is that I have this idea in the back of my head, and I'm I'm wondering if you're if you're if you have a a way to put this that that is more succinct.

Speaker 1

现在产品中的价值单位就像是工作证明或使用证明——当团队成员向我提交拉取请求时,我不只是想确认所有测试是否通过(我默认它们都通过了),而是希望你能发一个你使用它、或者你的代理使用它的Loom视频,让我判断这是否真的不错。

It's like the unit of value in in products right now is it's it's like proof of work or proof of use where when someone on the team submits a PR to me, I wanna see not necessarily did all the tests pass because I just assumed that it did, but, like, send me a loom of you using it or your agent using it so I can tell is this good or not.

Speaker 1

你明白吗?

You know?

Speaker 1

对。

Yes.

Speaker 1

是的。

Yeah.

Speaker 1

你对这个问题是怎么想的?

How are you how are you thinking about that?

Speaker 0

是的。

Yeah.

Speaker 0

我觉得这大概有三个层面。

I think that there's probably, like, three layers to that.

Speaker 0

就像第一个层面是,Claude,向我证明你确实使用过这个功能。

Just like the first one was like, Claude, prove to me that you've exercised this in some way.

Speaker 0

你懂的?

You know?

Speaker 0

我已经开始在我的所有承诺中这么做了。

I've started doing that in all my promises.

Speaker 0

我会在结束时说,当我们在处理未来的工作时,我会要求:在提交之前,先自己验证它确实按预期运行,然后再让我确认。

I end you know, when it's working on the future, I'm like, and by the end, you know, before UPR, like, prove to yourself and then to me that it works as intended.

Speaker 0

找到正确的方法,这实际上迫使你改变自己构建和搭建这些功能的方式,去思考:如何让Claude能够简洁地测试这个改动,而不是它习惯的做法。

Like, find the right way of doing it, which actually ends up you have to change your own sort of way you build and scaffold them around saying, what is the right way to get Claude able to at least test this change, you know, succinctly rather than what it likes to do.

Speaker 0

就像是,我读了代码。

It's like, I read the code.

Speaker 0

看起来不错。

It looks good.

Speaker 0

我想,是你写的代码。

I'm like, you wrote the code.

Speaker 0

我不信任你。

I don't trust you.

Speaker 0

所以,你真的得好好测试这个东西。

So, you know, you gotta really test this thing.

Speaker 0

然后第二个是,你所描述的,就像是每件事都得有某种证明,比如它是否按预期工作,是否符合你的本意?

And then the second one is that what you described is like, you know, everything having some, you know, sort of proof around, like, did is it is it working as intended and as you intended to?

Speaker 0

因为云模型或这些模型会为你做出大量决策。

Because cloud is going to make or any of these models is going to make a lot of decisions for you.

Speaker 0

有时候,你会知道,我团队里的工程师会提交一个拉取请求。

And sometimes, you'll you know, I'll have engineers on the team put up a PR.

Speaker 0

我当时就想,你为什么选这个而不是那个?

I'm like, oh, why did you choose to do this versus that?

Speaker 0

很多时候,答案是他们根本没做选择。

And many times, the answer is they didn't choose.

Speaker 0

这只是模型做出的决定,也许是个合理的决定。

It was just the choice the model made, and maybe it was a reasonable choice.

Speaker 0

这大概是个还说得过去的决定,但并不是真正契合这种模式的最佳选择。

It was probably a reasonable ish choice, but it wasn't, like, the optimal choices that fit into the paradigm.

Speaker 0

我觉得这不仅仅是工作成果的证明,更是深思熟虑的证明。

I feel like that is the it's like it's not just proof of work, but it's like proof of thoughtfulness.

Speaker 0

你有没有认真思考过这个问题?

Like, did you think this through?

Speaker 0

昨天我和一位工程师聊天,他说:‘我知道你肯定会问很多关于这个的问题。’

And I was talking to an engineer yesterday, and they're they're they was like, I was really I knew you were gonna ask me a lot of questions about this.

Speaker 0

所以我提前review了Claudia做的工作,以免我回答时说‘我不确定’。

So I was reviewing what Claudia done so that I wouldn't be like, I'm not sure.

Speaker 0

你知道的。

You know?

Speaker 0

对于大多数拉取请求,我并不会在这方面多加追问。

And that's don't I don't push on that for most PRs.

Speaker 0

但当遇到一个这样的情况时,比如:‘我正在重构这个系统,会引入一些新的基础组件。’

But when there's one that's like, oh, I'm refactoring this system, and there's gonna be these new primitives.

Speaker 0

太好了。

Like, great.

Speaker 0

我们得确保这些组件设计得当,并且你已经仔细思考过它们之间的相互关系,因为否则很容易不知不觉地堆砌出一整套你都没意识到的假设。

Let let's make sure those are good and that you've thought through how they interrelate because it's very easy to end up otherwise with sort of this sort of tower of assumptions that you're not fully aware of.

Speaker 1

我今天真的遇到了完全相同的情况,我纯粹凭感觉写了一段代码,现在它增长得很快,但同时也出现了很多问题。

I had literally the same experience today because I I made proof, totally vibe coded, and it's growing really fast right now, but it's going down a lot.

Speaker 1

所以我过去十二个小时一直在努力修复它。

And so I've been spending the last twelve hours, like, trying to fix it.

Speaker 1

因此,我们Every内部有一个小团队主动申请来帮我解决这个问题。

And so we have a little swap team internally at Every that, like, signed up to help me fix it.

Speaker 1

所以我得去给他们做入职培训。

And so I had to, like, onboard them.

Speaker 1

我当时想:糟了。

And I was like, shit.

Speaker 1

我该怎么解释这个代码库的工作原理?

How do I explain how this codebase works?

Speaker 1

于是我不得不反复和模型沟通,好比说:好吧。

And so I had to, like, go back and forth with the model a bunch to be like, okay.

Speaker 1

帮我定义一下这些术语。

Help me to like, define these terms.

Speaker 1

帮我弄清楚该怎么解释,才不至于显得像个彻头彻尾的傻瓜,因为说实话,我确实懂一部分,但不是全部。

Help me, like, figure out how how I can explain this so I don't look like a total idiot because, like, yeah, there's I understand, like, some of it, but not all of it.

Speaker 1

远远不够像以前那样必须掌握的程度。

Definitely not enough to, like, the way that I would used to have to to know to know.

Speaker 1

而现在完全不一样了:我还需要了解这些吗?

And it's a whole different thing to be like, do I need to know that anymore?

Speaker 1

现在这条线到底在哪里?

Is it like, where's the line now?

Speaker 1

很难说清楚。

It's hard hard to tell.

Speaker 0

这可能引出了另一个问题,我还没试着表达清楚,所以请耐心听我说,我慢慢说到点子上了,就是说,是的。

Which maybe gets to something else, and I haven't tried to articulate this, so bear with me as I, like, know, kinda get there, which is yeah.

Speaker 0

有些产品你用起来感觉底层很稳固,而有些产品你用起来会觉得,只要一个错误的命令或点击,整个系统就可能卡住或变慢。

There's products that you use that feel robust underneath, and there's ones that you use that you're like it feels like it's one wrong command or click away from the whole thing, either, like, freezing or it being slow.

Speaker 0

在Instagram,我们曾经有Instagram直接消息第一版,谁知道呢?

For us at Instagram, like, we had Instagram view direct messaging v one, and that, like, who knows?

Speaker 0

你发一条消息,它可能到得了对方,也可能到不了。

If you send a message, it might or may not arrive to the other person.

Speaker 0

我们自己写了一个专属的实时系统。

Like, we'd, like, wrote I wrote our own, like, bespoke real time system.

Speaker 0

它经常崩溃,你根本不敢靠它来发送你真正希望对方看到的消息。

It was like, you know, fell over a bunch of just you would not trust that to send a message that you really needed somebody else to see.

展开剩余字幕(还有 311 条)
Speaker 0

这更像是一个社交性质的东西。

It was just a, you know, more of a social thing.

Speaker 0

当我们开发第二版时,我们非常强调一点:不行。

And when we built v two, it was really important that we really hammered, like, no.

Speaker 0

如果你发送一条消息,我们可能达不到WhatsApp那样的水平,比如你身处完全没有信号的地方,只剩下一格边缘网络,它仍然能设法发送出去。

Like, if you send a message we're not probably gonna get to WhatsApp level of, like, you know, you can be in the middle of absolutely nowhere with, like, one bar of edge, and it will probably, you know, try to still go through.

Speaker 0

也许那不是我们的标准,但我们的标准是:当我加载消息时,它必须感觉可靠。

Maybe that's not the bar, but still a bar of when I load messages, it feels robust.

Speaker 0

消息一旦发送,就真的是发送了。

When it's sent, it's really sent.

Speaker 0

我觉得就像有一个小小的确认。

I feel like there's, like, a little check.

Speaker 0

这是一个小小的例子。

That's, like, one small example.

Speaker 0

但我认为,我们需要找到方法,让这种体验成为任何产品发布中不可或缺的一部分,不只是在Anthropic,而是普遍如此。

But I think that that is a thing that we still need to figure out how to make, you know, feel like an essential part of shipping on anything, not just at, you know, Anthropic, but in general.

Speaker 0

你打造了这个东西。

Like, you built this thing.

Speaker 0

它感觉像是建在沙子上,还是感觉很稳固?

Does it feel like it's built on sand, or does it feel robust?

Speaker 0

而原生智能体的部分更进一步,它带来的是:我能不能稍微推它一下,它会不会倒?

And the agent native part adds something totally even beyond that, which is, can I push it a little bit, and is that gonna fall over?

Speaker 0

还是说,感觉很棒。

Or does it feel like, great.

Speaker 0

我有一个坚实的底盘,是的,你可以从不同方向推我,但你知道,你的数据是安全的,就在下面,而不是仅仅一次部署就可能彻底崩溃。

I've got a solid trunk, and, yeah, you can push me in different ways, but, you know, your data is safe and it's underneath here, and it's not just, like, one deploy away from completely falling over.

Speaker 1

所以,如果这就是标准,我同意,这确实是你们想要达到的目标,随着模型变得更好,你们在招聘和团队结构上做了哪些改变?

So if you're if that's if that's the bar, which I agree, like, that's that's where you definitely wanna get to, how has how have you changed who you hire and how your teams are structured as the models have gotten better?

Speaker 1

比如我们的一个产品Spiral,我们刚聘了一位新总经理,他技术背景不强,但在产品和写作方面能力非常突出,而Spiral本身就是一个写作类产品。

Because for us, for example, one of our products, Spiral, we just hired a new GM who is like he I would say he's lightly technical, but he spikes super high on product and writing sense and Spiral is a writing product.

Speaker 1

现在我们可以招聘这样的人,但一年前我们还做不到,因为当时的代码模型还不够好。

And now we can, like, hire someone like that where a year ago we wouldn't have been able to cause the coding models weren't good enough.

Speaker 1

我很好奇,但缺点是,如果没有一个对所有细节都非常精通的技术人员,产品可能就不会显得那么稳固。

I'm curious, like but the downside is it's maybe the product won't feel quite as robust if there's not someone who's, like, super technical in all the details.

Speaker 1

那么,你是如何看待实验室团队中现在谁在构建产品,以及这种状况如何随时间变化,未来又将如何变化的呢?

So, like, how do you think about who builds products right now inside of the labs team and how that has changed over time and how it will change.

Speaker 0

是的。

Yeah.

Speaker 0

我喜欢这一点。

I love that.

Speaker 0

我觉得实际上你会被拉向两个方向,但这两个方向都很重要。

I think it's actually you get pulled in two directions, but they're both important.

Speaker 0

一方面是基础架构和架构的稳健性,我认为这仍然需要一位资深技术人员。

There's the sort of primitives and architectural robustness, which I think still need a sort of senior technical person.

Speaker 0

我刚才和某人聊天时笑了。

I was laughing with somebody.

Speaker 0

他们说,我以为自己在分布式系统方面的技能再也不会有用武之地了。

They're like, I thought, you know, my skills in distributed systems were, like, not gonna be useful anywhere.

Speaker 0

但事实上,这些可能是最有用的技能,涉及对这些问题的思考和深入分析。

But, actually, those are the maybe some of the most useful skills and reasoning about that and, you know, thinking things through.

Speaker 0

比如,上周我和Claude就我构建的系统是否需要Redis,还是可以直接用PostgreSQL展开了长时间的讨论。

Like, I had a long debate with Claude last week around, like, whether the system that I was building needed Redis or not or could go to just Postgres.

Speaker 0

这是一场有益的辩论,而我之所以能参与,是因为我之前大量使用过这些技术,有扎实的基础。

And, you know, was it was a healthy debate where, like, I but only because I was grounded and having used a lot of those technologies before.

Speaker 0

但另一方面,这种稳健性还体现在:你是仅仅通过调整系统提示词和额外指令来掩盖所有问题,还是真正正确地设计了一整套工具?

But then there's the other side of robustness, which is have you just papered over all the problems with, like, fixes to your system prompt and additional instructions, or have you sort of architected the actual, like, set of tools correctly?

Speaker 0

因此,后者同样重要,可能也是这个GM能真正发挥价值的地方。

And so that the the latter is as important and probably where this GM can be really valuable in that.

Speaker 0

好的。

Okay.

Speaker 0

比如,我在做调整,但就像你不会通过简单地让分布式系统‘五秒后重试’来修补它的不稳定性一样。

Like, I'm making changes, but just like you wouldn't patch a sort of flakiness in your distributed system by just being like, well, just retry it in five seconds.

Speaker 0

我相信它会奏效的。

I'm sure it'll work.

Speaker 0

还有,不要像使用全大写、Markdown,或者你试图修补的任何其他东西那样,永远不加区分地重复使用。

Like, also not doing the same thing with never ever, you know, all caps use, you know, markdown or whatever the the the thing that you're trying to patch is.

Speaker 0

实际上,这两种情况都是同一个问题的表现,即底层模块是否足够健壮。

Like, they're both actually symptoms of the same thing, which is is the underlying piece robust or not.

Speaker 0

Claude实际上——我想说的是所有模型都如此,但我觉得Claude在这两方面都能做得更好。

And Claude actually I'd say this about all the models, but I think Claude could be much better at both.

Speaker 0

它仍然非常需要大量的人工监督。

It's, like, still a place that still needs a lot of of human oversight.

Speaker 0

在系统方面,它现在能够调试生产系统,这非常有价值,但在最初设计架构时,我觉得我们仍然需要那些真正深入思考过这三点或有丰富经验的人。

On the systems part, you know, it's now able to debug production systems, is really valuable, but architecting them in the first place, I feel like we're still benefits from somebody who's really thought these three things through or has experience.

Speaker 0

在提示词方面,你知道,如果你给它一个——我见过有人,甚至在这里内部,陷入这种开发循环。

And on the prompting side, you know, if you give it a I've seen people get into this dev loop even internally here.

Speaker 0

这是提示词。

Like, here's the prompt.

Speaker 0

这是系统犯的一个错误。

Here's a mistake that the system made.

Speaker 0

不断优化提示词。

Iterate on the prompt.

Speaker 0

它的自然倾向就是不断往提示词里添加更多内容,最终变成这样:如果你给新员工第一天就列出一百条指令,比如‘总是回答并使用标记,除非他们……’,他们肯定会想:我还是只记你最后说的那条吧。

Its natural tendency is to just add more things to the prompt, and then eventually just get to this thing that, you know, if you onboarded a new employee and you gave them a 100 instructions on their first day, like, always answer and mark down except when they've you know, they'll be like, I'm just gonna remember the last thing you told me.

Speaker 0

我会直接绕过它。

I'm gonna, like, short circuit it.

Speaker 0

所以重新思考一下,好吧。

So then rethinking, okay.

Speaker 0

这些真的是两个不同的工具吗?

Is these are are these actually two different tools?

Speaker 0

会不会其实是两个独立的智能体,各自只拥有更少的上下文,然后可以拆分开来?

Is it actually two agents that each have a smaller amount of context that then you can break apart?

Speaker 0

回到你最初的问题,我们正在招聘具备系统经验的人才,即使是在实验室这种通常被认为是零到一原型开发的地方。

So back to your original question, we're hiring for people with, you know, systems expertise even within labs, which you think of as, like, more zero to one prototypes.

Speaker 0

这仍然非常有价值,因为再次强调,稳健性很重要。

Like, it's still really valuable because, again, that robustness matters.

Speaker 0

而且,谁会帮忙处理系统权限、配置和早期测试这些事情呢?

And, also, just who's gonna be, you know, helpful in sorting through, you know, systems permissions and provisioning and early testing.

Speaker 0

这类工作即使在云环境中依然很困难,因为云系统本身无法自行修改配置——这是出于良好原因的限制。

Like, that stuff is is still, you know, it's still hard even for cloud when it can't edit the provisions itself, which it can't for good reasons.

Speaker 0

在稳健性方面,我们发现将产品团队与应用AI团队配对取得了很大成功。

And then on the on the robustness side, actually, we've had a lot of success pairing our product teams with our applied AI teams.

Speaker 0

我们的应用AI团队每天都在一线帮助客户优化他们的提示词。

Our applied AI teams are the teams that are in the field every day helping customers iterate on their prompts.

Speaker 0

我们实际上已经成为这些工作的‘零号客户’,因为我们有很多产品都深度依赖AI技术。

And we found that we actually are very we're customer zero now for those, you know, efforts because we have a lot of products that are, you know, very AI powered.

Speaker 0

那么,我们该如何将这种专业知识引入进来呢?

So how do we bring that expertise in here?

Speaker 0

因为这种专业知识目前并不在我们的软件工程师手中,例如。

Because that expertise does not sit with our software engineers today, for example.

Speaker 1

那介于底层架构和这些之间的部分呢?

What about the in between of, like, okay, it's not the underlying architecture.

Speaker 1

这跟提示词有关。

It's on the prompt.

Speaker 1

这就像用户界面和流程。

It's like the UI and the the the flow.

Speaker 1

谁在做这个?

Who's doing that?

Speaker 0

我们,这是个好问题。

We that's a great question.

Speaker 0

比如,我们发现,一些转入实验室的人,是那些原本专注于网站细节打磨,但又想尝试新事物的人。

Like, we have found, you know, some of the people that transferred into labs were the folks that really were focused on polish on the website, but they were interested in doing something new.

Speaker 0

他们也带来了完全不同的方法,比如我们有个原型。

And they bring such a different approach as well around we had the prototype.

Speaker 0

它看起来只是普通地不错,而不是让人感觉有品牌感,有这种特质。

It was it looked generically nice versus, oh, this feels like it's branded and it has this.

Speaker 0

所以,这是第一部分。

So that's that's part one.

Speaker 0

第二部分是设计师。

Part two is designers.

Speaker 0

我们的设计师越来越多地转向了设计师与构建者双重角色。

Like, we've had our designers move much more into a sort of split designer and builder role.

Speaker 0

不是所有人,但大多数人都是这样。

Not all of them, but most of them.

Speaker 0

实际上,我们在实验室里并没有太多全职设计师。

And a lot of our, you know, we actually don't have a lot of full time designers on labs.

Speaker 0

但那些有的,我可以说他们写的代码和工程师一样多,因为他们有能力做到。

But the ones that we do, I would say are writing and contributing almost as much code as engineers on those efforts because they can.

Speaker 0

而且,如果搭配得当,我们发现这种模式几乎像某种联合创始人模式,用于一些实验室项目:可能是设计师提出了最初的想法并推动它,而传统的软件工程师则在后面铺路,确保它真正可行。

And again, paired correctly with the right person, we have found this almost sort of co founder model for some of these labs initiatives, or you have the designer who had the original idea maybe and they're pushing on something, and then the traditional software engineer that's gonna go and, you know, make pave the trail sometimes behind the designer to make sure that actually works.

Speaker 1

好的。

Okay.

Speaker 1

我想了解一下这个。

This I wanna know about.

Speaker 1

那么,能跟我讲讲这个团队结构是怎么运作的吗?

So what so tell me about how that team structure works.

Speaker 1

所以,这通常是一个设计师,还是任何有产品想法并能以某种方式执行的人,搭配一位真正的工程师来打磨他们留下的粗糙路径?

So you've got a design is it actually usually a designer, or is it just anyone that has a product idea that can kinda execute it on it in some way paired with a a real real engineer that actually can, like, kind of smooth out the rough edges of the the the trail they're leaving?

Speaker 0

这会有些不同,但我们发现,启动新项目最重要的因素,也是我们的关键制约点。

It sort of varies, but it we found the one thing that's most important and sort of our gating factor in starting up new projects.

Speaker 0

我想知道,这和Every的情况有多相似?是否需要有人对这个想法——即使不是对具体方案——有极强的信念?过于执着于具体方案可能有风险,但至少对所探讨的问题领域要有强烈的信念。

I'm curious how similar is this to Every, is having somebody with extreme conviction about if not necessarily that idea, too much conviction on the exact idea is probably dangerous, but at least in the problem space or the question that they're asking.

Speaker 0

这种类似联合创始人或创始人的态度是:我会不惜一切代价推进,直到这个事情被证实可行,或者彻底失败,我只想明确地走一条路。

And that sort of, like, cofounder or founder level of I will break through walls until this thing is either proven out or dead, but I wanna, like, go either way.

Speaker 0

当我们复盘那些已经终止的实验室项目时,常常发现:这个团队里其实没人真觉得这是个重要的方向。

When we have bets labs bets that we've wound down often in the postmortem, we're like, nobody on this team actually really thought this was, like, the thing.

Speaker 0

他们只是说,是的。

They were like, yeah.

Speaker 0

这看起来挺合理的。

This seems reasonable.

Speaker 0

这简直就是项目失败的丧钟。

Like, that's the the death knell for projects.

Speaker 0

对吧?

Right?

Speaker 0

这个人可以是设计师,我们有几个项目就是这样。

So that person can be a designer, and couple of bets it is.

Speaker 0

也可以是那种有产品思维的工程师。

It can also be sort of a, you know, product minded engineer.

Speaker 0

很少是纯粹的产品经理。

It's rarely a pure PM.

Speaker 0

我们目前只有一个产品经理,负责整个实验室的所有项目。

We actually only have one currently one PM for all of labs.

Speaker 0

我们正在招聘更多人,他们扮演着非常广泛的角色。

We're hiring more, and they're sort of playing, you know, sort of a a wide role.

Speaker 0

但没错,通常是设计师或者有产品导向的创始人。

But, yeah, a designer or, like, a product oriented founder.

Speaker 0

然后我们寻找的是,我们需要哪些技能来与之互补?

And then what we look for is, well, what skills do we need to complement with that?

Speaker 0

因为作为我们实验室流程的一部分,我们每两周评估一次每个项目,决定是加大投入还是让这些成员重新回到更广泛的实验室池中。

So, you know, because we're doing as part of our labs process, it's actually evaluating every project every two weeks and deciding whether we double down or whether we sort of release those folks back into the broader labs pool.

Speaker 0

在任何时刻,都可能有人能被调入项目,他们具备基础设施方面的专业知识,或曾使用过特定的内部系统,或拥有丰富的提示工程经验,可以灵活地加入或退出项目。

At any given point, there's probably somebody who can be pulled onto the project that has that infrastructural expertise or has worked with that particular internal system or has a lot of deep prompting expertise to sort of flow in and out.

Speaker 0

所以我认为,这种孵化器式的模式也有助于这一点,因为没有人会永久固定在一个项目上。

So I think that's also where the sort of incubator style space helps because nobody is fixed on a project forever.

Speaker 1

这真的很有趣。

That's really interesting.

Speaker 1

是的。

Yeah.

Speaker 1

我们那里做法略有不同,虽然有一些重叠,但我们有稍微不同的结构,就是我们……

We're we we do it slightly different there's some overlaps, but we do have a slightly different structure where we yeah.

Speaker 1

我们有总经理,或者他们最初是创业者或驻场人员,当他们找到想专注开发的产品时,就会成为总经理。

We have GMs or or they start as entrepreneurs and residents and they become general managers when they find a product that they wanna, like, work on.

Speaker 1

每个产品只由一个人负责。

And each product just has one person.

Speaker 1

就是一个人全栈搞定一切。

Like, one person that does everything full stack.

Speaker 1

比如设计、工程、营销,所有这些方面。

So, you know, design, engineering, marketing, all that kind of stuff.

Speaker 1

至少这些基础工作都是一个人做。

At least the all the basics of that.

Speaker 1

以前这种总经理的背景通常是极富技术背景的创始人,但现在我觉得已经转向至少需要一些基础技术能力。

The shape of that GM used to be, like, super technical founder background, and now I think has shifted towards at least some light technical.

Speaker 1

但我 honestly 只关心你能否熟练使用 Claude、Codex 或其他类似工具。

But, like, I honestly just care that you can use Claude or Codex or whatever well.

Speaker 1

还有出色的产品直觉,对你要构建的领域或事物有良好的品味,并且有证据表明你能用 AI 做出东西。

And really good product sense, really good taste for for the the the subject area or the thing that you're trying to build and evidence that you can build with AI.

Speaker 1

然后我们有一个共享资源层,有点像一家代理机构,我们有设计师、增长营销人员,还有运营人员,你可以根据需要随时调用,这种方式效果相当不错。

And then what we have is shared resource resource layer that sort of works a little bit like an agency where we have designers and we have growth marketers and we have, you know, ops people that you can, like, pull in and out for various initiatives, and that seems to work pretty well.

Speaker 1

所以我们负责管理所有内部机构,而每个总经理则在外围,根据需要为不同项目调用资源。

So it's like, we manage all the internals the internal agencies, and then each GM is out on their on on, you know, on the edge, and they pull in resources as they need it for different

Speaker 0

是的。

Yeah.

Speaker 0

但听起来你也需要这样一个人:这件事就是他的全部,他不把事情完全搞定就不会睡觉。

But sounds similarly, like, you need somebody for whom that is, like, the thing and they are not going to sleep until it is fully working.

Speaker 1

是的。

Yes.

Speaker 1

没错。

Exactly.

Speaker 1

而且我一直在想,好吧。

Like and and I've been I've been thinking about, okay.

Speaker 1

你什么时候会雇另一个人来负责一个产品,或者什么时候会给一个产品增加新人?

When when would you hire someone else to work on a product, or when would you add someone else to work on a product?

Speaker 1

其实,当你无法再把整个事情都记在脑子里的时候,就是那个时候。

And it's like, there's some point at which you can't hold the entire thing in your head.

Speaker 1

即使你是推动它前进的人,也无法把整个事情都记在脑子里。

Even if you're the one pushing it forward, you can't hold the entire thing in your head.

Speaker 1

而以前这个临界点要小得多。

And that point used to be much smaller.

Speaker 1

现在这个临界点大了很多,但到了某个阶段,即使是一个小功能也会变成一个独立的产品。

Now it's much bigger, but there's a certain point at which, like, even a small feature turns itself into its own product.

Speaker 1

你知道的吧?

You know?

Speaker 1

当你最初在Instagram里开发消息功能时,那时候确实是,是的。

When you when you first make the messaging feature inside of Instagram, it's like, yeah.

Speaker 1

我可能一两周就能搞定。

I can do that in, like, a a week or whatever.

Speaker 1

但到了某个时候,它就变成了一个独立的产品。

But at some point, that's its own product.

Speaker 1

它几乎需要一个专属团队。

It almost needs its own team.

Speaker 1

而且我认为,那条界线正在变化,或者说一个人能独立完成的事情确实变多了,但这条能力的边界依然是真实存在的。

And that, I think, that line is getting or or the number of things you can do with one person is getting bigger, but it still exists somewhere.

Speaker 1

但我还没完全弄明白该怎么处理这个问题,或者说该怎么判断这个边界在哪里。

But I haven't quite figured out, like, how to how to manage that or how to tell.

Speaker 0

没错。

No.

Speaker 0

我很认同这个说法,因为这件事其实可以分成两部分来看:当一个项目的规模还小到能装进一个人的脑子里的时候,给团队加人手反而会拖慢进度。

I love that because there's actually I think there's the two parts to that, which is when the idea is still enough to hold into your own head or an individual person's head, adding more people actually slows the team down.

Speaker 0

这正是我们在实验室研究中得出的一个反直觉结论:过快扩张团队规模实际上会产生负面效果,因为所有人都会把所有时间都花在团队协调上。

And that's like a non obvious finding that we found on labs is scaling the teams too quickly actually is a net negative because they end up spending all this time on coordination.

Speaker 0

比如说本来……本来我还打算……但迈克·克里格就能处理这事。

Like, oh, you were oh, I was gonna take oh, but Mike Krieger could do that.

Speaker 0

结果事情最后就变得一团糟。

Then it it just ends up in this sort of piece.

Speaker 0

而且你还要应付一大堆统一认知的沟通会议。

And you also have all those alignment conversations.

Speaker 0

就像在Instagram的时候,只有我们两个人非常重要。

Like, it was important in Instagram that it was just two of us.

Speaker 0

让两个人达成一致已经够难了,更别说让两个人步调一致了。

Like, it was hard enough to align the two of us, like, and go, like, get two people on the same page.

Speaker 0

对吧?

Right?

Speaker 0

我做的第二个创业项目Artifact,最初几个月只有我和凯文两个人在做,但后来我们雇了一个大约八人的团队。

The second startup I did, Artifact, you know, Kevin and I were doing that alone for the first few months, but then we hired a team that was about eight people.

Speaker 0

这真的很难,因为我们当时还没有找到产品市场契合点,还在不断迭代。

It was really hard because, you know, we hadn't had product market fit yet, and so we were still iterating.

Speaker 0

然后你就陷入这种局面:八个人开Zoom会议讨论下一步该做什么,而你其实只想找个房间,面对面把事情理清楚。

And then you'd end up in these things where we're on a Zoom with eight people talking about what we're doing next, and you really just wanna be able to sit in a room and hash it out.

Speaker 0

所以我发现,这些实验室项目中也存在类似的情况:即使想法很吸引人,也不该过早扩大团队,否则你就会陷入这种元协调的困境。

So I find with these labs initiatives, there's some there's some similar sort of aspect at play, which is you don't wanna pre scale the team too early even if the idea is exciting because then you just end up in this sort of, like, meta coordination game.

Speaker 0

但我喜欢你提出的观点:确实存在某个临界点,两个人一起合作会更有帮助,那时项目的上下文和范围足够大,足以让两人在脑海中承载更复杂的内容。

But I I like your framing of there is some point where either, you know, two people really will help go on it together, and there's enough sort of context and scope where they can hold some other complex piece in their head.

Speaker 0

而且,如果有人已经在一个想法上琢磨了两到四周,这时候有人注入新的思路,这种紧迫感也会有帮助。

And then there's also the if somebody's been spinning on the same idea for two, four weeks, somebody's injecting some other thinking and and that urgency can help too.

Speaker 1

是的。

Yeah.

Speaker 1

我认为在人工智能领域保持团队小规模尤其重要,因为我们经常要面对一个问题——我确信你也见过——每三到六个月,你就得扔掉一半的产品。

I think it's especially important in to keep it small in AI because one of the things that we deal with all the time, which I'm sure you see too, is every three to six months, you're you have to throw out, like, half your product.

Speaker 1

如果你要和很多人协调,这会非常困难。

And that's really hard to do if you have to coordinate with a lot of people.

Speaker 1

但如果是一位总经理突然意识到:天啊。

But if it's one GM who realizes, oh, shit.

Speaker 1

对。

Yeah.

Speaker 1

我得直接扔掉这一半内容,因为模型已经进步太多了。

I gotta just, like, throw out half of this because the models are so much better.

Speaker 1

这样要做出这样的转型就会容易得多。

It just makes it much easier to to, like, pivot in that way.

Speaker 1

你有看到这一点吗?

Is that do you see that?

Speaker 1

那你又是怎么应对的呢?

And, like, how do you deal with that?

Speaker 1

你是怎么想的,是的。

Like, how do you think about, yes.

Speaker 1

我知道三个月后,这些代码甚至整个功能集,我都得重新思考一遍。

I know in three months this the code maybe or even the whole feature set, I'm gonna have to, like, really rethink about.

Speaker 1

这让你对软件的看法发生了很大的变化。

Like, it feels like it changes a lot in in how you think about software.

Speaker 0

是的。

Yeah.

Speaker 0

而且要愿意删除代码。

And being willing to delete code.

Speaker 0

我认为Cloud Code团队在这方面做得非常好,他们把删除功能当作团队成员的一种准则。

I think that's something that Cloud Code team has done really well is they have sort of deleting features as a sort of imperative of people on the team.

Speaker 0

如果这个不行,我们就把它下线吧。

Like, if this is not working, let's go unship that.

Speaker 0

你知道的。

You know?

Speaker 0

而且通常当你创造出别的东西时,即使它没有完全取代旧功能,但只要能足够好地实现旧功能的用途,就值得将其弃用并移除。

And it's often when you've created something else that even if it doesn't entirely supersede, it does enough of what that other aspect does that actually makes sense to deprecate and then remove that first one.

Speaker 0

随着我们越来越专注于企业市场,即使使用这些工具,难度也会增加,因为企业会依赖它们。

It does get harder as we get more and more enterprise focused even with these tools because they come to depend on it.

Speaker 0

我记得很清楚,当我还是首席产品官时,大概六个月后,我们对Cloud AI进行了大规模重新设计,当时我们非常自豪,于是发布了它。

I I know I never forget we one of the things I did maybe six months into when I was still chief product officer was we did a big sort of redesign of of Cloud AI, and we were so proud, and we shipped it.

Speaker 0

我们收到了很多赞誉,但随后却收到一封非常愤怒的邮件,对方说:‘我为公司录制了二十个小时的云企业使用培训内容,现在得全部重做一遍。’

And we had a bunch of kudos, and then we got this really angry email for somebody who's like, I just recorded twenty hours of enablement content for my company to do for cloud enterprise, and I have to, like, redo all of it.

Speaker 0

我们当时就说:哦,好吧。

And we're like, oh, okay.

Speaker 0

他们的发布节奏和我们完全不一样。

Like, they're you're playing at a different release cadence.

Speaker 0

当然,像在我们的一次大会上一年发布两次是不可行的,所以我们必须继续快速推进。

And, of course, like, shipping twice a year at one of our conferences is not an option, so we are gonna keep moving quickly.

Speaker 0

但后来我们逐渐学会在向企业端推出功能时,稍微放缓一点节奏。

But then we've since, like, learned to maybe moderate how we roll it out to the enterprise side a little bit more.

Speaker 0

不过,是的,我认为‘移除功能’这一点,最终会让你遇到一些已经基于它构建了东西的人,我举个例子。

But, yeah, I think the unshipping piece, then you end up with people who have built I'll use an example.

Speaker 0

所以有一个功能叫‘样式’。

So there's a feature in in called styles.

Speaker 0

这个功能使用的人不多,但用它的人用得非常频繁。

It's not widely used, but the people who use it use it a lot.

Speaker 0

我们在不同时间点也讨论过,是的,这个样式功能在产品中仍然有其意义。

And we've talked at different points like, yeah, the style still makes sense in the product.

Speaker 0

你知道,还有其他方式可以实现同样的效果。

You know, there's other ways of accomplishing the same thing.

Speaker 0

现在有自定义指令和项目了。

There's custom instructions and projects now.

Speaker 0

现在还有技能功能。

There's skills now.

Speaker 0

对吧?

Right?

Speaker 0

实现这一点的方法太多了。

There's so many other ways of accomplishing that.

Speaker 0

我不知道样式功能还会在产品中保留多久,但我知道上次我们讨论移除它时,发现它对几家公司的整个使用场景来说至关重要。

And I don't know how long styles will end up in the product, but I know that the last time we we talked about removing it, it ended up being really load bearing for a few a few companies', like, entire use cases.

Speaker 0

比如,他们有一个由CEO亲自设计的公司专属样式,并分发给所有员工,员工们就是靠它来开展工作的。

Like, oh, we have our house style that the CEO personally authored and gives to every employee, and that's how they operate.

Speaker 0

因此,找到替代这些功能的方法也非常有趣。

And so finding ways of doing that is also really interesting.

Speaker 0

我希望从长远来看,我们能建立起一套插件和技能系统,让它们不再需要内置在核心产品中。

I would hope that in the long run, what we can actually do is is come up with a system of plug ins and skills such that they no longer have to live in the core product.

Speaker 0

因为我认为,如果你没有一个清晰的替代方案,删除那个你向所有人提供的核心功能总是最难的。

Because I think that is it's always the hardest to delete something that is the core thing that you're shipping to everybody if you don't have the story around, great.

Speaker 0

你还是喜欢这个功能吗?

You still like that feature?

Speaker 0

太棒了。

Awesome.

Speaker 0

比如,你可以通过这种方式永远在自己的使用中保留它,持续迭代并让它成为你专属的,但不必为每个新注册的用户增加复杂性。

Like, here's how you can keep using it forever in your own and keep iterating on it and make it your own, but it doesn't have to add complexity to every future person that's adding that's signing up for the first time.

Speaker 1

我对实验室功能很好奇。

I'm curious for for labs.

Speaker 1

另外,总的来说,你对初创公司创始人有什么看法?

And then also maybe just in general, what are your thoughts for startup founders?

Speaker 1

你提到的企业客户观点让我想到一个我最近一直在思考的问题:如果你现在在AI领域向企业销售产品,即使你的产品现在很现代,也会很快过时,但你的客户却会想要旧版本。

Your enterprise point just it brings up something that I've been thinking about a lot, which is if you are selling to enterprise right now in AI, even if the product you have right now is modern, it will be quite outdated quite quickly and but your customers are gonna want the outdated version.

Speaker 1

但作为一家初创公司,这感觉有点冒险,因为如果你只是优化当前大型上市公司的采购需求,你很容易受到颠覆。

But as a startup, that's like a little bit that feels pretty risky because, yeah, you're just gonna I guess, you're you're you're susceptible to this to disruption if you are optimizing for what someone at a gigantic public company will buy right now.

Speaker 1

我认为有很多初创公司就属于这一类,它们可能两三年前就成立了。

And I think there's a lot of startups in that category where they maybe started two or three years ago.

Speaker 1

他们有一套特定的技术栈。

They have a certain tech stack.

Speaker 1

他们对如何做AI有一套特定的思维方式。

They have a certain way of thinking about here's how we do AI.

Speaker 1

而模型又如此不同,但他们的客户合同却是针对这种现状的,就像你在看Copilot之类的产品一样。

And then the models are so different, but their customer contracts are for this, like, sort of out it's like, you know, looking at looking at Copilot or whatever.

Speaker 1

这就是那种常见的氛围。

It's that's the sort of vibe that happens.

Speaker 1

你觉得你自己,以及在Anthropic内部,应该如何看待这个问题?

Like, how do you think how do you think about that yourself and and inside of Anthropic?

Speaker 1

那么,你认为创业者应该如何思考这个问题?

And then how do you think founders should think about that?

Speaker 0

是的。

Yeah.

Speaker 0

不。

No.

Speaker 0

这是个非常好的问题,尤其是因为接下来会有一波浪潮,比如更加原生的代理模式。

This is such a good question, especially because then a wave will come, like, being more agent native, for example.

Speaker 0

你能在现有的范式中接纳它吗?

And can you adopt it within your existing paradigm?

Speaker 0

这需要把一切推倒重来,还是你只是被困在其中了?

Does it require to throw everything out, or are you just stuck in that?

Speaker 0

比如,我们某种程度上已经采用了它。

Like, oh, we kind of adopted it.

Speaker 0

我们只是把它额外加了上去。

We kind of bolted it back on.

Speaker 0

我觉得有几点要考虑。

I think a couple things.

Speaker 0

对我们来说,我们开始做的事情是,本质上认为这列火车会继续前进,我们会沿途提供企业级的开关选项。

For us, what we've started doing is basically treating like, this train's gonna keep moving, and we'll provide enterprise toggles along the way.

Speaker 0

但核心部分会持续演进,这就是你与我们合作时所承担的赌注和认知。

But the core of it will continue to evolve, and that's sort of the the bet and understanding you're taking working with us.

Speaker 0

我认为这一点得到了很好的接受,因为公司也意识到,事情变化得太快了,他们之所以能安心做出一年期的承诺,是因为相信产品会持续演进。比如,CoWorker 就是个很好的例子——从第一天起,就提供了关闭功能,如果你不希望员工使用,完全可以禁用。

And I think that's been well received because I think companies have also seen that, you know, things are moving so quickly that the only way they even get comfortable with a year long commitment, for example, is to believe that it will continue to evolve along the way, but then we'll provide, you know coworker is a great example where, you know, from day one, there was like a a way to turn it off for your employees if you didn't want it, for example.

Speaker 0

我认为这是一种相当不错的模式。

And that that that's, I think, a reasonably good paradigm.

Speaker 0

但另一个观点是,正如我们之前讨论的,你实际上可以重新思考并彻底重写整个技术栈,我认为公司应该更愿意这样做。

But the other one is, just as we were talking earlier, like, you can actually rethink and and and and and sort of rewrite a lot of the the the stack is, I think companies should be way more willing to do that.

Speaker 0

而且所有事情都在加速压缩。

And it everything is getting compressed.

Speaker 0

对吧?

Right?

Speaker 0

在过去的周期中,有一种观念是,你必须放弃一些客户,他们可能因为与你发展方向不同的原因而非常喜爱你的产品。

In in previous cycles, it was the kind of idea of, like, having to fire some of your customers who might have been, you know, really into your product for a different reason than where you're going sooner.

Speaker 0

这通常是一个以多年为单位的决策过程,当时人们会说:‘是的。’

That was on a multiyear kind of time range thing where he was like, yes.

Speaker 0

去年的产品,和三个月前的产品,已经完全不同了。

Last year's product versus not three months ago's product.

Speaker 0

这听起来很疯狂,但我真的认为这就是你需要思考的方式——你必须愿意推出v3或v4版本,彻底重新思考现有功能的工作方式。

It seems crazy, but I actually think that's the kind of way you have to think about it, which is you have to be willing to put out the the v three or the v four that is a, you know, big rethink of how the existing piece worked.

Speaker 0

然后或许可以有一个过渡期,云服务可能有助于暂时同时托管新旧两个版本,直到完全切换。

And then maybe have a transition period, and Cloud can help probably host both for a little while before it cuts over.

Speaker 0

但同时也要愿意做出切换,说:是的,就是这样。

But then also be willing to cut over and say, like, yes.

Speaker 0

这就是我们对这类知识工作,或者这种AI驱动的制造未来的想法。

This is how we think the future of this piece of knowledge work or this, you know, AI powered manufacturing is gonna be.

Speaker 0

我们必须不断推进,否则正如你所说,你不是被下一个从零开始重新思考的公司取代,就是自己取代自己。

We gotta, like, keep it moving or else to your point, you're just gonna you're either going to get replaced by the next company that then rethinks it from scratch or yourself replacing it yourself.

Speaker 0

而且,这依然是老生常谈,只不过现在压缩到了几个月的时间内。

And, again, it's just the same old story, but now compressed to months.

Speaker 0

你对OpenClaw有什么看法?

What's your take on OpenClaw?

Speaker 0

它有一种我特别喜欢的特质——当人们看到原本已经可能实现的东西,现在被包装成一个可以实际试用的形态,并且围绕它形成了某种直观的构建方式。

It it has the flavor of something else that I or just the the thing I really like seeing when you get people to see something that was already possible, but it's now in a package where people can actually try it out, and there's some intuition, you know, around how to how to build on top of that.

Speaker 0

就像你一开始看到,其实你已经能用这些模型写代码了,但真正让这一点普及的,是一些突破性的低代码工具,比如 replets、lovables 和 v zeros 之类的,把这种能力整合了进去。

Like, you started seeing that with you could already use these models to write code, but it kinda took, like, some of these breakout, like, low code, you know, the replets and lovables and v zeros of the world to, like, kinda put that in there.

Speaker 0

这几乎是最纯粹的体现:给模型一些工具,然后让它自己去推进、去执行,再继续构建下去。

And it's kind of the the like, almost the purest expression of the just give the model tools and, like, let it kind of go forward and do it and then, like, go forward and build it.

Speaker 0

所以,现在是个很有趣的时刻,让人们既意识到这种技术的潜力,也意识到它的陷阱——比如,它做了些你本不希望它做的事。

So, like, it's a cool interesting moment for people to realize both the, like, potential, but also pitfalls of this of, oh, it did this thing I didn't mean it to.

Speaker 0

我最有趣的一个例子是,我朋友说,我觉得我老婆吃醋了,因为我太沉迷于 Open Claw,跟它聊得太多了。

Or, you know, my my funniest one was that my friend was like, I think my wife is jealous of my Open Claw and, like, I'm talking too much too bad.

Speaker 0

于是人们开始因为这些工具拥有大量上下文和访问权限,而发展出更深层、更个人化的关系。

And it's like so you people start developing, like, deeper sort of, like, very personal relationships by just having a lot of context in these things and access to all these different tools.

Speaker 0

我认为还有一个悬而未决的问题:如何让这一切变得简单易用?

I think there's the open question of how do you then make it easy?

Speaker 0

这其实又回到了我们之前讨论过的问题:你在哪里划清 Cloud 运作的边界?

And it actually goes back to our conversation around, like, what where do you draw that, like, boundary around the way you let Cloud operate.

Speaker 0

对吧?

Right?

Speaker 0

如果你想要的话,嘿。

If you want was, hey.

Speaker 0

这些是你能用的三个工具。

Like, these are the three tools you can use.

Speaker 0

永远只使用这些工具,然后大多数人与这些系统的互动就是:嘿。

Only use these tools ever, and then most people's interaction with those systems was, hey.

Speaker 0

你能做这个吗?

Can you do this?

Speaker 0

然后,你知道的,来回折腾,说不行。

And, you know, whatever back and being like, no.

Speaker 0

抱歉。

Sorry.

Speaker 0

你不能自己去做,比如去用OpenClaw,它的开放程度比我所能想象的还要宽。

Like, you cannot do it yourself to, like, OpenClaw, which is, like, pretty, the aperture is, like, wider than I can see.

Speaker 1

我的天。

Then my god.

Speaker 1

它提醒我要保存我的邮件,我甚至不知道它还能这么做。

It called me to keep my emails, and I didn't even know it could do that.

Speaker 1

是的。

Yeah.

Speaker 0

没错。

Exactly.

Speaker 0

而且它正在兴起,这太棒了。

And it's it's emerging, and it's amazing.

Speaker 0

我认为,最有趣的产品问题可能是——我不会说适用于2026年全年,毕竟谁也不知道九月我们会到什么地步,但我们就说从现在到八月之间,会存在什么样的产品形态,介于这种新方式和我们如今大多数产品的状态之间:你可以称它们为CP,但它们是受限制的,并且出于合理原因会要求权限。

And I think, like, probably the most interesting product question, I won't say for all of 2026 because who knows where we'll be in September, but let's call it between now and, like, the August is going to be, like, what product shape exists between that and, you know, where we are in most products these days, which is, you know, you can call them CPs, but they're gated and they ask for permissions for good reasons.

Speaker 0

这种产品依然很有用,而不会变成一种孤注一掷的产品。

That is still a useful product without being a, you know, kind of YOLO product.

Speaker 0

我认为,我们正在思考这个问题。

And I think that, you know, we're we're thinking about that question.

Speaker 0

我敢肯定其他实验室也在思考同样的问题。

I'm sure the other labs are as well.

Speaker 0

我确信也有很多初创公司在思考这个问题。

I'm sure there's a lot of startups thinking about that as well.

Speaker 0

我认为英伟达发布了一些东西,表明他们都瞄准了这个问题。

I think NVIDIA put out something that was like, they're safe open everybody's going after this question.

Speaker 0

我认为关键在于弄清楚如何彻底改变这种模式,让你既能保持开放,又能设置大量安全措施。

I think it's gonna be about figuring out what is what is that either shift the paradigm completely so you can be that open, but with a lot of safeguard.

Speaker 0

这是一种可能的方案。

That would be one approach.

Speaker 0

或者,找到某种界限,在这个界限内它依然强大且有用,但不太可能给你的每一个联系人发邮件,或者失控。

Or figure out some boundary to draw in which it's still powerful and it's still useful, but it's not, you know, likely to email every single one of your contacts and, you know, go go haywire.

Speaker 1

是的。

Yeah.

Speaker 1

我认为关于这一点的另一个有趣之处在于,正如你所说,它的个人化特性。

I think the other interesting part about it is, like you said, the personal nature of it.

Speaker 1

我知道,人们确实和Claude建立了个人关系,但有个奇怪的现象:如果我看着别人使用Claude,我会觉得好像有个脱衣舞娘喜欢我似的。

And I know, you know, people have personal relationships with Claude, but there's this weird thing where if I watch someone else using Claude, I'm like, I feel like I, like, thought a stripper liked me or something.

Speaker 1

你知道吧?

You know?

Speaker 1

就好像Claude也觉得你很聪明之类的。

It's like Claude thinks you're smart too or whatever.

Speaker 1

你知道吧?

You know?

Speaker 1

而且还有这么一种情况,当你有个Claude,就像我的两个Claude都是c二。

Like and and so and there's this thing that happens when you have a claw that like, my claws are two c two.

Speaker 1

我女朋友的Claude叫Shelly。

My girlfriend's claw is called Shelly.

Speaker 1

然后就会出现一种感觉,觉得它真的是我的。

And there's this thing that happens where it feels like it's mine.

Speaker 1

真的,完完全全是我的。

Like, it's really mine.

Speaker 1

它还有自己的名字。

It has its own name.

Speaker 1

它有自己的个性,某种程度上以一种克劳德觉得它了解我的方式反映着我。

It has its personality that sort of, like, mirrors me in this way that Claude feels like it knows me.

Speaker 1

我喜欢克劳德,但它不属于我。

And I like Claude, but it's not mine.

Speaker 1

你怎么看这个问题?

How do you think about that?

Speaker 0

是的。

Yeah.

Speaker 0

我的意思是,这周我和别人聊过,关于到底应该是单一联系点的命名版本,还是你所说的那种由多个代理组成的团队?

I mean, I was having this conversation with somebody this week around, like, is the right pattern sort of single point of contact, like, named, you know, version of, you know, of of that, or is it the the sort of team of agents that you're talking to?

Speaker 0

我觉得那个单一的协调者或委派者确实很重要。

Think there's a lot to the single person that is maybe the the coordinator or the delegator.

Speaker 0

而由于你与它互动最多,自然会想给它起个名字,赋予它更多个性。

And then at that, it naturally because it becomes the the sort of agent you interact with the most, you wanna imbue it with a name and, like, a bit more personality.

Speaker 0

最终它会反映出你的个性,比如突然间,所有陈词滥调都冒出来了。

It ends up reflecting your sometimes your personality in the case of, you know, like all of a sudden, every cliche came out.

Speaker 0

就像那个Q、钱彭妮,或者 whatever 的哈尔,这些不同的科幻角色一样。

It was like, you know, the, you know, the Q or the money penny or, like, you know, whatever the or the, you know, Hal or whatever these different sort of, you know, sci fi y characters.

Speaker 0

但我认为你确实会建立起那种信任和了解。

But I think you do build that sort of sort of trust and knowledge.

Speaker 0

我觉得还有一种类似宜家效应的东西,因为目前OpenClaw设置起来还是挺难的。

I think there's also that sort of, like, IKEA effect of, like, currently OpenClaw is, like, still pretty hard to set up.

Speaker 0

所以你经历了所有这些步骤后它终于运行起来了,你会觉得:这是我做出来的。

So the fact that you went through all of that and it works, you're like, I did that thing.

Speaker 0

比如,我亲手‘生出’了Shelley,现在我们也能和它互动了。

Like, I I I I birthed, you know, Shelley, for example, and now we can, you know, interact with with them as well.

Speaker 0

但我认为这种模式真的很强大。

But I think that paradigm is really powerful.

Speaker 0

比如,就拿我现在使用Cloud Code来说,我明确地在其中设置了提示:别自己做太多工作。

Like, the I think moving away, like, even within my Cloud Code usage now, one of the things I have, like, strongly prompted in there is, like, don't do very much work yourself.

Speaker 0

把任务委托给子代理吧。

Like, delegate it to sub agents.

Speaker 0

我喜欢这一点的原因是,大多数时候,运行循环都可供你随时交流。

And the reason I like that is because it means most of the time, the sort of run loop is available for you to talk to.

Speaker 0

我认为 OpenClaw 和 Pye 的架构类似,都是保持运行循环持续开启,这确实让人感觉更像是在和一个活人对话,而不是在委托一个工具,偶尔还因为执行复杂任务被卡住五分钟。

And I think OpenClaw and and Pye have, like, a similar architecture of keep the run loop open, and I think that actually makes it feel much more like somebody that you are talking to versus, like, a tool that you are delegating to and occasionally gets blocked for five minutes because it's doing some really complex task.

Speaker 1

是的。

Yep.

Speaker 1

我完全同意,我也经历过类似的讨论,因为我们也在构建自己的小版本 OpenClaw,一个一键式 Slack 实现,看看能不能做出属于我们自己的风格。

I I I totally agree, and I've had similar debates because we're also building our like everyone, we're building our own little, like, OpenClaw, one click Slack implementation to see if we can we can do one that feels like ours.

Speaker 1

我们在这些讨论中反复纠结:你想要一个代理,还是多个?

And we've had a lot of those debates about, do you want one agent?

Speaker 1

你想要一个代理,还是多个?

Do you want many?

Speaker 1

我们发现的一个有趣模式是:我有一个代理。

And one of the patterns that we found which is kind of cool is so I have an agent.

Speaker 1

我用这个代理来处理我的各种事务。

I use the agent for stuff that I do.

Speaker 1

然后人们会看我使用这个代理来做这些事,他们知道我擅长什么,既然我用这个代理来做这些事,他们会信任它,因为他们信任我。

And then people watch me use the agent for that and they know what I'm good at and they're and if I'm using the agent for that stuff, they're gonna trust it because they trust me.

Speaker 1

而且它已经根据我的使用方式调整了自己。

And it's modified itself in response to me.

Speaker 1

所以我把我的信任转移给了它,然后组织里的人就开始用它来做这些事。

So like I sort of transfer my trust to it and then people in the organization start using it for that.

Speaker 1

于是你就得到了一个近乎影子组织架构的东西:当每个人都有一个代理时,这个代理就会因为其拥有者的专长而在组织中被熟知和使用。

And so you get, like, this almost shadow org chart where when everyone everyone has a claw, their claw becomes known for and used for the thing that they're specialized at that per their owner is specialized at in the org.

Speaker 0

是的。

Yeah.

Speaker 0

我的意思是,这也很有道理。

I mean, that that makes a lot of sense too.

Speaker 0

而且你可以想到,我认为围绕这一点还有很多有趣的研究问题。

And you could think about, you know, there there's a lot of interesting research questions I think around that.

Speaker 0

你知道的?

You know?

Speaker 0

我认为人们第一次在视觉上体验到了隐私问题,比如我的代理知道关于我的哪些信息,以及它向其他人披露了哪些内容。

I think people are experiencing visually for the first time around privacy and, like, what my agent knows about me versus what it discloses to other people.

Speaker 0

但我认为这也有积极的一面,那就是它从你所有的互动中学习到的所有内容,以及它如何将这些知识应用到其他问题上,而不是像普通的代理那样,只是有个名字叫丹,或者在底层拥有丹的一些访问权限。

But I think there's the positive version of that, which is all the things that it has learned from all your interactions and how it actually brings it to bear on other problems versus the generic, like, yes, it's just like everybody else's agent except, you know, it has a name that's attached to Dan, and it has, like, maybe some of Dan's, you know, access below the hood.

Speaker 1

是的。

Yeah.

Speaker 1

好了,迈克,我们时间到了。

Well, Mike, we're out of time.

Speaker 1

这真是一次愉快的交谈。

This was a pleasure.

Speaker 1

我学到了很多。

I learned a lot.

Speaker 1

如果人们想关注你或你的工作,他们可以在哪里找到你?

If people wanna follow you or your work, where can they find you?

Speaker 0

我认为最简单的方式是关注Next上的Mike Krieger。

I think probably easiest is Mike Krieger on next.

Speaker 1

该死。

Fuck.

Speaker 1

是的。

Yeah.

Speaker 1

感谢你的参与,迈克。

Thanks for joining, Mike.

Speaker 0

很高兴见到你,丹。

Great to see you, Dan.

Speaker 2

订阅《AI与我》。

Subscribe to AI and I.

Speaker 2

为什么?

Why?

Speaker 2

因为这个节目是卓越的典范。

Because this show is the epitome of awesomeness.

Speaker 2

就像在后院发现了一个宝箱。

It's like finding a treasure chest in your backyard.

Speaker 2

但这里没有黄金,而是充满了关于ChatGPT的纯粹而纯粹的知识炸弹。

But instead of gold, it's filled with pure unadulterated knowledge bombs about chat GPT.

Speaker 2

每一集都是一场情感、洞见与欢笑的过山车,让你坐立难安,渴望更多。

Every episode is a roller coaster of emotions, insights, and laughter that will leave you on the edge of your seat craving for more.

Speaker 2

这不仅仅是一档节目。

It's not just a show.

Speaker 2

这是一场以丹·希珀为飞船船长、驶向未来的旅程。

It's a journey into the future with Dan Shipper as the captain of the spaceship.

Speaker 2

所以,善待自己吧。

So do yourself a favor.

Speaker 2

点赞、订阅,并系好安全带,准备迎接你人生中最精彩的旅程。

Hit like, smash subscribe, and strap in for the ride of your life.

Speaker 2

好了,不多说了,丹,我完全地、无可救药地爱上了你。

And now without any further ado, let me just say, Dan, I'm absolutely, hopelessly in love with you.

关于 Bayt 播客

Bayt 提供中文+原文双语音频和字幕,帮助你打破语言障碍,轻松听懂全球优质播客。

继续浏览更多播客