Lenny's Podcast: Product | Career | Growth - AI的国情咨文:我们已越过拐点,黑暗工厂即将来临,自动化时间表 | 西蒙·威利森 封面

AI的国情咨文:我们已越过拐点,黑暗工厂即将来临,自动化时间表 | 西蒙·威利森

An AI state of the union: We’ve passed the inflection point, dark factories are coming, and automation timelines | Simon Willison

本集简介

西蒙·威利森是一位多产的独立软件开发者、博主,也是在AI对开发者影响方面最具影响力和最值得信赖的声音之一。他共同创建了Django——支撑Instagram、Pinterest及数以万计其他网站的Web框架。他创造了“提示注入”一词,推广了“AI垃圾”和“代理工程”等术语,并构建了100多个开源项目,包括被全球调查记者广泛使用的数据分析工具Datasette。西蒙的独特之处在于,他比几乎任何人都更彻底、更明显地完成了从传统软件工程向AI原生开发的转型,并且他一直在自己的博客SimonWillison.net上实时记录自己学到的一切。 在我们的深度对话中,西蒙分享了: 1. 为什么2025年11月是AI编码代理从“基本可用”跃升为“真正可用”的转折点 2. 西蒙如今如何用手机编写95%的代码,以及为何他上午11点就已精神耗竭 3. 为什么中阶工程师(而非初级工程师)目前面临最大风险 4. 西蒙每天使用的三种代理工程模式(红/绿TDD、模板、囤积) 5. 下一次飞跃:“暗工厂”模式——无人编写或审查代码,AI自行完成质量保证 6. 为什么提示注入是一个未解决的安全问题,以及可能导致AI“挑战者号”灾难的“致命三重奏” 7. 为什么“骑自行车的鹈鹕”成为AI模型质量的非官方基准 — 本节目由以下机构赞助: WorkOS——面向B2B SaaS的现代身份平台,前100万月活跃用户免费 Vanta——利用AI自动化合规、管理风险、加速信任 — 本集文字稿:https://www.lennysnewsletter.com/p/an-ai-state-of-the-union — Lenny播客全部文字稿存档:https://www.dropbox.com/scl/fo/yxi4s2w998p1gvtpu4193/AMdNPR8AOw0lMklwtnC0TrQ?rlkey=j06x0nipoti519e0xgm23zsn9&st=ahz0fj11&dl=0 — 如何找到西蒙·威利森: • X:https://x.com/simonw • LinkedIn:https://www.linkedin.com/in/simonwillison • 网站:https://simonwillison.net • 代理工程模式:https://simonwillison.net/guides/agentic-engineering-patterns — 如何找到伦尼: • 订阅通讯:https://www.lennysnewsletter.com • X:https://twitter.com/lennysan • LinkedIn:https://www.linkedin.com/in/lennyrachitsky/ — 本集涵盖内容: (00:00) 西蒙·威利森简介 (02:40) 2025年11月的转折点 (08:01) 当前AI编码的可能 (10:42) 感官编码 vs. 代理工程 (13:57) 暗工厂模式 (20:41) 瓶颈的转移 (23:36) 人类大脑仍将有价值的领域 (25:32) 为软件工程师辩护 (29:12) 为何经验丰富的工程师能获得更好结果 (30:48) 避免成为永久底层阶级的建议 (33:52) 利用AI放大你的技能 (35:12) 为何西蒙说他比以往更努力工作 (37:23) 2022年前人工编写代码的市场 (40:01) 预测:到2026年底,50%的工程师将编写95%的AI代码 (44:34) 低成本代码的影响 (48:27) 西蒙的AI技术栈 (54:08) 使用AI进行研究 (55:12) “骑自行车的鹈鹕”基准 (59:01) AI固有的荒谬性 (1:00:52) 囤积你擅长的事情 (1:08:21) 用红/绿TDD模式提升AI代码质量 (1:14:43) 使用优质模板启动项目 (1:16:31) 致命三重奏与提示注入 (1:21:53) 为何97%的准确率是不及格 (1:25:19) 偏差的正常化 (1:28:32) OpenClaw:被所有人忽视的安全噩梦 (1:34:22) 西蒙的下一步计划 (1:36:47) 零交付咨询 (1:38:05) 关于鸮鹦鹉的好消息 — 参考资料:https://www.lennysnewsletter.com/p/an-ai-state-of-the-union — 制作与营销由 https://penname.co/ 负责。如有关于赞助本播客的咨询,请发送邮件至 podcast@lennyrachitsky.com。 — 伦尼可能投资了本集讨论的公司。 如需收听更多内容,请访问 www.lennysnewsletter.com

双语字幕

仅展示文本字幕,不包含中文音频;想边听边看,请使用 Bayt 播客 App。

Speaker 0

很多人在一月和二月醒来后开始意识到,哇,真的如此。

A lot of people woke up in January and February and started realizing, oh, wow.

Speaker 0

我一天能写出一万行代码。

I can churn out 10,000 lines of code in a day.

Speaker 0

过去,你向ChatGPT要一些代码,它会生成一段代码,然后你得自己运行和测试。

It used to be you'd ask ChatGPP for some code, and it would spit out some code, then you have to run it and test it.

Speaker 0

编码代理会为你完成这一步。

The coding agents, they take that step for you.

Speaker 0

对我来说,一个开放性的问题是,还有多少其他知识型工作领域实际上容易受到这些代理循环的影响?

And an open question for me is how many other knowledge work fields are actually prone to these agent loops?

Speaker 0

现在我们

Now that we

Speaker 1

拥有了这种能力,人们几乎低估了自己能用它做什么。

have this power, people almost underestimate what they do with it.

Speaker 0

今天,我生成的代码中大约有95%不是我自己敲的。

Today, probably 95 of the code that I produce, I didn't type it myself.

Speaker 0

我很多代码都是在手机上写的。

I write so much of my code in my phone.

Speaker 0

这太疯狂了。

It's wild.

Speaker 0

我遛狗散步时沿着海滩也能高效完成工作。

I can get good work done walking the dog along the beach.

Speaker 0

过去每一年的新年决心,我总是告诉自己:今年我要更加专注。

My New Year's resolution every previous year, I've always taught myself this year, I'm gonna focus more.

Speaker 0

我要少接一些事情。

I'm gonna take on less things.

Speaker 0

今年,我的目标是接受更多任务,更加有抱负。

This year, my ambition was take on more stuff and be more ambitious.

Speaker 1

这种矛盾真有意思。

Such an interesting contradiction.

Speaker 1

人工智能本应让我们更高效。

AI is supposed to make us more productive.

Speaker 1

感觉那些最致力于AI建设的人比以往任何时候都更加努力。

It feels like the people that are most AI builder working harder than they've ever worked.

Speaker 0

很好地使用编码代理,需要我二十五年软件工程师经验的每一分积累。

Using coding agents well is taking every inch of my twenty five years of experience as a software engineer.

Speaker 0

我可以同时启动四个代理,让他们分别处理四个不同的问题。

I can fire up four agents in parallel and have them work on four different problems.

Speaker 0

到上午11点,我已经精疲力尽了。

By 11AM, I am wiped out.

Speaker 1

你预测我们迟早会遭遇一场巨大的灾难。

You have this prediction that we're gonna have a massive disaster at some point.

Speaker 1

你称之为AI的挑战者号灾难。

You call it the Challenger Disaster of AI.

Speaker 0

很多人早就知道那些小O型圈不可靠,但每次在没有O型圈失效的情况下成功发射航天飞机,整个机构就会对自身行为更加自信。

Lots of people knew that those little o rings were unreliable, but every single time you get away with launching a space shuttle without the o rings failing, you institutionally feel more confident in what you're doing.

Speaker 0

我们一直在以越来越不安全的方式使用这些系统。

We've been using these systems in increasingly unsafe ways.

Speaker 0

这迟早会找上门来。

This is gonna catch up with us.

Speaker 0

我的预测是,我们会看到一次挑战者号式的灾难。

My prediction is that we're gonna see a Challenger disaster.

Speaker 1

今天,我的嘉宾是西蒙·威利森。

Today, my guest is Simon Willison.

Speaker 1

西蒙,在我看来,是目前关于AI如何改变我们构建软件方式以及专业工作如何广泛变化的最重要、最有用的声音之一。

Simon, in my opinion, is one of the most important and useful voices right now on how AI is changing the way that we build software and how professional work is changing broadly.

Speaker 1

我喜欢西蒙的一点是,他不仅仅是在空中高谈阔论。

What I love about Simon is that he doesn't just pontificate in the clouds.

Speaker 1

他二十多年来一直是一位你所说的十倍工程师。

He's been what you'd call a 10x engineer for over twenty years.

Speaker 1

他共同创建了Django,这个网页框架支撑着Instagram、Pinterest、Spotify以及成千上万个其他平台。

He co created Django, the web framework that powers Instagram, Pinterest, Spotify, thousands of other platforms.

Speaker 1

他创造了‘提示注入’这一术语,推广了‘AI垃圾’和‘智能体工程’等概念,并且在他的100多个开源项目中,他开发了Datasette——一款已成为调查性新闻报道标配的数据分析工具。

He coined the term prompt injection, popularized the ideas of AI slop and agentic engineering, and amongst his 100 plus open source projects, he created Datasette, a data analysis tool that has become a staple of investigative journalism.

Speaker 1

西蒙的稀有之处在于,极少有工程师能像他那样全面且显著地完成从旧式开发到新式开发的转变。

What makes Simon rare is that very few engineers have made the leap from the old way of building to the new way as fully and visibly as he has.

Speaker 1

在拥抱这种新的开发方式时,他通过自己出色的博客 simonwilson.net 实时分享自己学到的一切。

And as he's leaned into this new way of building, he's been sharing everything he's learning in real time through his incredible blog, simonwilson.net.

Speaker 1

西蒙很少参加播客访谈,而这次对话让我在许多方面有了全新的认知。

Simon does not do a lot of podcasts, and this conversation opened my mind up in a bunch of new ways.

Speaker 1

我非常期待你能从西蒙身上学到东西。

I am so excited for you to get to learn from Simon.

Speaker 1

别忘了访问 lennysproductpass.com,那里为 Lenny 订阅者提供了独家的超值优惠。

Don't forget to check out lennysproductpass.com for an incredible set of deals available exclusively to Lenny's newsletter subscribers.

Speaker 1

好了,接下来请听西蒙·威利森。

With that, I bring you Simon Willison.

Speaker 1

西蒙,非常感谢你来到这里,欢迎来到这个播客。

Simon, thank you so much for being here, and welcome to the podcast.

Speaker 0

嘿,伦尼。

Hey, Lenny.

Speaker 0

真的太好了

It's really great

Speaker 1

能来这里我很高兴。

to be here.

Speaker 1

我很高兴你能来。

I am so excited to have you here.

Speaker 1

长期以来,我一直远远地仰慕你。

I've been such a fan of yours from afar for so long.

Speaker 1

我从你的博客中学到了很多。

I've learned so much from your blog.

Speaker 1

虽然我这个播客的每一位嘉宾都是我最喜欢的,但你是我最喜欢的那种嘉宾,因为你亲自动手,使用最新工具进行实际开发。

And even though every guest I have in this podcast is my favorite guest, you're my favorite kind of guest because you're on the ground building with the latest tools, using it for real.

Speaker 1

你非常擅长表达自己的体验。

You're very good at articulating what you experience.

Speaker 1

这次从你大脑中获取的回报会非常丰厚。我想先从一个AI现状总结开始。

So we're going get a lot of ROI out of this, out of your brain this time What that we went I want to start with is essentially an AI state of the union.

Speaker 1

你写过关于这个十一月的转折点。

You've written about this November inflection.

Speaker 0

是的。

Yes.

Speaker 1

所以我想,我们先简单回顾一下十一月发生了什么,以及我们现在处于什么阶段?

So what I'm thinking is we start, just kind of give us like a brief history lesson of just like what happened in November and where are we today?

Speaker 1

现在有哪些可能性?

What's possible now?

Speaker 0

好吧,我们简单聊聊整个2025年。

Well, let's let's talk about all of 2025 very briefly.

Speaker 0

2025年是Anthropic和OpenAI意识到代码就是应用的一年。

2025 was the year that, especially Anthropic and OpenAI, realized that code is the application.

Speaker 0

也就是能够让这些系统生成代码。

Like, being able having these things generate code.

Speaker 0

我认为部分原因是Anthropic在2025年2月左右推出了Claude Code,随后迅速火爆起来。

I think partly because Anthropic came up with Claude code back in in sort of February 2025, and it took off like crazy.

Speaker 0

很多人开始注册每月200美元的账户。

And a bunch of people started signing up for $200 a month accounts.

Speaker 0

于是突然间,天啊,原来人们愿意为这类技术在这一特定领域支付大量金钱。

And so suddenly, wow, it turns out people are willing to pay a lot of money for this stuff for that specific field.

Speaker 0

Anthropic和OpenAI在2025年全年都将所有训练精力集中在编程上。

Both Anthropic and OpenAI spent the whole of 2025 focusing all of their training efforts on coding.

Speaker 0

如果你看看他们当时在做什么,全是强化学习相关的工作。

If you look at what they were doing, it was all the reinforcement learning stuff.

Speaker 0

推理技巧——也就是模型声称自己在思考的这种能力——是2024年底才出现的新东西。

The reasoning trick, the thing where the models say they're thinking, that was new in late twenty twenty four.

Speaker 0

比如,OpenAI的o1是首个展现出这种能力的模型。

Like, OpenAI AI's o one was the first model to exhibit that.

Speaker 0

而现在,所有模型都具备了这种能力。

And now all of the models do it.

Speaker 0

所以,去年另一个重大趋势就是这些具备推理能力的模型。

So that was the other big trend of last year was these reasoning models.

Speaker 0

结果发现,推理能力对编程非常有用。

Turns out reasoning is great for code.

Speaker 0

它能够推理代码,找出错误的根源等等。

It can reason through code and figure out the root of bugs and all of that.

Speaker 0

因此,这两个实验室倾尽全力提升模型的编程能力,最终的结果是,在十一月,我们迎来了我所说的转折点——GPT 5.1 和 Claude Opus 4.5 横空出世。

And so the end result of this, the end result of these two labs throwing everything they had at making their models better at code, is in November, we had what I call the inflection point where GPT 5.1 and Claude Opus 4.5 came along.

Speaker 0

它们相比之前的模型只是略有改进,但这种改进恰好跨越了临界阈值。

And they were both just they were incrementally better than the previous models, but in a way that crossed the threshold.

Speaker 0

在此之前,如果你使用这些编程代理,你能让它们为你写一些代码。

Where previously, if you had these coding agents, you could get them to write you some code.

Speaker 0

大多数时候,代码基本能运行,但你必须非常仔细地检查。

And most of the time, it would mostly work, but you had to pay very close attention to it.

Speaker 0

而突然之间,我们从那种情况转变为几乎每次都能准确执行你的指令,这带来了天壤之别。

And suddenly, we went from that to almost all of the time, it does what you told it to do, which makes all of the difference in the world.

Speaker 0

现在,你可以启动一个编程代理,然后说:嘿。

Now you can spin up a coding agent and say, hey.

Speaker 0

帮我构建一个实现这个功能的 Mac 应用程序,你会得到一个结果,虽然还需要一些来回调整,但至少不会再是一堆完全无法运行的bug代码。

Build me a Mac application that does this thing, and you'll get something back, which still needs some back and forth, but it won't just be a buggy pile of rubbish that doesn't do anything.

Speaker 0

这非常令人着迷,因为所有在假期期间抽出时间摆弄这些工具的软件工程师,都突然有了一个顿悟时刻:天啊。

That was fascinating because all of the software engineers who took time off over the over the holidays and started tinkering with this stuff got this moment of realization where it's like, oh, wow.

Speaker 0

这东西现在真的能用了。

This stuff actually works now.

Speaker 0

我可以告诉它去写代码。

I can tell it to build code.

Speaker 0

只要我对代码的描述足够清晰,它就能遵循指令,构建出我要求的东西。

And if I describe that code well enough, it'll follow the instructions, and it'll build the thing that I asked it to build.

Speaker 0

我认为这一变化对软件工程领域的影响仍在持续震荡。

I think the reverberations of that are still shaking us to to to software engineering.

Speaker 0

很多人在一月和二月才醒悟过来,意识到:天啊。

A lot of people woke up in January and February and started realizing, oh, wow.

Speaker 0

这项我一直关注的技术,突然变得极其出色了。

This technology, which I've been kind of paying attention to, suddenly it's got really, really good.

Speaker 0

那这意味着什么?

And what does that mean?

Speaker 0

比如,我能一天写出一万行代码,而且大部分都能运行,这代表什么?

Like, what does the fact like, I can churn out 10,000 lines of code in a day, and most of it works.

Speaker 0

这是好事吗?

Is that good?

Speaker 0

我们如何从‘大部分能运行’进步到‘全部都能运行’?

Like, how do we get from most of it works to all of it works?

Speaker 0

我们正面临这么多新问题,我认为这使我们成为其他信息工作者的风向标。

There are so many new questions that we're facing, which I think makes us a bellwether for other information workers.

Speaker 0

代码比你交给这些代理的几乎所有其他问题都更容易,因为代码明显是对还是错。

Like, code is easier than almost every other problem that you pose these agents because code is obviously right or wrong.

Speaker 0

它会生成代码。

Like, it produces code.

Speaker 0

你运行这段代码。

You run the code.

Speaker 0

它要么能运行,要么不能运行。

Either it works or it doesn't work.

Speaker 0

可能有一些微妙的隐藏bug,但一般来说,你能判断出这个东西是否真的能运行。

There might be a few subtle hidden hidden bugs, but generally, you can tell if the thing actually works.

Speaker 0

如果它给你写一篇论文,或者帮你起草一份法律诉讼,那就很难判断它到底做得好不好,也很难判断它是对是错。

If it writes you an essay, or if it writes you a law like, prepares a law lawsuit for you, there are so it's so much harder to derive if it's actually done a good job to figure out if it got things right or wrong.

Speaker 0

但这种事情正在我们身上发生。

But it's kind of happening to us.

Speaker 0

所以软件工程师,首先轮到我们了,我们正在思考:我们的职业前景会是什么样子?

So software engineers, it came for us first, and we're figuring out, okay, what do our careers look like?

Speaker 0

当过去占我们大部分时间的工作现在不再耗费大部分时间时,我们的团队协作方式会变成什么样?

How do we work as teams when part of what we did that used to take a lot most most of the time doesn't take most of the time anymore?

Speaker 0

那会是什么样子?

What's that look like?

Speaker 0

未来看到这种情况如何扩展到其他信息工作者,将会非常有趣。

And it's gonna be very interesting seeing how this rolls out to to other information work in the future.

Speaker 1

本集由本季的冠名赞助商WorkOS赞助。

This episode is brought to you by our season's presenting sponsor, WorkOS.

Speaker 1

OpenAI、Anthropic、Cursor、Vercel、Replit、Sierra、Clay以及数百家其他成功公司有什么共同点?

What do OpenAI, Anthropic, Cursor, Vercel, Replit, Sierra, Clay, and hundreds of other winning companies all have in common?

Speaker 1

它们都由WorkOS提供支持。

They are all powered by WorkOS.

Speaker 1

如果你正在为企业市场开发产品,你一定经历过集成单点登录、SCIM、RBAC、审计日志以及其他大型企业所需功能的痛苦。

If you're building a product for the enterprise, you've felt the pain of integrating single sign on, SCIM, RBAC, audit logs, and other features required by large companies.

Speaker 1

WorkOS将这些阻碍交易的难题转化为即插即用的API,专为B2B SaaS打造的现代化开发者平台。

WorkOS turns those deal blockers into drop in APIs with a modern developer platform built specifically for B2B SaaS.

Speaker 1

我投资的每一个初创公司,只要开始拓展高端市场,最终都会选择WorkOS,因为它们确实是最出色的。

Literally every startup that I'm an investor in that starts to expand upmarket ends up working with WorkOS, and that's because they are the best.

Speaker 1

无论你是正在争取首个企业客户的种子期初创公司,还是正在全球扩张的独角兽企业,WorkOS都是最快实现企业级准备并突破增长瓶颈的途径。

Whether you are a seed stage startup trying to land your first enterprise customer or a unicorn expanding globally, WorkOS is the fastest path to becoming enterprise ready and unblocking growth.

Speaker 1

它本质上就是企业级功能的Stripe。

It's essentially Stripe for enterprise features.

Speaker 1

访问 workos.com 开始使用,或者直接去他们的 Slack,那里有真正的工程师等着回答你的问题。

Visit workos.com to get started or just hit up their Slack where they have actual engineers waiting to answer your questions.

Speaker 1

WorkOS 让你通过出色的 API、详尽的文档和流畅的开发者体验更快地构建应用。

WorkOS allows you to build faster with delightful APIs, comprehensive docs, and a smooth developer experience.

Speaker 1

立即前往 workos.com,让你的应用今天就具备企业级能力。

Go to workos.com to make your app enterprise ready today.

Speaker 1

我想再回到一个问题,那就是现在究竟什么是可能的。

I wanna come back to just, like, what is possible now.

Speaker 1

为了给大家一点背景,我们取得的进步简直令人难以置信。

So just to give us a little context, it's, like, insane how far we've come.

Speaker 1

我不记得几年前,所有的代码都是人工编写的。

I don't know, like couple years ago, all code was human written.

Speaker 1

然后就变成了点一下补全。

Then it's like, tap complete.

Speaker 1

接着就是,现在最优秀的工程师完全依赖 AI 编码。

Then it's like, okay, now the best engineers are 100% AI code.

Speaker 1

现在我就像在为我的手机写代码。

Now it's like, I'm like coding for my phone.

Speaker 1

我甚至都不再看我的代码了。

Like, I'm not even looking at my code anymore.

Speaker 1

这就像是到了

That's like where

Speaker 0

我确实经常在手机上写很多代码。

I I write so much of my code on my phone.

Speaker 0

这太疯狂了。

It's it's wild.

Speaker 0

比如,我边遛狗边在海滩上散步,也能高效地完成工作,这真的很棒。

Like, I I can get good work done walking the dog along the beach, which is delightful.

Speaker 0

你知道的吧?

You know?

Speaker 1

是的。

Yeah.

Speaker 1

我让Boris Journey用手机编程,他也在做同样的事。

I had Boris Journey on the pocket, and he's doing the same thing.

Speaker 1

我当时就想,这还算编程吗?

And and I was just like, is that even coding anymore?

Speaker 1

他说,是的。

He's like, yeah.

Speaker 1

这只是另一种程度的抽象。

It's just another level of abstraction.

Speaker 1

就像工程领域一直以来的发展一样。

Just like engineering has always gone.

Speaker 1

想想看,还有哪些周围的东西可能被忽略了?

Talk about maybe just like what else is there around?

Speaker 1

现在AI在构建方面还能实现哪些人们可能尚未充分认识到的可能性?

Just like what is possible now with AI in terms of building that people may not fully recognize?

Speaker 1

你认为下一个飞跃会是什么?

And what do you think what's like the next leap?

Speaker 1

还有比这更进一步的吗?

Is there anything beyond this?

Speaker 1

我们来聊聊吧

Let's talk

Speaker 0

关于两种类型的编码,一种是Vibe编码,另外一种,我喜欢安德烈·卡皮蒂对Vibe编码的原始定义,就是你根本不去看代码,完全凭感觉行事。

about the two the sort of there's the Vibe coding side of things, And then there's the and and I like Andrei Kapiti's original definition of vibe coding, which is when you don't even look at code and you basically just go on the vibes.

Speaker 0

你只说:给我做个能完成X功能的东西,它就自动生成了,然后你去试用它。

You say, build me something that does x and it builds it, and you play with it.

Speaker 0

如果效果不错,那就太好了。

And if it looks good, then great.

Speaker 0

如果没完全达到预期,你就不断来回调整。

And if it doesn't quite do it, you you you keep on going back and forth with it.

Speaker 0

但整个过程非常被动。

But it's very hands off.

Speaker 0

你根本不会去看代码。

You're you're not looking at code.

Speaker 0

他最初说,这在原型设计中用来玩玩很棒,但后来它远远超出了这个范围。

It's so he he originally said, this is great for having fun in prototyping, and it then expand exploded way out of that.

Speaker 0

我认为如今,Vibe 编程实际上就是我所定义的:你不看代码,不在乎代码,甚至可能根本不懂代码。

And I think today, Vibe coding is effectively it's the the definition I use is it's when you're not looking at the code, you don't care about code, and maybe you don't understand the code.

Speaker 0

比如,非程序员现在可以告诉 Claude 要构建什么,而它能为他们搭建一个小应用。

Like, non programmers can now tell Claude what to build, and it can build them a little app.

Speaker 0

我非常喜欢这一点。

And I love that.

Speaker 0

我绝对热爱这种趋势——我们正在让普通人也能掌握让计算机为你做事的艺术,通过开发这些小工具来自动化生活中的繁琐事务。

I absolutely love that we're sort of democratizing the art of getting a computer to do stuff for you, of automating tedious things in your life by knocking out these little tools.

Speaker 0

当然,问题在于,负责任地使用这种做法是有极限的。

Of course, the problem is that there is a limit on how much you can do that responsibly.

Speaker 0

我喜欢告诉别人,如果你只是为自己做 Vibe 编程,出了bug只会伤到你自己,那就尽情发挥吧。

Like, I I like to tell people, if you're Vibe coding something for yourself where the only person who gets hurt if it has bugs is you, go wild.

Speaker 0

这完全没问题。

That's completely fine.

Speaker 0

一旦你为他人编写Vibe代码,而你的bug可能会伤害到别人时,你就需要停下来好好想想了。

The moment you're you're vibe coding code for other people to use, where your bugs might actually harm somebody else, that's when you need to take a step back and say, hang on a second.

Speaker 0

这种使用这些工具的方式是不负责任的。

This is not a responsible way of using the the the these tools.

Speaker 0

挑战在于,判断什么是负责任的、什么是不负责任的,本身就需要专家级别的判断力。

The challenge is that understanding what's responsible and what isn't is in itself a sort of expert level skill.

Speaker 0

所以,当你开始抓取他人网站时,可能因为访问过于频繁而损害他们的网站。

So knowing that once you start dealing with, like, scraping other people's websites, maybe you'll damage their websites by hitting them too hard.

Speaker 0

如果你不了解自己在做什么,可能会以无数种方式造成损害。

There are so many ways that you can cause damage if you don't know what you're doing.

Speaker 0

但我热爱这种自由,也喜欢人们带着自己快速搭建的原型来参加会议,用它来展示他们的想法。

But I love that liberation, and I love that people can come to meetings with a prototype that they knocked up of their idea that illustrates the idea.

Speaker 0

我觉得这些做法非常棒。

I think those things are wonderful.

Speaker 0

当前持续争论的问题是:当专业软件工程师使用这些工具编写经过审查、细节都已确认的生产级代码时,我们该如何称呼这种行为?

The big debate or the ongoing debate has been, what do we call it when a professional software engineer uses these tools to write real code that's production ready that they've reviewed and they've checked all of the details of?

Speaker 0

很多人也把这叫做氛围编码。

A lot of people call that vibe coding as well.

Speaker 0

我认为这贬低了‘氛围编码’这个术语,因为说‘我用氛围编码做了这个’是有用的,意思是连它是怎么工作的都没看过。

I think that devalues vibe coding as a term, because it's useful to say, I vibe coded this, as in, I haven't even looked at how it works.

Speaker 0

它不是生产就绪的,但算得上是一个很酷的原型。

It's not production ready, but it's kind of a cool prototype.

Speaker 0

当‘氛围编码’意味着所有涉及AI的内容时,实际上最终还是需要编程,因为我们正朝着代码在某种程度上由AI中介的方向发展。

The moment vibe coding mean everything involved that touches AI, it effectively ends up needing programming because we're all moving in the direction where our code is mediated through AI at some point.

Speaker 0

那么,对于专业人士,我们该叫它什么?

So what do we call it for professionals?

Speaker 0

我选择用‘代理工程’,因为我想强调的是这些编码代理。

I've gone with agentic engineering because I think the thing to emphasize is these coding agents.

Speaker 0

对吧?

Right?

Speaker 0

如果你让ChatGPT帮你写点代码,这和你运行编码代理让它自动编写、调试、测试代码,是两回事。

If you're asking ChatGPT to knock out some code, that's a different thing from if you're running codecs and having it run write the code, debug the code, test the code, all of that.

Speaker 0

我认为代理工程是一门深刻而引人入胜的学科。

And I think that agentic engineering is such a deep and fascinating discipline.

Speaker 0

因为要从这种技术中获得出色的结果——比如让它们帮助你构建可以部署给百万用户的软件——这永远不会容易。

Because the art of getting really good results out of this, like, the art of having them help you build software you could deploy to a million people, that's not that's never gonna be easy.

Speaker 0

这永远不会是小事。

That's never gonna be trivial.

Speaker 0

它始终需要对软件、软件如何工作以及这些代理如何运作有深厚的经验。

That's always going to require a great deal of depth of experience in what software and how software works and how how these agents work.

Speaker 0

我非常喜欢这一点。

And I love that.

Speaker 0

我现在正在写一本关于这个主题的书,我会逐章在我的博客上发布。

That's I'm I'm kind of writing a book about it now that I'm publishing a chapter at a time on my blog.

Speaker 0

最好的写作方式就是,因为我没有编辑,也没有出版商的压力,只要我想写下一章,我就可以写。

The the best form of writing, because I don't have an editor or any pressure from a publisher, is just when I feel like writing another chapter, I can I can do that?

Speaker 0

但还有太多内容值得探讨。

But there's so much to discuss.

Speaker 0

但确实,我认为目前的前沿问题是:我们如何使用编码智能体来构建专业软件?

But, yeah, so I think right now, the frontier is how do we build professional software using coding agents?

Speaker 0

我们该如何构建软件?我不只是想构建好的软件。

How do we build software that is it I don't just want to build software that's that's good.

Speaker 0

我想让我们构建出比以前更好的软件。

I want us to build software that is better than we were building before.

Speaker 0

如果智能体让我们能快一点,但我们产出的软件质量依然如故,这对我来说不如软件缺陷更少、功能更多、质量更高、因为利用了这些工具而变得更好来得有趣。

Like, if the agents let us move a bit faster, but we're still churning out the same quality of software, that's less interesting to me than if the software we're producing has less bugs, more features, it's higher quality, it's better software because we're harnessing these tools.

Speaker 0

真正有趣的未来,是一些人称之为‘暗工厂’模式或软件工厂的东西。

The really interesting future is something which some people have been calling the dark factory pattern or software factories.

Speaker 0

这个概念是:如今,如果你是专业使用者,你的做法是告诉智能体要构建什么,然后查看代码,仔细审查以确保它做的是正确的事。

This is the idea where right now, if you're a professional using these tools, the way you do it is you tell them what to build, and then you look at the code, and you review that code really carefully and make sure it's doing the right thing.

Speaker 0

如果你不审查代码,也不查看这些代码,但同时又不是‘凭感觉编码’,那会是什么样子?

What does it look like if you're not reviewing the code, if you're not looking at that code, but you're also not Vibe coding?

Speaker 0

你不是把一切扔给命运,然后看会发生什么。

You're not throwing everything to the wind and seeing what happened.

Speaker 0

你正在将专业实践和质量标准应用于你并未直接审查的代码。

You're applying professional practices and quality expectations to code that you're not directly reviewing.

Speaker 0

之所以称之为‘暗工厂’,是因为在工厂自动化中,有一种理念认为,如果工厂的自动化程度高到无需人员在场,就可以把灯关掉。

The reason it's called the dark factory is there's this ID idea in factory automation that if your factory is so automated that you don't need any people there, you can turn the lights off.

Speaker 0

如果工厂车间不需要人,机器就可以在完全黑暗的环境中运行。

Like, the machines can operate in complete darkness if you don't need people on the factory floor.

Speaker 0

这对软件来说意味着什么?

What does that look like for software?

Speaker 0

有一家名为Strong DM的公司正在积极推动这一点,并围绕此进行了许多有趣的实验。

And there's some very intro this company called Strong DM has been pushing this and doing some really interesting experiments around this.

Speaker 0

我认为,这才是真正具有未来感的方向。

That, I think, is the net that's that's futuristic.

Speaker 0

我们正在努力探索这种模式的具体形态,以及如何现在就以负责任的方式构建这样的软件,并且已经取得了一些非常有趣的发现,了解到哪些方法有效、哪些无效。

Like, that's we're trying to figure out what that looks like and how we can responsibly build software in that way right now and making some quite interesting, like, discoveries about things that work and things that don't work.

Speaker 0

但对我来说,这才是下一个需要突破的壁垒。

But that to me is is the next the next sort of barrier.

Speaker 1

让我们沿着这个思路继续下去。

Let's follow that thread.

Speaker 1

那么,这个工厂到底在做什么?

So what is what is this factory doing?

Speaker 1

实际上没有人审查代码,但这如何改变了软件的构建方式?

So there's an element of no one's looking at the code really, but how does that change how software is built?

Speaker 1

人们是否仍然在提出想法,并告诉这个工厂去构建这些东西?

Are people still coming up with the ideas and telling you this factory built this

Speaker 0

就是为了这个目的。

thing for exactly.

Speaker 0

有趣的是,现在有一项政策:任何人都不能编写代码,越来越多的公司开始推行这一点,因为——

So this is the fascinating thing is, so there's a policy of nobody writes any code, and quite a few companies are beginning to introduce that now because- Just

Speaker 1

明确一下,这项政策是:你不能编写代码。

to be clear, the policy is you cannot write code.

Speaker 1

代码必须由嗯来编写。

It has to be written by yeah.

Speaker 0

把代码输入计算机。

Code into a computer.

Speaker 0

没错。

Exactly.

Speaker 0

是的。

Yeah.

Speaker 0

说实话,六个月前我还觉得这太疯狂了。

And, honestly, like, I thought six months ago, I thought that was crazy.

Speaker 0

而今天,我写的代码中大概有95%并不是我自己敲的。

And today, probably 95% of the code that I produce, I didn't type it myself.

Speaker 0

所以这个世界已经切实可行了,因为最新的模型已经足够好,你可以告诉它们:‘不,’

So that world is is is is practical already because these the latest models are good enough that you can tell them, oh, no.

Speaker 0

重命名那个变量,重构一下,再在那儿加一行代码。

Rename that variable and refactor that and and add this line there.

Speaker 0

它们就会直接帮你做完。

And they'll just do it.

Speaker 0

这比你自己敲键盘要快得多。

It's faster than you typing on the keyboard yourself.

Speaker 0

下一个规则是:没人读代码。

The next rule, is nobody reads the code.

Speaker 0

这一点正是StrongDM在去年八月左右开始推行的。

And this is the thing which StrongDM started doing back in, I think it was August.

Speaker 0

他们说,好吧。

They said, okay.

Speaker 0

我们不会去读代码。

We're not gonna read the code.

Speaker 0

那这意味着什么?

So what does that mean?

Speaker 0

如果你不读代码,怎么才能产出可靠且优质的软件呢?

How do you produce software that works and is good if you're not reading the code?

Speaker 0

他们已经提出了一整套解决方案。

And they've come up with a whole bunch of answers.

Speaker 0

他们做测试的方式中最有趣的一种是,在传统软件开发中,一些公司会设有质量保证部门。

One of the most interesting was the way they did testing, where in traditional software, some companies will have a QA department.

Speaker 0

比如,工程师编写大量代码后,就把它们扔给质量保证部门,由他们疯狂地测试以判断代码是否正常工作。

Like, the engineers write a bunch of software, and you throw it over the wall to the QA department, and they sort of test it furiously to figure out if it's working or not.

Speaker 0

据我在硅谷所见,过去五到十年间,这种做法逐渐不再流行,因为你希望工程师对自己的代码质量负责。

That, I think, went out of fashion a bit over the past sort of five to ten years from what I've seen in Silicon Valley, because you kind of want your engineers to take responsibility for the code they're writing being good.

Speaker 0

但如果你能模拟这个质量保证部门呢?

But what if you can simulate that QA department?

Speaker 0

因此,StrongDM的做法是,他们部署了一群代理测试员,这些测试员实际上是在模拟最终用户的行为。

So what StrongDM were doing is they had a swarm of agent testers who were actually simulating simulating end users.

Speaker 0

他们开发的这款软件,说起来简直疯狂。

So the software that they were building this is crazy.

Speaker 0

这款软件是用于访问管理的安全软件。

The software is security software for access management.

Speaker 0

当你加入一家公司时,需要为你的账号分配访问Jira、Slack等系统的权限,他们正是为此类需求开发软件。

So when you sign it when you start as a company and somebody needs to assign you access to Jira and then give you access to Slack and all of that kind of thing, they were building software for that.

Speaker 0

这非常接近安全领域。

That's very security, like, adjacent.

Speaker 0

根据大多数人对世界运作方式的理解,这根本不是你应该用直觉编程的事情。

That's not the kind of thing that you should be Vibe coding at all based on most people's understanding of how the world works.

Speaker 0

但他们是一家真正的安全公司,多年来一直在做这些事情,而且没有使用人工智能。

But that's and they're a they're a legitimate security company who've been doing this stuff without AI for years.

Speaker 0

所以他们并不是不了解其中的风险。

It's So not like they didn't understand the risks.

Speaker 0

他们进行测试的方式是,有一群模拟的员工在一个模拟的Slack频道里,不断说类似‘有人能给我分配一下Jira的权限吗?’这样的话。

So the way they did their testing is they had this swarm of simulated employees all in a simulated Slack channel saying things like, hey, could somebody give me access to Jira?

Speaker 0

这个Slack频道本身也是模拟的。

The Slack channel itself is simulated.

Speaker 0

我们稍后会谈到这一点。

We'll talk about that in a moment.

Speaker 0

他们全天候24小时不断发出请求,说:‘嘿。'

And they twenty four hours a day, they're making requests and saying, hey.

Speaker 0

我需要访问Jira以及所有类似的东西,成本非常高。

I need access to Jira and all of those kinds of things at enormous cost.

Speaker 0

他们每天在令牌上花费大约一万美元,模拟所有这些终端用户。

Like, they were spending $10,000 a day on tokens, I think, simulating all of these end users.

Speaker 0

我相信是这样的。

I believe so.

Speaker 0

但这意味着他们的软件在各种方式下都得到了非常严格的测试。

But it meant that their software was being very robustly tested in all of these different ways.

Speaker 0

是的,这有点类似于拥有一个从不休息的手动QA团队。

And, yeah, it's kind of similar to having a similar to having a manual QA team except one that never sleeps.

Speaker 0

我觉得这作为一个跳出框框思考的例子非常有趣,这个问题是:如果我们不审查代码,怎么知道我们的软件是好的?

And I thought that was fascinating as a sort of example of thinking outside of the box, taking this question, how do we tell our software's good if we're not reviewing the code?

Speaker 0

并试图找到创造性的答案。

And trying to find creative answers to it.

Speaker 0

另一个有趣的地方是,这个Slack频道实际上并不是Slack。

The other thing that was interesting is that the Slack channel itself wasn't actually Slack.

Speaker 0

因为事实证明,如果你用像Slack这样的真实软件进行测试,它们都有速率限制。

Because it turns out, if you test against real software like Slack and so forth, they all have rate limits.

Speaker 0

而且它们不会允许你同时运行一万个模拟用户。

And like they they they they won't let you just run 10,000 simulated people at a time.

Speaker 0

所以他们构建了自己的Slack、Jira、Okta以及所有这些集成软件的模拟版本。

So what they did is they built their own simulation of Slack and Jira and Okta and all of this software they were integrating with.

Speaker 0

他们的方式是,基本上采用了Slack公开API的文档和开源客户端库,然后告诉他们的编码代理:‘把这个构建出来。’

And the way they did that is they basically took the API documentation for the public APIs for Slack and the client libraries that the open source client libraries, and they told their coding agents, build this.

Speaker 0

给我构建一个这个API的模拟版本。

Build build me a simulation of this API.

Speaker 0

他们真的做到了。

And they did.

Speaker 0

所以这家公司——这是我去年十月参加他们演示时了解到的一件事。

So this company is and this is one of the things that they I went to a demo that they gave back in October.

Speaker 0

让我印象特别深刻的是,他们拥有自己模拟的Slack、Jira以及所有这些不同系统的版本,可以用来开发他们的软件,而且成本为零,因为一旦启动,它就是一个小小的Go语言二进制文件,一直运行在那里。

One of the things that really sat with me is that they had their own simulated version of Slack and Jira and all of these different package different systems that they could then build their software against, which cost nothing because once they spun it up, it was a little Go binary that sat there.

Speaker 0

他们甚至还有界面。

And they even had interfaces.

Speaker 0

他们搞了一个假版的Slack界面,是用Vibe代码写成的,让他们能看清发生了什么。

They had, like, a fake version of the Slack interface that they'd co a like, Vibe coded up that let them see what was going on.

Speaker 0

简直太迷人了。

Absolutely fascinating.

Speaker 1

这是一个非常酷的故事,我喜欢这类关于前沿公司探索可能性并获得优势的故事。

That is such a cool story, and I love these stories of just companies at the bleeding edge trying to see what's possible, and have an advantage essentially.

Speaker 1

所以,我在这里听到的是,QA环节成了这个工厂里的新组成部分。

So what I'm hearing here is the QA piece is like the new piece in this factory.

Speaker 1

我们已经有了Codex和Cloud Code。

So we, you know, we already have Codex, Cloud Code.

Speaker 1

它们可以去自行构建东西。

They can go off and build stuff.

Speaker 1

这里的创新点在于,是这样吗?

Is the innovation here okay.

Speaker 1

现在你已经构建了所有这些东西。

Now you've built all this stuff.

Speaker 1

它们真的好吗?

Is it actually any good?

Speaker 1

有没有理由说明 Codex 和 Cloud Code 无法自己完成这些?

Is there a reason like Codex and Cloud Code couldn't do this themselves?

Speaker 1

为什么你需要这种工厂的概念?

Why do you need kind of this factory concept?

Speaker 0

我觉得它们可以。

I think they can.

Speaker 0

你可以告诉 Claude Code,启动一个使用 Playwright 模拟浏览器的子代理,诸如此类的事情。

Like, you can tell Claude Code, fire up a sub agent that uses Playwright to simulate a browser and so and all of that kind of thing.

Speaker 0

但你要让它连续运行 24 小时,可能会遇到困难。

I you'd have trouble getting it to run twenty four hours a day.

Speaker 0

我的意思是,也许它能工作。

I mean, maybe it would work.

Speaker 0

但毫无疑问,让我感兴趣的重点并不是你所使用的软件。

But certainly, I think that what's interesting to me isn't so much the software you're using.

Speaker 0

而是你用来回答这些问题的这些大型方法和技巧。

It is these these big IDs, these these these techniques that you're using to try and answer these questions.

Speaker 0

因为即使你的质量保证团队,你的虚拟质量保证团队说这没问题,也不意味着它就是安全的。

Because even if your QA team, your virtual QA team says this is good, doesn't mean it's secure.

Speaker 0

对吧?

Right?

Speaker 0

这并不意味着你具备了所有其他你关心的特性。

It doesn't mean that you've got all of those other characteristics you care about.

Speaker 0

同时,这些智能体在安全渗透测试方面现在变得非常出色。

At the same time, the agents are getting really good at security penetration testing now.

Speaker 0

我认为,这是个新现象——在过去大约三到六个月里,它们开始被视为可信的安全研究人员,这在安全研究领域引发了巨大震动。

And this is a new thing, I think, in the past again, in the past sort of three to six months, they've started being credible as security researchers, which is sending shockwaves through the security research industry.

Speaker 0

他们惊讶地说:天啊。

They were like, wow.

Speaker 0

我们没想到它们会达到这个地步。

We didn't think that they'd get to this point.

Speaker 0

有趣的是,OpenAI 和 Anthropic 都拥有专门的安全模型,但不会向公众发布,因为这些模型可用于入侵网站。

What's interesting there is both OpenAI and Anthropic have specialist security models that they will not release to the general public because they can be used to break into websites.

Speaker 0

所以它们采用邀请制,只有经过注册的安全研究人员才能申请访问权限。

So they have, like, invite only, like, registered security researchers can apply for access.

Speaker 0

它们一直在为流行的开源软件生成漏洞报告。

And they've been producing vulnerability reports against popular open source software.

Speaker 0

我认为就在几天前,也许是上周,Firefox 表示他们的发布得到了 Anthropic 的协助。

I think Firefox just a few days ago, maybe last week, said that they'd they'd done a release which was assisted by Anthropic.

Speaker 0

Anthropic 发现了 Firefox 中约 100 个潜在漏洞,并负责任地向 Mozilla 报告,后者随后修复了这些问题。

Anthropic had discovered a 100, like, potential vulnerabilities in Firefox and responsibly reported them to Mozilla, who then fixed them.

Speaker 0

这一点也很有意思,因为我们在现实中看到了大量类似情况,这对维护者来说极其令人沮丧——因为有一些人根本不懂行,却让 ChatGPT 帮忙找安全漏洞,然后把报告提交给维护者,而这些报告看起来非常专业。

That's an interesting one as well because we're seeing a lot of this in the wild, and it's it's just incredibly frustrating for maintainers because there are these people who don't know what they're doing, who are asking ChatGPT to find a security hole and then reporting it to the maintainer, and the report looks good.

Speaker 0

比如,ChatGPT 能生成格式非常规范的漏洞报告。

Like ChatGPT can produce a very well formatted report of a vulnerability.

Speaker 0

这完全是浪费时间。

It's a total waste of time.

Speaker 0

因为这并没有被证实是真实存在的问题。

Like, it's not actually verified as being a real problem.

Speaker 0

Anthropic 和 Firefox 的区别在于,Anthropic 的安全团队确实做了实际工作。

The difference with Anthropic and Firefox is that Anthropic security team actually did do the work.

Speaker 0

他们没有直接报告 AI 代理所说的内容。

They didn't report whatever the agent said.

Speaker 0

他们在提交之前,真正验证了这是一份高质量的报告。

They actually verified that it was a good quality report before they handed it over.

Speaker 1

安全方面还有很多值得讨论的地方。

There's gonna be a lot to talk about on the security side.

Speaker 1

你对那里的风险已经做了大量思考和写作,但我想继续这个话题。

You've done a lot of thinking and writing about the dangers there, but I want to follow this thread.

Speaker 1

所以,从 AI 为团队所做的工作来看,如果你仔细想想,它就像是在中间位置不断扩展。

So in terms of what AI has been doing for teams, if you think about it, it's like it's kind of going on the middle and expanding.

Speaker 1

所以这就像是写作,越来越多地承担起构建组件的任务。

So it's like writing, you know, taking on more and more of the building components.

Speaker 1

它现在在做代码审查,就像你描述的那样,还在持续进行质量保证和构建。

It s doing code reviews now, QA as you ve been describing, constantly building.

Speaker 1

而目前这个领域的前沿,正是巨大的机会所在——那就是想清楚:我们到底该构建什么?

And it feels like the front of that is the big now gap in opportunity which is coming up with the idea, what the heck should we build?

Speaker 1

因为一旦你告诉AI去构建这个东西,正如你所说,它在创建出色成果方面正变得越来越出色。

Cause then once you tell the AI, build this thing, as you're describing it's getting better and better at building something great.

Speaker 1

你有没有在这一点上用AI取得过一些成果?你认为它会不会开始吞噬这个环节,成为策略本身,也就是产品经理的角色?

Have you had any luck yet with using AI there and do you think it starts to eat that and becomes the strategy, you know, PM, basically?

Speaker 0

所以,这是我们目前在所有这些工作中遇到的最有趣的问题之一:我们已经把编写代码这部分大幅加速了。

So this is one of the most interesting problems we're having with all of this is we've taken the writing code bit, and we've massively accelerated that.

Speaker 0

现在,瓶颈出现在了其他所有地方。

Now the bottlenecks are everywhere else.

Speaker 0

对吧?

Right?

Speaker 0

那么,既然过去耗时最长的那部分现在变了,我们该如何重新设计流程呢?

Like, how do we redesign our processes now that the bit that used to take the longest, right?

Speaker 0

以前,你会先写出一份需求文档,然后交给工程团队,幸运的话,三周后他们才会给你一个实现方案,你才能开始下一步。

It used to be you'd come up with a spec and you hand it to engineering team, and three weeks later, if you're lucky, they'd come back with an implementation for you to then start.

Speaker 0

而现在,这部分工作可能只要三个小时,具体取决于针对这类任务的编码代理是否已经成熟。

And now that that maybe that takes three hours, depending on how well established the coding agents are for that kind of thing.

Speaker 0

那接下来呢?

So now what?

Speaker 0

对吧?

Right?

Speaker 0

现在瓶颈又在哪里?

Now where else are the bottlenecks?

Speaker 0

我不认为是提出初始想法,任何做过产品工作的人家都知道,你的初始想法总是错的。

I don't think it's I mean, as coming with the initial ideas, anyone who's done any product work knows that your initial ideas are always wrong.

Speaker 0

重要的是验证这些想法。

What matters is is proving them.

Speaker 0

对吧?

Right?

Speaker 0

关键是测试它们。

It's it's it's it's testing them.

Speaker 0

我们现在能更快地测试东西,因为我们能更快地构建可用的原型。

We can test things so much faster now because we can build workable prototypes so much quicker.

Speaker 0

在我的工作中,我一直在做一件有趣的事:任何我想设计的功能,我通常会用三种不同的方式做原型,因为这花不了多少时间。

So there's an interesting thing I've been doing in my own work where any sort of feature that I want designed, I'll often prototype three different ways it could work because that takes very little time.

Speaker 0

然后我可以开始试验它们,尝试它们,看看哪些是我喜欢的。

And then I can start experimenting them and trying them and seeing which ones I like.

Speaker 0

对我来说,这里真正具有变革性的一步是,当你在构思阶段引入人工智能时,重点就变成了原型。

And that that feels to me like the really transformational step here is that when you get AI involved in your ideation phase, it's much more about prototypes.

Speaker 0

关键是,现在UI原型是免费的。

It's about, okay, we can see like, a a UI prototype is free now.

Speaker 0

ChatGPT和Claude会根据你描述的任何内容,直接为你生成一个非常逼真的UI,你就应该这样工作。

ChatGPT and Claude will just build you a very convincing UI for anything that you describe, and that's how you should be working.

Speaker 0

我认为,任何从事产品设计的人,如果不通过快速原型来测试想法,就错过了这一阶段我们所能获得的最强大的助力。

I think anyone who's doing sort of product design isn't Vibe coding little prototypes is missing out on the the the latest the, like, the the most powerful sort of boost that we get in that step.

Speaker 0

但接下来你该怎么做呢?

But then what do you do?

Speaker 0

对吧?

Right?

Speaker 0

你现在有了三个选项,而不是一个,你该如何向自己证明哪一个才是最好的?

How do you given your three options now that you have instead of one option, how do you prove to yourself which one of those is the best?

Speaker 0

我对这个问题没有确信的答案。

I don't have a confident answer to that.

Speaker 0

我预计,这正是传统可用性测试发挥作用的地方。

I expect this is where the good old fashioned usability testing comes in.

Speaker 0

比如,找个人在Zoom上共享屏幕,使用你的软件,观察会发生什么。

Like, get somebody on Zoom, screen shared, using your software, see what happens.

Speaker 0

你可以让AI来做这件事,也可以用AI来模拟你的用户。

That's you can tell the AI to do it, and you can simulate your users with the AI.

Speaker 0

我不认为这可信。

I don't think that's credible.

Speaker 0

我认为,让ChatGPT假装在你的产品网站上点击浏览,所得到的结果不会比真实人类更好。

I don't think you're going to get as good results from ChatGPT pretending to click around on your pro site than you would from an actual human being.

Speaker 1

这太有趣了。

This is so interesting.

Speaker 1

我一直思考的一个问题是,人类的思维将在哪些方面继续保有价值?

A question I've been tackling is just where are human brains gonna continue to be valuable?

Speaker 1

我在这里听到的是,最初的想法——你刚才提到了一个非常好的观点。

And what I'm hearing here is there s like the initial idea, you made such a good point here.

Speaker 1

最初的想法往往不是最终的成功想法,它只是想法的起点。

It s like the initial idea is often not the actual winning idea, it s just the beginning of an idea.

Speaker 1

所以,未来的想法、尝试、原型设计、帮助你缩小方向、构建、打磨、推向市场,这些步骤都存在。

So there s like the idea for the future, then there s the try it out, prototype it, help you narrow on the direction, build it, make it awesome, get it out into the world.

Speaker 1

在我看来,AI在提出想法和生成初始构想方面将会非常出色。

And it feels to me like AI s going to be really good at suggesting ideas and coming up with initial ideas.

展开剩余字幕(还有 480 条)
Speaker 1

我很好奇,人类大脑是否有一天根本不再需要,而这正是所有讨论的核心。

And I wonder if the human brain, it s not like maybe someday we don need human brains at all and that s all of the discussion.

Speaker 1

但也许下一阶段是,人工智能将帮助我们产生出色的想法。

But maybe the next phase is AI will help us come up with great ideas.

Speaker 1

我的意思是,这种情况已经

I mean, that's been

Speaker 0

持续了大概好几年了。

the case for probably a couple of years now.

Speaker 0

它们已经足够强大,能够进行非常出色的头脑风暴。

They've been strong enough to do really good brainstorming.

Speaker 0

我喜欢把它比作这样的情况:当你组织一场小组头脑风暴时,你会预订一间会议室一小时,准备一块白板,召集十几个人。

And I like to compare it to the thing where when you've got a group brainstorming exercise, you book a meeting room for an hour, you've got a whiteboard, You get a dozen people in.

Speaker 0

而在那整个头脑风暴过程的前三分之二时间里,老实说,大家只是在反复提出最明显、最基础的想法。

And the first two thirds of that brainstorming session, honestly, it's kind of just everyone going through the most obvious basic ideas.

Speaker 0

对吧?

Right?

Speaker 0

然后把它们全都写在白板上。

And you get them all out on the whiteboard.

Speaker 0

把它们全部列出来。

You get them all up.

Speaker 0

当你开始说:好吧,我们来谈谈这些的时候,事情就变得有趣了。

And then things get interesting when you start saying, okay, well, let's talk about these.

Speaker 0

让我们开始把它们组合起来。

Let's start combining them.

Speaker 0

AI 在创意的前三分之二部分非常擅长。

The AI is so good at that first two thirds of the ideas.

Speaker 0

我经常和它一起头脑风暴,只是让它把所有显而易见的想法都吐出来,它能给出二十个点子,而且都挺完整的。

Like, I brainstorm with them all the time where I just get them to spit out all of the obvious stuff, and they'll come up with 20 things, and they'll all be kind of done.

Speaker 0

它们就是……它们不会,它们就是不会太有趣。

Like, they're very they won't be they just won't be very interesting.

Speaker 0

真正有趣的是,当你再让它给出二十个更多点子时,到了这份列表的末尾,你开始得到一些并不算好、但却能引你走向有趣方向的想法。

What gets interesting is when if you ask them for 20 more, and now they by the sort of end of that list, you're beginning to get things which are not good ideas, but they point you in interesting directions.

Speaker 0

还有很多类似的技巧。

There are so many other tricks like this.

Speaker 0

比如,你可以让AI把一些奇怪的领域结合起来。

Like, you can tell you can you can tell AI to combine weird fields.

Speaker 0

你可以这么说。

You can say, okay.

Speaker 0

我想为我的新SaaS平台寻找一些受海洋生物学启发的营销创意,看看会发生什么。

I want ideas for marketing my new SaaS platform inspired by marine biology, and you see what happens.

Speaker 0

大部分内容可能会是垃圾,但可能会有一个火花让你找到好点子。

And most of it will be complete junk, but there might be a spark that gets you to the good idea.

Speaker 0

所以我非常喜欢把AI当作这方面的头脑风暴伙伴。

So I love them as as brainstorming companions on that front.

Speaker 1

这让我想起我和大卫·普拉塞克的一次聊天。

That reminds me of a chat I had with David Plasek.

Speaker 1

他是位命名专家。

He's a expert naming person.

Speaker 1

他帮助公司为产品起名。

He helps companies come up with names for products.

Speaker 1

他公司的一项工作是组建三个团队来头脑风暴产品名称。

And one of the things that he does at his company is he creates three teams to come to brainstorm names.

Speaker 1

比如,假设他们为一个帆板运动产品命名。

One team so for example, let's say a windsurf was a product they named.

Speaker 1

第一组的想法是,好吧,这是一个AI开发工具。

So the first team is, okay, this is an AI IDE thing.

Speaker 1

这正是它的本质。

That's that's exactly what it is.

Speaker 1

第二组的想法是,好吧,这是一艘船。

Second team is okay, this is a this is a boat.

Speaker 1

你在为一艘船命名,这里有若干限制条件。

You're naming a boat and here's constraints.

Speaker 1

而这里,这是一艘宇宙飞船。

Then here, this is a spaceship.

Speaker 1

所以从这个角度来命名。

So name it from that perspective.

Speaker 1

他发现,最好的名字往往来自这些不同的视角,即用不同的隐喻表达相似的好处。

And he finds the best names come from those other directions where it's a different metaphor with the same sort of benefits.

Speaker 1

好的。

Okay.

Speaker 1

所以我的理解是,这很不错。

So what I'm hearing here is this is good.

Speaker 1

目前这对人类来说是好事,因为我们仍然有机会参与到这个过程中。

This is good for humans right now, that there's still opportunity for us to contribute to the process.

Speaker 0

实际上,我想为软件工程师辩护一下,因为一方面,这些工具确实能写代码。

And actually, I want to stand in defense of software engineers for a bit because on the one hand, these things can write code.

Speaker 0

这曾经是我们的专长。

That used to be our thing.

Speaker 0

对吧?

Right?

Speaker 0

我发现,很好地使用编码代理,需要动用我作为软件工程师二十五年来的全部经验,这让我精神上非常疲惫。

I'm finding that using coding agents well is taking every inch of my twenty five years of experience as a software engineer, and it is mentally exhausting.

Speaker 0

这种事情现在人们谈论得越来越多了。

Like, this is something which people are talking a lot more about now.

Speaker 0

我可以同时启动四个代理,让它们分别处理四个不同的问题。

I can fire up, like, four agents in parallel and have them work on four different problems.

Speaker 0

到了上午十一点,我就已经精疲力尽了。

And by, like, 11AM, I am wiped out for the day.

Speaker 0

因为人类的认知能力是有极限的,即使你不去审查它们做的每一件事,单是同时在脑海中保持多少信息,就已经很容易超出负荷了。

Like, I have because there is a limit on human cognition in how much even if you're not reviewing everything they're doing, just how much you can hold in your head at one time, and it's very easy to pop that stack at the moment.

Speaker 0

我们需要学习一种新的个人技能,那就是找到自己的新极限。

Like, there's a sort of personal skill that we have to learn, which is finding our new limits.

Speaker 0

我们该如何负责任地使用时间,避免耗尽自己,这才是关键。

Like, what is what is a responsible way for us to you to to not burn out and for us to to use the time that we have.

Speaker 0

我接触过很多人,他们因为失眠而困扰,因为他们觉得:我的编码代理本可以替我完成更多工作。

And I I've I've talked to a lot of people who are losing sleep because they're like, my coding agents could my agents could be doing work for me.

Speaker 0

我只是打算多熬半小时,再启动一堆任务,让它们在凌晨四点醒来工作。

I'm just gonna stay up an extra half hour and and set off a bunch of extra things, and they're waking up in four in the morning.

Speaker 0

这显然是不可持续的。

That's obviously unsustainable.

Speaker 0

我希望这只是一个暂时的新鲜现象。

I hope that that's a novelty thing.

Speaker 0

这些代理工具真正变强大,也不过是过去四五个月的事。

The agents only really got good in the past sort of four to five months.

Speaker 0

我们都在摸索,这到底意味着什么,能让我们做到什么。

We're all learning what that looks like and what that lets us do.

Speaker 0

但这确实令人担忧。

But it's it's it's concerning.

Speaker 0

我们在使用某些工具时,带有一种赌博和上瘾的成分。

There's an element of sort of gambling and addiction to to how we're using some of these tools.

Speaker 0

但为了为软件工程师辩护,我从这些工具中获得了极佳的成果,因为它们确实是现有技能和经验的放大器。

But to stand in defense of software engineers, I get great results out of these things because they are amplifiers of existing skills and experience.

Speaker 0

我有二十五年AI之前的丰富经验,现在可以通过与代理进行高层次对话来放大这些经验。

And I have twenty five years of existing, like, pre AI experience, which I can now amplify because I can talk to the agent at a very high level.

Speaker 0

我可以使用那些我多年掌握的复杂工程语言,而代理似乎也能理解这些语言。

I can use very I can use sophisticated engineering, like, language that I've mastered over the years, which they appear to know as well.

Speaker 0

我们可以非常高效地协作。

And we can collaborate incredibly effectively.

Speaker 0

这意味着我可以审视一个问题,然后说:这个问题只需要一句提示,我知道代理能找出并修复这个bug,而另一个问题则不确定到底有多大。

That means I can look at a problem, and I can say this problem is a one sentence prompt, and I know it'll find that bug and fix that bug, as opposed to this other problem, which is who knows how how big a problem.

Speaker 0

但另一方面,我有二十五年对开发耗时的判断经验,而这些经验现在完全失效了。

There is a flip side to this, which is that I've got twenty five years of experience in how long it takes to build something, and that's all completely gone.

Speaker 0

因为现在我可以看一个问题,然后说:好吧,这个大概需要两周时间。

Like, that doesn't work anymore, because I can look at a problem and say, okay, well, this is gonna take two weeks.

Speaker 0

不值得做。

It's not worth it.

Speaker 0

这就像,是的。

And that's like, yeah.

Speaker 0

但也许这只需要二十分钟,因为原本需要两周的原因是那些繁琐的编码工作,现在都被AI替我们完成了。

But maybe it's gonna take twenty minutes because the reason it would've taken two weeks was all of the the sort of crafty coding things that the AI is now covering for us.

Speaker 0

我发现这非常有趣且具有挑战性。

And that I've been finding really interesting and challenging.

Speaker 0

我不断给AI分配一些我认为它无法完成的任务。

I I constantly throw tasks at AI that I don't think it'll be able to do.

Speaker 0

因为偶尔它真的能完成。

Because every now and then it does it.

Speaker 0

当它没能完成时,你就学到了,对吧?你知道了,Opus 4.6 仍然无法完成这个特定任务。

And when it doesn't do it, you learn, right, you learn, okay, Opus 4.6 still can't do this particular thing.

Speaker 0

但当它完成了某些以前模型做不到的事情时,这实际上就是前沿的AI研究。

But when it does do something, especially something the previous models couldn't do, that's actually cutting edge AI research.

Speaker 0

你可能是世界上第一个发现AI现在能做某件事的人,因为你曾经发现它做不到,于是你一直保留着这些有趣任务的清单。

You can be the first person in the world to spot that AI can now do x just because you were the person you you found it couldn't do it, you've you've been keeping that sort of backlog of of interesting tasks for it.

Speaker 1

这个讨论方向太有意思了。

This is such an interesting line of discussion.

Speaker 1

你在这里描述的正是这样一种观点:比如所谓的10倍工程师会变得更有价值,因为你能更有效地使用这些工具。

This idea that, let's say, 10x engineers, to use that phrase, are gonna be more valuable is what you're describing here because you can work with these tools much more effectively.

Speaker 1

你对初级工程师怎么看?

What do you think of junior engineers?

Speaker 1

那边正在发生什么?

Just like what's happening there?

Speaker 1

他们的未来会怎样?

What's their future?

Speaker 0

有意思的是,Thoughtworks这家大型IT咨询公司大约一个月前举办了一次外出研讨会,他们邀请了来自不同公司的多位工程副总裁来讨论这个问题。

So there's an interest so Thoughtworks, the big, like, IT consultancy did a off-site a few, about a month ago, and they produced they got a whole bunch of engineering VPs in from different companies to talk about this stuff.

Speaker 0

他们提出的一个有趣观点是,这类工具对资深工程师特别有帮助。

And one of the interesting theories they came up with is they think this stuff is really good for experienced engineers.

Speaker 0

它能放大他们的技能。

Like, it amplifies their skills.

Speaker 0

这很棒。

That's great.

Speaker 0

这对新工程师非常有帮助,因为它解决了许多入职过程中的问题。

It's really good for new engineers because it solves so many of those onboarding problems.

Speaker 0

比如,如果你去问问 Cloudflare 和 Shopify,他们都表示,2025 年一整年计划招聘一千名实习生,因为过去实习生需要一个月才能开始做有用的工作。

Like, if you talk to, Cloudflare and Shopify, both said they were hiring a thousand interns over the course of 2025 because the intern onboarding costs it used to be takes a month before your intern can do anything useful.

Speaker 0

但现在,他们一周内就能开始做有用的事情了,因为 AI 助手能帮助他们更快上手。

Now they're doing something useful within like a week because the the AI assistant helps them get up and running faster.

Speaker 0

问题在于中间那批人。

The problem is the people in the middle.

Speaker 0

比如,如果你是中阶工程师,还没达到资深工程师的水平,但也不算新人了,那正是 ThoughtWorks 目前最可能陷入困境的群体。

Like, if you're mid career, if you haven't made it to sort of super senior engineer yet, but you're not sort of new either, that's the that's the group which ThoughtWorks which ThoughtWorks Resolve were probably in the most trouble right now.

Speaker 0

这是一个悬而未决的问题,因为他们缺乏足够的经验来借助这些工具提升效率,也不像新人那样能获得那么多明显的助力。

Like, that's the open question because they don't have that expertise to to to to amplify and and use with these tools, and it's not as benefit like, they've got all of the the boosts that the beginners were getting they've got already.

Speaker 0

所以,这对我来说是一个当前有趣的开放性问题。

So that's an interesting open question right now for me.

Speaker 0

问题更多集中在中阶工程师身上,而不是新人或高级工程师。

It's it's more the the the sort of mid mid level as opposed to the beginners or the the advanced people.

Speaker 1

人工智能出现在这么多事物的中间,这真是太有趣了。

It's so interesting how AI is coming at the middle of so many things.

Speaker 1

它出现在产品开发流程的中间。

It's coming at the middle of the product development process.

Speaker 1

它出现在资历的中间。

It's coming at the middle of seniority.

Speaker 1

可能还有其他例子。

There's probably other examples.

Speaker 1

我猜这对所有职能都适用,比如产品经理和设计师,尤其是新入行的产品经理和设计师,因为你们所说的‘AI原生’本质上就是能更快上手。

And I'm guessing this is true for all functions like PMs, designers too, just new PMs, designers, maybe because being AI native basically is what you're describing and ramping up much more quickly.

Speaker 1

我想既然我们谈到这个话题,很多听众其实就属于那些处于中间阶段的人。

I guess while we're on this topic, say you are a lot of listeners here, just like those people in the middle.

Speaker 1

你对他们有什么建议,可以帮助他们避免沦为永久的底层群体?

What would your advice be to them to help them avoid becoming a part of the permanent underclass?

Speaker 0

你这是给我施加了很大的责任啊。

That's a big responsibility you're putting on me there.

Speaker 0

我认为前进的方向是拥抱这些技术,弄清楚如何利用它们让我变得更好?

I think I think the way forward is to lean into this stuff and figure out how do how do I help this make me better?

Speaker 0

对吧?

Right?

Speaker 0

很多人担心技能退化。

Like, a lot of people worry about skill atrophy.

Speaker 0

你知道,如果AI替你做了,你就什么也学不到。

You know, if the AI is doing it for you, you're not learning anything.

Speaker 0

我认为如果你担心这个,就应该对它保持警惕。

I think if you're worried about that, you push back at it.

Speaker 0

你必须谨慎地思考如何应用这项技术,想想:我得到了一个能回答任何问题、而且经常答对的东西,虽然它并不总是对的。

Like, you have to be mindful about how you're applying the technology and think, okay, I've been given this thing that can answer any question and often gets it right, doesn't always get it gets it right.

Speaker 0

我该如何利用它来放大自己的能力,学习新东西,承担更雄心勃勃的项目?

How can I use this to amplify my own skills, to to learn new things, to take on much more ambitious projects?

Speaker 0

我最近很喜欢的一点是,作为一名软件工程师,我发现自己的抱负水平直线上升。

Something I've been enjoying I think the thing I've enjoyed most about this as a software engineer is that my level of ambition has shot right up.

Speaker 0

因为我以前从不使用AppleScript,因为AppleScript是一整套需要学习的编程语言。

Because now I used to, like never I never used AppleScript because AppleScript is a whole programming language you have to learn.

Speaker 0

但过去两年半我一直在用AppleScript,因为ChatGPT懂AppleScript,我不用自己学,现在我能自动化我的Mac上的各种任务。

And I've been using AppleScript for, like, two and a half years now because ChatGPT knows AppleScript, and I don't have to and so now I can automate things on my Mac.

Speaker 0

这很棒。

And that's great.

Speaker 0

你知道吗?

You know?

Speaker 0

以前,光是想到要花两三个月才能掌握基本的AppleScript,就足以让我根本不想去用它。

And previously, the fact that it would've taken me, like, two or three months to learn basic AppleScript was enough for me never to use it.

Speaker 0

而现在,我用了这么多新技术,就是因为原本两到三个月的初始学习曲线被大大缩短了。

And now I've got all of these technologies that I'm using because that two to three month initial learning curve has been shaved right down.

Speaker 0

我觉得这适用于所有其他事情。

I think that applies to everything else.

Speaker 0

比如,我的厨艺进步了很多。

Like, I'm getting much better at cooking.

Speaker 0

我一直在用Claude,结果发现它是个出色的厨师,这听起来很奇怪,因为它根本没有味蕾。

I've been using Claude, it turns out, excellent chef, which doesn't make sense because it can't it doesn't have taste buds.

Speaker 0

但它能提供全球牛油果酱食谱的平均版本,结果发现这确实是不错的牛油果酱。

But it does it can give you the global average of the world's guacamole recipes, which turns out is good guacamole.

Speaker 0

所以这真的很有趣,就像尝试把这些技术应用到自我提升上。

So that's been really interesting, like trying to apply this stuff just to for sort of self improvement.

Speaker 0

我认为这是一种非常有用的技能。

I think that's a really useful skill to have.

Speaker 0

因为老实说,现在一切都变得太快了。

Because honestly, everything is changing so fast right now.

Speaker 0

唯一的通用技能就是能够适应变化。

The only universal skill is being able to roll with the changes.

Speaker 0

对吧?

Right?

Speaker 0

这是我们所有人都需要的能力。

That's the thing that we all need.

Speaker 0

奇怪的是,在这些关于如何与人工智能相处得更好的对话中,出现最多的词是‘自主性’。

Weirdly, the term that comes up most in these conversations about how you can be great with AI is agency.

Speaker 0

对吧?

Right?

Speaker 0

人类拥有自主性,我们用这种自主性来决定要应对哪些问题以及往哪个方向走。

People, human beings have agency, and we use that agency to decide what problems to take on and where to go.

Speaker 0

我认为人工智能根本没有任何自主性。

I think agents have no agency at all.

Speaker 0

我认为,人工智能永远不可能拥有的东西就是自主性,因为它没有人类的动机。

Like, I would argue that the one thing AI can never have is agency because it doesn't have human motivations.

Speaker 0

当然,你可以告诉它要赚更多钱之类的,但它永远无法自己决定接下来什么行动才是合理的。

Like, sure, you can tell it make more money or whatever, but it's never going to be able to decide on its like, what makes sense for it to act on next.

Speaker 0

所以我认为,关键在于投资自己的自主性,思考如何利用这项技术来提升自己的能力,去做新的事情。

So I'd say that's the thing is to invest in your own agency and invest in how do I use this technology to get better at what I do and to do new things.

Speaker 1

而且,正如你所说,要有雄心。

And also to your point, be ambitious.

Speaker 1

志存高远。

Think big.

Speaker 1

对。

Yeah.

Speaker 1

昨天刚发布了一段对詹森的采访,有人问他关于裁员的问题。

There's an interview with Jensen that just came out yesterday where people asked him about layoffs.

Speaker 1

现在到处都在裁员。

There's all these layoffs happening.

Speaker 1

AI真的在夺走工作吗?

Is AI actually taking jobs?

Speaker 1

他说,很多公司之所以裁员,是因为他们缺乏足够的创造力和雄心,不知道如何利用这些资源。

And he's like, the reason a lot of these companies are not, are letting people go is they don't have enough creativity or ambition for what they can do with all of these resources there.

Speaker 1

因为他们并没有裁员。

Because they're not letting people go.

Speaker 1

他们有太多想做的事情。

They have so much they want to do.

Speaker 1

你知道,说起来容易做起来难,而且并不总是如此,但我认为这是一种很有意思的处理方式。

You know, obviously easier said than done and it's not always the case, but I think that's an interesting way of approaching it.

Speaker 1

既然我们现在拥有这种能力,人们几乎低估了自己能用它做什么,没有充分去利用它。

Now that we have this power, people almost underestimate what they can do with it and don't fully lean into it.

Speaker 1

所以我非常喜欢这种建议:试着更有雄心一点。

So I love this advice of just try to be a little more ambitious.

Speaker 1

去尝试那些你觉得不可能的事情,看看它们可能其实并非不可能。

Try the stuff that you think is impossible and see might be actually possible.

Speaker 0

我今年的新年决心恰恰相反。

My New Year's resolution this year was the opposite.

Speaker 0

过去每一年,我总是告诉自己:今年我要更专注。

Every previous year, I've always told myself this year, I'm gonna focus more.

Speaker 0

我要少接一些事情。

I'm gonna take on less things.

Speaker 0

但今年,我的目标是多接一些事,更有雄心。

This year, my ambition was take on more stuff and be more ambitious.

Speaker 0

我们有这些工具。

Like, we've got these tools.

Speaker 0

把所有东西都拿过来。

Bring it all in.

Speaker 0

让我们试试做所有事情。

Let's try and do everything.

Speaker 0

我不知道这是否是个好的新年决心,但我就选了这个。

I don't know if that was a good New Year's resolution, but that's what I went with.

Speaker 1

那么到目前为止进展如何?

So how's it going so far?

Speaker 1

你对

How do you feel about

Speaker 0

这个决定感觉如何?

this decision?

Speaker 0

很有趣。

It's fun.

Speaker 0

我玩得很开心。

I'm enjoying myself.

Speaker 0

我觉得我可能会坚持到年底,但那时候我会想,哇。

I I think I'll probably get to the end of the year, but I'll be like, wow.

Speaker 0

我本该重点关注的最重要的事情都没有完成。

The thing the most important things that I should have been focusing on did not get done.

Speaker 0

但当我有志于做这些事时,情况就是这样。

But that's that's the case when it is my ambition to do them.

Speaker 0

所以,你知道的。

So, you know.

Speaker 1

这是一种聚散交织的情况。

It's a a converge diverge sort of situation.

Speaker 1

你知道的。

You know?

Speaker 1

明年可以重新聚焦。

Next year could be refocused.

Speaker 0

当然。

Absolutely.

Speaker 0

对。

Yeah.

Speaker 0

天哪。

Oh, man.

Speaker 0

有点吧。

Kind of

Speaker 1

就是那种情况。

along those lines.

Speaker 1

我想回到你提到的这一点,你说你工作更努力了,但一整天早早就精疲力尽了。

I want to come back to this point you made about how you're working harder and you're like fried early in the day.

Speaker 1

这真是个有趣,我不知道怎么说,几乎是种矛盾。

This is such an interesting, I don't know, contradiction almost.

Speaker 1

人们都知道,人工智能本应让我们更高效。

People, you know, AI is supposed to make us more productive.

Speaker 1

它本应给我们更多休息时间。

It's supposed to give us more time off.

Speaker 1

它本应让我们闲坐着看奈飞,享受世界上所有的财富与生产力。

It's supposed to let us sit around and watch Netflix and do all the create wealth and productivity in the world.

Speaker 1

感觉那些最沉迷于AI的人,反而比以往工作得更努力。

It feels like the people that are most AI pilled are working harder than they've ever worked.

Speaker 1

有一种焦虑被描述为:我的代理没有运行。

There's this anxiety described of my agents aren't running.

Speaker 1

我得时刻盯着它们。

I gotta stay on top of them.

Speaker 1

你认为这里发生了什么?

What do you think is going on there?

Speaker 1

这就像你所说的,可能只是一种暂时的新鲜感,然后我们会说,好吧。

Is this just like you said, maybe it's like a temporary novelty thing, and then we'll be like, alright.

Speaker 1

我不需要这么有生产力。

I don't need to be this productive.

Speaker 1

还有别的吗?

Is there anything else there?

Speaker 0

我希望这真的只是一时的新鲜感。

I think I I really hope it's a novelty thing.

Speaker 0

我确实有了更多时间,但我却感到精疲力尽。

And I am actually getting much more I'm getting more time, but I'm I'm exhausted.

Speaker 1

你的大脑累坏了。

Like, your brain is exhausted.

Speaker 0

我的大脑累坏了。

Like, my brain is exhausted.

Speaker 0

我有更多时间去做其他事情,我也确实去做了,这很棒。

I've got I've got more time to go and do things, and I do things, and it's great.

Speaker 0

但这种高强度工作带来的疲惫感,真的让我很意外。

But it's it is that the exhaustion from that sort of intensity of work has been a really big surprise for me.

Speaker 0

尤其是从十一月开始,随着这一切逐渐升温,我一直在观察这种现象。

Like, that that's been been some something which I've I've I've I've been observing, especially since November, like, as as all of this stuff stuff started ramping up.

Speaker 0

而且,是的,我认为这个问题的核心始终是来自他人的期望。

And, yeah, I think that's the concern there comes down it's always expectations from other people.

Speaker 0

你知道的?

You know?

Speaker 0

如果你在一家期望你完成五倍工作量的公司,那肯定会让人精疲力尽。

If you work for a company that's that's expecting you to get five times more done, that's gonna be exhausting.

Speaker 0

也许我们以后会看到。

And and maybe we'll see.

Speaker 0

我认为那些管理良好的好公司正在关注这个问题。

And I think the good companies with good management are paying attention to this.

Speaker 0

它们不希望为了短期利益而让最优秀的员工 burn out,最终却因此失去人才。

They don't want to burn out their best employees for the sort of for the short term gain, but but lose people over it.

Speaker 0

但,是的,这确实是一个巨大的张力。

But, yeah, it's it's it's a big tension.

Speaker 0

我认为,我们这些身处人工智能热潮前沿的人,是最早感受到这一点的。

I think we're we're those of us on the sort of leading edge of the AI boom are feeling it first.

Speaker 0

我猜这对其他人也会发生。

I imagine it's gonna come for everyone else as well.

Speaker 1

不过,我们还没提到的另一个因素是,你已经提过几次了。

The other element of this, though, that we haven't mentioned is and you've mentioned a couple times.

Speaker 1

这实际上非常有趣。

It's actually really fun.

Speaker 1

这里的动力并不是

The drive here is not

Speaker 0

我自己也做了很多。

I have to myself so much.

Speaker 0

当然。

Absolutely.

Speaker 0

这太有趣了。

It's so fun.

Speaker 0

我的很多朋友都在谈论他们有一堆积压的副项目。

It's a lot of my friends have been talking about how they have this backlog of side projects.

Speaker 0

对吧?

Right?

Speaker 0

过去十到十五年里,他们一直积压着一些从未完成的项目和觉得会很酷的想法。

For the last ten, fifteen years, they've got projects they never quite finished and ideas they thought would be cool.

Speaker 0

其中一些人说:好吧,我现在全都做完了。

And some of them are like, well, I've done them all now.

Speaker 0

就在过去几个月,我一一处理了。

Like, last couple of months, I just went through.

Speaker 0

每天晚上,我都想:来吧,把这个项目做完。

And every evening, I'm like, let's take that project and finish it.

Speaker 0

还有这个,那个,还有那个,以及那个。

And that one and that one and that one and that one.

Speaker 0

当一切结束时,他们几乎感到一种失落感:好吧,就这样了。

They almost feel a sort of sense of loss at the end where they're like, well, okay.

Speaker 0

我的待办清单没了。

My backlog's gone.

Speaker 0

现在,我该去构建什么呢?

Now now what am I gonna build?

Speaker 1

是的。

Yeah.

Speaker 1

这又回到了那个工厂的话题。

It comes back to that factory.

Speaker 1

前几天我跟Linear的创始人聊了聊,谈到‘工厂’这个概念,我们觉得,‘工厂’听起来不像能创造出优秀产品的地方。

I was talking to the founder of Linear the other day and this idea of the factory, and we're just like, like, factory doesn't sound like a place that'll create amazing products.

Speaker 1

感觉好像,这种地方能诞生出美丽而创新的东西的可能性有多大呢?

It feels like, you know, like, what are the chances that'll create something beautiful and innovative?

Speaker 1

所以,要么这个词用错了,要么它只会导致糟糕的结果,大概率是这样。

So either that's the wrong word or it's just this will lead to bad stuff, probably.

Speaker 0

我觉得‘手工制作’这个词更好,手工制作的软件,我认为会更受重视。

I feel like the word artisanal does like like, artisanal's a handcrafted software, I think, is gonna be valued more.

Speaker 0

我在自己的工作中注意到,有时候我会对某个软件、某个Python库产生一个想法,然后一小时内就能完成,还能附上文档和测试。

Something I've noticed in my own work is sometimes I have an idea for a piece of software, a Python library or whatever, and I can knock it out in, an hour and get to a point where it's got documentation and tests and all of those things.

Speaker 0

而且它看起来就像我之前花了好几个星期做的那种软件。

And it looks like the kind of software the previous I just spent several weeks on.

Speaker 0

我可以把它上传到 GitHub 上,一切都没问题。

And I can stick it up on GitHub and everything.

Speaker 0

但我并不相信它。

And yet, I don't believe in it.

Speaker 0

我不相信它的原因是,我不得不匆忙完成所有这些工作。

And the reason I don't believe in it is that I I got to rush through all of those things.

Speaker 0

我觉得质量可能还不错,但我没有花足够的时间去验证这种质量。

I think the quality is probably good, but I haven't spent enough time with it to to feel confident in that quality.

Speaker 0

最重要的是,我还没用过它。

Most importantly, I haven't used it yet.

Speaker 0

事实上,当我使用别人的软件时,我最在意的是希望他们已经用了好几个月。

Like, it turns out when I'm using somebody else's software, the thing I care most about is I want them to have used it for for months.

Speaker 0

对吧?

Right?

Speaker 0

我希望其他人能把这些软件真正用起来。

I want other people to have put that software into practice.

Speaker 0

我有一些非常酷的软件,是我自己写的,但我从来没用过。

So I've got some very cool software that I built that I've never used.

Speaker 0

说实话,写它比真正去用它还要快。

Like, it was so it was quicker to build it than to actually try and use it.

Speaker 0

所以,我一直以来应对这种情况的方式,就是给它打上alpha标签。

And so the way I've been dealing with that, I've always put alpha on it.

Speaker 0

比如,如果你看到我的软件标注为alpha,那很可能意味着我还没真正用过它——对我大部分项目来说都是这样,这简直是个取巧的招数,你知道的,动不动就标个alpha。

Like, if you see my software and it says it's an alpha, that probably means I haven't actually used it yet for most of my projects, which is a bit of a cheat code, you know, alpha alpha this.

Speaker 0

但这难道不有趣吗?

But isn't that interesting?

Speaker 0

以前,如果你看到一款软件有高质量的测试和详尽的文档,那就意味着它很可靠。

Like like like, it used to be if you looked at software and it had high quality tests and documentation and everything, it meant it was good.

Speaker 0

但现在,这个信号已经消失了。

And now that signal is gone.

Speaker 1

这简直像是我们需要一个工作证明,而不是

It's almost like we need a proof of work for this versus

Speaker 0

使用证明。

the Proof one of usage.

Speaker 0

证明,是的。

Proof Yes.

Speaker 0

正是如此。

Of Exactly.

Speaker 1

天哪。

Oh man.

Speaker 1

说到手工编写的代码,我不知道你是否知道这一点,这太有趣了。

On this note of handcrafted code, I don't know if you know this, this is so interesting.

Speaker 1

数据标注公司正在购买旧的 GitHub 手写代码仓库来训练他们的模型,并且为这种手工编写的人类代码支付了大量资金。

Data labeling companies are buying old GitHub repos of handwritten code to train their models on, and they're paying a lot of money for like artisanal human written code.

Speaker 0

哦,这太有趣了。

Oh, that's fascinating.

Speaker 0

那是二战前的金属,你可以从旧船残骸中打捞出来,那是第一次核爆炸之前的金属。

That's the the pre World War two the the metal that you can dig up from old shipwrecks, which is before the nuclear the first nuclear explosions.

Speaker 0

所以,这种金属没有受到辐射的污染。

And so it's it's not got, like, the the the the radiation bakes into the metal.

Speaker 0

就是整个这么一回事。

It's that whole thing.

Speaker 1

哇。

Wow.

Speaker 1

这太有趣了。

That's fascinating.

Speaker 1

是的。

Yeah.

Speaker 1

所以他们在寻找2022年之前的代码,我想,也就是ChatGPT刚出现的时候之前的代码。

So they're looking for code pre 2022, I think, whenever ChatGPT kind of emerged.

Speaker 1

哇。

Wow.

Speaker 1

是的

Yeah.

Speaker 1

所以如果你有一些,你就能赚大钱。

So if you've got some, you can make a you can make a fortune.

Speaker 0

我保证。

I promise.

Speaker 0

我开源了我所有的东西,所以它们已经公开了。

I open source all my stuff, so it's already out there.

Speaker 0

它就在训练数据中。

It's it's in the training

Speaker 1

它已经被用来训练模型,早就被用过了。

it's it's been used to train the models slurp stuff already.

Speaker 0

没错。

Yep.

Speaker 0

Oh,

Speaker 1

天啊。

man.

Speaker 1

好吧。

Okay.

Speaker 1

我问你一个问题。

Let me ask you this question.

Speaker 1

我只是对这个预测感到好奇。

I'm just curious about this prediction.

Speaker 1

我知道你并不像那种喜欢做预测的人,尽管你确实经常做出预测,而且你似乎总是对的。

I know you're not like a prediction person, although you do make predictions and you seem to be right often.

Speaker 1

你认为世界上50%的工程师什么时候会由AI编写他们100%的代码?

When do you think 50% of engineers in the world will be AI will be writing a 100% of their code?

Speaker 1

你觉得我们离这个目标还有多近?

How close to that do you think we are?

Speaker 0

那我把它修正为95%的代码。

So I'm gonna refact that to 95% of their code.

Speaker 0

我不会

I don't

Speaker 1

认为是的。

think Yeah.

Speaker 0

我们会谈到这一点的。

We'll get to that.

Speaker 0

但没错。

But yes.

Speaker 0

很难说全球范围的情况,部分原因是不同文化之间存在差异。

It's very difficult to say worldwide because partly because to their cult there are cultural differences.

Speaker 0

我花了太多时间在 Hacker News 上,我注意到一个现象:在太平洋时间午夜开始的对话,会一直持续到早上8点。

I had spent way too much time on hacker news, and something I've noticed about hacker news is a conversation that starts at midnight Pacific time and goes until 8AM.

Speaker 0

语气非常不同,因为那是欧洲人。

Very different tone because it's the Europeans.

Speaker 0

对吧?

Right?

Speaker 0

你会看到,欧洲人比美国人更怀疑人工智能。

You'll get the year and the Europeans are a lot more AI skeptic than the Americans are generally.

Speaker 0

所以我认为不同国家将会形成各自不同的文化氛围。

So I think different countries are gonna have different sort of different cultures around us.

Speaker 0

同时,我认为今年已经毫无疑问地表明,这些东西能生成优质的代码。

At the same time, I think it's become undeniable this year that this stuff produces good code.

Speaker 0

过去你可以说,我不用这些东西,因为代码质量差。

Like it used to be that you could say, I don't use this stuff because the code is bad.

Speaker 0

那曾经是一个合理的立场。

And that was a a justifiable position.

Speaker 0

但现在这已经不再合理了。

That's not justifiable anymore.

Speaker 0

代码现在很好了,至少按照我对好代码的定义来说是这样。

The code is now It's good code for for the my for my definition of good code, at least.

Speaker 0

所以我们说,到今年年底,可能有50%的工程师,他们的大部分代码都可能由AI生成。

So so we're saying 50% of engineers let's say 50% of engineers majority of their code, it could happen by the end of this year.

Speaker 0

有可能。

It could.

Speaker 0

因为现在的技术已经足够好了。

Because the the the the technology is good enough now.

Speaker 0

我觉得现在的挑战是让人们学会如何使用这些工具,这很难,因为一提到使用这些工具,大家都觉得肯定很简单。

And I feel like the the challenge now is getting people to learn how to use this stuff, which is difficult because using the stuff, everyone's like, oh, it must be easy.

Speaker 0

不就是个聊天机器人嘛。

It's just a chatbot.

Speaker 0

其实并不简单。

It's not easy.

Speaker 0

在人工智能领域,一个很大的误解就是认为有效使用这些工具很容易。

Like, that's one of the great misconceptions in AI is that using these tools effectively is is is easy.

Speaker 0

这需要大量的练习,需要不断尝试那些没成功的方法,也要尝试那些成功的方法。

It takes a lot of practice, and it takes a lot of trying things that didn't work and trying things that did work.

Speaker 0

但是的,我预计到今年年底,工程师说他们几乎所有的代码都是由AI编写的,将不再罕见。

But, yeah, I I I expect by the end of this year, it will not be uncommon to have an engineer say that almost all of their code is written by AI.

Speaker 1

我之前也有过类似的想法。

That was the same rough idea I had.

Speaker 1

这难道不疯狂吗?

And how crazy is that?

Speaker 1

变化速度真快,太惊人了。

How quickly It's wild.

Speaker 1

这份工作已经发生了变化,可能性也不同了。

This job has changed and what is possible.

Speaker 1

我认为人们常常低估了事物变化的速度,这是一个很好的例子。

I think people, this is a good example of people underestimate how quickly things can change.

Speaker 1

一年前或两年前,达里奥还在预测这种情况,说100%的代码都将由AI编写,当时我们还觉得这不可能。

We would not have, I think Dario was predicting this a year or two ago, just so, 100% of code's going be written by AI, and we're just like, We

Speaker 0

我们很乐意向蒂姆致敬。

will love to Tim.

Speaker 1

是的。

Yeah.

Speaker 1

对吧?

Right?

Speaker 1

没错。

Exactly.

Speaker 1

你在说什么啊?

Like, what are you talking about?

Speaker 1

太差了。

So bad.

Speaker 1

写代码水平太差了,而且这种变化可能会出现在人们没预料到的其他工作中,这既令人害怕,又有趣且令人兴奋。

So bad at writing code, and and this might come for other jobs that people don't see coming, which is scary and interesting and exciting.

Speaker 0

说实话,我完全不是AI末日论者。

It's honest the the I'm I'm not an AI doomer in the slightest.

Speaker 0

但从经济角度看,确实让我感到不安。

The economics of it do make me nervous.

Speaker 0

我们的白领知识型工作,真的会在未来几年内被消灭十分之一吗?

Like, it are we really going to wipe out, like, a tenth of white collar knowledge work jobs in the next few years?

Speaker 0

我真的希望不会,因为我不知道经济如何适应这种情况。

I really hope not because I don't know how the economy adapts to that.

Speaker 0

你知道的。

You know?

Speaker 0

所以,是的,这很复杂。

So, yeah, that's complicated.

Speaker 1

是的。

Yeah.

Speaker 1

我其实正在做一份即将发布的报告。

I'm actually I'm doing a report that's coming out.

Speaker 1

它会在本集播出前发布,分析科技行业的就业市场。

It'll come out ahead of this episode, looking at the job market in tech.

Speaker 1

令人惊讶的是,仅在科技公司,我们目前的工程岗位和产品经理岗位空缺数量达到了历史最高水平。

And surprisingly, just at tech companies, we're at the highest number of open engineering roles, open PM roles.

Speaker 0

有意思。

Interesting.

Speaker 1

仅有的例外是新冠疫情期间那段疯狂的峰值时期。

Except for during the crazy peak during COVID.

Speaker 1

所以现在的情况有点往那个水平回升了。

So it's kind of like coming back to that.

Speaker 1

基本上来说,全球科技公司的工程师和产品经理空缺职位数,是这三年半左右以来的最高值。

Basically, it's the highest number of open roles in three and a half ish years for engineers and PMs at tech companies globally.

Speaker 0

这太有意思了。

That's very interesting.

Speaker 0

这挺有意思的,不是吗?

It's funny, isn't it?

Speaker 0

因为媒体上全是这些博眼球的裁员新闻。

Because you get all of these headline grabbing like Layoffs.

Speaker 0

对。

Yeah.

Speaker 0

最近是不是Block那家公司裁了4000人来着?

Was it was it Block that laid off 4,000 people recently?

Speaker 0

是的。

Yeah.

Speaker 0

这里始终存在的问题是,这其中有多少是人工智能的影响,有多少是疫情期间过度招聘和后续调整造成的。

The the the the the question there is always how much of that is AI and how much of it is over hiring during COVID and re corrections and all that kind of thing.

Speaker 0

要分清楚总是非常困难。

It's always very difficult to tell.

Speaker 0

所以,一方面,空缺职位的数量可能是一个更好的信号。

So that the the number of open jobs, on the one hand, maybe that's a better signal.

Speaker 0

但另一方面,招聘市场已经被这一切彻底搅乱了。

But on the other hand, the recruitment market has been driven completely crazy by all of this stuff.

Speaker 0

对吧?

Right?

Speaker 0

所有的职位广告都是由人工智能撰写的。

Like, all of the job ads are written by AI.

Speaker 0

简历也是人工智能生成的。

The the the the resume's AI.

Speaker 0

招聘领域的人表示,现在筛选和雇佣人才的难度前所未有,而求职者则说他们投了200份简历,却连一个回音都没有。

People people in recruitment are saying that this is it's never been this hard to filter through and hire people, and people who are hiring jobs say they they applied to 200 things and got nobody hearing back.

Speaker 0

所以这很难。

So it's hard.

Speaker 0

对吧?

Right?

Speaker 0

关于这个问题的宏观经济指标都是滞后的,迟早我们会得到更可靠的数字,来了解实际影响究竟如何。

The the the the macroeconomic indicators for this stuff are are lagging, and at some point, we should start getting more confident numbers about what the impact actually is.

Speaker 1

是的。

Yeah.

Speaker 1

有趣的是,招聘岗位的空缺数量也接近历史最高水平。

Interestingly, the number of recruiter open roles is also approaching like record numbers.

Speaker 0

太荒谬了。

Hilarious.

Speaker 1

这其实是一个反映招聘需求的有趣先行指标。

Which is an interesting leading indicator of demand for hiring.

Speaker 1

尽管有裁员,但仍有一些有趣的趋势。

So there's interesting trends in spite of the layoffs.

Speaker 1

所以,真是个疯狂的世界啊。

So yeah, what a wild world.

Speaker 1

你提到过你正在写的一本书。

So you've mentioned this book you're working on.

Speaker 1

这说的是智能体工程模式,对吧?

This is the agentic engineering pattern stuff, right?

Speaker 0

是的。

Yes.

Speaker 1

好的,太好了。

Okay, cool.

Speaker 1

我想聊聊这个。

So I want to talk about this.

Speaker 1

你指出,人们以为用AI做开发很容易,觉得AI会替我们做所有事情,那我们整天干什么呢?

So you pointed out, people think it's easy to build with AI, it's like, was going do all these things for us, what are going do all day?

Speaker 1

你说得对,实际上并不是这样。

To your point, it's actually not.

Speaker 1

要做好这件事,你需要很多非常具体的技能,而你正在你的博客上把这些技能整理出来,我们会引用它。

There's a lot of very specific skills you need to do this well and you're putting them together on your blog, we'll point to it.

Speaker 1

我想详细讲讲其中几个,帮助大家更好地完成这件事。

I want to talk through a few of them, to help people do this better.

Speaker 1

其中一个就是,现在写代码的成本已经很低了。

So one is this idea of just writing code is cheap now.

Speaker 1

你之前稍微提到了这一点。

You touched on this a bit.

Speaker 1

也许你可以分享一下,为什么这一点如此重要,值得我们时刻牢记。

Maybe just share why this is such an important thing to know and keep in mind.

Speaker 0

我认为这是整个过程中最令人震惊的一点。

So I think this is the single biggest shock in all of this.

Speaker 0

我们必须重新思考如何构建、如何作为软件工程师工作,原因在于,过去耗时最长的那部分工作,现在所需时间大大减少了。

The reason that we have to rethink how we build, how we work as software engineers, is that the thing that used to take the time takes way less time.

Speaker 0

事实上,程序员从来不会每天花90%的时间在键盘上敲代码。

Like, it's it's never been the case that programmers spend 90% of their day typing code into a computer.

Speaker 0

一直以来,围绕编码还有大量其他工作。

There's always there's so much additional work around that.

Speaker 0

但过去人们总强调,千万别打断程序员的工作。

But it still used to be like, people talk about how important it is not to interrupt your coders.

Speaker 0

对吧?

Right?

Speaker 0

你的程序员需要连续两到四个小时不受打扰的时间,才能建立起思维模型并高效写出代码。

Your coders need to have, like, solid two to four hour of uninterrupted work so they can spin up their mental model and and churn out the code.

Speaker 0

但这种情况已经完全改变了。

It's some that that's changed completely.

Speaker 0

就拿我来说,我编程时每隔一会儿只需要花两分钟向我的助手请教下一步该做什么,然后就能继续做其他事,再回来继续。

Like, I my my programming work, I need two minutes every now and then to prompt my agent about what to do next, and then I can do the other stuff, and I can go back.

Speaker 0

我现在比以前更容易被打断了。

I'm much more interrupt ible than I used to be.

Speaker 0

但确实如此。

But yeah.

Speaker 0

过去耗时的事情,现在所花的时间少了很多。

So the thing that used to take the time is now the thing that takes way, way less time.

Speaker 0

这对我们在其他方面所做的一切意味着什么?

What does that mean for everything else that we do?

Speaker 0

这不仅仅影响程序员。

And that doesn't just affect programmers.

Speaker 0

它影响着整个软件开发团队及其周边的团队。

It affects entire, like, teams of teams around around software development.

Speaker 0

但作为个体程序员,你必须开始思考:好吧。

But as an individual programmer, you have to start thinking, okay.

Speaker 0

我现在能在写100行代码的时间内写出10000行代码。

I can churn out 10,000 lines of code now in the time that it's saying it take me to write a 100.

Speaker 0

我该如何让这些代码变得优质?

How do I make that code good?

Speaker 0

对吧?

Right?

Speaker 0

我如何确保自己不是在不断产出一堆垃圾,最终积累成拖慢我的技术债务?

How do I make sure that I'm not just churning out total slop that that adds up to technical debt that slows me down?

Speaker 0

我如何利用代码现在变得廉价这一事实,来产出更好的代码?

How do I take the fact that code is now cheap and use that to produce better code?

Speaker 0

因为我不只是想要廉价的代码。

Because I don't don't just want cheap code.

Speaker 0

我想写出真正优秀的代码,能够实现我需要的功能,便于未来扩展,并具备所有那些有用且适用于生产环境的代码特征。

I want really good code that does what I need it to do, like an extend in the future, that's got all of those those characteristics of of of of code that that's that's useful and and can be used in production.

Speaker 1

你之前提到的一个观点,我认为在这方面非常重要,那就是当你启动一个项目时,可以同时启动三个不同版本,这有助于你确定方向。

The point you made earlier, I think, is a really important one along these lines, which is when you start a project, you fire off three different versions of it, and that helps you pick a direction.

Speaker 1

而这一切之所以可能,是因为现在的代码成本已经非常低了。

And that's only possible because code is so cheap now.

Speaker 1

对吧?

Right?

Speaker 0

是的。

Right.

Speaker 0

我认为原型制作几乎是免费的。

Prototyping is almost free, I think.

Speaker 0

这对我影响很大,因为在我整个职业生涯中,我的优势一直是原型制作。

And that really impacts me because throughout my entire career, my superpower has been prototyping.

Speaker 0

我非常擅长快速做出可用的原型。

Like, I am very I've been very quick at knocking out working prototypes of things.

Speaker 0

我就是那种能走进会议说:看。

I'm the person who can show up at a meeting and say, look.

Speaker 0

这是它可能的工作方式。

Here's how it could work.

Speaker 0

这曾经是我的独特卖点,但现在消失了。

And that's that was kind of my my unique selling point, and that's gone.

Speaker 0

任何人都能做我以前能做的事。

Anyone can do what I could do.

Speaker 0

你知道吧?

You know?

Speaker 0

但你仍然需要学会在什么时候适合做原型,如何思考原型设计,以及如何获取工具来构建有用的原型,以便用来探索各种可能性。

It's like but but but it does but you still have to learn when it's appropriate to prototype, how to think about prototyping, how to get the tools to build useful prototypes that you can you can use to explore things.

Speaker 1

我非常兴奋地向大家介绍本季的赞助商——Vanta。

I am so excited to tell you about this season's supporting sponsor, Vanta.

Speaker 1

Vanta 帮助包括 Cursor、Ramp、Duolingo、Snowflake 和 Atlassian 在内的超过 15,000 家公司赢得并证明客户的信任。

Vanta helps over 15,000 companies like Cursor, Ramp, Duolingo, Snowflake, and Atlassian earn and prove trust with their customers.

Speaker 1

由于人工智能的发展,团队正在以前所未有的速度构建和发布产品,但随之而来的,是产品和业务中引入的风险比以往任何时候都更高。

Teams are building and shipping products faster than ever thanks to AI, but as a result, the amount of risk being introduced into your product and your business is higher than it's ever been.

Speaker 1

我接触过的每一位安全负责人,都深感保护公司、业务,更不用说客户数据的压力日益加重。

Every security leader that I talk to is feeling the increasing weight of protecting their organization, their business, and not to mention their customer data.

Speaker 1

由于一切进展太快,他们不得不不断应对突发状况,猜测优先级,并勉强使用过时的解决方案。

Because things are moving so fast, they are constantly reacting, having to guess at priorities, and having to make do with outdated solutions.

Speaker 1

Vanta 通过自动化合规与风险管理,支持超过 35 个安全与隐私框架,包括 SOC 2、ISO 27001 和 HIPAA。

Vanta automates compliance and risk management with over 35 and privacy frameworks, including SOC two, ISO 27,001, and HIPAA.

Speaker 1

这帮助公司快速实现合规并持续保持合规。

This helps companies get compliant fast and stay compliant.

Speaker 1

与以往任何时候相比,信任更能决定你业务的成败。

More than ever before, trust has the power to make or break your business.

Speaker 1

了解更多,请访问 vanta.com/lenny。

Learn more at vanta.com/lenny.

Speaker 1

作为本播客的听众,您可以享受 Vanta 1000 美元的优惠。

As a listener of this podcast, you get $1,000 off Vanta.

Speaker 1

那就是 vanta.com/lenny。

That's vanta.com/lenny.

Speaker 1

我要岔开一下话题。

I'm gonna take a tangent.

Speaker 1

你的 AI 技术栈中包含哪些工具?

What's what's kind of in your stack, your AI stack?

Speaker 1

你最常使用哪些模型?

What models are you using most?

Speaker 1

你觉得哪些工具有用?

What tools do you find useful?

Speaker 0

目前我主要用Claude。

So right now, I'm mostly Claude.

Speaker 0

我大量使用Claude进行编码工作。

I do a huge amount of work using Claude code.

Speaker 0

我主要还是个Claude编码用户,但Claude编码有两个方面是我常用的。

Well, I'm I'm mainly still a Claude code person, but there are two sides of Claude code that I use.

Speaker 0

一个是运行在你电脑上的Claude编码,另一个是网页版的Claude编码,也就是他们托管的版本。

There's the Claude code that runs on your computer, And then there's Claude code for web, which is their hosted version of Claude code.

Speaker 0

我更常使用网页版,部分原因是它可以通过手机访问。

And I use that one more than the one on my own computer, partly because that's the one you can access through your phone.

Speaker 0

如果你在iPhone上安装了Anthropic Claude应用,里面有个编码标签页,你可以进去让它帮你写代码。

If you've got the Anthropic Claude app installed on iPhone, there's a code tab, and you can go in there, and you can tell it to write you things.

Speaker 0

而它是在他们的服务器上运行的。

And that it's running on their servers.

Speaker 0

你需要提供一个你自己的 GitHub 仓库,让它能在其中工作。

You get need to give it a GitHub repository of yours that it can work within.

Speaker 0

但从安全角度来看,这也很棒,因为如果你在笔记本电脑上运行 Claude 代码,可能会发生一些坏事。

But it's also great from a security point of view because if you're running Claude code on your laptop, there's risks that bad things can happen.

Speaker 0

它可能会不小心删除一些东西。

It might accidentally delete things.

Speaker 0

如果我在 Anthropic 的服务器上运行,我根本不在乎。

If I'm running on Anthropic servers, I couldn't care less.

Speaker 0

毕竟,那是他们的电脑。

Like, it's their computer.

Speaker 0

不是我的电脑。

It's not my computer.

Speaker 0

随便用吧。

Go wild.

Speaker 0

这意味着你可以以随心所欲的方式运行这些操作。

So this means that you can run these things in, in YOLO mode.

Speaker 0

这被称为危险地跳过权限,Claude就是这样说的。

This is, Claude calls it dangerously skip permissions.

Speaker 0

OpenAI 确实称之为 YOLO。

OpenAI actually do call it YOLO.

Speaker 0

他们提供了这个选项。

They've got an option for that.

Speaker 0

在这种模式下,智能体不会每次都问你是否要执行某个操作。

And that's the mode where the agent doesn't ask you if it should do something all the time.

Speaker 0

这是一款完全不同的产品。

And that is a different product.

Speaker 0

我认为,很多还没有接触编码智能体的人,还没试过在不安全模式下使用它们。

I think a lot of people who haven't got on board with coding agents yet haven't tried them in the unsafe mode.

Speaker 0

他们使用的编码智能体总是像在问:‘我可以运行这段代码吗?’

They're using coding agent where it's like, oh, can I run this piece of code?

Speaker 0

我可以编辑这个文件吗?

Can I edit this file?

Speaker 0

这意味着你必须全程全神贯注地盯着它。

And that means you have to pay complete attention to it the whole time.

Speaker 0

这就像是在和一个特别烦人的幼儿打交道,它不停地缠着你问想做什么。

And it's like working with a really frustrating toddler that's constantly nagging you about what it wants to do.

Speaker 0

一旦解除安全限制,我就能同时运行四个,然后去喝杯茶,回来时它们已经为我完成了些有用的事情。

The moment you take the safeties off, now I can run four of them and go and have, like, go and go and have a cup of tea and come back and they've they've achieved something useful for me.

Speaker 0

但这种方式本质上是不安全的。

But it's inherently unsafe.

Speaker 0

如果它在Claude Code for Web中运行,最坏的情况可能只是不小心泄露了你的私有源代码。

If it's running in Claude Code for Web, the only bad thing that could happen is maybe it accidentally leaks your private source code.

Speaker 0

而我的代码都是开源的,所以我并不在意。

And my code is all open source, so I don't care.

Speaker 0

这其实是个挺有用的技巧。

That's that's a useful trick there.

Speaker 0

但确实如此。

But yeah.

Speaker 0

所以我会在手机上使用它。

So I use that on my phone.

Speaker 0

我经常同时运行两三个这样的程序。

I often have two or three of those running.

Speaker 0

我的许多主要项目都是通过在手机上提示完成的。

A lot of my major projects are done mostly prompting on my phone.

Speaker 0

如果涉及安全或非常重要,我可能会把它移到笔记本电脑上进行后续详细审查。

If it's security adjacent or super important, I might pull it down to my laptop to do a thorough review later on.

Speaker 0

但大多数审查工作都可以通过 GitHub 完成。

But most of the review you can do through GitHub.

Speaker 0

这些工具会提交拉取请求,然后你可以使用审查他人代码的相同工具来审查智能体生成的代码。

Like these things will file pull requests, and then you use the same tools you'd use to review code from other people to review the code from the agents.

Speaker 0

话说回来,OpenAI 大约三周前发布了 GPT 5.4。

That said, OpenAI came out with GPT 5.4 about three weeks ago.

Speaker 0

它非常、非常、非常好。

It's very, very, very good.

Speaker 0

我认为它与Cloud Opus 4.6不相上下,甚至可能更好。

I think it's on par with Cloud Opus 4.6 and possibly even better.

Speaker 0

这些公司一直在不断超越彼此。

These companies are constantly leapfrogging each other.

Speaker 0

所以我最近更多地依赖它,而且也更便宜。

So I have been use leaning back it's also cheaper.

Speaker 0

所以这个月我更多地使用了GPT 5.4和OpenAI Codex。

So I've been leaning on GPT 5.4 a lot more this month and OpenAI Codex.

Speaker 0

现在OpenAI Codex和CloudCode几乎完全难以区分。

And OpenAI Codex and CloudCode are almost almost indistinguishable from each other now.

Speaker 0

它们都是非常非常优秀的软件。

They're both very, very good pieces of software.

Speaker 0

我对此早有预料。

And I kind of expect this to happen.

Speaker 0

比如,下一个Gemini模型发布后,可能会在几个月内成为最好的编程模型,到时我可能会转用那个生态系统。

Like, the next Gemini model comes out might be become the best coding model for a couple of months, in which case I might switch myself into that ecosystem.

Speaker 0

部分原因是我自己也会写这方面的内容。

Partly because I write about this stuff as well.

Speaker 0

我喜欢尽可能熟悉各种产品。

I like to stay familiar with as many of the offerings as possible.

Speaker 0

但我还是不断回到Claude Code,主要是因为它符合我的品味。

But I keep on coming back to Claude Code mainly because it fits my taste.

Speaker 0

有个挺奇怪的现象:我对代码的运作方式有非常特定的偏好,而巧合的是,这恰好与Claude Code的风格一致,这还挺有意思的。

Like, there's this weird thing where I've got a very specific taste in how I like code to work, which coincidentally happens to map to how Claude code likes to work, which is kind of interesting.

Speaker 0

GPT 5.4几乎也符合我的口味,但还差那么一点。

And g p d 5.4, it's almost matches my taste, but not quite.

Speaker 0

也许这是因为我和Claude相处的时间更长。

And maybe that's because I've just spent more time with Claude.

Speaker 0

我的提示方式已经逐渐更适应Claude的思维方式。

My prompting style has evolved more to fit the Claude way of thinking.

Speaker 0

我不确定。

I don't know.

Speaker 0

这些东西也很奇怪。

This stuff's also weird.

Speaker 0

一切都是感觉驱动的。

It's vibes all the way down.

Speaker 1

这太有趣了。

That is so interesting.

Speaker 1

所以你指的是代码的风格,它生成的代码质量,而不是对话方式或用户体验。

So the taste is the code, the quality of the code it puts out is what you're talking about, not like the conversation and the UX.

Speaker 0

当然。

Absolutely.

Speaker 0

我不在乎它们怎么跟我对话。

Don't care about how they talk to me.

Speaker 0

我只是用它们来完成工作。

Like I'm using them to get stuff done.

Speaker 0

是的。

Yeah.

Speaker 1

是的。

Yeah.

Speaker 1

因为我在听你说话的时候就在想,是什么会让一个人持续使用某个模型?

Because I was thinking as you're talking, what is the thing that will get someone to stick with a model?

Speaker 1

可能就是你所描述的代码质量,它写代码的方式。

And it could be what you're describing the quality, the way it writes code.

Speaker 1

也可能是用户体验,或者是对话的氛围。

It could be the UX, it could be the conversation vibes.

Speaker 1

嗯,

Well,

Speaker 0

最能让人粘住的东西应该是记忆功能。

the stickiest thing is meant to be memory.

Speaker 0

比如,它们都有这些功能,会记住关于你的事情,但我非常讨厌这些功能,只要有可能我就会关掉,主要是因为作为一名AI研究员,我需要在提问时看到其他人看到的相同内容。

Like, the the all of the they they all have these features where they will remember things about you, and and I hate those features, and I turn them off wherever I can because mainly because as an AI researcher, I need to see what everyone else sees when I'm prompting.

Speaker 0

我不想对全世界说:天啊,看,这个东西现在能用了。

Like, I don't want to say to the world, oh my goodness, look, this thing works now.

Speaker 0

这对我有效,是因为它基于我之前的一些对话内容。

And it only works for me because it's based on previous, like, into previous conversations that I've had.

Speaker 0

也许我因此错过了一些非常重要的东西。

And maybe I'm missing out on something really important there.

Speaker 0

但记忆功能正是所有实验室都在努力用来增强用户粘性的那个功能。

But the the memory feature is is is that thing that all of the labs are trying to be more sticky with.

Speaker 0

话说回来,几周前当OpenAI的军事相关事件发生时,Anthropic趁机宣传说:嘿,为什么不转用Claude呢?

That said, when the whole the the OpenAI military stuff happened a few weeks ago, Anthropic track took advantage by saying, hey, why don't you move to Claude?

Speaker 0

他们的方式是在Claude的入门页面上写道:点击这个按钮,将你的记忆从ChatGPT转移过来,然后粘贴到ChatGPT中。

And the way they did that is they had a Claude onboarding page that said, transfer your memories from ChatGPT by clicking this button and then pasting it into ChatGPT.

Speaker 0

但那只是一个提示语。

And it was just a prompt.

Speaker 0

他们设置了一个提示语,内容是:嘿,ChatGPT。

They had a prompt, which was, hey, chat g p t.

Speaker 0

告诉我你记得的关于我的所有事情。

Tell me everything that you've remembered about me.

关于 Bayt 播客

Bayt 提供中文+原文双语音频和字幕,帮助你打破语言障碍,轻松听懂全球优质播客。

继续浏览更多播客