人工智能之母谈工作、机器人与为何世界模型是下一个前沿 | 费菲菲李博士 | Lenny's Podcast: Product | Career

本集简介

李飞飞博士被誉为“AI教母”。二十多年来，她始终处于人工智能重大突破的核心位置。她主导的ImageNet数据集点燃了我们正在经历的深度学习革命，曾任谷歌云首席AI科学家，执掌斯坦福人工智能实验室，并共同创立了斯坦福以人为本人工智能研究所。本次对话中，李飞飞将揭秘鲜为人知的人工智能发展史——包括九年前自称AI公司等于自寻死路的惊人事实。我们探讨： 1. ImageNet如何引爆当前AI革命 2. 为何世界模型与空间智能代表超越大语言模型的下一前沿 3. 李飞飞坚信AI不会取代人类但要求人类自我负责的原因 4. Marble从电影制作到心理研究的意外应用 5. 机器人技术相比语言模型的独特挑战及突破之道 6. 不同岗位参与AI发展的方式 —— 本期赞助商： Figma Make——将创意转化为代码的工具 Justworks——一站式中小企业HR解决方案 Sinch——为产品集成通讯、邮件与通话功能 —— 完整文字稿：https://www.lennysnewsletter.com/p/the-godmother-of-ai —— 付费用户专属要点总结： https://www.lennysnewsletter.com/i/178223233/my-biggest-takeaways-from-this-conversation —— 李飞飞博士联系方式： • 推特：https://x.com/drfeifei • 领英：https://www.linkedin.com/in/fei-fei-li-4541247 • World Labs：https://www.worldlabs.ai —— 主持人Lenny联系方式： • 通讯：https://www.lennysnewsletter.com • 推特：https://twitter.com/lennysan • 领英：https://www.linkedin.com/in/lennyrachitsky/ —— 本期时间轴： (00:00) 李飞飞博士介绍 (05:31) AI演进史 (09:37) ImageNet诞生记 (17:25) 深度学习崛起 (23:53) AI与AGI未来展望 (29:51) 世界模型解析 (40:45) AI与机器人领域的惨痛教训 (48:02) 革命性产品Marble发布 (51:00) Marble应用场景 (01:01:01) 创始人历程与洞见 (01:10:05) 斯坦福以人为本AI研究 (01:14:24) AI对各行业的影响 (01:18:16) 总结陈词 —— 参考文献：https://www.lennysnewsletter.com/p/the-godmother-of-ai —— 节目制作营销由Penname.co负责，赞助请联系podcast@lennyrachitsky.com —— 注：Lenny可能持有讨论企业的投资头寸更多内容请访问：www.lennysnewsletter.com

Dr. Fei-Fei Li is known as the “godmother of AI.” She’s been at the center of AI’s biggest breakthroughs for over two decades. She spearheaded ImageNet, the dataset that sparked the deep-learning revolution we’re living right now, served as Google Cloud’s Chief AI Scientist, directed Stanford’s Artificial Intelligence Lab, and co-founded Stanford’s Institute for Human-Centered AI. In this conversation, Fei-Fei shares the rarely told history of how we got here—including the wild fact that just nine years ago, calling yourself an AI company was basically a death sentence. We discuss: 1. How ImageNet helped spark the AI explosion we’re living through 2. Why world models and spatial intelligence represent the next frontier in AI, beyond large language models 3. Why Fei-Fei believes AI won’t replace humans but will require us to take responsibility for ourselves 4. The surprising applications of Marble, from movie production to psychological research 5. Why robotics faces unique challenges compared with language models and what’s needed to overcome them 6. How to participate in AI regardless of your role — Brought to you by: Figma Make—A prompt-to-code tool for making ideas real Justworks—The all-in-one HR solution for managing your small business with confidence Sinch—Build messaging, email, and calling into your product — Transcript: https://www.lennysnewsletter.com/p/the-godmother-of-ai — My biggest takeaways (for paid newsletter subscribers): https://www.lennysnewsletter.com/i/178223233/my-biggest-takeaways-from-this-conversation — Where to find Dr. Fei-Fei Li • X: https://x.com/drfeifei • LinkedIn: https://www.linkedin.com/in/fei-fei-li-4541247 • World Labs: https://www.worldlabs.ai — Where to find Lenny: • Newsletter: https://www.lennysnewsletter.com • X: https://twitter.com/lennysan • LinkedIn: https://www.linkedin.com/in/lennyrachitsky/ — In this episode, we cover: (00:00) Introduction to Dr. Fei-Fei Li (05:31) The evolution of AI (09:37) The birth of ImageNet (17:25) The rise of deep learning (23:53) The future of AI and AGI (29:51) Introduction to world models (40:45) The bitter lesson in AI and robotics (48:02) Introducing Marble, a revolutionary product (51:00) Applications and use cases of Marble (01:01:01) The founder’s journey and insights (01:10:05) Human-centered AI at Stanford (01:14:24) The role of AI in various professions (01:18:16) Conclusion and final thoughts — References: https://www.lennysnewsletter.com/p/the-godmother-of-ai — Production and marketing by https://penname.co/. For inquiries about sponsoring the podcast, email podcast@lennyrachitsky.com. — Lenny may be an investor in the companies discussed. To hear more, visit www.lennysnewsletter.com

双语字幕

仅展示文本字幕，不包含中文音频；想边听边看，请使用 Bayt 播客 App。

Speaker 0

很多人都称你为人工智能之母。

A lot of people call you the godmother of AI.

Speaker 0

你做的研究确实是引领我们走出AI寒冬的火种。

The work you did actually was the spark that brought us out of AI winter.

Speaker 1

在2015、2016年时，有些科技公司避免使用这个词，因为他们不确定AI是否是个负面词汇。

In the 2015, 2016, some tech companies avoid using the word because they were not sure if AI was a dirty word.

Speaker 1

2017年左右开始，企业纷纷自称AI公司。

Twenty seventeen ish was the beginning of companies calling themselves AI companies.

Speaker 0

有这么一句话，我记得是你向国会作证时说的。

There's this line, I think this was when you were presenting to congress.

Speaker 0

人工智能一点都不'人工'。

There's nothing artificial about AI.

Speaker 0

它的灵感来源于人类。

It's inspired by people.

Speaker 0

它由人类创造。

It's created by people.

Speaker 0

最重要的是，它影响着人类。

And most importantly, it impacts people.

Speaker 1

我并非认为AI不会对就业或人类产生影响。

It's not like I think AI will have no impact on jobs or people.

Speaker 1

事实上我相信，AI现在及未来能做什么完全取决于我们。

In fact, I believe that whatever AI does currently or in the future is up to us.

Speaker 1

取决于人类。

It's up to the people.

Speaker 1

我确实相信科技对人类总体是积极的，但每种技术都是一把双刃剑。

I do believe technology is a net positive for humanity, but I think every technology is a double edged sword.

Speaker 1

如果我们作为社会整体和个人都不做正确的事，同样可能把事情搞砸。

If we're not doing the right thing as a society, as individuals, we can screw this up as well.

Speaker 0

你获得了突破性的洞见：我们可以训练机器像人类一样思考，但它缺失了人类在童年时期必须学习的数据基础。

You had this breakthrough insight of just, okay, we can train machines to think like humans, but it's just missing the data that humans have to learn as a child.

Speaker 1

我选择从视觉智能的角度研究人工智能，因为人类本质上是高度视觉化的动物。

I chose to look at artificial intelligence through the lens of visual intelligence because humans are deeply visual animals.

Speaker 1

我们需要用尽可能多的物体图像信息来训练机器。

We need to train machines with as much information as possible on images of objects.

Speaker 1

但物体的学习过程非常非常困难。

But objects are very, very difficult to learn.

Speaker 1

单个物体在图像中可能呈现出无限种形态。

A single object can have infinite possibilities that is shown on an image.

Speaker 1

要让计算机掌握成千上万个物体概念，实际上需要展示数百万个示例。

In order to train computers with tens and thousands of object concepts, you really need to show it millions of examples.

Speaker 0

今天我的嘉宾是李飞飞博士，她被尊称为人工智能教母。

Today, my guest is doctor Fei Fei Li, who's known as the godmother of AI.

Speaker 0

飞飞主导并参与了许多引发当前AI革命的关键突破。

Fei Fei has been responsible for and at the center of many of the biggest breakthroughs that sparked the AI revolution that we are currently living through.

Speaker 0

她牵头创建了ImageNet项目，这个创举源于她认识到AI需要海量经过标注的干净数据才能变得更聪明。

She spearheaded the creation of ImageNet, which was basically her realizing that AI needed a ton of clean labeled data to get smarter.

Speaker 0

这个数据集最终成为推动当前AI模型构建与规模化方法的关键突破。

And that dataset became the breakthrough that led to the current approach to building and scaling AI models.

Speaker 0

她曾担任谷歌云的首席AI科学家，早期许多重大技术突破都源于此。

She was chief AI scientist at Google Cloud, which is where some of the biggest early technology breakthroughs emerged from.

Speaker 0

她曾任斯坦福人工智能实验室（SAIL）主任，培养出众多顶尖AI人才。

She was director at SAIL, Stanford's artificial intelligence lab, where many of the biggest AI minds came out of.

Speaker 0

她还是斯坦福以人为本AI研究所的联合创始人，该机构正引领AI发展方向。

She's also co creator of Stanford's Human Centered AI Institute, which is playing a vital role in a direction that AI is taking.

Speaker 0

她曾担任推特董事会成员。

She's also been on the board of Twitter.

Speaker 0

她被《时代》杂志评为AI领域最具影响力的100人之一。

She was named one of Time's 100 most influential people in AI.

Speaker 0

她还担任联合国顾问委员会委员。

She's also on the United Nations Advisory Board.

Speaker 0

我还能继续列举。

I could go on.

Speaker 0

在我们的对话中，李飞飞分享了AI发展简史，包括这个令人震撼的事实：十年前自称AI公司等于品牌自杀，因为当时没人相信AI能成功。

In our conversation, Fei Fei shares a brief history of how we got to today in the world of AI, including this mind blowing reminder that nine to ten years ago, calling yourself an AI company was basically a death knell for your brand because no one believed that AI was actually gonna work.

Speaker 0

如今情况已完全不同。

Today, it's completely different.

Speaker 0

每家公司都成了AI公司。

Every company is an AI company.

Speaker 0

我们还聊到她如何看待AI对未来人类的影响、现有技术能走多远、为何执着构建世界模型、世界模型究竟是什么，以及最激动人心的——全球首个大型世界模型Marble的发布，这个模型与本期节目同步面世。

We also chat about her take on how she sees AI impacting humanity in the future, how far current technologies will take us, why she's so passionate about building a world model, and what exactly world models are, and most exciting of all, the launch of the world's first large world model, Marble, which just came out as this podcast comes out.

Speaker 0

大家都可以通过marble.worldlabs.ai体验这个模型。

Anyone can go play with this at marble.worldlabs.ai.

Speaker 0

这太疯狂了。

It's insane.

Speaker 0

一定要去看看。

Definitely check it out.

Speaker 0

李飞飞的影响力如此深远却鲜为人知，她真的令人难以置信。能邀请到她并传播她的智慧让我非常兴奋。

Fei Fei is incredible and way too under the radar for the impact that she's had on the world, so I am really excited to have her on and to spread her wisdom with more people.

Speaker 0

特别感谢本·霍洛维茨和康多莉扎·赖斯为这次对话提供的建议话题。

A huge thank you to Ben Horowitz and Condoleezza Rice for suggesting topics for this conversation.

Speaker 0

如果你喜欢这期播客，别忘了在你常用的播客应用或YouTube上订阅关注。

If enjoy you this podcast, don't forget to subscribe and follow it in your favorite podcasting app or YouTube.

Speaker 0

接下来有请李飞飞博士，先插播一段赞助商信息。

With that, I bring you doctor Fei Fei Li after a short word from our sponsors.

Speaker 0

本期节目由Figma旗下产品Figma Make赞助播出。

This episode is brought to you by Figma, makers of Figma make.

Speaker 0

我在Airbnb担任产品经理时，还记得Figma问世后极大提升了我们的团队协作效率。

When I was a PM at Airbnb, I still remember when Figma came out and how much it improved how we operated as a team.

Speaker 0

突然间我就能让整个团队参与设计流程，快速反馈设计概念，这让整个产品开发过程变得有趣多了。

Suddenly, I could involve my whole team in the design process, give feedback on design concepts really quickly, and it just made the whole product development process so much more fun.

Speaker 0

但Figma总让我感觉不是为我量身打造的。

But Figma never felt like it was for me.

Speaker 0

它很适合提供设计反馈，但作为创造者，我更想动手制作东西。

It was great for giving feedback and designs, but as a builder, I wanted to make stuff.

Speaker 0

所以Figma开发了Figma Make。

That's why Figma built Figma Make.

Speaker 0

仅需几个提示，你就能将任何想法或设计转化为功能完备的原型或应用，任何人都可以在此基础上迭代并与客户验证。

With just a few prompts, you can make any idea or design into a fully functional prototype or app that anyone can iterate on and validate with customers.

Speaker 0

Figma Make 是一款与众不同的氛围编程工具。

Figma Make is a different kind of vibe coding tool.

Speaker 0

由于一切都在 Figma 中完成，你可以直接使用团队现有的设计组件库，轻松创建美观真实、且与团队构建方式无缝衔接的产出物。

Because it's all in Figma, you can use your team's existing design building blocks, making it easy to create outputs that look good and feel real and are connected to how your team builds.

Speaker 0

别再耗费大量时间向人们描述产品愿景，直接展示给他们看。

Stop spending so much time telling people about your product vision and instead show it to them.

Speaker 0

用 Figma 快速制作可逆向生成代码的原型和应用，访问 figma.com/leni 了解更多。

Make code back prototypes and apps fast with Figma Check it out at figma.com/leni.

Speaker 0

你知道吗？我有一个完整团队协助我制作播客和新闻简报。

Did you know that I have a whole team that helps me with my podcast and with my newsletter?

Speaker 0

我希望团队每位成员都能在工作中获得幸福感和成就感。

I want everyone on that team to be super happy and thrive in their roles.

Speaker 0

Justworks 明白你的员工远不止是雇员那么简单。

Justworks knows that your employees are more than just your employees.

Speaker 0

他们是你的伙伴。

They're your people.

Speaker 0

我的团队成员分布在科罗拉多、澳大利亚、尼泊尔、西非和旧金山。

My team is spread out across Colorado, Australia, Nepal, West Africa, and San Francisco.

Speaker 0

如果没有专业支持，跨国招聘、按时支付当地货币薪资以及7×24小时解答人力资源问题会让我的生活变得极其复杂。

My life would be so incredibly complicated to hire people internationally, to pay people on time and in their local currencies, and to answer their HR questions twenty four seven.

Speaker 0

但有了 Justworks，一切都变得非常简单。

But with Justworks, it's super easy.

Speaker 0

无论您是在设置自己的自动化薪资系统、提供优质福利，还是进行国际招聘，Justworks都能为您和您的团队提供简洁的软件解决方案，以及由中小企业专家提供的全天候人工支持。

Whether you're setting up your own automated payroll, offering premium benefits, or hiring internationally, Justworks offers simple software and twenty four seven human support from small business experts for you and your people.

Speaker 0

他们妥善处理您的人力资源事务，让您能更好地照顾您的团队。

They do your human resources right so that you can do right by your people.

Speaker 0

Justworks，为您的团队而生。

Justworks for your people.

Speaker 0

菲菲，非常感谢你能来参加节目，欢迎来到我们的播客。

Fei Fei, thank you so much for being here, welcome to the podcast.

Speaker 1

能来这里我很兴奋，Lenny。

I'm excited to be here, Lenny.

Speaker 0

你能来我更加兴奋。

I'm even more excited to have you here.

Speaker 0

能和你聊天真是件令人愉快的事。

It is such a treat to get to chat with you.

Speaker 0

我有太多话题想和你探讨了。

There's so much that I wanna talk about.

Speaker 0

你长期处于当前这场AI革命的核心位置。

You've been at the center of this AI explosion that we're seeing right now for so long.

Speaker 0

我们要聊很多不为人知的历史——关于这一切是如何开始的。

We're going to talk about a bunch of the history that I think a lot of people don't even know about how this whole thing started.

Speaker 0

但首先让我读一段《连线》杂志对你的评价，好让听众有个概念。开场时我会介绍你其他所有了不起的成就，但这段话很适合奠定基调。

But let me first read a quote from Wired about you just so people get a sense, and in the intro I'll share all of the other epic things you've done, but I think this is a good way to just set context.

Speaker 0

菲菲是为数不多的科学家之一——这个群体小到或许只需一张厨房餐桌就能坐下——正是他们造就了AI近期这些非凡的进步。

Fei Fei is one of the tiny group of scientists, a group perhaps small enough to fit around a kitchen table, who are responsible for AI s recent remarkable advances.

Speaker 0

很多人都称你为人工智能教母。

A lot of people call you the godmother of AI.

Speaker 0

与许多AI领域的领导者不同，你是个AI乐观主义者。

And unlike a lot of AI leaders, you're an AI optimist.

Speaker 0

你不认为AI会取代我们，不认为它会抢走所有工作，也不认为它会毁灭人类。

You don't think AI is going to replace us, don't think it's going take all our jobs, don't think it's going to kill us.

Speaker 0

所以我觉得从这里开始讨论会很有趣。

So I thought it would be fun to start there.

Speaker 0

你如何看待AI将如何随时间推移影响人类？

Just what's your perspective on how AI is going to impact humanity over time?

Speaker 1

是的。

Yeah.

Speaker 1

好的。

Okay.

Speaker 1

那么莱尼，让我说清楚。

So Lenny, let me be very clear.

Speaker 1

我不是个乌托邦主义者。

I'm not a utopian.

Speaker 1

所以我并非认为AI不会对工作和人类产生影响。

So it's not like I think AI will have no impact on jobs or people.

Speaker 1

事实上，我是个人文主义者。

In fact, I'm a humanist.

Speaker 1

我相信AI现在或将来所做的一切都取决于我们。

I believe that whatever AI does currently or in the future is up to us.

Speaker 1

这取决于人民。

It's up to the people.

Speaker 1

因此我坚信，从文明发展的长远历程来看，技术对人类整体是利大于弊的。

So I do believe technology is a net positive for humanity if you look at the long course of civilization.

Speaker 1

我认为人类本质上就是一个不断创新的物种——从数千年前有文字记录至今，人类始终在自我革新，也在持续改进工具。

I think we are an fundamentally, we're an innovative species that we you know, if you look at from, you know, written record thousands of years ago to now, humans just kept innovating ourselves and innovating our tools.

Speaker 1

正因如此，我们才能不断改善生活。

And with that, we make lives better.

Speaker 1

我们优化工作方式。

We make work better.

Speaker 1

我们建设文明。

We build civilization.

Speaker 1

而我确信人工智能也是这一进程的一部分。

And I do believe AI is part of that.

Speaker 1

这就是乐观主义的根源所在。

So that's where the optimism comes from.

Speaker 1

但每种技术都是一把双刃剑。

But I think every technology is a double edged sword.

Speaker 1

如果我们作为物种、作为社会、作为群体、作为个体未能做出正确抉择，同样可能搞砸这一切。

And if we're not doing the right thing as a species, as a society, as communities, as individuals, we can screw this up as well.

Speaker 0

有这么一句话。

There's this line.

Speaker 0

我记得这是你在国会作证时说的。

I think this was when you were presenting to Congress.

Speaker 0

人工智能并不虚假。

There's nothing artificial about AI.

Speaker 0

它源于人类的灵感。

It's inspired by people.

Speaker 0

它由人类创造。

It's created by people.

Speaker 0

最重要的是，它影响着人类。

And most importantly, it impacts people.

Speaker 0

我对此没有疑问，但这句话说得真好。

I don't have a question there, but what a great line.

Speaker 1

是啊。

Yeah.

Speaker 1

我深有感触。

I feel pretty deeply.

Speaker 1

我二十五年前开始从事AI工作，过去二十年一直在培养学生。

I started working in AI two and a half decades ago, and I've been having students for the past two decades.

Speaker 1

几乎每个毕业的学生，我都会提醒他们，当他们从我的实验室毕业时，你们的领域虽然叫人工智能，但它一点都不虚假。

And almost every student who graduates, I remind them, you know, when they graduate from my lab, that your field is called artificial intelligence, but there's nothing artificial about it.

Speaker 0

回到你刚才提到的观点，关于这一切的发展方向其实取决于我们，你认为我们需要做好哪些关键点？

Coming back to the point you just made about how it's kind of up to us about where this all goes, what is it you think we need to get right?

Speaker 0

我们该如何规划发展路径？

How do we set things on a path?

Speaker 0

我知道这个问题很难回答，但你的建议是什么？

I know this is a very difficult question to answer, but just what should what's your advice?

Speaker 0

你认为我们应该

What do you think we should

Speaker 1

保留是的。

be keeping Yeah.

Speaker 1

比如我们还有多少小时？

Like in how many hours do we have?

Speaker 0

我们如何对齐人工智能？

How do we align AI?

Speaker 0

这就对了。

There we go.

Speaker 1

让我们是的。

Let's Yes.

Speaker 1

解决所以我认为无论做什么，人们都应该成为负责任的个体。

Solve So I think people should be responsible individuals no matter what we do.

Speaker 1

这是我们教育孩子的理念，也是我们作为成年人需要做到的，无论参与人工智能的哪个环节——开发、部署或其他方面。很可能我们许多人，尤其是技术人员，会同时涉足多个环节。

This is what we teach our children, and this is what we need to do as grown ups as well, no matter which part of the AI development or AI deployment or participating in, and most likely many of us, especially as technologists, we're in multiple points.

Speaker 1

我们应当以负责任的个体行事，并真正重视这件事，非常重视。

We should act like responsible individuals and care about this, actually care a lot about this.

Speaker 1

我认为当今每个人都应该关注人工智能，因为它将影响你的个人生活。

I think everybody today should care about AI because it is going to impact your individual life.

Speaker 1

它将影响你所在的社区。

It is going to impact your community.

Speaker 1

它还将影响社会和未来世代。

It's gonna impact the the society and the future generation.

Speaker 1

作为一个负责任的人去关心它，这是第一步，也是最重要的一步。

And caring about it as a responsible person is the first, but also the most important step.

Speaker 0

好的。

Okay.

Speaker 0

那么让我先退一步，从人工智能的起源说起。

So let me actually take a step back and kind of go to the beginning of AI.

Speaker 0

大多数人开始听说并关注如今被称为AI的东西，大概就是几年前ChatGPT问世的时候，可能是三年前吧。

Most people started hearing and caring about AI, is what it's called today, just like, I don't know, a few years ago when ChatGPT came out, maybe it was like three years ago.

Speaker 1

三年前，准确说再过一个月就满三年了。

Three years ago, almost one more month three years ago.

Speaker 0

哇。

Wow.

Speaker 0

好的。

Okay.

Speaker 0

那时候是JatGPT发布吗？

And that was JatGPT coming out?

Speaker 0

这是你心中的里程碑事件吗？

Is that the milestone you have in mind?

Speaker 0

好的。

Okay.

Speaker 0

酷。

Cool.

Speaker 0

这正是我的看法。

That's exactly how I saw it.

Speaker 0

但很少有人知道，人们研究这个领域已有很长的历史，当时被称为机器学习，还有其他术语，而现在一切都叫AI。

But very few people know there was a long, long history of people working on it was called machine learning back then, there's other terms, and now it's just everything's AI.

Speaker 0

曾有很长一段时间很多人投身其中，后来出现了所谓的'AI寒冬'，那时人们几乎放弃了，大多数人都觉得这个理念不会有进展。

And there was kind of like a long period of just a lot of people working on it, and then there's this, what people refer to as the AI winter where people just gave up almost, most people did and just, okay, idea isn't going anywhere.

Speaker 0

而你做的工作实际上成为了带领我们走出AI寒冬的火种，直接造就了当下这个言必谈AI的世界——正如你所说，它将影响我们的一切行为。

And then the work you did actually was essentially the spark that brought us out of AI winter and is directly responsible for the world we're in now of just AI is all we talk about as you just said, it's going to impact everything we do.

Speaker 0

所以我想听你讲讲这段历史会很有趣：ImageNet之前的世界是怎样的？你创建ImageNet的工作为何如此重要？以及之后发生了什么？

So I thought it be really interesting to hear from you, just kind of like the brief history of what the world was like before ImageNet, then just the work you did to create ImageNet, why that was so important, and then just what happened after.

Speaker 1

对我来说很难意识到AI对所有人来说都是新事物。

It is for me hard to keep in mind that AI is so new for everybody.

Speaker 1

当我将整个职业生涯奉献给AI时，看到自己刚成年时萌生的好奇心如今成为文明的变革力量，有种难以言喻的满足感。

When I lived my entire professional life in AI, there's a part of me that is just so satisfying to see a personal curiosity that I started barely out of teenage hood and now has become a transformative force of our civilization.

Speaker 1

它本质上属于文明级技术。

It generally is a civilizational level technology.

Speaker 1

这段旅程大约三十年，或者说二十多年，确实令人欣慰。

That journey is about thirty years or twenty something, twenty plus years, and it's just very satisfying.

Speaker 1

那么这一切始于何处？

So where did it all start?

Speaker 1

其实我甚至不是第一代AI研究者。

Well, I'm not even the first generation AI researcher.

Speaker 1

第一代真正要追溯到50到60年代。

The first generation really dates back to the 50s and 60s.

Speaker 1

要知道，艾伦·图灵在40年代就超前地提出了大胆诘问：'是否存在会思考的机器？'

And, you know, Alan Turing was ahead of his time in the 40s by asking daring humanity with the question, is there thinking machines?

Speaker 1

对吧？

Right?

Speaker 1

当然，他有一套独特的方法来测试这种思维机器的概念——通过对话式聊天机器人，按照他的标准，我们现在确实拥有了一台思维机器。

And of course, he has a specific way of testing this concept of thinking machine, which is a conversational chatbot, which to his standard, we now have a thinking machine.

Speaker 1

但那更多只是轶事性的启发。

But that was just a more anecdotal inspiration.

Speaker 1

这一领域真正始于50年代，当时计算机科学家们齐聚一堂，研究如何利用计算机程序和算法构建那些原本仅靠人类认知才能完成的任务。

The field really began in the 50s when computer scientists came together and looked at how we can use computer programs and algorithms to build these programs that can do things that have been only incapable by human cognition.

Speaker 1

这就是开端。

And that was the beginning.

Speaker 1

1956年达特茅斯会议的创始人们奠定了基础。

And the founding fathers, the Dartmouth Workshop in 1956.

Speaker 1

后来加入斯坦福的约翰·麦卡锡教授创造了'人工智能'这个术语。

You know, we have Professor John McCarthy who later came to Stanford who coined the term artificial intelligence.

Speaker 1

在50至80年代间，是人工智能探索的早期阶段。

And between the 50s, 60s, 70s, and 80s, it was the early days of AI exploration.

Speaker 1

那时我们有逻辑系统。

And we had logic systems.

Speaker 1

我们有专家系统。

We had expert systems.

Speaker 1

还进行了神经网络的早期探索。

We also had early exploration of neural network.

Speaker 1

随后发展到80年代末、90年代直至21世纪初期。

And then it came to around the late eighties, the nineties, and the the very beginning of the twenty first century.

Speaker 1

那大约二十年的时间跨度，实际上是机器学习的开端。

That stretch about twenty years is actually the beginning of machine learning.

Speaker 1

这是计算机编程与统计学习的结合。

It's the marriage between computer programming and statistical learning.

Speaker 1

这种结合为人工智能带来了一个极其关键的概念：纯粹基于规则的程序无法实现我们想象中计算机应具备的广泛认知能力。

And that marriage brought a very, very critical concept into AI, which is that purely rule based program is not going to account for the vast amount of cognitive capabilities that we imagine computers can do.

Speaker 1

因此我们必须让机器学会识别模式。

So we have to use machines to learn the patterns.

Speaker 1

一旦机器能学会这些模式，就有希望做更多事情。

Once the machines can learn the patterns, it has the hope to do more things.

Speaker 1

例如，如果你给机器看三只猫，希望不仅是让它识别这三只猫。

For example, if you give it three cats, the hope is not just for the machines to recognize these three cats.

Speaker 1

而是希望机器能识别第四只、第五只、第六只以及所有其他的猫。

The hope is the machines can recognize the fourth cat, the fifth cat, the sixth cat, and all the other cats.

Speaker 1

这种学习能力是人类和许多动物的基本能力。

And that's a learning ability that is fundamental to humans and many animals.

Speaker 1

而我们这个领域意识到，我们需要机器学习。

And we, as a field, realize we need machine learning.

Speaker 1

这种情况一直持续到二十一世纪初。

So that was up till the beginning of the twenty first century.

Speaker 1

我正是在2000年进入人工智能领域的。

I entered the field of AI literally in the year of 2000.

Speaker 1

那一年我开始在加州理工学院攻读博士学位。

That's when my PhD began at Caltech.

Speaker 1

因此，我是第一代机器学习研究者之一，当时我们已经在研究机器学习这一概念，尤其是神经网络。

And so I was one of the first generation machine learning researchers, and we were already studying this concept of machine learning, especially the neural network.

Speaker 1

我记得在加州理工学院的第一批课程中，就有一门叫‘神经网络’。

I remember that was one of my first courses in at Caltech is called neural network.

Speaker 1

但那段时期非常艰难。

But it was very painful.

Speaker 1

当时正值所谓的‘AI寒冬’，意味着公众对此并不太关注。

It was still smack in the middle of the so called AI winter, meaning the public didn't look at this too much.

Speaker 1

资金并不充裕，但各种想法却在不断涌现。

There wasn't that much funding, but there was also a lot of ideas flowing around.

Speaker 1

我认为有两件事让我个人的职业生涯与现代AI的诞生如此紧密相连：一是选择通过视觉智能的视角来研究人工智能，因为人类本质上是高度视觉化的动物。

And I think two things happened to myself that brought my own career so close to the birth of modern AI is that I chose to look at artificial intelligence through the lens of visual intelligence because humans are deeply visual animals.

Speaker 1

我们稍后可以再详谈，但人类智能的很大一部分建立在视觉感知和空间理解上，而不仅仅是语言本身。

We can talk a little more later, but so much of our intelligence is built upon visual perceptual, spatial understanding, not just language per se.

Speaker 1

我认为它们是互补的。

I think they're complementary.

Speaker 1

所以我选择研究视觉智能。

So I chose to look at visual intelligence.

Speaker 1

在我的博士和早期教授生涯中，我和我的学生们始终致力于一个核心问题——解决物体识别问题，因为这是感知世界的基石。

And my PhD and my early professor years, I my students and I are very committed to a North Star problem, which is solving the problem of object recognition because it's a building block for the perceptual world.

Speaker 1

对吧？

Right?

Speaker 1

我们通过物体层面的解读、推理和互动来认知这个世界。

We go around the world interpreting, reasoning, and interacting with it more or less at the object level.

Speaker 1

我们并不在分子层面上与世界互动。

We don't interact with the world at the molecular level.

Speaker 1

我们与世界互动的方式并非如偶尔那般细致，比如当你提起茶壶时，不会先分解它由100片瓷片组成，再逐一处理这些部件。

We don't interact with the world as we sometimes do, but we rarely, for example, if you want to lift a teapot, you don't say, okay, the teapot is made of a 100 pieces of porcelain and let me work on this a 100 pieces.

Speaker 1

你会将其视为一个整体对象来与之互动。

You look at this as one object and interact with it.

Speaker 1

因此对象概念至关重要。

So object is really important.

Speaker 1

所以我成为最早将这一问题定位为'北极星难题'的研究者之一。

So I was among the first researchers to identify this as a North Star problem.

Speaker 1

但作为AI领域的学生继而成为研究者，我那时正致力于各种数学模型研究——神经网络、贝叶斯网络以及众多其他模型。

But I think what happened is that as a student of AI and then a researcher of AI, I was working on all kinds of mathematical models, including neural network, including Bayesian network, including many, many models.

Speaker 1

而最突出的痛点在于：这些模型缺乏训练数据。

And there was one singular pain point, is that these models don't have data to be trained on.

Speaker 1

整个领域过度聚焦模型本身，直到我意识到人类学习与生物进化本质上都是大数据学习过程。

And as a field, we were so focusing on these models, but it dawned on me that human learning, as well as evolution is actually a big data learning process.

Speaker 1

人类通过持续不断的丰富经验进行学习。

Humans learn with so much experience, you know, constantly.

Speaker 1

纵观进化史，生物正是通过感知世界逐步演化的。

And evolution, if you look at time, animals evolve with just experiencing the world.

Speaker 1

因此我和学生提出假设：实现人工智能最被忽视的关键要素正是大数据。

So I think my student and and I conjectured that a very critically overlooked ingredient of bringing AI to life is big data.

Speaker 1

于是我们在2006至2007年间启动了ImageDev项目。

And then we began this ImageDev project in 2006, 2007.

Speaker 1

我们当时非常有野心。

We were very ambitious.

Speaker 1

我们想要获取互联网上所有物体的图像数据。

We wanna get the entire Internet's image data on objects.

Speaker 1

当然，那时的互联网规模比现在小得多。

Now granted, Internet was a lot smaller than today.

Speaker 1

所以我觉得这个目标至少不算太疯狂。

So I felt like that ambition was at least not too crazy.

Speaker 1

现在想想，几个研究生和一位教授能完成这事简直是痴人说梦。

Now it's totally delusional to think a couple of graduate students and a professor can do this.

Speaker 1

但这就是我们当时做的事。

But and that's what we did.

Speaker 1

我们精心筛选了互联网上1500万张图片，建立了包含22000个概念的分类体系，借鉴了语言学家在WordNet等其他研究者的成果——这是一种特殊的词典编纂方式。

We curated very carefully 15,000,000 images on the Internet, created a taxonomy of 22,000 concepts, borrowing other researchers' work like linguists' work on WordNet, and it's a particular way of dictionarying words.

Speaker 1

我们将这些整合成ImageNet，并向研究社区开源。

And we combine that into ImageNet, and we open source that to the research community.

Speaker 1

我们每年举办ImageNet挑战赛来鼓励大家参与。

We held an annual ImageNet challenge to encourage everybody to participate in this.

Speaker 1

我们继续着自己的研究，但2012年被许多人视为深度学习元年或现代AI的诞生时刻——当时多伦多大学Jeff Hinton教授带领的团队参加ImageNet挑战赛，利用ImageNet大数据和英伟达的两块GPU，首次成功开发出神经网络算法。虽然未能彻底解决，但在物体识别问题上取得了重大突破。

We continue to do our own research, but 2012 was the moment that many people think was the beginning of the deep learning or birth of modern AI because a group of Toronto researchers led by Professor Jeff Hinton participated in ImageNet challenge, used the ImageNet big data and two GPUs from NVIDIA and created successfully the first neural network algorithm that can it didn't fundamentally it didn't totally solve, but made a huge progress towards solving the problem of object recognition.

Speaker 1

这种结合了TRIO技术、大数据、神经网络和GPU的配方，堪称现代AI的黄金配方。

And that combination of the TRIO technology, big data, neural network, and GPU was kind of the golden recipe for modern AI.

Speaker 1

之后快进到AI的公众时刻——也就是ChatGPT问世之时。

And then fast forward the the the public moment of AI, which is the ChatGPT moment.

Speaker 1

如果你审视ChatGPT诞生的关键要素，从技术层面看，它依然依赖于这三个核心成分。

If you look at the ingredients of what brought ChatGPT to to the to the world, technically, it still use these three ingredients.

Speaker 1

如今的数据规模已达互联网级别，主要是文本数据，神经网络架构也比2012年复杂得多，但本质上仍是神经网络。

Now it's Internet scale data, mostly texts, is a much more complex neural network architecture than 20 '12, but it's still neural network.

Speaker 1

GPU数量大幅增加，但核心仍是GPU。

And a lot more GPUs, but it's still GPUs.

Speaker 1

因此这三个要素依然是现代人工智能的核心基础。

So these three ingredients are still to at the core of modern AI.

Speaker 0

太不可思议了。

Incredible.

Speaker 0

我之前从未听过完整的故事版本。

I have never heard that full story before.

Speaker 0

我最喜欢最初只用两块GPU这个细节。

I love that it was two GPUs was the first.

Speaker 0

这点特别打动我。

I love that.

Speaker 0

是啊。

Yeah.

Speaker 0

而现在规模已经达到——我不确定具体数字——几十万块了吧？

And now it's, I don't know, hundreds of thousands, right?

Speaker 0

性能更是呈指数级提升。

That are orders of magnitudes more powerful.

Speaker 0

没错。

Yep.

Speaker 0

他们刚买的那两块显卡，就像是游戏显卡。

And those two GPUs where they just bought, they were like gaming GPUs.

Speaker 0

他们就直接去了游戏商店，对吧？

They just went to the like the game store, right?

Speaker 0

就是人们用来玩游戏的那种。

That people use for playing games.

Speaker 0

正如你所说，这仍然是模型变得更智能的主要方式。

As you said, this continues to be in a large way the way models get smarter.

Speaker 0

目前世界上一些增长最快的公司，我几乎都邀请过他们上播客，比如Merkor、Surge和Scale。

Some of the fastest growing companies in the world right now, I've had them all mostly on the podcast, Merkor and Surge and Scale.

Speaker 0

他们就是这样做的，持续为实验室提供这种服务。

Like they do this, they continue to do this for labs.

Speaker 0

只需要给他们越来越多他们最感兴趣的标注数据。

Just give them more and more labeled data of the things they're most excited about.

Speaker 1

是啊。

Yeah.

Speaker 1

我记得Scale的Alex Wong，很早的时候。

I remember Alex Wong from scale, very early days.

Speaker 1

可能还保留着他创办Scale时的邮件。

Probably still has his emails when he was starting scale.

Speaker 1

他非常友善。

He he was very kind.

Speaker 1

他不断给我发邮件讲述ImageNet如何启发了Scale。

He keeps sending me emails about how ImageNet inspired scale.

Speaker 1

看到这个我非常高兴。

I was very pleased to see that.

Speaker 0

你刚才分享的内容中，我最喜欢的另一个要点就是这种高度自主性和执行力的事例。

One of my other favorite takeaways from what you just shared is just such an example of high agency and just doing things.

Speaker 0

这在推特上有点像是个梗。

That's kind of a meme on Twitter.

Speaker 0

就是你可以直接去做事。

Just you can just do things.

Speaker 0

你就会觉得，好吧。

You're just like, okay.

Speaker 0

这对推动AI发展可能是必要的。

This is probably necessary to move AI.

Speaker 0

那时候它被称为机器学习。

And it's called machine learning back then.

Speaker 0

对吧？

Right?

Speaker 0

这是大多数人用的术语吗？

Was that the term most people used?

Speaker 1

我觉得这两个词是混用的。

I think it was interchangeably.

Speaker 1

确实如此。

It's true.

Speaker 1

比如，我确实记得那些科技公司。

Like, I do remember the companies, the tech companies.

Speaker 1

我不打算点名，但记得早期有次谈话，大概是在2015、2016年的时候。

I 'm not going to name names, but I was in a conversation in one of the early days, I think it is in the 2015, 2016.

Speaker 1

有些科技公司避免使用'AI'这个词，因为他们不确定这是否是个负面词汇。

Some tech companies avoid using the word AI because they were not sure if AI was a dirty word.

Speaker 1

记得我当时其实在鼓励大家使用'AI'这个词，因为在我看来，这是人类在科技探索中提出的最大胆的问题之一。

And I remember I was actually encouraging everybody to use the word AI because to me, that is one of the most audacious questions humanity has ever asked in our quest for science and technology.

Speaker 1

我对这个术语感到非常自豪。

And I feel very proud of this term.

Speaker 1

不过确实，最初有些人持怀疑态度。

But yes, at the beginning, some people were not sure.

Speaker 0

那大概是什么年份，当AI还——

What year was that roughly when AI was the

Speaker 1

就在2016年。

only 2016.

Speaker 1

我觉得那是——

I Less think that was

Speaker 0

不到十年前。

than ten years ago.

Speaker 1

那是个转折点。

That was the changing.

Speaker 1

比如，有些人开始称它为AI。

Like, some people start calling it AI.

Speaker 1

但如果你观察硅谷科技公司的营销术语，我认为2017年左右是企业开始自称'AI公司'的起点。

But I think if you look at the Silicon Valley tech company companies, if you trace their marketing term, I think twenty seventeen ish was the beginning of companies calling themselves AI companies.

Speaker 0

这太不可思议了。

That's incredible.

Speaker 0

世界变化真大啊。

Just how the world has changed.

Speaker 0

是的。

Yes.

Speaker 0

现在你不得不自称是一家AI公司了。

Now you can't not call yourself an AI company.

Speaker 1

我知道。

I know.

Speaker 0

大约九年后。

Nine ish years later.

Speaker 0

是啊。

Yeah.

Speaker 0

哦，天哪。

Oh, man.

Speaker 0

好的。

Okay.

Speaker 0

在我们讨论你认为工作未来发展方向之前，关于那段早期历史，还有什么你认为人们不了解但很重要的事情吗？

Is there anything else around the history, that early history that you think people don't know that you think is important before we chat about where you think things are going in the work that you're doing?

Speaker 1

我认为就像所有历史一样，我很清楚自己因参与这段历史而受到认可，但还有太多英雄和研究者。

I think as all histories, you know, I'm keenly aware that I am recognized for being part of the history, but there are so many heroes and so many researchers.

Speaker 1

我们谈论的是几代研究者的努力。

We're talking about generations of researchers there.

Speaker 1

你知道，在我的世界里，有很多人启发了我，这些我在书里都提到过。

You know, in my own world, there are so many people who have inspired me, which I talked about in my book.

Speaker 1

但我确实觉得我们的文化，尤其是硅谷，倾向于将成就归功于个人，虽然我认为这也有其价值。

But I do feel our culture, especially Silicon Valley, tends to assign achievements to a single person while I think it has value.

Speaker 1

但要记住，AI这个领域至今已有70年历史，我们经历了多代人的努力。

But it's just to be remembered, AI is a field of, at this point, 70 years old, and we have gone through many generations.

Speaker 1

没有人能单凭一己之力走到今天这一步。

Nobody, no one could have gotten here by themselves.

Speaker 0

好的。

Okay.

Speaker 0

那么让我问你这个问题。

So let me ask you this question.

Speaker 0

感觉我们总是处于AGI（人工通用智能）的边缘，这个人们经常随意提及的模糊概念。

It feels like we're always on this precipice of AGI, this kind of vague term people throw around.

Speaker 0

AGI即将到来。

AGI is coming.

Speaker 0

它将接管一切。

It's gonna take over everything.

Speaker 0

你认为我们距离实现AGI还有多远？

What's your take on how far you think we might be from AGI?

Speaker 0

你觉得按照当前的发展轨迹，我们能够实现吗？

Do you think we're gonna get there on the current trajectory we're on?

Speaker 0

你认为我们需要更多突破吗？

Do think we need more breakthroughs?

Speaker 0

你认为当前的方法能让我们实现目标吗？

Do you think the current approach will get us there?

Speaker 1

是啊，这是个非常有趣的术语，Lenny。

Yeah, this is a very interesting term, Lenny.

Speaker 1

我不知道是否有人曾定义过AGI（人工通用智能）。

I don't know if anyone has ever defined AGI.

Speaker 1

要知道，存在许多不同的定义，从赋予机器某种超能力，到机器能否成为社会经济活动中具有经济可行性的主体。

You know, there are many different definitions, including some kind of superpower for machines all the way to can machines become economically viable agent in the society.

Speaker 1

换句话说，能靠工资养活自己。

In other words, making salaries to live.

Speaker 1

这就是AGI的定义吗？

Is that the definition of AGI?

Speaker 1

作为科学家，我对待科学非常严肃，进入这个领域是因为受到这个大胆问题的启发：机器能否像人类一样思考和行动？

As a scientist, I take science very seriously, and I enter the field because I was inspired by this audacious question of can machines think and do things in the way that human humans can do?

Speaker 1

对我而言，这一直是人工智能的北极星。

For me, that's always the north star of AI.

Speaker 1

从这个角度看，我不知道AI和AGI有什么区别。

And from that point of view, I don't know what's the difference between AI and AGI.

Speaker 1

我认为我们在实现部分目标方面做得很好，包括对话式AI。

I think we've done very well in achieving parts of the goal, including conversational AI.

Speaker 1

但我不认为我们已经完全征服了AI的所有目标。

But I don't think we have completely conquered all the goals of AI.

Speaker 1

我在想，如果我们的奠基人艾伦·图灵今天还在世，你让他对比AI和AGI，他可能只会耸耸肩说：我在1940年代就问过同样的问题。

And I think our founding fathers, Alan Turing I wonder if Alan Turing is around today and you ask him to contrast AI versus AGI, he might just shrug and said, well, I asked the same question back in 1940s.

Speaker 1

所以我不想陷入定义AI与AGI的无底洞讨论。

So I don't want to get onto a rabbit hole of defining AI versus AGI.

Speaker 1

我觉得AGI更像营销术语而非科学术语。

I feel AGI is more a marketing term than a scientific term.

Speaker 1

作为科学家和技术专家，AI是我的北极星，是我们领域的指路明灯，至于人们怎么称呼它我都欣然接受。

As a scientist and technologist, AI is my North Star, is my field's North Star, and I'm happy people call it whatever name they want to call it.

Speaker 0

那让我换个方式问您。

So let me ask you maybe this way.

Speaker 0

就像您描述的，从ImageNet和AlexNet开始，这些组件引领我们走到今天。

Like you described, there's kind of these components that from ImageNet and AlexNet kind of took us to where we're today.

Speaker 0

本质上就是GPU、数据、标注数据，以及模型算法。

GPUs essentially, data, label data, just like the algorithm of the model.

Speaker 0

而Transformer模型似乎也是这条发展轨迹上的重要里程碑。

There's also just the transformer feels like an important step in that trajectory.

Speaker 0

您认为这些相同要素能让我们达到...比如智能水平提升十倍的模型吗？那种能改变全人类生活的突破？

Do you feel like those are the same components that'll get us to, I don't know, 10 times smarter model, something that's like life changing for the entire world?

Speaker 0

还是说我们需要更多技术突破？

Or do you think we need more breakthroughs?

Speaker 0

我知道我们会讨论世界模型——我认为这也是组成部分之一。

I know we're gonna talk about world models, I think is component of this.

Speaker 0

但您是否认为某些方面会遭遇瓶颈？或者说我们只需要更多数据、算力和GPU就能继续前进？

But is there anything else that you think is like, oh, this will plateau or, okay, this will take us just need more data, more compute, more GPUs.

Speaker 1

噢，不是的。

Oh, no.

Speaker 1

我坚信我们需要更多的创新。

I definitely think we need more innovations.

Speaker 1

我认为，尽管在更多数据、更多GPU和更大模型架构的扩展法则方面仍有大量工作要做，但我绝对认为我们需要更多创新。

I I think scaling laws of more data, more GPUs, and bigger current model architecture is there's still a lot to be done there, but I absolutely think we need to innovate more.

Speaker 1

人类历史上没有任何一门科学学科曾达到过'我们已经完成创新'的阶段。

There's not a single deeply scientific discipline in human history that has arrived at a place that says, we're done, we're done innovating.

Speaker 1

而人工智能，即便不是最年轻的学科，也是人类科技文明中最年轻的领域之一，我们仍处于探索的初级阶段。

And AI is one of the, if not the youngest discipline in human civilization in terms of science and technology, we're still scratching the surface.

Speaker 1

比如我刚才说的，我们将过渡到世界模型这个话题。

For example, like I said, we're going to segue into world models.

Speaker 1

如今，你给模型输入一段办公室场景的视频，要求它数出椅子的数量。

Today, you take a model and and and run it through a a video of a couple of office rooms and ask the the model to count the number of chairs.

Speaker 1

这是连学龄前儿童或小学生都能完成的任务。

And this is something a toddler could do, or maybe maybe a a elementary school kid could do.

Speaker 1

但AI却做不到。

And AI could not do that.

Speaker 1

对吧？

Right?

Speaker 1

所以当今AI还有太多无法完成的事情。

So there's just so much AI today could not do.

Speaker 1

更不用说像牛顿那样通过观察天体运动就能推导出支配所有物体运动的方程式了。

Then let alone thinking about how did, you know, someone like Isaac Newton look at the movements of the celestial bodies and derive an equation or a set of equations that governs the movement of all bodies.

Speaker 1

那种层次的创造力、推演能力和抽象思维，以目前技术还无法让AI实现。

That level of creativity, extrapolation, abstraction, we have no way of enabling AI to do that today.

Speaker 1

接下来我们看看情商。

And then let's look at emotional intelligence.

Speaker 1

想象一个学生走进教师办公室，讨论关于学习动力、热情、该学什么、以及真正困扰他的问题。这样的对话，尽管当今的对话机器人已经很强大，但现有AI仍无法达到这种情感认知智能的水平。

If you look at a student coming to a teacher's office and have a conversation about motivation, passion, what to learn, what's the problem that's really bothering you, That conversation, as powerful as today's conversational bots are, you don't get that level of emotional cognitive intelligence from today's AI.

Speaker 1

所以我们还有很多可以改进的地方。

So there's a lot we can do better.

Speaker 1

而且我认为创新远未结束。

And I do not believe we're done innovating.

Speaker 0

Demis最近在DeepMindGoogle接受了一个非常有趣的采访，有人问他：你怎么看？

Demis had this really interesting interview recently from DeepMindGoogle, where someone asked him just like, what do you think?

Speaker 0

我们距离通用人工智能还有多远？

How far are we from AGI?

Speaker 0

当它实现时会是什么样子？

What does it look like when it's through there?

Speaker 0

他提出了一个非常有趣的思考方式：如果我们给最前沿的模型提供直到二十世纪末的所有信息，看它是否能提出爱因斯坦所有的突破性理论。而目前，我们离这个目标还差得很远。

He had a really interesting way of approaching it is, if we were to give the most cutting edge model all the information until the end of the twentieth century, see if it could come up with all the breakthroughs Einstein had, And so far, we're nowhere near that that they

展开剩余字幕（还有 480 条）

Speaker 1

不能。

can No.

Speaker 1

确实没有。

We're not.

Speaker 1

事实上情况更糟。

Fact, it's even worse.

Speaker 1

让我们给AI所有数据，包括牛顿时代没有的现代天体仪器数据，然后要求AI创建十七世纪第六套关于物体运动定律的方程组。

Let's give AI all the data, including modern instruments data of celestial bodies, which Newton did not have, and give it to them and just ask AI to create the sixth seventeenth century set of equations on the laws of bodily movements.

Speaker 1

现在的AI还做不到这一点。

Today's AI cannot do that.

Speaker 0

好吧。

All right.

Speaker 0

我们还有很长的路要走，这就是我的

We're ways away is what I'm

Speaker 1

看法。

hearing.

Speaker 0

好的。

Okay.

Speaker 0

那我们谈谈世界模型吧。

So let's talk about world models.

Speaker 0

在我看来，这又是一个你领先于大众的绝佳例证。

This is to me, this is just another really amazing example of you being ahead of where people end up.

Speaker 0

你早就提出过，我们只需要大量干净的数据供AI和神经网络学习。

So you were way ahead on, okay, we just need a lot of clean data for AI and neural networks to learn.

Speaker 0

你谈论世界模型这个概念已经很久了。

You've been talking about this idea of world models for a long time.

Speaker 0

你还为此创办了一家公司。

You started a company to build.

Speaker 0

本质上，现在有的是语言模型。

Essentially, there's language models.

Speaker 0

这是完全不同的东西。

This is a different thing.

Speaker 0

这是一个世界模型。

This is a world model.

Speaker 0

我们会讨论它是什么。

We'll talk about what that is.

Speaker 0

就在我准备这个的时候，埃隆也在谈论世界模型。

And now, as I was preparing for this, Elon's like talking about world models.

Speaker 0

黄仁勋也在谈论世界模型。

Jensen's talking about world models.

Speaker 0

我知道谷歌正在研究这些东西。

I know Google's working on this stuff.

Speaker 0

你们已经研究这个很久了，而且实际上刚刚发布了一些内容，我们正好要在这期播客播出前讨论。

You've been at this for a long time, and you actually just launched something that's going to we're going to talk about right before this podcast airs.

Speaker 0

谈谈什么是世界模型？

Talk about what is a world model?

Speaker 0

为什么它如此重要？

Why is it so important?

Speaker 1

看到越来越多人像埃隆、黄仁勋那样讨论世界模型，我非常兴奋。

I'm very excited to see that more and more people are talking about world models like Elon, like Jensen.

Speaker 1

我一生都在思考如何真正推动人工智能发展。

I have been thinking about really how to push AI forward all my life.

Speaker 1

对吧？

Right?

Speaker 1

过去几年从研究界诞生的大型语言模型，以及OpenAI等等，都极具启发性，即使对我这样的研究者来说也是如此。

And the large language models that came out of the research world and then OpenAI and and all this for the past few years were extremely inspiring, even for researcher like me.

Speaker 1

我记得GPT-2发布的时候，大概是在2020年底。

I remembered when GPT two came out, and that was in, I think, late two thousand twenty.

Speaker 1

我当时是联合主任。

I was co director.

Speaker 1

现在依然是，但那时我是斯坦福以人为本人工智能研究所的全职联合主任。

I still am, but I was at that time full time co director of Stanford's Human Centered AI Institute.

Speaker 1

我记得当时公众还不了解大语言模型的能力，但作为研究者，我们已经看到了。

And I I remember it was you know, the public was not aware of the power of the large language model yet, but as researchers, we were seeing it.

Speaker 1

我们看到了未来。

We're seeing the future.

Speaker 1

我和自然语言处理领域的同事如Percy Liang、Chris Batting进行了相当长时间的讨论。

And I had pretty long conversations with my natural language processing colleagues like Percy Liang and Chris Batting.

Speaker 1

我们讨论这项技术将变得多么关键。

We were talking about how critical this technology is going to be.

Speaker 1

斯坦福人工智能研究所（以人为本人工智能研究所HAI）是第一个建立基础模型研究中心的机构。

And the Stanford AI Institute, Human Centered AI Institute, HAI, was the first one to establish a full research center foundation model.

Speaker 1

Percy Liang和许多研究人员领导发表了第一篇关于基础模型的学术论文。

We were Percy Liang and many researchers led the first academic paper foundation model.

Speaker 1

这让我深受启发。

So it was just very inspiring for me.

Speaker 1

当然，我来自视觉智能领域，当时就在思考我们可以在语言之外推进多少事情，因为人类运用空间智能和对世界的理解完成了许多超越语言范畴的事情。

So, of course, I come from the world of visual intelligence, and I was just thinking there's so much we can push forward beyond language because humans humans have used our sense of spatial intelligence, a world understanding to do so many things, and they are beyond language.

Speaker 1

想象一个非常混乱的急救现场，无论是火灾、交通事故还是自然灾害。

Think about a very chaotic first responder scene, whether it's fire or some traffic accident or some natural disaster.

Speaker 1

当你沉浸于场景中，思考人们如何组织救援、阻止灾难蔓延、扑灭火灾时，会发现大部分行动都源于对物体、环境和态势的本能理解与空间感知。

And if you immerse yourself in the scene and think about how people organize themselves to rescue people, to stop further disasters, to put down fires, to a lot of that is movements, spontaneous understanding of objects, worlds, situational awareness.

Speaker 1

语言虽是其中一环，但在许多紧急情况下，语言无法帮你扑灭火焰。

Language is part of that, but a lot of those situations, language cannot get you to put on the fire.

Speaker 1

那么这究竟意味着什么？

So that is what is that?

Speaker 1

我思考了很多。

I was thinking a lot.

Speaker 1

与此同时，我进行了大量机器人研究，并逐渐意识到：连接语言之外附加智能的关键，在于将具身AI（即机器人）、视觉智能与理解世界的空间智能相结合。

And in the meantime, was doing a lot of robotics research and it dawned on me that the linchpin of connecting the additional intelligence in addition to language and connecting embodied AI, which are robotics, connecting visual intelligence is the sense of spatial intelligence about understanding the world.

Speaker 1

就在那时——我记得是2024年——我在TED演讲中探讨了空间智能与世界模型。

And that's when, I think I, it was 2024, I gave a TED talk about spatial intelligence and world models.

Speaker 1

这个构想最早可追溯至2022年，基于我的机器人与计算机视觉研究逐渐成形。

And I start formulating this idea back in 2022 based on my robotics and computer vision research.

Speaker 1

我明确意识到：必须集结最杰出的技术专家，以最快速度将这项技术变为现实。

And then one thing that was really clear to me is that I really want to work with the brightest technologists and move as fast as possible to bring this technology to life.

Speaker 1

于是我们创立了名为World Labs的公司。

And that's when we founded this company called World Labs.

Speaker 1

公司名称中的'World'一词正体现了我们对世界建模与空间智能的坚定信念。

And you can see the word world is in the title of our company because we believe so much in world modeling and spatial intelligence.

Speaker 0

人们已习惯将聊天机器人简单等同于大语言模型。

People are so used to just chatbots, and that's a large language model.

Speaker 0

理解世界模型的简易方式是：你描述一个场景，它就能生成无限可探索的虚拟世界。

A simple way to understand a world model is you basically describe a scene and it generates an infinitely explorable world.

Speaker 0

我们会链接到你发布的内容，稍后会讨论，但这是否是一种简单的理解方式？

We'll link to the thing you launched, which we'll talk about, but just is that a simple way to understand it?

Speaker 1

这是其中一部分，Lenny。

That's part of it, Lenny.

Speaker 1

我认为理解世界模型的一种简单方式是：这个模型能让任何人通过提示（无论是图像还是句子）在脑海中创造任何世界，并能在这个世界中互动——无论是浏览行走、拾取物品还是改变事物，同时还能在这个世界中进行推理。

I think a simple way to understand a world model is that this model can allow anyone to create any worlds in their mind's eye by prompting, whether it's an image or a sentence, and also be able to interact in this world, whether you're browsing and walking or picking objects up or changing things, as well as to reason within this world.

Speaker 1

例如，如果使用这个世界模型输出的代理是个机器人，它应该能够规划路径并帮助整理厨房等任务。

For example, if if the person consuming if the agent consuming this output of the world model is a robot, It should be able to plan its path and help to tidy the kitchen, for example.

Speaker 1

因此世界模型是一个基础，你可以用它来推理、互动和创造世界。

So world model is a foundation that that you can use to reason, to interact, and to create worlds.

Speaker 0

很棒。

Great.

Speaker 0

是的。

Yeah.

Speaker 0

所以机器人领域感觉可能是AI研究者的下一个重点方向，就像对世界的影响一样。

So robots feels like that's potentially the next big focus for AI researchers and just like the impact on the world.

Speaker 0

你这里说的是，这是让机器人真正在现实世界中工作的关键缺失环节——理解世界如何运作。

What you're saying here is, this is a key missing piece of making robots actually work in the real world, understanding how the world works.

Speaker 1

是的。

Yeah.

Speaker 1

首先，我认为这不仅仅局限于机器人领域。

Well, first of all, I do think there's more than robots.

Speaker 1

这很令人兴奋。

That's exciting.

Speaker 1

不过，我完全同意你刚才说的所有观点。

So but I agree with everything you just said.

Speaker 1

我认为世界建模和空间智能是具身人工智能缺失的关键部分。

I think world modeling and spatial intelligence is a key missing piece of embodied AI.

Speaker 1

我还认为，我们不应低估人类作为具身主体的能力，以及人类如何能被人工智能的智能所增强。

I also think let's not underestimate that humans are embodied agents and humans can be augmented by AI's intelligence.

Speaker 1

就像今天，人类虽然是语言动物，但在执行语言任务（包括软件工程）时，我们很大程度上被人工智能增强了能力。

Just like today, humans are language animals, but we're very much augmented by AI when helping us to, you know, do language tasks, including software engineering.

Speaker 1

我认为我们不应低估——或者说我们往往忽略了——人类作为具身主体实际上能从世界模型和空间智能模型中获益良多，机器人同样可以。

I think that we shouldn't underestimate, or maybe we tend not to talk about how humans as an embodied agent can actually benefit so much from world models and spatial intelligent models, as well as robots can.

Speaker 0

所以这里的关键突破在于机器人技术，这可是大事。

So the big unlocks here, robots, which a huge deal.

Speaker 0

如果这能实现，想象一下我们每个人都能拥有机器人帮我们处理各种事务，比如在灾难中提供援助等等。

If this works out, imagine each of Us has robots doing a bunch of stuff for us, goes into, you know, they help us with disasters, things like that.

Speaker 0

游戏显然是个很酷的例子，就像那些能无限畅玩、完全由你想象创造的游戏。

Games, obviously, is a really cool example, just like infinitely playable games that you just invent out of your head.

Speaker 0

还有创意领域，感觉就像纯粹为了乐趣，发挥创意，构想各种奇妙的新世界和环境。

And then creativity feels like just like being fun, having fun, being creative, thinking of wild new worlds and environments.

Speaker 1

当然还有设计领域。

And also design.

Speaker 1

人类设计从机器到建筑到住宅，还包括科学发现，对吧？

Humans design from machines to buildings to homes and also scientific discovery, right?

Speaker 1

可能性实在太多了。

There is so much.

Speaker 1

我喜欢用DNA结构发现的例子来说明。

I like to use the example of the discovery of the structure of DNA.

Speaker 1

如果你观察DNA发现史上最重要的部分之一，就是罗莎琳德·富兰克林拍摄的X射线衍射照片。

If you look at one of the most important piece in DNA's discovery history is the X-ray diffraction photo that was captured by Rosalind Franklin.

Speaker 1

那是一张二维平面照片，结构看起来像一个带有衍射图案的十字架。

And it was a flat two d photo of a structure that looks like it looks like a cross with diffractions.

Speaker 1

你可以谷歌搜索这些照片。

You can Google those photos.

Speaker 1

但凭借那张二维平面照片，人类——特别是两位重要人物詹姆斯·沃森和弗朗西斯·克里克，结合其他信息，能够在三维空间中进行推理，推导出DNA高度立体的双螺旋结构。

But with that two d flat photo, humans, especially two important humans, James Watson and Francis Crick, in addition to their other information, was able to reason in three d space and deduce a highly three-dimensional double helix structure of the DNA.

Speaker 1

而这种结构绝不可能是二维的。

And that structure cannot possibly be two d.

Speaker 1

你无法通过二维思维推导出这种结构。

You cannot think in two d and deduce that structure.

Speaker 1

你必须运用三维空间思维，利用人类的空间智能。

You have to think in three d spatial, use the human spatial intelligence.

Speaker 1

因此我认为即使在科学发现中，空间智能或AI辅助的空间智能也至关重要。

So I think even in scientific discovery, spatial intelligence or AI assisted spatial intelligence is critical.

Speaker 0

这让我想起克里斯·迪克森说过的话：下一件大事最初总会让人觉得像个玩具。

This is such an example of, I think it was Chris Dixon that had this line that the next big thing is gonna start off feeling like a toy.

Speaker 0

当ChatGPT刚推出时，我记得萨尔·莫明发推说：'我们在玩一个很酷的东西，来看看'。

When ChatGPT just came out of like, I remember Sal Momin just tweeted, it's like, here's a cool thing we're playing with, check it out.

Speaker 0

现在它已成为史上增长最快的产品，改变了世界。

Now it's the fastest growing product in all of history, changed the world.

Speaker 0

是啊。

Yeah.

Speaker 0

往往正是那些看起来‘嗯，这个挺酷’、玩起来有趣的东西，最终最能改变世界。

And it's oftentimes the things that just look like, okay, this is cool, that it's fun to play with and end up changing the world most.

Speaker 0

没错。

Yeah.

Speaker 0

本节目由客户通讯云平台Cinch赞助播出。

This episode is brought to you by Cinch, the customer communications cloud.

Speaker 0

关于数字化客户通讯，有件事你需要知道。

Here's the thing about digital customer communications.

Speaker 0

无论是发送营销活动、验证码还是账户提醒，你都需要确保信息能可靠触达用户。

Whether you're sending marketing campaigns, verification codes, or account alerts, you need them to reach users reliably.

Speaker 0

这就是Cinch的用武之地。

That's where Cinch comes in.

Speaker 0

全球超过15万家企业（包括十大科技巨头中的八家）使用Cinch的API，将消息、邮件和通话功能集成到产品中。

Over 150,000 businesses, including eight of the top 10 largest tech companies globally, use Cinch's API to build messaging, email, and calling into their products.

Speaker 0

产品团队需要关注通讯领域的重要变革：富媒体通讯服务（RCS）。

And there's something big happening in messaging that product teams need to know about: rich communication services, or RCS.

Speaker 0

你可以把RCS理解为短信2.0版本。

Think of RCS as SMS two point zero.

Speaker 0

用户不再收到来自陌生号码的短信，而是直接看到经过认证的企业名称和标识——无需下载任何新应用。

Instead of getting texts from a random number, your users will see your verified company name and logo without needing to download anything new.

Speaker 0

这种体验更安全且品牌感更强，还支持交互式轮播图和智能回复建议等功能。

It's a more secure and branded experience, plus you get features like interactive carousels and suggested replies.

Speaker 0

这就是为什么这很重要。

And here's why this matters.

Speaker 0

美国运营商开始采用RCS。

US carriers are starting to adopt RCS.

Speaker 0

Cynch已在帮助各大品牌全球发送RCS消息，并让Lenny的播客听众在美国市场热潮前优先注册。

Cynch is already helping major brands send RCS messages around the world, and they're helping Lenny's podcast listeners get registered first before the rush hits The US market.

Speaker 0

了解更多并开始使用请访问cinch.com/lenny。

Learn more and get started at cinch.com/lenny.

Speaker 0

网址是sinch.com/lenny。

That's sinch.com/lenny.

Speaker 0

我联系了本·霍洛维茨，他非常欣赏你的事业，是你的忠实粉丝。

I reached out to Ben Horowitz who loves what you're doing, a big fan of yours.

Speaker 0

他们是投资人，我很信任他们。

They're investors, I believe in.

Speaker 1

是的。

Yeah.

Speaker 1

我们认识很多年了。

We we've known each other for for many years.

Speaker 1

不过现在他们确实是Warlabs的投资人。

But, yes, right now, they are investors of Warlabs.

Speaker 0

太棒了。

Amazing.

Speaker 0

好的。

Okay.

Speaker 0

于是我问他应该问你些什么，他建议我问你，为什么苦涩的教训单靠它自己不太可能对机器人奏效？

So I asked him what I should ask you about, and he suggested ask you, why is the bitter why is the bitter lesson alone not likely to work for robots?

Speaker 0

首先，请解释一下AI历史上的苦涩教训是什么，然后说明为什么这无法让我们在机器人领域达到预期目标。

So first of all, just explain what the bitter lesson was in the history of AI, and then just why that won't get us to where we want to be with robots.

Speaker 1

首先，其实存在很多苦涩的教训。

So, well, first of all, there are many bitter lessons.

Speaker 1

但大家常提到的苦涩教训，是最近获得图灵奖的理查德·萨顿写的一篇论文。

But the bitter lessons everybody refers to is a paper written by Richard Sutton, who won the Turing Award recently.

Speaker 1

他主要从事强化学习研究。

And he does a lot of reinforcement learning.

Speaker 1

理查德说过，纵观历史，特别是AI算法发展历程，最终胜出的总是拥有海量数据的简单模型，而非数据量较少的复杂模型。

And Richard has said, right, if you look at the history, especially the algorithmic development of AI, it turns out simpler model with a ton of data always win at the end of the day instead of the more complex model with less data.

Speaker 1

其实这篇论文是在ImageNet问世多年后才发表的。

I mean, that was actually This paper came years after ImageNet.

Speaker 1

对我来说那并不苦涩。

That to me was not bitter.

Speaker 1

那是个甜蜜的教训。

It was a sweet lesson.

Speaker 1

正因如此我才创建了ImageNet，因为我坚信大数据能发挥这种作用。

That's why I built ImageNet, because I believe that big data plays that role.

Speaker 1

那么为什么苦涩的教训无法单独在机器人领域见效？

So why can a bitter lesson work in robotics alone?

Speaker 1

首先我认为需要肯定我们当前的成就。

Well, first of all, I think we need to give credit to where we are today.

Speaker 1

机器人技术仍处于实验的早期阶段。

Robotics is very much in the early days of experimentation.

Speaker 1

这项研究远不如语言模型那样成熟。

The research is not nearly as mature as, say, language models.

Speaker 1

许多人仍在尝试不同的算法，其中部分算法由大数据驱动。

So many people are still experimenting with different algorithms, and some of those algorithms are driven by big data.

Speaker 1

因此我认为大数据将继续在机器人技术中发挥作用。

So I do think big data will continue to play a role in robotics.

Speaker 1

但机器人技术的难点是什么？

But what is hard for robotics?

Speaker 1

主要有几个方面。

There are a couple of things.

Speaker 1

一是数据获取更为困难。

One is that it's harder to get data.

Speaker 1

数据获取要困难得多。

It's a lot harder to get data.

Speaker 1

你可能会说，网上有现成的数据。

You can say, well, there's web data.

Speaker 1

这正是当前机器人研究使用网络视频的原因。

This is where the latest robotics research is using web videos.

Speaker 1

我认为网络视频确实发挥了作用。

And I think web videos do play a role.

Speaker 1

但作为从事计算机视觉、空间智能和机器人技术的研究者，当我思考语言模型成功的原因时，实在很羡慕语言领域的同行——他们拥有完美的数据闭环：训练数据是文字（最终转化为词元），模型输出的也是文字。

But if you think about what made language model worth a very as someone who does computer vision and spatial intelligence and robotics, I'm very jealous of my colleagues language because they had this perfect setup where their training data are in words, eventually tokens, and then they produce a model that outputs words.

Speaker 1

因此，你期望获得的结果（我们称之为目标函数）与训练数据之间存在着完美的匹配关系。

So you have this perfect alignment between what you hope to get, which we call objective function, and what your training data looks like.

Speaker 1

但机器人技术则截然不同。

But robotics is different.

Speaker 1

甚至空间智能也大相径庭。

Even spatial intelligence is different.

Speaker 1

你希望机器人输出动作，但训练数据中缺乏三维世界里的动作样本。

You hope to get actions out of robots, but your training data lacks actions in three d worlds.

Speaker 1

而这正是机器人必须实现的功能，对吧？

And that's what robots have to do, right?

Speaker 1

在三维世界里执行动作。

Actions in three d worlds.

Speaker 1

所以你必须另辟蹊径，用他们常说的'方枘圆凿'的方式解决问题。

So you have to find different ways to fit a what do they call a a a a square in a round hole.

Speaker 1

我们拥有的是海量的网络视频资源。

What we have is tons of web videos.

Speaker 1

因此我们开始讨论补充数据，比如遥操作数据或合成数据，让机器人通过大量数据来验证'苦涩教训'假说。

So then we have to start talking about adding, supplementing data such as teleoperation data or synthetic data so that the robots are trained with this hypothesis of bitter lesson, which is large amount of data.

Speaker 1

我认为仍有希望，因为我们在世界建模方面的进展终将为机器人解锁这些信息。

I think there is still hope because even what we are doing in world modeling will really unlock a lot of this information for robots.

Speaker 1

但我们必须保持谨慎，毕竟这项研究刚起步，'苦涩教训'尚未完全验证，因为我们还未彻底解决数据问题。

But I think we have to be careful because we're at the early days of this, and bitter lesson is still to be tested because we haven't fully figured out the data for.

Speaker 1

关于机器人学'苦涩教训'的另一个现实是——相较于语言模型甚至空间模型，机器人本质上是物理系统。

Another part of the bitter lesson of robotics I think we should be so realistic about is, again, compared to language models or even spatial models, robots are physical systems.

Speaker 1

因此，机器人更接近自动驾驶汽车，而非大型语言模型。

So robots are closer to self driving cars than a large language model.

Speaker 1

认识到这一点非常重要。

And that's very important to recognize.

Speaker 1

这意味着要让机器人工作，我们不仅需要大脑，还需要物理躯体。

That means that in order for robots to work, we not only need brains, we also need the physical body.

Speaker 1

我们还需要应用场景。

We also need application scenarios.

Speaker 1

如果你回顾自动驾驶汽车的历史，我的同事塞巴斯蒂安·特龙在2005或2006年用斯坦福的汽车赢得了第一届DARPA挑战赛。

If you look at the history of self driving car, my colleague, Sebastian Thrum, took Stanford's car to win the first DARPA challenge in 2006 or 2005.

Speaker 1

从那个自动驾驶汽车原型至今已有二十年。

It's twenty years since that prototype of a self driving car.

Speaker 1

从能在内华达沙漠行驶130英里，到今天Waymo出现在旧金山街头。

Being able to drive 130 miles in the Nevada Desert to today's Waymo and on the street of San Francisco.

Speaker 1

而我们甚至还没有完成。

And we're not even done yet.

Speaker 1

仍有很长的路要走。

There's still a lot.

Speaker 1

所以这是一段二十年的旅程。

So that's a twenty year journey.

Speaker 1

而自动驾驶汽车是简单得多的机器人。

And self driving cars are much simpler robots.

Speaker 1

它们只是金属盒子在二维平面上移动，目标是不触碰任何东西。

They're just metal boxes running on two d surfaces, And the goal is not to touch anything.

Speaker 1

机器人是在三维世界中运行的三维物体，其目标是接触物体。

Robot is three d things running in three d world, and the goal is to touch things.

Speaker 1

因此这段旅程将会，你知道的，包含许多方面和元素。

So the journey is gonna be, you know, there's many aspects, elements.

Speaker 1

当然可以说，自动驾驶汽车的早期算法属于深度学习前时代。

And of course, one could say, well, the self driving car early algorithm were pre deep learning era.

Speaker 1

所以深度学习正在加速大脑的进化。

So deep learning is accelerating the brains.

Speaker 1

我认为确实如此。

And I think that's true.

Speaker 1

这就是我投身机器人领域的原因。

That's why I'm in robotics.

Speaker 1

这就是我专注空间智能并为此感到兴奋的原因。

That's why I'm in spatial intelligence and I'm excited by it.

Speaker 1

但与此同时，汽车工业非常成熟，产品化还涉及成熟用例、供应链和硬件。

But in the meantime, the car industry is very mature and productizing also involves the mature use cases, supply chains, the hardware.

Speaker 1

所以我认为现在研究这些问题非常有趣。

So I think it's a very interesting time to work in these problems.

Speaker 1

但本说得没错。

But it's true Ben is right.

Speaker 1

我们可能仍需经历一些惨痛教训。

We might still be subject to a number of bitter lessons.

Speaker 0

从事这项工作时，你是否会对大脑的运作方式及其为我们完成这一切的能力感到敬畏？

Doing this work, do you ever just feel awe for the way the brain works and is able to do all of this for us?

Speaker 0

仅仅为了让机器能行走而不撞到东西或摔倒的这种复杂性，是否让你对我们已有的能力更加敬畏？

Just the complexity, to get a machine to just walk around and not hit things and fall, does just give you more respect for what we've already got?

Speaker 1

完全同意。

Totally.

Speaker 1

我们仅需约20瓦的功率运行。

We operate on about 20 watts.

Speaker 1

这比我此刻房间里任何灯泡的亮度都要低。

That's dimmer than any light bulb in the room I'm in right now.

Speaker 1

然而我们能做这么多事情。

And yet we can do so much.

Speaker 1

所以实际上，我越从事AI工作，就越对人类产生敬意。

So I think actually the more I work in AI, the more I respect humans.

Speaker 0

让我们聊聊你们刚推出的产品Marble，这个名字很可爱。

Let's talk about this product you just launched called Marble, a very cute name.

Speaker 0

关于它是什么，为什么它很重要。

About what this is, why this is important.

Speaker 0

我一直在试用它。

I've been playing with it.

Speaker 0

它太不可思议了。

It's incredible.

Speaker 0

我们会附上链接方便大家查看。

We'll link to it and for folks to check it out.

Speaker 0

Marble是什么？

What is Marble?

Speaker 1

是的。

Yeah.

Speaker 1

我非常兴奋。

I'm very excited.

Speaker 1

首先，Marble是Warlabs推出的首批产品之一。

So first of all, Marble is one of the first product that Warlabs has rolled out.

Speaker 1

World Labs是一家前沿基础模型公司。

World Labs is a foundation frontier model company.

Speaker 1

我们由四位技术背景深厚的联合创始人资助。

We are funded by four co founders who have deep technical history.

Speaker 1

我的联合创始人Justin Johnson、Christoph Lassner和Ben Meldenhall都来自AI、计算机图形学和计算机视觉的研究领域。

My co founders, Justin Johnson, Christoph Lassner, and Ben Meldenhall, we all come from the research field of AI, computer graphics, computer vision.

Speaker 1

我们相信空间智能和世界建模至少与语言模型同等重要，并且能与语言模型形成互补。

And we believe that spatial intelligence and world modeling is as important, if not more, to language models and complementary to language models.

Speaker 1

因此我们希望抓住这个机会，建立一个能将前沿模型与产品连接起来的深度技术研究实验室。

So we wanted to seize this opportunity to create deep tech research lab that can connect the dots between frontier models with products.

Speaker 1

Marble是一款基于我们前沿模型构建的应用。

So Marble is an app that's built upon our frontier models.

Speaker 1

我们花了一年多时间构建世界上首个能输出真正三维世界的生成模型。

We've spent a year and plus building the world's first generative model that can output genuinely three d worlds.

Speaker 1

这是个非常非常困难的问题。

That's a very, very hard problem.

Speaker 1

而且这个过程非常艰难。

And it was a very hard process.

Speaker 1

拥有一支非凡的团队，创始团队由来自顶尖团队的技术专家组成。

Have a team of incredible, founding team of incredible technologists from, you know, incredible teams.

Speaker 1

就在一两个月前，我们首次见证仅需输入一句话和图像（或多张图像）就能创建出可自由探索的世界。

And then around just a month or two ago, we saw the first time that we we can just prompt with a sentence and the image and multiple images and create worlds that we can just navigate in.

Speaker 1

如果使用我们的谷歌眼镜选项（我们提供该功能），你甚至能四处走动。

If you put it on goggle, which we have an option to let you do that, you can even walk around.

Speaker 1

对吧？

Right?

Speaker 1

尽管我们已开发很长时间，这依然令人震撼不已。

So it was even though we've been building this for quite a while, it was still just awe inspiring.

Speaker 1

我们希望能让需要的人用上它。

And we wanted to get into the hands of people who need it.

Speaker 1

我们知道众多创作者、设计师、考虑机器人模拟的人、探索可导航交互沉浸世界应用场景的人，以及游戏开发者都会发现它的价值。

And then we know that so many creators, designers, people who are thinking about robotic simulation, people who are thinking about different use cases of navigable, interactable, immersive worlds, game developers will find this useful.

Speaker 1

因此我们开发了Marble作为第一步。

So we develop developed Marble as a first step.

Speaker 1

虽然仍处于早期阶段，但这是全球首个实现该功能的模型，也是首个支持纯提示生成的产品。

It's it's, again, still very early, but it's the world's first model doing this, and it's the world's first product that allows people to just prompt.

Speaker 1

我们称之为‘提示即世界’。

We call it prompt to worlds.

Speaker 0

我一直在试用。

Well, I've been playing around.

Speaker 0

这太疯狂了。

It is insane.

Speaker 0

就像，你可以拥有一个更羞涩的世界，在那里你基本上可以无限漫步在中土世界，虽然那里还没有人，但这太疯狂了。

Like, you could just have a little shyer world where you just infinitely walk around Middle Earth, basically, and there's no there's no one there yet, but it's insane.

Speaker 0

你可以去任何地方。

You just go anywhere.

Speaker 0

这就像是个反乌托邦世界。

There's like dystopian world.

Speaker 0

我只是在浏览所有这些例子。

I'm just looking at all these examples.

Speaker 1

是的。

Yes.

Speaker 0

其实我最喜欢的部分是，我不知道。

And my favorite part actually, I don't know.

Speaker 0

我不确定是否存在功能缺陷。

I don't know if there's a feature bug.

Speaker 0

你能在世界完全渲染出所有纹理前看到那些小点，我特别喜欢窥探这个模型的运作机制。

You can see like the dots of the world before it actually renders with all the textures and I just love to get a glimpse into what is going on with this model.

Speaker 0

基本上，

Basically,

Speaker 1

听起来真是太酷了。

That I is like so cool to hear.

Speaker 1

因为作为研究者，这正是我学习的地方——引导你进入世界的那些点其实是特意设计的可视化功能。

Because this is where, as a researcher, I'm learning because the dots that lead you into the world was an intentional feature visualization.

Speaker 1

它并不属于模型本身。

It is not part of the model.

Speaker 1

这个模型实际上只是生成了世界。

The model actually just generates the world.

Speaker 1

但我们试图找到一种方法引导人们进入这个世界。

But we were trying to find a way to guide people into the world.

Speaker 1

许多工程师开发了不同版本，但我们最终聚焦于那个点。

And a number of engineers worked on different versions, but we converged on the dot.

Speaker 1

很多人——你不是唯一一个——告诉我们这种体验多么令人愉悦。

And so many people, you're not the only one, told us how delightful that experience is.

Speaker 1

听到这个刻意设计的可视化功能（不仅仅是硬核大模型）确实取悦了用户，我们感到非常满足。

And it was really satisfying for us to hear that this intentional visualization feature that's not just the big hardcore model actually has delighted our users.

Speaker 0

哇。

Wow.

Speaker 0

所以你们添加这个功能是为了让人类更容易理解发生了什么，获得更多乐趣。

So you add that to make it more to have humans understand what's going on more, getting more delightful.

Speaker 0

哇，这太有趣了。

Wow, that is hilarious.

Speaker 0

让我联想到语言模型——虽然不完全相同——但它们会谈论自己在想什么。是的。

Makes me think about LMs in the way they it's not the same thing, but they talk about what they're thinking and what they're Yes.

Speaker 1

确实如此。

It is.

Speaker 1

确实如此。

It is.

Speaker 0

这也让我联想到《黑客帝国》。

It also makes me think about just the matrix.

Speaker 0

这简直就是《黑客帝国》般的体验。

Like, it's exactly the matrix experience.

Speaker 0

我不确定那是否是你们的灵感来源。

I don't know if that was your inspiration.

Speaker 1

嗯，就像我说的，有很多工程师参与了这个项目。

Well, like I said, a number of engineers worked on that.

Speaker 1

那可能是他们的灵感来源。

It could be their inspiration.

Speaker 0

这已经刻在他们的潜意识里了。

It's in their subconscious.

Speaker 0

是啊。

Yeah.

Speaker 0

好的。

Okay.

Speaker 0

那么对于想要尝试使用这个技术的人，目前有哪些实际应用可以立即上手？

So just for folks that may want to play around with this, maybe use it, what's like, what are some applications today that folks can start using today?

Speaker 0

你们这次发布的主要目标是什么？

What's what's your goal with this launch?

Speaker 1

没错。

Yeah.

Speaker 1

我们确实认为世界建模技术具有广泛适用性，而且已经看到了一些非常激动人心的应用场景。

So we do believe that world modeling is very horizontal, but we're already seeing some really exciting use cases.

Speaker 1

比如电影虚拟制作，因为他们需要能与摄像机同步的三维世界。

Virtual production for movies, because what they need are three d worlds that they can align with the camera.

Speaker 1

所以当演员在上面表演时，他们可以调整摄像机位置，把各个片段拍得很好。

So when the actors are acting on it, they can, you know, they can position the camera and shoot the the segments really well.

Speaker 1

我们已经看到了惊人的应用效果。

And we're already seeing incredible use.

Speaker 1

其实我不知道你是否看过我们展示Marble的发布视频。

In fact, I don't know if you have seen our launch video showing Marble.

Speaker 1

那是由一家虚拟制作公司制作的。

It was produced by a virtual production company.

Speaker 1

我们与索尼合作，他们使用Marble场景来拍摄那些视频。

We we collaborated with Sony, and they use Marble scenes to shoot those videos.

Speaker 1

我们当时与那些技术美术师和导演合作，他们说这使制作时间缩短了40倍。

So our we were collaborating with those technical artists and directors, and they were saying this has cut our production time by 40x.

Speaker 1

实际上，必须

In fact, it has to

Speaker 0

是40倍？

be 40x?

Speaker 1

实际上必须如此，因为我们只有一个月时间做这个项目，而他们需要拍摄的内容太多了。

In fact, it has to because we only had one month to work on this project, and there were so many things they were trying to shoot.

Speaker 1

使用Marble确实极大地加速了VFX和电影的虚拟制作流程。

So using Marble really, really significantly accelerated the production of virtual virtual production for VFX and movies.

Speaker 1

这只是其中一个应用场景。

That's one use case.

Speaker 1

我们已经看到用户将我们的Marble场景导出网格后用于游戏开发，无论是VR游戏还是他们开发的休闲游戏。

We are already seeing our users putting taking our marble scene and taking the mesh export and putting games, you know, whether it's games on VR or games, just fun games that they have developed.

Speaker 1

我们刚才展示了一个机器人仿真实例，因为我现在仍是一名从事机器人训练的研究员。

We have had we were showing an example of robotic simulation because when I was, I mean, I still am a researcher doing robotic training.

Speaker 1

最大的痛点之一就是为训练机器人创建合成数据。

One of the biggest pain point is to create synthetic data for training robots.

Speaker 1

而这些合成数据需要具备高度多样性。

And this synthetic data needs to be very diverse.

Speaker 1

它们需要来自不同环境，包含可供操作的不同物体。

They need to come from different environments with different objects to manipulate.

Speaker 1

实现途径之一就是让计算机进行模拟。

And one path to it is to ask computers to simulate.

Speaker 1

否则人类就得——你知道的——为机器人逐个构建所有素材。

Otherwise, humans have to, you know, build every single asset for robots.

Speaker 1

那样会耗费更长时间。

That's just gonna take a lot longer.

Speaker 1

已经有研究人员联系我们，希望使用Marble来创建这些合成环境。

So we already have researchers reaching out and wanting to use marble to create those synthetic environments.

Speaker 1

我们还收到了关于如何使用Marble的意外用户反馈。

We also have unexpected user outreach in terms of how they want to use Marble.

Speaker 1

例如，有个心理学团队联系我们想用Marble进行心理学研究。

For example, a psychologist team called us to use Marble to do psychology research.

Speaker 1

原来他们研究的某些精神病患者，需要了解其大脑对不同特征沉浸场景的反应。

It turned out some of the psychiatric patients they study, they need to understand how their brain respond to different immersive scenes of different features.

Speaker 1

比如混乱的场景、整洁的场景，或任何你能命名的场景。

For example, messy scenes or clean scenes or whatever you name it.

Speaker 1

研究人员很难接触到这类沉浸式场景，而且他们需要花费太多时间和预算来创建。

And it's very hard for researchers to get their hands on these kind of immersive scenes, and it will take them too long and too much budget to create.

Speaker 1

而Marble几乎能瞬间将大量这类实验环境交到他们手中。

And Marble is a really almost instantaneous way of getting so many of these experimental environments into their hands.

Speaker 1

所以我们目前看到了多种应用场景，视觉特效师、游戏开发者、模拟开发者和设计师都非常兴奋。

So we're seeing multiple use cases at this point, but the VFX, the game developers, the simulation developers, as well as designers are very excited.

Speaker 0

这正是AI领域的工作方式。

This is very much the way things work in AI.

Speaker 0

我在播客中采访过其他AI领袖，他们总是说：尽早把产品放出去，这样才能发现最重要的应用场景在哪里。

I've had other AI leaders on the podcast, and it's always like, put things out there early as soon as you can to discover where the big use cases are.

Speaker 0

ChatGPT的负责人告诉我，他们刚推出ChatGPT时，他整天刷TikTok看人们怎么使用它、讨论什么话题，这些观察帮助他们确定了重点发展方向。

The head of ChatGPT told me how when they first put out ChatGPT, he was just scanning TikTok to see how people were using it and all the things they were talking about, and that's what convinced them where to lean in and help them see how people actually want to use it.

Speaker 0

我特别喜欢最后一个关于治疗的应用场景。

I love this last use case of like for therapy.

Speaker 0

想象一下，比如恐高症患者面对高处，或是面对蛇、蜘蛛之类的场景——

Just imagining like, like heights, people seeing, dealing with heights or snakes or spiders that which

Speaker 1

太神奇了。

It's amazing.

Speaker 1

昨晚我朋友真的打电话给我，聊他的恐高症，还问我是否该用marble治疗。

A friend of mine last night literally called me and talked about his height scare and asked me if marble should be used.

Speaker 1

你直接想到这个应用真是太棒了。

That's amazing you went straight there.

Speaker 0

因为我在想象所有暴露疗法的场景，这个工具会非常适用。

That's, you know, because I'm imagining all the, like the exposure therapy stuff, like this could be so good for that.

Speaker 0

这太酷了。

That is so cool.

Speaker 0

好的，让我——我本该早点问这个的，但我觉得这里有个关键问题：它与VO3这类视频生成模型究竟有何不同？

Okay, so let me, I should have asked you this before, but I think there's a, there's gonna be a question of just how does this differ from things like VO3 and other video generation models?

Speaker 0

对我来说区别很明显，不过解释一下它与现有视频AI工具的不同之处或许会有所帮助。

It's pretty clear to me, but I think it might be helpful just to explain how this is different from all the video AI tools people have seen.

Speaker 1

WordLab的核心观点是：空间智能至关重要，且空间智能不仅关乎视频。

WordLab's thesis is that spatial intelligence is fundamentally very important, and spatial intelligence is not just about videos.

Speaker 1

事实上，世界并非被动观看视频流逝的过程，对吧？

In fact, the world is not passively watching videos passing by, right?

Speaker 1

柏拉图用洞穴寓言来描述视觉认知。

Love Plato has the allegory of the cave analogy to describe vision.

Speaker 1

他说想象一个被绑在椅子上的囚犯——不太人道——在洞穴中观看面前的人生剧场。

He said that imagine a prisoner tied on his chair, not very humane, but in a cave watching a full life theater in front of him.

Speaker 1

但真实的演员表演其实发生在他背后。

But the actual live theater that actors are acting is behind his back.

Speaker 1

只是通过光线将动作投射在洞穴墙壁上。

It was just lit so that the projection of the action is on a wall of the cave.

Speaker 1

这个囚犯的任务就是理解眼前发生的真相。

And then the task of this prisoner is to figure out what's going on.

Speaker 1

这个极端例子生动展现了视觉的本质：从二维信息解读三维（或四维）世界。

It's a pretty extreme example, but it really shows, it describes what vision is about, is that to make sense of the three d world or four d world out of two d.

Speaker 1

因此在我看来，空间智能远比创造扁平二维世界更为深刻。

So spatial intelligence to me is deeper than only creating that flat two d world.

Speaker 1

对我来说，空间智能是创造、推理、互动、理解深度空间世界的能力，无论是二维、三维还是四维，包括动态变化等一切。

Spatial intelligence to me is the ability to create, reason, interact, make sense of deeply spatial world, whether it's two d or three d or four d, including dynamics and all that.

Speaker 1

所以WorldLab正专注于这一领域。

So so WorldLab is focusing on that.

Speaker 1

当然，单纯创作视频的能力本身也可能是其中的一部分。

And, of course, the ability to create videos per se could be part of this.

Speaker 1

事实上，就在几周前，我们推出了全球首个可在单块H100 GPU上实时演示的视频生成技术。

And in fact, just a couple of weeks ago, we rolled out the world's first real time demoable real time video generation on a single h 100 GPU.

Speaker 1

因此我们的技术体系包含了这一部分。

So we we we part of our technology includes that.

Speaker 1

但我觉得Marble截然不同，因为我们真正希望创作者、设计师和开发者能掌握一个能生成具有三维结构世界的模型，用于他们的工作。

But I think Marble is very different because we really want creators, designers, developers to have in their hands a model that can give them worlds with three d structure so they can use it for for their work.

Speaker 1

这正是Marble如此独特的原因。

And that's where that's why Marble is so different.

Speaker 0

在我看来，这是一个充满无限可能性的平台。

The way I see it is it's a it's a platform for a ton of opportunity to do stuff.

Speaker 0

正如你所说，视频就像是——这里有个一次性视频，很有趣很酷，然后就这样了。

As you describe, videos are just like, here's a one off video that's very fun and cool, and you could and that's it.

Speaker 0

就这样。

That's it.

Speaker 0

然后你就继续前进。

And then you move on.

Speaker 1

顺便说，在Marble里我们允许用户以视频形式导出内容。

By the way, we could in Marble, we could allow people to export in video form.

Speaker 1

就像你说的那样，你可以进入一个世界。

So you could actually like you said, you go into a world.

Speaker 1

比如说这是一个霍比特人的洞穴。

So so let's say it's a hobbit cave.

Speaker 1

特别是作为创作者，你会有一种非常具体的方式，按照导演脑海中的轨迹来移动摄像机。

You can actually especially as a creator, you have such a specific way of moving the camera in a trajectory in the director's mind.

Speaker 1

对吧？

Right?

Speaker 1

然后你可以将其从Marble导出为视频。

And then you can export that from Marble into video.

Speaker 0

要创造出这样的东西需要什么条件？

What does it take to create something like this?

Speaker 0

团队规模有多大？

Just like how big is the team?

Speaker 0

你们使用多少GPU？

How GPUs you work in?

Speaker 0

任何可以分享的信息都可以。

Anything you can share there.

Speaker 0

我不知道这些信息有多少是保密的，但创造并推出你们这样的产品需要什么条件？

I don't know how much of this is private information, but just what does it take to create something like this that you've launched here?

Speaker 1

这需要大量的脑力。

It takes a lot of brain power.

Speaker 1

我们刚才说到每个大脑20瓦的功率。

We just talk about 20 watts per brain.

Speaker 1

从这个角度看，数字虽小，但确实令人难以置信。

So from that point of view, it's a small number, but it's actually incredible.

Speaker 1

要知道，这是五亿年进化才赋予我们的力量。

You know, it's half billion years of evolution to give us those power.

Speaker 1

我们

Speaker 0

拥有

have

Speaker 1

目前团队约有30人，主要是研究员或研究工程师。

a team of 30 ish people now, and we are predominantly researchers or research engineers.

Speaker 1

但我们也有设计师和产品人员。

And but we also have designers and and product.

Speaker 1

我们真心希望创建一家以空间智能深度技术为核心的公司，同时也在认真打造产品。

We we actually really believe that we wanna create a company that's anchored in the deep tech of spatial intelligence, but we we we are actually building serious products.

Speaker 1

因此我们实现了研发与产品化的融合。

So we have this integration of R and D and productization.

Speaker 1

当然，我们使用了大量GPU。

And of course, we use, you know, a ton of GPUs.

Speaker 0

啊这

A that's

Speaker 1

技术层面是这样的。

the technical to hear.

Speaker 0

恭喜产品发布。

Well, congrats on the launch.

Speaker 0

我知道这是一个重要的里程碑。

I know this is a huge milestone.

Speaker 0

知道这需要付出巨大的努力。

Know this took a ton of work.

Speaker 1

谢谢苏

Thank So

Speaker 0

我只想对你和你的团队表示祝贺。

I just want to say congrats to you and your team.

Speaker 0

让我简单谈谈你的创始人历程。

Let me talk about your founder journey for a moment.

Speaker 0

所以你是这家公司的创始人。

So you're a founder of this company.

Speaker 0

你是多少年前创办的？

You started how many years ago?

Speaker 0

几年前？

A couple years ago?

Speaker 0

两、三年前？

Two, three years ago?

Speaker 1

一年前。

A year ago.

Speaker 1

一年前。

A year ago.

Speaker 1

一年一年？

A year year?

Speaker 1

好的。

Okay.

Speaker 1

整整十八个月。

Very eighteen month.

Speaker 1

是啊。

Yeah.

Speaker 0

好的。

Okay.

Speaker 0

在你开始之前，有什么是你希望自己早知道、能对十八个月前的自己耳语的事情吗？

What's something you wish you knew before you started this that you wish you could, like, whisper into the ear of eighteen months ago?

Speaker 1

嗯，我一直希望能预知科技的未来。

Well, I continue to wish I know the future of technology.

Speaker 1

我认为这实际上是我们创始优势之一——我们通常比大多数人更早看到未来。

I think actually that's one of our founding advantage is that we see the future earlier in general than most people.

Speaker 1

但即便如此，伙计，未知和即将到来的事物还是如此令人兴奋和惊叹。

But still, man, this is so exciting and so amazing that what's unknown and what's coming.

Speaker 1

但我知道你问这个问题的重点不在于科技的未来。

But I know the reason you're asking me this question is not about the future of technology.

Speaker 1

你大概更想说的是——听着，我20岁时可没创办过这种规模的公司。

You're probably more, you know, look, I did not start a company of this scale at 20 year old.

Speaker 1

所以我19岁时开了家干洗店，但那规模小得多。

So, you know, I started a dry cleaner when I was 19, but that's a little smaller scale.

Speaker 0

我们得谈谈

We gotta talk

Speaker 1

关于那件事。

about that.

Speaker 1

后来我创立了谷歌云AI部门，又在斯坦福大学建立了一个研究所，但这两者性质截然不同。

And then I, you know, founded Google Cloud AI, and then I founded an institute at Stanford, but those are different beasts.

Speaker 1

确实，相比那些20岁的创业者，作为经历过磨砺的创始人，我感觉自己准备得更充分些。

I did feel I was a little more prepared as a a founder of the grinding journey I compared to maybe the 20 year old founders.

Speaker 1

但我仍时常感到震惊，甚至陷入焦虑——AI领域的技术竞争和人才争夺竟如此激烈。

But I still I'm surprised and it puts me into paranoia sometimes that how intensely competitive AI landscape is from the technology itself, as well as talents.

Speaker 1

要知道当初我创业时，根本想象不到某些顶尖人才的薪酬会达到如此惊人的数字。

And, you know, when I founded the company, we did not have these incredible stories of how much certain talents would cost, you know?

Speaker 1

这些情况不断刷新我的认知，我必须时刻保持警觉。

So these are things that continue to surprise me, and I have to be very alert about.

Speaker 0

所以你提到的竞争，主要是人才争夺战，以及整个领域令人目眩的发展速度。

So the competition you're talking about is, yeah, the competition for talent, the speed at which just how things are moving.

Speaker 0

确实。

Yeah.

Speaker 0

没错。

Yeah.

Speaker 0

我想回到你之前提到的观点：纵观你的职业生涯，你几乎参与了所有孕育当今重大突破的核心人类集群。

You mentioned this point that I want to come back to that you, if you just look over the course of your career, you were like at all of the major collections of humans that led to so many of the breakthroughs that are happening today.

Speaker 0

我们讨论过ImageNet，还有斯坦福SAIL实验室——许多突破性工作诞生地，谷歌云也是重大突破的温床。

Obviously we talk about ImageNet, also just SAIL at Stanford is where a lot of the work happened, Google Cloud, which a lot of the breakthroughs happened.

Speaker 0

是什么吸引你投身这些地方的？

What brought you to those places?

Speaker 0

对于那些寻求职业发展、想要站在未来中心的人们来说，能否分享一下是什么力量推动你不断转换环境、加入那些可能对他人有启发的团队？

For people looking for how to advance in their career, be at the center of the future, just like, is there a through line there of just what pulled you from place to place and pulled you into those groups that might be helpful for people to hear?

Speaker 1

这确实是个好问题，兰尼，因为我确实经常思考这一点。

This is actually a great question, Lanny, because I do think about it.

Speaker 1

显然，我们之前讨论过是好奇心和热情将我引向了人工智能领域。

And obviously, we talked about its curiosity and passion that brought me to AI.

Speaker 1

这更像是一颗科学北极星，对吧？

That is more a scientific north star, right?

Speaker 1

我当时并不在乎AI是否真的能成气候。

I did not care if AI was a thing or not.

Speaker 1

这是其中一部分原因。

So that was one part.

Speaker 1

但关于我如何最终选择了包括创立World Labs在内的特定工作场所，我想我很感谢自己——或许该感谢父母的基因。

But how did I end up choosing in the particular places I work in, including starting World Labs, is I think I'm very grateful to myself or maybe to my parents' genes.

Speaker 1

我在知识探索上是个非常无所畏惧的人。

I'm an intellectually very fearless person.

Speaker 1

必须说，当我招聘年轻人时，我特别看重这种特质。因为想要有所作为，就必须接受你正在创造新事物或探索未知领域——这种品质至关重要。

And I have to say, when I hire young people, I look for that because I think that's a very important quality if one wants to make a difference, is that when you want to make a difference, you have to accept that you're creating something new or you're diving into something new.

Speaker 1

前人都没做过这些事。

People haven't done that.

Speaker 1

如果你具备这种自我认知，就几乎必须允许自己保持无畏和勇敢。

And if you have that self awareness, you almost have to allow yourself to be fearless and to be courageous.

Speaker 1

比如当我来到斯坦福时，在学术圈里，我离那个叫做'终身教职'的东西很近——就是能在普林斯顿获得永久职位。

So when I, for example, came to Stanford, you know, in the world of academia, I was very close to this thing called tenure, which is, you know, have the job forever at Princeton.

Speaker 1

但我选择来斯坦福是因为我热爱普林斯顿。

But I chose to come to Stanford because I love Princeton.

Speaker 1

那是我的母校。

It's my alma mater.

Speaker 1

就在那一刻，斯坦福有那么多杰出人才，硅谷的生态系统如此出色，让我愿意冒险重新开始我的终身教职计时。

It's just at that moment, there are people who are so amazing at Stanford and the Silicon Valley ecosystem was so amazing that I was okay to take a risk of restarting my tenure clock.

Speaker 1

成为SAIL首位女性主任时，实际上相对而言我当时还是一名非常年轻的教员。

Going to becoming the first female director of SAIL, I was actually, relatively speaking, a very young faculty at that time.

Speaker 1

我想这么做是因为我在乎那个群体。

And I wanted to do that because I care about that community.

Speaker 1

我没有花太多时间考虑所有可能的失败情况。

I didn't spend too much time thinking about all the failure cases.

Speaker 1

显然我很幸运得到了资深教员们的支持，但我只是想有所作为。

Obviously, I was very lucky that the more senior faculty supported me, but I just wanted to make a difference.

Speaker 1

后来去谷歌也是类似的情况。

And then going to Google was similar.

Speaker 1

我想与杰夫·迪恩、杰夫·辛顿这样的杰出学者，以及所有那些了不起的人共事。

I wanted to work with people like Jeff Dean, Jeff Hinton, and all these incredible academics, the incredible people.

Speaker 1

你知道吗？

You know?

Speaker 1

世界实验室的情况也是如此。

So so the same with World Labs.

Speaker 1

我怀有这份热情，也相信志同道合的人能创造奇迹。

I I have this passion, and I also believe that people with the same mission can do incredible things.

Speaker 1

这就是它如何指引我度过人生的。

So that's how it guided my through through life.

Speaker 1

我不会过度思考所有可能出错的事情，因为实在太多了。

I don't overthink of all possible things that can go wrong because that's too many.

Speaker 0

我觉得这是个重要因素。

I feel like that's an important element.

Speaker 0

这不是在关注消极面，而是更多地关注人、使命和让你兴奋的事物，你觉得呢？

This is not focusing on the downside, focusing more on the people, the mission, what gets you excited, what do think?

Speaker 1

我很好奇。

I curiosity.

Speaker 1

是的。

Yeah.

Speaker 1

我想对所有AI领域的年轻人才、工程师和研究人员说一句，因为你们中有些人申请了WorldLabs。

I do wanna say one thing to all the young talents in AI, the engineers, the researchers out there, because some of you apply to WorldLabs.

Speaker 1

我感到非常荣幸你们考虑过WorldLabs。

I I feel very privileged you considered WorldLabs.

Speaker 1

我发现现在很多年轻人在决定工作时会考虑方程式的每一个细节。

I do find many of the young people today think about every single aspect of an equation when they decide on jobs.

Speaker 1

某种程度上，也许这就是他们想要的方式。

At some point, maybe, you know, maybe that's the way they want to do it.

Speaker 1

但有时我确实想鼓励年轻人关注重要的事情，因为我发现自己与求职者交谈时总是不自觉地进入导师模式，不一定是在招聘或不招聘，而只是处于指导状态。

But sometimes I do want to encourage young people to focus on what's important because I find myself constantly in mentoring mode when I talk to job candidates, not necessarily recruiting or not recruiting, but just in mentoring mode.

Speaker 1

当我看到一位才华横溢的年轻人过度纠结于工作考虑的每个微小维度时，或许最重要的问题是：你的热情在哪里？

When I see an incredible young talent who is over focusing on every minute dimension and aspect of considering a job, when maybe the most important thing is where's your passion?

Speaker 1

你认同这个使命吗？

Do you align with the mission?

Speaker 1

你相信并对这个团队有信心吗？

Do you believe and have faith in this team?

Speaker 1

只需专注于你能产生的影响，以及你能参与的工作和合作的团队。

And just focus on the impact and you can make and the kind of work and team you can work with.

Speaker 0

是啊，这很艰难。

Yeah, it's tough.

Speaker 0

现在对AI领域的人来说很艰难。

It's tough for people in the AI space now.

Speaker 0

他们面临太多压力，太多新闻，太多变化，太多错失恐惧症。

There's just so much at them, so much news, so much happening, so much FOMO.

Speaker 1

确实如此。

That's true.

Speaker 0

我能看出这种压力。

I could see the stress.

Speaker 0

所以我认为这个建议非常重要，就像什么能真正让你对所做的事情感到满足，而不仅仅是哪家公司增长最快，谁会赢，我不知道。

So I think that advice is really important, just like what will actually make you feel fulfilled in what you're doing, not just where's the fastest growing company, where's the, who's going to win, I don't know.

Speaker 0

我想确保我问到你今天在斯坦福大学HCI的工作。

I want to make sure I ask you about the work you're doing today at Stanford at the HCI.

Speaker 1

我认为这是

I think it's the

Speaker 0

HAI，以人为中心的人工智能研究所。

HAI, Human Centered AI Institute.

Speaker 0

你在那里做什么？

What you what are you doing there?

Speaker 0

我知道你现在还在做这件事。

I know this is a thing you do on the side still.

Speaker 1

是的，HAI（以人为本的人工智能研究所）是由我和包括John H教授在内的一群学者共同创立的。

So, yes, I HAI, Human Centered AI Institute, was co founded by me and a group of faculty like Professor John H.

Speaker 1

Mendy教授、James Landy教授、Chris Manning教授等人早在2018年就参与了。

Mendy, Professor James Landy, Professor Chris Manning back in 2018.

Speaker 1

当时我其实刚结束在谷歌的学术休假。

I was actually finishing my sabbatical at Google.

Speaker 1

这对我来说是个极其重要的决定——我本可以留在工业界，但在谷歌的经历让我明白：AI将成为改变人类文明的技术。

And it was a very, very important decision for me because I could have stayed in industry, but my time at Google taught me one thing is AI is gonna be a civilizational technology.

Speaker 1

我逐渐意识到这对人类有多重要，甚至促使我在2018年为《纽约时报》撰文，探讨建立AI开发和应用的指导框架的必要性。

And they it's it dawned on me how important this is to humanity to the point that I actually wrote a piece in New York Times that year, 2018, to talk about the need for a guiding framework to develop and to to apply AI.

Speaker 1

这个框架必须以人类福祉为根基，以人为核心。

And that framework has to be anchored in human benevolence, is human centeredness.

Speaker 1

我认为斯坦福——这所位于硅谷心脏地带、孕育了从英伟达到谷歌等伟大企业的世界顶尖学府——应当成为创建这种以人为本AI框架的思想领袖，并将其贯彻到我们的研究、教育、政策及生态建设中。

And I felt that Stanford, one of the world's top university in the heart of Silicon Valley that gave birth to important companies from NVIDIA to Google, should be a thought leader to create this human centered AI framework and to actually embody that in our research, education, and policy and ecosystem work.

Speaker 1

于是我创立了HAI。

So I founded HAI.

Speaker 1

转眼六七年过去，它已成为全球最大的人工智能研究机构，开展以人为本的研究、教育、生态拓展和政策影响工作。

You know, after fast forward, after six, seven years, it has become the world's largest AI institute that does human centered research, education, ecosystem outreach, and policy impact.

Speaker 1

它汇聚了斯坦福八大学院——从医学院、教育学院到可持续发展学院、商学院、工程学院、人文学院、法学院——数百名教职员工。

It involves hundreds of faculty across all eight schools at Stanford, from medicine to education, to sustainability, to business, to engineering, to humanities, to law.

Speaker 1

我们支持研究人员，特别是在数字经济、法律研究、政治学、新药研发、新算法乃至超越Transformer模型等跨学科领域。

And we support researchers, especially at the interdisciplinary area from digital economy to legal studies, to political science, to discovery of new drugs, to to new algorithms, to that's beyond transformers.

Speaker 1

我们还特别重视政策研究，因为当我们成立HAI时，我意识到硅谷与华盛顿、布鲁塞尔乃至世界其他地区之间缺乏对话。

We also actually put a very strong focus on on policy because when we started HAI, I realized that Silicon Valley did not talk to Washington, DC and or Brussels or other parts of the world.

Speaker 1

鉴于这项技术的重要性，我们需要让各方都参与进来。

And given how important this technology is, we need to bring everybody on board.

Speaker 1

因此我们创建了多项计划，从国会速成班到AI指数报告，再到政策简报会。

So we created multiple programs from congressional bootcamp to AI index report to policy briefing.

Speaker 1

我们尤其参与了政策制定工作，包括推动在前特朗普政府时期通过的国家AI研究云法案，并参与州级AI监管讨论。

And we especially participated in policymaking, including advocating for a national AI research cloud bill that was passed in the first Trump administration and participate participating in state level regulatory AI discussions.

Speaker 1

我们做了很多工作。

So there's a lot we did.

Speaker 1

我继续担任领导者之一，尽管在运营层面参与较少，因为我不仅关心创造这项技术，更关注如何正确使用它。

And I continue to be one of the leaders, even though I'm much less involved operationally, because I care not only we create this technology, but we use it in the right way.

Speaker 0

哇。

Wow.

Speaker 0

我之前不知道你还做了这么多其他工作。

I was not aware of all that other work you were doing.

Speaker 0

听你讲述时，我想起查理·芒格的名言：'选择一个简单的理念，并非常认真地对待它'。

As you're talking, was reminded Charlie Munger had this quote, Take a simple idea and take it very seriously.

Speaker 0

感觉你在诸多领域都践行着这个理念并持之以恒，多年来产生的全方位影响令人难以置信。

I feel like you've done that in so many different ways and stayed with it, and it's unbelievable the impact that you've had in so many ways over the years.

Speaker 0

我跳过快速问答环节，直接问你最后一个问题。

I'm gonna skip the lightning round, and I'm just gonna ask you one last question.

Speaker 0

你还有什么想分享的吗？还有什么想留给听众的？

Is there anything else that you wanted to share, anything else you wanna leave listeners with?

Speaker 1

我对AI Lenny感到非常兴奋。

I'm very excited by AI Lenny.

Speaker 1

我想回答一个我在世界各地旅行时总被问到的问题：无论我是音乐家、中学教师、护士、会计、农民，我在AI中能扮演什么角色？AI是会接管我的生活和工作吗？

I wanna answer one question that when I travel around the world, everybody asks me is that if I'm a musician, if I'm a teacher, middle school teacher, if I'm a nurse, if I'm an accountant, if I'm a farmer, do I have a role in AI, or is AI just gonna take over my life or my work?

Speaker 1

我认为这是关于AI最重要的问题。

And I think this is the most important question of AI.

Speaker 1

我发现硅谷往往不会与人们——无论是像我们这样的还是不像我们这样的，而是指所有人——进行心与心的对话。

And I find that in Silicon Valley, we tend not to speak heart to heart with people, with people like us and and not like us in Silicon Valley, but like all of us.

Speaker 1

我们总是随意抛出诸如无限生产力、无限休闲时间或无限力量之类的词汇。

We tend to just toss around words like infinite productivity or infinite leisure time or infinite power or whatever.

Speaker 1

但归根结底，AI是关于人的。

But at the end of the day, AI is about people.

Speaker 1

当人们问我这个问题时，我的回答是响亮的肯定。

And when people ask me that question, it's a resounding yes.

Speaker 1

每个人在AI中都有自己的角色。

Everybody has a role in AI.

Speaker 1

这取决于你的职业和追求，但任何技术都不应剥夺人类的尊严。

It depends on what what you do and what you want, but no technology should take away human dignity.

Speaker 1

人类尊严和自主权应成为每项技术开发、部署及治理的核心。

And the human dignity and the agency should be at the heart of the development, the deployment, as well as the governance of every technology.

Speaker 1

如果你是个年轻艺术家，热爱讲故事，就把AI当作工具来拥抱吧。

So if you are a young artist and your passion is storytelling, embrace AI as a tool.

Speaker 1

事实上，拥抱漫威吧。

In fact, embrace Marvel.

Speaker 1

我希望它能成为你的工具，因为你讲述故事的方式独一无二，这个世界依然需要它。

I hope it becomes a tool for you because the way you tell your story is unique and the world still needs it.

Speaker 1

但你要如何讲述你的故事呢？

But how you tell your story?

Speaker 1

如何用最不可思议的工具以最独特的方式讲述你的故事，这很重要。

How do you use the most incredible tool to tell your story in the most unique way is important.

Speaker 1

而这个声音需要被听见。

And that voice needs to be heard.

Speaker 1

即使你是临近退休的农民，AI依然重要，因为你也是公民。

If you're a farmer near retirement, AI still matters because you're a citizen.

Speaker 1

你可以参与社区事务。

You can participate in your community.

Speaker 1

你应该对AI的使用方式和应用领域有发言权。

You should have a voice in how AI is used, how AI is applied.

Speaker 1

你可以与志同道合的人合作，鼓励大家运用AI让生活更轻松。

You you work with people that you can, you know, encourage all of all of you to use AI to make life easier for you.

Speaker 1

如果你是护士，我希望你知道——至少在我的职业生涯中，我深耕医疗研究，因为我坚信AI技术应该大力辅助医护人员，无论是通过智能摄像头提供更多信息，还是机器人协助，毕竟我们的护士们工作过度、疲惫不堪。

If you're a nurse, I hope you know that at least in my career, I have worked so much in health care research because I feel our health care workers should be greatly augmented and helped by AI technology, whether it's smart cameras to feed more information or robotic assistance, because our nurses are overworked, overfatigued.

Speaker 1

随着社会老龄化，我们需要更多帮助来照料人们。

And as our society ages, we need more help for people to be taken care of.

Speaker 1

因此AI可以扮演这个角色。

So AI can play that role.

Speaker 1

所以我想说的是，这非常重要，即使像我这样的技术专家也真诚地认为每个人在人工智能中都有自己的角色。

So I just wanna say that it's so important that even a technologist like me are sincere about that everybody has a role in AI.

Speaker 0

多么完美的结束方式啊。

What a beautiful way to end it.