本集简介
双语字幕
仅展示文本字幕,不包含中文音频;想边听边看,请使用 Bayt 播客 App。
你好。
Hello.
我是安德鲁·梅恩,这是OpenAI的播客。
I'm Andrew Main, and this is the OpenAI podcast.
今天,我们的嘉宾是OpenAI的首席财务官莎拉·弗莱尔,以及知名投资者、Khosla Ventures的维诺德·科斯拉。
Today, our guests are Sarah Fryer, CFO of OpenAI, and legendary investor Vinod Khosla of Khosla Ventures.
在本次讨论中,我们将探讨人工智能生态系统的现状,我们是否处于泡沫之中,以及在人工智能发展过程中初创公司和投资者如何取得成功。
In this discussion, we're going to talk about the state of the AI ecosystem, whether or not we're in a bubble, and how startups and investors can succeed as AI progresses.
与Netflix这样的平台不同,它们每天播放大量内容,我认为人工智能更像基础设施,比如电力。
Unlike something like Netflix, where they're running so many hours in the day, I think of it much more like infrastructure, electricity.
今天,需求的限制因素不是别的,而是计算资源的可用性。
Demand is limited, not by anything other than availability of compute today.
我认为我们需要讨论的问题是:人们将会做什么?
I think the conversation we need to have is what will people do?
2025年将是关于智能代理和氛围编程的一年。
2025 was about agents and vibe coding.
现在是2026年。
Now it's 2026.
2026年的故事是什么?
What's the story of 2026?
我认为我们在2025年已经让氛围编码更加成熟了。
I think we matured in vibe coding in 2025.
我认为我们在智能体方面还没有成熟。
I don't think we've matured in agents.
所以智能体,尤其是多智能体系统,将会成熟到产生切实可见的影响。
So agents, especially multi agentic systems, will mature to the point of having real visible impact.
无论你是企业,拥有一个多功能智能体系统为你完成完整任务,比如每天为你运行ERP系统,处理所有对账、计提和合同跟踪。
Whether you're an enterprise and you have a multi agent systems doing full task, like running an ERP system for you, you know, doing all the reconciliation every day, accruals every day, tracking contracts every day.
我认为这在企业层面是如此。
I think that on the enterprise side.
但今天,在消费者层面,计划一次旅行仍然很麻烦。
But today, on the consumer side, you know, it's still a hassle to plan a trip.
这种多智能体系统会综合考虑你的饮食偏好、餐厅预订、航班时刻表以及个人日程等多个方面,我认为一年后就会开始成熟。
That's a multi agentic thing that looks across a lot of different things from your food preferences, to the restaurant preservation, to airline schedules, to your personal calendar, those will start to mature, I think a year from now.
所以我对这一点非常期待。
So I'm pretty excited about that.
我认为,机器人领域以及超越机器人本身的现实世界模型,比如通用直觉,都将在明年开始出现。
I think models in robotics and real world models that go beyond well beyond robotics, like general intuition will all start to happen in the next year.
所以我认为这些是值得关注的领域。
So I think that those are areas to look for.
像LLM中的记忆功能、持续学习、减少幻觉影响等,都是常见的功能。
There's usual functions like memory in LLMs, continual learning in LLMs, reduction of the impact of hallucinations.
这些都是AI目前表现不佳的领域,我还能列举更多,大约有六七个方面,未来将开始得到改善。
Those are all areas, I could go on, there's half a dozen areas in which AI doesn't do as well today that will start to be addressed.
是的。
Yeah.
我认为,从根本上说,维诺德的意思是:2026年将是缩小这一能力差距的起点。
And I think at its baseline, what Vinod is saying is '26 is the beginning of closing this capability gap.
所以我们知道,我们已经赋予了人们巨大的智能。
So what we know is we've handed people massive intelligence.
对吧?
Right?
我们把法拉利的钥匙交到了他们手里,但他们才刚开始学习如何上路驾驶。
We've handed them the keys to the Ferrari, but they are only learning how to take it out on the road for the first time.
我们需要为消费者提供越来越多简单易用的方式,让ChatGPT不再只是一个聊天问答工具。
We need to give consumers more and more easy ask, easy ways to go from ChatGPT is just a chatbot call and response.
今天大多数人只是用它来提问。
Most people use it today just to ask questions.
但我们要如何让它演变成真正的任务助手,帮他们预订旅行、为他们刚从医生那里听到的信息提供第二意见,或帮他们为患有糖尿病的孩子设计菜单呢?
But how do we take it towards being a true task worker that that books that trip for them or helps them get a second opinion on what they just heard from their doctor or enables them to create a menu for their diabetic child?
对吧?
Right?
我们该如何帮助他们从简单的提问,转向真正改善生活的实际成果?
How do we help them really move from simple questions into actual outcomes that make my life better?
在企业端,情况也是同样的连续过程。
And then on the enterprise side, it's that same continuum.
我们如何缩小能力差距?
How do we close the capability gap?
我们从首席经济学家去年年底发布的《企业人工智能现状报告》中了解到,前沿企业与普通企业之间的差距。
One of the things we know from our state of the enterprise AI and the enterprise report that our chief economist put out at the end of last year is on the frontier versus just even the median corporation.
平均消息量或中位数约为六倍,这表明前沿企业的使用量是普通企业的六倍。
The the average number of messages or the median is about six x, which will tell you that six x the usage from a company that's already on the frontier.
我们知道,前沿企业甚至还没有达到其最大潜力。
And we know that frontier isn't even pushed to its max.
因此,对我们而言,重点是如何帮助消费者沿着这一连续过程,迈向真正的智能任务执行?
So for us, it's this focus of how do we help consumers move along that continuum to true agentic task working?
而对于企业,我们如何创建更复杂、垂直专业化的成果,使他们能够从简单的ChatGPT应用,逐步发展到彻底改变其核心业务的系统?
And then for enterprises, how do we create a much more sophisticated, vertically specialized outcome for enterprises that allows them to go from maybe a very simple chat GPT implementation the whole way to something that's transforming the most important part of their business.
对于医疗提供商来说,这可能是他们的药物研发流程。
For a healthcare provider, it might be their drug discovery process.
对于一家医院来说,可能是缩短患者入院时间,帮助患者更快回归社区。
For a hospital, it might be the time to admit a patient to get that patient back into the community.
对于一家大型零售商来说,可能是增加购物车规模、提高转化率,以及让客户更满意。
For a really large retailer, it might be just larger basket sizes, higher conversion rates, and much happier customers.
因此,这本质上是缩小能力差距的基础。
So it's the basics of closing that capability gap.
我可能会补充另一个视角。
So I might add one other perspective.
我们已经讨论过技术将取得进展的多个领域,以及能力将如何提升。
We've talked about the number of areas in which the technology will advance and capability will advance.
我猜测,如今使用人工智能的人,无论是个人还是企业,只有个位数百分比的人甚至用到了人工智能30%的能力。
I would venture to guess today, of the people using AI, whether it's personal or enterprise, some single digit percentage are even using 30% of the capability of the AI.
因此,使用人工智能30%或50%能力,更不用说80%能力的人群比例将持续增长。
So this percentage of people who are using 30% or 50%, let alone 80% of the AI's capabilities will keep increasing.
我认为,人们要掌握如何使用人工智能,还需要十年时间。
I think that's a ten year journey before people learn to use AI.
我见过这种情况,有些人,一些评论家把采用曲线和能力曲线搞混了。
I've seen this, some people kind of pundits confuse adoption curves for capability curves.
这确实出现过,你已经看到了。
And that's that's come up where you've seen.
这正是我想表达的观点。
So that's the point I'm making.
这是一个倍增器,因为今天已经有超过8亿人每周在使用ChatGPT。
And it's a force multiplier because today we have over 800,000,000 using ChatGPT today, 800,000,000 consumers weekly using.
但你知道,这个数字应该达到数十亿。
But, you know, that number should be in the billions.
那么,他们使用它的比例是多少呢?
And then what percentage use are they using it for?
这就像我们刚刚在家里通上了电。
It's like we've just turned electricity on in the home.
我们已经布好了电线,人们打开了灯,但他们根本不知道现在还能用它来给家里供暖。
We've wired up the home and they've turned on the lights, but they have no idea that they could now heat their home.
他们可以用它做饭,也可以用它卷头发,对吧?
They could cook, they could curl their hair, right?
现在你能做的事情太多了。
There's so many things you now can do.
我用过一个类比:从1990年到2000年,电子邮件并没有明显改进,手机也是如此,但使用人数却大幅增长,问题不在于‘我们需要更好的电子邮件,需要更先进的手机’。
An analogy I've used is that email didn't really get much better between 1990 and the year 2000, neither did mobile, but usage went way up and the problem wasn't like, well, we need better email, we need more better mobile.
而是人们需要学会所有能用它来做的事情。
It's like people need to learn all the things they could use it for.
没错。
Right.
是的。
Yeah.
更复杂一点来说,手机就是一个很有趣的例子,当手机普及时,人们只是把桌面网站直接搬到手机上,结果页面很难滚动。
And in a more sophisticated way, like mobile is always one that's interesting to me because when mobile took off, people just took their desktop websites and turned them into mobile and they were really hard to scroll.
但至少你把它放进了口袋里。
But I guess you at least had them in your pocket.
但后来你意识到你有了GPS。
But then you realized you had a GPS.
所以现在你可以使用Uber,也可以利用位置功能,或者随时用上相机。
So now you could have Uber, and now you could do things with location or you had a camera at your fingertips.
好吧。
Okay.
所以现在,是的,我可以给所有朋友拍照,但我也能拍下支票并存入银行账户,尽管我们该解决一下纸质支票的问题。
So now, yeah, I can take photographs of all my friends, but I can also snap, you know, a check and deposit it into my bank account, although we should fix the whole paper check thing.
但那是题外话。
But that's on the side.
它看起来仍然如此。
It still seems like it.
但我可以
But I can
只需拍下这张照片
just take a photo of this
现在我的银行账户里有钱了。
and now I get money in my bank account.
是的。
Yeah.
但你知道,这些功能在手机普及的那一刻就已经存在了。
But, you know, those those that all existed in in the minute mobile was available to us.
但更重要的是,人类的创造力得以发挥作用。
But just the, you know, the the the ability for human ingenuity to come to work on it.
所以我认为你是对的。
So I think you're right.
我甚至不知道,我们是否需要比今天更多的智能,就能大幅提高成果。
I don't even know if we need more intelligence than we have today to vastly increase outcomes.
但当然,这些模型也会继续变得更加智能。
But of course, the models are going to keep getting more intelligent as well.
你提到了健康,这是我们思考时最看重的领域之一,可能也是最重要的事情。
You mentioned health and that's one of the really kind of high stakes things we think about when it comes to just probably the most important thing.
想想几年前我们刚得到ChatGPT时,只用它来做一些非常简单的应用,而现在我们却开始让它处理符合HIPAA标准的敏感数据,这确实很有趣。
And it's kind of fascinating to think about the gist, you know, a few years ago we got ChatGPT and we're using it for very simple applications and now we're trusting with HIPAA compliant data.
你是否认为这是衡量技术加速发展速度或程度的一个标志?
Do you look at that as sort of a marker of how fast or how well things have been accelerating?
你还能想到其他类似的例子,来说明我们已经进入了一个全新的阶段吗?
Are there other ones like that you think about to say, okay, now we know we're some new level.
健康显然是其中一个领域,我长期以来一直相信,AI将通过让专业知识变得普及来彻底改变医疗行业。
Health is clearly one of those areas I've long believed it'll revolutionize health by making expertise be a commodity in all areas of health.
但医疗领域的问题在于监管。
The problem with health is regulatory.
首先,AI能做的事情受到诸多限制。
So first, there's constraints on what AI can do.
即使AI开处方的能力比人类更强,它也无权合法开具处方。
And AI can't legally write a prescription, even if it's better than human beings at writing a prescription.
这不仅是FDA的限制,更是由美国医学会等机构在制度层面上控制着这一职能。
That is not only the FDA, but it's actually beyond the FDA into the American Medical Association institutionally controls that function.
因此,在许多领域都会存在既得利益者的抵制。
So they will be incumbent resistance in a lot of areas.
如果你愿意,我们可以聊聊这个。
I think we can talk about it if you like.
但诊断仍然是一个限制,因为FDA对此有控制权。
But diagnosing is still a constraint because the FDA controls that.
目前还没有任何AI被批准为医疗设备。
There's no AI approved as a medical device yet.
幸运的是,这一届政府正在迅速行动,并采取了适当的风险水平。
That all, fortunately, this administration is doing a very good job of moving quickly and taking the appropriate level of risk.
因此,我对那里发生的事情感到非常满意。
So I'm pretty pleased to see what's happening there.
在医疗领域,我们的数据显示,每周有两亿三千万人向ChatGPT询问健康问题。
On the health front, we see in our data, two thirty million people every week ask CHAT GPT a health question.
66%的U。
Sixty six percent of U.
S.
S.
医生表示他们在日常工作中使用CHAT GPT。
Physicians say they use CHAT GPT in their daily work.
我跟你讲个个人经历,我哥哥是英国重症监护室的医生,他的工作就是你急诊室一到,没人知道怎么分诊你,就把你推给他。
I'll tell you at a personal level, my brother is an HDU doctor in The UK, so his job is you hit the ER, They don't know how to triage you.
所以他们把你送到他那里。
So they send you to him.
你其实不太想去找他。
You kind of don't want to show up to him.
但他被期望要
He's expected to
不过他非常出色。
have He's very good, though.
他做他这份工作非常在行
He's very good at what he
但他意味着你情况不太妙。
does, but it means you're not in good shape.
但他被期望拥有几乎百科全书般的知识,了解所有存在过的疾病。
But he's expected to have an almost an encyclopedic knowledge of every disease that ever existed.
所以我总是举这个例子。
So I always give the example.
他在苏格兰的阿伯丁工作。
He works in Aberdeen in Scotland.
如果你得了疟疾去找他,他根本不会往那方面想。
If you showed up with malaria, he will not think of that.
这不在他的模式识别范围内。
That is not in his pattern recognition.
然而这种情况是有可能发生的。
And yet that could have happened.
我不知道。
I don't know.
你夏天去度假,被蚊子叮了。
You went on vacation summer, you got bitten by mosquito.
突然间,你出现在阿伯丁的急诊室。
Boom, you're showing up in an ER room in Aberdeen.
ChatGPT 或模型能做的,其实是为医生提供强大的辅助,这就是为什么我认为有百分之六十六的医生在使用它。
What chat GPT can do or what the model can do is really act as a great augmentation to the doctor, which is why I think sixty six percent of them are using it.
而且这个数字还在持续增长。
And that number is only growing.
你知道,实际比例可能已经高得多。
You know, it's probably already, much higher.
因此,我认为这很好地说明了在医疗领域,我们的医生能够随时接触到最新的研究成果,了解最新的药物方案与患者个体生活和体验之间的相互作用。
And so I think it's just a great example of where something like health, we're getting the benefit of our doctors being able to have always the latest research in front of them, always the latest known, interactions, say between someone's drug regime and what they're living through and experiencing as individuals.
但它也把一部分自主权重新交到了消费者手中。
But it also puts some independence back into consumers' hands.
因此,我现在有机会提前研究一下我的症状可能意味着什么,以便能与医生进行更有依据的对话。
So now I get the opportunity to, ahead of time, do some research on what my symptoms might be saying so I can have a much more educated conversation with my doctor.
它让我能够获得第二意见,或者知道我应该去寻求第二意见。
It allows me to maybe get a second opinion or know that I want to go ask for a second opinion.
我们虽然很容易走向极端,但即使是像‘我每天只有二十分钟锻炼’这样的小事也一样。
It also we go very fast to these extreme places, but just to even things like, hey, I've got twenty minutes a day to exercise.
我知道我患有1型糖尿病。
I know I'm suffering from type one diabetes.
那我二十分钟能做些什么呢?
What what could I do in twenty minutes?
或者我女儿在饮食上有一些特殊的问题。
Or my daughter has an interesting issue with the food she eats.
以前去餐厅是件特别令人沮丧的事,因为我们得向服务员问一大堆问题。
And so it used to be a super frustrating thing to go to a restaurant even because we'd have to almost ask the server so many questions.
现在我们可以拍下菜单,聊天助手会推荐最适合她点的菜品。
And now we can photograph a menu, chat suggests what are likely the best dishes for her to order.
然后我们可以进行更有成效的对话,更好地决定哪些食物适合她。
And then we can have a bit more of a of a tercer conversation, but a bit more productive on what's going to work.
它彻底改变了我们对饮食的看法,让我们不再只关注食物本身,而是更注重为什么我们要一起外出用餐。
And it has just changed how we think about just eating, takes it away from all about the food to why we're going out for dinner together.
所以我认为,这些都是健康领域的一些典型案例。
And so I think they are all these just examples of something like health.
这种事情已经在发生,并且会变得越来越好。
It's already happening and it's going to keep getting better and better.
至于法诺瓦特的观点,我认为监管环境必须跟上步伐。
And then to Fanowat's point, I think regulatory environment is going to have to catch up.
无论你处于什么样的医疗体系下,医疗成本的增长速度都已超过每个国家的GDP。
It's no matter what kind of system you're under, the cost of medical care is exceeding the GDP of every country, the rate at which increases.
我们似乎急需人工智能,现在就需要它,而且它确实能提供帮助,正如你所指出的,这是医疗智能成本首次出现逐年下降的情况。
And it seems like we needed AI, we needed it now and, you know, it can be helpful and as you pointed out, it's the first time the cost of medical intelligence has dropped year over year.
但这带来了对计算资源的巨大需求,我们还有更多问题希望得到解答。
But that comes with a lot of demand for compute and we have a lot more questions, you know, that we want to have answered.
当然,人们都能看到对更多计算资源的需求,但OpenAI在计算资源上的投入规模和范围实在惊人。
And certainly people can see the need for more compute but the scale and scope at which OpenAI is investing in compute is incredibly huge.
你知道吗,我们谈论的这些数字真的很难想象。
You know, we're talking, you know, numbers that are just really hard to fathom.
OpenAI是如何确定这种需求的?
How does OpenAI determine that need?
你们会看哪些指标来判断,是的,我们需要花这么多钱?
You know, what are the metrics you're looking at to think that like, yes, we need to spend this much?
首先,我们努力确保在算力上的投入与我们的收入增速保持一致。
So first of all, we are trying to make sure we stay investing in compute to match the pace of our revenue.
我们发现,当期算力与当期收入之间存在很强的相关性。
And we've seen a really strong correlation between in period compute, in period revenue.
我举个例子。
I'll give you an example.
如果你回看2023年、2024年和2025年,我们的算力从200兆瓦增长到去年底的2吉瓦。
If you just go back in '23, '24 and '25, our compute was 200 megawatts, 600 megawatts when it ended last year at two gigawatts.
与此对应的是,数字非常吻合,我们在2023年底的年经常性收入(ARR)为20亿美元。
Against that, and it's really easy because the numbers match up, we exited '23 at 2,000,000,000 in ARR.
所以是200兆瓦,20亿美元。
So 200 megawatts, 2,000,000,000.
我们在2024年结束时达到了60亿美元。
We exited '24 at 6,000,000,000.
所以是60亿美元,600兆瓦。
So 6,000,000,000, 600 megawatts.
去年我们结束时营收略高于200亿美元,200亿美元,对应两吉瓦。
And we exited last year a little over 20,000,000,000, 20,000,000,000, two gigawatts.
实际上,增长速度正在加快。
Actually, it's been accelerating.
所以即使你看这条线的斜率,也能看出更多的算力带来更多的收入。
So that's just even if you look at the slope of the line, it says more compute, more revenue.
当然,这里存在时间上的不匹配,因为我今天就必须做出决策,确保我们不仅为2026年或2027年,而是为2028、2029、2030年准备好算力。
Now there is definitely a timing mismatch because I have to make decisions today about making sure we have compute in not even '26 or '27, but '28, 2930.
因为如果今天我不下订单,不发出建设数据中心的信号,到时候就来不及了。
Because if I don't put in orders today and don't give the signal to create data centers, it won't be there.
今天,我们在算力方面感到绝对受限。
Today we feel absolutely constrained on compute.
如果我们今天拥有更多算力,我们可以推出更多产品,训练更多模型,探索更多多模态应用。
There are many more products that we could launch, many more models that we would train, many more multimodality things we would explore if we had more compute today.
所以,例如在去年,全球整体硬件投资增长了大约2200亿美元,这只是实际支出的增长幅度。
So for example, even in the last year, I think the overall hardware investments globally has gone up by something like $220,000,000,000 That's just how much actual spending has gone up.
如果你看芯片领域,芯片预测增长也类似,达到了约3340亿美元。
If you look at chips, chip forecasts have gone up similarly about $334,000,000,000.
这不仅仅是OpenAI的问题。
So it's not just OpenAI.
整个环境发出的信号是:人工智能是真实的。
The signal from the whole environment is AI is real.
我们正处于一场范式转变之中。
We are in a paradigm shift.
我们需要投资,以提供人们实现我们刚才所谈所有事情所需的智能。
We need to invest to give people the intelligence they need to do all the things we just talked about, for example.
因此,在OpenAI内部,我们花了很多时间深入研究消费者、企业及开发者层面的需求信号。
So back inside of OpenAI, we do spend a lot of time going very deep on what is our demand signal in consumer and enterprise and developers.
我们首先从基础设施层出发,思考如何构建一个最基础的拼图,以实现最大的灵活性。
We think about what's the mosaic first at the base, like on an infrastructure layer, how do we create max optionality?
因此,我们希望实现多云、多芯片架构,这为我们提供了基础设施层面的有趣优势。
So we want to be multi cloud, multi chip, and that gives us an interesting layer at the infrastructure layer.
再往上一层,是产品层面。
One tick up at the product layer.
我们还希望变得更加多维。
We also want to become more multidimensional.
过去我们只有一个产品:ChatGPT。
So we used to just be one product chat GPT.
如今,我们为消费者提供了集成了各种功能的ChatGPT,涵盖医疗健康等领域。
Today we are chat GPT for consumer with all of the blades inside it, health care and so on.
面向工作的ChatGPT。
Chat GPT for work.
但我们还有一个新的平台Sora。
But we also have Sora as a new platform.
我们还有一些变革性的研究项目。
We have some of our transformational research projects.
再往上一层。
One tick up.
我们还有一个正在变得越来越多元化的商业模式生态系统。
We also then have a business model ecosystem that's becoming much more multidimensional.
最初我们只有一个订阅服务,因为我们推出了ChatGPT,需要一种方式来支付计算成本。
Began with a single subscription because we'd launched ChatGPT and we needed a way to pay for the compute.
我们现在有多个价格
We now have multiple price
顺便说一下,ChatGPT的订阅用户。
ChatGPT subscriber, by the way.
我
I
谢谢你这么做。
love you for that.
订阅服务。
Subscriptions.
我们进入了企业市场,采用了基于SaaS的定价模式。
We went to the enterprise and had SaaS based pricing.
现在,对于那些高价值场景,我们采用了基于积分的定价方式。
We have credit based pricing now for places where the high value is being, found.
人们愿意多付钱来获得更多的服务。
People want to pay more to get more.
我们开始考虑诸如电商和广告之类的事情。
We're beginning to think about things like commerce and ads.
当然,从长远来看,我喜欢像许可模式这样的方式——比如在药物研发中,如果我们授权我们的技术,而你取得了突破,药物大获成功,我们就能从其全部销售额中获得一部分许可收益。
And then of course, longer term, I like models like, for example, would we do, licensing models to really align let's say in drug discovery, if we licensed our technology, you have a breakthrough, that drug takes off and we get a licensed portion of all its sales.
这对我们和客户来说都是绝佳的协同方式。
It's great alignment for us with our customer.
所以,如果你思考一下这三个层级,我实际上把它想象成一个魔方。
So kind of if you think about those three tiers, I actually think of it like a Rubik's cube.
好的。
Okay.
我们从单一的模块——比如一个云服务提供商、一个芯片、一个产品、一种商业模式——发展到了现在这个三维的立方体。
So we went from a single block, you know, one CSP, Microsoft, one chip, one product, one business model to now a whole three-dimensional cube.
我喜欢魔方的一点是,虽然我不确定数字是否完全准确,但我记得它有大约430亿亿种不同的状态。
And one of the things I love about a Rubik's cube, I'm probably not getting the number exactly right, but I think it has 43 quintillion different states it can be in.
当我
It always blew my mind when I
在
was in
现在就想象一下这个立方体在旋转。
so now just think about that cube spinning.
因此,当我们选择一个低延迟芯片,配合编码速度达到人们预期五倍的功能时,我们可以为此收取高端订阅费用。
So we pick a low latency chip going alongside something like coding that's five x the pace that people expect, we can charge a high end subscription for that.
所以这就像你把魔方对齐,让一面呈现出三种颜色。
So it's almost like you line up the cube and you get three colors on one side.
我们可以再次转动魔方,比如使用低延迟芯片、更快的图像生成,吸引更多免费用户,但这会为最终可能的广告平台创造更多库存。
We could spin the cube again and say low latency chip, faster image gen, more free users come in, but that creates more inventory for ultimately perhaps an ads platform.
因此,你可以看到过去十二个月的目标一直是创造越来越多的战略选择,让我能够持续支付所需的计算资源,以真正实现造福人类的AGI使命。
So you can start to see how the goal in the last twelve months has been creating more and more strategic options that allow me to keep paying for the compute we need to really achieve our mission, AGI for the benefit of humanity.
所以,简化来说,需求的限制并非来自其他任何因素,而是取决于计算资源的可用性,无论是Sora还是更广泛的意义上。
So, you know, the way to simplify that is demand is limited not by anything other than availability of compute Whether it's Sora or more broadly.
然后是价格弹性,计算资源的需求是无限的。
And then there's price elasticity where demand is infinite for compute.
所以我认为这就是看待这个问题的方式。
So I think that's the way to think about it.
是的。
Yeah.
我们甚至还没有开始利用价格弹性的杠杆。
It just, we haven't even started to exercise the price elasticity lever.
我们根本无法满足需求。
It's just we can't fulfill demand.
没错。
Right.
这受到算力的限制。
And it's limited by compute.
所以所有谈论泡沫之类的人,我认为都走错了方向。
So all the people talking about bubbles and things, I think are on the wrong track.
他们完全不了解这一变革的规模,以及对API调用需求弹性的巨大需求。
They have no sense of how large this changes and how much more demand elasticity there's a need for API calls.
作为OpenEye最早的投资者之一,你早早地押注了,你看到了这个方向的发展,但你也经历过互联网泡沫。
As one of OpenEye's earliest investors, you made a bet early on, you saw where this was headed, but you've saw the .com bubble.
你目睹了当时发生的一切,但也见证了移动革命。
You watched what happened there, but you've also seen other things, the mobile revolution.
你已经看到其他领域也发生过类似的情况。
You've seen this happen with other areas.
你提到了‘广泛’这个词,这是否就是你信心的来源?
And you mentioned the term broad and is that sort of where your conviction comes from?
只是因为它涉及了这么多不同的领域吗?
It's just how many different areas it touches?
是的。
Yeah.
当我们投资时,我们只有一个简单的衡量标准。
Look, when we invested, we had one simple metric.
当时没有可参考的预测,没有产品计划,也没有ChatGPT可以参考。
There was no projections to look at, no product plans to look at, no chat GPT to look at.
很简单,如果我们能开发出接近甚至超越人类智能的东西,其影响将是巨大的。
It was very simply the idea, if we develop anywhere near close to human intelligence, let alone supersede human intelligence, its impact is going to be huge.
所以我们采取了一种‘婴儿学步’的方式:如果成功,后果将极其深远,那为什么不试试呢?
So it was this hand baby approach, like the consequences of success are really going to be consequential, so why not try that?
还有一种有趣的说法叫‘泡沫’。
There's also this funny notion of bubble.
人们把泡沫等同于股价,而这与投资者的恐惧和贪婪毫无关系。
People equate bubble to stock prices, which has nothing to do with anything other than fear and greed among investors.
所以我总是认为,泡沫应该通过API调用次数来衡量。
So I always look at bubbles should be measured by the number of API calls.
或者在人们所提及的互联网泡沫中,应该看的是互联网流量,而不是股价的涨跌,因为有人过度兴奋或过度悲观。
Or in the .com bubble, which people refer to, it should be amount of internet traffic, not by what happened to stock prices because somebody got overexcited or underexcited.
而在一天之内,他们可能从热爱英伟达突然转为憎恨英伟达,只因为觉得它被高估了。
And in one day they can go from loving Nvidia to hating NVIDIA because it's overvalued.
这些剧烈波动并不是现实。
Those gyrations aren't reality.
真正的现实是底层的API调用次数。
The reality is the underlying number of API calls.
如果你查看互联网泡沫时期的互联网流量,股价可能剧烈上涨又剧烈下跌。
If you look at internet traffic during the .com bubble, prices may have gone up violently and gone down violently.
但在互联网流量中,并未发现泡沫的迹象。
There's no bubble detected in internet traffic.
我几乎可以肯定,你在API调用数量上看不到泡沫。
I would almost guarantee you, you won't see the bubble in number of API calls.
如果你把API调用作为衡量AI真正用途、AI价值和AI需求的基本指标,那么你在API调用上根本看不到泡沫。
And if that's your fundamental metric of what's the real use of your AI, usefulness of AI, demand for AI, you're not going to see a bubble in API calls.
华尔街通常怎么对待它,我其实并不关心。
What Wall Street tends to do with it, I don't really care.
我觉得这基本无关紧要。
I think it's mostly irrelevant.
这对新闻报道来说很有用,因为媒体需要填满版面,但这并不是现实。
Great for press articles because press has to fill their column inches, but it's not reality.
所以事物的价格,或者股票价格、私营公司估值,都不是现实。
So prices of things aren't reality, or stock prices, private company valuations.
现实是AI的实际需求,也就是API调用的数量。
The reality is what's the actual demand for AI, which is the number of API calls.
对。
Right.
如果我回想起1999年那个时刻,当时人们从互联网中获得的价值其实非常有限,因为互联网还太年轻、太初期,你根本看不出它如何真正改变人们的生活。
And I think if I hark back to that moment where you were looking at 1999, the value people were getting from the Internet at the time was actually very it was so young, so nascent that you couldn't really see how it was changing their lives.
我认为,对于人工智能来说,这种变化发生得实在太快了。
I do think that with AI, it's happened so fast.
是的。
Yeah.
这种变化是真实存在的。
That change, it's very real.
比如,作为一个CFO,别说是OpenAI的CFO,单就作为一个CFO而言,我在我的组织里看到的变化,确实是把那些过去我不得不不断增派人手来处理的、相当枯燥的任务取代了。
Like, as a CFO, forget about being the CFO of OpenAI, but as a CFO, what I see happening in my organization is truly taking tasks that previously I would have kept having to add more and more people doing fairly mundane things.
比如,拿收入管理来说。
Like, let's take something like revenue management.
在负责收入管理的团队中,他们每天的一项工作就是下载前一天或本周内签署的所有合同,并逐份阅读这些合同,确保其中没有隐藏任何意外的、非标准的条款。
So in in a team that does revenue management, they have one of the things they do every day is they have to download all the contracts that we signed the day before or through the week, and they have to read all of those contracts to make sure there's no terms sitting in it that are unexpected, that are effectively nonstandard terms.
因为非标准条款意味着可能需要调整收入确认方式。
Because a nonstandard term means that there could be a revenue recognition change that has to happen.
这对财务团队来说是非常重要的事情。
And that's a very big deal for a finance team.
这通常是审计师来审计你的首要事项。
That's the number one thing usually your auditors come in to audit you on.
我们增长的速度,没错,每天的合同数量正在成倍增加。
The pace at which we are growing, right, that number of contracts every day is going up in multiples.
所以在没有AI的时代,我唯一的选择就是雇更多人。
So my only choice in a pre AI world would have been hire more people.
想象一下那些人的工作是什么样子。
Imagine what those people's jobs are like.
你每天上班,读一份合同,然后读下一份,再下一份。
You come to work every day and you read a contract and then you read the next one and the next one.
这太枯燥乏味了。
It is so mundane and such drudgery.
这并不是人们上学学习会计或立志成为财务专业人士的原因,但我们却把这种工作当作入门级职位交给他们。
It's not why people went to school and learned about the accounting field or thought about being a finance professional, but that's kind of the job we hand them as an entry level job.
今天,借助我们在OpenAI内部开发的工具,我可以在一夜之间将所有这些合同从系统中提取出来。
Today, using our own tools here at OpenAI, I now have overnight, all of those contracts are pulled out of a system.
它们被导入到一个表格数据库中,对我们来说就是Databricks数据库。
They are put into a tabular database, the Databricks database in our case.
这个智能代理能够逐一分析这些合同。
The the agent or the intelligence is able to go through.
它能准确告诉我哪些条款是非标准的,以及原因是什么。
It shows me exactly what is nonstandard and why.
它不仅会建议相应的收入确认方式,还会提供深入洞察,比如:这个条款真的有必要存在吗?
It suggests what therefore the rev rec is, but it also suggests the insight, which is, you know, should this term even be here?
销售代表是不是擅自让步了本不该让步的内容?
Did the salesperson just give away something they shouldn't have?
如果是这样,我就会去指导他们。
In which case, you know, I go and I coach them.
它是否实际上在向我揭示我们业务中正在发生的变化?
Is it actually telling me something about my business that's starting to shift?
在这种情况下,这个非标准条款实际上应该成为标准条款。
In which case, this nonstandard term is actually should become a standard term.
而我实际感受到的是我的商业模式正在发生变化,这可能是一件好事。
And I'm actually what I'm experiencing is a shift in my business model, which might actually be a good thing.
或者,也许我想找到一种不同的方式,既帮助客户获得他们想要的,也帮助销售人员实现他们的目标,同时保持我的收入确认和现有商业模式。
Or perhaps I wanna find a different way to help get the customer what they're looking for, the salesperson what they're looking for, but maintain my revenue recognition, my current business model.
对吧?
Right?
我知道我那些资历较浅的初级人员在讨论的右侧,他们正在重新找到自己热爱的工作。
So I know my more junior entry level people are over on the right of that discussion, and they're kind of refinding the job they loved.
对我来说,这正是它不是一个泡沫的原因,因为价值是真实且切实的。
That to me is why it's not a bubble because the value is real and tangible.
这意味着我可能只需要一个更小的团队。
I get also means I probably can have a smaller team.
我可以拥有一个绩效更高、士气更旺盛、留存率更高的团队。
I can have a much more high performing team, a much higher morale on my team, better retention rates.
对吧?
Right?
所有这些我都可以量化,说明我的业务现在更健康了。
All of these I can put into like numbers to say my business is now healthier.
我认为,当媒体试图主导关于泡沫的讨论时,他们忽略了这一点。
I think that's the piece when the press is trying to lead with the bubble conversation or whatever.
他们只是没注意到,我们目前的投资是基于需求的,甚至可以说是跟随需求。
They just miss that we are investing with demand, if anything behind demand at the moment.
对我来说,泡沫意味着你是在需求之前投资,最终会出现缺口。
A bubble to me suggests you're investing ahead of demand and there's going to be a gap.
如果你看一下生产率数据,那些正在采用人工智能的公司,尤其是新兴的科技公司,生产率正在上升。
And you look at productivity numbers, they're going up in the companies that are adapting AI, especially the newer set of tech oriented companies.
这些数据简直惊人。
The numbers are just absolutely amazing.
我最喜欢的一个例子是一家叫Slash的小公司,年经常性收入约为1.5亿美元。
So one of my favorites is a little company called Slash, about 150,000,000 ARR.
他们只有一个会计人员,仅有一位财务主管,因为他们采用了人工智能导向的ERP系统。
They have one person in accounting, only a controller, because they adapted an AI oriented ERP system.
他们用这个系统替换了NetSuite。
They replaced NetSuite with it.
他们能做到的事情简直令人惊叹。
And it's just amazing what they can do.
公司的首席执行官向我道歉,说他可能不得不雇用第二个人。
And the CEO was apologizing to me he might have to hire a second person.
而且他们的进展非常迅速。
And they're moving really rapidly.
我刚看到一个故事,有人用一个SDR加上AI取代了十个SDR,剩下的那名SDR主要负责监督AI的工作。
I just saw a story, somebody replaced 10 SDRs with one SDR and AI essentially that the one SDR remaining supervises.
我听到了两个故事,说的是公司不再招聘那些不直接推动增长的岗位人员,现在招聘时,他们更倾向于雇佣那些曾为公司创造过显著增长的人。
I've been hearing two stories about where instead of hiring somebody that's in an area that doesn't create growth, they can now then when they hire, hire people that have created a lot more growth for the company.
这就是为什么你看到这么多科技公司能够如此迅速地扩张。
And that's why you're seeing a lot of these tech companies just build so fast.
你知道,那句老话,未来已经到来,只是分布不均。
You know, that old phrase, the future is here now, but it's not evenly distributed.
是的。
Yes.
我看到所有这些单一环节带来了巨大的生产力提升、效率提升或敏捷性提升,能够更快地行动。
I see all these single points of huge productivity gains and efficiency gains or agility gains, the ability to move faster.
但世界上,无论是美国还是全球,只有极少数人采用了这些技术,甚至都不知道它们的存在。
But very small percentage of the people in the world, in The US or worldwide, have adapted these or even know they exist.
对。
Right.
所以回到需求问题,我认为这些理念,这些例子,最终会随着时间推移普及到每个人。
And so this issue back to demand, I think these ideas, some of these examples will spread to everybody over time.
你会看到这些技术的采用呈指数级增长。
And you'll see an exponential growth of adoption of these technologies.
这就是为什么我不认为需求是个问题。
That's why I don't think demand is the question.
是的,这个笔记完全切中要点。
Yeah, the note is absolutely spot on.
我认为麦肯锡做过一项研究,显示那些处于顶尖四分位的公司,其生产力——无论用哪种财务指标衡量——都提升了百分之二十七到三十三。
I think McKinsey did a study that showed for companies that are more in the top quartile, their productivity as measured by any kind of financial metric you would pull is up you know, in the twenty seven to thirty three percent.
这可是个非常显著的提升。
Like, that's a really meaningful jump.
我想你想要表达的是,这并不意味着整体员工数量减少。
I think where you were going is it doesn't just mean fewer employees overall.
确实存在将人员转向更具增长性岗位的空间。
There's definitely a place to kind of shift people over into more growth oriented jobs.
这个周末我徒步时遇到一个人,他经营着一家你们都熟知的大型咨询公司。
I was hiking this weekend with someone who runs a very large consulting company that you all would know of.
他谈到自己公司,或者说他更关注的后台系统。
And he was talking about how his and his what he thinks of more his back end systems.
那里的负责人现在把她的团队描述为‘人加智能代理’。
The leader there is now talking about her organization as people plus agents.
她的人机比例是1比5,即一个人对应五个智能代理。
And she has a one to five ratio, one person to five agents.
但在前端,他们实际上正在重新招聘以扩大规模,因为客户现在需要更多帮助来思考如何部署人工智能。
But on the front end, they're actually back out rehiring to grow because clients need more help now to think about deploying AI.
所以,我认为这实际上是在回归到人们真正想做的工作,而不是那些仅仅因为世界信息量过大、人们不得不处理这些信息而被迫接受的工作。
So it's actually shifting back, I would say, to the jobs people want to do, not the jobs that maybe were just open to them because more and more of the world had become this kind of, you know, so much information that people were parsing it.
现在,我们终于回到了机器和智能代理来处理这些信息的时代。
Now we're finally back to a machine and agent intelligence parsing it.
我想再回到消费者层面谈谈。
I want to I want to touch back on the consumer side.
你提到了广告,而我们之前提出的观点是,通过广告可以增加对用户的益处。
You mentioned ads and certainly the argument we made that with ads you can increase the benefits to people.
你可以提供更多的服务、更多的AI功能,帮助支付计算成本,用户在这些高级套餐中也能获得更多的价值。
You can provide more services, more AI, you can help pay for the compute and people get more out of those tiers with that.
但这引出了一个信任问题:当人们最初想到人工智能并提出问题时,他们会担心聊天机器人会如何处理他们的信息。
But that brings up the question though of trust and when people think about AI initially even asking questions, people worried about what does chat should be to do with my information.
一旦引入广告,人们就会担心,因为这通常会引发一个大问题:广告如何影响产品的其他部分和整个组织?
Once you have ads in play, people worry about that because it's often just a big question of how does that affect the rest of the product and the org?
是的。
Yeah.
所以我认为你从正确的角度切入了:如今,我们95%的用户在消费者端免费使用我们的平台。
So I think you started in the right place, which is today 95% of our users use our platform for free on the consumer side.
而这正是我们的使命所在。
And that's absolutely where our mission is.
对吧?
Right?
AJI的目标是为全人类服务,而不是只为付得起钱的人服务。
AJI, for the benefit of humanity, not the benefit of humanity who can pay.
对吧?
Right?
因此,可及性非常重要。
So access is very important.
从广告的角度来看,我认为最重要的是,我们必须确保每个人都明白,你始终会得到模型能提供的最佳答案,而不是付费得到的答案。
From an ads perspective, I think number one, we have to just make sure everyone understands you're always going to get the best answer the model can provide you, not the paid for answer.
我认为其他平台在这方面已经倒退了,你无法确定这个链接是赞助的,还是真正最好的结果。
And I think other platforms have fallen back into that where you're not sure, this is a sponsored link or is this truly the best outcome.
我们有一个北极星目标,那就是模型始终会给你最佳答案。
We have a North Star, which is that the model will always give you the best answer.
我认为第二点需要理解的是,广告其实可以带来很多价值。
I think the second thing to understand is that there can be a lot of utility in ads.
所以我们希望确保人们清楚地知道什么时候他们正在与广告互动。
So we want to make sure people know when it is an ad that they're working with.
但例如,如果我搜索一个周末度假目的地,比如我最喜欢的圣地亚哥,Airbnb的广告实际上可能会非常有帮助。
But for example, if I do a search for a weekend getaway to pick your favorite city, I don't know, San Diego, an ad for Airbnb might actually be very helpful.
在这种情况下,你甚至可能想在ChatGPT这样的丰富对话环境中与广告或广告主进行交流,但你必须清楚这属于广告场景。
And you might even want to have a discussion with the ad or with the advertiser in that case in a chat GPT setting that's very rich, but you're clear that it's in an advertising setting.
我认为,这正是需要更多创新的地方——要创造出真正契合平台特性的广告形式,而不是简单照搬传统的横幅广告等旧模式。
And I think this is where there's there has to be more innovation on what feels endemic to the platform, not just kind of the old world of stick, you know, banner ads on things.
展开剩余字幕(还有 227 条)
对我来说,第三点也是最后一点是,必须始终存在一个没有广告的层级。
And I think the third and final thing for me is, again, there always has to be a tier where advertising doesn't exist.
所以我们给用户一些选择和控制权。
So we give the user some choice and some control.
但我们非常重视您的数据。
But we're very mindful of your data.
当我们推出健康功能时,我们明确表示您的数据是单独存放的。
When we released health, we were very clear your data is off to one side.
它不会被用于训练等用途。
It's not being used to train on and so on.
我认为我们只需持续给予用户这种信任感,因为信任对OpenAI至关重要,即使在广告等问题上,我们也会坚持这些原则。
I think we just need to keep giving users that kind of, that trust is everything for OpenAI and that we're going to stand by those principles even when it comes to things like ads.
在消费者层面,未来会不会是一个你需要订阅多种AI服务的世界?
On the consumer side, is it going to be a world where you're going have
会有大量针对不同AI服务的订阅吗?
a lot of subscriptions to different AI services?
我认为你会拥有每一个模型。
I think you'll have every model.
大多数人会订阅多个服务。
Most people will have more than one subscription.
媒体是一个很好的例子。
Media is a good example.
大多数人在媒体领域都会订阅多个服务。
Most people have more than one subscription in media.
因此,这是一个反映消费者行为的很好参照。
And so that's a good proxy for consumer behavior.
不同的人会选择不同的选项,包括免费选项,也就是广告支持的媒体。
Different people will pick different choices, including free choices, which is ad supported media, too.
所以即使是相同的服务,你也可以选择付费或免费使用。
So even the same services you can get for pay or for free.
我认为你会看到广泛的多样性。
I think you'd see a wide range of diversity.
不过,你怎么看待切换到另一个平台的费用呢?
How do you think about, though, the expense of going to a different platform?
我喜欢ChatGPT的记忆功能。
So I like ChatGPT memory.
我发现它越来越有帮助了,因为当我问一个问题时,它会记得我们几周前、甚至几个月前聊过的内容,这种功能现在还没广泛普及,但对我来说,它已经成了我每天早晨醒来后的第一件事。
I'm finding it more and more helpful because as I ask about one thing, remembers something we talked about maybe weeks ago, months ago, pulse, which is today not widely distributed, but it's the more it's the way I wake up in the morning now.
这太棒了。
It's so amazing.
当你把它和日历之类的东西连接起来时,它就不仅仅是说你对AI数据中心非常感兴趣——显然它一定觉得我是地球上最无聊的人,因为我总是看到这些内容。
And when you start connecting it to things like your calendar, so it's not just saying, you you say are very interested in AI data centers, which clearly it must think I'm the most boring person on earth because this what I see a lot of.
但它还会提醒你:嘿,你的日历显示你今天要和Node见面。
But it also says, hey, on your calendar, you're going to be sitting down with the node today.
记住这些事情。
You know, remember a couple of these things.
很有帮助。
Helpful.
但如果我同时使用多个平台,我就失去了这种优势,这和我订阅《华尔街日报》、《经济学人》和《纽约时报》是不一样的。
But if I am multi homing, I'm losing the benefit, which is not the same as if I subscribe to The Wall Street Journal, The Economist in The New York Times.
我到其他地方阅读时,它们并没有真正受损,或者说我也没有损失什么。
They're not really losing out if I go read in other places in the same way, or I'm not losing out.
是的。
Yeah.
所以我认为记忆是一个重要的问题,是否每个设备都只有一个记忆,或者每个设备有多个记忆。
So I do think memory is an important question, whether there'll be one per wear or more than one per wear of the models.
在每个模型上,都会有多个服务提供不同的权衡。
On each model, there'll be multiple services that may offer different trade offs.
即使你谈论的是医疗或媒体领域,在OpenAI的模型上,也有许多人提供不同的服务。
So even whether you're talking health or media, even on the OpenAI models, there's multiple people providing services.
这就是我所说的多平台使用。
So that's what I was thinking of multi homing.
但显然,我不认为OpenAI会占据100%的市场。
But obviously I don't think OpenAI will be 100% of the market.
我希望如此。
I hope so.
我也希望如此。
I'm going say, I hope so too.
我对此没有意见。
I'm okay with that.
这是一个有趣的商业模式。
It's an interesting business model.
我觉得人们很难理解,因为奈飞是个很棒的公司,但世界上人们能看奈飞的时间毕竟有限,对吧?
Think it's hard for people to wrap their heads around because like Netflix is a great company but there's only so many hours on the planet that people can watch Netflix, right?
移动设备也很棒,对吧?
And mobile's great, right?
我每周需要的移动使用时间也就那么多,不管怎样。
I only need so many minutes of mobile per week or whatever to do that.
在人工智能和智能方面,你可以拥有更多的智能。
With AI and intelligence, you can have more intelligence.
我可以购买更多服务,获得更好的答案,做这些事情,但我还在努力理解这些会走向何方。
I can buy more and get better answers and do this and I think that's, I think I'm still trying to wrap my head around about where where that goes.
这个想法是,你一开始使用免费版本,可以免费使用。
The idea that like you start at like, you know, one level of free, use it for free.
然后你升级到一个更小的付费层级,随着它变得更有用,你开始逐步提升。
Then you go to a smaller tier and then as it becomes more useful, you start increasing that.
这会走向哪里呢?
Where does it go?
我认为这和Netflix不一样,Netflix每天播放那么多小时,我觉得它更像基础设施,比如电力。
So I think unlike something like Netflix where they're running so many hours in the day, I think of it much more like infrastructure, like electricity.
你一天用多少电?
How much electricity do you use in the day?
我不知道。
I don't know.
今天我走进一个房间,发现里面有个风扇在吹。
I walked into a room today and there was a fan blowing.
它真的很舒服。
It was really nice.
它让房间凉快了。
It cooled it down.
我们现在周围都是亮着的灯。
There are lights on around us right now.
我昨晚给手机充电了,它一整天都正常工作。
Either there's so many I I charged my phone overnight and it worked for me all day.
所以我认为,我们今天所处的状态是更多地依赖调用聊天GPT,而不是让智能直接内嵌在系统里。
So I think that the state we live in today is much more I call on chat, GPTI, invoke it as opposed to intelligence just being baked in.
我觉得,这将在未来几年带来巨大的变化。
Like, think this will be the big change over the next couple of years.
你会回过头来看,觉得我们以前做的那些事简直像玩具一样。
You'll kind of look back almost it'll feel a little toy like that we used to do this thing.
而如今,智能已经无处不在。
And instead, it just is everywhere around us.
所以它并没有直接回答你提出的问题,但我的意思是,我不会太纠结于人每天只有那么多小时来做事情,因为我感觉生活中几乎每件事都需要智慧,而我希望自己头脑里能带着一些智慧。
And so it's not really quite asking answering the question you're asking, but it's that I don't get so caught up that there's only so many hours for people to do things because I feel like almost everything I do in life requires intelligence because I'm walking around hopefully with some intelligence up here.
如果我能获得这种增强,我认为这会让我们感到惊喜。
And if I can get that augmented, I think it's going to surprise us.
就像我们刚开始聊天前谈到的,你提到在手机上突然发现有了手电筒和相机。
Like as we were talking before we got started, you said about on your phone when you suddenly discovered you had a flashlight and a camera.
就像你说的,这太明显了。
Is like you say that I'm so obvious.
但每次我用聊天工具发现一个几乎有点可爱的应用场景时,我都感到非常震撼。
And yet with Chatty, time I discover kind of a what feels like almost a slightly cute use case, I'm so blown away by it.
昨天早上,我非常喜欢《经济学人》。
Yesterday morning, I do love The Economist.
我想读一下社论。
I wanted to read the editorial.
但由于要跑上楼准备,我其实没太多时间。
I didn't really have a ton of time because of running upstairs to get ready.
所以我拍下了这篇社论,因为它们写得非常好。
So I took a photograph of the editorial because they're very good.
它们把内容浓缩在一页上,我让Chatty读给我听。
They put it on one page and I asked Chatty to read it to me.
它真的做到了。
And it did it.
我当时想,天哪,这太棒了。
And I was like, Oh my God, this is awesome.
所以我只是觉得,我们才刚刚开始遇到这些时刻。
So I just think there are all these moments where we're just getting started.
而多模态,我认为可能是最重要的,因为手机教会了我们用拇指交流。
And multimodal, I think, is probably the biggest because phones taught us to talk with our thumbs.
我认为我们正进入的新世界,将会出现新的硬件,真正帮助我们意识到:我们可以说话、倾听、观看、书写,以非常人性化的方式完成这一切,而我们才刚刚触及皮毛。
And I think this new world we're moving into, there's going to be new hardware that just really help us understand that we can talk, we can listen, we can see, we can write and do all of these things in a very human way that we're just scratching the surface of.
让我换一个角度来说明这一点。
Let me give you a different frame on that.
我同意所有这些观点。
I agree with all of that.
如果你回顾我们之前讨论的互联网以及与之相关的泡沫,互联网的作用是让你接触到更多内容,无论是媒体、YouTube视频、TikTok,还是任何其他类型的信息。
If you look at what we talked about the internet earlier and the bubble associated with it, but what the internet did is give you access to a lot more stuff, whether it was media, YouTube videos, or TikTok, or you name it, information of any sort.
但它扩展到了一个程度,以至于没有任何人能完全利用互联网。
But it expanded it to the point where no human can actually use the internet fully.
我认为人工智能的作用是,考虑到你每天只有大约八千小时,其中一部分还要用于睡觉,它会让你的时间变得高效得多。
I think of AI as given you're limited to eight thousand some hours a day, some of which is meant for sleeping, it'll make your time much more efficient.
所以互联网将你能获取的信息量爆炸式增长,多到你根本用不过来。
So the internet exploded information available to you to the point where you couldn't use it.
我认为人工智能会过滤这些信息,让你的每一小时都变得最有效率,前提是你知道怎么使用它。
And I think what AI will do is filter it to make your every hour the most effective hour, if you know how to use it.
因此,智能会将世界缩减为你个人最相关的内容。
So intelligence will reduce the world to what is most relevant to you personally.
而我的优先事项可能和莎拉的不同。
And I may have a different set of priorities than Sarah.
所以我认为,智能就是将世界总结为对我而言最相关的内容,以及对她而言最相关的内容,而这两者是不同的。
So I think of intelligence as summarizing the world to the most relevant things for me, and the most relevant things to her, which are different.
因此,我认为这正是智能大有可为的地方——当互联网让信息爆炸式增长时,智能可以用来过滤和缩减信息。
So I think that's where there's almost unlimited capacity for intelligence to be used to reduce information when the internet exploded information.
是的,没错。
Yeah, yeah.
我们已经讨论了很多关于消费者端的内容,感觉OpenAI在消费者领域已经遥遥领先。
We've talked a lot about consumer side and it feels like OpenAI is very much winning the consumer side.
有人会问到企业端,OpenAI将如何在这一领域竞争并取得胜利?
Question comes up about enterprise and how is OpenAI going to compete and win in that area?
所以我认为,我们已经在这一领域取得领先。
So I think we're already winning in this area.
我看到的情况是,90%的企业表示,他们要么已经在使用OpenAI,要么计划在未来十二个月内使用。
What I see is, you know, 90% of corporations are saying they either are using OpenAI or intend to use over the next twelve months.
对。
Right.
我认为第二点是微软正在使用我们的技术。
I think the second is Microsoft and Microsoft using our technology.
所以我认为,消费者端实际上是企业飞轮中一个非常强大的部分。
So I actually think we have this is where the consumer is a really potent part of the enterprise flywheel.
正如我之前所说,回想过去,当你刚开始把iPhone带到公司时,企业并不希望你这么做,但你发现你无法拒绝消费者偏好带来的滔天浪潮。
So as I said earlier, when someone know, you back in the day when you first started bringing your iPhone to work and corporates didn't want you to do that, you just discovered you can't say no to the tidal wave that is consumer preference.
所以我已经在使用的东西,已经装在我口袋里,带到公司了。
So something I'm already using that I've already got in my pocket and I get to work.
我的期望是,工作场景下的体验至少要一样好,甚至更好。
My expectation is work is at least as good, if not better.
因此,正是这一点推动了我们的企业业务,使我们成为历史上最快达到一百万家企业用户的平台。
And so that's what's helped drive our actual enterprise business, the fastest company ever to get to 1,000,000 businesses on a platform.
我们只用了一年半就做到了。
And we did that in about a year and a half.
但接下来该往哪里走?
But where to from here?
因为我们显然才刚刚触及表面。
Because clearly we're just scratching the surface.
所以其中一部分确实是根据客户的行业需求与他们沟通,用他们熟悉的语言交流。
So some of it is certainly meeting customers in terms of their vertical so that we talk to them in their language.
我们学会了企业销售的艺术:不要一上来就向你介绍我的所有产品,而是先理解你的问题。
And we learn this art of enterprise selling, which is let me not tell you all about my products, but let me understand your problem.
比如,董事会给董事长和首席执行官们施加了哪些压力?
Like, what is your board forcing on you, mister and missus CEO?
你的客户最想要但你却无法提供的东西是什么?
What is the thing your customers most want that you can't deliver?
好吧,让我们开始针对这些问题引入智能解决方案。
Okay, let's start putting intelligence against that.
我们可以将这种智能逐步从轻度行业专业化扩展到深度行业专业化,比如针对特定用例的RL模型。
We can then drop that down into some light vertical specialization to quite heavy vertical specialization, things like RL ing models that are very pertinent to a use case.
比如,在能源公司中,可能是深入理解某个特定油井或他们所有的地震数据,从而判断这个气田的开采潜力有多大?
Like, let's say in an energy company, it might be really understanding that particular oil well or all the seismic data they have to say, what's the recovery we're going to get out of this gas field?
这就是深度专业化。
Like that is deep specialization.
然后,我认为这会逐步演变为我们已经开始的一些重大变革性研究项目,在这些项目中,我们几乎接管了整个企业的运营,帮助他们以更智能、更快、更好的方式重新思考业务,最终推动其关键业务指标。
And then I think it gets the whole way to some of these big transformational research projects that we have begun, where we're actually almost taking over someone's whole business and helping them rethink it in a smarter, faster, better way that ultimately drives their key business metrics.
这是一段旅程。
So it's a journey.
我认为大多数企业最初都是从全面使用ChatGPT开始的。
I think most corporates have started with wall to wall chat GPT.
这是一个简单的起点。
That's an easy starting point.
他们已经进行了一些编程,在很多情况下,还进行了大量编程。
They've done some coding and in many cases, a lot of coding.
当我与企业交流时,他们的CEO们开始说,比如,我60%的生产代码都是由某个智能体构建的。
Like when I talk to corporates, they're not CEOs are starting to say things like 60% of all my production code was built by, you know, an agent.
我心想,十二个月前你甚至都不知道什么是生产代码。
I'm like, you didn't even know what, you know, production code meant twelve months ago.
但你现在说这其实是好事,因为这意味着你在跟踪它。
But now you're saying that that's good because it means you're tracking it.
但代理工具才刚刚起步,我们目前看到的只有大约14%的客户——当你对美国企业进行调查时,只有14%的企业在使用某种代理工具。
But agents, it's just starting like we only see about 14% of all kind of customers when you go out and just survey US corporates are using something agentic today, 14%.
当我向你解释我财务部门正在发生的变化时。
When I just explained what's happening in my finance organization.
所以我认为我们才刚刚开始,但我对这个机会感到无比兴奋。
So I think we are just getting going, but I couldn't be more excited about the opportunity.
这规模巨大。
It's huge.
好吧,如果我是一家初创公司,看到OpenEye所做的一切,我可能会问:还有我的容身之地吗?
Okay, but if I'm a startup and I look at everything Open Eye is doing, I might be asking, is there room for me?
我能做什么?
What do I get to do?
听好了,模型会变得越来越好,承担越来越多的任务。
Look, models will keep getting better and do more and more.
但我确实相信,在此基础上还有很多可以构建的空间。
But I do believe there's lots of room to build on top.
你知道,没有一家公司能包揽地球上所有事情。
You know, no one company can do everything on the planet.
有数十亿人在工作,他们的工作可以通过AI来协助。
There's billions of people who are working that whose job AI can help with.
我不认为OpenAI会专精于每一个领域。
I don't think OpenAI will specialize in every one of those.
所以我认为明智的做法是明确模型将走向何方,是OpenAI还是其他公司,以及它们能做什么。
So I think the careful thing to do is be clear where the models will go, OpenAI or others, and what they will be able to do.
你该如何最好地利用这一点,进而专精于更有趣的世界?
And how do you use that best to then specialize into more interesting world?
比如某种专业化方式,在基础模型之上增加额外的价值。
Like some sort of specialization where you add something that's additional to the base models.
坦率地说,智能并不是提供解决方案的唯一要素。
And frankly, just intelligence isn't the only thing to provide a solution.
除了智能之外,还有很多其他因素围绕着解决方案。
There's lots of other stuff that goes around solution beyond intelligence.
所以我认为在这些模型之上还有很多机会可以拓展。
So I think there's lots of opportunity to build on top of these models.
它们越强大,能够在此基础上增加的功能机会就会大幅增加。
And the more powerful they get, the number of opportunities to add to it dramatically increases.
你怎么看待那些已经积累了大量数据的使用场景呢?这些数据可能是由初创公司或企业聚合的,你知道,实际上当今世界95%的信息都藏在企业防火墙、大学防火墙之后。
How do you think about so I think a lot about use cases where there's already a lot of data that's being aggregated perhaps by that startup, by that company that, you know, today, I think 95% of the world's information actually sits behind corporate firewalls, university firewalls and so on.
所以尽管我们谈论的是已经进行的海量训练,但其实我们才刚刚开始。
So there's even though we talk about the vast training that's occurred, again, we're just getting going.
但我认为,那些已经建立起业务并聚合了这些数据的公司,拥有对这些数据的访问权。
But I think companies that have already built businesses that have aggregated that data have access to it.
而且它们还在这些数据之上管理着复杂的流程。
And then on top of that have managed complex workflows.
所以我经常举的例子是我们的采购系统,采购系统本身并不复杂。
So I often give the example of our procurement system procurement system per se, not that complicated.
但它非常擅长理解诸如授权委托之类的事情。
But what it does very well is it understands things like delegation of authority.
因此它知道董事会在审批限额方面批准了哪些内容。
So it knows what the board has approved in terms of approval limits.
所以它知道当这份软件合同到来时,金额超过了某个阈值。
So it knows that when this software contract comes in, it's x over x amount.
只有我才能批准,或者如果金额低于这个阈值,但我知道副总裁可以批准。
So only I can approve it or if it's beneath that, but it knows a VP can approve it.
它不知道Andrew是副总裁,但它知道要去HRS系统查询他的职级。
It doesn't know that Andrew's a VP, but it knows to touch the HRS system and check what's his level.
因此,整个采购流程可以以一种确保合规性和治理的方式进行, hopefully 让整个公司运行得更快。
And so the whole procurement flow can happen in a way where I have compliance and governance and hopefully makes just the whole company run faster.
这些正是我对初创公司感兴趣的地方。
Those are places I get interested for startups.
那么,你在哪些地方拥有独特数据和复杂工作流程的访问权限?
So where have you got access to unique data with a complex workflow?
感觉在这方面有一道更宽的护城河,我们希望与你携手合作。
It feels like there's more of a moat around that, that we want to work alongside you.
但通用模型本身并不能完成所有这些工作。
But the general purpose model is not going to do all of that itself.
是的,我完全认同这一点。
Yeah, no, I completely buy that.
这里有很多机会。
There's lots of opportunity.
我见过不少初创公司专注于权限管理,比如谁可以访问哪些信息。
I've seen quite a few startups around just permissioning around Like who can do access to what information.
例如,我看到很多初创公司根据每家公司的历史和优先事项来定制模型。
For example, I've seen a whole bunch of startups around customizing to each company the models for their history and their priorities.
还有智能体的身份方面,我认为我们才刚刚开始理解智能体之间相互对话可能带来的风险,以及如何进行权限管理,进而开始思考智能体商业的复杂性,这种即将到来的复杂性也非常巨大。
And the agent, the whole identity side of agents, I think we're just starting to understand both the risk that can happen when you have agents talking to agents, talking to agents, but then also how are you going to permission that and then start to think about it like agentic commerce, like the the the complexity that's coming is also quite big.
所以,如果说初创公司已经没有更多机会了,我认为这可能是有史以来最有趣、最令人兴奋的创业时代。
So to suggest there's no more opportunity as a startup, I think it's never been probably more interesting or fun to be a startup.
是的,我认为现在的机遇比以往任何时候都更多。
Yeah, I think there's more opportunities than they've ever been.
你现在在寻找什么?
What are you looking for now?
当你和一家公司交谈时,是什么让你感到兴奋?
What gets you excited when you talk to a company?
最难的是优秀的人才,一直都是这样。
Well, the hardest thing is great people, always.
但我觉得另一个短缺的要素是自主性,即人们能够主动推动事情发生。
But I think the other thing that has been in short supply is agency, where people sort of have the agency to make things happen.
这 again,归根结底还是人才问题。
That's again, comes down to people.
但机遇太多了。
But there's so much opportunity.
我认为像了解某个领域或拥有该领域经验这样的传统因素,现在重要性低多了。
I think traditional things like knowing a space or experience in space is much less relevant now.
更多的是自主性。
It's more agency.
我们还没谈到机器人和现实世界模型等整个新领域。
We've not talked about the whole new world of robotics and real world models and all that.
这是一个独立的领域,我们可能没时间深入讨论。
That's a whole space by itself that we probably don't have time for.
哇,我们有吗?
Woah, do we?
我们还有
We've got
plenty of time.
plenty of time.
我非常想
I'd love
它。
it.
我想去那里。
I want to go there.
是的,因为我们讨论过我们未来的方向,你曾经 famously 谈到过2050年的世界,事情正在快速变化,模型变得越来越快、越来越强大。
Yeah, because we talked about where we're headed here and you famously talked about kind of the world of 2050 and things are moving fast, models are getting faster and more capable.
你认为机器人技术会走向何方?
And where do you see things like robotics headed?
我认为两年前我在TED演讲时说过,无论是双足机器人还是其他类型的机器人,十五年后的机器人产业规模将超过今天的汽车行业。
Well, I think two years ago when I gave a talk at TED, I said the robotics business, both bipedal and other robots, will be a larger business in fifteen years than the auto industry is today.
我们通常认为汽车行业是全球最大的产业之一。
We think of auto industry as one of the larger businesses on the planet.
而这个新兴领域将会更大。
And this other thing will be larger.
我认为几乎没有几家汽车公司会这样看待这个世界。
I don't think there's very many automotive companies who are thinking of the world that way.
他们只想着如何在生产线上使用机器人,而不是意识到机器人驱动的这个新产业规模将超越他们当前的业务。
They're thinking about how to use a robot in their assembly line, not that that business is larger than their current business, all driven by the intelligence of robots.
因此,那里对初创公司来说有巨大的机会。
So massive opportunities for startups there.
而且我们看到了很多活动。
And we are seeing a lot of activities.
是的。
Yeah.
而且我认为有时候我们低估了。
And I think sometimes we underestimate.
所以当你想到家中的机器人时,对吧?
So when you think about robots in the home, right?
这是一个非常有潜力的领域。
People, very fertile area.
不过,真正取得突破的还很少。
One's really had a breakthrough, though.
围绕复杂性存在太多不同的问题。
There's so many different issues around the complexity.
实际上,我花越多时间去思考,就越是对人类的处境心生敬意,因为我们的能力是能够在世界上自由行动;如果你观察机器人领域的人们,看到他们对机器人叠衣服如此兴奋,你可能会想,我18岁的孩子也会为这个感到兴奋,但对普通人来说,我假设他们本来就会叠衣服。
Actually, sometimes the more time I spend in respect I have for the human condition in a way, because our ability to move around the world and do you know, if you watch, like, the people in robotics getting so excited about a robot folding clothes, you know, perhaps my 18 year old, I'd be just as excited about, but for the average human, I assume they can fold clothes.
但我
But I
我觉得现在机器人领域的‘Hello World’就是
think It's like the hello world of robotics now is
叠衣服。
folding clothes.
是的。
Yeah.
但
But
你会不自觉地陷入一种想法,认为机器人必须某种程度上像人一样。
you do get a little stuck in your head that they have to somehow be a human.
但事实证明,可能只是会出现一些突破性的时刻,比如家庭中的陪伴。
But it turns out there may just be these breakthrough moments like, for example, companionship in the home.
对吧?
Right?
我们正面临人口老龄化。
We have an aging population.
世界上最大的问题之一是什么?我们正在谈论的全球性流行病。
What's one of the biggest, you know, we're talking about epidemics in the world.
孤独,可能是最大的流行病之一。
Loneliness, probably one of the biggest epidemic epidemics.
一个独自生活、刚刚失去配偶的人,最需要的是什么?也许只是一个能以自然、人性化方式与之交谈的人。
What does someone living alone, maybe has just lost a spouse value most, just someone to converse with in a way that feels intuitive and human?
我们看到越来越多的人使用ChatGPT来进行这种对话,但有没有突破性进展?
We see people using chat GBT more and more for this conversation, but is there a breakthrough?
或者事实是,你并不需要机器人来煮咖啡、叠衣服或洗碗,尽管这些也很有用,但也许真正重要的是某种更简单的东西,它依然能带来巨大价值,这正是文德所谈论的未来——从爬行到行走再到奔跑的第一步,这种整体复杂性所带来的价值,远超我们曾经在汽车行业看到的。
Or it turns out you don't need it to make coffee or full clothes or do the dishes, although that would be good too, but it might just be something a little bit more simple that still adds a lot of value and is just the the first crawl of crawl, walk, run of this kind of future that Vinod is talking about where that whole complex is x times more valuable ever than we saw in automotives.
我认为这很有趣,因为我们可以在当下将机器人置于各种场景中,去做一些这样的事情。
I think that it's it's interesting because we can sort of think of kind of like our present and put robots in places and do things like that.
当你拥有极低成本的劳动力和制造业时,很难想象从那里能构建出怎样的世界。因为你知道,我们目前可能认为这是个不错的解决方案,但当建造一个先进的养老设施、将许多人集中在一起的成本大幅下降时,情况就会不同。
It's really hard to think of when you really have extremely low cost labor manufacturing, etcetera And then the world you can build from there because, you know, we can look at that's a good solution for now but when the cost of building a wonderful state of the art assisted living facility where you can put a bunch of people together, the cost drops.
我认为我最难理解的问题是,真正降低成本究竟意味着什么。
I think that's the thing I have my the hardest problem is for me is to really think like, what does it really mean when you lower the cost?
我们正在降低智能的成本。
We're lowering the cost of intelligence.
当我们真正降低成本时,这究竟意味着什么?
What does it mean we really lower the cost
我个人认为,大概在下一个十年末,你会看到一个大规模通缩的经济,因为劳动力将接近免费,专业知识也将接近免费,大多数功能的成本几乎为零。
of Well, my personal view, sometime probably towards the end of the next decade, you'll see a massively deflationary economy because labor will be near free, expertise will be near free, most functions will be almost zero cost.
具体会如何发展,很难准确预测。
How it exactly plays out, a little hard to tell.
购买力与商品和服务的生产之间将如何互动。
How purchasing power versus production of goods and services plays out.
但我预计,我们将迎来一个远超人们预期的、极度通缩的经济。
But I expect we'll see a hugely deflationary economy at a level people aren't planning on.
因此,人工智能的普及还涉及一些尚未解决的社会层面问题。
So there's social aspects of adoption of AI that hasn't been handled yet.
我认为我们需要讨论的问题是:人们将会做什么?
I think the conversation we need to have is what will people do?
我经常被问到这个问题。
I get asked that a lot.
人们将如何谋生?
How will people make a living?
我认为,政府能够保障人们的最低生活标准将大大提高,而无需人们赚取收入。
I think the minimum standard of living governments can assure people is going to be much, much higher without needing to earn an income.
我的意思是,我无法想象,每月一美元就能获得比现在多十倍的初级医疗服务。
I mean, I can't imagine much better primary care, like 10x more primary care than today, doesn't happen for a dollar a month.
我很难想象这种情况如何实现。
I have a hard time imagining how that happens.
这将是事实:提供免费的初级医疗、免费的教育,以及为每个孩子配备几乎一对一的私人导师,成本几乎为零。
It will be true, it costs almost nothing to have free primary care, free education, almost personal tutors for every child.
这已经在发生了。
That's already happening.
因此会有一系列免费的服务。
So there's a set of services that'll be free.
有些难题需要解决。
There's some hard nuts to crack.
住房是最大的难题。
Housing is the hard one.
你知道,对于美国人口中收入最低的一半人来说,他们将收入的40%以上花在住房和食物上。
You know, for people in the bottom half of The US population, they spend 40 some percent of their income on housing and food.
确实有些难题,但我相信机器人技术和更优的方法都能解决这些问题。
So there's some hard nuts, but I do think both are addressable by robotics and better approaches.
嗯,这场对话非常有趣。
Well, this has been a very interesting conversation.
我很期待未来的发展方向。
I'm excited to see where things are headed.
感谢你们两位参加我们的播客节目。
Thank you both for joining us here on the podcast.
谢谢。
Thank you.
谢谢。
Thank you.
关于 Bayt 播客
Bayt 提供中文+原文双语音频和字幕,帮助你打破语言障碍,轻松听懂全球优质播客。