Limitless Podcast - 多模态AI如何改变搜索 | 谷歌产品副总裁罗比·斯坦 封面

多模态AI如何改变搜索 | 谷歌产品副总裁罗比·斯坦

How Multimodal AI Is Changing Search | Google’s VP of Product Robby Stein

本集简介

谷歌搜索曾以"输入几个词,返回十条蓝色链接"著称。在本期《无限可能》节目中,产品副总裁罗比·斯坦与乔什和埃贾兹共同解密人工智能如何悄然颠覆这一认知模式,将搜索转变为能看、能听甚至能代你行事的跨模态对话系统。罗比阐释了为何AI具有"扩张性"而非"吞噬性",Gemini驱动模型如何自主发起谷歌搜索并实时查询后台数据图谱,以及为何通过Lens进行的视觉搜索正成为人们使用网络增长最快的方式之一。过程中他分享了发布Instagram故事的经验,深入探讨能为你预订餐厅或研究文件保险箱的智能代理,并坦率谈及广告、出版商流量,以及把守互联网入口所肩负的责任。 ------ 🌌 无限可能总部:收听与关注⬇️ https://limitless.bankless.com/ https://x.com/LimitlessFT ------ 时间轴 0:00 开场 0:56 搜索的AI驱动演进 3:26 从蓝色链接到AI概览 7:58 对话式搜索成为新认知模式 10:58 代理程序:代订餐厅与代办事务 14:15 AI是在蚕食还是拓展搜索? 16:52 硬件、眼镜与搜索载体 18:40 Gemini驱动搜索的技术内幕 21:46 Instagram故事、产品品味与用户体验 26:15 谷歌相较其他AI的独特优势 28:14 AI搜索的盈利与广告 30:25 广告原则与实验 32:56 Gemini与搜索的整合 36:04 Gemini与谷歌搜索的实际协作 39:58 个性化推荐与透明度 43:40 跨设备搜索的未来愿景 46:21 结束语 ------ 相关资源 罗比·斯坦 https://x.com/rmstein ------ 非财务或税务建议。投资披露详见: https://www.bankless.com/disclosures⁠

双语字幕

仅展示文本字幕,不包含中文音频;想边听边看,请使用 Bayt 播客 App。

Speaker 0

自远古以来,搜索就以谷歌及其辉煌的十条蓝色链接形式存在。

Since seemingly the beginning of time, search has come in the form of Google and their glorious 10 blue links.

Speaker 0

无论你提出什么问题,搜索引擎都会迅速整合全球知识并几乎即时呈现给你。

Ask any question in the world, and the search engine goes off aggregating the world's knowledge and presenting it to you nearly instantly.

Speaker 0

这简直只能用魔法来形容。

It can only really be described as magic.

Speaker 0

但现在交互方式正在转变。

But now the interface is shifting.

Speaker 0

它正从这种黑箱模式转向多模态,比如谷歌镜头——现在有了能与现实世界互动的传感器,能在语音、音频、视觉等多维度提供搜索功能。

It's shifting from this black box to multi modality, and things like Google Lens where you now have sensors that engage with the real world and can now give you search in many dimensions voice, audio, visual, all of the things.

Speaker 0

埃杰斯,我一直对此很好奇。

And I've been curious about this, Ejes.

Speaker 0

我知道你也对此非常好奇,幸运的是还有个人对此深感兴趣,他就是今天来到节目的罗比·斯坦。

I know you've been really curious about this, and thankfully, another person has been really interested in this, and this is Robbie Stein, who is now on the show.

Speaker 0

欢迎你,罗比。

Welcome, Robbie.

Speaker 0

Robbie是谷歌搜索的产品副总裁,看来你是与我们讨论这个话题的绝佳人选。

Robbie is the VP of product at Google Search, so you seem like the absolute perfect person to discuss this topic with us.

Speaker 0

我想从这个问题开始:新版TenBlueLinx会是什么样子?

I think the place I wanna start is, what does the new version of TenBlueLinx look like?

Speaker 0

搜索是否会保持不变,在AI时代,谷歌搜索未来的计划是什么?

Is is search going to continue to be the same, and kind of what is the plan for Google Search moving forward in a world of AI?

Speaker 1

首先感谢邀请我参加节目。

First of thanks for having me on the show.

Speaker 1

很高兴能来到这里进行这次对话。

Really fun to be here and to have this conversation.

Speaker 1

我认为,在这个AI时代,它本质上是在扩展的。

I think, ultimately, in this AI moment, it truly is expansionary.

Speaker 1

我们讨论的是它比以往任何时候都能为人们做更多事情。

We talk about the fact that it's doing more for people than ever before.

Speaker 1

当你思考搜索的未来时,实际上要从它现在的样子开始。

When you think about what the future of search looks like, it actually starts from what it is today.

Speaker 1

因为人们日常需要快速获取电话号码、在线支付账单或找到某个直接链接的需求并不会消失,但谷歌搜索能做的远不止这些。

Because the everyday need of someone grabbing a quick phone number, being able to pay a bill online, find a direct link somewhere, it doesn't really go away, but it's that Google search can do so much more.

Speaker 1

所以我认为你会发现搜索体验已经提升,变得更为强大。

So, I think what you're finding is that the search experience has enhanced, and it's become, a lot more powerful.

Speaker 1

而现在,AI体验也开始显现。

And now there's AI experiences that start to show up.

Speaker 1

在页面顶部会有一个小预览,展示AI可能在哪些方面提供帮助。

It's a little preview at the top of the page where AI could be helpful.

Speaker 1

你可以想象这样一个场景:当有人提出一个非常困难的问题时,这类问题会越来越多,通过点击那里,你现在就能体验到这种更具对话性的搜索版本。

You can imagine a world that someone asks a really hard question, you get more and more of those, and by tapping into there, you're now having this more conversational version of search.

Speaker 1

因此,第一个重要主题是你可以获得AI驱动的体验,而且是双向互动的。

So, the first big theme is that you can go and have an AI driven experience, and one that's a back and forth.

Speaker 1

你可以进行优化、提出后续问题、满足更多好奇心,这是一种不同的模式。

You have refinement, you have follow-up questions, you have more curiosity, and that's a different paradigm.

Speaker 1

但这确实只适用于特定类型的问题。

But it really makes sense for a specific kind of question.

Speaker 1

你只是在随意浏览,并不清楚自己具体想问什么。

You're just kind of browsing, you don't really know what you're trying to ask.

Speaker 1

实际上,核心搜索体验很大程度上就是为这类场景优化的,这可能就是最佳方案,因此并非每个查询都需要AI介入。

Actually, lot of the core search experience is really optimized for that kind of experience, and that might be the best, which is why you don't see AI for every single query.

Speaker 1

但我认为第一个显著变化将是这种对话式交互。

But I think the first big change you're gonna observe is that that conversationality.

Speaker 1

我认为第二个变化是多模态能力。

I think the second is multimodal.

Speaker 1

过去AI模式中,人工智能和搜索总体上更局限于类似网页的体验形式。

I think in the past AI mode, AI and search in general has been more constrained to a web experience that feels more like a web page.

Speaker 1

你输入文字,返回给你一个页面。

You type text in, you get you get a page back.

Speaker 1

但逐渐地,AI将演变为一种无处不在的智能存在——想象一个被编码来理解尽可能多的世界信息、网络内容,并能针对每个问题匹配最有价值网络资源的知识大脑。

But increasingly, it's just this kind of ambient knowledgeable AI that imagine something that has been encoded to understand as much of the world's information as possible, as much of the web and what parts of the web are most helpful for every single given question, and it's in this brain.

Speaker 1

你可以直接与它对话。

You can talk to it.

Speaker 1

一种方式是通过文本,通过我刚才提到的网页界面,你可以进行完整的对话。

One way to do it is through text, through the web interface I just mentioned, and you can have a whole conversation.

Speaker 1

另一种方式可以是实时的。

The other way could be live.

Speaker 1

所以现在在我们的应用中,你可以在开车时进行实时对话,使用完全相同的模型,它开始了解这个世界并与你交谈。

So in our apps right now, you can have a live conversation while you're driving with the exact same model, just start learning about the world and just talk to you.

Speaker 1

这有点难以置信。

It's kind of unbelievable.

Speaker 1

另一个是图像。

The other one is is is images.

Speaker 1

我们看到谷歌上视觉搜索量每年增长70%。

And and we see a 70% increase in the amount of visual searching year over year on Google.

Speaker 1

这实际上是人们搜索速度最快的方式之一。

It's actually one of the fastest ways people are searching.

Speaker 1

人们只想拿出相机,询问他们看到的东西或设备上的内容。

People just wanna take their camera out and ask questions about what they're seeing or ask questions about what's on their device.

Speaker 1

所以你基本上是从网页世界转向一个可以直接询问所见事物的世界,它能适应你生活中所需的各种形式。

So you kinda move from this world of a page to a world where you can just ask about what you're observing and it can kind of fit the form factor that you need given your life.

Speaker 1

无论你是在散步、开车,还是躺在沙发上玩手机,Google都与你同在。

And whether you're going for a walk, you're driving, or you're just on your couch on your phone, Google's there with you.

Speaker 2

我喜欢这种描述,它就像一个试图突破被束缚媒介限制的环绕式大脑。

I love this description of it's kind of like this ambient brain that's trying to break through the confines of whatever medium it's been chained to.

Speaker 2

最初,你知道的,就是那10个链接,Google主页的搜索排名,这对我来说充满了怀旧感。

And originally, you know, it's this 10 links, you know, the the homepage of Google, the search ranking, this is something that's pretty nostalgic to me.

Speaker 2

对吧?

Right?

Speaker 2

因此,如果它是互联网的门户,那么随之演变的任何事物都至关重要。

And so if it's the doorstep to the Internet, anything that evolves with that is is super important.

Speaker 2

我一直在思考的是,随着AI通过大语言模型、ChatGPT和Google Gemini流行起来,看到搜索页面演变至今,我很想了解推动我们或你们达到最终媒介的时间线和决策点。

And one thing I've been asking myself is, as AI has become popular through LLMs, ChatGPT, through Google Gemini, seeing the search page evolve and get to where it is today, I would love to understand the timeline and decision points that made us or made you get to that final medium.

Speaker 2

我的理解是,我们从搜索引擎起步,获得一个按从最热门到最不热门排序的10个蓝色链接的搜索页面。

So the way I think about it is we've started with the search engine and we get a ranked search page of 10 blue links, right, ranked from most popular to least popular.

Speaker 2

接着我们迎来了Google Gemini的出现,这个大型语言模型聊天界面。

And then we had Google Gemini appear, this LLM chat interface.

Speaker 2

然后这个大型语言模型聊天界面接入了网络搜索功能。

And then this LLM chat interface got access to web search.

Speaker 2

再后来我们有了由Gemini驱动的Google搜索,开始看到更多这类AI概览功能。

And then we had Google Search powered by Gemini where we started seeing more of these AI overviews.

Speaker 2

现在我们甚至有了能在后台自主工作的Google智能体。

And now we even have, like, Google agents that are doing autonomous work behind the scenes.

Speaker 2

你能简要介绍一下这个发展历程和关键决策点吗?

Can you briefly walk us through this timeline and decision points of how we got here?

Speaker 1

好的。

Yeah.

Speaker 1

我认为中间其实有几个关键环节需要说明清楚。

Mean, I think that there's a couple of pieces in between actually that are helpful to clarify.

Speaker 1

其实Google很早以前就以这个经典搜索页面闻名,大家都熟悉并喜爱那些蓝色链接,但此后它已经经历了多次演变。

I mean, I think Google a long time ago had, you know, blue links as this prototypical search page that everyone knows and loves, But actually it's evolved a lot over the years since then.

Speaker 1

我们考虑过通用搜索,比如当你查询本地信息时,地图信息就会显示出来。

We think about universal search and how if you ask for local information, maps information shows up.

Speaker 1

或者当你提出视觉相关问题时,会得到通用图像搜索结果,看到相关图片。

Or you ask about visual questions, you get a universal for, image search and you'll see like related images.

Speaker 1

又或者查询热门信息时,顶部会出现一个新闻头条单元,展示相关文章。

Or if you ask about trending information, you'll get a top stories unit with, articles at the top.

Speaker 1

对吧?

Right?

Speaker 1

此外还有些非常具体的问题,我们会在页面顶部特别展示一个带有超大摘要的网页结果,比如问‘有多少...’这类问题。

So and then you also have very specific questions someone might ask where we would feature one web result with an extra large snippet at the top of the page that might be like, hey, how many?

Speaker 1

我不知道。

I don't know.

Speaker 1

你得提出一个很多人都会问的具体问题,我们就会特别突出显示它。

You gotta give a very specific question that lots of people ask and we just kinda highlight that.

Speaker 1

所以我认为上述每个变化都是搜索体验的重大演进。

So each of those I think have been fairly large evolutions to the experience.

Speaker 1

我认为AI概览功能正是基于这一背景应运而生的。

And I think off the back of that is where AI overviews came.

Speaker 1

因为,你知道,我们已经具备了这些丰富的体验。

Because, you know, we had these rich experiences.

Speaker 1

当你查询天气时,页面顶部就会显示天气信息,对吧。

You asked about weather, you get weather information, right, at the top of the page.

Speaker 1

在页面最顶端,你可以针对具体问题获取更精确的信息。

Where right at the top, you could get more specific information when you had a specific question.

Speaker 1

因此很自然地,AI能进一步解锁用自然语言提出更复杂、更冗长问题的能力。

And so a natural extension of that is how AI could unlock the ability to really ask these harder, longer questions with natural language.

Speaker 1

这样即便没有单独网页包含该信息,你也能通过提出具体问题获得AI生成的概览。

So you could ask a very specific question, and even if there's not one specific web page that has that information, you could provide this AI overview.

Speaker 1

这就是我们所见证的第一次重大演进。

And so that was the first big move in evolution of what we saw.

Speaker 1

接下来,一旦实现这点,我们实际观察到用户正试图更频繁地触发和使用该功能。

And the next one was, okay, well once you had that, we actually observed that people were trying to get it to trigger more and show up.

Speaker 1

所以人们实际上会在搜索框里输入‘AI’这个词。

So people would actually put the word AI in the search box.

Speaker 1

然后,另一件事是他们并不满足于此。

And so, and then the other thing was they weren't done there.

Speaker 1

他们想继续追问,但没有简单的方法实现这一点。

They wanted to ask follow-up questions and there was no easy way to do that.

Speaker 1

这最终催生了AI模式的概念,即通过搜索实现对话式交互体验。

And so, really, that led to this idea around AI mode, which is a way to have this conversational experience with search.

Speaker 1

而且从AI概览进入这个模式变得越来越容易。

And it's increasingly been, you know, easy to get to from AI overview.

Speaker 1

你搜索后会在顶部看到AI预览,展开后就能进入AI模式深入探讨。

So you search, you get the little AI preview at the top, you expand it, and you can go deeper in AI mode and have a poll.

Speaker 1

现在你可以和谷歌进行十到二十轮的对话交流。

You can have ten, twenty back and forth conversation with Google now.

Speaker 1

以上就是整个发展历程的主要脉络。

And so, all of that was kind of the main arc of how it's evolved.

Speaker 1

然后我认为它只是沿着这种复杂性和用户需求的曲线逐步上升。

And then I think it just kind of ascends this kind of curve of complexity and and and and kinda user need.

Speaker 1

我们就在想,好吧,用户接下来想做什么?

We're like, okay, what's the next thing the user's trying to do?

Speaker 1

你不断提出这个问题,思考我们还能如何提供帮助?

You just keep asking that question, and how else we can be helpful?

Speaker 1

举个例子,假设你想在旧金山这周五晚上7点找个约会餐厅。

So, for example, let's say you want to find a restaurant for date night in San Francisco this Friday at 07:00, specifically.

Speaker 1

你可以把这个输入谷歌,我们会给出AI概览,然后你可以在AI模式下反复交流,直到得到一份不错的餐厅列表。

Like, it's possible that you could put that into Google and we get an AI overview and you would go back and forth with AI mode until you, you know, like, got to some list of great restaurants.

Speaker 1

但你上谷歌不只是为了找餐厅。

But then you're not going to Google just to figure out a restaurant.

Speaker 1

你真正想要的是把预订作为这段旅程的最终目标。

You actually want the reservation as the the ultimate end to that journey.

Speaker 2

行动。

Actions.

Speaker 1

是的。

Yep.

Speaker 1

所以你的下一个问题是,好吧,我们能做些什么来代理性地为你预订那张桌子或提供帮助。实际上,我们现在已经实现了一个功能,让代理可以浏览并与OpenTable及网络上的预订系统交互,然后带回一个列表,不仅包括通过分析和推理为你呈现的优秀餐厅,还在列表中显示可用的餐桌时间,这确实非常神奇。

And so your next question is, okay, well, what could we do agentically to actually book that table for you or help, you know, we actually implemented now in his live, a way for an agent to browse talk and OpenTable and the web for reservations, and then bring back in the list, not only great restaurants that use, you know, analysis and reasoning to present that list for you, but then in that list, times at the table's available, which is really really magical.

Speaker 1

希望这能稍微说明我们是如何将体验从感觉像一页纸转变为感觉像一场互动式AI对话,并且能够越来越多地为你和与你一起完成事情的。

And so hopefully that tells a little bit of the story of how we went how how kind of kind of evolved the experience from one that felt like a page to one that felt like an interactive AI conversation and one that could increasingly do things for you and with you.

Speaker 2

这种互动对话正是我认为这个新世界的样子,但我有点难以理解这种思维跳跃具体是怎样的。

That interactive conversation is exactly what I think this new world looks like, but I'm kind of struggling to get my head around what that mental leap looks like.

Speaker 2

对吧?

Right?

Speaker 2

我习惯了搜索。

So I'm used to search.

Speaker 2

我习惯了在键盘上敲击按键,输入文字,获取我想要的信息,然后滚动、点击,最终完成一个操作。

I'm used to tapping keys on my keyboard, typing words, getting the kind of information that I want, and then kind of scrolling, clicking, and getting to an action.

Speaker 2

这种多模态版本会是什么样子呢?

What does the multimodal version of this look like?

Speaker 0

或许我们甚至可以用一种心智模型的形式来实现。

And maybe we can even do it in the form of, like, a mental model.

Speaker 0

那么,如果传统搜索像是‘输入文字获取链接’,当我的相机、语音、截图和电视成为交互方式时,这会变成什么样?

So, like, if classic search was kind of, like, type to links, then what is it when my camera, voice, screenshots, and TV are kind of, like, the the way you interface?

Speaker 0

那么,这个类比看起来是怎样的?

So, like, what does that analogy look like?

Speaker 1

是的。

Yeah.

Speaker 1

我想首先要说的是,我不认为人们想要彻底改变他们对谷歌的心智模型。

So I guess the first thing to say is I don't think people wanna dramatically change their mental model of Google.

Speaker 1

他们基本上希望——这是我个人的观点,尤其作为用户时的感受——通常你只想说‘我有个问题’。

They wanna basically think and and this is my personal view, and particularly as a user, how I feel, is in general, you kinda wanna just say, I have a question.

Speaker 1

我想把这个问题抛给谷歌。

I wanna put I wanna ask Google this question.

Speaker 1

你把它输入谷歌。

You put it into Google.

Speaker 1

目前,主要的方式还是通过文本,但它正迅速向多模态、视觉、语音等方式扩展。

And right now, the the way main way that happens is text, but it's very quickly growing to be multimodal, visual, voice, etcetera.

Speaker 1

但它们都源于同一个根本需求——我有个问题要问。

But they're all just different modes of the same root need, which is I have a question.

Speaker 1

可能是关于你正在看的东西的问题,比如在社交媒体上截图问'我想知道这套衣服是什么,怎么买'。

Could be a question about something you're looking at, could take a screenshot on social media and say, I wonder what this outfit is and how I could buy it.

Speaker 1

可能是你路过一棵树时拍照想知道那是什么树。

Could be a tree you're walking past, take a picture of, wanna know what that tree is.

Speaker 1

但无论如何,核心都是提出一个问题。

But it's a question, never you know, nevertheless.

Speaker 1

所以你带着问题,把它交给谷歌,然后选择最便捷的提问方式。

And so you take your question, you put it into Google, and then you you kinda decide what's the easiest way to do it.

Speaker 1

如果你在电脑前,可能是复制粘贴作业问题,或直接打字提问。

Well, if you're on your computer on your desktop, maybe you have a homework question, you'd copy and paste it, or you'd ask a question, or you just type it.

Speaker 1

如果你在外用手机,就会选择拍照提问。

If you're out and about on your phone, and you'd take a picture.

Speaker 1

如果你正使用安卓设备查看手机,可以使用圈选搜索功能。

If you happen to be on an Android device and you're looking on your phone, you use circle to search.

Speaker 1

人们已经在以巨大规模进行这种操作。

People are already doing this at enormous scale.

Speaker 1

每月约有十亿人使用镜头功能,使用这些可以拍照的产品。

It's like a billion people a month using lens, using these products where you can take a photo.

Speaker 1

所以我认为这已经是一种广为人知的模式了。

So I think this is a pretty commonly understood pattern at this point.

Speaker 1

然后它就直接跳转到谷歌。

And then it just goes to Google.

Speaker 1

从用户角度来说,我认为他们并不希望被打扰太多。

And to from the user's perspective, they don't really want, I think, to be bothered too much more than that.

Speaker 1

他们只想把问题输入谷歌,然后谷歌应该负责判断并为你提供最佳信息。

They just wanna get their question into Google, and then Google should do the judge do the work to give you the best possible information.

Speaker 1

所以如果你问一个非常基础的问题并想浏览相关信息,比如有个你从未听说过的新首发四分卫,你输入这个人的名字。

And so if you ask a question that's really basic and you wanna browse about it, like, let's say there's a new starting quarterback you've never heard of, you put the person's name in.

Speaker 1

当然,你可能想要一个快速描述,但实际上你通常想浏览的是那种针对人物的搜索。

Like, sure, maybe you want a quick description, but you actually wanna browse typically that a per a people search like that.

Speaker 1

你想看到这个人的照片,快速的知识面板。

You wanna see photos of the person, quick knowledge panel.

Speaker 1

你想看看他们在社交媒体上的近期帖子。

You wanna see their, like, recent posts on social media.

Speaker 1

你知道的,你想看看关于他们的报道文章。

You know, you wanna see articles that have written about them.

Speaker 1

在很多情况下,对于这种需要浏览类的问题,AI的回答其实并不理想。

In many cases, an AI response is actually not great for that kind of a question that's like a browser kind of need.

Speaker 1

但如果你问一个非常具体的问题,砰,你会在顶部直接获得AI提供的概览,并有深入探讨和来回交流的方式。

But if you ask a really specific question, boom, you're gonna get AI right at the top with an overview and a way to dive in and have them back and forth.

Speaker 1

所以你某种程度上依赖谷歌总能根据问题给你最佳的呈现形式。

And so you kind of rely on Google to always give you the best format given the question.

Speaker 1

如果这是可预测的,那么你大概知道,好吧,如果你要求一些鼓舞人心的图像类内容,你就会得到图片。

And if it's predictable, then you kinda know, okay, if you you ask for something inspirational and image y, you get images.

Speaker 1

对吧?

Right?

Speaker 1

你会得到视觉内容。

You get visual stuff.

Speaker 1

如果你问一个具体的知识性问题,就会得到AI的回复。

If you ask a specific question about knowledge, you get AI.

Speaker 1

如果你查询某个人物,就能看到他的照片和人物简介。

If you ask a question about a person, you get see photos of that person, you get his description of people.

Speaker 1

这就是我的理解,也是我们从用户那里听到的反馈。

So that's how I think about it, and I think what we hear from users.

Speaker 0

你一直在讨论形式因素,这让我很感兴趣,因为我觉得包括我们在内,当人们谈论AI时,往往只想到聊天机器人,在搜索场景下就是一个聊天框加文本框。

So you're talking about the the form factor a lot, which is something I'm interested in because I think a lot of the times when people talk about it, ourselves included of AI, we kind of imagine, well, there's there's a chatbot, or in terms of search, there's a chatbot, and then there's a a text box.

Speaker 0

中间形态很少,我好奇的是你如何看待这类产品最终形态的演变方向。

And there's not really much in between, and I guess what I'm curious about is the way you see the the final form of this kind of evolving over time.

Speaker 0

因为你描述的是谷歌的文本框,而我们还有带摄像头的多模态交互。

Because you you describe a text box with Google, and then we have the multimodality with cameras.

Speaker 0

当我听到这个,它让我想起了早期的计算机,那时我们刚开始只有基于文本的东西,这比打孔卡要好,然后我们发展到了图形用户界面,接着又进化到了网页浏览器。

When And I hear this, it it kind of reminds me of the early computer where we kind of started with just text based stuff, which was better than punch cards, and then we evolved to a graphical user interface, and then we evolved to web browsers.

Speaker 0

这种自然的演进过程在当时并不明显,但事后看来却如此清晰。

And there was this kind of natural evolution that wasn't obvious, but in hindsight, it was so clear.

Speaker 0

我很好奇你对这种演进接下来会走向何方有什么看法,因为我知道你是Google Lens的先驱,这非常了不起。

And And I'm curious if you have any ideas on where that evolution goes from here, because I know you pioneered Google Lens, which is amazing.

Speaker 0

我一直在使用这个产品。

I use the product all the time.

Speaker 0

这是我最喜欢的与世界互动的方式,因为它就像一根魔法棒,我可以看着任何东西并从中获得答案。

It's my favorite way of engaging with the world because it just feels like this magical wand that I could look at anything and get answers from.

Speaker 0

我很好奇你认为这种技术未来会如何发展。

I'm And curious where you see that kind of progressing to over time.

Speaker 1

是的。

Yeah.

Speaker 1

嗯,我是说,你之前问过我关于模式的问题,比如,心理模型是什么。

Well, I mean, you were talking I mean, earlier you asked me about what's the pattern, like, what's the mental model.

Speaker 1

听你重新阐述后,我想最简洁的说法是:这是一种对话的心智模型。

And I guess in hearing you rearticulate it, the the the most succinct way of saying it is it's the mental model of of conversation.

Speaker 1

所以如果你想想,当你与某物或某人对话时,会有文本输入框、照片上传功能,还能实时连线看到对方。

And so if you think about, you know, when you have a conversation with something or a person, then you have a text field, you have a way to upload photos, you can go live with that person and see them right now.

Speaker 1

对吧?

Right?

Speaker 1

你可以和那个人进行实时视频聊天。

You can have, like, a live video chat with that person.

Speaker 1

这些都只是接触那个人的不同方式而已。

They're all just different modes of accessing that person.

Speaker 1

我认为技术正朝着这个方向发展:与人交流会变得像和真人对话一样简单。

And I do think that technology is moving in a direction where it's as simple to communicate with as talking to a person.

Speaker 1

它能接受所有交流模式和讨论方式。

Mean, it accepts all of the modes and all the ways of discussing with the person too.

Speaker 1

不过现在的情况是,对大多数人来说,大多数需求并不需要长时间对话。

Now it just turns out that most people, for most needs, don't want need to have a long conversation.

Speaker 1

就像,它们更像是时间线上的单次瞬间。

Like, they're kinda single shot moments in time.

Speaker 1

就像你坐在电脑前,突然想起来:哦,我完全忘了。

It's like you're sitting at your computer, you're like, oh, I just totally forgot.

Speaker 1

我得给这个地方打个电话。

I need to call this place.

Speaker 1

你快速谷歌一下电话号码,随便哪个。

You just Google really quick phone number, whatever.

Speaker 1

打给他们。

Call them.

Speaker 1

对吧?

Right?

Speaker 1

所以你不需要——我不需要和它进行长达二十分钟的对话。

And so you don't need I don't need to have like a twenty minute conversation with the thing.

Speaker 1

但如果你想想那个小瞬间,你其实是在与谷歌交流,表达需求并获取反馈信息。

But if you think about in that little moment, you were kinda communicating with Google, you're expressing a need, you're getting information back.

Speaker 1

因此,我认为从心智模型的角度来看,这越来越是正确的方式。

And so I think increasingly that is the the right way to think about it from a mental model perspective.

Speaker 1

所以如果你思考事物的发展方向,可以大致表达出,至少对谷歌而言,我们非常关注信息需求。

And so if you think about where things are going, you can kind of just articulate, you know, what else, at least for Google, we think a lot about informational needs.

Speaker 1

那么,比如,我们还能提供哪些新型信息任务来帮助你?可能是新模式,如果有用的新模式出现,也可能是在特定问题上为你做更多。

So, like, what other kind of informational task that we could do to be helpful for you could be new modes, like if new modes come up that could be useful, but it could also be doing more for you given a given a certain question.

Speaker 1

就像你可能会发短信找人帮忙说:嘿,能帮我个忙扫描这些东西并告诉我需要了解的内容吗?

The same way you might text someone to help you with something and say, hey, can you do me a favor and scan all this stuff and tell me what I should know about it?

Speaker 1

或者,比如,你能帮我预订这个活动吗?

Or, you know, can you, you know, book this event?

Speaker 1

我正在想办法买这场比赛的票,但过程很复杂。

I'm trying to get tickets to this game and it's complicated.

Speaker 1

在某些情况下,你甚至可能雇人来帮你,比如为超级碗比赛或其他难买的票找到最佳位置。

You know, you might even hire, like, someone to help you, in some cases, get the perfect ticket for a Super Bowl game or some, you know, hard to get ticket.

Speaker 1

或许可以请个代理帮你完成这项工作。

You know, maybe an agent could go do that work for you.

Speaker 1

在后台,这就像发短信给一个人,对方回复说,给我几分钟时间。

And on the back end, it's almost like getting texting a person, the person getting back to you saying, give me, like, a few minutes.

Speaker 1

我会查一下,然后他们会给你一些结果。

I'm gonna look into this, and then they give you, you know, some result.

Speaker 1

实际上,这两张票是有的,它们相邻,价格是这个,但AI能做到大多数人无法付费或对人来说非常困难的事情,因为这需要你搜索数百个东西才能回复你。

Like, actually, these two tickets are available, they're next to each other, and they're at this price point, except that the AI can do things that, you know, most people, you know, couldn't pay for or would be really hard for a person to go do because it would require you to search, like, hundreds of things to get back to you.

Speaker 1

所以我认为这是一种有帮助的思考方式。

So I think that's that's kind of a a helpful way of thinking about it.

Speaker 0

关于最后一点,搜索数百个东西,我们询问了观众他们想问你的问题,其中一个问题是,当人们认为AI正在蚕食搜索时,谷歌将如何生存,一个有广告模式,一个没有,我想问你这个问题,因为我觉得你会说什么,我想让你帮我验证这个零拇指理论,或者说零和理论,关于聊天机器人,我听过你说,也听过谷歌整体说,AI概览和AI整体让人们搜索更多,某种程度上我们看到杰文斯悖论,随着数据增多,人们更有动力去搜索更多,创建更多查询,我想让你帮我们理解这个现象,在有AI的情况下,实际上有更多的搜索发生。

To that last point, the the searching hundreds of things, we we asked our audience what they wanted to ask you, and one of the questions is, how how is Google going to survive when the perceived thing is that, well, AI is kinda cannibalizing search, and one has an ad model, one doesn't, and I kind of wanna ask you, because I have a feeling what you're gonna say, I want you to help me falsify this zero thumb theory, or zero sum theory, as it relates to chatbots, and kind of like, I know I've heard you say, and I've heard just Google as a whole say, that AI overviews and AI as a whole, it kind of makes people search more, and in a way we're seeing Jevan's paradox where as there becomes more data, there becomes more of an inkling to search more, to create more queries, and I kinda want you to help help us understand that phenomenon, I guess, where in the case that there is AI, well, there's actually a lot more search happening.

Speaker 0

搜索数量并没有下降,实际上是在上升。

The the number of searches don't go down, they actually go up.

Speaker 1

是的。

Yeah.

Speaker 1

所以我认为AI是一种扩张性的东西。

So I think AI is is an expansionary thing.

Speaker 1

就像人们有各种各样的问题,他们本可以问更多,但由于种种限制而没有问。

It's like people had all these questions and they could ask so much more, but they they didn't because there were limitations.

Speaker 1

因此我认为最好的例子就是类似Lens结合AI这样的应用,这是最佳方式。

And so I think the best example of that is something like Lens and with AI, which is the best way.

Speaker 1

对吧?

Right?

Speaker 1

现在你可以拍一张书架的照片然后说,我已经读过这些书了。

So you can now take a picture of your bookshelf and say, I've given these books.

Speaker 1

接下来我该读什么?

What should I read next?

Speaker 1

或者你可以截屏某个名人的穿搭然后问,哪里能买到这件夹克之类的东西?

Or you can take a screenshot of, you know, a celebrity outfit or something and say, where could I buy this jacket or something?

Speaker 1

十年前可能也有人尝试在谷歌上输入这类问题,但当时要做到这点相当困难。

It's possible someone would try to put that kind of a question into Google ten years ago plus, but it'd be pretty hard to do that.

Speaker 1

同理,要让人输入一个长达20个句子的问题也是相当困难的。

In the same way, it would be pretty hard for someone to type in like a 20 sentence question.

Speaker 1

就像说,我要去旅行了。

That's like, I'm going on this trip.

Speaker 1

我有个孩子。

I have a kid.

Speaker 1

这孩子有这种过敏症。

The kid has this allergy.

Speaker 1

他们有这样的需求。

They're like, this need.

Speaker 1

我得去住酒店。

I have to go to a hotel.

Speaker 1

酒店需要满足这些条件——以前根本没法这样查询。

Hotel needs to be far you just like couldn't do that.

Speaker 1

所以人们往往就会放弃或者不去尝试。

And So people would just kind of give up or not do it.

Speaker 1

当你为用户解锁新需求时就会带来增长,这正是我们当前看到的情况。

So you get growth when you're unlocking new needs for people, and that's what we're seeing.

Speaker 1

我认为最好的总结方式是:人们通过搜索获取快速高效信息的基本需求并未改变,但现在可以向技术提出更多问题,这正是增长来源所在。

So I think the best way I can summarize it is the everyday need I just mentioned for people getting fast, efficient information from search isn't really changing, but now you can ask technology so many more questions, and that's where the growth's coming from.

Speaker 1

要知道,我们讨论过这个,比如AI概览功能,我们确实看到了增长。

Know, and we talked about this, like, you know, AI overviews, you know, we're seeing growth.

Speaker 1

我们讨论过AI概览功能出现的场景。

You know, we we talk about where AI overview shows up.

Speaker 1

它通常出现在用户提出更具体问题时,尤其是那些更详细、更具体的查询。

It's around when people have a more specific question, but they put like a longer, more specific question.

Speaker 1

当用户把这些输入谷歌时,就会获得AI生成的回答。

They put that into Google, they get this AI response.

Speaker 1

这类查询在美国等主要市场增长了约10%,以谷歌的规模而言,这样的年增长率相当可观。

Those kinds of questions are up about 10% in in large markets like The US, which at Google scale is a pretty enormous kind of growth number year over year.

Speaker 1

而我提到的这类视觉搜索功能,年增长率更是达到了70%。

And then these kind of visual searches I mentioned are up 70% year over year.

Speaker 1

因此在这些市场扩张最快的领域出现了巨大增长,这正是我们观察到的现象。

So huge growth in these areas where the market's expanding the fastest, and that's exactly what we're seeing.

Speaker 1

但显然,市场蛋糕正在扩大,人们有大量机会从多方获取信息,这令人振奋。

But obviously, the pie is growing, there's lots and lots of opportunities for people to get information from lots of people, and that's exciting.

Speaker 2

我很好奇。

I'm curious.

Speaker 2

在构建这一愿景时,你们对消费级硬件设备有多重视?

How much importance do you place on consumer hardware devices when you're kind of thinking about building out this vision?

Speaker 2

你知道,Meta一直在尝试通过眼镜重塑设备形态。

You know, Meta's been attempting to, you know, revamp the form factor with glasses.

Speaker 2

有传言称OpenAI正在打造自己的消费级设备。

OpenAI is rumored to be building their own consumer device.

Speaker 2

天知道那会是什么样子。

Goodness knows what that looks like.

Speaker 2

当然,谷歌很重视与苹果的合作关系,通过Safari提供主要搜索功能。

And, you know, obviously, Google values their partnership with Apple being the primary search function through Safari.

Speaker 2

你们有考虑过这方面吗?

Is this something you consider at all?

Speaker 2

这是谷歌想要自己掌控并塑造的东西,还是说更像是‘让我们静观其变,让搜索功能自然融入各处’?

Is this something that Google kinda wants to own and kind of, like, form themselves, or is it just kind of, like, let's see what happens and search will be integrated everywhere?

Speaker 1

我认为搜索已经无处不在,我们视其为一种服务,应当能在人们需要帮助的任何设备或场景中触手可及。

I mean, think search is so ubiquitous that we think of it as a service that should just be accessible to any device or context that someone needs help in.

Speaker 1

无论你是用手机时对所见内容产生疑问,还是在拍摄某物时——谷歌希望人们能在任何情境下获得谷歌的价值。

You know, whether you're on your phone and you have a question on what you're looking at, when you're taking a picture of something, like Google wants to we want people to get the value of Google in whatever that context looks like.

Speaker 1

我个人非常认同‘最好的相机就是你口袋里的那部’这种说法。

I personally am a big fan of the the kind of saying the best camera is the one in your pocket.

Speaker 1

嗯。

Mhmm.

Speaker 1

我觉得这是个非常贴切的比喻。

I think it's just like a really apt point.

Speaker 1

因此,未来可能会涌现许多酷炫的新事物。

And so, like, there's lots of cool new things that will probably be created.

Speaker 1

但这可能需要时间。

They'll probably take time.

Speaker 1

但真正重要的是人们日常使用的便利性。

But really, there's just the convenience of what people use every day is important.

Speaker 1

懂吗?

Know?

Speaker 1

人们依赖某些技术,依赖特定的提问方式。

People are people rely on certain technologies, certain ways of asking questions.

Speaker 1

就像他们的相机一样。

Similar to their camera.

Speaker 1

比如,他们有相机,随身携带,并且经常拍照。

Like, they have a camera, they have it in their pocket, and they're constantly taking photos.

Speaker 1

所以如果你能贴近这种体验,那就太好了。

And so if you can be close to that experience, great.

Speaker 1

这意味着你只需轻点一下就能把照片发送给谷歌,比如说,然后询问你看到的是什么。

That means that you can be one tap away from sending your photo to Google, let's say, and asking a question what you're looking at.

Speaker 1

我认为这就是很多人想要与技术互动的方式。

And I think that's how a lot of people wanna interact with with technology.

Speaker 1

显然,我认为搜索领域一直渴望融入每一个令人兴奋的新技术交互方式,而现在确实出现了一个激动人心的突破性硬件类别。

And I've some obviously, there's an exciting breakout hardware category, you know, that would be something that I think search historically has always wanted to be a part of every new exciting new way that people are interacting with technology.

Speaker 2

罗比,我们的观众和听众最常问的一个问题是:这听起来太神奇了。

Robbie, the the number one question we get from viewers of our and listeners of our show is this sounds magical.

Speaker 2

不过,这是怎么运作的呢?

How does this work, though?

Speaker 2

帮我理解一下,背后的原理是什么?

Help me understand, like, what's happening under the hood.

Speaker 2

我想把这个问题指向Gemini搜索和谷歌搜索,包括你刚才解释的所有AI概览功能。

I would like to point that question towards Gemini search and Google search in general with the AI overviews and everything you've explained so far.

Speaker 2

能否给我们一个高层次的解析,说明背后的运作机制?

Can you give us maybe a high level breakdown as to what's happening under the hood?

Speaker 1

在谷歌,我们在AI领域的努力方向是打造世界上知识最渊博的人工智能。

So at at Google, what we've done with AI is we've really tried to create the world's most knowledgeable AI.

Speaker 1

这个AI不仅要深度连接谷歌的海量信息,还要整合整个网络上的知识。

So one that really is connected to the vast information that obviously, like, sits at Google, but also around the web.

Speaker 1

对吧?

Right?

Speaker 1

就像,关键在于连接和理解这个世界,这是独一无二的。

Like, it's about connecting and understanding the world, and that's unique.

Speaker 1

因此我们有机会打造一个AI,要知道,购物目录中有数十亿商品,每天有数亿商家在更新本地信息。

And so we have an opportunity to have an AI that, you know, obviously, there's there's, say, billions of products in the shopping catalog, hundreds of millions of places that businesses are updating with local information every day.

Speaker 1

知识库中实时更新着数万亿条事实数据。

There's trillion facts in the knowledge base updated all the time.

Speaker 1

还有实时金融数据、体育赛事、旅行信息、航班动态价格等各类资讯。

There's information about live finance data, sports, travel information, live prices for flights, all that stuff.

Speaker 1

我们希望让每个人都能轻松快捷地获取这些信息。

Like, we want to be able to make that easily and quickly accessible to anyone.

Speaker 1

当然还有我们想为你连接的浩瀚网络资源。

And then you obviously have the vastness of the web that we wanna connect you to.

Speaker 1

所以我们构建的模型——基于Gemini基础架构的AI模型,本质上是个能理解自然语言和多模态问题、生成回答,并能消化所有知识的大型语言模型。

And so the models that we've built, there's an AI model that is kind of based on Gemini's foundational model, which is the one that kind of under is a large language model that understands, you know, natural language and multimodal questions and is able to generate responses, is able to also understand all that knowledge.

Speaker 1

因此,当你提出一个问题时,后台的AI模型会开始生成谷歌搜索来进行相关研究。

And so you can ask a question, and what'll happen on the back end is the AI model will start to actually generate Google searches to start researching that.

Speaker 1

根据问题的复杂程度,它可能会花时间进行思考、推理和研究。

And given the complexity of the question, may actually spend time thinking and reasoning and doing research.

Speaker 1

比如你询问应该选择哪种太阳镜,想了解偏光与非偏光镜片的区别及其优点,可能会衍生出数十个相关问题。

And so if you ask a question about what kinds of sunglasses you should get and learning more about polarized versus not and its benefits, there might be dozens of questions connected to that question.

Speaker 1

在底层机制中,模型实际上正在进行搜索。

And what would happen under the hood is that the model's actually searching.

Speaker 1

它就像人类一样发起大量谷歌搜索,并可能同时使用购物图谱等API进行研究。

Mean, it's super it's issuing a bunch of Google searches as if a person would, and it's potentially using APIs like the shopping graph to do research as well.

Speaker 1

随后它会检索所有相关信息,借助搜索领域的知识和信号,对特定问题所需的最佳信息形成准确判断。

It would then retrieve all of the relevant information and then because of, you know, all of the search knowledge and signals that that are available in search, kind of a good understanding of what information is great for a given question.

Speaker 1

所有这些信息都会被反馈给模型,模型经过推理后生成包含深入链接的回应,你可以通过这些链接了解更多信息,甚至购买所需商品,继续探索之旅。

All of that is is brought back into the model and the model reasons about it and and generates a response with links to dive in, learn more, potentially buy the thing you're looking for, and continue your journey.

Speaker 1

这一切都是通过这个AI体验实现的。

And and that all happens, you know, through this AI experience.

Speaker 1

这基本上就是本次AI概览中展示的主要内容。

And so this is this is largely previewed in this AI overview.

Speaker 1

如果你现在输入一个难题,比如‘如何去除浅色白沙发上的番茄酱污渍?’

So if you just put a hard question right now, you go, how do I get ketchup stains out of a light white couch?

Speaker 1

在谷歌搜索这个问题,你很可能会在顶部看到一个小型的AI预览——AI概览。

Put that into Google, you're very likely to get a little AI preview at the top, AI overviews.

Speaker 1

如果展开并点击AI模式,你就能进行完整的对话交流。

And if you were to expand it and click AI mode, you can have a whole back and forth with that.

Speaker 1

而实现这一切的正是这个AI模型。

And and that's the model that's doing that.

Speaker 2

立即吸干多余番茄酱,罗比,然后涂抹冷

Block the excess ketchup immediately, Robbie, and apply solution of cold

Speaker 1

水和洗洁精的混合溶液。

water the mold dish soap.

Speaker 1

我现在真的说得太久了。

I literally am now too long.

Speaker 0

罗比,我得坦白一件事,其实我有点像是你的粉丝,直到最近才意识到这一点,因为你曾在Instagram工作过,而且你刚才提到'最好的相机就是你口袋里的那部'。

Robbie, I have to I have to confess something, which is that I I'm a bit of a fanboy and I didn't quite realize until recently because you had a stint at Instagram, and you just mentioned a little earlier the best camera is the one in your pocket.

Speaker 0

我热爱摄影,喜欢拍照,尤其钟爱Instagram故事功能,因为这是我最喜欢的分享内容方式

I love photography, I love taking photos, and particularly I love Instagram stories because it's my favorite way of sharing content

Speaker 1

在世界上

in the world

Speaker 0

在互联网上。

on the internet.

Speaker 0

后来我发现,你就是Instagram上实现这一功能的关键人物,这太了不起了。

What I found out is that you are the guy responsible for implementing that at Instagram, which was amazing.

Speaker 0

这是个绝妙的产品应用,可能也是我至今仍频繁使用Instagram的主要原因

It's such a great product application, and probably the single reason why I still use Instagram for a lot

Speaker 1

多年。

of years.

Speaker 1

有很多人共同努力才让这个功能取得成功,但我也很荣幸能参与其中。

Lot of people worked on making that successful, but I was very privileged to have a chance to work on that as well.

Speaker 1

是的,它

Yes, it

Speaker 0

确实是个非常酷的功能。

was Oh, really a really cool feature.

Speaker 0

所以我想请教你的是,在涉及搜索功能时,你是如何考虑实现这类功能的?因为我觉得这并非原创概念。

So what I kind of wanted to ask you about that is how you think about implementing features like that kinda as it relates to search, because I imagine this wasn't an original idea.

Speaker 0

众所周知,Snapchat最早推出类似功能,但Instagram的版本我认为做得更好,这也是我现在使用的版本。

Famously, Snapchat kind of had it first, but then Instagram implemented it, what I believe to be much better, and that's the one that I use.

Speaker 0

我记得在另一次采访中你解释过思考过程,比如Snapchat不允许上传已有照片,必须使用内置相机拍摄,体验很糟糕。

And I remember hearing you on another interview kind of describing the way you reasoned through, which is like, on Snapchat I couldn't upload my own photos, had to use the inline camera, and that was just a poor experience.

Speaker 0

但在Instagram上可以上传我自己觉得更精美的照片,我更喜欢这种方式。

But on Instagram, could upload my own photos that I thought were much more beautiful, and I much prefer that.

Speaker 0

所以我很好奇这背后的思考过程,以及你如何将这种思路应用到像谷歌这样的公司,现在你正在为这项新AI技术开发产品。

So I guess I'm kind of wondering the thought process that was behind that, and how you apply that to companies like Google, where now you're developing product for this new AI technology.

Speaker 1

是的。

Yeah.

Speaker 1

有趣的是,我觉得从那次经历中能学到很多关于产品构建的相似之处。

Interestingly, I feel like there's a lot of similarities in terms of what to learn about product building from that experience.

Speaker 1

我认为最主要的教训是,如果一个产品深受大众喜爱并被广泛使用,你就不应该彻底颠覆它,因为用户已经形成了一种自然的、习以为常的使用路径,每天都有数十亿次这样的行为。

And I think, you know, the main one is is that if you have a product that's beloved and used by lots and lots of people, you don't wanna dramatically upend that on people, because there's a there's a natural, well worn path that people are traveling every day, billions of times a day.

Speaker 1

你不能某天突然出现,让用户感觉世界都颠倒了。

And you don't wanna just show up one day and just have it feel like upside down world.

Speaker 1

那样对任何人都没有好处,只会制造一堆问题。

Like, that's just not a service to anyone and will create a bunch of problems.

Speaker 1

话虽如此,如果你正在开发的领域与用户原本的需求直接相关——比如谷歌的信息服务,但用户希望能做得更多——

Now that said, if if you're building in a space where your need for your product is directly connected to what people already come for, in this case, information at Google, but people just wanna do more for it.

Speaker 1

那就存在一个自然的机会来扩展服务范围,就像人们来到Instagram是为了分享照片,而后来发现通过'故事'功能与朋友互动可能是更好的方式,因为这种轻松短暂的格式允许你收到私信、与朋友愉快交流并保持联系,整个系统运作良好,但它并没有取代Instagram原有的功能。

There's a natural opportunity to expand what you can do for people in the way that people came to Instagram to share through photos, and there's turns out there was a potentially even better way to do that for friends through stories because it was this low pressure kind of ephemeral format and allowed for this you to get a DM and have a fun conversation with your friends and feel connected and that whole system really worked well, but it didn't replace Instagram.

Speaker 1

它成为了Instagram帮助用户的另一种方式。

You know, it it became an additional way that Instagram could help you.

Speaker 1

我认为AI与搜索的关系也是如此。

And I think of AI and search in the same way.

Speaker 1

人们每天数十亿次地使用谷歌获取信息,实际上在AI出现之前,人们就已经尝试在搜索框中输入各种奇怪的内容了。

People come to Google every day for information, billions and billions of times a day, And actually people have tried typing in crazy stuff into search even before AI existed really.

Speaker 1

而以前你确实做不了什么,我甚至可能翻到搜索结果页的底部,告诉你我们找不到任何相关信息。

And before you couldn't really do much, you know, I might even get to the end of the search results page and say we couldn't find anything.

Speaker 1

但现在你几乎可以帮忙解决任何问题。

But now you can really help with almost anything.

Speaker 1

所以这感觉是件自然而然的事。

And so that feels like a natural thing to do.

Speaker 1

但基于同样的经验教训,你必须真正为用户需求而设计。

But from the same learning, you have to really design for the needs of your user.

Speaker 1

就像搜索领域一样,还记得你问今天发生了什么时,模型过去会回答'我不知道'吗?

And so in the same way for search, like, you know, remember when you asked the question about what was happening today and models used to be like, oh, I don't know.

Speaker 1

它可能会说'我的知识截止到一年前'之类的话。

I was trained up until a year ago or something.

Speaker 1

这对我来说总是显得很荒谬——仅仅因为技术发展路径,你竟然无法实时获取世界正在发生的事,哪怕只差100毫秒。

It's like, it just like, like, it always seemed crazy to me that you couldn't get information within a 100 milliseconds of what was happening in the world just because of the way technology evolved.

Speaker 1

但现在,你知道,这已经发展得非常不同了,特别是在谷歌,它能在近乎实时的基础上从我们所有知识库中查找信息。

But now, you know, this has evolved to be very different, but particularly in Google, it's finding information in near real time basis across all of our knowledge.

Speaker 1

所以我们认为这是我们能做得特别出色的地方。

So that's something that we think we can do uniquely well.

Speaker 1

另一个例子是关于视觉和灵感的。

And, you know, another example is around visual and inspiration.

Speaker 1

人们总是来搜索。

People come to search all the time.

Speaker 1

他们会搜索图片。

They search for images.

Speaker 1

他们会使用图片搜索,这本身就是一个巨大的搜索引擎。

They go to Image Search, which is a huge search engine in of itself.

Speaker 1

但人们也会寻找设计。

But people look for design.

Speaker 1

他们想要壁纸。

They want wallpapers.

Speaker 1

他们想要灯光设计的创意灵感。

They want, uplighting ideas.

Speaker 1

他们想重新装修孩子的卧室,于是浏览这些图片。

They want to redecorate their kids' bedrooms, and they browse for these images.

Speaker 1

如果你问AI这类问题,它会用文字描述如何设计卧室,我一直觉得这很奇怪。

And if you ask AI these kinds of questions, it'll, like, describe in text how to design a bedroom, which I always thought was really weird.

Speaker 1

现在有了视觉AI模式,你可以询问'帮我设计我女儿的卧室',寻找灵感和创意。

And so now with Visual AI mode, you can ask, help me design, you know, at my daughter's bedroom, looking for ideas, looking for inspiration.

Speaker 1

可以是关于任何主题的。

Could be about, you know, anything.

Speaker 1

你可以在选购时尚连衣裙时,AI模式会找到启发性的图片,然后你可以进行多轮对话。

You could be shopping for fashion dresses, and the AI mode will actually go find inspirational images, and then you can have a multi turn conversation.

Speaker 1

你可以说'实际上,我想要极繁主义的暗色调和超级忧郁的主题'。

You could say, actually, I want, you know, maximalist dark tones and super brooding theme.

Speaker 1

它能理解这种需求,并运用我们大量的镜头和图像处理技术。

It will like know what that means and using a lot of our lens and and technology for imagery.

Speaker 1

将整个网格从加州那种通透轻盈的风格转变为这种阴暗木屋氛围,它能在视觉上理解这种转变的含义。

Go change the whole grid from something airy and light in California to like this dark lodge vibe, and like it knows what that means visually.

Speaker 1

我认为这些方式能让谷歌根据用户需求,为世界增添独特价值,而不是仅仅试图打造另一个通用聊天机器人,这并非我们的初衷。

And I think these are ways that Google, based on what Google users need, can add needs, can add unique value to the world, you know, versus just trying to implement another kinda general purpose chatbot, which isn't what our intention is.

Speaker 0

我很好奇谷歌的独特优势是什么,以虚拟背景为例,我用它是因为墙上什么都没有。

I'm curious to understand what the advantage that is uniquely Google's because to that example, the reason I'm using a virtual background, I have nothing on my walls.

Speaker 0

很希望能在这方面得到些帮助。

Love some help getting some assistance on that.

Speaker 0

我知道谷歌擅长这个,但我想了解为什么谷歌在这方面具有独特优势。

And I understand Google's good for that, but I'd love to kind of understand why Google's uniquely good to that.

Speaker 0

因为如果我询问其他模型,比如x AI或Grok,它们实际上会去搜索互联网——而我认为大部分内容都是谷歌索引的。

Because if I ask another model, if I ask x AI or Grok, they'll they'll actually go and search the Internet, which I assume is mostly indexed by Google.

Speaker 0

谷歌作为谷歌本身,相比需要向谷歌发起查询,是否存在独特优势?

Is there a unique advantage to Google being Google versus having to query against Google?

Speaker 0

比如独特的数据集,谷歌特有的用户画像索引方式,这些是否让你们在同领域公司中脱颖而出?

Like, the unique dataset, the unique kind of profile indexing that Google does that separates you from a lot of other companies in the same space?

Speaker 1

是的。

Yeah.

Speaker 1

我是说,我认为有很多因素让我们在这些情况下能提供真正独特的帮助。

I mean, there's a bunch of things that I think allow us to be really uniquely helpful in these cases.

Speaker 1

比如,我认为其中之一就在于技术本身及其输入方式。

I mean, I think one is just in the technology and the inputs itself.

Speaker 1

我们在图像识别和视觉理解的多模态能力建设上已经投入了很多很多年。

Like, there's been many, many, many years of building multimodal capabilities for image recognition and visual understanding.

Speaker 1

所以我们的模型能够分割你体验中的背景,将注意力集中在物体的正确部分。

So our models are able to segment the background of your experience, put attention on the correct parts of the object.

Speaker 1

比如你说'嘿,我身后架子上的那棵小树或者那株小植物怎么样'。

If you were to say, like, hey, like, what's up with the you know, I want, like, a the the little tree behind me on the ledge or the the little, you know, plant.

Speaker 1

'架子是什么?'

Like, well, what's a ledge?

Speaker 1

明白吗?

You know?

展开剩余字幕(还有 236 条)
Speaker 1

‘在你后面’具体指的是什么位置?

And what does behind you mean?

Speaker 1

还有,底层架子与中层架子的区别是什么?

And and what what is the bottom shelf versus the middle shelf?

Speaker 1

模型如何知道该查看架子的哪个部分?

How do how does the model know which part of the shelf to look at?

Speaker 1

我们的模型对此理解得极为透彻且独具优势。

Our models really understand that really well and uniquely well.

Speaker 1

一旦你选定那个区域,你可能会说,嘿,把那株植物换掉。

Then once you select that region, now you might say, hey, replace that plant.

Speaker 1

我想要一株更好的。

I want a better one.

Speaker 1

好的。

Okay.

Speaker 1

那么,其他用户曾点击并使用过哪些视觉素材?这些素材在他们的探索过程中提供了怎样的灵感和帮助?

Well, what visual imagery have other people clicked on and used that are and and what has been inspirational and helpful for those people in those journeys.

Speaker 1

而你知道,我可能在网上给你找到一株勉强算植物的劣质盆栽,但如果没有排名或历史数据证明人们过去在其他植物搜索中觉得它有用,你可能不会知道这其实是株广受好评的植物——许多人在搜索办公室装饰植物时都点击并喜欢过它。我认为谷歌凭借其用户群体特性,能更直观地提供这类推荐。

Whereas, you know, I could probably find you like a janky plant that technically is a plant on the web, but without ranking or understanding if people found that useful in the past for other plant searches, you might not know that this is actually a really helpful plant that lots of people have found and clicked on and enjoyed when looking for office decor plants, which is something that I think Google might more intuitively be able to offer given the people that, come to Google to search for these kinds of things.

Speaker 2

我想把话题转向广告和盈利模式,因为在我看来,当每个人都拥有代表自己的AI代理在互联网上活动时——这些代理能替主人完成所有购物、拥有钱包权限并支配消费——现有模式将彻底崩溃。

I wanna shift the conversation to to ads, the monetization model, because it it in my mind, this breaks completely when everyone has an AI agent that represents them on the Internet, that does all their shopping for them, that has access to their wallet, spends everything for them.

Speaker 2

这个思维模型,或者说商业模式,在谷歌搜索中会如何崩溃?

How how does this mental model or, I guess, business model break in Google search?

Speaker 2

如果广告不是推送给人类眼球并试图吸引注意力,那如何与AI代理协作?

If you're not pitching adverts to human eyeballs and trying to get their attention, how does it work with AI agents?

Speaker 1

确实。

Yeah.

Speaker 1

我认为这里存在许多未知数。

I mean, I think there's a lot of unknowns here.

Speaker 1

这是个发展非常迅猛的领域。

This is a very kind of fast moving space.

Speaker 1

但有件事值得提一下(我想之前也说过),就是人们仍在大量进行着他们惯常的搜索行为。

But I think one thing to mention, and think I mentioned before, is that, you know, people are still doing at scale the kinds of questions that they're doing.

Speaker 1

因此,我认为目前人们利用智能代理能做更多事情。

And so I view this as, you know, right now, people's ability to do more with agents.

Speaker 1

所以感觉像是,你没法花上一个小时,比如说。

And so this feels like, you know, you can't spend an hour, let's say.

Speaker 1

比如,我最近试图搞清楚。

Like, I was recently trying to find out.

Speaker 1

比如,我在考虑买一个保险箱。

Like, I was looking at buying a safe.

Speaker 1

比如,我有一些文件。

Like, I have some documents.

Speaker 1

我的银行关闭了我的保管箱服务,他们不再提供这项业务了。

I that my bank closed my safety deposit box, and so they don't offer that anymore.

Speaker 1

就觉得,这太糟糕了。

Was like, that sucks.

Speaker 1

我或许该把这些都放进某个保险箱里。

I probably should put all these in a safe somewhere.

Speaker 1

对吧?

Right?

Speaker 1

而且去银行还挺烦人的。

And it was just kind of annoying to go to the bank.

Speaker 1

信息太多了,但这就是事情经过。

Too much information, but this is the story.

Speaker 1

所以买保险箱其实真的很复杂。

So it's actually really complicated to buy a safe.

Speaker 1

我用我们的深度研究产品(或深度搜索)查看了成百上千个不同地方,它生成了一份超详细的保险箱选购指南。

There's, like, all like, so I used our deep research product or deep search, and it's it looked at, like, hundreds and hundreds of various places, and it created this incredible guide to safes.

Speaker 1

里面涉及防潮性能差异、对保险的不同影响等等。

And it was like, there's different things around moisture, different implications on insurance.

Speaker 1

换作平时我根本不会花时间研究这些。

Like, I would have never spent time doing that.

Speaker 1

但现在做完之后,它保存了所有这些相关链接。

But now that I did it, it has all of these links.

Speaker 1

它包含了我已经阅读过的评论。

It has reviews I'm gonna I've read.

Speaker 1

理论上它提供了让我去购买这些保险箱的途径,否则我可能会无限期拖延这项琐事,永远不去做。

It has opportunities theoretically for me to go buy those safes in ways that I probably would just put this chore off indefinitely and, like, never do.

Speaker 1

这些不仅创造了新的发现机会,还为未来的变现和其他可能性铺路。

And those all create new opportunities, not just for discovery, but for monetization and other things down the road.

Speaker 1

显然,如果是代理任务——比如在某个无限时间线上模型足够了解我,理论上保险箱可能会直接出现在我家里——你甚至不需要向用户展示任何东西。

And then obviously, if there's if you're talking about agentic tasks where you never need to show anything to the user, like, theoretically, I don't know, on some infinite timeline and a model knows me so well, a safe would just show up in my house.

Speaker 1

这简直完美得不可思议。

That's, like, perfect somehow.

Speaker 1

虽然不确定我是否完全相信这种情况会发生,但假设它成立吧。

Don't know if I totally believe that that's ever true, but let's say it is.

Speaker 1

我认为事情会以我们无法完全预料的方式发展。

I mean, I think we'll things will just evolve in ways we don't totally

Speaker 2

是啊。

Yeah.

Speaker 2

听起来购物体验实际上变得更丰富了。

It it sounds like the shopping experience actually becomes richer.

Speaker 2

因此我们更深入地了解你的需求,无论是购买保险箱还是为即将到来的场合买新衬衫,这些都为谷歌提供了额外的信息。

And so we delve more into knowing what you want, whether it's purchasing a safe or buying a new shirt for an occasion that's coming up, and it feeds Google kind of like this additional information.

Speaker 2

你认为广告模式会以任何方式演变,还是保持现状?

Do you see the ad model evolving in any way or kind of staying where it is right now?

Speaker 1

我认为广告模式肯定会演变,因为形式在演变。

I think the ad model is definitely gonna evolve because the format evolves.

Speaker 1

通常来说,如果历史重演,广告本质上就是信息。

And typically, if history repeats itself, you know, ads and and is is information.

Speaker 1

它实际上是很有帮助的信息和内容,也是人们发现新业务和服务的一种方式。

And it's actually really helpful information and content, and it's also a way for people to discover, you know, new businesses and services.

Speaker 1

所以当转向移动端时,出现了一套新的移动广告形式。

And so when there was a shift to mobile, there's a new set of formats that came up for mobile.

Speaker 1

当转向视频和短视频时,出现了新型的视频广告形式,它们更显眼、更真实,人们在谈论产品,感觉很好。

When there's a shift to video and short form video, there's a new type of ad formats for videos, and they're taller, and they're more authentic, and there are people talking about products, and it feels great.

Speaker 1

因此我认为在AI领域,我们会看到类似的情况。

And so I think in the AI world, we'll see something similar.

Speaker 1

在一个代理主导的世界里,你可能再次看到类似的情况,它会感觉更自然,就像在对话中顺便提供一些信息。

And in an agentic world, you might see something similar again, and it'll feel more natural to the format of, hey, they're like, you're just kinda talking and and here's some information.

Speaker 1

顺便说一句,这里可能有个优惠信息你想了解一下,我觉得你已经看到一些相关的实验了。

And by the way, hey, here's maybe another here's a deal you might wanna know about, which, you know, I think you're starting to see some experiments around.

Speaker 2

但我们必须直面这个敏感问题——谷歌掌握着巨大的权力。

But, I mean, we have to address the elephant in the room, which is like, this is a lot of power for Google to hold.

Speaker 2

对吧?

Right?

Speaker 2

那么你认为该如何平衡这两者——既要对用户提问给出有帮助且真实的回应,又要在回答中嵌入赞助内容或产品?

So, like, how do you think about treading that line behind, like, you know, responding to a user prompt in a way that's helpful and factual with also kind of like giving sponsored content or a sponsored product embedded into that response.

Speaker 1

是的。

Yeah.

Speaker 1

我认为这是谷歌在大部分发展历程中都必须面对的问题。

I I think this is something Google's we've had to do for most of the existence of Google.

Speaker 1

我是说,人们已经因为这类任务而使用谷歌,而且页面上也有广告,你知道的,在结果旁边。

I mean, people already come to Google for these kinds of tasks, and there's ads, you know, and, you know, on a page with results as well.

Speaker 1

我认为原则保持不变。

I think the principles stay the same.

Speaker 1

就像,第一,我们有一个诚实的结果政策。

It's like, one, we have an honest results policy.

Speaker 1

广告不会影响你所看到的核心体验。

Like, ads will not affect the core experience of anything you see.

Speaker 1

所以在人工智能领域,这没什么不同。

So in AI, it's no different.

Speaker 1

它不会影响排名。

Like, it will not affect ranking.

Speaker 1

广告商无法改变AI向你推荐的有机回复。

An advertiser can't change, you know, the organic reply of what the AI is recommending you.

Speaker 1

这并不意味着你不能插入发现新事物的机会,但这些内容需要非常透明地标注,向用户展示,嘿,这是你可能想知道的东西。

And now it doesn't mean you can't insert opportunities to discover new things, but then those things need to be labeled really transparently to show the user, hey, this is something that you might wanna know about.

Speaker 1

这是一则广告。

This is an advertisement.

Speaker 1

就像如今在搜索中的运作方式完全一样。

The same way that exact way that it works today on search.

Speaker 1

所以我认为原则不会改变。

So I think the principles don't change.

Speaker 1

每当人们获取信息的方式发生重大变革时,你都需要从根本上重新思考这些原则。

You just you have to kinda rethink them foundationally every time there's a major move in how people consume information.

Speaker 1

我认为你们在视频时代、移动时代都见证过这种转变,而如今在这种更偏向对话的新范式下,人们会再次看到这种情况。

I think you saw that with video, saw that with mobile, and I think people will see that again in this kind of more conversational based paradigm that we're seeing.

Speaker 0

你对于这种形式要素或其存在方式有什么想法吗?

Do you have any ideas of of what that form factor is or how that exists?

Speaker 0

因为据推测,如果这些AI概览导致某些发布商的点击量减少,那么显然需要某种变现或控制机制。

Because I guess the perception is that if these AI overviews trigger fewer clicks for some of the publishers, then surely there needs to be some sort of monetization or controls.

Speaker 0

你们是否正在考虑实施某些应该存在的方案?有什么具体想法吗?

Do you do you have any ideas that you're considering implementing of of things that should exist?

Speaker 1

我是说,我认为我们目前正在进行实验。

I mean, think we're running experiments right now.

Speaker 1

我不认为任何人...我觉得这是一个学习过程,但原则是相似的,即如果你搜索信息,你应该能够找到它、深入了解,并对所看到的内容拥有控制和透明度。

I don't think anyone I think this is a learning exercise, but the principles are are similar, which is that if you search for information, you should be able to be able to find it, go deeper, and have control and transparency over what you're seeing.

Speaker 1

我认为我们已经在不同的AI界面上开始进行实验。

And I think we've started to experiment particularly in different AI surfaces.

Speaker 1

因此,在AI概览和AI模式下,现在有在这些体验中投放广告的实验,以了解哪些方式可能效果良好。

So with AI overviews and AI mode, there's experiments now with advertising across those experiences to learn about what could work well there.

Speaker 1

但我想说这些仍处于早期阶段。

But I'd say those are still in the earlier days.

Speaker 1

然后我认为最终我们会发现,如果你搜索并看到AI概览,那些页面的变现方式与不含AI的页面大体相似。

And then I think ultimately what we find is that if you search and you see an AI overviews, you know, those pages, you know, largely monetize very similarly to ones that don't have AI in them.

Speaker 1

所以你差不多会达到这样一个点:你在搜索某样东西,一旦你进入漏斗深处,比如我在找这个去除番茄酱的方法,我只想知道如何以这种方式擦拭。

And so you kinda, like, get to a point where you're searching for something, and once you're kinda down the funnel of, like, I'm looking for this ketchup removal thing, and I just wanna know I just need to know how to blot it in this way.

Speaker 1

事实证明,在那个时刻我可能根本不想购买任何产品。

Like, turns out I was very unlikely to probably want to go buy a product in that moment.

Speaker 1

我当时可能只是想立刻知道接下来两秒该怎么处理,因为沙发上正有个紧急状况等着我解决。

I probably just wanted to, like, know how to deal with this in the next two seconds because I have, like, I've got an active situation on the couch that I need to deal with.

Speaker 1

因此你会逐渐明白哪些时刻最适合帮助人们发现新事物,并从中学习。

And so you you kinda learn you'll also learn what the moments are that are gonna be most helpful for people to discover new things and from there.

Speaker 1

另外我想总体说的是,你提到了链接以及如何鼓励和帮助人们发现网络。

The other thing I'll just say overall is that you mentioned links and and how how to encourage and understand how people can discover the web.

Speaker 1

这绝对至关重要,是我们所有工作的基础设计原则。

This is absolutely essential and something that we take is like a foundational design principle to everything we do.

Speaker 1

我认为谷歌和搜索引擎对网络的重视程度,可以说超过任何其他公司或产品。

I think Google and Search, you know, care more about the web than arguably any any any company, any product out there.

Speaker 1

所以我们模型的独特之处在于,它们能利用并理解所有搜索信号,知道哪些网站对特定问题真正有用。

And so one thing our models do uniquely is they actually are using and understand all these search signals so they know for a given question what websites are really useful.

Speaker 1

因此你会看到,我们的方法不仅是提供有用的相关链接,还会将它们嵌入内容中。

And so you see, you know, what our approach here is not only to provide helpful links alongside, but also to embed them.

Speaker 1

这样当你在阅读时,可以随时点击感兴趣的内容深入了解更多。

So as you're reading, you can click and go deeper for anything that you see.

Speaker 1

我们发现人们确实会点击,并且他们确实想要深入了解。

And what we're finding is that people do click and they indeed wanna go deeper.

Speaker 1

只是模式正在改变,人们现在更倾向于先获取背景信息。

Just the paradigm's kinda changing where people want context first.

Speaker 1

他们想先大致了解情况,然后再深入点击查看。

They kinda wanna get a sense, a gist of things, and then they wanna click in.

Speaker 1

比如我想办信用卡或买床垫,最终我可能想看看专家怎么说,读完整篇文章,但我会先获取一些表层信息,然后再去阅读。

So say I'm trying to get a credit card or buy a mattress, like, I ultimately probably wanna read what the experts are gonna say about something and read a whole article, but I'm gonna get a little bit of superficial information first, and then I'm gonna go read.

Speaker 1

比如说,我可能会先看看社交媒体上的讨论。

Let's say there's I'm gonna read what people say probably, like on some social media threads.

Speaker 1

我会阅读专业人士的观点,那些花大量时间分析这些内容的付费专家,然后我才会做出购买决定。

I'm gonna read what experts say, like who are paid professionals, who spend a lot of time analyzing this stuff, and then I'm gonna make and then I'm gonna purchase.

Speaker 1

这就是我们观察到的现象。

And that's what we see.

Speaker 1

总之,我认为我们的工作是让这些连接成为可能。

But anyway, think our job is to make those connections possible.

Speaker 1

我们的希望是,由于AI在很大程度上是对现有搜索功能的渐进式提升,它能为你连接更多新服务和新网站,这些是你原本可能无法发现的,因为AI的搜索范围比你个人能进行的更为广泛。

And our hope is that AI because it's incremental largely to to what we see in search, there are new opportunities to connect you to new services and new websites that you wouldn't have found because the AI is also doing broader searching than what you would do.

Speaker 1

因此,长期来看,AI也有望促进更多新发现的产生。

And and so the hope is that you also can promote discovery long term as well.

Speaker 0

好的。

Okay.

Speaker 0

这正体现了正和博弈的理念——当满足好奇心的答案变得更容易获取时,实际的搜索行为和好奇心总量会远超多数人的想象。

And then, yeah, that that gets to the the positive sum pie, where there's just a significant more search and curiosity than I think a lot of people perceive as it becomes easier to unlock the answers to that curiosity.

Speaker 0

我有个问题想问。

So there's one question that I had.

Speaker 0

我们节目之前邀请了Gemini团队的洛根·基尔帕特里克做客。

We actually, we had Logan Kilpatrick on the show, who is part of the Gemini team.

Speaker 0

非常出色的嘉宾,非常精彩的节目。

Amazing guest, amazing episode.

Speaker 0

建议大家听完本期后,都去找来那期节目收听。

I suggest everyone go find that and listen to it after you're done with this one.

Speaker 0

但这让我得以一窥那个团队——谷歌Gemini团队的工作内幕,这让我联想到曼哈顿计划那种模式:一小群人在研发这种高度智能的AI,但它又有些独立运作,同时与谷歌搜索保持着接口联系。

But what it unlocked for me was kind of the behind the scenes look at what that team does, what the Gemini team does at Google, and it kind of it reminds me of this Manhattan Project type thing where there's this small subset of people working on this really intelligent AI, but there's a slight disconnect where it's it's kind of under a separate thing, and it interfaces with Google and Search.

Speaker 0

所以既有谷歌,也有Gemini。

So there's Google, but there's also Gemini.

Speaker 0

我好奇的是谷歌与Gemini团队之间的关系是怎样的,你们是如何协同工作将这些产品整合在一起的。

And I guess what I'm curious is is kind of what that relationship is like between Google and the Gemini team, and how you guys kind of work to integrate these products together.

Speaker 0

因为既有Gemini AI Studio,又有谷歌搜索。

Because there's the Gemini AI Studio, and then there's Google Search.

Speaker 0

那么对于我和听众来说,该如何划分并理解这两个实体之间的协同效应呢?

So kinda what's a what's a good way for myself and for people who are listening to kind of compartmentalize and see where the synergies lie between those two entities?

Speaker 1

是的。

Yeah.

Speaker 1

我们与谷歌DeepMind的Gemini团队合作非常紧密。

I mean, we work incredibly closely with the kinda Google DeepMind Gemini teams.

Speaker 1

可以这样理解:这些基础模型越来越擅长理解任何问题,并帮助查找或生成相关信息。

The way to think about it is there's these foundational models that are increasingly able to understand any question and help find information about it or generate information about it.

Speaker 1

有许多人正从多个维度探索这一领域的前沿,他们中的许多人正是从事这方面的工作。

And there's people working on the frontier of what that looks like in many ways, and then those are many of those folks.

Speaker 1

而如何将这些技术转化为人们日常使用并喜爱的产品,则主要依赖于产品团队的努力。

But then how that is brought to life with products people use every day and love is really around the product teams.

Speaker 1

显然,谷歌搜索是目前人们与AI互动最广泛(虽非最大)的途径之一。

And so Google Search obviously is one of the largest, not the largest, you know, ways that people interact with AI, even today.

Speaker 1

我们保持着极其紧密的合作,将其视为推动前沿发展的助力,特别是在模型如何利用信息这一领域。

And and we work extremely closely and kinda think of it as helping, you know, really push the frontier, particularly for how models are using information.

Speaker 1

我们将密切协作,将这些模型整合到搜索中,进行定制优化,使其完美适配我们讨论过的所有应用场景。

And and we'll work closely to bring those models into search and customize them and make them work really well for all the things we just talked about.

Speaker 1

人们会用它获取卧室装修灵感。

People are kinda get bedroom inspiration.

Speaker 1

他们会拍摄物品照片进行查询。

They're taking photos of things.

Speaker 1

他们会咨询营业结束时间。

They're asking about closing times.

Speaker 1

建模团队会更多地从能力角度来思考这个问题。

And and what'll happen is the modeling team will think about it more as capabilities.

Speaker 1

比如说,你可能希望模型具备使用工具的能力,或者拥有推理能力,让模型能进行更深层次的思考。

Like, let's say you want the capability for the model to use a tool because or to have reasoning so that the model can think a little bit more.

Speaker 1

于是就会有人去研究这个方向,并添加这种能力。

So someone will go research that and and add that capability.

Speaker 1

从搜索的角度来看,它使用的工具类似于金融工具。

And then, you know, from a search perspective, the tools that it's using is something like finance.

Speaker 1

因此它可以实时查询金融信息。

And so it can make a real time query to look up financial information.

Speaker 1

比如你现在询问任意两支股票的情况,进入AI模式后输入'比较这两支股票过去六个月的表现'。

If you right now, if you ask about any two stocks, you say like, you go to AI mode and you say compare, you know, last six months of these name two stocks, put them in there.

Speaker 1

它会调用Google Finance接口获取实时行情和历史数据,然后将这些信息可视化呈现。

It'll actually use Google Finance at the pool and make a request for live information and stock historical data, and then it'll plot that information in the ground.

Speaker 1

这使用了Gemini作为基础模型,它既能理解你的问题本质,又能调用各类搜索工具——这种结合非常精妙。

And that uses Gemini as a model, so it has this foundational ability to, like, understand what the question is that you were asking, but then it has the search ability to, like, use all of these search tools, which is really cool.

Speaker 1

然后它就能为你生成那种类型的回复。

And then it can generate that kind of a response for you.

Speaker 2

我能就这个问个后续问题吗?

Can you just a follow-up question on that.

Speaker 2

你能带我们了解一下当初将Google搜索与MLM整合的过程是怎样的吗?

Can you can you walk us through what that was like integrating Google Search as it was before with an MLM?

Speaker 2

比如,想必你们在整合数据集等方面遇到过一些摩擦吧?

Like, presumably, there was, like, some friction that you ran up against where there was, like, combining datasets and stuff.

Speaker 2

我很好奇当时的具体情况。

I'm I'm curious what that looked like.

Speaker 1

我认为本质上并没有太多摩擦。

I don't think there was too much friction per se.

Speaker 1

我的意思是,关键在于当你把模型引入这个组合时,必须以非常特定的方式操作,因为模型对信息的响应方式与既有的搜索架构有着不同的倾向性。

I mean, I think the main thing is when you're adding a model into the mix, it has to just be done in a very specific way because models have different tendencies in terms of how it responds to information than kind of the the the otherwise, the search stack that has been built out.

Speaker 1

但它们现在已经在搜索环境中和谐共处了。

But they work together now, you know, harmoniously in the search environment.

Speaker 1

因此,有些问题会产生这些AI生成的回答,而其他问题则有不同类型的AI增强功能。

And so some questions kind of produce these AI responses, and then other ones are kind of have different AI AI enhancements.

Speaker 1

但我不确定你指的是否是某种更具体的矛盾点。

But I'm not sure if there was a more specific kind of tension you were alluding to.

Speaker 2

没有。

No.

Speaker 2

没有。

No.

Speaker 2

我只是好奇是否存在显著的提升,无论是开发层面还是用户偏好方面遇到的情况,但听起来似乎并不多。

I was just curious whether there was massive kind of uplift, whether that was developmental or on kind of, like, taste making for users that you kind of, like, ran up against, but it sounds like there wasn't much.

Speaker 2

罗比,我还有个突发奇想的问题,我很喜欢谷歌搜索的个性化功能。

Another kind of wildcard question that I had, Robbie, is I like the personalization of using Google Search.

Speaker 2

有个我甚至没意识到的例子是,当我输入某人的名字,比如一位名人,它就会显示出他们的净资产之类的信息。

One that I wasn't even aware of was if I would type in someone's name, for example, a celebrity, and it would come up with their net worth or something.

Speaker 2

我会向朋友展示,说,看,这是谷歌上搜索最多的内容。

And I would show my friends, I would say, see, this is the most searched thing on Google.

Speaker 2

然后他们会说,嗯,不完全是这样。

And they would be like, well, not quite.

Speaker 2

就像,它某种程度上是在查看你之前搜索的cookie记录,可能是在根据你可能想搜索的内容调整它的偏好。

Like, it's kind of looking at the cookies that you've searched before, and maybe it's kind of like adjusting its preferences to what you might wanna search.

Speaker 2

关于这个话题,当涉及到为某人个性化Google AI搜索时,我们能深入到什么程度呢?

On that theme, how far down the rabbit hole can we go when it comes to personalization of Google AI search for someone, right?

Speaker 2

比如,是否存在一个世界,在那里我不仅仅是发现新的网站、应用或互联网体验,而是这些体验高度定制化,并且针对其他与我个人独特相关的数据集?

Like, is there a world where I'm not just kind of like discovering new websites or apps or experiences on the internet, but it is highly tailored and personalized to other data sets that are kind of like unique to me, right?

Speaker 2

它知道我的购物偏好,某种程度上知道我银行里有多少钱。

It knows my shopping preferences, kind of knows how much money I have in the bank.

Speaker 2

它大概知道我几周后有个活动要参加。

It kind of knows that I have an event coming up in a couple of weeks time.

Speaker 2

对你来说,那样的世界会是什么样子?

How does that world look for you?

Speaker 1

是的,我是说,我认为首先有整个Google,然后是人们通常用于AI的问题。

Yeah, mean, I think first of all, there's Google overall, and then there's the questions that people typically use for AI.

Speaker 1

我认为,对于谷歌整体而言,存在许多并不适合个性化处理的问题。

I think for Google overall, there's plenty of questions out there that aren't great to be personalized.

Speaker 1

因此我们经常思考个性化带来的差异化价值。

And so we think about a lot the differential value in being personalized.

Speaker 1

如果你问帝国大厦有多高。

If you ask about how tall the, Empire State Building is.

Speaker 1

这基本上就是个事实性信息。

It's kind of just like a factual piece of information.

Speaker 1

可能在信息来源上有些个性化设置,比如你特别偏好某些特定提供方的事实数据,但很多内容并不适合个性化。

Maybe there's some personalization on the source, if you really like certain facts or something from specific providers, but many things are not great.

Speaker 1

我认为存在这样一种价值:拥有一个通用平台,可以看到针对特定问题呈现的内容。

And I think that there's a value in just having this kind of universal place you go to seeing what things are showing up for a given question.

Speaker 1

话虽如此,我认为很多问题恰恰相反。

And that said, I think many questions, it's the opposite.

Speaker 1

不进行个性化反而显得奇怪。

It's almost weird not to personalize it.

Speaker 1

比如,你问'我该买哪种牛仔裤?'

Like, you say like, what kind of jeans should I get?

Speaker 1

这就像,嗯,我不知道

It's like, well, I don't know.

Speaker 1

比如,你想要什么样的牛仔裤,对我来说,这和随便找世界上一个人给他选牛仔裤会非常不同

Like, what what what kind of jeans would you like, for me, that's gonna be super different than, you know, some other if I were to grab a random person in the world and and try to get them some jeans.

Speaker 1

对吧?

Right?

Speaker 1

所以我认为在这些AI体验中,比如AI概览和AI模式,人们更多会提出这类寻求建议的推荐性问题

And so and I think it turns out that in these AI experiences, like AI overviews and AI mode, people have a lot more questions that are these kinds of advice seeking recommendations questions.

Speaker 1

他们想知道晚餐该去哪里吃

They wanna know where to eat eat eat eat for dinner.

Speaker 1

他们想知道该带家人去哪里旅行

They wanna know where to travel with their family.

Speaker 1

他们想知道这类更主观的问题

They wanna know it's kind of in this more subjective camp.

Speaker 1

这要看情况。

It kinda depends.

Speaker 1

因此,我们认为AI有巨大机会通过更了解你来提供独特帮助。

And so we think there's a huge opportunity for our AI to know you better and then to be uniquely helpful because of that knowledge.

Speaker 1

我们在IO大会上讨论过,AI如何通过Gmail等关联服务更懂你,从而逐渐掌握你的品牌偏好。

And one of the things we talked about at IO was how the AI can get a better understanding of you through connected services like Gmail so that over time it could really know, wow, like, you tend to like these kinds of products or brands.

Speaker 1

他们刚发布了另一个新功能。

Here's another one that just came out from them.

Speaker 1

这比泛泛推荐要有用得多。

And how much more useful that would be than just generically showing you.

Speaker 1

比如,我不确定现在销量前十的牛仔裤品牌通用列表对你有多大参考价值。

Like, I I don't know if you just whatever the top 10 selling gene brands are right now, generically, a list of that, and how you would how how helpful that would be.

Speaker 1

所以我们的愿景是打造真正懂你的个性化服务,但要注意信息定制的分寸。

And so that is, I think, much the vision of building something that can be really knowledgeable for you specifically, But I think there's a nuance there in in in what you what you personalize.

Speaker 1

我们的理念是让用户清楚看到:哪些推荐基于你的兴趣、历史行为,哪些只是通用建议。

And I think our our thought on this is that the user's always seeing, like, what parts of information are being shown because of your interests or or purchases or things that you've done in the past or things that it it might think you might like versus things that it's just suggesting generally.

Speaker 1

因此我认为人们希望能直观地理解何时信息是为他们个性化定制的,何时是所有人提问都会看到的通用内容。

So I think people wanna intuitively understand when they're being personalized, when information is made for them versus when in something that everyone would see if they were to ask this question.

Speaker 1

这某种程度上体现了群体智慧,可以说,就像谷歌搜索结果页呈现的那样。

And this is kind of the wisdom of the crowds, you know, so to speak, represented in a Google search page.

Speaker 0

随着节目接近尾声,我喜欢以更乐观的基调结束。

As we get closer to the end of the show, I love ending on a more optimistic note.

Speaker 0

所以我想聊聊未来愿景,作为为数十亿用户打造产品的设计师,你期待怎样的未来。

So I wanna talk about the future vision, the future that you're excited about as someone who builds products that billions of people use.

Speaker 0

有趣的是,今天我刚收到通知,吉姆和我推出的功能现在出现在我的电视、汽车、手机,还有谷歌手表和耳机上——那汽车呢?

So it was funny, just today I saw a notification that Jim and I rolled out on my TV, and now I have it on my car or on my phone, and then I have it, like, on some Google watches and some earbuds, And what about cars?

Speaker 0

我桌上的那些设备又该如何?

What about things that are sitting on my tabletop?

Speaker 0

我好奇的是,作为产品设计师,当搜索无处不在却很少需要手动输入时,你理想中产品融入生活的场景是怎样的?

What I'm curious to ask you is, like, what does the ideal day for you as a product designer where your product slots into this look like for someone when search is kind of everywhere and rarely typed?

Speaker 0

最终形态会是什么样子?这将如何改变人们的日常生活方式?

Like, what does the final form look like, and how does that change the way people go about their day to day lives?

Speaker 1

是的。

Yeah.

Speaker 1

我认为这并不是最终形态。

I think that really it's not a final form.

Speaker 1

更像是多种形态并存。

It's more of like a multi form.

Speaker 1

我不确定这个词是否准确,但这就是我

I don't know if that's a word, but that's kind of how I

Speaker 0

的想法。

think about it.

Speaker 0

明白了。

Understood.

Speaker 0

对。

Yeah.

Speaker 1

可以说并不存在单一形态。

It's kinda like there's not a single form.

Speaker 1

我认为它是具有适应性的。

I think that the it's adaptive.

Speaker 1

因此,你思考它的方式是这样的——我非常喜欢思考那些持续多天的旅程,人们有各种待满足的需求,他们会在过程中逐步解决这些问题。

And so the way that you think about it is I I love thinking about these journeys where they're multi day, people have needs that are kinda like pending and they're kinda working on them over time.

Speaker 1

你知道,就像很多人会为公寓选购沙发这样的事。

You know, like I think about a lot people like buying a couch for their apartment, let's say.

Speaker 1

这并不像看起来那么简单。

And it's not just like an easy thing.

Speaker 1

比如你可能正用电脑做些研究,想为纽约的公寓找些很酷的沙发。

Like you might be on your computer, you're doing some research, and you wanna just find cool couches for like apartment in New York, let's say.

Speaker 1

如果你今天使用AI模式询问,你会看到一个沙发视觉网格,可能会点击其中几个,它还会推荐你可能喜欢的款式。

You if you use AI mode today and you ask about it, you'll see a visual grid of couches, and you might actually click on a few of those, and it might recommend you some things that you might like.

Speaker 1

好的。

Okay.

Speaker 1

很好。

Great.

Speaker 1

你会觉得,哦,这些还挺酷的。

You kinda like, oh, Those are kinda cool.

Speaker 1

我得考虑考虑。

I'm gonna think about it.

Speaker 1

然后你出门散步,路过一家家具店,或者在朋友家看到让你心动的东西。

Then you go in for a walk, and you walk by, you know, like a furniture store, and you see something that strikes you at a friend's house.

Speaker 1

接着你打开应用,回到之前的对话,上传一张照片说:'哦,其实这个才是我真正想要的超棒的东西'。

And then you go to your app, you go back to your thread, you upload a photo of it, and you're like, oh, actually this is the thing that's super awesome.

Speaker 1

就像,我真正想要的是类似这样的款式。

Like, I actually wanted this is like stuff that's like this.

Speaker 1

太好了。

Great.

Speaker 1

谢谢。

Thanks.

Speaker 1

这里有些类似的东西。

Here's some things like that.

Speaker 1

好的。

Alright.

Speaker 1

先把这个放一边,等等等等。

Put that away another time, blah, blah, blah, blah, blah.

Speaker 1

假设你开车时突然想到,其实你真正想要的是另一种颜色。

And let's say you're driving and you like pop into your head that you really actually realize that this other color was one you wanted.

Speaker 1

你上线说,嘿,记得我之前提过沙发的事吗?

You go live and you say, hey, remember I was kind of talking about couches?

Speaker 1

其实我喜欢这个颜色,而且主要考虑这两种。

I actually like this color and I'm mostly focused on these two colors.

Speaker 1

给我推荐些这类的款式吧。

Send me some recommendations for those.

Speaker 1

太好了。

Great.

Speaker 1

明白了。

Got it.

Speaker 1

砰。

Boom.

Speaker 1

然后它就会那样做。

And then it does that.

Speaker 1

几天后,你可能会收到一条推送提醒,说:'你之前考虑过的那款正在特价促销,正好符合你的需求。'

And then a few days after that, maybe you get a push alert that's like, there's a deal where one of the ones you were considering in the exact way has that.

Speaker 1

而且,你知道的,就像是在打折促销。

And, you know, and there's like a sale.

Speaker 1

我不确定。

I don't know.

Speaker 1

可能是网络星期一快到了,或者是黑色星期五,总之是大型购物季。

It's a, what, Cyber Monday coming up, a Black Friday, all of the above, big shopping season.

Speaker 1

也许你之前喜欢的那款商品有货了。

Maybe one one of the things you were loving was available.

Speaker 1

你看,谷歌现在就是通过这些方式,跨越不同模式和你生活的各个方面,在这个需求上为你提供难以置信的帮助。

Like, these are all these ways that Google now, across modes, across kinda different aspects of your life, being incredibly helpful to you for this need.

Speaker 1

我认为这就是我对搜索未来的更多思考,而非任何单一具体功能或某种固定形态。

And I think that's more of how I think of the future of search than any any one specific feature or or kind of, you know, single form factor.

Speaker 2

好的,我们即将接近整点时间。

Well, we are just about coming to the top of the hour.

Speaker 2

罗比,非常感谢你百忙之中抽空与我们交谈。

Robbie, thank you so much for taking time out of your busy day to chat with us.

Speaker 2

这真是一次引人入胜的对话。

That was a fascinating conversation.

Speaker 2

我觉得生活中很多人往往把谷歌搜索带来的便利视为理所当然。

I think a lot of people in life just kind of take for granted what Google Search has brought for them.

Speaker 2

而这项技术如何演变,以及它如何渗透到我们生活的每个角落——尤其是当它完全实现多模态时——是非常值得深入理解的。

And kind of like the way that this thing evolves and the way that it kind of like permeates every facet of our life, especially when it goes fully multimodal, is super important to understand.

Speaker 2

感谢你带领我们完成这段探索之旅。

So thank you for taking us through that journey.

Speaker 2

《无限》的听众们,如果喜欢本期节目,请点赞支持。

Limitless listeners, if you enjoyed this show, please give it a thumbs up.

Speaker 2

我们知道很多听众还没订阅我们,所以请大家务必点击那个按钮,集中注意力完成订阅。

We know that a bunch of you aren't subscribed to us, so we need you to get on that button and and and kind of focus on that, please.

Speaker 2

如果你喜欢本期节目或本周收听过的任何一期节目,请给我们五星好评,我们下期再见。

If you enjoyed this show and if you enjoyed any of the episodes that you've listened to so far this week, please give us a five star rating, and we will see you on the next one.

Speaker 2

罗比,再次感谢你的参与。

Robbie, thank you again for joining us.

Speaker 1

谢谢邀请我。

Thank you for having me.

关于 Bayt 播客

Bayt 提供中文+原文双语音频和字幕,帮助你打破语言障碍,轻松听懂全球优质播客。

继续浏览更多播客