Microsoft: The economic growth formula has never been so clear—more electricity, more efficient grids, and more computing power!

硬AI · May 29 18:42

来源：硬AI

按：本文为著名科技分析师Ben Thompson与微软CEO Satya Nadella和CTO Kevin Scott的访谈。

访谈主要涉及微软最新发布的AI PC硬件，以及支撑当前人工智能浪潮的战略考虑因素。两位微软高管还解释了与OpenAI的合作伙伴关系具体如何运作，以及微软如何在全公司范围内转向人工智能，还谈到了云计算领域的竞争格局和战略思考。

核心观点：

1. 能源就是一切。有了更多的可再生能源、更好的电网和更强的算力，经济的其他各个领域都能真正中受益。

2. 微软的核心竞争力在于构建平台，让其他人再次基础上发展业务和产品的。

3. AI是十年一遇，甚至一代人一遇的重要机遇。

4. 如果无法每年投入500到600亿美元的资本支出，就拿不到进入超大规模云计算市场的门票。

5. 与 OpenAI 的合作关系就像当年与英特尔的合作关系，它既定义了行业，也定义了微软，长期稳定的合作来自于持续的双赢。

6. 未来五年，世界最需要的是无处不在的超大规模算力。

7. 世界人口未来会持续减少，为了保持生活水平，必须要很大的生产力提升。AI能让人们公平地获取需要的东西，从而拥有尊严，过上美好的生活。

以下部分为微软CEO Satya Nadella的访谈，有删减

微软的AI转型

Ben Thompson：我曾在微软工作过。事实上，我在这里参加了2012年的第一次Build大会，那是一次非常独特的活动。我不确定你是否还记得，当时它与 Windows 团队在纽约举办的另一场以消费者为中心的活动存在竞争关系。如今，微软似乎是一家更加团结的公司。你认为这段时间在文化和组织上发生了哪些变化？

Satya Nadella：好吧，让我这么说吧，Ben。对我来说，我的记忆可以追溯到 91 年的PDC（专业开发者大会），那时我们第一次讨论 Win32。事实上，那时我还在 Sun 公司工作，甚至还没有加入微软，我对未来的发展非常清楚，PC和它要做的事情以及服务器架构对我来说在 91 年就已经非常清楚了。

我觉得这对任何科技公司来说都超级重要，就是要有我所说的完整思维。对我来说，完整思维始于“系统创新是什么？”——无论是芯片、操作系统还是应用平台。那么，为什么这对任何消费者或任何应用程序开发者来说都是可取的？因此，对我来说，无论是在 Azure 方面还是在 Windows 方面，甚至是在今天早上的 Windows Copilot+ PC 上，都需要这样的创新，我们已经很久没有这样完整的想法了，我们已经有了——Arm，我觉得我们已经得到了。我们已经拥有了应用平台。微软自己在开发应用，第三方也在开发应用。因此，对我来说，从文化上讲，能够让你构建完整产品的东西才是我认为必须努力的方向。

Ben Thompson：你提到了消费者和开发者。你没有提到企业这个词。你是否觉得，完整的思维目前正在这三个层面上发挥作用？

Satya Nadella：这是一个很好的问题。我认为，企业也是终端用户。事实上，我觉得在微软，只要我们处于最佳状态，我们就能——如果你还记得，我们一直是一家知识工作者公司。

Ben Thompson：没错。但我认为，企业的购买者和用户之间的分叉有时确实是个棘手的问题

Satya Nadella: 老实说，我在 90 年代初加入的公司不是这样。

Ben Thompson：因为那时只有开发者。

Satya Nadella: 还有终端用户。所以，我经常回想的一件事就是，我们真正考虑的是终端用户。对我来说，Excel 是一个面向终端用户的产品。

Ben Thompson：肯定是在80年代。

Satya Nadella：早在它成为 IT 之前。因此，回过头来看，这并不意味着我希望我们现在不做真正满足 IT 需求的事情。事实上，Paul Maritz曾经说过：神奇之处在于终端用户、开发者和 IT。顺便说一句，要想成为一家优秀的企业公司，你必须善于协调这三者之间的关系。这意味着，如果你把开发者和终端用户等同起来，你就能成为一家出色的消费类公司，而且是分门别类的，对吗？消费者现在是一个很宽泛的概念。它可能意味着很多很多东西。我们不会去做好莱坞电影或其他很多事情，但说到游戏，我称之为生产力，我们想做得很棒。

Ben Thompson：没错，人们都忘了，微软不仅在消费领域和PC领域，而且在服务器领域和商品硬件等方面都是颠覆性的。是否存在孰先孰后的转变？我认为可能有一段时间微软对消费领域抱有很大的期望，但也许是人们在工作中使用PC，然后他们希望在家里使用PC，而手机在这方面显然有些不同，但这仍然是微软的甜蜜点吗？

Satya Nadella：是的，我认为这就是。就拿 Windows 来说吧，我希望我们能为工作和家庭两用的用户打造出色的 Windows PC，甚至在外形设计上也是如此。如果人工智能浪潮已经来临，那就让我们重新设计操作系统和硬件。因此，这就是我希望我们能真正做好的事情，而今天就是朝着这个方向迈出的良好一步。

Copilot+PC

Ben Thompson：是的，我印象很深，我觉得它很有说服力，它似乎让人真正明白了为什么要购买 Windows，而自从浏览器出现并将应用从桌面上剥离后，这种感觉似乎就不复存在了。你是如何利用这一点并向其靠拢的呢？你们的市场定位是否会发生重大转变？是否只是依靠原始设备制造商，你认为它会自己卖出去？你们是要以以前没有的方式进行大量投资，还是一直在投资？

Satya Nadella：我们一直在投资，但在科技领域，时机就是一切，对吧？我们一直在研究Arm，我们谈论NPU已经很久了。

Ben Thompson：是的，十年前，微软推出了 Arm PC，我当时就在现场。

Satya Nadella：所以，关键点是它正在整合起来。想想刚刚发生了什么。有了所有这些模型，以及我们能够拥有的能力，无论是从隐私的角度，还是从延迟的角度，或者是COGS（成本）的角度，拥有内置模型，因为当你使用……

Ben Thompson：老实说，你在台上发言时有点像苹果的风格。首先，你提到“MacBook”的次数比乔纳森·艾维（Jony Ive）时代的苹果还要多，这明显是在做对比。关于本地隐私的讨论很多，但你提到的这些本地模型很关键——COGS，从你的角度看，如果使用客户的能源，那实际上是免费的。你认为你们在这方面能走多远？比如AI PC，我一直在等规格参数，你确实提到了，16GB的内存对Windows来说还不错，但对AI来说还是偏小。

Satya Nadella：现在有了 45 TOPS，我觉得这款 Copilot PC 已经迈出了第一步，我喜欢我们在 AI PC 上迈出的第一步，但有了这款 Copilot+ PC，我觉得我们就成功了。顺便说一句，我非常相信分布式计算仍将是分布式的，所以它实际上是一致的。就拿 Recall 来说吧，我认为这是一个杀手锏功能，我们已经研究了很长时间。

Ben Thompson：我一直在 Mac 上使用“倒带”功能，这无疑是一种超能力。

Satya Nadella：是的，关键是现在有了语义索引（Semantic Index），我可以键入自然语言查询，然后进行Recall，甚至是我们忘记了的事实，我通过视觉来记忆事物，通过联想来记忆事物，而现在能够不用学习搜索，而只需键入我的意图就能回忆起来。但是，Recall 的有趣之处在于——如果你注意到它，它不仅会返回到之前的页面，而且由于语义索引的存在，我可以调用此时此刻的内容，因此这种能力需要大量的边缘计算。

另一个引人入胜的地方是，我喜欢的一个演示是，你可以在玩《使命召唤》时，使用 NPU（全部45个TOPS），而不会因为玩游戏而损坏电池，这种能力让操作系统知道如何合理使用所有芯片和系统，我认为这将是我们的真正突破。

Ben Thompson：要演示这一点很容易，你在 Surface PC 上演示的绘图和增强绘图功能就非常酷。一个永恒的问题是，Android 遇到了与 Windows 之前同样的问题——是的，你为这些AI PC定义了规格，但你如何提供一致的体验呢？

Satya Nadella：这是一个很好的问题，Ben。这一直是我们生态系统的难题之一。坦率地说，我认为我们都在学习如何真正做好操作系统——坦率地说，这是芯片的问题。

比如我们与高通合作时，需要很多工作确保他们的芯片达到最佳状态。现在，英特尔、AMD都在做类似的事情，这非常好——如果我用云中的情况做类比，我很高兴看到一些对芯片非常了解的顶尖人士将他们的精力投入到构建出色的产品中。

Ben Thompson：当你处于挑战者的位置，而不是处于领导地位时，是否会觉得更容易推动各方朝正确的方向前进？

Satya Nadella：我想是的。这都是竞争的结果，在竞争中我们会更强大、更有纪律，当你有东西要去赢得时，就会更严格地执行，因此，这很好，所以我们有最好的芯片创新。不知道你有没有注意到OEM的创新…

Ben Thompson：之后我就去看看。

Satya Nadella：你应该去看看。戴尔、惠普、三星、宏碁、联想都参与进来了。顺便说一句，Surface 定下了基调，但这并不是说 Surface 定下基调后就没人跟进了，坦率地说，我们能把所有的东西整合在一起，这实际上是对生态系统的一种证明，也是对领导力的一种证明，其他硬件厂商会想，让我们来试试，就是它了，这些东西十年一遇，一代一遇。

但我认为，即使追溯到 30 年前，就在这片土地上，我们推出了 Windows '95。你可以说，“哦，那是 Windows 的鼎盛时期”——但你知道吗，即使是在 Windows 的鼎盛时期，我们也忘了一样东西，那就是互联网。就在那个十二月，我们推出了SR-1，也就是浏览器。然而，在AI时代，我感觉从结构上我们更加完善了，我们有更多的机会可以赢得胜利，整个生态系统也在与我们一同创新。

人工智能平台

Ben Thompson：有一个问题。你一直把人工智能称为平台机会。我有一个问题，甚至在你做介绍的时候，我就在准备这个问题，那就是平台机会在多大程度上可以与硬件无关，没有模式转变，无论是进入市场还是完全揭示这些功能？

在这方面，这次演讲很有意思，因为它非常具体，“你看，这让 Windows 变得更好了，你可以通过购买设备来获得这些功能”。我曾经写过一篇文章，说我认为你的一个伟大胜利就是基本上取消了 Windows 在微软中的中心地位。当然，它对你来说很重要，但它不会成为一切的中心。你是否有能力以一种前所未有的方式实现全面发展？Windows 对你们未来的驱动力有多重要？它对实现这一平台机会是否至关重要，还是说你仍然可以在 iOS 或 Android 上获得这一机会？

Satya Nadella: 首先，我的做法都是基于当今世界的现状，而不是凭空想象。其次，我希望我们能同时把我们的每一层东西都整合到一个有凝聚力的架构中，以满足开发者和终端用户的利益。

Ben Thompson：在我离开微软的时候，我想要一份关于 Windows 的备忘录，时机正好，我就写了这篇文章。我当时觉得这太疯狂了，但现在看来更有意义。

Satya Nadella：因为在某种程度上，我坦率地说，我觉得我们必须真正确保为这两亿多台已售出的设备做最好的工作。但这并不意味着其他已售出的 10 亿台设备不重要。对于其他十亿台设备，我们需要进行伟大的创新，这一点我稍后再谈，但首先让我们对 2 亿 Windows 用户说，我们能通过这次平台转变为他们带来什么神奇的东西？这就是从芯片到体验再到第三方开发者——顺便说一句，这并不是孤立的，Windows 本身并不独立。

我不知道你是否注意到了，但今天有一件事超级关键：采取边缘人工智能。有两个挑战——怎么确保隐私，怎么确保安全？如果你的分类器不能根据过去一小时内发生的所有对抗性攻击不断学习，就无法在前沿模型或最新模型上提供安全性，而这将在云端完成。因此，我希望能够调用云服务。这有点像 Windows Defender，如果你没有连接到云端，你如何拥有 Windows Defender？人工智能安全也是一样。所以，你希望云做它擅长的事，你希望客户端做它擅长的事，我认为这才是关键所在。

另一件有趣的事情是，我对 Copilot Runtime 感到非常兴奋。对我来说，我想要一个真正的命名空间——顺便说一句，WebNN 这东西太酷了，我可以编写一些 JavaScript 并使用 WebNN 获取一个模型，然后让 NPU 在本地运行。我可以访问 GAP.com 或任何网站，现在我可以开始添加人工智能功能，并在本地卸载人工智能。我认为，云计算、网络和边缘技术的结合是一种具有凝聚力的思想。

事实上，这让我们在为 Android 构建系统时占尽先机。事实上，在明天的 Build 大会上，你会听到我们谈论的一件事，我很兴奋，那就是使用 Phi，对吗？现在，作为开发者，你可以在Azure AI云端上使用Phi作为托管模型服务，你可以使用现有的芯片技术，基本上是Project Silica，这在Windows上已经存在，或者你可以将其封装到你的应用中，然后在Android和iOS上使用。这就是我们打算推进的方向。

Ben Thompson：我们原本不打算过多谈论Windows。我有些更大的想法，不过这次演示确实非常引人注目，足以让人以全新的视角重新思考这个领域。

Satya Nadella：我们会让你回到 Windows 上的，Ben。

Ben Thompson：我对 Windows 没有任何障碍，我只是讨厌改变！

OpenAI 合作伙伴关系

Ben Thompson：微软似乎是一家更加团结的公司，我之前提到过这一点，当你去拜访客户时，知道他们背后有一家团结一致的公司的支持，这一点有多重要？

Satya Nadella: 你是说在微软内部？

Ben Thompson：不，要面向外部客户，比如大企业、大公司。

Satya Nadella：我认为，客户对我们的期望是，第一，既是，我会说，我非常非常关注的一件事。因为我们是一家公司，所有这些部分都在一起，整合很重要，但每一层也必须独立存在。

因此，对我来说，我对微软的看法是，是的，我们最终并不像一个企业集团，我们必须有一个真正的论点，那就是架构的凝聚力。客户关心我们，关心我们带来的整合价值，但他们也非常关心每件事情是否具有竞争力。因此，是的，客户关心这一点，我们内部也必须坚持这一点。事实上，我们的最佳状态不仅仅是整合，还必须是整合加上堆栈每一层的竞争力。

Ben Thompson：那么，当你谈到“一个微软”的整合时，你如何解决与 OpenAI 合作的问题？对此的担忧是否有所增加？比如说，“微软很不错，你们都在朝着正确的方向前进，但这里似乎存在着一种依赖关系，我们不确定你们是否能控制它，这意味着我们也无法控制它”，这些对话进行得如何？

Satya Nadella：对我们来说，我认为与 OpenAI 的合作关系就像当年与英特尔的合作关系，或者在我们构建 SQL 时与 SAP 的合作关系一样，因为它既定义了行业，也定义了微软，所以我们对这种合作关系非常投入。这是一个简单的逻辑。

Ben Thompson：谁拥有算力，谁就能主宰世界？

Satya Nadella：没错。早在 2019 年，我们就下了一个非常规的赌注，当时我们觉得，也许应该投入大量的算力，因为这是 OpenAI 比其他任何公司（甚至包括微软内部人员）都更有信心的事情，所以我们才下了这个赌注，而且在过去的五年里，这个赌注一直在发挥作用、我一直专注于确保未来五年的合作关系，正如你所知——事实上，这是双方成功的关键时期，长期稳定来自于持续的双赢，至少我是这么做的。

我认为，对他们来说，我们提供基础设施，他们是模型构建者。他们构建应用程序，我们构建应用程序，第三方构建应用程序，如此循环往复。竞争是会有的，也会有一些完全垂直整合的竞争。垂直整合的效果很好。你必须保持开放的心态，有时合作是唯一的出路。

人工智能的集成与模块化

Ben Thompson：你提到OpenAI在计算方面有信念，而这也是微软肯定会倚重的，考虑到谷歌在模型，尤其是基础设施方面的领先优势，人工智能领域是否或是否应该有一种反谷歌的联盟？我们是否看到这种联盟的出现，不仅是微软和 OpenAI，还有可能是苹果？

Satya Nadella：我看着它说，听着，我认为总有一些人可以进行垂直整合。我总是回过头来看，有盖茨/格罗夫模式，也有苹果模式，或许还有新的谷歌模式，这就是垂直整合模式。我认为这两种模式都有可取之处。

从长远来看，我更相信横向专业化。就拿芯片来说，英伟达首席执行官黄仁勋（Jensen Huang）坐在那里，非常积极地执行着令人难以置信的路线图。今天，你猜怎么着？他需要确保领先的人工智能模型是在英伟达芯片上训练出来的。

你猜怎么着？谷歌不是在英伟达芯片上训练出来的，谷歌销售英伟达芯片驱动的算力，但谷歌的模型是在TPU上训练出来的。我想黄仁勋也会注意到这一点。AMD首席执行官苏姿丰正在进行创新。我们正在制造自己的芯片。

因此，每个人都会想要进行芯片创新和模型创新。OpenAI、Meta与Llama、Mistral、所有这些小型语言模型，都在进行着大量的工作。

无论如何，我们的任何应用，比如 Copilot，都绝对会使用 GPT-4o，并与 Phi 和其他应用混合使用。因此，我认为任何企业，他们最感兴趣的都是模型即服务（MAAS, model-as-a-service）。因此，我认为这将是一个更加多样化的市场，至少我的历史教训是，赢者通吃的情况非常少见，我们需要非常清楚这一点，并确保你在这种情况下做好准备。但在其它情况下，我们需要采取广泛的平台策略。

Ben Thompson：这当然有道理，你说的是模型商品化的想法。微软从 Inflection AI 公司聘请了很多人才，看来你们要确保自己提供的模型具有多样性。但如果模型将被商品化，为什么云计算的动态会与过去 12、15 年有什么不同呢？这会有什么新变化吗？

Satya Nadella：我认为这是一个非常好的观点。我认为超大规模企业在这方面具有根本性的结构优势，从某种意义上说，如果你说五年后的世界更需要什么，我会说超大规模计算公用设施随处可见。如果你仔细思考一下，我认为经济增长的新公式与以往一样清晰，那就是你需要更多的可再生能源、更好的电网和更好的算力，如果你拥有了这些，那么经济的其他各个领域都能真正从这两件事中受益。任何一个国家、任何一个社区，如果能在效率上达到最佳前沿，就能在经济增长中占得先机、如虎添翼。因此，如果你考虑到这一高层次的前提，那么绝对是这样。

Ben Thompson：但竞争态势呢？因为Amazon是最早进入云计算领域，他们基本上囊括了所有SaaS企业的客户。微软与其企业客户群一起转向云端。而Google则在说，“来，我们的云是最棒的，试试我们的服务”，但他们却处于遥远的第三位。竞争态势会按照类似的方式发展吗？数据引力（data gravity）会占据主导地位吗？也许AI是新兴事物，但实际的竞争态势是否仍然……

Satya Nadella：我想我还没有遇到过至少一家企业客户是使用单一云服务的。我记得我刚开始接触云计算时，每个人都会谈论它，好像这会是一个赢家通吃的市场，而我总是想，我们只是在服务器领域领先，当人们甚至说微软赢了时，我不太明白。每一类服务器，无论是操作系统、数据库、网络服务器，还是所有这些中间层的东西，都有两三个玩家。

因此，从根本上说，我认为这个市场肯定容得下两家，甚至三家企业的空间，而且，收入份额，这是前微软首席执行官Steve Ballmer经常跟我说的——收入份额与市场份额在多玩家市场中是两码事。

但是，我还是认为三家公司都有发展空间，亚马逊AWS曾经经历过六年、七年没有竞争对手的时期，现在，竞争对手出现了，我们也来到了这个阶段。我对下一阶段感到非常、非常乐观。我们并非从零开始。事实上，如果有的话，我们有一个起点，并且那改变了一切。以B2C客户为例，无论是Shopify、Spotify，还是其它什么公司。多亏了OpenAI API，以前这些客户都不是Azure的用户。现在，他们不仅是Azure的用户，而且是忠实的Azure用户，这对我们来说是一个巨大的转折。

资本支出与未来

Ben Thompson：你们谈到了你们对收入支出的可视性，那么在这种竞争态势下，你们是否有什么优势？毫无疑问，你必须投资人工智能，但在过去 7 年里，你的资本支出相对于毛利润的比例从 13% 上升到 26%，这是一个巨大的增长——是什么让你相信这将带来回报，还是说这并不重要，因为竞争态势决定了你无论如何都要投资？

Satya Nadella：我认为你正确地指出了经济规律，我们是一个资本支出很高的实体。大多数人关注我们的资本支出，只是因为人工智能。但拜托，即使剔除人工智能，我们也是一个知识密集型和资本密集型企业，这是超大规模企业所需要的。如果你现在还不能每年投入 500 到 600 亿美元的资本支出，你拿不到进入超大规模市场的门票，这是进入市场的必要条件。

但同时，这也将始终受市场情况的制约。你的支出不能远远超过你的收入增长。因此，有一个绝对的调控因素，那就是，训练计算分配的阶跃函数发生变化时，训练块就会随之变化，但推理最终是由需求驱动的。因此，如果把这两者结合起来，我觉得即使有周期性的变化，调整起来也不是很难。这是一个纯粹的商业管理问题，我不是为了一个季度而管理它，

Ben Thompson：你不像华尔街那么担心。问一个简单的问题，因为我喜欢这个问题。比尔·盖茨说，我们高估了两年内发生的事情，低估了十年内发生的事情。因为感觉两年内发生了很多事，这些单位还正确吗？

Satya Nadella：我认为这些可能都是正确的单位，以摩尔定律为例，在 Epoch AI 网站上有一张漂亮的图表，我非常喜欢，他们谈到了机器学习，从 1950 年开始，机器学习算法的运行次数就一直遵循摩尔定律。它每15、16 个月翻一番，然后到了 2010 年，翻了 3 倍，而且我认为它的斜率实际上更陡峭，可能每六个月翻一番，甚至比这还少，这很难让你保持头脑清醒。每个人都说，“哦，我直到什么是指数级增长”，相信我，生活在那个世界——

Ben Thompson：当那个指数级增长时，事情就变得完全不同了。

Satya Nadella：是的，这很难。这就是为什么我认为人工智能安全是一件超级重要的事情，我们必须牢记这一点，但我们也必须牢记，会有新的创新出现。那么，如何在保证安全的前提下利用新的创新呢？这是一场截然不同的比赛。

----

以下部分为微软CTO Kevin Scott的访谈，有删减

人工智能平台

Ben Thompson：让我们把时间倒退 10 年左右，带我回顾一下微软从人工智能、高性能计算或人工智能计算，到与 OpenAI 建立合作伙伴关系所走过的历程。有没有哪一点让你意识到这就是你需要走的道路？

Kevin Scott：是的，当然。有趣的是，我在微软才工作了七年半多一点，十年前，我还在 LinkedIn 负责工程和运营团队，当时我们已经非常明显地看到，人工智能正走在一条非常有趣的新曲线上。当时的人工智能还不是现在的生成式人工智能，但人们通过真正复杂的统计机器学习所做的事情，以及我们在 10 年前就已经从这些系统的扩展中获得的好处，速度之快出乎我的意料。

我做这个工作的时间相对较长，所以在谷歌上市前后，我就在谷歌建立了很多机器学习系统，包括当时运行广告拍卖的大型机器学习系统。当时，规模的重要性已经非常明显。但是，六年前的一个相对较新的更新是，这种规模的扩大导致人工智能模型开始表现得像平台一样。

因此，我们开始看到，这些大型语言模型的扩展特性使得这些大型语言模型可以重复用于很多很多不同的事情，而不是专门为某一件事情建立一个模型，然后你应用了大量的扩展，而那一件特定的事情，比如广告的点击率（CTR）预测变得非常好。

Ben Thompson：实际上，这是我想问的一个问题，因为你和Satya一直在谈论平台的转变，“平台”这个词一直在出现。

Kevin Scott: 是的。

Ben Thompson：我本来想问你这是什么意思，但我从你的回答中听出，你说的平台是指可推广性。

Kevin Scott：没错。它是一个组件，能以非常灵活、通用的方式与你正在构建的软件系统进行组合。因此，在人工智能的世界里，像微软这样的公司可能有上百个不同的团队，从上到下负责数据，机器学习算法，用于训练机器学习的系统，部署系统的方式，以及如何从部署过程和实际使用情况中获得反馈，并将其纳入模型，以便随着时间的推移不断改进一切。

整个流程里有上百个小飞轮在转动，而非一个中央模型的训练工作，你从中获得的东西在你已有的大量应用中都非常有用，而且它还为构建大量以前不可能实现的新事物提供了可能性。

Ben Thompson：因此，我想再强调一下平台的概念。我从你那里听到的，也许我没有从你那里听到，我在与你交谈之前就已经想到了，但有一些平台，比如 Windows 就是一个平台，这就是我们对平台的看法。你有应用程序接口，有网络效应，这是一个双面网络，一边是开发者，一边是用户，但也有像 x86 这样的平台。

Kevin Scott: 是的。

Ben Thompson：我认为你使用的平台一词更接近x86，而不是 Windows，这种看法对吗？

Kevin Scott：是的。

Ben Thompson：或许更恰当的例子是通用处理，因为当你谈到从专用到通用的转变时，这听起来就像是从只做一件事的专用处理器到可广泛编程的通用逻辑芯片的转变。

Kevin Scott：是的，我认为 x86 可能是一个非常恰当的对比，因为 x86 的有趣之处在于它是一个通用的基础架构，允许编写大量软件，而且系统、平台的功能随着时间的推移不断增强，因为它每隔 18 个月左右就会变得更便宜，同时功能也更强大。因此，很多人都可以在它的基础上进行开发，从而实现能力的快速提升。

x86 和操作系统、PC制造商以及在其基础上构建应用程序的人员之间有着明确的界限。有时，微软同时开发应用程序和操作系统，所以两者都有一点，但人们在 Wintel 平台上做事情的可能性是无限的，这与微软预测的所有有用的东西无关，人们可以相信这是一个有趣的平台，因为你有一个叫做摩尔定律的指数，最终会导致这个东西完全无处不在。

Ben Thompson：我们将讨论摩尔定律，我知道这是你和Satya经常讨论的问题，也是你想讨论的问题，但你提到了 Wintel。x86 的结果是，到了后来，你有了 Windows，有了 Linux，最后甚至有了 Mac 或其他，所以你确实有了层级，但总的来说，从开发者的角度来看，他们关心的是操作系统层。

对于人工智能模型，我的问题是，真正的机会会在哪里出现？让我们倒回去。我认为，英伟达现在的一个有趣之处在于，很明显，我们有很多理由出于世俗原因看好英伟达，但我认为有一个结构性原因值得关注，那就是 CUDA，这是我们都很专业的地方，我们为这个做框架，为那个做框架。LLM 将其泛化，而现在，实际上有更高层次的事情正在发生，你不需要知道 CUDA 就能构建人工智能应用。但这是实际的一层，还是会有一个位于其上的操作系统？

Kevin Scott：很难说。可能不是操作系统的意义上的——

Ben Thompson：虽然不是传统的操作系统，但也有这种背景。

Kevin Scott: 是的。我认为这就是计算机的发展史，人们总是在不断提高抽象程度。

Ben Thompson：由于采用了新模式，我们正在重新编排。

Kevin Scott：是的，百分之百，我认为这完全正确。所以，我不知道抽象程度到底会是怎样，但现在已经非常不同了。我们现在有了及时的工程师，他们只需用自然语言发出指令，比如你希望系统做什么或不做什么，就能哄骗这些系统去做非常复杂的事情。我们正在开发各种各样的工具，用来找出，你必须在大型语言模型的上下文窗口中塞进一堆东西，才能让它做你想做的事情。我们在微软研究院建立的 GraphRAG 系统就是这样一个工具，它可以进行图结构化的上下文合成，这样你就可以非常高效地使用上下文，而不会向模型发送不必要的标记，这是件好事，因为发送的标记越多，成本就越高，延迟也就越高，而你只需要获取它需要的信息，就可以回答你需要回答的问题，或者完成你需要完成的任务。

所以，我不知道抽象的全部内容是什么，所以我们为什么要谈论 Copilot 栈这个概念，我们甚至还没有搞清楚 Copilot 栈中的所有内容到底是什么。在过去的几年里，我们已经弄清了部署现代应用程序所必须具备的大量要素，但即使模型变得越来越强大，抽象程度也会越来越高。回到你对 Windows 的比喻，Windows 的第一版并没有 DirectX，因为当时的图形处理功能还不够强大，根本无法考虑使用着色器。

Ben Thompson：不是没有人创造它，而是还没有人想到它。

Kevin Scott对，所以还有很多事情要做。但我认为，你至少会从我们这里看到的是，除了强大的前沿模型之外，你还需要具备哪些组件，才能构建这些真正丰富、有趣的新应用。

Ben Thompson：是 Copilot 堆栈吗？从长远来看，在我看来，如果你是一个开发者，如果你是一个在 1975 年和 1985 年和 1995 年考虑构建计算机应用程序的开发者，你的决策范围是完全不同的。在某些方面，你要做什么是显而易见的，那么你对这种演变将如何发生有何看法？

Kevin Scott：嗯，所以我认为，我们现在的观点是，当所有这些平台出现时，这些抽象概念都会分层。因此，在抽象堆栈的底部，你有一个大型的基础模型，然后你有数据集和检索机制，以一种非常谨慎的方式，确保模型能够访问它需要访问的信息，以完成任务。在此基础上，你还会有一堆正在进行协调的东西，因此你可能需要跨多个模型进行操作，以完成你想要完成的任务，而出于成本、质量或数据隐私的原因，你可能需要这样做。

在未来的一年里，我们真正希望看到的事情之一是对推理发生的地点进行分解。如果你能做到这一点，我认为你会希望尽可能多的推理、尽可能多的人工智能都在设备上进行，只有当设备上的能力或容量耗尽时，你才需要调用云端更强大、更复杂的东西。

Kevin Scott: 我认为，我们现在达成的共识和你一样，随着各种平台的出现，都会经历一系列抽象层的叠加。在抽象堆栈的底部，有一个大型基础模型，然后有数据集和检索机制，确保模型能够访问完成任务所需的信息。在此基础上，我们有一系列工具来进行编排，可能需要跨多个模型操作，以满足成本、质量或数据隐私的要求。

在未来的一年里，我们期待看到推理发生的位置更加多样化。一部分推理可以在设备上进行，如PC或手机，如果可能的话，我们应尽可能在设备上进行推理。只有当设备能力或容量不足时，才需要调用云端更强大、更复杂的资源。

Ben Thompson：在这个过程中，最重要的是编排代理，它首先要决定是在本地执行还是云端执行。如果选择云端执行，我们还需要考虑如何优化提示或请求，以减少标记数量，提高效率。就像你之前提到的Windows上的Phi模型，其实最重要的部分不仅在于Windows上的协作绘图，更在于通过云端为Copilot降低成本。

Kevin Scott: 我认为这是一个重要的问题，而且抽象层次正越来越多。关键在于，如果你正在打造一个真正有用且目标用户非常广泛的产品，那么就需要尽可能地将它分发给尽可能多的人。因此，成本（COGS）绝对是一个需要考虑的因素。所以，如果有一些方法能让你以高质量产品的形式提供给他们，比如通过云端卸载到一个小模型来提供那个应用程序，那就太好了。绝对应该这样做。

Ben Thompson：我记得你之前提到我们不必过于担心成本（COGS），因为一切都会变得非常便宜。

Kevin Scott: 会的。

人工智能扩展

Kevin Scott：我认为目前整个领域的有趣之处在于，前沿模型的能力确实呈指数级增长，而且我不认为我们已经接近边际收益递减的地步。

Ben Thompson：如果我们遇到了规模扩张的阻碍，会是什么？是数据还是什么？

Kevin Scott：我觉得数据已经很难了。我认为，就某些前沿模型的规模而言，每个人都会遇到这样的问题，要有足够的数据来支持它是一个挑战。我的意思是，Phi 模型的一大创新之处在于，你实际上是用一个前沿模型来生成......

Ben Thompson：合成数据。

Kevin Scott：我们这么做已经很多年了，我们曾经为非生成模型这么做过。因此，如果你想建立一个计算机视觉分类器模型，并想确保它经过训练后不会从底层训练数据集中反映出偏差，你可以使用生成模型来生成一大堆合成数据，以获得公平的训练数据分布，从而获得你想要的模型性能。因此，我认为生成合成数据是人们建立小型模型和大型模型的一种日益强大的方法。特别是对于强化学习来说，我认为这真的很有价值。

Ben Thompson：是什么推动了这一点？你提到了基础模型，你在台上与山姆·奥特曼（Sam Altman）打了个比方，就像我们从较小的动物开始，什么是较小的动物？

Kevin Scott：鲨鱼。先是鲨鱼，然后是虎鲸，现在蓝鲸是一个尚未命名的训练模型，显然将在某个时候发布。那么答案是否是，很多效率和很多扩展能力都在较小的模型中，因为这些大模型可以生成所有合成数据，可以提供所有——你可以优化，但这并不能回答基础模型的问题。你认为它们的扩展能力如何？

Kevin Scott：要完全回答这个问题，又不透露一大堆我不想透露的东西，有点困难。

Ben Thompson：我想这就足够了。

Kevin Scott：但是，我认为合成数据对于训练大型基础模型也很有用，所以试想一下，如果你想训练一个基础模型，让它非常、非常、非常擅长编码，有很多方法可以生成具有特定特征的合成程序，因为程序是这些确定性实体，你可以合成生成一些东西，然后通过类似模型检查器的东西运行它，以证明 "它是否可编译？"、"它是否产生了一组输出？"、"它是否是一个有效的输入，你可以把它放到训练过程中？"。通过在这些领域生成合成数据，你可以为任何模型设计出训练课程，或者至少是部分训练课程，在这些领域中，生成形式良好的训练输入是非常直接的，可以让模型在你试图训练的特定课程中表现得更好。但这并不意味着所有数据都可以是合成的。

Ben Thompson：你之前举例说明了做点击率预测、广告定位的模型，你说的是编码。所有这些的好处是，虽然你在制作一个概率模型，但你使用的数据是确定的，对吗？

Kevin Scott：正确。

Ben Thompson：因此，当你达到这种可通用的功能时，是什么让你有信心这种可通用性可以扩展到一些领域，在这些领域中，几乎就像一个极端，你有纯粹的创造力，在那里没有错误的答案，在那里工作得很好，还有另一个极端，你在一个具有验证功能的领域中工作，这样你实际上可以以并行的方式让人工智能发挥作用，并得到最好的答案，因为你可以给它打分。但还有一个中间地带，我认为人们——我称之为 "懒惰功能"——希望人工智能为他们完成工作，但现实问题是，并不一定有一个评分员。那么，人工智能是否能通用于这一领域呢？

Kevin Scott: 是的，我认为我们将能够泛化很多事情。在Phi论文中，我们提到的一件事，标题大概是《教科书就是你需要的全部》，这也让我很有信心。我们有能力对人类专家进行相当有限的课程培训，使他们能够做非常专业的事情。

就像我作为计算机科学家接受培训的方式一样，我阅读了大量的计算机科学论文和教科书，并做了大量的问题集，不断地练习、练习、再练习。经过一段时间，我足够称职，可以在世界上做一些有用的事情了。所以，这让我有信心，我们将能够找出如何为这些模型生成足够的课程，并找到一个学习函数，让我们构建出具有相当强大认知能力的东西。

现在，我不知道的事情是——这将是一个我们很快会弄清楚的值得关注问题——我和一些人打赌，我想象一台计算机会在数学家之前证明“黎曼猜想”。黎曼猜想是数学中一个有着百年历史的难题之一，由希尔伯特（David Hilbert）在19世纪末或20世纪初提出，大家一直在努力解决这个问题。黎曼猜想基本上是关于素数分布的一个陈述，这是一个非常困难的问题。这是一个容易陈述的问题，但一直有一些极其聪明的人在努力解决它很长时间了。所以，我实际上相信这是一个可能非常复杂的问题，其证明将会令人难以置信。我的预测是，一台计算机将在人类之前解决这个问题，并且可能会涉及到人类的协助。它不会是完全自主的。

作为工具的人工智能

Ben Thompson：你在今天的主题演讲中说，你一生都热爱工具。

Kevin Scott: 是的。

Ben Thompson：人工智能是否仍将是一种工具？

Kevin Scott：是的，我想是的。

Ben Thompson：为什么会这样？为什么它不会是一种更加自主的东西？

Kevin Scott：我们都不知道，但我认为我们已经掌握了很多关于人类想要什么的线索。自 1997 年 “深蓝”（Deep Blue）在国际象棋上击败Gary Kasparov以来，还没有一个人类在下棋方面比计算机更胜一筹，但人们却不关心两台计算机在国际象棋上的对弈，大家关心的是人类在国际象棋上的对弈。

Ben Thompson：那么，有没有一种观点认为，也许人工智能会接管一切，但我们根本不会关心，因为我们只会关心其他人类？

Kevin Scott：我不认为人工智能会取代任何东西，我认为它将继续成为我们用来为彼此制造东西、为彼此服务、为彼此做有价值事情的工具。

我认为我们所追求的是意义和联系，我们希望为彼此做事，我认为我们有巨大的机会利用这些工具，以略有不同的方式做更多这些事情。但我并不担心我们会因此而失去位置感或目标感。

Ben Thompson：在你成长的弗吉尼亚州，人工智能将如何改善生活？

Kevin Scott：今天上午我在台上讲了我妈妈的故事，我想她去年秋天在健康方面遇到了很大的困难。

许多地方的人口都在减少。中国，意大利，日本，德国，人口都在减少。你可以去看看很多地方的人口何时达到峰值。我认为法国将在2030年代初的某个时候达到顶峰，

所以我们未来将生活在一个人口减少的世界里，所以为了保持我们的生活水平，我们需要更少的人完成工作，随着时间的推移，为了更好的生活水平，必须要很大的生产力提升。必须有某种办法，用更少的人完成所有需要完成的事情。

在美国农村的一些地方，有煤矿中的金丝雀，我们都将在某个时刻面临这个问题，那里没有医生排队照顾那里迅速老龄化的人口。因此，我认为人工智能在这些地方的应用方式是，它能让人们公平地获得所有你需要获得的东西，从而拥有尊严，过上美好的生活。我知道所有这些听起来都很抽象，都是一些遥远的问题。

Ben Thompson：我来自威斯康星州的一个小镇，我很清楚你在说什么。

Kevin Scott：是的，这是一件真实的事情，我想到了我妈妈陷入的医疗危机，我认为系统中的每个人都在尽力而为，但尽力而为仍然不够好。我想，如果我没有干预，她的结果可能会大不一样。我想到那些没有儿子可以干预的老太太，如果人工智能可以在干预过程中发挥一些作用，让人们对自己的医疗保健有更多的自主权，对自己的教育有更多的自主权，对自己的创业机会有更多的自主权，我认为这一定说是好事。当然也并不意味着我们可以完全不去考虑它的风险和弊端。

Ben Thompson：我认为总的来说，特别是在我们这个圈子里，与许多其它技术革命不同，我们还没有充分思考过其潜在的益处。当你谈论所有这些好事时，大家通常会说，“哦，我最好也提一下安全性问题”。但如果你去问问这个领域之外的人，他们会说，“哦，我们知道这些好处”——但往往忽略了这部分。就像在说，“不，等等，我们能停下来谈谈吗？我们能真正深入讨论一下这些好处是什么吗？”所以，我很欣赏你对这一点的看法。

Kevin Scott：我确实认为，至少还有一次技术革命具有这种性质，那就是印刷革命，当时有了印刷机。

Ben Thompson：教会——花了 10 年、15 年时间，但他们很快就赶上了时间的大潮。

Kevin Scott：是的，实际上，时间比这更长，经历了大约一个世纪的动荡和动乱，而最终的结果是今天的世界。如今的人类无法想象一个没有文字、没有书籍和信息自由流通的世界。

Ben Thompson：我们也最终建立了一个完全重新组织的威斯特伐利亚体系。我们经历了多年的战争，发生了很多事情。我们经历了整个宗教改革的破裂，发生了很多事情？

Kevin Scott：是的，我的妻子是一位受过训练的早期现代欧洲历史学家，她现在是一位慈善家。印刷革命是我们家庭对话的一部分。

与OpenAI的合作

Ben Thompson：是不是因为你是局外人，你曾在谷歌、LinkedIn 工作过，所以你可以来到微软说：“我不确定你是否意识到你比谷歌落后了多少，你需要在这里做一些非常激进的事情”？

Kevin Scott：也许吧。我认为，微软当时实际上已经认识到它被远远落在后面。

Ben Thompson：大体上，你不需要说服任何人？

Kevin Scott: 是的。问题是怎么办？我总是被问题所吸引。

Ben Thompson：你需要一个使用才能的地方。

Kevin Scott: 没错，我是个工程师出身，你会注意到，大家解决问题的方式各不相同。有人擅长开始，有人擅长收尾，而很少有人两者都擅长。我选择的工作大多是关于——就像电影《魔法保姆麦克菲》（Nanny McPhee）那样，我不知道你是否看过。

麦克菲是一个虚构的保姆角色，我并不是说我有魔法，但她的角色是，当孩子们需要她但不想要她时，她必须留下；而当他们不再需要她但希望她留下时，她必须离开。这就是吸引我去做的事情。就像，“好吧，这是一个棘手的情况。我认为我可以帮忙解决这个特定的问题。”

Ben Thompson：既然微软在某些方面落后了，你是否觉得这是一个挑战和机会？

Kevin Scott：这并不是说整个公司落后了——微软很好，云计算和其他业务很强。这是关于在人工智能方面的落后，人工智能在 2017 年并没有那么明显的重要性。

Ben Thompson：这是否与其产品组合有关？例如，如果必应的规模更大，他们的广告业务也更多，那么他们的广告收入就会更多——还是疏忽所致？是什么驱动了他们？

Kevin Scott：很难说。我认为现在投资人工智能有一点非常明确，那就是你必须对自己的投资方式保持严谨。这不是一个你想让千花齐放的领域，你要把资源分散到不同的赌注上，所有这些最终都会带来巨大的收益。

因此，我认为微软在人工智能方面投入了大量资金，并有大量人员从事相关工作，但人工智能确实被分散到了许多不同的领域，而在企业中，让人工智能分散到许多不同的领域实在是太昂贵、太复杂了。我认为这是人们至今仍在苦苦挣扎的问题。

Ben Thompson：那么，你是如何说服微软的：“你看，你花了这么多钱，我们明白了。你们有微软研究院，有 XYZ。实际上，你们只需要做的是，你们的核心能力，微软，是花钱的能力，而 OpenAI 这个组织没有钱，但有能力建立需要做的事情，我们必须一起合作”？

Kevin Scott：我想对这种说法提出质疑。我不认为我们的核心能力是花钱。如果从公司的 DNA 来看，我认为我们的核心能力是建立平台，让其他人在上面建立他们的业务和产品。

Ben Thompson：很公平。这样一来，你花的钱就少了很多。

Kevin Scott：是的，我认为在成功方面确实如此。因此，我们的论点基本上与迄今为止的论点几乎一模一样。这就像，我们现在看到的技术趋势是，它表现得像一个平台，而平台本身将真正受益于专注，并对什么是你想投入资金的东西有自己的观点"。不仅仅是你的资金，还有你花在开发这个新平台上的所有机会成本。

Ben Thompson：虽然微软一直默认“构建而非购买”。在这种情况下，问题不是“购买”，而是 "构建与合作"，这是一个更加岌岌可危的立场。是什么证据或者说是什么时刻说服了董事会说“我们没有时间追赶了”？

Kevin Scott: 我认为大约是在2019年我们与OpenAI达成第一笔合作的时候，那时我们对规模化规律有了相当清晰的认识。我们意识到必须立即行动，而有两三个选择。在我看来，以及Satya的判断中，这是让我们迅速启动并进入市场最快的方式。

Ben Thompson：但风险在于，你将很多控制权交给了一个你无法完全掌控的实体。作为这一主张的主要支持者，2013年11月你感受到的压力有多大？

Kevin Scott: 那确实是有压力的。但我要重申的是，我认为微软作为一个平台提供商，多年来与合作伙伴一起构建复杂事物方面做得相当出色。这不像PC革命完全由微软完成，而是微软与英特尔、英伟达以及整个OEM生态系统共同合作的成果。即使是Azure的成功，也离不开我们与其它基础设施提供商如Databricks和Snowflake的合作，以及众多运行在我们云和其它云的服务。因此，在当今时代，如果你真的在谈论这些超大规模的平台，我们必须非常擅长合作。不能有这样的想法：我要自己包揽一切，这太难了。

Ben Thompson：回到抽象的问题，在这种情况下，你是否有信心，广义地说，这让你夜不能寐，那么除了谁拥有算力谁就能统治世界这一事实之外，模型最终会被商品化吗？如果到了紧要关头，当然，你必须做一些工作，但 Office 应用程序可以运行任何模型，不一定非得是 OpenAI 模型。

Kevin Scott：我认为这与商品化关系不大，更多的是与我们现在正在做的两个步骤有关，这就好比你有一个正在快速发展的前沿领域，我认为如果你想成为一个现代人工智能云，或者你想构建现代人工智能应用，你最好能够访问前沿模型，这已经是板上钉钉的事情了。我认为，OpenAI 在构建这些前沿模型和很好地利用计算资源方面做得非常出色，随着前沿模型的推进，你将拥有一个由非常非常聪明的人组成的生态系统，他们正在想办法优化其中的各个部分。

规模法则（Scaling Laws）

Ben Thompson：你觉得 Phi 是对你的战略的真正验证吗？你从什么都做不了到建立起最好的小型模型，基本上只用了几年时间？

Kevin Scott：是的，我认为Phi的有趣之处并不在于它取代了什么，而是它与我们已有的东西结合得很好，因为你可以用前沿模型做很多事情，我不希望任何人感到困惑。我认为，我今天在构建大会上所传达的信息有一半是这样的："你真的需要考虑这个前沿模型的发展速度有多快，一个真正的错误就是过于沉迷于大家都在做的线性优化"。

Ben Thompson：科技界是否已经忘记了在摩尔定律的基础上进行构建的感觉？如果你回到上世纪八九十年代，你需要花上一段时间来了解——你需要构建一个低效的应用程序，因为你想优化用户体验的前端，而你只是相信英特尔会解决你所有的问题。

Kevin Scott: 是的，你说得对。

Ben Thompson：是不是有些方面被遗忘了？

Kevin Scott: 我认为是的。

Ben Thompson：因为每个人都抱怨程序臃肿，但实际上有时候你需要它，因为性能问题会得到解决。

Kevin Scott: 是的，你可以在之后解决臃肿的问题。

Ben Thompson：没错。

Kevin Scott: 你不希望有无意义的臃肿，但你也不想……

Ben Thompson：你不想为了消除它而过度优化。

Kevin Scott: 回想我职业生涯的早期，大家曾经以此为傲。你编写这些程序，然后深入你的关键路径函数的内部循环，编写大量的……

Ben Thompson：就像，好的，你从0.0002毫秒优化到0.0001毫秒了。

Kevin Scott: 确实有一段时间，这是很重要的，是区分有用之物和废物的关键。但是因为你有了摩尔定律，这个指数级改进的过程，如果你没有意识到这一点，你所做的一切可能只是在为某事的内部循环编写汇编语言，而错过了写出更强大软件的所有机会。

我在读研究生时是一个编译器优化的人。我有个朋友，托德·普罗布斯廷（Todd Proebsting），他是亚利桑那大学的教授，曾在微软研究院工作过一段时间。他提出了普罗布斯廷定律（Proebsting'Law），这是对摩尔定律的一种戏谑。普罗布斯廷定律说，编译器优化研究人员使计算机程序的性能翻倍的工作每18年进行一次。

他说得没错。这也是我决定不再做编译器优化人员的原因之一，因为你可以花六个月的时间去研究非常非常复杂的东西，并将基准提高 4%，而在同一时间内，材料科学家和架构师会将这个东西的速度提高一倍。那你在做什么？与其试图优化旧的慢速系统，不如想办法利用即将到来的新的快速系统。

Ben Thompson：是什么驱动了这一切？你刚才谈到了即将推出的新机型，但你也提到了 GPT-4 的情况，GPT-4o，成本降低了 12 倍，速度提高了 6 倍。现在，我认为如果你真正深入研究，GPT-4o 在某些方面不如 GPT-4 好，但它在其他一些方面却很好，而且已经进行了优化。这仅仅是模型的优化吗？这是一种解决方案和推论，你能找出新的方法来解决这个问题吗？这其中有哪些驱动因素？

Kevin Scott：是的，所以我认为你有两个基本点。其一是硬件正在变得越来越好，英伟达做了大量工作，AMD现在也做得很好，微软的自研芯片也在进行中，还有很多公司在各自的环境中构建自己的芯片、尽管更小的晶体管越来越便宜，并为通用计算提供了更大的能量，但至少在目前，我们在如何将这些芯片用于人工智能这种令人尴尬的并行应用方面，仍有足够的创新空间。

Ben Thompson：嗯，总是有更多的创新空间。即使不能从晶体管方面完全获取，也可以在网络等其它方面进行创新。

Kevin Scott: 是的。

Ben Thompson：“我们将从整体上重新创造它”，这是《点球成金》（Moneyball）中一句话。

Kevin Scott: 我们从硬件方面获得了很大的性价比优势，但更为关键的是，我们在创新上投入了大量的精力。这包括如何优化整个系统软件堆栈，以及如何利用新型的数据格式。目前，许多工作都转向使用更快的并行数据处理，例如采用FP8而不是对模型进行全部的32位算术运算，这有助于更高效地使用内存，完成更多的操作。

Ben Thompson：因此，有一种反直觉的说法是，你使用的精确度较低，在某种程度上比较笨，而这实际上是更好的答案，因为价格和速度更重要？

Kevin Scott：到目前为止，精度降低并不会使任何东西变笨，只是你看神经网络中的所有激活，它们都非常非常稀疏。神经网络很大，但每个激活信号都不多。

Ben Thompson：让我印象深刻的是，这种并行方法的最大特点，就是它的抽象程度无处不在。在我看来，最有吸引力的应用是那些能将并行性发挥到极致的应用，这也是一种不要纠结于任何一种计算精度的应用。如果代价是并行性的降低，你就需要更多，更多，更多。

Kevin Scott：没错，所以硬件越来越好，我们的技术也越来越好，比如训练技术和构建推理引擎的技术比硬件好得更快。

微软的支出

Ben Thompson：你如何对投入的资金产生信心？这可能是人们的疑问。比如，哦，你会说，"好吧，我们对我们的收入有信心"，那信心是什么？是 Office Copilot 的收入吗？是 API 使用量吗？是过度投资好，还是没有足够的计算能力和推理能力好？

Kevin Scott：我们现在看到的情况是，计算能力过剩的弊端相对较小，这只是理论上的，因为现实情况是我们没有过剩的计算能力。现在人们对所有这些人工智能产品和服务的需求都非常高，我们只是在做一些疯狂的事情，以确保我们有足够的计算能力，并对整个系统进行足够的优化，从而满足我们所看到的需求。

展望未来，这里有大量的经济机会。我认为，API 业务从无到有，从小到大，速度之快前所未有。Copilot 业务具有巨大的吸引力。Copilot的用户参与度是我们在任何新的Microsoft 365办公产品中见过的最高水平，也许是有史以来最高的。所以，很多时候，你会去销售一个新的企业产品，然后需要相当长的时间让产品扩散到组织中。

Ben Thompson：听起来，如果你们能在资本支出上花更多钱，你们一定会这么做，难道你们仅仅受到供应的限制？

Kevin Scott：是的，但如果我能在CapEx上投入更多...

Ben Thompson：数据中心的能源也是一个考虑因素。

Ben Thompson：你在需求方面没有顾虑吗？

Kevin Scott: 不，现在没有。

编辑/jayden

Source: Hard AI

Note: This article is an interview between famous tech analyst Ben Thompson, Microsoft CEO Satya Nadella, and CTO Kevin Scott.

The interview mainly covered Microsoft's newly released AI PC hardware and the strategic considerations supporting the current wave of artificial intelligence. The two Microsoft executives also explained how the partnership with OpenAI works, how Microsoft is shifting to artificial intelligence across the company, and also discussed the competitive landscape and strategic thinking in the field of cloud computing.

Core views:

1. Energy is everything. With more renewable energy, better power grids, and stronger computing power, every other area of the economy can really benefit.

2. Microsoft's core competency lies in building a platform where others can once again develop businesses and products on the basis of it.

3. AI is an important once-in-a-decade, or even once-in-a-generation opportunity.

4. If you can't invest 50 to 60 billion dollars in capital expenses every year, you won't get a ticket to the hyperscale cloud computing market.

5. The partnership with OpenAI is like the partnership with Intel back then. It defined both the industry and Microsoft. Long-term and stable cooperation comes from a continuous win-win situation.

6. In the next five years, what the world needs most is ubiquitous large-scale computing power.

7. The world's population will continue to decline in the future, and in order to maintain living standards, a significant increase in productivity is necessary. AI allows people to get what they need fairly, so they can have dignity and live a beautiful life.

The following section is an interview with Microsoft CEO Satya Nadella, abridged

Microsoft's AI transformation

Ben Thompson: I used to work at Microsoft. In fact, I attended the first Build Conference here in 2012, and it was a very unique event. I'm not sure if you remember when it competed with another consumer-centric event hosted by the Windows team in New York. Today, Microsoft seems to be a more united company. What cultural and organizational changes do you think have taken place during this time?

Satya Nadella: OK, let me just say that, Ben. For me, my memories go back to PDC (Professional Developers Conference) in '91, when we first discussed Win32. In fact, I was still working at Sun at the time, and I hadn't even joined Microsoft; I was very clear about the future; the PC and what it wanted to do, and the server architecture were already very clear to me in '91.

I think this is super important for any tech company, which is to have what I call a complete mindset. For me, complete thinking starts with “What is systems innovation?” ——Whether it's a chip, operating system, or application platform. So why is this desirable for any consumer or any app developer? So for me, whether on Azure or Windows, or even on this morning's Windows Copilot+ PC, we need this kind of innovation. We haven't had such a complete idea for a long time; we've had it — Arm, and I think we've got it. We already have an app platform. Microsoft is developing its own apps, and third parties are also developing apps. So for me, culturally speaking, something that allows you to build a complete product is something I think I must work towards.

Ben Thompson: You mentioned consumers and developers. You didn't mention the word enterprise. Do you think complete thinking is currently working on these three levels?

Satya Nadella: That's a great question. In my opinion, businesses are end users too. In fact, I think at Microsoft, as long as we're at our best, we can — if you remember, we've always been a company of knowledge workers.

Ben Thompson: Exactly. But I think the split between a company's buyers and users is sometimes a tricky problem

Satya Nadella: Honestly, the company I joined in the early 90s wasn't like that.

Ben Thompson: Because back then there were only developers.

Satya Nadella: There are also end users. So, one thing I always think about is that we're really thinking about the end user. For me, Excel is a product for end users.

Ben Thompson: Definitely in the 80s.

Satya Nadella: Long before it became IT. So looking back, that doesn't mean I don't want to do something that actually meets IT needs right now. In fact, Paul Maritz once said: The magic is in end users, developers, and IT. Incidentally, to be a good corporate company, you must be good at coordinating the relationships between the three. That means if you equate developers with end users, you can be a great consumer company, and they're all different categories, right? The consumer is now a very broad concept. It could mean many, many things. We're not going to do a good Hollywood movie or many other things, but when it comes to games, I call it productivity, we want to do great things.

Ben Thompson: Yes, people have forgotten that Microsoft is disruptive not only in the consumer field and PC field, but also in the server field and product hardware. Is there a shift between which comes first and which comes later? I think there may be a time when Microsoft had high expectations for the consumer sector, but maybe people use PCs at work and then they want to use PCs at home, and phones are clearly a bit different in this regard, but is that still Microsoft's sweet spot?

Satya Nadella: Yes, I think that's it. Let's take Windows, and I want us to build great Windows PCs for both work and home users, even in terms of form factor. If the wave of artificial intelligence is already here, let's redesign the operating system and hardware. So that's what I hope we can actually do well, and today is a good step in that direction.

Copilot+PC

Ben Thompson: Yes, I'm really impressed; I think it's very persuasive; it really makes sense to understand why you should buy Windows, and that feeling didn't seem to exist since the browser came along and the app was removed from the desktop. How did you take advantage of this and move closer to it? Will there be a major shift in your market positioning? Do you just rely on the original equipment manufacturer; do you think it will sell it itself? Are you investing a lot in ways you didn't before, or are you investing all the time?

Satya Nadella: We're always investing, but in tech, timing is everything, right? We've been working on Arm and we've been talking about NPU for a long time.

Ben Thompson: Yes, 10 years ago, Microsoft introduced the Arm PC, and I was there.

Satya Nadella: So, the key point is that it's being integrated. Think about what just happened. With all of these models, and the ability we can have, whether from a privacy perspective, a latency perspective, or a COGS (cost) perspective, to have built-in models because when you use...

Ben Thompson: Honestly, you're speaking on stage a bit like Apple. First, you mentioned “MacBook” more times than the Apple of the Jony Ive (Jony Ive) era, which is clearly a comparison. There's a lot of talk about local privacy, but the local models you mentioned are critical — COGS, from your point of view, if you use the customer's energy, it's actually free. How far do you think you can go in this regard? For example, AI PCs, I've been waiting for specifications. You did mention that 16GB of RAM is not bad for Windows, but it's still too small for AI.

Satya Nadella: Now with 45 TOPS, I think this Copilot PC has taken its first steps, I love our first steps on the AI PC, but with this Copilot+ PC, I think we've succeeded. BTW, I'm pretty sure distributed computing will still be distributed, so it's actually consistent. Let's take Recall; I think this is a killer feature, and we've been working on it for a long time.

Ben Thompson: I've always used the “rewind” feature on my Mac, which is definitely a superpower.

Satya Nadella: Yes, the point is that now that I have a semantic index (Semantic Index), I can type in natural language queries and then recall, and even facts we've forgotten. I memorize things through sight, and I remember things through associations, and now I can recall without learning to search, just by typing in my intentions. However, the interesting thing about Recall — if you notice it, not only does it return to the previous page, but because of the semantic index, I can call up the content of the moment, so this ability requires a lot of edge computing.

Another interesting point is that one of my favorite demos is that you can use the NPU (45 TOPS in all) while playing Call of Duty without damaging the battery. This ability lets the operating system know how to properly use all chips and systems, and I think this will be a real breakthrough for us.

Ben Thompson: It's easy to demonstrate this; the drawing and drawing enhancements you've demonstrated on your Surface PC are really cool. An eternal question is that Android has the same issues as before Windows — yes, you've defined the specs for these AI PCs, but how do you deliver a consistent experience?

Satya Nadella: That's a great question, Ben. This has always been one of the challenges of our ecosystem. Frankly, I think we're all learning how to actually do a good operating system — frankly, it's a chip issue.

For example, when we work with Qualcomm, it takes a lot of work to ensure that their chips are in the best condition. Right now, Intel and AMD are doing similar things, which is great — if I use the cloud as an analogy, I'm happy to see some of the top people who know a lot about chips put their energy into building great products.

Ben Thompson: When you're in the position of a challenger rather than a leader, do you find it easier to push all parties in the right direction?

Satya Nadella: I think so. These are all the results of competition. In competition, we are stronger and more disciplined, and when you have something to win, we enforce it more strictly, so this is great, so we have the best chip innovations. Not sure if you've noticed OEM innovations...

Ben Thompson: I'll check it out later.

Satya Nadella: You should check it out. Dell, HP, Samsung, Acer, and Lenovo are all involved. Incidentally, Surface set the tone, but that's not to say that no one followed after Surface set the tone. Frankly speaking, we can integrate everything together. This is actually a proof of the ecosystem and a proof of leadership. Other hardware vendors will think, let's try it. This is it. Once every ten years, once a generation.

But I think even if it goes back 30 years, it's in this land that we launched Windows '95. You could say, “Oh, that was the heyday of Windows” — but did you know, even in the heyday of Windows, we forgot one thing: the internet. That December, we launched the SR-1, or browser. However, in the AI era, I feel that structurally we are more perfect, we have more opportunities to win, and the entire ecosystem is innovating with us.

Artificial intelligence platform

Ben Thompson: There's a problem. You've always called artificial intelligence a platform opportunity. I have a question, and I was even preparing for your introduction, that is, to what extent can platform opportunities be hardware independent, without a paradigm shift, whether entering the market or fully revealing these features?

In this regard, this talk was interesting because it was very specific, “Look, this makes Windows better, and you can get these features by buying a device.” I once wrote an article saying that I think one of your great wins is essentially removing Windows from the center of Microsoft. Sure, it's important to you, but it won't be the center of everything. Are you capable of achieving comprehensive development in an unprecedented way? How important is Windows to drive your future? Is it critical to realizing this platform opportunity, or can you still get this opportunity on iOS or Android?

Satya Nadella: First, my approach is based on the current state of the world, not out of imagination. Second, I want us to simultaneously integrate every layer of our stuff into a cohesive architecture that serves the interests of developers and end users.

Ben Thompson: When I left Microsoft, I wanted a memo on Windows, and the timing was right, so I wrote this post. I thought this was crazy at the time, but now it seems even more meaningful.

Satya Nadella: Because to some extent, I honestly think we have to really make sure we do the best work for the 200 million devices that have been sold. But that doesn't mean the other 1 billion devices sold aren't important. For another billion devices, we need great innovation, and I'll talk about that later, but first let's say to 200 million Windows users, what amazing things can we do with this platform transformation? That's from the chip to the experience to the third-party developer — by the way, this isn't isolated; Windows itself isn't independent.

I don't know if you've noticed, but one thing is super critical today: adopting artificial intelligence at the edge. There are two challenges — how do you ensure privacy and how do you ensure security? If your classifier doesn't continuously learn from all adversarial attacks that have occurred in the past hour, it won't be able to provide security on the cutting edge or latest model, and this will be done in the cloud. So I want to be able to call cloud services. It's kind of like Windows Defender, how can you have Windows Defender if you're not connected to the cloud? The same goes for artificial intelligence security. So, you want the cloud to do what it's good at, and you want the client to do what it's good at, and I think that's the key.

Another interesting thing is that I'm really excited about Copilot Runtime. For me, I wanted a real namespace - by the way, the WebNN thing is so cool that I can write some JavaScript and use WebNN to get a model and then run the NPU locally. I can go to Gap.com or any website, and now I can start adding AI features and uninstalling AI locally. I think the combination of cloud computing, networking, and edge technology is a cohesive idea.

In fact, it gave us a head start when building systems for Android. In fact, at tomorrow's Build conference, you'll hear us talk about one thing that I'm excited about, and that's using Phi, right? Now, as a developer, you can use PHI as a hosting model service on the Azure AI cloud. You can use existing chip technology, basically Project Silica, which already exists on Windows, or you can package it into your app and then use it on Android and iOS. This is the direction we intend to move forward.

Ben Thompson: We weren't going to talk too much about Windows. I have some bigger ideas, but this demo was really compelling enough to rethink this field from a fresh perspective.

Satya Nadella: We'll get you back on Windows, Ben.

Ben Thompson: I don't have any barriers to Windows, I just hate changes!

OpenAI partnerships

Ben Thompson: Microsoft seems to be a more united company, and I've mentioned this before. When you visit customers, how important is it to know that they have the support of a united company behind them?

Satya Nadella: You mean within Microsoft?

Ben Thompson: No, target external customers, such as big companies, big companies.

Satya Nadella: I think what our customers expect of us is, first of all, one thing I'm very, very concerned about. Because we're a company, all of these parts are together, and integration is important, but each layer must also exist independently.

So for me, my opinion of Microsoft is, yeah, we're not like a corporate group in the end, we have to have a real argument, which is the cohesion of the architecture. Our customers care about us and the integrated value we bring, but they also care about whether everything is competitive. So yes, customers care about that, and we have to stick to it internally. In fact, our best state isn't just consolidation; it must be integration plus the competitiveness of every layer of the stack.

Ben Thompson: So when you talk about “one Microsoft” integration, how do you solve the problem of partnering with OpenAI? Have concerns about this increased? For example, “Microsoft is great, you're all moving in the right direction, but there seems to be a dependency here, and we're not sure if you can control it, which means we can't control it either”, how are these conversations going?

Satya Nadella: For us, I think the partnership with OpenAI was like the partnership with Intel back then, or the partnership with SAP when we built SQL, because it defined both the industry and Microsoft, so we're very involved in this partnership. This is simple logic.

Ben Thompson: Who has computing power can rule the world?

Satya Nadella: Exactly. Back in 2019, we made an unconventional bet. At the time, we thought we should probably invest a lot of computing power because OpenAI was more confident than any other company (even Microsoft insiders), and that bet has been working for the past five years, and I've been focusing on securing a partnership for the next five years, as you know — in fact, this is a critical period of success for both parties. At least that's what I did.

I think for them, we provide infrastructure, they are model builders. They build apps, we build apps, third parties build apps, and this cycle goes back and forth. There will be competition, and there will also be some competition that is completely vertically integrated. The vertical integration works very well. You have to keep an open mind, and sometimes collaboration is the only way out.

Artificial intelligence integration and modularity

Ben Thompson: You mentioned that OpenAI has beliefs in computing, and this is something Microsoft will definitely rely on. Considering Google's leading edge in models, especially infrastructure, is there or should there be an anti-Google alliance in the field of artificial intelligence? Are we seeing this kind of alliance emerge, not just Microsoft and OpenAI, but possibly Apple?

Satya Nadella: I looked at it and said, listen, I think there are always people who can vertically integrate. I always look back. There's the Gates/Grove model, the Apple model, and maybe the new Google model. This is the vertical integration model. I think both models have merit.

In the long run, I believe more in horizontal specialization. In the case of chips, Nvidia CEO Jensen Huang (Jensen Huang) is sitting there and is very active in implementing the incredible roadmap. Today, guess what? He needed to ensure that leading artificial intelligence models were trained on Nvidia chips.

Guess what? Google was not trained on Nvidia chips. Google sells Nvidia chip-driven computing power, but Google's model was trained on TPU. I think Hwang In-hoon will notice this too. AMD CEO Su Zifeng is innovating. We are making our own chips.

Therefore, everyone will want to innovate chips and innovate models. OpenAI, Meta, Llama, Mistral, all of these small language models are all doing a great deal of work.

In any case, any of our applications, such as Copilot, will definitely use GPT-4O and mix it with Phi and others. So I think any business is most interested in model-as-a-service (MAAS, model-as-a-service). So I think it's going to be a more diverse market; at least my historical lesson is that winner-take-all situations are very rare, and we need to be very aware of this and make sure you're prepared in this situation. But in other cases, we need to adopt a broad platform strategy.

Ben Thompson: Of course it makes sense; you're talking about the idea of commercializing models. Microsoft has hired a lot of talent from Inflection AI, and it seems like you want to make sure the models you provide are diverse. But if the model were to be commercialized, why would the dynamics of cloud computing be any different from the past 12 or 15 years? Will this change anything new?

Satya Nadella: I think that's a very good point. I think hyperscale enterprises have a fundamental structural advantage in this regard. In a sense, if you were to say what the world needs more in five years, I'd say hyperscale computing utilities are everywhere. If you think about it carefully, I think the new formula for economic growth is as clear as ever: you need more renewable energy, better grids, and better computing power. If you have these, then every other area of the economy can really benefit from these two things. Any country, any community, if it can reach the best cutting edge in terms of efficiency, can take the lead in economic growth and gain momentum. So if you consider this high level premise, then this is definitely the case.

Ben Thompson: But what about the competitive landscape? Because Amazon was the first to enter the cloud computing field, they basically included customers of all SaaS companies. Microsoft is moving to the cloud with its enterprise customer base. Google, on the other hand, is saying, “Come on, our cloud is the best, try our service,” but they are far in 3rd place. Will the competitive landscape evolve in a similar way? Will data gravity (data gravity) dominate? Maybe AI is a new thing, but is the actual competitive landscape still...

Satya Nadella: I don't think I've met at least one enterprise customer using a single cloud service. I remember when I first got involved with cloud computing, everyone talked about it, as if it were a winner-take-all market, and I always thought we were just leading in the server field, and when people even said Microsoft won, I didn't really understand it. Every type of server, whether it's an operating system, database, web server, or all that middle tier stuff, has two or three players.

So, fundamentally speaking, I think this market definitely has room for two or even three companies, and revenue share, which former Microsoft CEO Steve Ballmer often told me — revenue share and market share are two different things in a multi-player market.

However, I still think that all three companies have room for development. Amazon AWS experienced a period of six or seven years without competitors. Now, competitors have appeared, and we have reached this stage. I'm very, very optimistic about the next phase. We didn't start from scratch. In fact, if anything, we had a starting point, and that changed everything. Take B2C customers, whether it's Shopify, Spotify, or any other company. Thanks to the OpenAI API, none of these customers were users of Azure before. Now they're not only Azure users, but loyal Azure users, which is a huge turning point for us.

Capital expenditure and future

Ben Thompson: You've talked about your visibility into revenue and expenses, so do you have any advantage in this competitive landscape? There's no doubt that you have to invest in artificial intelligence, but over the past 7 years, your capital expenditure ratio has risen from 13% to gross profit, which is a huge increase — what makes you believe this will pay off, or is it unimportant because competition dictates that you invest no matter what?

Satya Nadella: I think you're pointing out the laws of the economy correctly; we're an entity with a high capital expenditure. Most people are concerned about our capital expenses simply because of artificial intelligence. But please, even without artificial intelligence, we are a knowledge-intensive and capital-intensive enterprise, which is what hyperscale enterprises need. If you can't invest $50-60 billion in capital expenses each year, you can't get tickets to the hyperscale market, which is a necessary condition for entering the market.

But at the same time, this will always be subject to market conditions. Your expenses should not far exceed the growth of your income. Therefore, there is an absolute control factor, that is, when the step function assigned to the training calculation changes, the training block changes accordingly, but the inference is ultimately driven by demand. Therefore, if the two are combined, I don't think it's very difficult to adjust even if there are cyclical changes. This is a pure business management issue, I'm not managing it for a quarter,

Ben Thompson: You're not as worried as Wall Street. Just a quick question because I love this one. Bill Gates said that we overestimated what happened in two years and underestimated what happened in ten years. Because it feels like a lot has happened in two years, are these units still correct?

Satya Nadella: I think these are probably all the correct units. Take Moore's Law as an example. There's a beautiful chart on the Epoch AI website. I really like it. They talked about machine learning, and since 1950, the number of runs of machine learning algorithms has followed Moore's Law. It doubled every 15 or 16 months, then tripled by 2010, and I think its slope is actually steeper, probably doubling every six months, or even less, which makes it hard to keep your head clear. Everyone says, “Oh, I am until what is exponential growth”, believe me, live in that world --

Ben Thompson: When that exponential growth started, things became completely different.

Satya Nadella: Yes, it's hard. That's why I think AI security is a super important thing, and we have to keep that in mind, but we also have to keep in mind that new innovations will emerge. So how do you take advantage of new innovations without compromising safety? It's a very different game.

----

The following section is an interview with Microsoft CTO Kevin Scott, abridged

Artificial intelligence platform

Ben Thompson: Let's take a decade or so back in time and take me back to Microsoft's journey from artificial intelligence, high performance computing, or artificial intelligence computing to forming a partnership with OpenAI. Did anything make you realise that this is the path you need to follow?

Kevin Scott: Yes, of course. What's interesting is that I've only been working at Microsoft for a little over seven and a half years. Ten years ago, when I was in charge of the engineering and operations team on LinkedIn, we already saw very clearly that artificial intelligence was on a very interesting new curve. Artificial intelligence back then wasn't the generative AI it is today, but what people did with really complex statistical machine learning, and the benefits we had already reaped from the expansion of these systems 10 years ago, was unexpectedly fast.

I've been working on this for a relatively long time, so before and after Google went public, I built many machine learning systems at Google, including large-scale machine learning systems that ran ad auctions at the time. At the time, the importance of scale was already evident. However, a relatively recent update six years ago was that this increase in scale caused the AI model to start behaving like a platform.

As a result, we're beginning to see that the extended nature of these large language models allows these large language models to be reused for many, many different things, rather than building a model specifically for one thing, and then you apply a large number of extensions, and that specific thing, such as ad click-through rate (CTR) prediction, becomes very good.

Ben Thompson: Actually, this is a question I wanted to ask because you and Satya have been talking about the transformation of platforms, and the word “platform” keeps popping up.

Kevin Scott: Yes.

Ben Thompson: I wanted to ask you what this meant, but I heard from your answers that you meant the platform for promotion.

Kevin Scott: Exactly. It's a component that can be combined in a very flexible and versatile way with the software system you're building. Therefore, in the world of artificial intelligence, companies like Microsoft may have hundreds of different teams, from top to bottom, responsible for data, machine learning algorithms, systems for training machine learning, how to deploy systems, and how to get feedback from the deployment process and actual usage and incorporate it into models to continuously improve everything over time.

There are hundreds of small flywheels rotating throughout the process, not the training work of a central model. What you get from this is very useful in many of the applications you already have, and it also provides the possibility to build a lot of new things that were impossible before.

Ben Thompson: So I'd like to emphasize the concept of a platform again. I've heard it from you, maybe I haven't heard it from you, I had this in mind before I talked to you, but there are platforms, such as Windows, that's what we think about platforms. You have an application interface and a network effect. This is a two-sided network with developers on one side and users on the other, but also platforms like x86.

Kevin Scott: Yes.

Ben Thompson: I think you're using the term platform closer to x86 than Windows, is that right?

Kevin Scott: Yes.

Ben Thompson: Perhaps the more appropriate example is general-purpose processing, because when you talk about the transition from dedicated to general purpose, it sounds like a shift from a dedicated processor that only does one thing to a general-purpose logic chip that can be widely programmed.

Kevin Scott: Yes, I think x86 is probably a very appropriate comparison, because the interesting thing about x86 is that it is a general infrastructure that allows writing a large amount of software, and the functionality of the system and platform continues to increase over time because it becomes cheaper and more powerful every 18 months or so. Therefore, many people can develop based on it, so as to achieve rapid improvement in capabilities.

There are clear boundaries between x86 and operating systems, PC manufacturers, and people building applications based on them. Sometimes, Microsoft develops apps and operating systems at the same time, so there's a bit of both, but the possibilities for people to do things on the Wintel platform are limitless. This isn't about all the useful stuff Microsoft predicts. People can believe this is a fun platform because you have an index called Moore's Law, which will eventually cause this thing to be completely everywhere.

Ben Thompson: We're going to be discussing Moore's Law, and I know this is something you and Satya often discuss, and it's something you want to discuss, but you mentioned Wintel. The result of x86 is that later, you have Windows, you have Linux, and eventually even Mac or something, so you do have tiers, but overall, from a developer's point of view, they're concerned about the OS layer.

For AI models, my question is, where do real opportunities arise? Let's go backwards. I think one interesting thing about Nvidia now is that, obviously, we have a lot of reasons to be optimistic about Nvidia for secular reasons, but I think one structural reason is worth paying attention to, and that is CUDA. This is where we all specialize. We frame this, we frame that. LLM generalized it, and now, something higher level is actually happening where you don't need to know CUDA to build artificial intelligence applications. But is this an actual layer, or will there be an operating system on top of it?

Kevin Scott: Hard to say. Probably not in the sense of an operating system --

Ben Thompson: It's not a traditional operating system, but it also has this background.

Kevin Scott: Yes. I think this is the history of the development of computers; people are always improving their level of abstraction.

Ben Thompson: We're rearranging it because of the new model.

Kevin Scott: Yes, 100%, I think that's completely true. So, I don't know what the level of abstraction would be, but it's very different now. We now have timely engineers who can trick these systems into doing very complicated things by simply issuing instructions in natural language, such as what you want the system to do or not do. We're developing a variety of tools to find out that you have to cram a bunch of things into the context window of a large language model to make it do what you want it to do. The GraphRag system we built at Microsoft Research is one such tool. It can perform graph-structured contextual synthesis, so you can use context very efficiently without sending unnecessary tags to the model. This is a good thing, because the more tags you send, the higher the cost and latency, and you only need to get the information it needs to answer the questions you need to answer or complete the tasks you need to complete.

So, I don't know what the abstraction is all about, so why are we talking about the Copilot stack, and we haven't even figured out what everything in the Copilot stack actually is. Over the past few years, we've figured out many of the elements required to deploy modern apps, but even as the models get more powerful, the level of abstraction will increase. Going back to your Windows analogy, the first version of Windows didn't have DirectX, because graphics processing wasn't powerful enough at the time to consider using shaders at all.

Ben Thompson: It's not that nobody created it, it's that nobody thought of it yet.

Kevin Scott is right, so there's still a lot of work to be done. But I think the least you'll see from us is what components you'll need to have in addition to a powerful cutting-edge model to build these really rich and exciting new apps.

Ben Thompson: Is it a Copilot stack? In the long run, in my opinion, if you were a developer, if you were a developer considering building a computer application in 1975 and 1985 and 1995, the scope of your decisions was completely different. In some ways, what you're going to do is obvious, so what's your opinion on how this evolution will happen?

Kevin Scott: Hmm, so I think our point now is that when all these platforms come out, these abstractions will be layered. So at the bottom of the abstraction stack, you have a large underlying model, and then you have data sets and retrieval mechanisms to make sure the model can access the information it needs to do its job. On top of that, you'll also have a bunch of things being coordinated, so you might need to operate across multiple models to complete the tasks you want to accomplish, and you might need to do this for cost, quality, or data privacy reasons.

One of the things we really want to see in the year ahead is a breakdown of where the reasoning takes place. If you can do this, I think you'll want as much inference as possible, and as much artificial intelligence as possible on the device, and only when the device runs out of power or capacity, you'll need to call on something more powerful and complex in the cloud.

Kevin Scott: I think the consensus we have now reached is that as you, various platforms will experience a series of layers of abstraction as they emerge. At the bottom of the abstraction stack, there is a large underlying model, followed by data sets and retrieval mechanisms to ensure that the model can access the information it needs to complete the task. Based on this, we have a range of tools to orchestrate, and may require operation across multiple models to meet cost, quality, or data privacy requirements.

Over the next year, we're looking forward to seeing more diverse places where reasoning takes place. Some inference can be done on a device, such as a PC or mobile phone, and if possible, we should reason on the device as much as possible. Only when the device's capabilities or capacity are insufficient will it be necessary to call on more powerful and complex resources in the cloud.

Ben Thompson: In this process, the most important thing is the orchestration agent, which first decides whether to execute locally or in the cloud. If we choose cloud execution, we also need to consider how to optimize the prompts or requests to reduce the number of tags and increase efficiency. Like the Phi model on Windows you mentioned before, the most important part is not only collaborative drawing on Windows, but also reducing costs for Copilot through the cloud.

Kevin Scott: I think this is an important question, and there are more and more levels of abstraction. The point is, if you're building a product that's really useful and has a very wide range of target users, then you need to distribute it to as many people as possible. So cost (COGS) is definitely a factor to consider. So, it would be great if there were some ways for you to offer them in the form of a high-quality product, such as providing that app by uninstalling it from the cloud to a small model. It should definitely be done.

Ben Thompson: I remember you mentioned before that we don't have to worry too much about cost (COGS) because everything is going to be really cheap.

Kevin Scott: Yes.

Artificial intelligence expansion

Kevin Scott: I think the interesting thing about the whole field right now is that the capabilities of cutting-edge models are really growing exponentially, and I don't think we're close to declining marginal returns.

Ben Thompson: If we were to get in the way of scaling up, what would it be? Is it data or what?

Kevin Scott: I think the data is already difficult. I think everyone will run into this kind of problem in terms of the scale of some cutting-edge models, and it's a challenge to have enough data to support it. I mean, one of the major innovations in the Phi model is that you're actually using a cutting-edge model to generate...

Ben Thompson: Synthetic data.

Kevin Scott: We've been doing this for years, and we've done this for non-generative models. Therefore, if you want to build a computer vision classifier model and want to ensure that it does not reflect deviations from the underlying training data set after training, you can use the generated model to generate a large set of synthetic data to obtain a fair distribution of training data, so as to obtain the desired model performance. So I think generating synthetic data is an increasingly powerful way for people to build small and large models. For reinforcement learning in particular, I think this is really valuable.

Ben Thompson: What's driving this? You mentioned the basic model, and you compared Sam Altman (Sam Altman) on stage, just like we started with smaller animals, what is a smaller animal?

Kevin Scott: Shark. First a shark, then a killer whale, and now the blue whale is an unnamed training model that will apparently be released at some point. So whether the answer is, a lot of efficiency and a lot of scalability are in the smaller models, because these big models can generate all the synthetic data and provide everything — you can optimize, but that doesn't answer the basic model's question. How do you think they can scale?

Kevin Scott: It's a bit difficult to fully answer this question without revealing a bunch of things I don't want to reveal.

Ben Thompson: I think that's enough.

Kevin Scott: But I think synthetic data is also useful for training large basic models, so imagine if you want to train a basic model and make it very, very, very good at coding, there are lots of ways to generate synthetic programs with specific characteristics, because programs are these deterministic entities, you can synthesize something and then run it through something like a model checker to prove “is it compilable?” , “Did it produce a set of outputs?” , “Is it a valid input and can you put it in the training process?” By generating synthetic data in these fields, you can design a training course, or at least part of a training course, for any model. In these fields, generating well-formed training inputs is very direct, allowing the model to perform better in the specific course you are trying to train. But that doesn't mean all of the data can be synthetic.

Ben Thompson: You gave an example of a model for click-through rate prediction and ad targeting; you were talking about coding. The benefit of all of this is that even though you're making a probabilistic model, the data you're using is definitive, right?

Kevin Scott: Exactly.

Ben Thompson: So when you reach this kind of versatile functionality, what makes you confident that this versatility can be extended to fields, where it's almost like an extreme, where you have pure creativity, where there are no wrong answers, where you work very well, and at the other extreme, you work in a field with verification capabilities, so you can actually make artificial intelligence work in parallel and get the best answers because you can rate it. But there's also a middle ground, and I think people — I call it a “lazy feature” — want artificial intelligence to do the work for them, but the real problem is that they don't necessarily have a grader. So, can artificial intelligence be used universally in this field?

Kevin Scott: Yes, I think we'll be able to generalize a lot of things. One of the things we mentioned in the Phi paper was probably “Textbooks are all you need”, which also made me very confident. We have the ability to train human experts in fairly limited courses to enable them to do very specialized things.

Just like the way I was trained as a computer scientist, I read lots of computer science papers and textbooks, and did lots of problem sets to practice, practice, and practice again and again. After a while, I'm competent enough to do something useful in the world. So, it gives me confidence that we'll be able to figure out how to generate enough courses for these models and find a learning function that allows us to build something with pretty strong cognitive abilities.

Now, what I don't know — this is going to be a matter of concern we'll figure out soon — I'm betting with some people that I imagine a computer would prove the “Riemann conjecture” before mathematicians. The Riemann conjecture is one of the century-old problems in mathematics. It was proposed by David Hilbert (David Hilbert) in the late 19th or early 20th century, and everyone has been trying to solve this problem. The Riemann conjecture is basically a statement about the distribution of prime numbers; this is a very difficult question. This is an easy question to state, but there have been some extremely smart people who have been trying to solve it for a long time. So I actually believe this is a potentially very complicated question, and the proof will be incredible. My prediction is that a computer will solve this problem before humans, and that it may involve human assistance. It won't be completely autonomous.

Artificial intelligence as a tool

Ben Thompson: You said in today's keynote that you've loved tools your whole life.

Kevin Scott: Yes.

Ben Thompson: Will artificial intelligence still be a tool?

Kevin Scott: Yes, I think so.

Ben Thompson: Why is this happening? Why wouldn't it be something more autonomous?

Kevin Scott: We don't know, but I think we've got a lot of clues about what humans want. Since “Deep Blue” (Deep Blue) defeated Gary Kasparov in chess in 1997, no human has been superior to a computer in chess, yet people don't care about the game between two computers in chess; everyone is concerned about the game between humans at chess.

Ben Thompson: So, is there an idea that maybe AI will take over everything, but we don't care at all because we only care about other humans?

Kevin Scott: I don't think artificial intelligence will replace anything; I think it will continue to be the tool we use to make things for each other, serve each other, and do valuable things for each other.

I think what we're looking for is meaning and connection, we want to do things for each other, and I think we have a huge opportunity to use these tools to do more of these things in slightly different ways. But I'm not worried that we'll lose our sense of place or purpose because of this.

Ben Thompson: How will artificial intelligence improve lives in Virginia where you grew up?

Kevin Scott: I told my mom's story on stage this morning, and I think she had a lot of trouble with her health last fall.

The population is shrinking in many places. The population of China, Italy, Japan, Germany is shrinking. You can go see when the population peaks in many places. I think France will reach its peak sometime in the early 2030s,

Therefore, we will live in a world with a declining population in the future, so in order to maintain our standard of living, we need fewer people to complete jobs. Over time, for a better standard of living, we must increase our productivity a lot. There must be a way to do everything that needs to be done with fewer people.

In some parts of rural America, where there are canaries in coal mines, we will all face this problem at some point where there are no doctors lining up to care for the rapidly aging population there. Therefore, I think the way artificial intelligence is applied in these places is that it allows people to get everything they need fairly, so they can have dignity and live a beautiful life. I know all of this sounds very abstract, and these are far off questions.

Ben Thompson: I'm from a small town in Wisconsin, and I know what you're talking about.

Kevin Scott: Yes, this is a real thing. I thought about the medical crisis my mom was in. I think everyone in the system is doing their best, but trying their best is still not good enough. I think her results might have been very different if I hadn't intervened. I think of old ladies who don't have sons to intervene. If artificial intelligence can play some role in the intervention process, give people more autonomy over their own health care, more autonomy over their own education, and more autonomy over their own entrepreneurial opportunities, I think this must be a good thing. Of course, that doesn't mean we can completely ignore its risks and drawbacks.

Ben Thompson: I think in general, and in our circle in particular, unlike many other technological revolutions, we haven't fully considered the potential benefits. When you talk about all these good things, people usually say, “Oh, I'd better mention security issues too.” But if you ask people outside of this field, they'll say, “Oh, we know the benefits” — but that part is often overlooked. It's like saying, “No, wait, can we just stop talking? Can we really talk more about what these benefits are?” So, I appreciate your thoughts on this.

Kevin Scott: I do think there was at least one technological revolution of this nature, and that was the printing revolution, when there was a printing press.

Ben Thompson: Church—it took 10, 15 years, but they quickly caught up with the tide of time.

Kevin Scott: Yes, actually, it's been longer than that, and it's been about a century of turbulence and upheaval, and the end result is today's world. Humans today cannot imagine a world without words, books, and the free flow of information.

Ben Thompson: We also ended up with a completely reorganized Westphalian system. We have been through years of war and many things have happened. We've been through the breakdown of the entire Reformation, and many things have happened?

Kevin Scott: Yes, my wife is a trained early modern European historian who is now a philanthropist. The printing revolution is part of our family conversation.

Collaboration with OpenAI

Ben Thompson: Is it because you're an outsider, you've worked at Google and LinkedIn, so you can come to Microsoft and say, “I'm not sure you realise how far behind Google, you need to do something very aggressive here”?

Kevin Scott: Maybe. I think Microsoft actually realized at the time that it was left far behind.

Ben Thompson: Generally, you don't need to convince anyone?

Kevin Scott: Yes. The question is what to do? I'm always drawn to questions.

Ben Thompson: You need a place where you can use talents.

Kevin Scott: Yes, I'm an engineer, and you'll notice that people solve problems in different ways. Some are good at starting, some are good at finishing, and few are good at both. Most of the jobs I chose were about — like the movie “The Magical Nanny McPhee” (Nanny McPhee), I don't know if you've watched it.

McPhee is a fictional babysitter character, and I'm not saying I have magic, but her character is that when kids need her but don't want her, she has to stay; and when they don't need her anymore but want her to stay, she has to leave. That's what drew me to do it. It's like, “Okay, this is a tricky situation. I think I can help with this particular problem.”

Ben Thompson: Since Microsoft is lagging in some areas, do you think this is a challenge and an opportunity?

Kevin Scott: That's not to say the whole company is lagging—Microsoft is good, cloud computing and other businesses are strong. It's about falling behind in terms of artificial intelligence, which wasn't so clearly important in 2017.

Ben Thompson: Does this have anything to do with their product portfolio? For example, if Bing were larger and had more advertising business, they would have more ad revenue — or was it due to negligence? What's driving them?

Kevin Scott: Hard to say. I think one thing is very clear about investing in artificial intelligence now, and that is that you have to be strict about how you invest. This isn't an area where you want all your flowers to flourish; you have to spread your resources across different bets, all of which will ultimately bring huge returns.

Therefore, I think Microsoft has invested a lot of money in artificial intelligence and has a large number of people doing related work, but artificial intelligence is indeed scattered into many different fields, and in enterprises, it is really too expensive and complicated to spread artificial intelligence into many different fields. I think this is an issue that people are still struggling with.

Ben Thompson: So, how did you convince Microsoft: “Look, you spent this much money, we get it. You have Microsoft Research and you have XYZ. Actually, all you need to do is that your core competency, Microsoft, is the ability to spend money, and the OpenAI organization doesn't have money, but it has the ability to build what needs to be done; we have to work together”?

Kevin Scott: I'd like to challenge that assertion. I don't think our core competency is spending money. If you look at the company's DNA, I think our core competency is to build platforms on which others can build their businesses and products.

Ben Thompson: Fair enough. That way, you'll spend a lot less money.

Kevin Scott: Yes, I think that's true when it comes to success. So our arguments are essentially the same as those we have made so far. It's like, the technology trend we're seeing now is that it acts like a platform, and the platform itself will really benefit from focus and have its own views on what you want to invest in”. It's not just your capital, but all the opportunity costs you've spent developing this new platform.

Ben Thompson: Although Microsoft has always defaulted to “build rather than buy.” In this case, the issue isn't “buying” but “building and collaborating,” which is a more precarious position. What evidence or moment convinced the board to say “we don't have time to catch up”?

Kevin Scott: I think it was around 2019 when we reached our first partnership with OpenAI, when we had a pretty clear understanding of the laws of scale. We are aware of the need to act now, and there are two or three options. In my opinion, and in Satya's judgment, this was the fastest way for us to quickly launch and get to market.

Ben Thompson: But the risk is that you're handing over a lot of control to an entity you can't fully control. As a major supporter of this proposition, how much pressure did you feel in November 2013?

Kevin Scott: That was really stressful. But I want to repeat that I think Microsoft, as a platform provider, has done a pretty good job of building complex things with partners over the years. It's not like the PC revolution was entirely completed by Microsoft, but rather the result of Microsoft's collaboration with Intel, Nvidia, and the entire OEM ecosystem. Even Azure's success is inseparable from our partnerships with other infrastructure providers such as Databricks and Snowflake, as well as numerous services running on our cloud and other clouds. So in this day and age, if you're really talking about these hyperscale platforms, we have to be really good at collaborating. Don't think like this: I have to take care of everything myself; it's too difficult.

Ben Thompson: Back to the abstract question, in this case, are you confident that, broadly speaking, it keeps you up at night, so apart from the fact that whoever has computing power can rule the world, will eventually be commercialized? If you get to a critical point, of course, you'll have to do some work, but an Office application can run any model, not necessarily an OpenAI model.

Kevin Scott: I don't think this has much to do with commercialization; it's more related to the two steps we're taking right now. It's like you have a frontier field that is rapidly developing. I think if you want to become a modern AI cloud, or if you want to build modern AI applications, you'd better have access to cutting-edge models. This is already a no-brainer. I think OpenAI has done a great job building these cutting-edge models and making good use of computing resources, and as the cutting-edge models advance, you'll have an ecosystem of very, very smart people who are looking for ways to optimize every part of it.

The laws of scaling (Scaling Laws)

Ben Thompson: Do you think Phi is a real test of your strategy? It basically only took a few years for you to go from being able to do nothing to building the best small model?

Kevin Scott: Yes, I think the interesting thing about Phi isn't about what it replaced, but rather that it combines so well with what we already have because you can do so many things with cutting-edge models, and I don't want anyone to be confused. I think half of the message I conveyed at the construction conference today was this: “You really need to consider how fast this cutting-edge model is developing; one real mistake is getting too addicted to linear optimization that everyone is doing”.

Ben Thompson: Has the tech world forgotten what it feels like to build on Moore's Law? If you go back to the 80s and 90s, you'll need to take a while to understand — you need to build an inefficient application because you want to optimize the front end of the user experience, and you just trust Intel to solve all of your problems.

Kevin Scott: Yes, you're right.

Ben Thompson: Have some aspects been forgotten?

Kevin Scott: I think so.

Ben Thompson: Because everyone complains about the program being bloated, but actually sometimes you need it because performance issues will be solved.

Kevin Scott: Yes, you can fix the bloat later.

Ben Thompson: Exactly.

Kevin Scott: You don't want meaningless bloat, but you don't want...

Ben Thompson: You don't want to over optimize to get rid of it.

Kevin Scott: When I think back to the early days of my career, people used to be proud of it. You write these programs, then go deep into the internal loop of your critical path functions and write lots of...

Ben Thompson: Like, OK, you've optimized from 0.0002 ms to 0.0001 ms.

Kevin Scott: It's been a while. It's really important; it's the key to distinguishing useful things from waste. But because you have Moore's Law, this exponential process of improvement, and if you don't realize it, all you're doing is probably just writing an assembly language for the internal loop of something and missing every chance to write more powerful software.

I was a compiler optimizer when I was in grad school. I have a friend, Todd Proebsting (Todd Proebsting), a professor at the University of Arizona who worked at Microsoft Research for a while. He proposed Proebsting' Law (Proebsting' Law), which is a joke on Moore's Law. Probustin's Law says that compiler optimization researchers work to double the performance of a computer program every 18 years.

He was right. This is one of the reasons I decided not to work as a compiler optimizer because you can spend six months working on something very, very complex and raise the benchmark by 4%, and in the same time, materials scientists and architects will double the speed of this stuff. So what are you doing? Instead of trying to optimize the old slow system, find a way to take advantage of the new fast system that is about to come.

Ben Thompson: What's driving this? You mentioned the new model that is about to be launched, but you also mentioned the case of the GPT-4, GPT-4o, which is 12 times less expensive and 6 times faster. Now, I think if you really dig deep, GPT-4O isn't as good as GPT-4 in some ways, but it's good in others, and it's optimized. Is this just an optimization of the model? It's a solution and an inference, can you figure out a new way to solve this? What are the drivers behind this?

Kevin Scott: Yes, so I think you have two basic points. One is that hardware is getting better. Nvidia has done a lot of work, AMD is doing a good job now, Microsoft's self-developed chips are also underway, and many companies are building their own chips in their own environments. Although smaller transistors are getting cheaper and provide more energy for general computing, at least for now, we still have enough room for innovation in how to use these chips for embarrassing parallel applications such as artificial intelligence.

Ben Thompson: Well, there's always room for more innovation. Even if it is not completely obtainable from the transistor side, it is possible to innovate in other areas such as networks.

Kevin Scott: Yes.

Ben Thompson: “We're going to recreate it in its entirety” is a phrase from “Moneyball” (Moneyball).

Kevin Scott: We got a great price/performance advantage from the hardware, but more importantly, we put a lot of effort into innovation. This includes how to optimize the entire system software stack and how to take advantage of new data formats. Currently, many jobs are moving to faster parallel data processing, such as using FP8 instead of doing all 32-bit arithmetic operations on the model, which helps to use memory more efficiently and complete more operations.

Ben Thompson: So there's a counterintuitive way of saying that you're using less accurate and somewhat dumb, and that's actually a better answer because price and speed are more important?

Kevin Scott: So far, the reduction in accuracy hasn't made anything dumb; you just look at all the activations in the neural network, they're all very, very sparse. The neural network is large, but there aren't many activation signals per one.

Ben Thompson: What impressed me was that the greatest characteristic of this parallel approach was that it was abstracted everywhere. In my opinion, the most attractive applications are those that take parallelism to the extreme; this is also an application that doesn't have to worry about any kind of computational accuracy. If the cost is reduced parallelism, you need more, more.

Kevin Scott: Right, so the hardware is getting better, and our technology is getting better. For example, training techniques and techniques for building inference engines are getting better faster than hardware.

Microsoft's expenses

Ben Thompson: How do you gain confidence in the money invested? This is probably a question that people are questioning. For example, oh, you'd say, “OK, we're confident in our revenue,” what is that confidence? Is it Office Copilot's revenue? Is it API usage? Is it good to overinvest, or is it good to not have enough computational and reasoning skills?

Kevin Scott: What we're seeing now is that the downside of excessive computing power is relatively small. This is just theoretical, because the reality is that we don't have excess computing power. Now that people are in high demand for all of these artificial intelligence products and services, we're just doing crazy things to make sure we have enough computing power and optimize the entire system enough to meet the needs we're seeing.

Looking ahead, there are plenty of economic opportunities here. In my opinion, the API business has gone from scratch, from small to large, faster than ever before. The Copilot business is hugely appealing. Copilot's user engagement is the highest we've ever seen in any new Microsoft 365 office product, and probably the highest ever. So, many times, you're going to sell a new corporate product, and then it takes quite a while for the product to spread to the organization.

Ben Thompson: Sounds like if you could spend more on capital expenses, you would definitely do it; are you just limited by supply?

Kevin Scott: Yes, but if I could invest more in CapEx...

Ben Thompson: Data center energy is also a consideration.

Ben Thompson: Aren't you concerned about demand?

Kevin Scott: No, not right now.

Editor/jayden

The translation is provided by third-party software.

The above content is for informational or educational purposes only and does not constitute any investment advice related to Futu. Although we strive to ensure the truthfulness, accuracy, and originality of all such content, we cannot guarantee it.

微软：经济增长公式从未如此清晰——更多电力、更高效的电网和更强大的算力！

Microsoft: The economic growth formula has never been so clear—more electricity, more efficient grids, and more computing power!

微软的AI转型

Copilot+PC

人工智能平台

OpenAI 合作伙伴关系

人工智能的集成与模块化

资本支出与未来

人工智能平台

人工智能扩展

作为工具的人工智能

与OpenAI的合作

规模法则（Scaling Laws）

微软的支出

Microsoft's AI transformation

Copilot+PC

Artificial intelligence platform

OpenAI partnerships

Artificial intelligence integration and modularity

Capital expenditure and future

Artificial intelligence platform

Artificial intelligence expansion

Artificial intelligence as a tool

Collaboration with OpenAI

The laws of scaling (Scaling Laws)

Microsoft's expenses

Risk Disclaimer

Statement