Focus on CHK and US stocks

Nvidia soared more than 8% overnight, driving up US stocks. What did Huang Renxun say?

wallstreetcn · Sep 12 07:05

英伟达CEO黄仁勋说，英伟达AI芯片Blackwell供应的增速有限，这让某些客户倍感挫折。他还暗示，若有必要，英伟达会减少对台积电的依赖，转向其他芯片制造供应商。此外报道称，美国政府正在考虑允许英伟达向沙特阿拉伯出口先进芯片。

AI热潮龙头股英伟达首席执行官黄仁勋（Jensen Huang）周三表示，英伟达的产品现已成为科技界最抢手的商品，客户对有限的供应争相竞争，尤其是AI芯片Blackwell供应的增速有限，导致一些客户感到沮丧。他还暗示，若有必要，英伟达会减少对台积电的依赖，转向其他芯片制造供应商。

他在旧金山由高盛集团举办的科技会议上对观众说：

“我们产品需求如此之大，每个人都想第一个拿到，想得到最多的份额。我们今天可能有更多情绪化的客户，这也是情有可原的。关系很紧张，但我们正尽力做到最好。”

黄仁勋向观众介绍说，公司最新一代的AI芯片Blackwell，正面临强劲的需求。目前，英伟达将Blackwell的生产外包出去，他表示，英伟达供应商正在尽力跟上需求并取得进展。

不过，英伟达的大部分收入依赖于少数客户，如微软和Meta Platforms Inc.这样的数据中心运营商。当黄仁勋被问到，巨大的AI支出是否为客户带来了投资回报时，黄仁勋表示，企业别无选择，只能接受“加速计算”。他解释说，英伟达的技术不仅能加速传统的工作负载——数据处理，还能处理旧技术无法应对的AI任务。

黄仁勋还表示，英伟达在芯片生产方面严重依赖台积电，这是因为台积电在芯片制造领域中遥遥领先。

但他也表示，英伟达在内部开发了大部分技术，这使得该公司能够将订单转移给其他供应商。然而，他表示，这样的改变可能会导致其芯片质量的下降。

“台积电的敏捷性和他们响应我们需求的能力实在是令人难以置信。因此我们选择他们是因为他们很出色，但如果有必要，当然我们也可以转向其他供应商。”

此外，报道称，美国政府正在考虑允许英伟达向沙特阿拉伯出口先进芯片，这可能有助于该国训练和运行最强大的AI模型。一些为沙特数据和AI管理局工作的人士说，沙特正在努力遵守美国的安全要求，以加快获得这些芯片的进程。

访谈内容传出后，英伟达股价日内由跌转涨，收盘涨超8%，报116.91美元，同时带动纳指从日内1.6%的跌幅转涨2.17%。今年，英伟达股票价格已经翻了一倍多，而2023年上涨了239%。

以下是黄仁勋访谈节录：

1. 首先谈谈31年前，你创立公司时的一些想法。从那时起，你将公司从一个专注于游戏的GPU公司转型为一个为数据中心行业提供广泛硬件和软件的公司。你能不能先谈谈这个历程？当你开始时，你在想什么？它是如何演变的？你未来的关键优先事项是什么，以及你如何看待未来的世界？

黄仁勋：我想说，我们做对的一件事是，我们预见到，未来会有另一种计算形式，它可以增强通用计算，解决通用工具永远无法解决的问题。这种处理器一开始会做一些对CPU来说极其困难的事情，那就是计算机图形处理。

但我们将逐步扩展到其他领域。我们选择的第一个领域当然是图像处理，这与计算机图形处理是互补的。我们将其扩展到物理模拟，因为在我们选择的视频游戏领域中，你不仅希望它美观，还希望它动态化，能够创建虚拟世界。我们一步一步地扩展，并将其引入科学计算。第一个应用之一是分子动力学模拟，另一个是地震处理，这基本上是逆物理。地震处理与CT重建非常相似，是另一种形式的逆物理。所以我们一步一步地解决问题，扩展到相邻行业，最终解决了这些问题。

我们一直坚守的核心理念是加速计算能够解决有趣的问题。我们的架构保持一致，意味着今天开发的软件可以在你留下的大量已安装基础上运行，过去开发的软件可以通过新技术加速。这种关于架构兼容性的思维方式、创建大量已安装基础、与生态系统共同发展的心理从1993年就开始了，我们一直延续到今天。这就是为什么英伟达的CUDA拥有如此庞大的已安装基础的原因，因为我们一直在保护它。保护软件开发者的投资是我们公司自始至终的首要任务。

保护软件开发者的投资是我们公司自始至终的首要任务。展望未来，我们在一路上解决的一些问题，当然包括学习如何成为创始人、如何成为首席执行官、如何经营业务、如何建立公司，这些都是新的技能。这有点像发明现代计算机游戏行业。人们可能不知道，但英伟达是世界上最大的视频游戏架构的安装基础。GeForce拥有大约3亿玩家，仍然在快速增长，非常活跃。所以我认为，每次我们进入一个新市场时，我们都需要学习新的算法、市场动态，创建新的生态系统。

我们需要这样做的原因是，与通用计算机不同，通用计算机一旦构建好处理器，所有的东西最终都会运行。但我们是加速计算机，这意味着你需要问自己，你要加速什么？不存在所谓的通用加速器。

2. 深入谈谈一般用途和加速计算之间的差异？

黄仁勋：如果你看看现在的软件，你写的软件中有大量的文件输入输出，有设置数据结构的部分，还有一些魔法般的算法核心。这些算法不同，取决于它们是用于计算机图形处理、图像处理还是其他什么。它可以是流体、粒子、逆物理或者图像领域的东西。所以这些不同的算法都是不同的。如果你创建一个处理器，专门擅长这些算法，并补充CPU处理它擅长的任务，那么理论上，你可以极大地加速应用程序的运行。原因是通常5%到10%的代码占据了99.99%的运行时间。

因此，如果你把那5%的代码卸载到我们的加速器上，技术上，你可以将应用程序的速度提高100倍。这并不罕见。我们经常可以将图像处理加速500倍。现在我们做的是数据处理。数据处理是我最喜欢的应用之一，因为几乎所有与机器学习相关的内容都在演进。它可以是SQL数据处理、Spark类型的数据处理，或者是向量数据库类型的处理，处理无结构或结构化的数据，这些数据都是数据帧。

我们对这些进行极大的加速，但为了做到这一点，你需要创建一个顶级的库。在计算机图形处理领域，我们很幸运有了Silicon Graphics的OpenGL和Microsoft的DirectX，但在这些之外，没有真正存在的库。因此，举个例子，我们最著名的一个库是与SQL类似的库。SQL是存储计算的库，我们创建了一个库，它是世界上第一个神经网络计算库。

我们有cuDNN（用于神经网络计算的库），还有cuOpt（用于组合优化的库），cuQuantum（用于量子模拟和仿真的库），以及很多其他的库，比如用于数据帧处理的cuDF，类似于SQL的功能。因此，所有这些不同的库都需要被发明出来，它们可以把应用程序中的算法重新整理，使我们的加速器能够运行。如果你使用这些库，你就可以实现100倍的加速，获得更多的速度，非常惊人。

因此，概念很简单，而且非常有意义，但问题是，你如何去发明这些算法，并让视频游戏行业使用它们，编写这些算法，让整个地震处理和能源行业使用它们，编写新的算法并让整个AI行业使用它们。你明白我的意思吗？因此，所有这些库，每一个库，首先我们必须完成计算机科学的研究，其次，我们必须经历生态系统的开发过程。

我们必须去说服每个人使用这些库，然后还要考虑它们运行在哪些类型的计算机上，每种计算机都不一样。因此，我们一步一步地进入一个领域又一个领域。我们为自动驾驶汽车创建了一个非常丰富的库，为机器人开发了一个非常出色的库，还有一个令人难以置信的库，用于虚拟筛选，无论是基于物理的虚拟筛选还是基于神经网络的虚拟筛选，还有一个令人惊叹的库用于气候技术。

因此，我们必须去结交朋友，创建市场。事实证明，英伟达真正擅长的事情是创建新的市场。我们现在已经做了这么久，以至于英伟达的加速计算似乎无处不在，但我们确实必须一步步地完成，一次一个行业地开发市场。

3. 现场的许多投资者非常关注数据中心市场，能否分享一下你对中长期机会的看法？显然，你的行业推动了你所称的“下一次工业革命”。你如何看待数据中心市场的现状以及未来的挑战？

黄仁勋：有两件事同时在发生，它们经常被混为一谈，分开讨论有助于理解。首先，我们假设没有AI存在的情况下。在没有AI的世界里，通用计算已经停滞不前了。大家都知道，半导体物理学中的一些原理，比如摩尔定律、Denard缩放等，已经结束了。我们不再看到CPU的性能每年翻倍的现象。我们已经很幸运了，能在十年内看到性能翻倍。摩尔定律曾经意味着五年内性能提升十倍，十年内提升一百倍。

然而现在这些已经结束了，所以我们必须加速一切能加速的东西。如果你在做SQL处理，加速它；如果你在进行任何数据处理，加速它；如果你在创建一个互联网公司并拥有推荐系统，必须加速它。如今最大的推荐系统引擎已经全部加速了。几年前这些还在CPU上运行，而现在已经全部加速了。因此，第一个动态是，全世界价值数万亿美元的通用数据中心将会现代化，转变为加速计算的数据中心。这是不可避免的。

此外，因为英伟达的加速计算带来了如此巨大的成本降低，过去十年中，计算能力不是以100倍，而是以100万倍的速度增长。那么问题来了，如果你的飞机能快一百万倍，你会做什么不同的事情呢？

因此，人们突然意识到：“为什么我们不让计算机来编写软件，而不是我们自己去想象这些功能，或者我们自己去设计算法呢？”我们只需要把所有的数据、所有的预测性数据交给计算机，让它去找出算法——这就是机器学习，生成式AI。因此，我们在许多不同的数据领域大规模应用了它，计算机不仅知道如何处理数据，还理解数据的含义。因为它同时理解多种数据模式，它可以进行数据翻译。

因此，我们可以从英文转换为图像，从图像转换为英文，从英文转换为蛋白质，从蛋白质转换为化学物质。因为它理解了所有的数据，因此可以进行所有这些翻译过程，我们称之为生成式AI。它可以将大量的文字转换为少量的文字，或者将少量的文字扩展为大量的文字，等等。我们现在正处于这个计算机革命的时代。

而现在令人惊叹的是，第一批价值数万亿美元的数据中心将被加速，并且我们还发明了这种新型的软件，称为生成式AI。生成式AI不仅仅是一种工具，它是一种技能。正是因为这个原因，新的行业正在被创造出来。

这是为什么？如果你看看直到现在的整个IT行业，我们一直在制造人们使用的工具和仪器。而第一次，我们正在创造出能够增强人类能力的技能。因此，人们认为AI将超越价值数万亿美元的数据中心和IT行业，进入技能的世界。

那么，什么是技能呢？比如数字货币是一种技能，自动驾驶汽车是一种技能，数字化的装配线工人，机器人，数字化的客户服务，聊天机器人，数字化的员工为英伟达规划供应链。这可以是一个SAP的数字代理。我们公司大量使用ServiceNow，我们现在拥有了数字员工服务。因此，我们现在拥有了这些数字化的人类，这就是我们现在正处的AI浪潮。

4. 金融市场中存在一个持续的辩论，即随着我们继续建设AI基础设施，投资回报是否足够？你如何评估客户在这个周期中获得的投资回报率？如果你回顾历史，回顾PC和云计算，它们在类似的采用周期中，回报率如何？与现在相比有什么不同？

黄仁勋：这是个非常好的问题。让我们来看看。在云计算之前，最大的趋势是虚拟化，如果大家还记得的话。虚拟化基本上意味着我们将数据中心中的所有硬件虚拟化为虚拟数据中心，然后我们可以跨数据中心移动工作负载，而不必直接与特定的计算机相关联。结果是，数据中心的利用率提高了，我们看到了数据中心成本减少了两倍到两倍半，几乎是在一夜之间完成的。

接着，我们将这些虚拟计算机放到云中，结果是，不仅仅是一家公司，很多公司都可以共享相同的资源，成本再次下降，利用率再次提高。

这些年的所有进步，掩盖了底层的根本变化，那就是摩尔定律的终结。我们从利用率提升中获得了两倍、甚至更多的成本降低，然而这也碰到了晶体管和CPU性能的极限。

接着，所有的这些利用率的提升已经达到极限，这也是为什么我们现在看到数据中心和计算通胀的原因。因此，第一件正在发生的事情就是加速计算。因此，当你在处理数据时，比如使用Spark——这是当今世界上使用最广泛的数据处理引擎之一——如果你使用Spark并通过英伟达加速器加速它，你可以看到20倍的加速。这意味着你会节省10倍的成本。

当然，你的计算成本会上升一点，因为你需要支付英伟达GPU的费用，计算成本可能会增加一倍，但你将减少计算时间20倍。因此，你最终节省了10倍的成本。而这样的投资回报率对于加速计算来说并不罕见。因此，我建议你们加速一切可以加速的工作，然后使用GPU进行加速，这样可以立即获得投资回报。

除此之外，生成式AI的讨论是当前AI的第一波浪潮，基础设施玩家（比如我们自己和所有云服务提供商）将基础设施放在云上，供开发人员使用这些机器来训练模型、微调模型、为模型提供保护等等。由于需求如此之大，每花费1美元在我们这里，云服务提供商可以获得5美元的租金回报，这种情况正在全球范围内发生，一切都供不应求。因此，对这种需求的需求非常巨大。

我们已经看到的一些应用，当然包括一些知名的应用，比如OpenAI的ChatGPT、GitHub的Copilot，或者我们公司内部使用的共同生成器，生产力提升是不可思议的。我们公司里的每一个软件工程师现在都使用共同生成器，不管是我们自己为CUDA创建的生成器，还是用于USD（我们公司使用的另一种语言），或者Verilog、C和C++的生成器。

因此，我认为每一行代码都由软件工程师编写的日子已经彻底结束了。未来，每一个软件工程师都将有一个数字工程师伴随在身边，24/7随时协助工作。这就是未来。因此，我看英伟达，我们有32000名员工，但这些员工周围将有更多的数字工程师，可能会多100倍的数字工程师。

5. 很多行业都在接受这些变化。哪些用例、行业是你最兴奋的？

黄仁勋：在我们公司，我们在计算机图形学方面使用AI。如果没有人工智能，我们无法再进行计算机图形学。我们只计算一个像素，然后推测其余的32个像素。也就是说，我们在某种程度上“幻想”出其余的32个像素，它们在视觉上是稳定的，看起来是照片级真实的，图像质量和性能都非常出色。

计算一个像素需要大量的能量，而推测其他32个像素的能量需求则非常少，而且可以非常快速地完成。因此，AI并不仅仅是训练模型，这只是第一步。更重要的是如何使用模型。当你使用模型时，你会节省大量的能量和时间。

如果没有AI，我们无法为自动驾驶汽车行业提供服务。如果没有AI，我们在机器人技术和数字生物学领域的工作也是不可能的。现在几乎每一个科技生物公司都以英伟达为中心，他们正在使用我们的数据处理工具来生成新蛋白质，小分子生成、虚拟筛选等领域也将因为人工智能而被彻底重塑。

6. 谈谈竞争和你们的竞争壁垒吧。目前有很多公私公司希望能打破你们的领导地位。你如何看待你们的竞争壁垒？

英伟达：首先，我认为有几件事让我们与众不同。第一点要记住，AI并不仅仅是关于芯片的。AI是关于整个基础设施的。如今的计算机不是制造一块芯片然后人们购买它并放入计算机中。那种模式属于上世纪90年代。如今的计算机是以超级计算集群、基础设施或超级计算机为名开发的，这不是一块芯片，也不完全是计算机。

所以，我们实际上是在构建整个数据中心。如果你去看一下我们其中一个超级计算集群，你会发现管理这个系统所需的软件是非常复杂的。并没有一个“Microsoft Windows”可以直接用于这些系统。这种定制化的软件是我们为这些超级集群所开发的，所以设计芯片的公司、构建超级计算机的公司以及开发这些复杂软件的公司，理所当然的是同一家公司，这样可以确保优化、性能和效率。

其次，AI本质上是一种算法。我们非常擅长理解算法的运作机制，并且了解计算堆栈如何分布计算，以及如何在数百万个处理器上运行数天，保持计算机的稳定性、能源效率以及快速完成任务的能力。我们在这方面非常擅长。

最后，AI计算的关键是安装基础（installed base）。拥有跨所有云计算平台和内部部署（on-premise）的统一架构非常重要。无论你是在云中构建超级计算集群，还是在某台设备上运行AI模型，都应该有相同的架构以运行所有相同的软件。这就是所谓的安装基础。而这种自1993年以来的架构一致性是我们能够取得今天成就的关键原因之一。

因此，今天如果你要创办一家AI公司，最明显的选择就是使用英伟达的架构，因为我们已经遍布所有的云平台，不论你选择哪台设备，只要它有英伟达的标识，你就可以直接运行相同的软件。

7. Blackwell在训练上快了4倍，推理速度比它的前代产品Hopper快了30倍。你们的创新速度如此之快，你们能否保持这样的节奏？你们的合作伙伴能否跟上你们的创新步伐？

黄仁勋：我们的基本创新方法是确保我们不断推动架构创新。每个芯片的创新周期大约是两年，在最好的情况下是两年。我们每年还会对它们进行中期升级，但整体架构的革新大约是每两年一次，这已经非常快了。

我们有七个不同的芯片，这些芯片共同作用于整个系统。我们可以每年推出新的AI超级计算集群，并且比上一代更强大。这是因为我们拥有多个可以进行优化的部分。因此我们可以非常快速地交付更高的性能，并且这些性能的提升直接转化为总拥有成本（TCO）的下降。

Blackwell在性能上的提升意味着，对于拥有1千兆瓦电力的客户，他们可以获得3倍的收入。性能直接转化为吞吐量，吞吐量则转化为收入。如果你有1千兆瓦的电力可用，你可以获得3倍的收入。

因此，这种性能提升的回报是无与伦比的，也无法通过芯片成本的降低来弥补这3倍的收入差距。

8. 如何看待对亚洲供应链的依赖？

黄仁勋：亚洲的供应链非常复杂并且高度互联。英伟达的GPU不仅仅是一块芯片，它是由成千上万个组件组成的复杂系统，类似于一辆电动车的构造。因此，亚洲的供应链网络非常广泛且复杂。我们力求在每一个环节上设计出多样性和冗余性，确保即使出现问题，我们也能够迅速将生产转移到其他地方进行制造。总的来说，即使供应链出现中断，我们也有能力进行调整，以确保供应的连续性。

我们目前在台积电进行制造，因为它是世界上最好的，不仅仅是好一点点，而是好得多。我们与他们有着长期的合作历史，他们的灵活性和规模能力都令人印象深刻。

去年，我们的收入出现了大幅增长，这离不开供应链的快速反应。台积电的敏捷性以及它们满足我们需求的能力是非常了不起的。在不到一年的时间里，我们大幅提升了产能，并且我们明年将继续扩大，后年还要进一步扩大。因此，他们的敏捷性和能力都很出色。不过，如果有需要，我们当然也可以转向其他供应商。

9. 贵公司处于非常有利的市场位置。我们已经讨论了很多非常好的话题。你最担心的是什么？

黄仁勋：我们的公司目前与全球每一家AI公司都有合作，也与每一家数据中心有合作。我不知道有哪家云服务提供商或计算机制造商我们没有合作的。因此，随着这样的规模扩展，我们肩负着巨大的责任。我们的客户非常情绪化，因为我们的产品直接影响他们的收入和竞争力。需求太大，满足这些需求的压力也很大。

我们目前正全面生产Blackwell，并计划在第四季度开始发货并进一步扩展。需求如此之大，每个人都希望能够尽早拿到产品，获取最多的份额。这种紧张和激烈的氛围实在是前所未有。

虽然在创造下一代计算机技术时非常令人兴奋，也令人惊叹地看到各种应用的创新，但我们肩负着巨大的责任，感到压力很大。但我们尽力去做好工作。我们已经适应了这种强度，并将继续努力。

编辑/Somer

Nvidia CEO Huang Renxun said that the growth of Nvidia's AI chip Blackwell supply is limited, which has frustrated some customers. He also hinted that if necessary, Nvidia would reduce its reliance on Taiwan Semiconductor and turn to other chip manufacturers. In addition, it is reported that the US government is considering allowing Nvidia to export advanced chips to Saudi Arabia.

Jensen Huang, CEO of NVIDIA, the leading stock in the AI boom, said on Wednesday that NVIDIA's products have now become the most sought-after commodities in the tech industry, and customers are competing for limited supply, especially for AI chips. The limited growth of Blackwell supplied by suppliers has frustrated some customers. He also hinted that if necessary, NVIDIA would reduce its reliance on Taiwan Semiconductor and turn to other chip manufacturers.

He told the audience at a technology conference hosted by Goldman Sachs in San Francisco:

"Our product demand is so high, everyone wants to be the first to get it and get the most share. We may have more emotional customers today, and that's understandable. The relationship is very tense, but we are trying our best."

Huang Renxun introduced to the audience that the company's latest generation of AI chips, Blackwell, is facing strong demand. Currently, NVIDIA outsources the production of Blackwell, and he said that NVIDIA's suppliers are trying their best to keep up with demand and making progress.

However, the majority of NVIDIA's revenue relies on a few customers, such as Microsoft and Meta Platforms Inc., data center operators. When asked if the massive AI spending has brought investment returns to the customers, Huang Renxun said that companies have no choice and can only accept "accelerated computing." He explained that NVIDIA's technology can not only accelerate traditional workloads such as data processing, but also handle AI tasks that old technologies cannot cope with.

Huang Renxun also stated that Nvidia heavily relies on Taiwan Semiconductor for chip production, as Taiwan Semiconductor is far ahead in the chip manufacturing sector.

However, he also stated that NVIDIA has developed most of the technology internally, which allows the company to transfer orders to other suppliers. However, he said that such changes may lead to a decrease in the quality of their chips.

"The agility of Taiwan Semiconductor and their ability to respond to our needs is truly incredible. That's why we chose them because they are outstanding, but if necessary, of course, we can turn to other suppliers."

In addition, it is reported that the US government is considering allowing NVIDIA to export advanced chips to Saudi Arabia, which could help the country train and run the most powerful AI models. Some individuals working for the Saudi Data and AI Management Agency said Saudi Arabia is making efforts to comply with US security requirements to accelerate the process of obtaining these chips.

After the interview content was released, Nvidia's stock price turned from a drop to a rise during the day, closing up more than 8% at $116.91, which also led the Nasdaq to turn from a 1.6% drop to a 2.17% rise. This year, Nvidia's stock price has more than doubled, and it has risen 239% in 2023.

The following is an excerpt from Huang Renxun's interview:

1. First, let's talk about your thoughts when you founded the company 31 years ago. Since then, you have transformed the company from a GPU company focusing on gaming to a company that provides a wide range of hardware and software for the datacenter industry. Could you first talk about this journey? What were you thinking when you started? How did it evolve? What are your key priorities for the future, and how do you see the future world?

Huang Renxun: I would like to say that one thing we did right was that we foresaw that there would be another form of computing in the future, one that could enhance general computation and solve problems that general-purpose tools could never solve. This processor would initially do something that was extremely difficult for CPUs, which is computer graphics processing.

But we would gradually expand into other domains. The first domain we chose, of course, was image processing, which is complementary to computer graphics processing. We expanded it to physical simulation because in the video game domain we chose, you not only want it to be visually appealing but also dynamic, capable of creating virtual worlds. We expanded step by step and introduced it to scientific computing. One of the first applications was molecular dynamics simulation, and another was seismic processing, which is basically inverse physics. Seismic processing is very similar to CT reconstruction and is another form of inverse physics. So we solved problems step by step, expanded into adjacent industries, and eventually solved these problems.

The core concept we have always adhered to is that accelerating computation can solve interesting problems. Our architecture remains consistent, meaning software developed today can run on the large installed base you leave behind, and software developed in the past can be accelerated by new technologies. This way of thinking about architecture compatibility, creating large installed bases, and developing with the ecosystem started in 1993 and continues to this day. This is why NVIDIA's CUDA has such a large installed base, because we have been protecting it. Protecting the investments of software developers has always been our top priority.

Protecting the investments of software developers has always been our top priority. Looking to the future, some of the problems we have solved along the way include learning how to become founders, how to become CEOs, how to run a business, and how to build a company, all of which require new skills. This is somewhat like inventing the modern computer gaming industry. People may not know, but NVIDIA has the largest installed base of video game architecture in the world. GeForce has around 0.3 billion players and is still growing rapidly and very active. So, I believe that every time we enter a new market, we need to learn new algorithms, market dynamics, and create new ecosystems.

The reason we need to do this is that unlike general-purpose computers, once a general-purpose computer is built with a processor, everything will eventually run. But we are accelerating computers, which means you need to ask yourself, what do you want to accelerate? There is no such thing as a universal accelerator.

2. Let's talk more about the differences between general-purpose and accelerated computing?

Huang Renxun: If you look at modern software now, the software you write includes a lot of file input and output, parts where data structures are set up, and some magical core algorithms. These algorithms vary depending on whether they are used for computer graphics processing, image processing, or something else. It could be related to fluid, particles, inverse physics, or image domains. So these different algorithms are all different. If you create a processor that specializes in these algorithms and complement the CPU processing tasks it is good at, theoretically, you can greatly accelerate the running of applications. The reason is that usually 5% to 10% of the code accounts for 99.99% of the running time.

Therefore, if you offload that 5% of the code to our accelerator, you can technically increase the speed of the application by 100 times. This is not rare. We often can accelerate image processing by 500 times. Now what we are doing is data processing. Data processing is one of my favorite applications because almost everything related to machine learning is evolving. It could be SQL data processing, Spark-type data processing, or vector database type processing, dealing with unstructured or structured data, these are all data frames.

We greatly accelerate these, but to do this, you need to create a top-level library. In the field of computer graphics processing, we were fortunate to have Silicon Graphics' OpenGL and Microsoft's DirectX, but beyond these, there are no truly existing libraries. So, for example, our most famous library is a library similar to SQL. SQL is a storage computing library, and we created a library that is the world's first neural network computing library.

We have cuDNN (a library for neural network computing), cuOpt (a library for combinatorial optimization), cuQuantum (a library for quantum simulation and emulation), and many other libraries, such as cuDF for data frame processing functions similar to SQL. Therefore, all these different libraries need to be invented, they can rearrange the algorithms in the application to allow our accelerator to run. If you use these libraries, you can achieve 100 times acceleration, gain more speed, which is quite amazing.

So, the concept is very simple and meaningful. But the question is, how do you invent these algorithms and make the video game industry use them, write these algorithms and make the entire seismic processing and energy industry use them, write new algorithms and make the entire AI industry use them. Do you understand what I mean? Therefore, all these libraries, each library, first we must complete the research of computer science, and then we must go through the development process of the ecosystem.

We have to persuade everyone to use these libraries, and then consider which types of computers they run on, each computer is different. So, we step by step into one field after another. We have created a very rich library for autonomous driving cars, a very impressive library for robot development, and an incredible library for virtual screening, whether it's physics-based or neural network-based virtual screening, and an amazing library for climate technology.

So, we have to make friends and create markets. It turns out that what NVIDIA is really good at is creating new markets. We have been doing this for so long now that NVIDIA's accelerated computing seems to be everywhere, but we really have to complete it step by step, develop markets one industry at a time.

3. Many investors on the scene are very interested in the datacenter market. Can you share your views on medium and long-term opportunities? Obviously, your industry is driving what you call the "next industrial revolution." How do you see the current situation of the datacenter market and the challenges in the future?

Huang Renxun: There are two things happening simultaneously, they are often confused and discussed separately is helpful for understanding. First, let's assume that AI does not exist. In a world without AI, general computing has already stagnated. As we all know, some principles in semiconductor physics, such as Moore's Law and Denard scaling, have come to an end. We no longer see the doubling of CPU performance every year. We have been lucky to see performance double in ten years. Moore's Law used to mean a tenfold increase in performance in five years, and a hundredfold increase in ten years.

But now these have come to an end, so we must accelerate everything that can be accelerated. If you're doing SQL processing, accelerate it; if you're doing any data processing, accelerate it; if you're creating an internet company and have a recommendation system, it must be accelerated. The largest recommendation system engines are now all accelerated. A few years ago, these were running on CPUs, and now they are all accelerated. So, the first dynamic is that the global datacenter, worth trillions of dollars, will be modernized and transformed into an accelerated computing datacenter. This is inevitable.

In addition, because NVIDIA's accelerated computing has brought such a huge cost reduction, over the past decade, computing power has grown not at a rate of 100 times, but at a rate of 1 million times. So, the question is, if your airplane could go a million times faster, what would you do differently?

So, people suddenly realize, "Why don't we let computers write software instead of us imagining these features or designing algorithms ourselves?" We just need to give all the data, all the predictive data to the computer and let it find the algorithm - this is machine learning, generative AI. So, we have applied it on a large scale in many different data fields, where the computer not only knows how to process the data, but also understands the meaning of the data. Because it understands multiple data patterns at the same time, it can perform data translation.

Therefore, we can convert from English to images, from images to English, from English to proteins, and from proteins to chemical substances. Because it understands all the data, it can perform all these translation processes, which we call generative AI. It can convert a large amount of text into a small amount of text, or expand a small amount of text into a large amount of text, and so on. We are now in the era of this computer revolution.

And now, what is amazing is that the first batch of data centers worth trillions of dollars will be accelerated, and we have also invented this new type of software called generative AI. Generative AI is not just a tool, it is a skill. And because of this, a new industry is being created.

Why is that? If you look at the entire IT industry until now, we have been creating tools and instruments that people use. And for the first time, we are creating skills that can enhance human capabilities. Therefore, people believe that AI will surpass data centers worth trillions of dollars and the IT industry, and enter the world of skills.

So, what are these skills? For example, digital currency is a skill, autonomous driving cars are a skill, digitized assembly line workers, robots, digitized customer service, chatbots, digitized employees planning the supply chain for NVIDIA. This can be a digital agent for SAP. Our company uses ServiceNow extensively, and now we have digital employee services. So, we now have these digitized humans, and this is the AI wave we are currently in.

4. There is an ongoing debate in the financial markets about whether the return on investment is sufficient as we continue to build AI infrastructure. How do you evaluate the return on investment that customers get in this cycle? If you look back at history, at PC and cloud computing, how did their ROI compare in similar adoption cycles? What is different now?

Huang Renxun: That's a great question. Let's take a look. Before cloud computing, the biggest trend was virtualization, if you remember. Virtualization basically meant that we virtualized all the hardware in the data center into virtual data centers, and then we could move workloads across data centers without being directly tied to a specific computer. The result was increased utilization of data centers, and we saw a reduction in data center costs by two to two and a half times, almost overnight.

Then, we put these virtual machines into the cloud, and as a result, not only one company but many companies could share the same resources, costs decreased again, and utilization increased again.

All the progress in recent years has overshadowed the underlying fundamental change, which is the end of Moore's Law. We have achieved a two-fold, or even greater, cost reduction from increased utilization, but it has also encountered the limits of transistors and CPU performance.

Furthermore, all these improvements in utilization rates have reached their limits, which is why we now see data centers and computational inflation. Therefore, the first thing that is happening is accelerated computation. So, when you are dealing with data, such as using Spark - one of the most widely used data processing engines in the world today - if you use Spark and accelerate it with NVIDIA accelerators, you can achieve a 20-fold acceleration. This means you will save 10 times the cost.

Of course, your computation cost will increase a bit because you need to pay for NVIDIA GPUs. The computation cost may double, but you will reduce the computation time by 20 times. Therefore, you ultimately save 10 times the cost. And such return on investment is not uncommon for accelerated computation. So, I suggest accelerating anything that can be accelerated and using GPUs for acceleration, so you can immediately get your investment return.

In addition, the discussion of generative AI is the first wave of AI at the moment. Infrastructure players, such as ourselves and all cloud service providers, are putting their infrastructure in the cloud for developers to use these machines for training models, fine-tuning models, securing models, and so on. Because the demand is so high, for every $1 spent on our side, cloud service providers can earn $5 in rental income. This situation is happening globally, and everything is in short supply. Therefore, the demand for this kind of demand is very high.

We have seen some applications that are well-known, including OpenAI's ChatGPT, GitHub's Copilot, or the shared generator we use internally, and the productivity improvement is incredible. Every software engineer in our company now uses the shared generator, whether it's the one we created for CUDA, or the one for USD (another language we use), or the generators for Verilog, C, and C++.

Therefore, I believe the days when every line of code is written by software engineers have come to an end. In the future, every software engineer will have a digital engineer by their side, assisting their work 24/7. That is the future. So, when I look at NVIDIA, we have 32,000 employees, but there will be many more digital engineers around these employees, possibly 100 times more digital engineers.

5. Many industries are embracing these changes. Which use cases and industries are you most excited about?

Huang Renxun: In our company, we use AI in computer graphics. Without artificial intelligence, we cannot continue in computer graphics. We calculate only one pixel and then extrapolate the other 32 pixels. In other words, we 'imagine' the remaining 32 pixels to some extent, and they are visually stable and appear photo-realistic. The image quality and performance are excellent.

Calculating one pixel requires a lot of energy, while predicting the other 32 pixels requires very little energy and can be done very quickly. Therefore, AI is not just about training models, that is only the first step. What is more important is how to use the models. When you use the models, you save a lot of energy and time.

Without AI, we would not be able to provide services to the autonomous driving industry. Without AI, our work in the fields of robotics and digital biology would also be impossible. Almost every technology and life science company is now centered around NVIDIA, using our data processing tools to generate new proteins, small molecule synthesis, virtual screening, and other areas that will be completely reshaped by artificial intelligence.

6. Let's talk about competition and your competitive barriers. There are currently many public and private companies that want to challenge your leadership position. How do you view your competitive barriers?

NVIDIA: First, I think there are a few things that make us different. The first point to remember is that AI is not just about chips. AI is about the entire infrastructure. Today's computers are not about manufacturing a chip and people buying it and putting it into a computer. That model belongs to the 1990s. Today's computers are developed under the names of super computing clusters, infrastructure, or supercomputers. It's not just a chip, it's not entirely a computer.

So, in fact, we are building the entire data center. If you take a look at one of our super computing clusters, you will find that the software required to manage this system is very complex. There is no 'Microsoft Windows' that can be directly used for these systems. This customized software is developed by us for these super clusters. Therefore, the companies designing chips, building supercomputers, and developing this complex software are naturally the same company, ensuring optimization, performance, and efficiency.

Second, AI is fundamentally an algorithm. We are very good at understanding how algorithms work, and how computing stacks distribute computation and run on millions of processors for days, maintaining computer stability, energy efficiency, and the ability to complete tasks quickly. We are very good at this.

Lastly, the key to AI computing is the installed base. Having a unified architecture across all cloud computing platforms and on-premise deployments is very important. Whether you are building super computing clusters in the cloud or running AI models on a device, there should be the same architecture to run all the same software. This is called the installed base. And this consistency in architecture since 1993 is one of the key reasons why we have achieved what we have today.

Therefore, if you want to start an AI company today, the most obvious choice is to use NVIDIA's architecture because we are present on all cloud platforms. Regardless of which device you choose, as long as it has the NVIDIA logo, you can directly run the same software.

7. Blackwell is 4 times faster in training and 30 times faster in inference than its predecessor product, Hopper. Your innovation speed is so fast, can you maintain this pace? Can your partners keep up with your pace of innovation?

Huang Renxun: Our basic innovation approach is to ensure that we continuously drive architectural innovation. The innovation cycle of each chip is about two years, in the best case scenario, it is two years. We also perform midterm upgrades on them every year, but the overall architectural innovation is about once every two years, which is already very fast.

We have seven different chips that work together for the entire system. We can launch new AI supercomputing clusters every year that are more powerful than the previous generation. This is because we have multiple parts that can be optimized. Therefore, we can deliver higher performance very quickly, and these performance improvements directly translate into a decrease in Total Cost of Ownership (TCO).

The improvement in performance with Blackwell means that customers with 1 gigawatt of electrical utilities can generate three times the revenue. Performance directly translates into throughput, and throughput translates into revenue. If you have 1 gigawatt of electrical utilities available, you can generate three times the revenue.

Therefore, the return on investment for this performance improvement is unparalleled, and the 3x revenue gap cannot be compensated by reducing chip costs.

8. How do you view the dependence on the Asian supply chain?

Huang Renxun: The Asian supply chain is very complex and highly interconnected. NVIDIA's GPU is not just a chip, it is a complex system composed of thousands of components, similar to the construction of an electric vehicle. Therefore, the Asian supply chain network is very extensive and complex. We strive to design diversity and redundancy in every link, to ensure that even if there are problems, we can quickly shift production to other places. In general, even if the supply chain is disrupted, we have the ability to make adjustments to ensure the continuity of supply.

Currently, we manufacture at Taiwan Semiconductor because it is the best in the world, not just a little bit better, but much better. We have a long history of cooperation with them, and their flexibility and scale capabilities are impressive.

Last year, our revenue saw significant growth, thanks to the rapid response of the supply chain. Taiwan Semiconductor's agility and their ability to meet our needs are remarkable. In less than a year, we have greatly increased our production capacity, and we will continue to expand next year and further expand the year after. Therefore, their agility and capabilities are excellent. However, if needed, we can certainly turn to other suppliers.

Your company is in a very advantageous market position. We have discussed many very good topics. What are you most worried about?

Huang Renxun: Our company currently collaborates with every AI company in the world, as well as every data center. I don't know of any cloud computing service provider or computer manufacturer that we do not collaborate with. Therefore, with such scale expansion, we shoulder a great responsibility. Our customers are very emotional because our products directly affect their income and competitiveness. The demand is enormous, and there is also great pressure to meet these demands.

We are currently in full production of Blackwell and plan to start shipping and further expand in the fourth quarter. The demand is so high and everyone wants to get the product as soon as possible and get the largest share. This tension and intense atmosphere are unprecedented.

While it is very exciting to create the next generation of computer technology and see the innovation of various applications, we bear a great responsibility and feel a lot of pressure. But we are trying our best to do a good job. We have adapted to this intensity and will continue to work hard.

Editor/Somer

The translation is provided by third-party software.

The above content is for informational or educational purposes only and does not constitute any investment advice related to Futu. Although we strive to ensure the truthfulness, accuracy, and originality of all such content, we cannot guarantee it.

Focus on CHK and US stocks

英伟达一夜飙升逾8%带飞美股，黄仁勋到底说了些什么？

Nvidia soared more than 8% overnight, driving up US stocks. What did Huang Renxun say?

Risk Disclaimer

Statement