Is 2024 the first year of the “liquid cooling” outbreak? Under Nvidia's leadership, demand for “liquid cooling” embarks on a wild path

Zhitong Finance · May 10 22:10

在全球企业自2023年以来布局AI技术的狂热浪潮刺激之下，英伟达(NVDA.US)最强性能AI GPU服务器——GB200 AI GPU服务器的液冷技术解决方案供应商之一Vertiv (VRT.US)股价自2023年以来已暴涨超600%，2024年以来的涨幅已高达103%。在华尔街分析师们看来，在AI芯片领域的绝对霸主英伟达大力推动之下，液冷在超高性能的AI服务器领域有望从“可选”迈入“必选”，意味着“液冷”解决方案在未来的市场规模无比庞大，而在股价预期方面， Vertiv等液冷领域领导者股价上行之路可能远未结束。

而在最新的业绩以及业绩预期方面，Vertiv也交出了一份令市场非常满意的业绩，暗示全球AI数据中心对于液冷技术的需求激增，同时也从侧面显示出全球企业对于英伟达AI GPU的需求仍然极度旺盛。前不久英伟达GB200液冷解决方案供应商Vertiv业绩显示，该公司第一季度总订单同比增长60%，期末积压订单金额高达63亿美元，一举创下历史新高。Q1净销售额16.39亿美元，同比增长8%，调整后营业利润高达2.49亿美元，同比增长42%。

不仅第一季度订单和销售额强劲，Vertiv还以超市场预期的步伐上调2024年全年业绩预期，销售额中值显示有望在强劲的2023年销售额基础上同比增长约12%，调整后营业利润13.25亿至13.75亿美元，预期中值较强劲的2023年全年增长约28%。

在中国A股，液冷技术领域领导者英维克(002837.SZ)也交出了一份无比强劲的第一季度业绩报告。报告期内，英维克实现营业收入7.46亿元，同比增长41.36%。归属于上市公司股东的净利润6197.52万元，同比增长146.93%。归属于上市公司股东的扣除非经常性损益的净利润5430.77万元，同比增长169.65%。

展望液冷未来前景，从2024年起，液冷解决方案的渗透规模有望进入“爆发式增长”模式。据Dell'Oro Group 2024年2月的预期测算数据，该机构预计2028年数据中心热管理市场规模(风冷+液冷)将达120亿美元，预计届时液冷规模将达35亿美元，占热管理总计支出的近1/3，对比目前占比仅不到1/10。

国际知名研究机构IDC近日发布报告称，中国的液冷服务器市场在2023年继续保持快速增长。2023全年中国液冷服务器市场规模达到15.5亿美元，与2022年相比增长52.6%，其中95%以上均采用冷板式液冷解决方案。IDC预计，2023-2028年，中国液冷服务器市场年复合增速将达到45.8%， 2028年市场规模有望达到102亿美元。

液冷——逐渐从AI服务器散热模块的“可选项”踏入“必选项”

目前，全球采用英伟达H100 AI GPU的AI服务器在散热解决方案选择上呈现多样化，但风冷仍然是主流选择。尽管液冷因其在高性能计算中的优势(如更有效的热管理和能效)正在逐渐普及，但液冷服务器的部署并未完全普及到所有使用英伟达H100 GPU的系统中。

在英伟达全新的Blackwell 架构GPU(即B100B200GB200 AI GPU )时代，由于AI GPU性能激增，从理论技术层面的角度来看，风冷散热规模几乎达到风冷能力极限，液冷散热时代拉开序幕。随着在AI服务器领域液冷从“可选”到“必选”，将大幅提升市场空间，成为AI算力领域的重要细分赛道之一。整体来看，液冷不仅保证AI GPU服务器在最佳性能下高效率24小时无间断运行，还有助于延长硬件使用寿命。

英伟达GB200超算服务器性能则可谓“全球独一档”算力系统的存在。英伟达基于两个B200 AI GPU以及自研Grace CPU所打造的AI超算系统GB200，基于大语言模型(LLM)的推理工作负载性能则瞬间能够提升30倍，同时与上一代 Hopper架构相比，GB200成本和能耗大幅度降低约25倍。在具有1750亿参数级别的GPT-3 LLM基准上，GB200的推理性能是H100系统的7倍，并且提供了4倍于H100系统的训练速度。

如此强大的性能提升，意味着风冷散热模块已不足以支撑算力系统正常散热运作，这也是英伟达选择在9月份量产的GB200 AI GPU服务器大规模采用液冷解决方案的重要因素。

随着AI和机器学习算法变得越来越复杂，相应的AI算力需求也在快速增长。特别是在训练AI大模型或进行大规模AI推理进程时，AI服务器需要高性能GPU来处理这些计算密集型任务。这些高性能AI GPU(如英伟达的GB200)在运行时会产生大量热量，需要有效的散热解决方案以维持运行效率和硬件寿命。液冷系统可以更迅速、更有效地将热量从GPU等热源传输到散热器，从而减少了热积聚可能性，使得晶体管出现烧损的可能性大幅降低，保持GPU长期以高性能运作。

从技术路线而言，业内主流观点认为，冷板式间接液冷有望先于直接液冷获得全面渗透与推广。液冷系统可以根据液体与硬件之间的接触方式分为直接液冷和间接液冷，直接冷却包括浸没式和喷淋式，间接液冷主要是冷板式液冷解决方案。冷板式液冷技术工艺成熟，不需改变现有服务器的形态，加工难度低，成本较低，且冷却功耗可以满足AI服务器需求，有望率先获得推广。

知名机构Markets And Markets研究报告显示，预计全球数据中心液体冷却市场将从2023年的26亿美元增长到2028年的至少78亿美元，在预测期内的复合年增长率为24.4%。Markets And Markets表示，由于人工智能服务器、边缘计算和物联网(oT)等设备的发展，需要紧凑而有效的冷却解决方案，液冷优点则是能够通过冷却小型设备和小型服务器，有效地处理具有挑战性的情况下的大批量数据。总体而言，现代数据中心处理天量级别数据的强劲需求之下，数据中心液冷市场主要受到提高冷却效率、节能、可扩展性、可持续性和更高性能GPU等硬件要求的驱动。

华尔街分析师们普遍乐观地认为，全球企业对人工智能技术的庞大投资规模将支持数据中心的容量规模不断扩张。这对Vertiv来说可谓是一大利好，该公司的大部分营收规模来自数据中心电力管理和数据中心使用的IT液冷以及混合冷却系统等产品的销售额，该公司主营业务集中于为全球范围的数据中心提供电源管理和各类冷却技术。

Vertiv当前则致力于开发AI数据中心先进液冷解决方案，有公开资料显示，Vertiv和AI芯片霸主英伟达(NVDA.US)合作开发的下一代NVIDIA AI GPU加速数据中心先进液冷解决方案有望适用于GB200，Vertiv 的高能量密度电源和冷却解决方案旨在支持英伟达下一代 GPU 以最佳性能和高可用性安全地运行计算最密集的 AI 工作负载。

机构汇编的数据显示，华尔街分析师们对Vertiv给出了8个“买入”评级，1个“持有”评级，没有出现“卖出”评级，共识评级为“强力买入”，最乐观目标价高达102美元(周四收于97.940美元这一历史高位)。来自Oppenheimer & Co.的分析师诺亚•凯伊(Noah Kaye)强调“人工智能大趋势”正在扩大AI数据中心容量的潜在市场，并且预计到2026年，仅仅Vertiv高密度计算市场就将达到250亿美元。

这家来自中国的液冷技术领导者获华尔街大行高盛青睐

华尔街大行高盛认为，人工智能这一全球股票市场的“股票动力燃料”远未耗尽。该机构在近期发布的最新预测报告中表示，全球股市目前仅仅处于人工智能引领的投资热潮的第一阶段，这股热潮将继续扩大至第二、第三以及第四阶段，提振全球范围内越来越多的行业。

“如果说英伟达代表了人工智能股票交易热潮的第一阶段——即最直接受益的AI芯片阶段，那么第二阶段将是全球其他公司帮助建立与人工智能相关的基础设施。”该机构写道。“预计第三阶段是将人工智能纳入其产品以增加营收规模的公司，而第四阶段是与人工智能相关的生产效率全面提高，而这一预期能够在全球许多企业中实现。”

在人工智能投资热潮的第二阶段，聚焦于除英伟达之外其他参与AI基础设施建设的公司，包括阿斯麦、应用材料等半导体设备商、芯片制造商、云服务提供商、数据中心REITs、数据中心硬件和设备公司、软件安全股以及公用事业公司。而在这一阶段，高盛在研究报告中专门提到了一家中国上市公司，即专注于服务器、数据中心和能源存储系统的精密液体冷却技术的英维克(Shenzhen Envicool)。

知名机构IDC预计，2023-2028年，中国液冷服务器市场年复合增长率将达到45.8%，2028年市场规模将达到102亿美元。IDC数据显示，基于行业需求和政策推动，2023年中国液冷服务器市场规模进一步加大，并且参与液冷生态体系的合作伙伴也越来越丰富，表明市场对于数据中心液冷解决方案的态度是非常积极。随着中国人工智能企业和组织对智算中心无论是建设要求还是算力供给需求越来越高，导致此类数据中心的IT设备能耗大幅上升，更加需要高效的液体冷却系统来维持适宜的操作温度，否则将对大模型产品的周期管理和运维难度产生巨大挑战。

液冷技术风靡全球，暗示英伟达AI GPU需求无比强劲

Vertiv以及英维克等液冷领域领导者纷纷交出无比强劲的业绩数据，以及分析师们对于Vertiv的看涨预期升温，暗示全球数据中心，尤其是AI数据中心对于液冷散热解决技术的需求呈激增之势，同时也从侧面显示出全球企业对于英伟达基于Hopper架构以及最新发布的Blackwell 架构的AI GPU需求极度旺盛。

高盛预计，微软、谷歌、亚马逊旗下AWS、Facebook母公司Meta这四家大型科技公司今年在云计算方面的资本投入高达1770亿美元，远远高于去年的1190亿美元，而2025年将继续增至惊人的1950亿美元。

据媒体报道，微软与OpenAI正在就耗资高达1000亿美元的超大型全球数据中心项目规划进行细节层面的谈判，该项目将包含一台暂时命名为“星际之门”(Stargate)的AI超级计算机，这将是两家AI领域的领导者计划在未来六年内建立的一系列AI超算基础设施中最大规模的超算设施。

毋庸置疑的是，这个巨无霸级别的AI超算将配备“数以百万计算”的核心硬件——英伟达不断升级的AI GPU，旨在为OpenAI未来更为强大的GPT大模型以及比ChatGPT和Sora文生视频等更具颠覆性的AI应用提供强大算力。

虽然随着供应瓶颈逐渐消除，AI GPU这一核心硬件需求增量可能趋于稳定，但是底层硬件的市场仍将不断扩张，英伟达旗下高性能AI GPU的供不应求之势可能在未来几年难以彻底缓解。这也是高盛等华尔街大行看好英伟达未来一年冲击1100美元大关的重要逻辑(周四英伟达收于887.47美元)。

尤其是AI大模型以及AI软件不得不面临的技术情景——即更新迭代趋势的刺激之下软件开发端势必将不断采购或升级AI GPU系统，因此未来几年AI硬件市场规模仍然显得无比庞大。根据市场研究机构Gartner最新预测，到2024年AI芯片市场规模将较上一年增长 25.6%，达到671亿美元，预计到2027年，AI芯片市场规模预计将是2023年规模的两倍以上，达到1194亿美元。

The stock price of VertiV (VRT.US), one of the liquid cooling technology solution providers for the GB200 AI GPU server, Nvidia (NVDA.US)'s most powerful AI GPU server, has skyrocketed by more than 600% since 2023, and the increase has reached 103% since 2024. According to Wall Street analysts, under the strong impetus of Nvidia, the absolute leader in the AI chip field, liquid cooling is expected to move from “optional” to “required” in the field of ultra-high performance AI servers, which means that the future market size of “liquid cooling” solutions will be extremely large, and in terms of stock price expectations, the upward path of liquid cooling industry leaders such as Vertiv may be far from over.

In terms of the latest results and performance expectations, Vertiv also handed over a performance that is very satisfying to the market, which suggests that demand for liquid cooling technology in global AI data centers has surged, while also showing from the side that global companies are still extremely strong in demand for Nvidia's AI GPUs. Recently, Nvidia's GB200 liquid cooling solution provider Vertiv's results showed that the company's total orders for the first quarter increased 60% year over year, and the backlog of orders at the end of the period reached 6.3 billion US dollars, a record high in one fell swoop. Net sales in Q1 were US$1,639 million, up 8% year on year, and adjusted operating profit reached US$249 million, up 42% year over year.

In addition to strong orders and sales for the first quarter, Vertiv also raised its 2024 full-year performance forecast at a pace that exceeded market expectations. The median sales value shows that it is expected to increase by about 12% year-on-year from strong 2023 sales, adjusted operating profit of 1,325 million to US$1,375 million, and is expected to increase by about 28% for the full year of 2023, when the median value is strong.

In China's A Shares, Invec (002837.SZ), a leader in liquid cooling technology, also handed over an extremely strong first-quarter results report. During the reporting period, Invec achieved revenue of 746 million yuan, an increase of 41.36% over the previous year. Net profit attributable to shareholders of listed companies was 61.9752 million yuan, an increase of 146.93% over the previous year. Net profit attributable to shareholders of listed companies after deducting non-recurring profit and loss was $54.3077 million, an increase of 169.65% over the previous year.

Looking ahead to the future prospects of liquid cooling, the penetration scale of liquid cooling solutions is expected to enter an “explosive growth” model from 2024. According to Dell'Oro Group's February 2024 estimated data, the agency expects the data center thermal management market size (air cooling+liquid cooling) to reach 12 billion US dollars in 2028. At that time, the liquid cooling scale is expected to reach 3.5 billion US dollars, accounting for nearly 1/3 of the total thermal management expenditure, accounting for less than 1/10 of the current expenditure.

IDC, an internationally renowned research institute, recently released a report stating that China's liquid-cooled server market will continue to grow rapidly in 2023. In 2023, China's liquid-cooled server market reached US$1.55 billion, an increase of 52.6% compared with 2022, of which more than 95% use cold plate liquid cooling solutions. IDC predicts that in 2023-2028, the compound annual growth rate of China's liquid-cooled server market will reach 45.8%, and the market size is expected to reach 10.2 billion US dollars in 2028.

Liquid cooling — gradually moving from an “optional” to a “required option” for AI server cooling modules

Currently, AI servers using Nvidia H100 AI GPUs around the world are diversifying in terms of cooling solution choices, but air cooling is still the mainstream choice. Although liquid cooling is gradually gaining popularity due to its advantages in high-performance computing, such as more effective thermal management and energy efficiency, the deployment of liquid cooled servers has not fully spread to all systems using Nvidia H100 GPUs.

In the era of Nvidia's new Blackwell architecture GPU (B100\ B200\ GB200 AI GPU), due to the surge in AI GPU performance, from a theoretical and technical point of view, the scale of air cooling has almost reached the limit of air cooling capacity, and the era of liquid cooling has begun. As liquid cooling goes from “optional” to “required” in the AI server field, it will greatly increase market space and become one of the important segments in the field of AI computing power. Overall, liquid cooling not only ensures efficient 24-hour uninterrupted operation of the AI GPU server at optimal performance, but also helps extend the service life of the hardware.

The Nvidia GB200 supercomputing server performance can be described as the existence of a “world's only” computing power system. The GB200 AI supercomputing system built by Nvidia is based on two B200 AI GPUs and the self-developed Grace CPU. The performance of inference workloads based on the Big Language Model (LLM) can instantly increase by 30 times. At the same time, compared with the previous Hopper architecture, GB200's cost and energy consumption are drastically reduced by about 25 times. On the GPT-3 LLM benchmark with 175 billion parameter levels, the GB200 has 7 times the inference performance of the H100 system and provides 4 times the training speed of the H100 system.

This powerful performance improvement means that the air-cooled cooling module is no longer sufficient to support the normal cooling operation of the computing power system. This is also an important factor for Nvidia to use liquid cooling solutions on a large scale in GB200 AI GPU servers mass-produced in September.

As AI and machine learning algorithms become more complex, the corresponding demand for AI computing power is also growing rapidly. In particular, when training large AI models or performing large-scale AI inference processes, AI servers require high-performance GPUs to handle these computation-intensive tasks. These high-performance AI GPUs (such as Nvidia's GB200) generate significant amounts of heat when running and require effective cooling solutions to maintain operational efficiency and hardware longevity. The liquid cooling system can transfer heat more quickly and effectively from heat sources such as GPUs to radiators, thereby reducing the possibility of heat accumulation, greatly reducing the possibility of transistor burnout, and keeping the GPU running at high performance for a long time.

In terms of technology routes, the mainstream opinion in the industry is that cold plate indirect liquid cooling is expected to be fully penetrated and promoted before direct liquid cooling. Liquid cooling systems can be divided into direct liquid cooling and indirect liquid cooling according to the contact method between liquid and hardware. Direct cooling includes immersion and spray types. Indirect liquid cooling is mainly a cold plate liquid cooling solution. The cold plate liquid cooling technology is mature. There is no need to change the shape of existing servers. The processing difficulty is low, the cost is low, and the cooling power consumption can meet the needs of AI servers, and it is expected to be promoted first.

According to a research report by well-known agency Markets And Markets, the global data center liquid cooling market is expected to grow from US$2.6 billion in 2023 to at least US$7.8 billion in 2028, with a compound annual growth rate of 24.4% during the forecast period. Markets And Markets said that due to the development of devices such as artificial intelligence servers, edge computing, and the Internet of Things (OT), compact and effective cooling solutions are needed. The advantage of liquid cooling is that it can effectively handle large amounts of data in challenging situations by cooling small devices and small servers. Overall, the data center liquid cooling market is mainly driven by hardware requirements to improve cooling efficiency, energy efficiency, scalability, sustainability, and higher performance GPUs under the strong demand of modern data centers to process heavy-volume data.

Wall Street analysts are generally optimistic that the huge scale of investment in artificial intelligence technology by global enterprises will support the continuous expansion of data center capacity. This can be described as a major benefit for Vertiv. Most of the company's revenue comes from sales of products such as data center power management and IT liquid cooling and hybrid cooling systems used in data centers. The company's main business focuses on providing power management and various cooling technologies for data centers around the world.

Vertiv is currently committed to developing advanced liquid cooling solutions for AI data centers. According to public information, the next-generation NVIDIA AI GPU accelerated data center advanced liquid cooling solution developed by Vertiv and AI chip leader Nvidia (NVDA.US) is expected to be suitable for the GB200. Vertiv's high energy density power and cooling solutions are designed to support Nvidia's next-generation GPUs to safely run the most compute-intensive AI workloads with optimal performance and high availability.

According to data compiled by the agency, Wall Street analysts gave Vertiv 8 “buy” ratings, 1 “hold” rating, and no “sell” rating. The consensus rating was “strong buy,” and the most optimistic target price reached 102 US dollars (closing at a record high of 97.940 US dollars on Thursday). Noah Kaye (Noah Kaye), an analyst from Oppenheimer & Co., emphasized that “AI megatrends” are expanding the potential market for AI data center capacity, and the Vertiv high-density computing market alone is expected to reach $25 billion by 2026.

This liquid cooling technology leader from China is favored by Wall Street giant Goldman Sachs

Wall Street bank Goldman Sachs believes that artificial intelligence, the “stock driving fuel” of the global stock market, is far from being exhausted. The agency said in the latest forecast report released recently that the global stock market is currently only in the first phase of the investment boom led by artificial intelligence. This boom will continue to expand to the second, third, and fourth stages, boosting more and more industries around the world.

“If Nvidia represents the first phase of the AI stock trading boom — the AI chip phase that benefits most directly, then the second phase will be for other companies around the world to help build infrastructure related to artificial intelligence.” the agency wrote. “The third phase is expected to be for companies incorporating artificial intelligence into their products to increase the scale of revenue, while the fourth phase is an overall increase in production efficiency related to artificial intelligence, and this expectation can be realized in many companies around the world.”

In the second phase of the AI investment boom, the focus is on companies involved in AI infrastructure construction other than Nvidia, including semiconductor equipment providers such as Asmack and applied materials, chip manufacturers, cloud service providers, data center REITs, data center hardware and equipment companies, software security stocks, and utility companies. At this stage, Goldman Sachs specifically mentioned a Chinese listed company in the research report, namely Shenzhen Envicool (Shenzhen Envicool), which focuses on precision liquid cooling technology for servers, data centers, and energy storage systems.

The well-known agency IDC predicts that in 2023-2028, the compound annual growth rate of China's liquid cooling server market will reach 45.8%, and the market size will reach 10.2 billion US dollars in 2028. According to IDC data, driven by industry needs and policies, the size of China's liquid cooling server market will further expand in 2023, and there will also be more and more partners participating in the liquid cooling ecosystem, indicating that the market's attitude towards liquid cooling solutions for data centers is very positive. As Chinese artificial intelligence companies and organizations demand more and more intelligent computing centers, whether in terms of construction or computing power supply, the energy consumption of IT equipment in such data centers has increased dramatically, and an efficient liquid cooling system is needed to maintain an appropriate operating temperature. Otherwise, it will pose a huge challenge to the cycle management and operation of large model products.

Liquid cooling technology is popular all over the world, suggesting that demand for Nvidia's AI GPUs is extremely strong

Liquid cooling leaders such as Vertiv and Invict have handed over extremely strong performance data, and analysts' bullish expectations for Vertiv are heating up, suggesting that global data centers, especially AI data centers, are experiencing a surge in demand for liquid cooling solutions. It also shows from the side that global companies are extremely strong in demand for Nvidia's AI GPUs based on the Hopper architecture and the newly released Blackwell architecture.

Goldman Sachs expects that the four major technology companies, Microsoft, Google, Amazon's AWS, and Facebook parent company Meta, will invest as much as 177 billion US dollars in cloud computing this year, far higher than the 119 billion US dollars last year, and will continue to increase to an astonishing 195 billion US dollars in 2025.

According to media reports, Microsoft and OpenAI are in detailed negotiations on planning a large-scale global data center project costing up to 100 billion US dollars. The project will include an AI supercomputer temporarily named “Stargate” (Stargate). This will be the largest supercomputing facility in a series of AI supercomputing infrastructures that the two leaders in the AI field plan to build within the next six years.

Needless to say, this Big Mac-level AI supercomputer will be equipped with the core hardware for “millions of computations” — Nvidia's continuously upgraded AI GPUs, which aim to provide powerful computing power for OpenAI's future more powerful GPT models and more disruptive AI applications than ChatGPT and Sora Wensheng Video.

Although the increase in demand for core hardware such as AI GPUs may stabilize as supply bottlenecks are gradually eliminated, the market for underlying hardware will continue to expand, and the shortage of Nvidia's high-performance AI GPUs may be difficult to completely ease in the next few years. This is also the important logic that Goldman Sachs and other Wall Street banks are optimistic that Nvidia will hit the $1,100 mark in the next year (Nvidia closed at $887.47 on Thursday).

In particular, the big AI model and the technical scenario AI software must face — that is, the software development side will inevitably continue to purchase or upgrade AI GPU systems under the impetus of the trend of updating and iteration, so the AI hardware market will still be extremely large in the next few years. According to the latest forecast from market research firm Gartner, the AI chip market size will increase 25.6% over the previous year to reach US$67.1 billion by 2024. It is estimated that by 2027, the AI chip market is expected to be more than double the size of 2023, reaching US$119.4 billion.

The translation is provided by third-party software.

The above content is for informational or educational purposes only and does not constitute any investment advice related to Futu. Although we strive to ensure the truthfulness, accuracy, and originality of all such content, we cannot guarantee it.

2024年乃“液冷”爆发元年? 英伟达带领之下，“液冷”需求踏上狂飙之路

Is 2024 the first year of the “liquid cooling” outbreak? Under Nvidia's leadership, demand for “liquid cooling” embarks on a wild path

Risk Disclaimer

Statement