Mellanox HDR 200G InfiniBand Deep Learning Acceleration Engines Demonstrates Two Times Higher Performance for Artificial Intelligence (AI) Platformswith NVIDIA

梅拉諾克斯 HDR 200G InfiniBand 深度學習加速引擎展現人工智慧 (AI) 平台的效能提升兩倍

Business Wire · 2019/03/18 12:30

Mellanox In-Network Computing “Hierarchical Aggregation and Reduction Protocol” (SHARP)™ Technology in Combination with NVIDIA Collective Communications Library (NCCL) Delivers Performance Breakthrough to AI

Mellanox Technologies, Ltd. (NASDAQ: MLNX), a leading supplier of high-performance, end-to-end smart interconnect solutions for data center servers and storage systems, today announced that its HDR 200G InfiniBand with the “Scalable Hierarchical Aggregation and Reduction Protocol” (SHARP)™ technology has set new performance records, doubling deep learning operations performance. The combination of Mellanox In-Network Computing SHARP with NVIDIA® 100 Tensor Core GPU technology and Collective Communications Library (NCCL) deliver leading efficiency and scalability to deep learning and artificial intelligence applications.

The combination of the state-of-the-art NVIDIA GPUs, Mellanox’s InfiniBand, GPUDirect RDMA and NCCL to train neural networks has already become a de-facto standard when scaling out deep learning frameworks, such as Caffe, Caffe2, Chainer, MXNet, TensorFlow, and PyTorch. With the Mellanox SHARP technology and HDR InfiniBand, deep learning training’s data aggregation operations can be offloaded and accelerated by the InfiniBand network, resulting in improving their performance by two times.

The joint effort with NVIDIA and testing performed in Mellanox’s performance labs, using the Mellanox HDR InfiniBand Quantum connecting four system hosts, each with eight NVIDIA V100 Tensor Core GPUs with NVLink interconnect technology and a single ConnectX-6 HDR adapter per host, have achieved an effective reduction bandwidth of 19.6GB/s by integrating SHARP’s native streaming aggregation capability with NVIDIA’s latest NCCL 2.4 library, which now takes full advantage of the bi-directional bandwidth available from the Mellanox interconnect. This implementation is effectively two times higher bandwidth than NVIDIA’s current tree-based implementation using the same hardware configuration.

In the more common setup for this configuration, four HCAs in each system host are used for balanced performance across a variety of workloads where the initial SHARP and NCCL results yielded an expected 70.3GB/s. For more densely populated GPU-based systems, like NVIDIA DGX-2, which houses 16 NVIDIA V100 Tensor Core GPUs with NVLink in each system node, the in-network capabilities and available bidirectional bandwidth of the Mellanox fabric can be fully leveraged.

“Our long-standing collaboration with NVIDIA has again delivered a robust solution that takes full advantage of the best-of-breed capabilities from Mellanox InfiniBand, including GPUDirect RDMA and now extending in-network computing to NCCL, which delivers two times better performance for AI,” said Gilad Shainer, Vice President of Marketing at Mellanox Technologies. “HDR InfiniBand in-network computing acceleration engines, including the SHARP technology, provide the highest performance and scalability for HPC and AI workloads.”

“Mellanox solutions amplify NVIDIA’s unmatched CUDA-X acceleration libraries using NCCL, our open source collective communication library,” said Ian Buck, vice president and general manager of Accelerated Computing at NVIDIA. “Together, we offer solutions that ensure the most demanding AI applications in the data center benefit from cutting-edge performance and scaling efficiency.”

Supporting Resources:Learn more about Mellanox SHARP™Learn more about Mellanox Quantum™ HDR 200Gb/s InfiniBand Smart SwitchesFollow Mellanox on Twitter , Facebook , Google+ , LinkedIn , and YouTubeJoin the Mellanox Community

About Mellanox

Mellanox Technologies (NASDAQ: MLNX) is a leading supplier of end-to-end Ethernet and InfiniBand smart interconnect solutions and services for servers and storage. Mellanox interconnect solutions increase data center efficiency by providing the highest throughput and lowest latency, delivering data faster to applications, unlocking system performance and improving data security. Mellanox offers a choice of fast interconnect products: adapters, switches, software and silicon that accelerate application performance and maximize business results for a wide range of markets including cloud and hyperscale, high performance computing, artificial intelligence, enterprise data centers, cyber security, storage, financial services and more. More information is available at:http://www.mellanox.com.

Note: Mellanox, ConnectX-6, Mellanox Quantum, Mellanox Scalable Hierarchical Aggregation and Reduction Protocol (SHARP), and Mellanox logo are registered trademarks of Mellanox Technologies, Ltd. All other trademarks are property of their respective owners.

Mellanox 網路內運算「階層式聚合與縮減通訊協定」(SHARP)™ 技術與 NVIDIA 集體通訊庫 (NCCL) 相結合，為人工智慧提供效能突破

資料中心伺服器和儲存系統高效能、端對端智慧互連解決方案的領先供應商梅拉諾克斯科技有限公司 (NASDAQ: MLNX) 今天宣佈，其 HDR 200G InfiniBand 搭載「可擴展分層聚合與減少通訊協定」(SHARP)™ 技術，創造了新的效能記錄，使深度學習運作效能倍增。結合 Mellanox 網內運算 SHARP 與 NVIDIA® 100 張量核心 GPU 技術和集體通訊庫 (NCCL)，為深度學習和人工智慧應用提供領先的效率和可擴充性。

結合最先進的 NVIDIA GPU、梅拉諾克斯的 Infiniband、GPUDirect RDMA 和 NCCL 來訓練神經網路的結合，已經成為深度學習架構 (例如咖啡、咖啡 2、鎖匠、MXNet、TensorFlow 和 PyTorch) 等深度學習架構的事實上標準。借助 Mellanox SHARP 技術和 HDR InfiniBand，可以通過 InfiniBand 網絡卸載和加速深度學習培訓的數據聚合操作，從而使其性能提高兩倍。

與 NVIDIA 的共同努力和在梅拉諾克斯的性能實驗室進行測試，使用梅拉諾克斯 HDR 無限量子連接四個系統主機，每個主機都具有八個 NVIDIA V100 張量子 GPU 和 NVLink 互連技術的單個 ConnectX-6 HDR 適配器，通過整合夏普的本地流媒體聚合能力與 NVIDIA 2.4 的最新庫實現了 19.6GB/s 的帶寬，現在採用了最新的 NCCL 庫的 NCCL 集成功能從梅拉諾克斯互連可用的雙向帶寬。這項實作實際上是 NVIDIA 目前使用相同硬體組態的樹型實作高兩倍的頻寬。

在較為常見的設定中，每個系統主機中有四個 HCA 用於在各種工作負載之間達到平衡效能，其中初始 SHARP 和 NCCL 結果產生預期的 70.3Gb/s。對於使用較密集的 GPU 系統，例如 NVIDIA DGX-2，每個系統節點中容納 16 個 NVIDIA V100 Tensor 核心 GPU，可在每個系統節點使用 Mellanox 網路頻寬和雙向網路功能充分利用。

梅拉諾克斯科技行銷副總裁 Gilad Shainer 表示：「我們與 NVIDIA 的長期合作再次提供強大的解決方案，充分利用 Mellanox InfiniBand 在內的最佳功能，現在將網路內運算擴展至 NCCL，為人工智慧提供兩倍的效能。HDR InfiniBand 網路內部運算加速引擎 (包括 SHARP 技術) 可為 HPC 和人工智慧工作負載提供最高的效能和可擴充性。」

NVIDIA 副總裁兼加速運算總經理 Ian Buck 表示：「使用我們的開放原始碼集體通訊庫 NCCL，梅拉諾克斯解決方案擴大了 NVIDIA 無與倫比的 CUDA-X 加速程式庫。我們攜手合作提供的解決方案，確保資料中心內最嚴苛的 AI 應用程式受益於尖端效能和擴充效率。」

支援資源：了解更多關於梅拉諾克斯夏普™ 了解更多關於梅拉諾克斯量子™ HDR 200Gb/s 無限比賓智慧切換器在推特、臉書、谷歌、LinkedIn 和 YouTube 上關注梅拉諾克斯社群

About Mellanox

關於梅拉諾克斯

Mellanox 科技（納斯達克股票代碼：MLNX）是一家領先的供應商，為服務器和存儲提供端到端以太網和 InfiniBand 智能互連解決方案和服務。Mellanox 互連解決方案透過提供最高輸送量和最低延遲來提高資料中心效率，更快地為應用程式提供資料、釋放系統效能並改善資料安全性。Mellanox 提供一系列快速互連產品：配接卡、交換器、軟體和矽晶片，可加速應用程式效能並在雲端和超大規模、高效能運算、人工智慧、企業資料中心、網路安全、儲存、金融服務等廣泛市場中獲得最佳業務成果。更多信息可在以下位置獲得：http://www.mellanox.com.

注意：梅拉諾克斯，ConnectX-6，梅拉諾克斯量子，梅拉諾克斯可擴展分層聚合和還原協議（SHARP）和梅拉諾克斯標誌是梅拉諾克斯科技有限公司的註冊商標，所有其他商標均為其各自所有者的財產。

譯文內容由第三人軟體翻譯。

以上內容僅用作資訊或教育之目的，不構成與富途相關的任何投資建議。富途竭力但無法保證上述全部內容的真實性、準確性和原創性。

Mellanox HDR 200G InfiniBand Deep Learning Acceleration Engines Demonstrates Two Times Higher Performance for Artificial Intelligence (AI) Platformswith NVIDIA

Mellanox HDR 200G InfiniBand Deep Learning Acceleration Engines Demonstrates Two Times Higher Performance for Artificial Intelligence (AI) Platformswith NVIDIA

風險及免責聲明

聲明