NVIDIA HPC-X

Increase scalability and performance of messaging communications.

NVIDIA® HPC-X® is a comprehensive software package that includes Message Passing Interface (MPI), Symmetrical Hierarchical Memory (SHMEM) and Partitioned Global Address Space (PGAS) communications libraries, and various acceleration packages. This full-featured, tested, and packaged toolkit enables MPI and SHMEM/PGAS programming languages to achieve high performance, scalability, and efficiency and ensures that communication libraries are fully optimized by NVIDIA Quantum InfiniBand networking solutions.

Performance at Any Scale

HPC-X takes advantage of NVIDIA Quantum InfiniBand hardware-based networking acceleration engines to maximize application performance. It dramatically reduces MPI operation time, freeing up valuable CPU resources, and decreases the amount of data traversing the network, allowing unprecedented scale to reach evolving performance demands.

NVIDIA In-Network Computing

Software and Acceleration Packages

HPC-X MPI

MPI is a standardized, language-independent specification for writing message-passing programs. NVIDIA HPC-X MPI is a high-performance, optimized implementation of Open MPI that takes advantage of NVIDIA’s additional acceleration capabilities, while providing seamless integration with industry-leading commercial and open-source application software packages.

HPC-X OpenSHMEM

The HPC-X OpenSHMEM programming library is a one-side communications library that supports a unique set of parallel programming features, including point-to-point and collective routines, synchronizations, atomic operations, and a shared memory paradigm used between the processes of a parallel programming application.

In-Network Computing

NVIDIA Scalable Hierarchical Aggregation and Reduction Protocol (SHARP™) improves the performance of MPI operations by offloading them from the CPU to the switch network and eliminating the need to send data multiple times, decreasing the amount of data traversing the network and dramatically reducing MPI operation time.

Unified Communication X

Unified Communications X (UCX®) is an open-source communication framework for data-centric and high-performance applications. UCX provides a low-overhead, point-to-point communication path for near-native, hardware-level performance with cross-platform support.

Unified Collective Communication

Unified Collective Communication (UCC) is an open-source communication framework for high-performance applications. UCC provides low-overhead collective communications leveraging InfiniBand In-Network Computing acceleration engines.

NCCL/SHARP and UCX Support

NCCL-RDMA plug-ins enable remote direct-memory access (RDMA) and switch-based collectives (SHARP) with the NVIDIA Collective Communication Library (NCCL). The NCCL UCX plug-in replaces the default NCCL verbs-based inter-node communication routines with UCX-based communication routines for enhanced performance.

ClusterKit

ClusterKit is a multifaceted node-assessment tool for high-performance clusters. ClusterKit employs well-known techniques and tests to ensure the health and performance of a cluster.

Key Features

  • Offloads collectives communications from MPI onto NVIDIA Quantum InfiniBand networking hardware
  • Multiple transport support, including Reliable Connection (RC), Dynamic Connected (DC), and Unreliable Datagram (UD)
  • Intra-node shared memory communication
  • Receive-side tag matching
  • Native support for MPI-3
  • Multi-rail support with message striping
  • NVIDIA GPUDirect® with CUDA® support
  • NCCL-RDMA-SHARP plug-in support

Benefits

  • Increases CPU availability, application scalability, and system efficiency for improved application performance
  • Ensures node-level and system-level health and performance
  • Maximizes application performance with underlying hardware architecture
  • Fully optimized for NVIDIA Quantum InfiniBand networking solutions
  • Supports any interconnect based on InfiniBand or Ethernet standards

Resources

See how you can build the most efficient, high-performance network.


Request a Demo

Configure Your Cluster

Take Networking Courses

Ready to Purchase?