NVIDIA GB200 NVL72

Rent NVIDIA GB200 NVL72 GPU – New Wave of Computing

Introducing the cutting-edge NVIDIA GB200 compute tray, a revolutionary solution utilising the full potential of NVIDIA's innovative MGX design. With 2 powerful Grace CPUs and 4 advanced Blackwell GPUs, it delivers unparalleled performance to tackle the most demanding generative AI, data analytics and HPC tasks.

Get Started

Unrivalled Performance in...

LLM Inference

1.7 TB fast memory for trillion-parameter LLMs

Multimodal Transformer Models

900 GB/s bandwidth accelerates multimodal transformer tasks

Generative 3D Capabilities

1.8 TB/s bidirectional throughput per GPU for generative models for 3D data

Data Processing

Experience faster data processing speed of up to 18x over typical CPUs

Computing Revolution with Blackwell GB200

Real-Time Inference for Trillion-Parameter LLMs

Experience unparalleled real-time LLM inference with the GB200 NVL72. Its cutting-edge second-generation Transformer Engine, FP4 AI and fifth-generation NVIDIA NVLink deliver a stunning 30x faster performance for trillion-parametre language models. This breakthrough is powered by a new generation of Tensor Cores, introducing advanced microscaling formats for superior accuracy and throughput. The GB200 NVL72 leverages NVLink and liquid cooling to create a massive 72-GPU rack to overcome communication bottlenecks

Massive LLM Training at High Speed

Accelerate massive-scale training with the GB200 NVL72's second-generation Transformer Engine featuring FP8 precision. A remarkable 4X faster training for LLM, redefining the boundaries of possibility. This groundbreaking performance is powered by fifth-generation NVLink, which connects up to 576 GPUs in a single NVLink domain with over 1 PB/s total bandwidth and 240 TB of fast memory. The GPU is als equipped with InfiBand networking for super speed and efficiency.

Benefits of NVIDIA Blackwell GB200

Reserve now

Groundbreaking Blackwell Architecture

The NVIDIA Blackwell GB200 Chip leads groundbreaking advancements for accelerated computing into a new era of unparalleled performance, efficiency and scalability. It packs 208 billion transistors and is manufactured using a custom-built TSMC 4NP process.

Sustainable Computing

Embrace a greener future with the liquid-cooled GB200 NVL72 racks. Reduce your carbon footprint and energy consumption while delivering 25X more performance than NVIDIA H100 air-cooled infrastructure at the same power.

Accelerated Data Processing

Experience high-bandwidth memory performance, NVLink-C2C, and dedicated decompression engines to accelerate key database queries by 18X compared to CPUs. Enjoy a 5X better TCO and revolutionise your data processing capabilities.

Breakthrough CPU Performance

The NVIDIA Grace CPU redefines modern Data Centre computing with outstanding performance and memory bandwidth. Experience 2X the energy efficiency of today's leading server processors while powering AI, cloud, and HPC applications with unprecedented speed.

Seamless Interconnectivity

Get the full potential of exascale computing and trillion-parametre AI models with the fifth-generation NVIDIA NVLink. This scale-up interconnect unleashes accelerated performance for swift and seamless communication between every GPU in your server cluster.

High-Performance Networking

NVIDIA Quantum-X800 InfiniBand, NVIDIA Spectrum X800 Ethernet, and NVIDIA BlueField®-3 DPUs enable efficient scalability across hundreds and thousands of Blackwell GPUs for optimal application performance.

Technical Specifications

View pricing

GPU: NVIDIA GB200 NVL72

GPU Memory: 192 GB HBM3e

Power: 1200W

FP4 Tensor Core

20 petaFLOPS

FP8/FP6 Tensor Core

10 petaFLOPS

INT8 Tensor Core

10 petaOPS

FP16/BF16 Tensor Core

5 petaFLOPS

TF32 Tensor Core

2.5 petaFLOPS

FP64 Tensor Core

45 teraFLOPS

GPU memory

Up to 192 GB HBM3e

Bandwidth

Up to 8 TB/s

Multi-Instance GPU (MIG)

Decompression Engine

Yes

Decoders

2x 7 NVDEC, 2x 7 NVJPEG

Power

Configurable up to 1,200W

Interconnect

5th Generation NVLink: 1.8TB/s, PCIe Gen6: 256GB/s

FP4 Tensor Core

20 petaFLOPS

FP8/FP6 Tensor Core

10 petaFLOPS

INT8 Tensor Core

10 petaOPS

FP16/BF16 Tensor Core

5 petaFLOPS

TF32 Tensor Core

2.5 petaFLOPS

FP64 Tensor Core

45 teraFLOPS

GPU memory

Up to 192 GB HBM3e

Bandwidth

Up to 8 TB/s

Multi-Instance GPU (MIG)

Decompression Engine

Yes

Decoders

2x 7 NVDEC, 2x 7 NVJPEG

Power

Configurable up to 1,200W

Interconnect

5th Generation NVLink: 1.8TB/s, PCIe Gen6: 256GB/s

Frequently Asked Questions

Our product support and product development go hand in hand to deliver you the best solutions available.

What is the NVIDIA GB200 NVL72 architecture?

Blackwell GB200 is built with the revolutionary Blackwell architecture.

What advantages does the NVIDIA GB200 GPU Card offer for data processing workloads?

The NVIDIA GB200 GPU Card can accelerate key database queries by up to 18x over typical CPUs thanks to its high memory bandwidth, NVLink interconnects, and dedicated decompression engines. This enables a 5x better total cost of ownership for data processing workloads.

What interconnect technologies enable large-scale Blackwell GPU deployments?

The 5th gen NVIDIA NVLink GPU interconnect with up to 130 TB/s bandwidth allows seamless multi-GPU communication. Combined with NVIDIA's high-speed InfiniBand, Ethernet, and DPU technologies, this provides efficient scalability across thousands of Blackwell GPUs.

What is the NVIDIA GB200 memory?

The NVIDIA GB200 GPU Card has 192 GB memory capacity.

How much is the NVIDIA GB200 NVL72 power consumption?

The NVIDIA GB200 power consumption is up to 1,200W.

What is the NVIDIA GB200 price?

The NVIDIA GB200 price will be discussed after you reserve the NVIDIA GB200 GPU.

Is the NVIDIA GB200 NVL72 available through any cloud provider?

Yes, the NVIDIA GB200 NVL72 is available for reservation via Hyperstack.

Is the NVIDIA GB200 NVL72 ideal for LLM inference?

Absolutely. The NVIDIA GB200 NVL72 is ideal for LLM inference workloads at scale. Its cutting-edge second-generation Transformer Engine, FP4 AI and fifth-generation NVIDIA NVLink deliver a stunning 30x faster performance for trillion-parameter language models.

Is the NVIDIA GB200 available for on-demand access through any cloud provider?

Not on-demand but the NVIDIA GB200 NVL72 is available for reservation via Hyperstack.

What are the NVIDIA Blackwell GB200 GPU specs?

The NVIDIA GB200 features 4 Blackwell GPUs, 2 Grace CPUs, 1.7 TB memory, 30× faster LLM inference, 1 PB/s NVLink bandwidth, and liquid cooling for unmatched AI performance.