NVIDIA GB300 NVL72

NVIDIA GB300 NVL72
for AI Reasoning at Scale

Get ultra-scale AI training, reasoning and inference with the NVIDIA GB300 NVL72 in a single liquid-cooled rack.

Now available on Hyperstack Secure Private Cloud.

Reserve Now

Unrivalled
Performance in…

AI Reasoning and Inference

37 TB of fast memory and 130 TB/s NVLink bandwidth power real-time inference at a scale previously impossible in a single rack.

Trillion-Parameter LLM Training

1440 FP4 PFLOPS with sparsity accelerates large language model training to new levels of speed and cost efficiency.

High-Density HPC Workloads

2,592 Arm Neoverse V2 CPU cores alongside 20 TB of GPU memory ensure CPU-bound and GPU-bound tasks run in lockstep.

Large-Scale Data Processing

Up to 576 TB/s GPU memory bandwidth eliminates bottlenecks across the largest analytic and simulation pipelines

Key Features of
NVIDIA GB300 NVL72

Faster AI
Inference

The NVIDIA GB300 NVL72 delivers 2x the attention-layer acceleration and 1.5x more AI compute FLOPS than standard Blackwell GPUs, helping you run larger reasoning and inference workloads with higher throughput and lower latency.

Expanded GPU Memory

With 1.5x more HBM3e memory capacity, you can support longer context windows, larger batch sizes, and more demanding AI models while maintaining maximum throughput.

Next-Gen Accelerated Computing

Blackwell Ultra brings major advances in accelerated computing, delivering the performance, efficiency, and scalability needed for next-generation AI and HPC workloads.

High-Speed AI Networking

Each GPU connects through dual NVIDIA ConnectX-8 SuperNICs, providing up to 800 Gb/s of high-speed networking for ultra-efficient AI cluster communication and RDMA performance.

High-Bandwidth GPU Interconnect

Fifth-generation NVIDIA NVLink enables high-bandwidth communication between every GPU in the rack, accelerating large-scale AI reasoning and distributed workloads.

Technical Specifications

NVIDIA GB300 NVL72

GPU

NVIDIA GB300 NVL72

Configuration

72 NVIDIA Blackwell Ultra GPUs, 36 NVIDIA Grace CPUs

NVLink Bandwidth

130 TB/s

Fast Memory

37 TB

GPU Memory / Bandwidth

20 TB / Up to 576 TB/s

CPU Memory / Bandwidth

17 TB LPDDR5X / 14 TB/s

CPU Core Count

2,592 Arm Neoverse V2 cores

CPU Core Count

2,592 Arm Neoverse V2 cores

FP4 Tensor Core

1440 / 1080 PFLOPS

FP8/FP6 Tensor Core

720 PFLOPS

INT8 Tensor Core

24 POPS

FP16/BF16 Tensor Core

360 PFLOPS

TF32 Tensor Core

180 PFLOPS

FP32

6 PFLOPS

FP64 / FP64 Tensor Core

100 TFLOPS

GPU

NVIDIA GB300 NVL72

Configuration

72 NVIDIA Blackwell Ultra GPUs, 36 NVIDIA Grace CPUs

NVLink Bandwidth

130 TB/s

Fast Memory

37 TB

GPU Memory / Bandwidth

20 TB / Up to 576 TB/s

CPU Memory / Bandwidth

17 TB LPDDR5X / 14 TB/s

CPU Core Count

2,592 Arm Neoverse V2 cores

CPU Core Count

2,592 Arm Neoverse V2 cores

FP4 Tensor Core

1440 / 1080 PFLOPS

FP8/FP6 Tensor Core

720 PFLOPS

INT8 Tensor Core

24 POPS

FP16/BF16 Tensor Core

360 PFLOPS

TF32 Tensor Core

180 PFLOPS

FP32

6 PFLOPS

FP64 / FP64 Tensor Core

100 TFLOPS

Frequently Asked Questions

Our product support and development go hand in hand to deliver you the best solutions available.

What is the NVIDIA GB300 NVL72?

The NVIDIA GB300 NVL72 is a rack-scale AI computing system built around 72 NVIDIA Blackwell Ultra GPUs and 36 NVIDIA Grace CPUs. It delivers 1440 FP4 PFLOPS of AI compute, 37 TB of fast memory, and 130 TB/s of NVLink bandwidth for AI reasoning, inference, and large-scale training.

How does the NVIDIA GB300 NVL72 differ from the NVIDIA GB200 NVL72?

The NVIDIA GB300 NVL72 uses NVIDIA Blackwell Ultra GPUs, which deliver 2x the attention-layer acceleration and 1.5x more AI compute FLOPS compared to the NVIDIA Blackwell GPUs in the GB200 NVL72. Each GPU also features 288 GB of HBM3e memory, with total rack-level fast memory increasing to 37 TB. At the system level, the GB300 NVL72 delivers up to 50x higher AI factory output versus NVIDIA hopper platforms, combining 10x better user-facing latency and 5x greater throughput per megawatt.

What workloads is the NVIDIA GB300 NVL72 best suited for?

The NVIDIA GB300 NVL72 is optimised for AI reasoning inference, large language model training and inference, high-performance computing, and data-intensive analytics. It is particularly suited for test-time scaling workloads and long-context AI models that require massive memory and compute resources.

NVIDIA GB300 NVL72

NVIDIA GB300 NVL72
for AI Reasoning at Scale

Unrivalled
Performance in…

AI Reasoning and Inference

Trillion-Parameter LLM Training

High-Density HPC Workloads

Large-Scale Data Processing

Key Features of
NVIDIA GB300 NVL72

Faster AI
Inference

Expanded GPU Memory

Next-Gen Accelerated Computing

High-Speed AI Networking

High-Bandwidth GPU Interconnect

NVIDIA GB300 NVL72

Access NVIDIA GB300 NVL72
on Secure Private Cloud

Frequently Asked Questions

What is the NVIDIA GB300 NVL72?

How does the NVIDIA GB300 NVL72 differ from the NVIDIA GB200 NVL72?

What workloads is the NVIDIA GB300 NVL72 best suited for?

United Kingdom (Head office)

Registered Office

Spain

Solutions

Resources

Site map

Products

Legal

NVIDIA GB300 NVL72

NVIDIA GB300 NVL72 for AI Reasoning at Scale

Unrivalled Performance in…

AI Reasoning and Inference

Trillion-Parameter LLM Training

High-Density HPC Workloads

Large-Scale Data Processing

Key Features of NVIDIA GB300 NVL72

Faster AI Inference

Expanded GPU Memory

Next-Gen Accelerated Computing

High-Speed AI Networking

High-Bandwidth GPU Interconnect

NVIDIA GB300 NVL72

Access NVIDIA GB300 NVL72 on Secure Private Cloud

Frequently Asked Questions

What is the NVIDIA GB300 NVL72?

How does the NVIDIA GB300 NVL72 differ from the NVIDIA GB200 NVL72?

What workloads is the NVIDIA GB300 NVL72 best suited for?

United Kingdom (Head office)

Registered Office

Spain

Solutions

Resources

Site map

Products

Legal

NVIDIA GB300 NVL72
for AI Reasoning at Scale

Unrivalled
Performance in…

Key Features of
NVIDIA GB300 NVL72

Faster AI
Inference

Access NVIDIA GB300 NVL72
on Secure Private Cloud