NVIDIA GB300 NVL72
NVIDIA GB300 NVL72
for AI Reasoning at Scale
Get ultra-scale AI training, reasoning and inference with the NVIDIA GB300 NVL72 in a single liquid-cooled rack.
Now available on Hyperstack Secure Private Cloud.
Unrivalled
Performance in…
AI Reasoning and Inference
37 TB of fast memory and 130 TB/s NVLink bandwidth power real-time inference at a scale previously impossible in a single rack.
Trillion-Parameter LLM Training
1440 FP4 PFLOPS with sparsity accelerates large language model training to new levels of speed and cost efficiency.
High-Density HPC Workloads
2,592 Arm Neoverse V2 CPU cores alongside 20 TB of GPU memory ensure CPU-bound and GPU-bound tasks run in lockstep.
Large-Scale Data Processing
Up to 576 TB/s GPU memory bandwidth eliminates bottlenecks across the largest analytic and simulation pipelines
Key Features of
NVIDIA GB300 NVL72
Faster AI
Inference
The GB300 NVL72 delivers 2x the attention-layer acceleration and 1.5x more AI compute FLOPS than standard Blackwell GPUs, helping you run larger reasoning and inference workloads with higher throughput and lower latency.
Expanded GPU Memory
With 1.5x more HBM3e memory capacity, you can support longer context windows, larger batch sizes, and more demanding AI models while maintaining maximum throughput.
Next-Gen Accelerated Computing
Blackwell Ultra brings major advances in accelerated computing, delivering the performance, efficiency, and scalability needed for next-generation AI and HPC workloads.
High-Speed AI Networking
Each GPU connects through dual NVIDIA ConnectX-8 SuperNICs, providing up to 800 Gb/s of high-speed networking for ultra-efficient AI cluster communication and RDMA performance.
High-Bandwidth GPU Interconnect
Fifth-generation NVIDIA NVLink enables high-bandwidth communication between every GPU in the rack, accelerating large-scale AI reasoning and distributed workloads.
Technical Specifications
NVIDIA GB300 NVL72
Secure NVIDIA GB300 NVL72
on Secure Private Cloud
Reserve guaranteed access to one of the most powerful AI computing systems available on cloud infrastructure.
Our team will discuss deployment type and configuration based on your requirements after you reserve via Secure Private Cloud.
Frequently Asked Questions
Our product support and development go hand in hand to deliver you the best solutions available.
What is the NVIDIA GB300 NVL72?
The NVIDIA GB300 NVL72 is a rack-scale AI computing system built around 72 NVIDIA Blackwell Ultra GPUs and 36 NVIDIA Grace CPUs. It delivers 1440 FP4 PFLOPS of AI compute, 37 TB of fast memory, and 130 TB/s of NVLink bandwidth for AI reasoning, inference, and large-scale training.
How does the NVIDIA GB300 NVL72 differ from the NVIDIA GB200 NVL72?
The NVIDIA GB300 NVL72 uses NVIDIA Blackwell Ultra GPUs, which deliver 2x the attention-layer acceleration and 1.5x more AI compute FLOPS compared to the NVIDIA Blackwell GPUs in the GB200 NVL72. Each GPU also features 288 GB of HBM3e memory, with total rack-level fast memory increasing to 37 TB. At the system level, the GB300 NVL72 delivers up to 50x higher AI factory output versus NVIDIA hopper platforms, combining 10x better user-facing latency and 5x greater throughput per megawatt.
What workloads is the NVIDIA GB300 NVL72 best suited for?
The NVIDIA GB300 NVL72 is optimised for AI reasoning inference, large language model training and inference, high-performance computing, and data-intensive analytics. It is particularly suited for test-time scaling workloads and long-context AI models that require massive memory and compute resources.