<img alt="" src="https://secure.insightful-enterprise-intelligence.com/783141.png" style="display:none;">

Meet Hyperstack at RAISE 2026, 8th-9th July · Booth #14A · Scale your AI infrastructure with us.

Catch Hyperstack at ISC 2026, 22nd-26th June · Booth #A39 · Let's talk GPU-accelerated workloads

Reserve early access to NVIDIA B300s — arriving Q3/Q4

alert

We’ve been made aware of a fraudulent website impersonating Hyperstack at hyperstack.my.
This domain is not affiliated with Hyperstack or NexGen Cloud.

If you’ve been approached or interacted with this site, please contact our team immediately at support@hyperstack.cloud.

close
NVIDIA GB300 NVL72
nvidia-gb300-nvl72-for-ai-reasoning-at-scale
NVIDIA GB300 NVL72

NVIDIA GB300 NVL72
for AI Reasoning at Scale

Get ultra-scale AI training, reasoning and inference with the NVIDIA GB300 NVL72 in a single liquid-cooled rack.

Now available on Hyperstack Secure Private Cloud.

product-banner-curve

Unrivalled
Performance in…

tick_check

AI Reasoning and Inference

37 TB of fast memory and 130 TB/s NVLink bandwidth power real-time inference at a scale previously impossible in a single rack.

tick_check

Trillion-Parameter LLM Training

1440 FP4 PFLOPS with sparsity accelerates large language model training to new levels of speed and cost efficiency.

tick_check

High-Density HPC Workloads

2,592 Arm Neoverse V2 CPU cores alongside 20 TB of GPU memory ensure CPU-bound and GPU-bound tasks run in lockstep.

tick_check

Large-Scale Data Processing

Up to 576 TB/s GPU memory bandwidth eliminates bottlenecks across the largest analytic and simulation pipelines

Key Features of
NVIDIA GB300 NVL72

key-features-of-nvidia-gb300-nvl72

Faster AI
Inference

The GB300 NVL72 delivers 2x the attention-layer acceleration and 1.5x more AI compute FLOPS than standard Blackwell GPUs, helping you run larger reasoning and inference workloads with higher throughput and lower latency.

key-features-of-nvidia-gb300-nvl72

Expanded GPU Memory

With 1.5x more HBM3e memory capacity, you can support longer context windows, larger batch sizes, and more demanding AI models while maintaining maximum throughput.

key-features-of-nvidia-gb300-nvl72

Next-Gen Accelerated Computing

Blackwell Ultra brings major advances in accelerated computing, delivering the performance, efficiency, and scalability needed for next-generation AI and HPC workloads.

key-features-of-nvidia-gb300-nvl72

High-Speed AI Networking

Each GPU connects through dual NVIDIA ConnectX-8 SuperNICs, providing up to 800 Gb/s of high-speed networking for ultra-efficient AI cluster communication and RDMA performance.

key-features-of-nvidia-gb300-nvl72

High-Bandwidth GPU Interconnect

Fifth-generation NVIDIA NVLink enables high-bandwidth communication between every GPU in the rack, accelerating large-scale AI reasoning and distributed workloads.

Technical Specifications

NVIDIA GB300 NVL72

GPU
NVIDIA GB300 NVL72
Configuration
72 NVIDIA Blackwell Ultra GPUs, 36 NVIDIA Grace CPUs
NVLink Bandwidth
130 TB/s
Fast Memory
37 TB
GPU Memory / Bandwidth
20 TB / Up to 576 TB/s
CPU Memory / Bandwidth
17 TB LPDDR5X / 14 TB/s
CPU Core Count
2,592 Arm Neoverse V2 cores
CPU Core Count
2,592 Arm Neoverse V2 cores
FP4 Tensor Core
1440 / 1080 PFLOPS
FP8/FP6 Tensor Core
720 PFLOPS
INT8 Tensor Core
24 POPS
FP16/BF16 Tensor Core
360 PFLOPS
TF32 Tensor Core
180 PFLOPS
FP32
6 PFLOPS
FP64 / FP64 Tensor Core
100 TFLOPS
GPU
NVIDIA GB300 NVL72
Configuration
72 NVIDIA Blackwell Ultra GPUs, 36 NVIDIA Grace CPUs
NVLink Bandwidth
130 TB/s
Fast Memory
37 TB
GPU Memory / Bandwidth
20 TB / Up to 576 TB/s
CPU Memory / Bandwidth
17 TB LPDDR5X / 14 TB/s
CPU Core Count
2,592 Arm Neoverse V2 cores
CPU Core Count
2,592 Arm Neoverse V2 cores
FP4 Tensor Core
1440 / 1080 PFLOPS
FP8/FP6 Tensor Core
720 PFLOPS
INT8 Tensor Core
24 POPS
FP16/BF16 Tensor Core
360 PFLOPS
TF32 Tensor Core
180 PFLOPS
FP32
6 PFLOPS
FP64 / FP64 Tensor Core
100 TFLOPS

Secure NVIDIA GB300 NVL72
on Secure Private Cloud

Reserve guaranteed access to one of the most powerful AI computing systems available on cloud infrastructure.

Our team will discuss deployment type and configuration based on your requirements after you reserve via Secure Private Cloud.

Frequently Asked Questions

Our product support and development go hand in hand to deliver you the best solutions available.

What is the NVIDIA GB300 NVL72?

The NVIDIA GB300 NVL72 is a rack-scale AI computing system built around 72 NVIDIA Blackwell Ultra GPUs and 36 NVIDIA Grace CPUs. It delivers 1440 FP4 PFLOPS of AI compute, 37 TB of fast memory, and 130 TB/s of NVLink bandwidth for AI reasoning, inference, and large-scale training.

How does the NVIDIA GB300 NVL72 differ from the NVIDIA GB200 NVL72?

The NVIDIA GB300 NVL72 uses NVIDIA Blackwell Ultra GPUs, which deliver 2x the attention-layer acceleration and 1.5x more AI compute FLOPS compared to the NVIDIA Blackwell GPUs in the GB200 NVL72. Each GPU also features 288 GB of HBM3e memory, with total rack-level fast memory increasing to 37 TB. At the system level, the GB300 NVL72 delivers up to 50x higher AI factory output versus NVIDIA hopper platforms, combining 10x better user-facing latency and 5x greater throughput per megawatt.

What workloads is the NVIDIA GB300 NVL72 best suited for?

The NVIDIA GB300 NVL72 is optimised for AI reasoning inference, large language model training and inference, high-performance computing, and data-intensive analytics. It is particularly suited for test-time scaling workloads and long-context AI models that require massive memory and compute resources.