NVIDIA HGX B300

Accelerate Large-scale AI with NVIDIA HGX B300

Get next-generation performance for large AI training, inference and reasoning with the NVIDIA HGX B3000. Powered by Blackwell Ultra GPUs to accelerate enterprise AI.

Now available on Hyperstack. Starting from $3.50/hour.

Reserve now

Unrivalled
Performance in..

Generative AI and LLM Development

Accelerate LLM development with up to 144 PFLOPS FP4 Tensor Core performance.

AI Training at Hyperscale Efficiency

Achieve great model throughput with 72 PFLOPS FP8/FP6 performance, ultra-fast 14.4 TB/s NVLink bandwidth and high-density compute.

High-Speed AI Inference for Production Deployment

Deploy real-time inference at scale using the Blackwell Ultra architecture, delivering up to 144 PFLOPS sparse FP4 for advanced reasoning tasks.

Key Features of the
NVIDIA HGX B300

Blackwell Ultra Architecture

The NVIDIA B300 GPU is built on the Blackwell Ultra architecture, so you can handle real-time AI reasoning, massive parallelism and dense compute throughput across FP4, FP6 and FP8.

Massive 2.3 TB GPU Memory Space

8 Blackwell Ultra GPUs deliver a combined 2.3 TB of ultra-high-bandwidth HBM3e, providing 64 TB/s aggregate bandwidth to keep your largest models fully in memory.

NVLink Switch System for Extreme Scaling

The dual NVIDIA NVLink Switch System provides 14.4 TB/s of aggregate GPU interconnect bandwidth for near-linear scaling across all eight GPUs.

Technical Specifications

NVIDIA HGX B300 Specs

Specification

NVIDIA HGX B300

GPU

NVIDIA HGX B300

Total GPU Memory

2.3 TB HBM3e

Memory Bandwidth

64 TB/s (aggregate)

FP4 Tensor Core

144 PFLOPS (sparse) / 108 PFLOPS (dense)

FP8/FP6 Tensor Core

72 PFLOPS

NVLink Bandwidth

14.4 TB/s aggregate

System Memory

2 TB (configurable to 4 TB)

CPU

Dual Intel Xeon 6776P

Power Consumption

14.5 kW (busbar) / 15.1 kW (PSU)

Frequently Asked Questions

Our product support and development go hand in hand to deliver you the best solutions available.

What is the NVIDIA B300 GPU?

The NVIDIA B300 GPU is part of the Blackwell Ultra generation, delivering extreme FP4, FP6 and FP8 performance for LLMs, generative AI and real-time reasoning workloads.

How much will the NVIDIA B300 cost?

On Hyperstack, you can reserve the NVIDIA HGX B300, starting from $3.50/hour.

What are the key specs of the NVIDIA B300 GPU?

The key specifications of NVIDIA B300 include:

Blackwell Ultra architecture
Up to 144 PFLOPS FP4
Up to 72 PFLOPS FP8/FP6
HBM3e memory, up to 2.3 TB in HGX configuration
NVLink Switch System with 14.4 TB/s bandwidth

How does the NVIDIA B300 compare to the NVIDIA  H100 and NVIDIA H200?

The NVIDIA B300 significantly outperforms H100/H200 in FP4 and FP8 compute, offers greater memory bandwidth and delivers faster attention inference. This makes it ideal for next-generation LLMs and reasoning workloads.

What chip architecture does the NVIDIA B300 use?

The NVIDIA B300 uses the NVIDIA Blackwell Ultra architecture which is ideal for large-scale LLM training and real-time inference.

What workloads is the NVIDIA B300 best suited for?

The NVIDIA HGX B300 is ideal for demanding workloads such as:

Large language model training and inference
High-throughput generative AI workloads
Reinforcement learning and simulation
RAG systems and complex reasoning

Which cloud providers will offer NVIDIA B300 GPUs?

Hyperstack offers the NVIDIA HGX B300 GPUs with on-demand access and reservation options for long-term workloads.

How does the B300 perform in professional 3D rendering applications?

The B300 is optimised for AI training and inference rather than typical 3D graphics. While powerful, its strengths lie in large-model acceleration, tensor compute, and AI workflows, not conventional 3D rendering benchmarks.

NVIDIA HGX B300

Accelerate Large-scale AI with NVIDIA HGX B300