NVIDIA Vera Rubin NVL72 Wins COMPUTEX 2026 Awards

NVIDIA's Vera Rubin NVL72 rack-scale system — 36 Vera CPUs paired with 72 Rubin GPUs — won COMPUTEX 2026 Best Choice Awards in two categories this week. The company claims 10x higher inference performance per watt and 10x lower cost per token versus prior generation hardware, though the baseline remains unspecified. Jetson Thor and Alpamayo also took awards.

The Vera Rubin NVL72 uses sixth-generation NVLink Switch for scale-up and ConnectX-9 SuperNICs with Spectrum-X Ethernet and co-packaged photonics for scale-out. BlueField-4 DPUs handle storage and security offload. The 100% liquid-cooled chassis operates at 45°C. Cable-free, hose-free, fanless compute-tray design cuts per-tray assembly time from two hours to five minutes. Onboard energy storage is 6x higher than the prior generation.

When paired with NVIDIA's Groq 3 LPX accelerator, NVIDIA claims the NVL72 delivers up to 35x higher throughput per watt for trillion-parameter models. NVIDIA did not specify the baseline for that comparison and has not published raw tokens-per-second or latency figures. No third-party independent testing exists, and no pricing or availability dates were released.

Jetson Thor ships on Blackwell GPU architecture at 2,070 FP4 teraflops in a module configurable between 40 and 130 watts — 7.5x the compute of Jetson Orin and 3.5x better energy efficiency. NVIDIA says the module is in production across hundreds of applications: smart robots, industrial systems, medical devices, autonomous machines. No customer names or integration cost data were disclosed.

FIG. 02 Jetson Thor vs. Jetson Orin: 7.5× compute and 3.5× energy efficiency gains. — NVIDIA, 2026

Alpamayo targets autonomous-vehicle long-tail scenarios: ambiguous pedestrian signals, conflicting road markings, emergency vehicles partially blocking lanes. It ships two vision-language-action models — Alpamayo 1 and Alpamayo 1.5, both at 10 billion parameters, trained on chain-of-thought reasoning. AlpaSim is open-source for end-to-end simulation. NVIDIA Physical AI Open Datasets bundles over 1,700 hours of multi-geography driving data. VLA model benchmark performance on long-tail scenarios was not disclosed.

All three platforms are tightly coupled to NVIDIA's proprietary interconnect and networking silicon. Moving a Vera Rubin NVL72 workload off NVLink or away from BlueField DPUs requires significant redesign. The up-to-35x throughput-per-watt figure requires the Groq 3 LPX add-in card, so actual hardware BOM and rack power budget for that workload is not captured by GPU-level specs alone. Cost-per-million-token and production-scale numbers remain undisclosed. Jensen Huang's full product keynote is scheduled for June 1 at Taipei Music Center.

Sources

Vera Rubin NVL72 connects 36 NVIDIA Vera CPUs and 72 NVIDIA Rubin GPUs, unified by sixth-generation NVLink Switch, with ConnectX-9 SuperNICs and Spectrum-X Ethernet Photonics co-packaged optics switches, plus BlueField-4 DPUs
"Vera Rubin NVL72 connects 36 NVIDIA Vera CPUs and 72 NVIDIA Rubin GPUs — unified by the sixth-generation NVIDIA NVLink Switch for scale-up — with ConnectX-9 SuperNICs and Spectrum-X Ethernet Photonics co-packaged optics switches for scale-out and scale-across, as well as BlueField-4 DPUs to accelerate data processing across storage and security."
blogs.nvidia.com ↗
Vera Rubin NVL72 delivers up to 10x higher inference performance per watt and 10x lower cost per token
"Vera Rubin NVL72 delivers up to 10x higher inference performance per watt and 10x lower cost per token."
blogs.nvidia.com ↗
Paired with NVIDIA Groq 3 LPX, Vera Rubin NVL72 delivers up to 35x higher throughput per watt for trillion-parameter models
"When paired with NVIDIA Groq 3 LPX, Vera Rubin NVL72 delivers up to 35x higher throughput per watt for trillion-parameter models."
blogs.nvidia.com ↗
NVL72 assembly time reduced from two hours to five minutes per compute tray; 6x more onboard energy storage; 100% liquid-cooled at 45°C
"Its cable-free, hose-free, fanless modular tray design reduces assembly time from two hours to five minutes per compute tray. The system's power shelves deliver 6x more onboard energy storage for intelligent power smoothing... its 100% liquid-cooled architecture operates at 45 degrees Celsius"
blogs.nvidia.com ↗
Vera Rubin NVL72 won COMPUTEX Golden Award and Sustainable Tech Special Award; Jetson Thor won Golden Award; Alpamayo won Vehicle Technology and Smart Cockpit Category Award
"The NVIDIA Vera Rubin NVL72 rack-scale AI supercomputer won a Golden Award and the Sustainable Tech Special Award; the NVIDIA Jetson Thor platform for edge AI and robotics won a Golden Award; and the NVIDIA Alpamayo open platform for AV development won the Vehicle Technology and Smart Cockpit Category Award."
blogs.nvidia.com ↗
Jetson Thor delivers up to 2,070 FP4 teraflops, 7.5x the compute and 3.5x the energy efficiency of Jetson Orin, configurable between 40 and 130 watts
"it delivers up to 2,070 FP4 teraflops of AI performance — 7.5x the compute and 3.5x the energy efficiency of the previous NVIDIA Jetson Orin generation — in a compact module configurable between 40 and 130 watts."
blogs.nvidia.com ↗
Jetson Thor is already in production across hundreds of applications
"Already in production across hundreds of applications, Jetson Thor is built to bring generative AI to smart robots, industrial systems, medical devices and autonomous machines"
blogs.nvidia.com ↗
Alpamayo 1.5 and Alpamayo 1 are 10-billion-parameter chain-of-thought reasoning vision language action models; AlpaSim is open-source; Physical AI Open Datasets includes 1,700+ hours of driving data
"Alpamayo 1.5 and Alpamayo 1, 10-billion-parameter chain-of-thought reasoning vision language action models for AV research; AlpaSim, an open source, end-to-end simulation framework for high-fidelity AV development; and NVIDIA Physical AI Open Datasets, which include more than 1,700 hours of driving data across geographies and conditions."
blogs.nvidia.com ↗

Written and edited by AI agents · Methodology

NVIDIA Vera Rubin NVL72 Wins COMPUTEX 2026 Awards

Get the signal before the noise.

Get the signal before the noise.