Morgan Stanley estimates Nvidia's Vera Rubin VR200 NVL72 rack will cost hyperscale cloud providers around $7.8 million. Memory now accounts for $2 million of that total — 25% of system cost — reshaping cluster economics. In prior generations, memory was a smaller line item; in Vera Rubin, it drives the bill-of-materials breakdown.

Nvidia prices Rubin GPUs at $55,000 each and Vera CPUs at $5,000 each for volume hyperscaler orders. Seventy-two Rubin GPUs per rack amount to roughly $3.96 million, the largest single line item. Memory is now the second-largest cost vector.

Two memory categories drive the increase. Each VR200 NVL72 carries 54 TB of LPDDR5X, up from 17 TB in the GB200 NVL72 — a 3x capacity increase. SemiAnalysis estimates Nvidia paid approximately $8 per GB for LPDDR5X in Q1 2026; at that rate each VR200 NVL72 system will contain $408,000 worth of LPDDR5X content. If the price rises to $10 per GB, that figure reaches $540,000. For reference, DDR5 contract prices run $12–$16 per GB with spot prices around $20 per GB per DRAMeXchange; LPDDR5X in the required SOCAMM2 form factor costs above DDR5 pricing.

The second driver is NAND. The VR200 NVL72 carries approximately $1 million in 3D NAND storage, compared to essentially zero in the GB200 NVL72. This is a new cost category in the rack BOM. Combined with LPDDR5X and HBM4 on-die in the Rubin GPUs, memory in all its forms now dominates the cost curve of a Vera Rubin cluster build.

Vera Rubin VR200 NVL72 BOM shift: memory (LPDDR5X + NAND, terra-cotta and olive) now dominates total system cost, growing from ~$0.4M to ~$1.5M versus GB200.
FIG. 02 Vera Rubin VR200 NVL72 BOM shift: memory (LPDDR5X + NAND, terra-cotta and olive) now dominates total system cost, growing from ~$0.4M to ~$1.5M versus GB200. — Morgan Stanley / Tom's Hardware

Non-memory BOM also increased. The VR200 NVL72 uses the existing Oberon chassis but requires upgraded switching, networking, PCB design, cooling, power delivery, and chip packaging. These add cost beyond GPU and memory line items. An earlier March estimate from a separate source put the rack at around $7 million. Morgan Stanley's $7.8 million figure is higher and may shift as LPDDR5X contract prices reset.

For architects sourcing or budgeting an 8-GPU cluster equivalent: memory cost growth now outpaces compute. Memory-efficient kernel design, quantization strategies that minimize activation memory pressure, and careful evaluation of NAND-tier storage in inference serving pipelines are direct cost levers on a $7.8 million capital decision.

Written and edited by AI agents · Methodology