Wiwynn packs 528 million IOPS into liquid-cooled storage server

Wiwynn has unveiled a 2.9 petabyte Nvidia SCADA storage server at Computex 2026, featuring 96 PCIe 6.0 Micron 9650 Pro SSDs and four RTX Pro 6000 Blackwell GPUs in a liquid-cooled 6RU chassis rated for 9 kW. The flash array is capable of 528 million 4K random-read IOPS.

The system is based on Nvidia's SCADA architecture, which removes the CPU from both data and control paths, allowing GPUs to directly initiate storage operations. This is a stricter split than GPUDirect Storage, where the CPU still controls the control plane. In Wiwynn's system, the Nvidia Vera CPU is present but largely sidelined; the four RTX Pro 6000 cards act as storage processors, managing millions of parallel requests smaller than 4 KB across the 96 E3.S drives and forwarding data to compute hosts over four ConnectX-9 SuperNICs. Broadcom PCIe 6.x switches handle the on-board fabric, and the 2.949 PB of raw capacity comes from 30.72 TB Micron 9650 Pro drives. Nvidia positions the design as tier 3.5 in its "Storage Next" vision, targeting vector search, RAG retrieval, graph analytics, and KV-cache serving where thousands of GPU threads issue fine-grained random reads.

FIG. 02 SCADA removes the CPU from both data and control paths, letting GPUs directly initiate storage operations—unlike traditional GPU Direct Storage where the CPU still owns the control plane.

The 528 million IOPS figure addresses the access pattern that stalls inference pipelines: massive thread count, tiny block size, unpredictable address space. However, sequential throughput is limited by the PCIe switches and NICs, not the NAND, meaning the real ceiling is the ConnectX-9 egress and downstream network. The 9 kW draw for six rack units is aggressive for a storage node, and the six cold-plate modules covering every SSD indicate that air cooling is not an option at this density. Wiwynn and Nvidia have not disclosed p50 or p99 latencies under load, sustained throughput figures, $/IOPS, or pricing, but the bill of materials suggests a seven-figure unit before networking.

As this is a showcase unit with no production workload evidence, architects should treat the peak IOPS number as a lab specification until independent benchmarks show how the system behaves under concurrent RAG or KV-cache eviction patterns. The software stack is another open question. SCADA requires applications to issue GPU-initiated storage commands, a programming model that does not map cleanly to existing GPUDirect Storage code, standard POSIX filesystems, or Kubernetes-based inference serving. Adopting it means new drivers, new failure-handling logic, and custom CUDA I/O paths.

The opportunity cost also deserves scrutiny. Using four RTX Pro 6000 GPUs as I/O orchestrators means dedicating high-end accelerators to storage control instead of model forward passes. This trade-off only pencils out when data-movement stalls already dominate pipeline utilization, and when the alternative is repeatedly idling compute GPUs while a CPU-bound storage control path fetches embeddings or cached activations from flash.

The pattern to steal is offloading storage request scheduling and sub-4 KB flash access directly to GPU-resident threads, but only after measuring current retrieval latency and proving the bottleneck is CPU-control-path bound, not PCIe-fabric bound, because four RTX Pro 6000 GPUs serving as I/O processors are four GPUs not generating tokens.

Sources

Wiwynn's SCADA server stores 2.949 PB using 96 × 30.72 TB Micron 9650 Pro PCIe 6.0 drives
"When equipped with 96 30.72 TB Micron 9650 Pro drives with a PCIe 6.0 interface, the server can store 2.949 PB of data."
tomshardware.com ↗
Wiwynn's SCADA server achieves 528 million 4K random-read IOPS
"Wiwynn claims an aggregated random read speed of 528 million 4K IOPS"
tomshardware.com ↗
SCADA removes the CPU from both data and control paths; GPUs directly initiate storage operations — unlike GPUDirect Storage where CPU still owns the control plane
"Even in advanced solutions like GPUDirect Storage, which allows data to be transferred directly from SSDs to GPUs, the CPU still owns the control path and can become a bottleneck."
tomshardware.com ↗
System uses Nvidia Vera CPU, 4× RTX Pro 6000 Blackwell GPUs, 4× PCIe 6.x switches, 4× ConnectX-9 SuperNICs
"The machine is based on Nvidia's Vera CPU, four RTX Pro 6000 Blackwell graphics cards, four PCIe 6.x switches, and four ConnectX-9 SuperNIC cards."
tomshardware.com ↗
6RU chassis, Nvidia MGX rack-compliant, max 9 kW power, fully liquid cooled with six cold plate modules
"Wiwynn's SCADA server is an Nvidia MGX rack-compliant system in an 6RU form-actor that has a maximum power consumption of 9 kW."
tomshardware.com ↗
Sequential throughput is limited by PCIe switches and NICs, not the NAND drives
"sequential read/write speeds limited by the performance of PCIe switches and/or network cards rather than the drives themselves"
tomshardware.com ↗
Nvidia positions SCADA as tier 3.5 storage — behind local NVMe but ahead of remote HDD-based tier 4
"Nvidia clearly positions SCADA as tier 3.5 storage servers located behind local SSDs, but ahead of tier 4 remote storage servers that often rely on hard drives."
tomshardware.com ↗
RTX Pro 6000 GPUs act as storage processors handling millions of small requests and forwarding to compute servers via ConnectX-9
"its RTX 6000 Pro GPUs act more like very sophisticated storage processors that initiate and handle storage transactions, millions of small storage requests on behalf of AI applications, and pass them to the compute server via the ConnectX-9 cards"
tomshardware.com ↗
SCADA is part of Nvidia's Storage Next vision, making storage behave like an extension of GPU memory
"SCADA is a part of Nvidia's Storage Next vision, which is a collection of technologies aimed to make storage behave more like an extension of GPU memory for AI workloads."
tomshardware.com ↗

Written and edited by AI agents · Methodology

Wiwynn packs 528 million IOPS into liquid-cooled storage server

Get the signal before the noise.

Get the signal before the noise.