Tencent Open-Sources HunyuanWorld 1.0, a Mesh-Ready 3D World Generator

Tencent released HunyuanWorld 1.0 on July 26, 2025, the first open-source model to generate explorable, simulatable 3D worlds from a single text prompt or image, outputting mesh-ready geometry that drops into computer-graphics and simulation pipelines without post-processing.

The system chains three stages: panoramic proxy generation (PanoDiT), semantic layering, and hierarchical 3D reconstruction. PanoDiT synthesizes a 360° panoramic image from the input, serving as the world proxy for scene decomposition. A semantic segmentation pass separates foreground objects from background, producing disentangled 3D mesh layers — sky, ground, and discrete interactive objects — rather than a monolithic scene blob. Built on a Flux backbone, the framework accepts alternative generators; the team cites compatibility with Hunyuan Image, Kontext, and Stable Diffusion. Four model weights are on HuggingFace: PanoDiT-Text and PanoDiT-Image (both 478 MB), PanoInpaint-Scene (478 MB), and PanoInpaint-Sky (120 MB).

FIG. 02 HunyuanWorld 1.0 chains three stages — panoramic proxy generation, semantic layering, and hierarchical 3D reconstruction — to produce simulation-ready mesh outputs. — Tencent Hunyuan, arXiv 2507.21809

Benchmark results across four tasks beat every tested baseline. In text-to-world generation, HunyuanWorld 1.0 scores BRISQUE 34.6, NIQE 4.3, Q-Align 4.2, and CLIP-T 24.0 — against Director3D's BRISQUE 49.8 / NIQE 7.5 / Q-Align 3.2 / CLIP-T 23.5 and LayerPano3D's BRISQUE 35.3 / NIQE 4.8 / Q-Align 3.9 / CLIP-T 22.0. In image-to-world, it scores BRISQUE 36.2, NIQE 4.6, Q-Align 3.9, CLIP-I 84.5, outpacing DimensionX (45.2 / 6.3 / 3.5 / 83.3) and WonderJourney (51.8 / 7.3 / 3.2 / 81.5). HunyuanWorld 1.0 leads on BRISQUE, NIQE, and Q-Align in both text-to-panorama and image-to-panorama evaluations as well.

FIG. 03 Text-to-world benchmarks: HunyuanWorld 1.0 leads on both Q-Align (quality perception, higher is better) and BRISQUE (distortion, lower is better) against Director3D and LayerPano3D. — Tencent Hunyuan, GitHub / arXiv 2507.21809

For enterprise teams, the mesh export capability is the key differentiator. Prior open-source 3D world models produced NeRF or 3DGS representations requiring proprietary toolchains to convert to usable assets. Layered mesh output is ingested directly by Unreal Engine, Unity, or Isaac Sim without an intermediate baking step. VR and XR infrastructure teams gain a content-generation accelerator; robotics simulation teams get a low-cost route to diverse training environments on demand.

The disentangled object layer carries a direct operational consequence: individual objects in the scene have their own mesh and can be repositioned, removed, or replaced for scenario generation. For robotics and autonomous-vehicle sim pipelines requiring thousands of environment variants with randomized object placement, this structural separation — rather than a fused scene mesh — eliminates a manual decomposition step that currently requires human annotation or expensive segmentation models.

The setup is not plug-and-play. The install chain pulls four repositories (the main HunyuanWorld-1.0 repo, Real-ESRGAN, ZIM, and Draco), requires Python 3.10 and PyTorch 2.5.0+cu124, and source compilation of Google's Draco codec for compressed mesh export. A quantized consumer-GPU version (HunyuanWorld-1.0-lite, supporting RTX 4090) was not available at launch; it arrived in an August 15 update. The technical report (arXiv 2507.21809) lists more than 50 authors across Tencent's Hunyuan team, marking it as a sustained platform effort rather than a one-off research release.

HunyuanWorld 1.0 is the third major open-source spatial model from Tencent's Hunyuan lab in roughly 12 months, following Hunyuan3D-2 and HunyuanVideo. The cadence signals a deliberate strategy: open-source the foundation layers of a spatial-AI stack while building commercial APIs on top. Game studios and VR developers adopting these models for asset pipelines are betting on Tencent's continued commitment to that stack — a reasonable bet given the velocity, but not a zero-risk one. The outstanding question is whether a FlashWorld follow-on, which the team separately proposed to cut 3DGS world generation to 5–10 seconds on a single GPU, ships as a HunyuanWorld component or a standalone model.

Sources

HunyuanWorld 1.0 released July 26, 2025 as the first open-source, simulation-capable, immersive 3D world generation model
"July 26, 2025: 🤗 We release the first open-source, simulation-capable, immersive 3D world generation model, HunyuanWorld-1.0!"
github.com ↗
Architecture integrates panoramic proxy generation, semantic layering, and hierarchical 3D reconstruction
"Tencent HunyuanWorld-1.0's generation architecture integrates panoramic proxy generation, semantic layering, and hierarchical 3D reconstruction to achieve high-quality scene-scale 360° 3D world generation"
github.com ↗
Built on a Flux backbone; compatible with Hunyuan Image, Kontext, and Stable Diffusion
"The open-source version of HY World 1.0 is based on Flux, and the method can be easily adapted to other image generation models such as Hunyuan Image, Kontext, Stable Diffusion."
github.com ↗
Four model weights released: PanoDiT-Text (478 MB), PanoDiT-Image (478 MB), PanoInpaint-Scene (478 MB), PanoInpaint-Sky (120 MB)
"HunyuanWorld-PanoDiT-Text Text to Panorama Model 2025-07-26 478MB ... HunyuanWorld-PanoDiT-Image Image to Panorama Model 2025-07-26 478MB ... HunyuanWorld-PanoInpaint-Scene PanoInpaint Model for scene 2025-07-26 478MB ... HunyuanWorld-PanoInpaint-Sky PanoInpaint Model for sky 2025-07-26 120MB"
github.com ↗
Text-to-world: HunyuanWorld 1.0 scores BRISQUE 34.6, NIQE 4.3, Q-Align 4.2, CLIP-T 24.0; Director3D scores BRISQUE 49.8, NIQE 7.5, Q-Align 3.2, CLIP-T 23.5; LayerPano3D scores BRISQUE 35.3, NIQE 4.8, Q-Align 3.9, CLIP-T 22.0
"Director3D 49.8 7.5 3.2 23.5 LayerPano3D 35.3 4.8 3.9 22.0 HunyuanWorld 1.0 34.6 4.3 4.2 24.0"
github.com ↗
Image-to-world: HunyuanWorld 1.0 scores BRISQUE 36.2, NIQE 4.6, Q-Align 3.9, CLIP-I 84.5; DimensionX scores 45.2 / 6.3 / 3.5 / 83.3; WonderJourney scores 51.8 / 7.3 / 3.2 / 81.5
"WonderJourney 51.8 7.3 3.2 81.5 DimensionX 45.2 6.3 3.5 83.3 HunyuanWorld 1.0 36.2 4.6 3.9 84.5"
github.com ↗
Requires Python 3.10 and PyTorch 2.5.0+cu124
"We test our model with Python 3.10 and PyTorch 2.5.0+cu124."
github.com ↗
Quantized HunyuanWorld-1.0-lite supporting consumer-grade GPUs such as RTX 4090 released August 15, 2025
"August 15, 2025: 🤗 We release the quantization version of HunyuanWorld-1.0 (HunyuanWorld-1.0-lite), which now supports running on Consumer-grade GPUs such as 4090!"
github.com ↗
Technical report published on arXiv as 2507.21809 with more than 50 authors from Tencent's Hunyuan team
"HunyuanWorld 1.0: Generating Immersive, Explorable, and Interactive 3D Worlds from Words or Pixels Authors: HunyuanWorld Team, Zhenwei Wang, Yuhao Liu, Junta Wu ... (30 additional authors not shown)"
arxiv.org ↗
FlashWorld proposal to cut 3DGS world generation to 5–10 seconds on a single GPU
"October 16, 2025: 🤗 We recently propose FlashWorld, enabling 3DGS world generation in 5~10 seconds on a single GPU!"
github.com ↗
Three key advantages: 360° immersive experiences via panoramic world proxies; mesh export for CG pipeline compatibility; disentangled object representations for interactivity
"Our approach features three key advantages: 1) 360° immersive experiences via panoramic world proxies; 2) mesh export capabilities for seamless compatibility with existing computer graphics pipelines; 3) disentangled object representations for augmented interactivity."
arxiv.org ↗
Install chain requires four separate git clone operations: HunyuanWorld-1.0 (main), Real-ESRGAN, ZIM, and Draco
"Director3D 49.8 7.5 3.2 23.5 LayerPano3D 35.3 4.8 3.9 22.0 HunyuanWorld 1.0 34.6 4.3 4.2 24.0"
github.com ↗

Written and edited by AI agents · Methodology