Hunyuan3D 2.1

Generate 3D meshes from text or images with Tencent Hunyuan3D 2.1 on Clore.ai

Hunyuan3D 2.1 by Tencent is a two-stage 3D generation model: it first predicts geometry (shape), then synthesizes PBR textures. It accepts both text prompts and reference images as input and outputs production-ready meshes in GLB, OBJ, or PLY format. With over 3 million downloads on HuggingFace, it is one of the most widely adopted open-source 3D generation models.

circle-check

Key Features

  • Text-to-3D and image-to-3D — both input modes in a single model

  • Two-stage pipeline — shape generation followed by PBR texture synthesis

  • High-fidelity output — detailed geometry with albedo, normal, and roughness maps

  • Multiple export formats — GLB, OBJ, PLY

  • Gradio web UI — browser-based interaction, no coding required

  • 16–24 GB VRAM — runs on RTX 3090 and RTX 4090

  • 3M+ downloads on HuggingFace — active community and continuous updates

Requirements

Component
Minimum
Recommended

GPU

RTX 3090 24 GB

RTX 4090 24 GB

VRAM

16 GB

24 GB

RAM

16 GB

32 GB

Disk

50 GB

100 GB

CUDA

11.8

12.1+

Python

3.10

3.11

Clore.ai pricing: RTX 4090 ≈ $0.5–2/day · RTX 3090 ≈ $0.3–1/day

Quick Start

1. Clone and Install

2. Launch the Gradio Web UI

The UI provides:

  • Text input field for text-to-3D generation

  • Image upload for image-to-3D generation

  • Sliders for inference steps, guidance scale, and seed

  • 3D model preview with orbit controls

  • Download buttons for GLB/OBJ/PLY

3. Generate via Python API

4. Add Textures (Stage 2)

Usage Examples

Image-to-3D Generation

Batch Processing

Export to Multiple Formats

Performance Reference

GPU
Shape (30 steps)
Texture (20 steps)
Total

RTX 4090

~20 sec

~15 sec

~35 sec

RTX 3090

~30 sec

~25 sec

~55 sec

A100 40G

~18 sec

~12 sec

~30 sec

Tips

  • Remove image backgrounds before image-to-3D — use rembg for clean segmentation

  • Text prompts benefit from specificity — "a low-poly medieval sword with leather-wrapped handle" produces better results than "sword"

  • Reduce num_inference_steps to 15–20 for faster previews during iteration

  • Increase guidance_scale (8–12) for stronger adherence to the prompt at the cost of diversity

  • GLB is the best export format — it bundles geometry, textures, and materials in a single file

  • Use --share when launching Gradio on Clore.ai for remote browser access

  • Model weights are ~15 GB — ensure sufficient disk space before first run

  • For game assets, generate at high quality then decimate in Blender for LOD levels

Troubleshooting

Problem
Solution

CUDA out of memory

Use RTX 3090+ (24 GB). Reduce batch size or inference steps

Model download stalls

Check disk space. Use huggingface-cli download tencent/Hunyuan3D-2 manually

Gradio UI not reachable

Pass --share flag, or forward port 7860 from Clore.ai dashboard

Poor geometry quality

Increase num_inference_steps to 40+, try different seeds

Texture artifacts

Ensure shape mesh is clean before texture stage

Import errors on fresh install

Run pip install -r requirements.txt again — some deps compile from source

Slow generation on first run

Expected — model compilation and weight loading is cached after first inference

Resources

Last updated

Was this helpful?