For the complete documentation index, see llms.txt. This page is also available as Markdown.

SadTalker

Create talking head videos from audio and images on Clore.ai

Animate faces with audio to create realistic talking head videos.

Renting on CLORE.AI

  1. Filter by GPU type, VRAM, and price

  2. Choose On-Demand (fixed rate) or Spot (bid price)

  3. Configure your order:

    • Select Docker image

    • Set ports (TCP for SSH, HTTP for web UIs)

    • Add environment variables if needed

    • Enter startup command

  4. Select payment: CLORE, BTC, or USDT/USDC

  5. Create order and wait for deployment

Access Your Server

  • Find connection details in My Orders

  • Web interfaces: Use the HTTP port URL

  • SSH: ssh -p <port> root@<proxy-address>

What is SadTalker?

SadTalker generates talking videos:

  • Lip-sync from any audio

  • Natural head movements

  • Works with single image

  • Expression control

Requirements

Mode
VRAM
Recommended

Basic

4GB

RTX 3060

High Quality

6GB

RTX 3080

Full Face

8GB

RTX 4080

Quick Deploy

Docker Image:

Ports:

Command:

Accessing Your Service

After deployment, find your http_pub URL in My Orders:

  1. Go to My Orders page

  2. Click on your order

  3. Find the http_pub URL (e.g., abc123.clorecloud.net)

Use https://YOUR_HTTP_PUB_URL instead of localhost in examples below.

Installation

Basic Usage

Command Line

Python API

With Face Enhancement

Parameters

Pose Styles

Range
Effect

0-5

Subtle movements

6-20

Normal movements

21-46

Expressive movements

Batch Processing

Gradio Interface

API Server

Text-to-Speech + SadTalker

Complete pipeline:

Expression Control

Quality Settings

Setting
Speed
Quality

No enhancer, 256px

Fast

Basic

GFPGAN, 256px

Medium

Good

GFPGAN, 512px

Slow

Better

RealESRGAN, 512px

Slowest

Best

Preprocessing Options

Troubleshooting

Face Not Detected

  • Use clear, frontal face image

  • Good lighting

  • Avoid occlusions (glasses, hair)

Audio Sync Issues

  • Use 16kHz WAV files

  • Avoid background music

  • Clear speech only

Choppy Movement

  • Increase expression_scale slightly

  • Try different pose_style

  • Use longer audio

Out of Memory

  • Reduce output size

  • Disable enhancer

  • Use crop preprocessing

Performance

Resolution
GPU
Time (10s video)

256px

RTX 3060

~30s

256px

RTX 4090

~15s

512px + GFPGAN

RTX 4090

~45s

Cost Estimate

Typical CLORE.AI marketplace rates (as of 2024):

GPU
Hourly Rate
Daily Rate
4-Hour Session

RTX 3060

~$0.03

~$0.70

~$0.12

RTX 3090

~$0.06

~$1.50

~$0.25

RTX 4090

~$0.10

~$2.30

~$0.40

A100 40GB

~$0.17

~$4.00

~$0.70

A100 80GB

~$0.25

~$6.00

~$1.00

Prices vary by provider and demand. Check CLORE.AI Marketplace for current rates.

Save money:

  • Use Spot market for flexible workloads (often 30-50% cheaper)

  • Pay with CLORE tokens

  • Compare prices across different providers

Next Steps

Last updated

Was this helpful?