Stable Audio

Generate music and sound effects with Stability AI's Stable Audio on CLORE.AI GPUs.

circle-check

Why Stable Audio?

  • High quality - 44.1kHz stereo audio generation

  • Variable length - Generate up to 95 seconds

  • Versatile - Music, sound effects, ambient sounds

  • Text-to-audio - Describe what you want to hear

  • Open weights - Stable Audio Open available

Model Variants

Model
Duration
Quality
VRAM
License

Stable Audio Open

47 sec

Good

8GB

Open

Stable Audio 2.0

3 min

Excellent

12GB

Commercial

Quick Deploy on CLORE.AI

Docker Image:

pytorch/pytorch:2.5.1-cuda12.4-cudnn9-devel

Ports:

Command:

Accessing Your Service

After deployment, find your http_pub URL in My Orders:

  1. Go to My Orders page

  2. Click on your order

  3. Find the http_pub URL (e.g., abc123.clorecloud.net)

Use https://YOUR_HTTP_PUB_URL instead of localhost in examples below.

Hardware Requirements

Model
Minimum GPU
Recommended

Stable Audio Open

RTX 3070 8GB

RTX 3090 24GB

Stable Audio 2.0

RTX 3090 12GB

RTX 4090 24GB

Installation

Basic Usage

Text to Music

Sound Effects

Ambient Sounds

Prompt Examples

Music Genres

Sound Effects

Ambient/Background

Advanced Options

Controlling Generation

Variable Length

Batch Generation

Gradio Web Interface

Performance

Duration
Steps
GPU
Time

10 sec

100

RTX 3090

~15s

10 sec

100

RTX 4090

~10s

30 sec

100

RTX 3090

~40s

30 sec

100

RTX 4090

~25s

47 sec

100

RTX 4090

~40s

Quality Tips

Better Music

Better Sound Effects

Cost Estimate

Typical CLORE.AI marketplace rates:

GPU
Hourly Rate
~30sec clips/hour

RTX 3060 12GB

~$0.03

~50

RTX 3090 24GB

~$0.06

~90

RTX 4090 24GB

~$0.10

~140

A100 40GB

~$0.17

~200

Prices vary. Check CLORE.AI Marketplacearrow-up-right for current rates.

Troubleshooting

Out of Memory

Poor Quality Output

  • Increase steps (150-200)

  • Adjust CFG scale (try 5-10)

  • Be more specific in prompt

  • Try different seeds

No Sound / Silence

  • Check prompt is descriptive enough

  • Avoid very abstract descriptions

  • Try known-working prompts first

Audio Artifacts

  • Increase steps

  • Lower CFG scale

  • Reduce duration

  • Check for GPU thermal throttling

Stable Audio vs Others

Feature
Stable Audio
AudioCraft
Bark

Music

Excellent

Excellent

Poor

SFX

Great

Good

Poor

Speech

No

No

Yes

Duration

47s / 3min

30s

15s

Quality

44.1kHz

32kHz

24kHz

Open

Partial

Yes

Yes

Use Stable Audio when:

  • High-quality music generation

  • Sound effects for games/video

  • Background music

  • Ambient soundscapes

Next Steps

Last updated

Was this helpful?