For the complete documentation index, see llms.txt. This page is also available as Markdown.

Quickstart

Step 1: Create Account & Add Funds

  1. Go to clore.ai β†’ Sign Up

  2. Verify your email

  3. Go to Account β†’ Deposit

  4. Add funds via CLORE, BTC, USDT, or USDC (minimum ~$5 to start)

Step 2: Pick a GPU

Go to the Marketplace and choose based on your task:

What I Want To Do
Minimum GPU
Budget/Day

Chat with AI (7B models)

RTX 3060 12GB

~$0.15

Chat with AI (32B models)

RTX 4090 24GB

~$0.50

Generate images (FLUX)

RTX 3090 24GB

~$0.30

Generate videos

RTX 4090 24GB

~$0.50

Generate music

Any GPU 4GB+

~$0.15

Voice cloning / TTS

RTX 3060 6GB+

~$0.15

Transcribe audio

RTX 3060 8GB+

~$0.15

Fine-tune a model

RTX 4090 24GB

~$0.50

Run 70B+ models

A100 80GB

~$2.00

Quick GPU Guide

GPU
VRAM
Price
Sweet Spot For

RTX 3060

12GB

$0.15–0.30/day

TTS, music, small models

RTX 3090

24GB

$0.30–1.00/day

Image gen, 32B models

RTX 4090

24GB

$0.50–2.00/day

Everything up to 35B, fast inference

RTX 5090

32GB

$1.50–3.00/day

70B quantized, fastest

A100 80GB

80GB

$2.00–4.00/day

70B FP16, serious training

H100 80GB

80GB

$3.00–6.00/day

400B+ MoE models

Step 3: Deploy

Click Rent on your chosen server, then configure:

  • Order type: On-Demand (guaranteed) or Spot (30–50% cheaper, can be interrupted)

  • Docker image: See recipes below

  • Ports: Always include 22/tcp (SSH) + your app port

  • Environment: Add any API keys needed

πŸš€ One-Click Recipes

Chat with AI (Ollama + Open WebUI)

The easiest way to run local AI β€” ChatGPT-like interface with any open model.

After deploy, open the HTTP URL β†’ create account β†’ pick a model (Llama 4 Scout, Gemma 3, Qwen3.5) β†’ chat!

Image Generation (ComfyUI)

Node-based workflow for FLUX, Stable Diffusion, and more.

Image Generation (Stable Diffusion WebUI)

Classic UI for Stable Diffusion, SDXL, and SD 3.5.

LLM API Server (vLLM)

Production-grade serving with OpenAI-compatible API.

Music Generation (ACE-Step)

Generate full songs with vocals β€” works on any 4GB+ GPU!

SSH in, then:

Step 4: Connect

After your order starts:

  1. Go to My Orders β†’ find your active order

  2. Web UI: Click the HTTP URL (e.g., https://xxx.clorecloud.net)

  3. SSH: ssh -p <port> root@<proxy-address>

Deploy
Typical Startup

Ollama + Open WebUI

3–5 min

ComfyUI

10–15 min

vLLM

5–15 min (depends on model size)

SD WebUI

10–20 min

Step 5: Start Creating

Once your service is running, explore the guides for your specific use case:

πŸ€– Language Models (Chat, Code, Reasoning)

  • Ollama β€” easiest model management

  • Llama 4 Scout β€” Meta's latest, 10M context

  • Gemma 3 β€” Google's 27B that beats 405B models

  • Qwen3.5 β€” beat Claude 4.5 on math (Feb 2026!)

  • DeepSeek-R1 β€” chain-of-thought reasoning

  • vLLM β€” production API serving

🎨 Image Generation

🎬 Video Generation

πŸ”Š Audio & Voice

  • Qwen3-TTS β€” voice cloning, 10+ languages

  • WhisperX β€” transcription + speaker diarization

  • Dia TTS β€” multi-speaker dialog

  • Kokoro β€” tiny TTS, only 2GB VRAM

🎡 Music

  • ACE-Step β€” full songs on < 4GB VRAM

πŸ’» AI Coding

  • TabbyML β€” self-hosted Copilot for $4.50/month

  • Aider β€” terminal AI coding assistant

🧠 Training

  • Unsloth β€” 2x faster, 70% less VRAM

  • Axolotl β€” YAML-based fine-tuning

πŸ’‘ Tips for Beginners

  1. Start with Ollama β€” it's the easiest way to try AI locally

  2. RTX 4090 is the sweet spot β€” handles 90% of use cases at $0.50–2/day

  3. Use Spot orders for experiments β€” 30–50% cheaper

  4. Use On-Demand for important work β€” guaranteed, no interruptions

  5. Download your outputs before the order ends β€” files are deleted after

  6. Pay with CLORE token β€” often better rates than stablecoins

  7. Check RAM and network β€” low RAM is the #1 cause of failures

Troubleshooting

Problem
Solution

HTTP 502 for a long time

Wait 10–20 min for first startup; check RAM β‰₯ 16GB

Service won't start

RAM too low (need 16GB+) or VRAM too small for the model

Slow model download

Normal on first run; prefer 500Mbps+ servers

CUDA out of memory

Use smaller model or bigger GPU; try quantized versions

Can't SSH

Check port is 22/tcp in config; wait for server to fully start

Prefer code over clicking? Install the official SDK:

Or use Python directly:

β†’ Full Python Quickstart | SDK Guide | CLI Automation

Need Help?

Last updated

Was this helpful?