Kohya Training

Train LoRA, Dreambooth, and full fine-tunes for Stable Diffusion using Kohya's trainer.

circle-check

Renting on CLORE.AI

  1. Filter by GPU type, VRAM, and price

  2. Choose On-Demand (fixed rate) or Spot (bid price)

  3. Configure your order:

    • Select Docker image

    • Set ports (TCP for SSH, HTTP for web UIs)

    • Add environment variables if needed

    • Enter startup command

  4. Select payment: CLORE, BTC, or USDT/USDC

  5. Create order and wait for deployment

Access Your Server

  • Find connection details in My Orders

  • Web interfaces: Use the HTTP port URL

  • SSH: ssh -p <port> root@<proxy-address>

What is Kohya?

Kohya_ss is a training toolkit for:

  • LoRA - Lightweight adapters (most popular)

  • Dreambooth - Subject/style training

  • Full fine-tune - Complete model training

  • LyCORIS - Advanced LoRA variants

Requirements

Training Type
Min VRAM
Recommended

LoRA SD 1.5

6GB

RTX 3060

LoRA SDXL

12GB

RTX 3090

Dreambooth SD 1.5

12GB

RTX 3090

Dreambooth SDXL

24GB

RTX 4090

Quick Deploy

Docker Image:

Ports:

Command:

Accessing Your Service

After deployment, find your http_pub URL in My Orders:

  1. Go to My Orders page

  2. Click on your order

  3. Find the http_pub URL (e.g., abc123.clorecloud.net)

Use https://YOUR_HTTP_PUB_URL instead of localhost in examples below.

Using the Web UI

  1. Access at http://<proxy>:<port>

  2. Select training type (LoRA, Dreambooth, etc.)

  3. Configure settings

  4. Start training

Dataset Preparation

Folder Structure

Image Requirements

  • Resolution: 512x512 (SD 1.5) or 1024x1024 (SDXL)

  • Format: PNG or JPG

  • Quantity: 10-50 images for LoRA

  • Quality: Clear, well-lit, varied angles

Caption Files

Create .txt file with same name as image:

myimage.txt:

Auto-Captioning

Use BLIP for automatic captions:

LoRA Training (SD 1.5)

Configuration

In Kohya UI:

Setting
Value

Model

runwayml/stable-diffusion-v1-5

Network Rank

32-128

Network Alpha

16-64

Learning Rate

1e-4

Batch Size

1-4

Epochs

10-20

Optimizer

AdamW8bit

Command Line Training

LoRA Training (SDXL)

Dreambooth Training

Subject Training

Style Training

Training Tips

Optimal Settings

Parameter
Person/Character
Style
Object

Network Rank

64-128

32-64

32

Network Alpha

32-64

16-32

16

Learning Rate

1e-4

5e-5

1e-4

Epochs

15-25

10-15

10-15

Avoiding Overfitting

  • Use regularization images

  • Lower learning rate

  • Fewer epochs

  • Increase network alpha

Avoiding Underfitting

  • More training images

  • Higher learning rate

  • More epochs

  • Lower network alpha

Monitoring Training

TensorBoard

Key Metrics

  • loss - Should decrease then stabilize

  • lr - Learning rate schedule

  • epoch - Training progress

Testing Your LoRA

With Automatic1111

Copy LoRA to:

Use in prompt:

With ComfyUI

Load LoRA node and connect to model.

With Diffusers

Advanced Training

LyCORIS (LoHa, LoKR)

Textual Inversion

Saving & Exporting

Download Trained Model

Convert Formats

Cost Estimate

Typical CLORE.AI marketplace rates (as of 2024):

GPU
Hourly Rate
Daily Rate
4-Hour Session

RTX 3060

~$0.03

~$0.70

~$0.12

RTX 3090

~$0.06

~$1.50

~$0.25

RTX 4090

~$0.10

~$2.30

~$0.40

A100 40GB

~$0.17

~$4.00

~$0.70

A100 80GB

~$0.25

~$6.00

~$1.00

Prices vary by provider and demand. Check CLORE.AI Marketplacearrow-up-right for current rates.

Save money:

  • Use Spot market for flexible workloads (often 30-50% cheaper)

  • Pay with CLORE tokens

  • Compare prices across different providers

Troubleshooting

OOM Error

  • Reduce batch size to 1

  • Enable gradient checkpointing

  • Use 8bit optimizer

  • Lower resolution

Poor Results

  • More/better training images

  • Adjust learning rate

  • Check captions match images

  • Try different network rank

Training Crashes

  • Check CUDA version

  • Update xformers

  • Reduce batch size

  • Check disk space

Last updated

Was this helpful?