Kandinsky

Generate images with powerful multilingual text understanding.

circle-check

What is Kandinsky?

Kandinsky is an image generation model developed by Sber AI:

  • Strong multilingual text understanding

  • High-quality image generation

  • Image mixing and interpolation

  • Inpainting and outpainting support

  • Open source weights

Resources

Model Versions

Version
Resolution
Quality
Speed

Kandinsky 2.1

768x768

Good

Fast

Kandinsky 2.2

1024x1024

Better

Medium

Kandinsky 3

1024x1024

Best

Slower

Hardware Requirements

Model
VRAM
Recommended GPU

Kandinsky 2.2

8GB

RTX 3070

Kandinsky 3

12GB

RTX 3090

Kandinsky 3 (high res)

16GB

RTX 4090

Quick Deploy

Docker Image:

Ports:

Command:

Accessing Your Service

After deployment, find your http_pub URL in My Orders:

  1. Go to My Orders page

  2. Click on your order

  3. Find the http_pub URL (e.g., abc123.clorecloud.net)

Use https://YOUR_HTTP_PUB_URL instead of localhost in examples below.

Installation

Basic Usage

Kandinsky 3

Kandinsky 2.2

Multilingual Prompts

Kandinsky supports multiple languages:

Image Mixing

Inpainting

Image-to-Image

Batch Generation

Gradio Interface

Memory Optimization

Performance

Model
Resolution
GPU
Time

Kandinsky 3

1024x1024

RTX 3090

15s

Kandinsky 3

1024x1024

RTX 4090

10s

Kandinsky 2.2

768x768

RTX 3090

8s

Kandinsky 2.2

768x768

RTX 4090

5s

Troubleshooting

Out of Memory

Problem: CUDA OOM when generating

Solutions:

  • Enable CPU offloading

  • Reduce resolution

  • Use Kandinsky 2.2 instead of 3

  • Enable attention slicing

Poor Text Rendering

Problem: Text in images looks wrong

Solutions:

  • Kandinsky struggles with text rendering (like most diffusion models)

  • Add text in post-processing

  • Use prompts that avoid text

Colors Look Wrong

Problem: Image colors are washed out or oversaturated

Solutions:

  • Adjust guidance scale (try 3-6 range)

  • Specify color preferences in prompt

  • Post-process with color correction

Slow Generation

Problem: Takes too long to generate

Solutions:

  • Reduce inference steps (30 is often enough)

  • Use fp16 precision

  • Use Kandinsky 2.2 for faster results

  • Reduce resolution for previews

Comparison with Other Models

Feature
Kandinsky 3
SDXL
FLUX

Multilingual

Excellent

Limited

Limited

Image Quality

High

Very High

Highest

Speed

Medium

Medium

Slow

VRAM

12GB

12GB

24GB

Inpainting

Yes

Yes

Limited

Cost Estimate

Typical CLORE.AI marketplace rates (as of 2024):

GPU
Hourly Rate
Daily Rate
4-Hour Session

RTX 3060

~$0.03

~$0.70

~$0.12

RTX 3090

~$0.06

~$1.50

~$0.25

RTX 4090

~$0.10

~$2.30

~$0.40

A100 40GB

~$0.17

~$4.00

~$0.70

A100 80GB

~$0.25

~$6.00

~$1.00

Prices vary by provider. Check CLORE.AI Marketplacearrow-up-right for current rates.

Next Steps

  • FLUX Generation - Highest quality images

  • Stable Diffusion - Most popular option

  • PixArt - Fast generation

  • ComfyUI - Advanced workflows

Last updated

Was this helpful?