Images Docker

Images Docker prêtes à déployer pour les charges de travail IA sur Clore.ai

Images Docker prêtes à être déployées pour les charges de travail IA sur CLORE.AI.

Déployez ces images directement sur CLORE.AI Marketplace.

Référence de déploiement rapide

Les plus populaires

Tâche

Image

Ports

Discuter avec l'IA

ollama/ollama

22, 11434

Interface de type ChatGPT

ghcr.io/open-webui/open-webui

22, 8080

Génération d'images

universonic/stable-diffusion-webui

22, 7860

Génération d'images basée sur des nœuds

yanwk/comfyui-boot

22, 8188

Serveur API LLM

vllm/vllm-openai

22, 8000

Modèles de langage

composant Ollama

Exécuteur LLM universel - la façon la plus simple d'exécuter n'importe quel modèle.

Image : ollama/ollama
Ports : 22/tcp, 11434/http
Commande : ollama serve

Après le déploiement :

# Se connecter en SSH au serveur
ssh -p <port> root@<proxy>

# Récupérer et exécuter un modèle
ollama pull llama3.2
ollama run llama3.2

Variables d'environnement :

OLLAMA_HOST=0.0.0.0
OLLAMA_MODELS=/root/.ollama/models

Ouvrir WebUI

Interface de type ChatGPT pour Ollama.

Image : ghcr.io/open-webui/open-webui:ollama
Ports : 22/tcp, 8080/http

Inclut Ollama intégré. Accès via le port HTTP.

Autonome (se connecter à un Ollama existant) :

Image : ghcr.io/open-webui/open-webui:main
Ports : 22/tcp, 8080/http
Environnement : OLLAMA_BASE_URL=http://localhost:11434

vLLM

Service LLM haute performance avec API compatible OpenAI.

Image : vllm/vllm-openai:latest
Ports : 22/tcp, 8000/http
Commande : python -m vllm.entrypoints.openai.api_server --model meta-llama/Meta-Llama-3.1-8B-Instruct --host 0.0.0.0

Pour les modèles plus grands (multi-GPU) :

python -m vllm.entrypoints.openai.api_server \
    --model meta-llama/Meta-Llama-3.1-70B-Instruct \
    --tensor-parallel-size 2 \
    --host 0.0.0.0

Variables d'environnement :

HUGGING_FACE_HUB_TOKEN=<votre-token>  # Pour les modèles restreints

Text Generation Inference (TGI)

Serveur LLM de production de HuggingFace.

Image : ghcr.io/huggingface/text-generation-inference:latest
Ports : 22/tcp, 8080/http
Commande : --model-id meta-llama/Meta-Llama-3.1-8B-Instruct

Variables d'environnement :

HUGGING_FACE_HUB_TOKEN=<votre-token>
MAX_INPUT_LENGTH=4096
MAX_TOTAL_TOKENS=8192

Génération d'images

Stable Diffusion WebUI (AUTOMATIC1111)

Interface SD la plus populaire avec extensions.

Image : universonic/stable-diffusion-webui:latest
Ports : 22/tcp, 7860/http

Pour faible VRAM (8 Go ou moins) :

./webui.sh --listen --medvram --xformers

Pour l'accès API :

./webui.sh --listen --xformers --api

ComfyUI

Flux de travail basé sur des nœuds pour utilisateurs avancés.

Image : yanwk/comfyui-boot:cu126-slim
Ports : 22/tcp, 8188/http
Environnement : CLI_ARGS=--listen 0.0.0.0

Images alternatives :

# Avec extensions courantes
Image : ai-dock/comfyui:latest

# Minimal
Image : pytorch/pytorch:2.5.1-cuda12.4-cudnn9-devel

Commande d'installation manuelle :

git clone https://github.com/comfyanonymous/ComfyUI && cd ComfyUI && pip install -r requirements.txt && python main.py --listen 0.0.0.0

Fooocus

Interface SD simplifiée, type Midjourney.

Image : pytorch/pytorch:2.5.1-cuda12.4-cudnn9-devel
Ports : 22/tcp, 7865/http
Commande : git clone https://github.com/lllyasviel/Fooocus && cd Fooocus && pip install -r requirements.txt && python launch.py --listen

FLUX

Génération d'images haute qualité et récente.

Utiliser ComfyUI avec les nœuds FLUX :

Image : yanwk/comfyui-boot:cu126-slim
Ports : 22/tcp, 8188/http

Ou via Diffusers :

Image : pytorch/pytorch:2.5.1-cuda12.4-cudnn9-devel
Ports : 22/tcp

# Après SSH
pip install diffusers transformers accelerate
python << 'EOF'
from diffusers import FluxPipeline
pipe = FluxPipeline.from_pretrained("black-forest-labs/FLUX.1-schnell")
pipe.enable_model_cpu_offload()
image = pipe("A cat", num_inference_steps=4).images[0]
image.save("output.png")
EOF

Génération vidéo

Stable Video Diffusion

Image : pytorch/pytorch:2.5.1-cuda12.4-cudnn9-devel
Ports : 22/tcp

pip install diffusers transformers accelerate
python << 'EOF'
from diffusers import StableVideoDiffusionPipeline
from diffusers.utils import load_image, export_to_video
pipe = StableVideoDiffusionPipeline.from_pretrained(
    "stabilityai/stable-video-diffusion-img2vid-xt",
    variant="fp16"
)
pipe.to("cuda")
image = load_image("input.png")
frames = pipe(image, num_frames=25).frames[0]
export_to_video(frames, "output.mp4", fps=7)
EOF

AnimateDiff

Utiliser avec ComfyUI :

Image : yanwk/comfyui-boot:cu126-slim
Ports : 22/tcp, 8188/http

Installez les nœuds AnimateDiff via le gestionnaire ComfyUI.

Audio et voix

Whisper (Transcription)

Image : onerahmet/openai-whisper-asr-webservice:latest
Ports : 22/tcp, 9000/http
Environnement : ASR_MODEL=large-v3

Utilisation de l'API :

curl -X POST "http://localhost:9000/asr" \
    -F "[email protected]" \
    -F "task=transcribe"

Bark (Synthèse vocale)

Image : pytorch/pytorch:2.5.1-cuda12.4-cudnn9-devel
Ports : 22/tcp

pip install bark
python << 'EOF'
from bark import SAMPLE_RATE, generate_audio, preload_models
from scipy.io.wavfile import write as write_wav
preload_models()
audio = generate_audio("Hello, this is a test.")
write_wav("output.wav", SAMPLE_RATE, audio)
EOF

Stable Audio

Image : pytorch/pytorch:2.5.1-cuda12.4-cudnn9-devel
Ports : 22/tcp

pip install stable-audio-tools
# Requiert un token HF pour l'accès au modèle

Modèles de vision

LLaVA

Image : pytorch/pytorch:2.5.1-cuda12.4-cudnn9-devel
Ports : 22/tcp

pip install llava
python -m llava.serve.cli --model-path liuhaotian/llava-v1.6-34b

Llama 3.2 Vision

Utiliser Ollama :

Image : ollama/ollama
Ports : 22/tcp, 11434/http

ollama pull llama3.2-vision
ollama run llama3.2-vision "describe this image" --images photo.jpg

Développement & Entraînement

Base PyTorch

Pour configurations personnalisées et entraînement.

Image : pytorch/pytorch:2.5.1-cuda12.4-cudnn9-devel
Ports : 22/tcp

Inclut : CUDA 12.1, cuDNN 8, PyTorch 2.1

Jupyter Lab

Notebooks interactifs pour ML.

Image : jupyter/pytorch-notebook:cuda12-pytorch-2.1
Ports : 22/tcp, 8888/http

Ou utilisez la base PyTorch avec Jupyter :

pip install jupyterlab
jupyter lab --ip=0.0.0.0 --allow-root --no-browser

Entraînement Kohya

Pour LoRA et l'affinage de modèles.

Image : pytorch/pytorch:2.5.1-cuda12.4-cudnn9-devel
Ports : 22/tcp

git clone https://github.com/kohya-ss/sd-scripts
cd sd-scripts
pip install -r requirements.txt
# Utiliser les scripts d'entraînement

Référence des images de base

Officiel NVIDIA

Image

CUDA

Cas d'utilisation

nvidia/cuda:12.1.0-devel-ubuntu22.04

12.1

Développement CUDA

nvidia/cuda:12.1.0-runtime-ubuntu22.04

12.1

Runtime CUDA uniquement

nvidia/cuda:11.8.0-devel-ubuntu22.04

11.8

Compatibilité héritée

Officiel PyTorch

Image

PyTorch

CUDA

pytorch/pytorch:2.5.1-cuda12.4-cudnn9-devel

2.5

12.4

pytorch/pytorch:2.0.1-cuda11.7-cudnn8-devel

2.0

11.7

pytorch/pytorch:1.13.1-cuda11.6-cudnn8-devel

1.13

11.6

HuggingFace

Image

But

huggingface/transformers-pytorch-gpu

Transformers + PyTorch

ghcr.io/huggingface/text-generation-inference

Serveur TGI

Variables d'environnement

Variables courantes

Variable

Description

Exemple

HUGGING_FACE_HUB_TOKEN

Token API HF pour les modèles restreints

hf_xxx

CUDA_VISIBLE_DEVICES

Sélection du GPU

0,1

TRANSFORMERS_CACHE

Répertoire du cache de modèles

/root/.cache

Variables Ollama

Variable

Description

Par défaut

OLLAMA_HOST

Adresse de liaison

127.0.0.1

OLLAMA_MODELS

Répertoire des modèles

~/.ollama/models

OLLAMA_NUM_PARALLEL

Requêtes parallèles

1

Variables vLLM

Variable

Description

VLLM_ATTENTION_BACKEND

Implémentation de l'attention

VLLM_USE_MODELSCOPE

Utiliser ModelScope au lieu de HF

Référence des ports

Port

Protocole

Service

TCP

SSH

7860

HTTP

Gradio (SD WebUI, Fooocus)

7865

HTTP

Alternative Fooocus

8000

HTTP

API vLLM

8080

HTTP

Open WebUI, TGI

8188

HTTP

ComfyUI

8888

HTTP

Jupyter

9000

HTTP

API Whisper

11434

TCP

API Ollama

Conseils

Stockage persistant

Montez des volumes pour conserver les données entre les redémarrages :

docker run -v /data/models:/root/.cache/huggingface ...

Sélection du GPU

Pour systèmes multi-GPU :

docker run --gpus '"device=0,1"' ...
# ou
CUDA_VISIBLE_DEVICES=0,1

Gestion de la mémoire

Si manque de VRAM :

Utilisez des modèles plus petits
Activer le déchargement vers le CPU
Réduire la taille de batch
Utilisez des modèles quantifiés (GGUF Q4)

Prochaines étapes

Comparaison GPU - Choisir le bon GPU
Compatibilité des modèles - Ce qui fonctionne où
Guide de démarrage rapide - Commencer en 5 minutes

PrécédentCalculatrice de coûts SuivantTarification GPU

Mis à jour il y a 1 jour

Ce contenu vous a-t-il été utile ?

hashtagRéférence de déploiement rapide

hashtagLes plus populaires

hashtagModèles de langage

hashtagcomposant Ollama

hashtagOuvrir WebUI

hashtagvLLM

hashtagText Generation Inference (TGI)

hashtagGénération d'images

hashtagStable Diffusion WebUI (AUTOMATIC1111)

hashtagComfyUI

hashtagFooocus

hashtagFLUX

hashtagGénération vidéo

hashtagStable Video Diffusion

hashtagAnimateDiff

hashtagAudio et voix

hashtagWhisper (Transcription)

hashtagBark (Synthèse vocale)

hashtagStable Audio

hashtagModèles de vision

hashtagLLaVA

hashtagLlama 3.2 Vision

hashtagDéveloppement & Entraînement

hashtagBase PyTorch

hashtagJupyter Lab

hashtagEntraînement Kohya

hashtagRéférence des images de base

hashtagOfficiel NVIDIA

hashtagOfficiel PyTorch

hashtagHuggingFace

hashtagVariables d'environnement

hashtagVariables courantes

hashtagVariables Ollama

hashtagVariables vLLM

hashtagRéférence des ports

hashtagConseils

hashtagStockage persistant

hashtagSélection du GPU

hashtagGestion de la mémoire

hashtagProchaines étapes

Référence de déploiement rapide

Les plus populaires

Modèles de langage

composant Ollama

Ouvrir WebUI

vLLM

Text Generation Inference (TGI)

Génération d'images

Stable Diffusion WebUI (AUTOMATIC1111)

ComfyUI

Fooocus

FLUX

Génération vidéo

Stable Video Diffusion

AnimateDiff

Audio et voix

Whisper (Transcription)

Bark (Synthèse vocale)

Stable Audio

Modèles de vision

LLaVA

Llama 3.2 Vision

Développement & Entraînement

Base PyTorch

Jupyter Lab

Entraînement Kohya

Référence des images de base

Officiel NVIDIA

Officiel PyTorch

HuggingFace

Variables d'environnement

Variables courantes

Variables Ollama

Variables vLLM

Référence des ports

Conseils

Stockage persistant

Sélection du GPU

Gestion de la mémoire

Prochaines étapes