ChromaDB

Deploy ChromaDB open-source vector database for AI applications on Clore.ai GPUs

ChromaDB is the leading open-source vector database purpose-built for AI applications. It provides a simple, intuitive API for storing, querying, and managing high-dimensional embeddings — the backbone of modern RAG systems, semantic search, recommendation engines, and LLM memory.

ChromaDB abstracts away the complexity of vector similarity search, letting you focus on building AI applications. It supports both in-memory mode for development and a persistent server mode for production deployments, with Docker support for easy deployment on Clore.ai GPU servers.

Key features:

  • 🚀 Simple Python/JavaScript API — get started in minutes

  • 💾 Persistent storage — data survives container restarts

  • 🔍 Multiple distance metrics — cosine, L2, inner product

  • 📦 Integrated embeddings — built-in support for OpenAI, Cohere, sentence-transformers

  • 🏗️ Multi-tenant — collections for organizing different datasets

  • 🔌 REST API — language-agnostic HTTP interface

  • Fast — HNSW index for approximate nearest-neighbor search

  • 🔗 LangChain/LlamaIndex native — first-class integration

circle-check

Server Requirements

Parameter
Minimum
Recommended

GPU

Any NVIDIA GPU (optional)

NVIDIA RTX 3080+ (for embeddings)

VRAM

Not required for ChromaDB

8–16 GB (for local embedding models)

RAM

4 GB

16–32 GB

CPU

2 cores

8 cores

Disk

10 GB

50–200 GB (for large datasets)

OS

Ubuntu 20.04+

Ubuntu 22.04

Docker

Required

Docker + Docker Compose

Ports

22, 8000

22, 8000

circle-info

ChromaDB itself doesn't require a GPU — it runs efficiently on CPU. However, generating embeddings (converting text to vectors) benefits greatly from GPU acceleration. If you plan to use local embedding models (sentence-transformers, etc.), choose a server with a GPU.


Quick Deploy on CLORE.AI

1. Find a suitable server

Go to CLORE.AI Marketplacearrow-up-right and choose:

  • CPU-only for ChromaDB server + API (store pre-computed embeddings)

  • GPU server if you want to generate embeddings locally as well

2. Configure your deployment

Docker Image:

Port Mappings:

Environment Variables:

Startup Command:

3. Test the deployment


Step-by-Step Setup

Step 1: SSH into your server

Step 2: Create data directory

Step 3: Run ChromaDB container

Step 4: Verify it's running

Step 5: Install Python client

Step 6: Test connectivity from Python

Step 7: (Optional) Enable authentication


Usage Examples

Example 1: Basic Vector Store Operations



Example 3: RAG Pipeline with ChromaDB + OpenAI


Example 4: Multi-Collection Document Management


Example 5: Filtering and Metadata Queries


Configuration

Docker Compose (Production)

Environment Variables Reference

Variable
Default
Description

IS_PERSISTENT

FALSE

Enable persistent storage

ANONYMIZED_TELEMETRY

TRUE

Disable usage tracking

CHROMA_SERVER_LOG_LEVEL

INFO

Log verbosity

CHROMA_MEMORY_LIMIT_BYTES

None

Max memory for segment cache

ALLOW_RESET

FALSE

Allow resetting all data via API

CHROMA_SERVER_AUTH_PROVIDER

None

Authentication provider class


Performance Tips

1. Choose the Right Embedding Model

Model
Dimensions
Speed
Quality
GPU Required

all-MiniLM-L6-v2

384

Fast

Good

No

all-mpnet-base-v2

768

Med

Better

Optional

text-embedding-3-small

1536

Fast

Great

API only

BAAI/bge-large-en-v1.5

1024

Med

Best

Yes

2. Batch Upserts for Speed

3. HNSW Index Tuning

4. Persistent Client for Local Use


Troubleshooting

Issue: Cannot connect to ChromaDB

Issue: Data lost on container restart

Issue: Out of memory errors

Issue: Slow embedding generation

Issue: Collection not found after restart



Clore.ai GPU Recommendations

Use Case
Recommended GPU
Est. Cost on Clore.ai

Development/Testing

RTX 3090 (24GB)

~$0.12/gpu/hr

Production RAG

RTX 3090 (24GB)

~$0.12/gpu/hr

High-throughput Embedding

RTX 4090 (24GB)

~$0.70/gpu/hr

💡 All examples in this guide can be deployed on Clore.aiarrow-up-right GPU servers. Browse available GPUs and rent by the hour — no commitments, full root access.

Last updated

Was this helpful?