Storage Benchmark Kit
This driver provides performance benchmarking capabilities for ChromaDB, a vector database for AI applications.
ChromaDB is an open-source vector database designed for AI applications. This driver allows you to benchmark ChromaDB’s performance for storing and retrieving byte arrays as documents within collections.
The driver supports the following configuration options:
host: ChromaDB server host (default: localhost)port: ChromaDB server port (default: 8000)collectionName: Name of the collection to use (default: sbk_benchmark)embeddingDimension: Dimension for embeddings (default: 384)distanceFunction: Distance function for similarity (default: cosine)ssl: Use SSL connection (default: false)authToken: Authentication token (default: empty)timeoutSeconds: Connection timeout in seconds (default: 30)maxRetries: Maximum connection retries (default: 3)batchSize: Batch size for operations (default: 100)# Using Docker
docker run -p 8000:8000 chromadb/chroma
# Or using Python
pip install chromadb
chroma run --host localhost --port 8000
# Run benchmark with default settings
./sbk -class chromadb -writers 4 -readers 4 -size 1024 -seconds 60
# Custom ChromaDB settings
./sbk -class chromadb -writers 4 -readers 4 -size 1024 -seconds 60 \
-host localhost \
-port 8000 \
-collectionName my_test_collection
You can also modify the default settings in ChromaDB.properties:
host=localhost
port=8000
collectionName=sbk_benchmark
embeddingDimension=384
distanceFunction=cosine
ssl=false
authToken=
timeoutSeconds=30
maxRetries=3
batchSize=100
The driver uses a key generation strategy similar to other SBK drivers:
io.github.amikos-tech:chromadb-java-client:0.1.7 - ChromaDB Java clienttools.jackson - JSON processing (included via ChromaDB client)This driver is licensed under the Apache License 2.0, same as the SBK framework.