Hybrid Search
Dense and sparse vectors fused in one query. Keyword recall meets semantic intent.
sub-50ms @ 10M docs
Vector is the open-source vector database built for AI — sub-millisecond queries, billion-scale indexes, a Python SDK that feels native.
Adopted in production
Powering retrieval at Series-B AI startups and Fortune-500 search teams.
from vector import Client db = Client("vector://localhost:6333") db.upsert("docs", embeddings) hits = db.query( collection="docs", vector=q, limit=5, )
single node · 1B vectors · 1024-dim · cold-cache p99
Benchmark · not a claim
faster than Pinecone
ANN benchmark · 1B vectors
p99 query latency
1024-dim embeddings
vectors per node
HNSW + product quantization
recall@10
vs. exact brute-force
[1] Measured on c6i.4xlarge, 1B 1024-dim float32 vectors, HNSW (M=16, ef=128), 99th-percentile cold-cache reads. Full methodology and reproducible scripts in our public benchmark repo ↗.
Developer experience
No config files. No schema ceremony. Connect, upsert, query — that's the whole API surface you need to ship.
from vector import Client db = Client("vector://localhost:6333") db.create_collection("docs", dim=1536) db.upsert("docs", embeddings) hits = db.query("docs", vector=q, limit=5)
import { Client } from "vector-db"; const db = new Client("vector://localhost:6333"); await db.createCollection("docs", { dim: 1536 }); await db.upsert("docs", embeddings); const hits = await db.query("docs", { vector: q, limit: 5 });
# create collection curl -X PUT localhost:6333/collections/docs \ -H "Content-Type: application/json" \ -d '{"vectors": {"size": 1536, "distance": "cosine"}}' # query curl -X POST localhost:6333/collections/docs/points/query \ -H "Content-Type: application/json" \ -d '{"vector": [0.1, ...], "limit": 5}'
How it works
connect
One URI, no driver soup. Same client works in-process or across the network.
model
Declare a collection with a dimension. Vector infers the rest.
load
Batch upsert streams straight into the index, no rebuild step.
query
Vector + metadata filter in one call. Recall doesn't flinch.
The features you'd reach for after the demo — already first-class.
No enterprise upsell required.
Dense and sparse vectors fused in one query. Keyword recall meets semantic intent.
sub-50ms @ 10M docs
Metadata filtering that respects your index. Predicate pushdown, no recompute.
zero recompute
Sharding and replication built in. Grow from laptop to cluster without a rewrite.
1B vectors · one command
Pricing
Self-host forever, free. Pay only when you want it managed.
For builders & solo projects.
For teams shipping to production.
For regulated & hyperscale.
Self-host forever, free. Apache-2.0. No vendor lock-in. · See the fair comparison vs. managed alternatives ↗
Ship today
Two commands. No signup wall. Your embeddings, indexed in under a minute.