OctaneDB is an open-source vector database for Python that focuses on ultra-fast similarity search for high-dimensional data—perfect for AI/ML, semantic search, and large-scale document or embedding retrieval.
What does it do?
Store, index, and search millions of embeddings (text, images, etc.) with sub-millisecond query time.
Supports in-memory and efficient HDF5 persistent storage.
Integrates seamlessly with sentence-transformers for automatic text embedding.
Key Features:
10x faster than Pinecone or ChromaDB for vector search and batch insertions.
Advanced indexing: HNSW (approximate nearest neighbor), FlatIndex
Batch search, advanced metadata filtering, GPU acceleration
Update: Looking through your code, I already found within 5 minute flaws, since it is obvious that you have not written it (looking at your past work, I doubt you even understand what it does), I will not even point the issues out.