Back to browse
GitHub Repository
387 starsZig

Antfly: Distributed, Multimodal Search and Memory and Graphs in Go

by kingcauchy·Mar 17, 2026·107 points·40 comments

AI Analysis

●●SolidBig BrainShip It

Bundles vector, graph, and full-text search with local ML inference into one Go binary.

Strengths
  • Multi-Raft setup using etcd's library with Pebble storage shows serious distributed systems engineering.
  • Built-in Termite service enables local embeddings without external API calls or separate inference.
  • Hybrid search combines BM25, vectors, and graph traversal in a single query interface.
Weaknesses
  • Only 38 commits suggests early stage stability risks compared to established tools like Weaviate.
  • Heavy reliance on bundling existing libraries rather than novel underlying storage or consensus algorithms.
Category
Target Audience

Backend engineers building RAG pipelines or search features

Similar To

Weaviate · Elasticsearch · Qdrant

Post Description

Hey HN, I’m excited to share Antfly: a distributed document database and search engine written in Go that combines full-text, vector, and graph search. Use it for distributed multimodal search and memory, or for local dev and small deployments.

I built this to give developers a single-binary deployment with native ML inference (via a built-in service called Termite), meaning you don't need external API calls for vector search unless you want to use them.

Some things that might interest this crowd:

Capabilities: Multimodal indexing (images, audio, video), MongoDB-style in-place updates, and streaming RAG.

Distributed Systems: Multi-Raft setup built on etcd's library, backed by Pebble (CockroachDB's storage engine). Metadata and data shards get their own Raft groups.

Single Binary: antfly swarm gives you a single-process deployment with everything running. Good for local dev and small deployments. Scale out by adding nodes when you need to.

Ecosystem: Ships with a Kubernetes operator and an MCP server for LLM tool use.

Native ML inference: Antfly ships with Termite. Think of it like a built-in Ollama for non-generative models too (embeddings, reranking, chunking, text generation). No external API calls needed, but also supports them (OpenAI, Ollama, Bedrock, Gemini, etc.)

License: I went with Elastic License v2, not an OSI-approved license. I know that's a topic with strong feelings here. The practical upshot: you can use it, modify it, self-host it, build products on top of it, you just can't offer Antfly itself as a managed service. Felt like the right tradeoff for sustainability while still making the source available.

Happy to answer questions about the architecture, the Raft implementation, or anything else. Feedback welcome!

Similar Projects

AI/ML●●Solid

AgentKV – SQLite for AI agent memory (MMAP vector+graph DB)

Single-file mmap storage plus an HNSW vector index and explicit graph edges is an elegant, practical combo — think "SQLite for agent memory" with CRC-32 crash recovery and zero-server convenience. The C++20 core + nanobind gives zero-copy NumPy views and GIL-free searches, and the claimed FAISS-like throughput makes this genuinely interesting for local setups; main caveat is build/toolchain friction and how rich the surrounding ecosystem becomes.

WizardryNiche Gem
shiwang_khera
103mo ago
AI/ML●●Solid

Replaced Neo4j with pure vector search for Graph RAG

Graph RAG without Neo4j — pure vector search beats HippoRAG on multi-hop benchmarks.

Big BrainDark Horse
zhangchen
202mo ago
AI/ML●●Solid

Genesys – Causal graph memory for AI agents, not just vectors

ACT-R scoring and active forgetting beat standard vector similarity for agent context.

Big BrainNiche Gem
rmm2496
101mo ago