Antfly: Distributed, Multimodal Search and Memory and Graphs in Go

Name: Antfly: Distributed, Multimodal Search and Memory and Graphs in Go
Availability: InStock
Author: kingcauchy

by kingcauchy·Mar 17, 2026·107 points·40 comments

Visit Project View on HN

AI Analysis

●●SolidBig BrainShip It

Bundles vector, graph, and full-text search with local ML inference into one Go binary.

Strengths

•Multi-Raft setup using etcd's library with Pebble storage shows serious distributed systems engineering.
•Built-in Termite service enables local embeddings without external API calls or separate inference.
•Hybrid search combines BM25, vectors, and graph traversal in a single query interface.

Weaknesses

•Only 38 commits suggests early stage stability risks compared to established tools like Weaviate.
•Heavy reliance on bundling existing libraries rather than novel underlying storage or consensus algorithms.

Post Description

Hey HN, I’m excited to share Antfly: a distributed document database and search engine written in Go that combines full-text, vector, and graph search. Use it for distributed multimodal search and memory, or for local dev and small deployments.

I built this to give developers a single-binary deployment with native ML inference (via a built-in service called Termite), meaning you don't need external API calls for vector search unless you want to use them.

Some things that might interest this crowd:

Capabilities: Multimodal indexing (images, audio, video), MongoDB-style in-place updates, and streaming RAG.

Distributed Systems: Multi-Raft setup built on etcd's library, backed by Pebble (CockroachDB's storage engine). Metadata and data shards get their own Raft groups.

Single Binary: antfly swarm gives you a single-process deployment with everything running. Good for local dev and small deployments. Scale out by adding nodes when you need to.

Ecosystem: Ships with a Kubernetes operator and an MCP server for LLM tool use.

Native ML inference: Antfly ships with Termite. Think of it like a built-in Ollama for non-generative models too (embeddings, reranking, chunking, text generation). No external API calls needed, but also supports them (OpenAI, Ollama, Bedrock, Gemini, etc.)

License: I went with Elastic License v2, not an OSI-approved license. I know that's a topic with strong feelings here. The practical upshot: you can use it, modify it, self-host it, build products on top of it, you just can't offer Antfly itself as a managed service. Felt like the right tradeoff for sustainability while still making the source available.

Happy to answer questions about the architecture, the Raft implementation, or anything else. Feedback welcome!

Similar Projects

Infrastructure●●Solid

HelixDB – A Graph Database built on Object-storage

Graph-vector-FTS in one database, but Weaviate and Neo4j already offer hybrid search.

Big BrainShip It

GeorgeCurtis

152422d ago

AI/ML●●Solid

AgentKV – SQLite for AI agent memory (MMAP vector+graph DB)

Single-file mmap storage plus an HNSW vector index and explicit graph edges is an elegant, practical combo — think "SQLite for agent memory" with CRC-32 crash recovery and zero-server convenience. The C++20 core + nanobind gives zero-copy NumPy views and GIL-free searches, and the claimed FAISS-like throughput makes this genuinely interesting for local setups; main caveat is build/toolchain friction and how rich the surrounding ecosystem becomes.

WizardryNiche Gem

shiwang_khera

103mo ago

Infrastructure●●●Banger