Back to browse
NERDs – Entity-centered long-term memory for LLM agents

NERDs – Entity-centered long-term memory for LLM agents

by tdaltonc·Mar 6, 2026·13 points·5 comments

AI Analysis

●●●BangerBig BrainWizardryZero to One

Entity-centric memory cuts context 90% while matching full-text performance on NovelQA.

Strengths
  • Grounded evaluation on real benchmark (86 novels, 200K+ tokens avg) with measurable token savings (90% reduction) that scale independent of doc length
  • Biologically-inspired architecture (mirrors how human memory organizes around entities/relationships) is conceptually sound and novel in RAG context
  • Public demo across 86 novels with explorable entity pages (Elizabeth Bennet, Winterfell) proves the approach works end-to-end, not just theory
Weaknesses
  • Evaluation limited to narrative QA; unclear how it generalizes to code, logs, or transactional documents where relationships matter less
  • No mention of entity disambiguation (which 'James' in which context?) or hallucinated relationship creation
Category
Target Audience

Researchers, RAG builders, teams processing long documents for QA systems

Similar To

Retrieval-Augmented Generation (RAG) · Graph-based knowledge indexing · LlamaIndex entity extractors

Post Description

Long-running agents struggle to attend to relevant information as context grows, and eventually hit the wall when the context window fills up.

NERDs (Networked Entity Representation Documents) are Wikipedia-style entity pages that LLM agents build for themselves by reading a large corpus chunk-by-chunk. Instead of reprocessing the full text at query time, a downstream agent searches and reasons over these entity documents.

The idea comes from a pattern that keeps showing up: brains, human cognition, knowledge bases, and transformer internals all organize complex information around entities and their relationships. NERDs apply that principle as a preprocessing step for long-context understanding.

We tested on NovelQA (86 novels, avg 200K+ tokens). On entity-tracking questions (characters, relationships, plot, settings) NERDs match full-context performance while using ~90% fewer tokens per question, and token usage stays flat regardless of document length. To highlight the methods limitation, we also tested it on counting tasks and locating specific passages (which aren't entity-centered) where it did not preform as well.

nerdviewer.com lets you browse all the entity docs we generated across the 86 novels. Click through them like a fan-wiki. It's a good way to build intuition for what the agent produces.

Paper: https://www.techrxiv.org/users/1021468/articles/1381483-thin...

Similar Projects

Developer Tools●●●Banger

Adam – An embeddable cross-platform AI agent library

Agent framework in a single C header file that actually runs local models offline.

WizardryBig BrainNiche Gem
marcobambini
2491mo ago