Union-find for chatbot memory instead of flat compaction

Name: Union-find for chatbot memory instead of flat compaction
Availability: InStock
Author: kimjune01

by kimjune01·Mar 15, 2026·1 point·0 comments

Visit Project View on HN

AI Analysis

●●●BangerBig BrainZero to OneWizardry

15-18pp fact recall boost over flat summarization with full provenance tracing.

Strengths

•Union-find provides O(1) amortized lookup while maintaining message provenance.
•Per-cluster summaries avoid headline facts crowding out footnote details.
•Seven empirical trials with statistical significance (p=0.039) back the claims.

Weaknesses

•Research prototype—needs production implementation before real adoption.
•TF-IDF centroid comparison may not scale to very large conversation histories.

Post Description

Every chatbot handles context overflow the same way — summarize everything into one block and throw away the sources. I replaced that with a union-find forest: messages merge into clusters, each cluster has its own summary, and you can trace any summary back to the messages that produced it.

Ran seven trials against flat summarization. UF led by 15-18pp on fact recall in every trial. One hit significance (p=0.039), the rest are directional. The interesting finding: flat summaries drop "footnote" facts (cron schedules, webhook paths) because they compete against headline facts for space. Per-cluster summaries don't have that pressure.

Code and trial logs: https://github.com/kimjune01/union-find-compaction