Back to browse
We built a learning loop that learns from its own failures

We built a learning loop that learns from its own failures

by DarenWatson·Jun 29, 2026·3 points·0 comments

AI Analysis

●●SolidBig BrainBold Bet

Frozen models with persistent memory stores rediscover rules from failures without retraining.

Strengths
  • Three-store memory architecture (Experience, Lessons, Knowledge) is genuinely novel.
  • Held-out study shows agent recovered hidden rules from trial and error alone.
  • Addresses real gap: most AI systems don't retain learnings between sessions.
Weaknesses
  • Research report without a usable product or demo to test yourself.
  • Claims need independent verification beyond author's own benchmarks.
Category
Target Audience

AI researchers and ML engineers

Similar To

MemoGPT · LangChain Memory · AutoGen Agent Memory

Post Description

We recently published our research on the learning loop behind the Dropstone agent. Instead of throwing more compute at a problem or trying to dynamically update weights, we tested what happens when you give a frozen open-weight model persistent memory of its own mistakes.

We found that by keeping three stores the Experience, Lessons, and Knowledge the agent successfully builds up scar tissue. In a held-out study, it was able to independently rediscover hidden mathematical laws and coding rules purely from trial, error, and data.

Similar Projects

Developer Tools●●Solid

Mimir – Shared memory and inter-agent messaging for Claude Code swarms

Mimir hooks into Claude Code lifecycle events so agents can 'mark' facts (e.g., "API uses snake_case") into a DuckDB-backed memory and RAG pipeline, then auto-injects that context as additionalContext for later agents. It's a pragmatic, well-scoped solution to the annoying problem of agent amnesia — very useful if you run agent swarms, but its impact is limited by Claude Code adoption and the need for the surrounding infra (BGE keys, hooks).

Niche GemShip It
deejaydev
214mo ago
AI/ML●●Solid

A self upgrading agent that learns from failure

Agent writes its own Python tools and saves rules to avoid repeating mistakes.

WizardryBold Bet
grimm76321
101mo ago
AI/MLMid

Soma, a local-first AI OS with 178 cognitive modules and P2P learning

178 modules and P2P learning sound impressive; unclear if implemented or aspirational.

Bold BetRabbit Hole
Undeca
204mo ago