Luna Agent – Custom AI agent in ~2300 lines of Python, no frameworks
Custom agent framework in 2300 lines beats 400K-line bloatware; auditable and runs fully local.
Yet Another Llama 3.2 implementation (in pure numpy)
Pure NumPy Llama 3.2 inference in 100 lines for hacking KV cache compression.
ML researchers, students, hobbyists
llama2.c · minGPT · tinygrad
Would be interested in feedback from people who have written transformer implementations before, are there any implementation "tricks" that I'm missing (e.g, cleaner KV cache for PyTorch/Jax or rope tricks)?
Custom agent framework in 2300 lines beats 400K-line bloatware; auditable and runs fully local.
NumPy-backed ECS with archetype pooling beats OOP inheritance for Python game loops.
GPT-2 inference in pure C# allocating zero bytes per token beats ONNX Runtime.
Self-bootstrapping agent writes its own improvements in 100 lines of TypeScript.
Beautiful physics visualization, but Chladni simulations exist in MATLAB, Mathematica, and other interactive platforms.
30x faster cold start than vLLM with zero PyTorch dependencies.