Back to browse
GitHub Repository

Ollama for classical ML models. AOT compiler that turns XGBoost, LightGBM, scikit-learn, CatBoost & ONNX models into native C99 inference code. One command to load, one command to serve. 336x faster than Python inference.

683 starsPython

Timber – Ollama for classical ML models, 336x faster than Python

by kossisoroyce·Mar 2, 2026·207 points·33 comments

AI Analysis

●●●BangerWizardrySolve My Problem

336× faster tree model inference; compiles sklearn/XGBoost to C99, serves like Ollama.

Strengths
  • 338x latency improvement over Python with microsecond-scale native calls and no runtime overhead
  • AOT compilation to portable C99 removes Python dependency entirely, enabling edge/IoT/regulated deployments
  • Rigorous benchmark methodology with reproducible scripts; transparent comparison table vs ONNX Runtime/Treelite/lleaves
Weaknesses
  • Only accelerates tree-based models; deep learning and non-tree ensembles need alternative solutions
  • Very early (0 stars/forks); adoption and long-term maintenance unproven
Target Audience

MLOps engineers, fraud/risk teams, edge computing teams, regulated industries

Similar To

Treelite · ONNX Runtime · Seldon Core

Similar Projects

Developer Tools●●Solid

Nimic – write pure Python and compile AOT to native binaries via Nim

Python-to-Nim transpiler with ctypes-backed types when Cython and Numba already exist.

Big BrainNiche Gem
dima-quant
5013d ago