HF viewer – visualize any Hugging Face model
Paste any HF URL to instantly see the full transformer architecture graph.
HF CLI extension to run local coding agent powered by llmfit and llama.cpp
One command replaces manual GGUF hunting and hardware compatibility guesswork.
Developers running local LLMs, CLI tool users
Ollama · LM Studio · Claude Code
It uses llmfit under the hood to profile your hardware and automatically select the best-fit model and quantization — no manual GGUF hunting. It then launches a Pi Agent on top of it. One command, local, fully open.
If you've been using Claude Code or Codex CLI and want something that runs entirely on your own hardware/models, this is a nice lightweight alternative to try.
Happy to answer questions — and curious what hardware setups people are running this on.
Paste any HF URL to instantly see the full transformer architecture graph.
Yet another model runner when Ollama already dominates this space.
The project nails a real pain: instead of guessing whether a 7B or 13B model will fit, llmfit inspects your system and ranks 94 models by fit, speed, context and quality, even recommending quantization and run modes and supporting multi‑GPU and MoE setups. The combo of an installable binary, interactive TUI for quick browsing and JSON output for automation makes it immediately useful; just remember its suggestions are heuristics — you’ll still want to validate edge cases with a real run.
Galaxy classification model, but model card has mostly empty fields.
TPU training wrapper built on torchprime; solves a real problem but torchprime already exists.
Client-side graph expansion with IDF weighting needs zero backend infrastructure.