Back to browse
GitHub Repository

HF CLI extension to run local coding agent powered by llmfit and llama.cpp

416 starsShell

HF-agents, CLI extension to find the best model/quant for your hardware

by clmnt·Mar 18, 2026·2 points·0 comments

AI Analysis

●●SolidSolve My ProblemSlick

One command replaces manual GGUF hunting and hardware compatibility guesswork.

Strengths
  • Hardware profiling via llmfit eliminates trial-and-error model quantization selection.
  • Reuses existing llama-server instance instead of spawning redundant processes.
  • Official HF CLI extension means proper integration with model hub authentication.
Weaknesses
  • Orchestrates existing tools (llmfit, llama.cpp, Pi) rather than novel architecture.
  • Limited to Pi agent — no support for other local coding agent frameworks.
Category
Target Audience

Developers running local LLMs, CLI tool users

Similar To

Ollama · LM Studio · Claude Code

Post Description

We've been building out CLI extensions for the Hugging Face hub, and hf-agents is a fun one to share.

It uses llmfit under the hood to profile your hardware and automatically select the best-fit model and quantization — no manual GGUF hunting. It then launches a Pi Agent on top of it. One command, local, fully open.

If you've been using Claude Code or Codex CLI and want something that runs entirely on your own hardware/models, this is a nice lightweight alternative to try.

Happy to answer questions — and curious what hardware setups people are running this on.

Similar Projects

AI/ML●●Solid

Llmfit;94 models, 30 providers.1 tool to see what runs on your hardware

The project nails a real pain: instead of guessing whether a 7B or 13B model will fit, llmfit inspects your system and ranks 94 models by fit, speed, context and quality, even recommending quantization and run modes and supporting multi‑GPU and MoE setups. The combo of an installable binary, interactive TUI for quick browsing and JSON output for automation makes it immediately useful; just remember its suggestions are heuristics — you’ll still want to validate edge cases with a real run.

Solve My ProblemWizardry
axjns
103mo ago