Back to browse
GitHub Repository

Stop configuring your AI stack. Start using it. One command brings a complete pre-wired LLM stack with hundreds of services to explore.

3,077 starsPython

A single CLI to manage llama.cpp/vLLM/Ollama models

by everlier·Mar 13, 2026·2 points·1 comment

AI Analysis

●●●BangerSolve My ProblemSlick

Finally one CLI for Ollama, llama.cpp, and vLLM instead of three separate tools.

Strengths
  • Unified model management across three major LLM backends with identical command syntax.
  • JSON output with jq support enables scripting and automation workflows.
  • 127+ releases and 2.5k stars show real adoption and active maintenance.
Weaknesses
  • Doesn't add features beyond what individual backend CLIs already provide.
  • Model management space is getting crowded as local LLM tooling matures.
Target Audience

ML engineers, developers running local LLMs, AI infrastructure teams

Similar To

Ollama CLI · llama.cpp · Hugging Face CLI

Post Description

Harbor crossed its 127th (next one is special) release.

One notable added feature is ability to manage all my LLMs with a single CLI.

# list all models harbor ls # pairs well with jq harbor ls --json # Ollama harbor pull qwen3.5:35b # llama.cpp harbor pull unsloth/Qwen3.5-35B-A3B-GGUF:Q8_0 # vllm (HuggingFace Hub cache) harbor pull Qwen/Qwen3.5-35B-A3B

# Remove any of the models by the same id # used to pull it harbor rm <id>

Hopefully it'll be useful for someone too.

Similar Projects