Back to browse
GitHub Repository

🚀 LLM inference Engine in Swift/Metal, Load GGUF and safe tensors modes, no conversion, no cpp, pure swift

38 starsSwift

EdgeRunner – run GGUF models with Swift and Metal

by karc14·Jul 5, 2026·2 points·0 comments

Similar Projects

Host any GGUF model in one command

Ollama and llama.cpp server already do this with more maturity and model support.

Ship It
gauravvij137
303mo ago
AI/ML●●●Banger

SwiftLM – Qwen Chat on iPhone, 100B+ Moe on M5 Pro 64GB (Native Swift)

Native Swift inference with SSD streaming runs 100B MoE models without kernel panics.

WizardryNiche Gem
aegis_camera
123mo ago
AI/ML●●Solid

WayInfer – Native GGUF engine that runs models larger than your RAM

Custom GGUF parser with mmap beats llama.cpp load times, but zero stars means unproven claims.

WizardryBold Bet
ahmedm24
103mo ago