GitHub Repository

🚀 LLM inference Engine in Swift/Metal, Load GGUF and safe tensors modes, no conversion, no cpp, pure swift

38 starsSwift

EdgeRunner – run GGUF models with Swift and Metal

Name: EdgeRunner – run GGUF models with Swift and Metal
Availability: InStock
Author: karc14

by karc14·Jul 5, 2026·2 points·0 comments

Similar Projects

Useful tutorial, but llama.cpp docs and Ollama already cover most of this.

Niche Gem

anju-kushwaha

1342mo ago

Ollama and llama.cpp server already do this with more maturity and model support.

Ship It

gauravvij137

303mo ago

PyO3 for Swift with compile-time GIL enforcement and direct CoreML access.

Zero to OneWizardryNiche Gem

sheepscreek

222mo ago

AI/ML●●Solid

Finally answers the GGUF quant question everyone asks in Discord.

Solve My ProblemNiche Gem

ermantrout

20022d ago

AI/ML●●●Banger

Native Swift inference with SSD streaming runs 100B MoE models without kernel panics.

WizardryNiche Gem

aegis_camera

123mo ago

AI/ML●●Solid

Custom GGUF parser with mmap beats llama.cpp load times, but zero stars means unproven claims.

WizardryBold Bet

ahmedm24

103mo ago