Digest AI vs HN About

GitHub Repository

⚡ Native MLX Swift LLM inference server for Apple Silicon. OpenAI-compatible API, SSD streaming for 100B+ MoE models, TurboQuant KV cache compression, MACOS + iOS iPhone app.

688 starsSwift

SwiftLM – Qwen Chat on iPhone, 100B+ Moe on M5 Pro 64GB (Native Swift)

by aegis_camera·Apr 1, 2026·1 point·2 comments

Visit Project View on HN

AI Analysis

●●●BangerWizardryNiche Gem

Native Swift inference with SSD streaming runs 100B MoE models without kernel panics.

Strengths

•SSD streaming swaps MoE layers directly from NVMe to GPU without trashing Unified Memory.
•Hybrid TurboQuant achieves V3 quality at V2 speeds using custom Metal shaders.
•Zero Python dependencies means no GIL overhead and single binary deployment.

Weaknesses

•SSD streaming marked experimental; stability unproven compared to mature llama.cpp.
•Apple Silicon lock-in excludes Windows and Linux users entirely.

Category

Target Audience

Apple Silicon developers, local LLM enthusiasts, iOS engineers

Similar To

Ollama · llama.cpp · LM Studio

Similar Projects

AI/ML●●Solid

Running Gemma 4 on an iPhone 13 Pro

Clean Swift wrapper for Gemma 4 with vision and audio on iPhone.

Niche GemShip It

dengjiuhong

101mo ago

AI/ML●●●Banger

NVFP4 on Desktop Blackwell – 122B MoE on a Single RTX PRO 6000 31 tok/s

Bypasses NVIDIA's artificial FP4 lock—122B MoE on single desktop GPU at 31 tok/s.

WizardryDark Horse

jcartu

203mo ago

AI/ML●●Solid

I ran Qwen3.5 35B on my iPhone at 5.6 tok/SEC

Runs 19.5GB Qwen3.5 on 12GB RAM iPhone via memory swapping.

WizardryBold Bet

alexintosh

422mo ago

AI/ML●●Solid

Best setup local LLM found for a 5090 (llama.cpp fork + turboquant)

450k context on 32GB VRAM using turboquant KV cache compression.

Big BrainNiche Gem

utopman

226d ago

Productivity●Mid

Font Wizard Pro – a font manager for iPhone and iPad

Yet another font manager when Font Book and iOS already handle this.

Cozy

stalinkay

2012d ago

Developer Tools●●●Banger

CodeLayers – See your codebase's dependency layers in 3D

BFS-layered 3D codebase viz solves force-directed chaos; works in browser for instant PR review.

Eye CandyWizardrySolve My Problem

lnguyen11288

503mo ago