SwiftLM – Qwen Chat on iPhone, 100B+ Moe on M5 Pro 64GB (Native Swift)
Native Swift inference with SSD streaming runs 100B MoE models without kernel panics.
MLX-compatible REAP for pruning MoE models on Apple Silicon
MoE pruning on MacBook without CUDA or PyTorch dependency stack.
ML researchers and engineers working with MoE models on Mac
Cerebras REAP · MLX-LM
Native Swift inference with SSD streaming runs 100B MoE models without kernel panics.
Specialized routing logic for MoE models without a demo or benchmarks.
20x faster MoE inference on existing hardware with hash-verified output correctness.
Full MLX power in Ruby: lazy arrays, Metal GPU, transformer layers—but Ruby adoption risk.
Blog post reviewing a laptop that doesn't exist—Show HN is for projects, not articles.
Maps MacBook lid angle sensor to synthesized creaks using AVAudioEngine.