Mlx-serve – LLM inference server for Apple Silicon, written in Zig

Name: Mlx-serve – LLM inference server for Apple Silicon, written in Zig
Availability: InStock
Author: ddalcu

by ddalcu·Jul 3, 2026·2 points·0 comments

Similar Projects

AI/ML●●Solid

Standardized MLX benchmarking when everyone's currently comparing engines manually.

Niche GemBig Brain

igurss

208d ago

AI/ML●●●Banger

LiteRT beats MLX on Gemma memory while CoreML sips power on the Neural Engine.

Dark HorseBig BrainSolve My Problem

mlboy

1029d ago

AI/ML●●●Banger

Custom Metal shaders beat llama.cpp and MLX—1.67x faster on M4 Max.

WizardrySlickZero to One

sanchitmonga22

2401533mo ago

MLX-powered local TTS plugin for OpenClaw—elegant but audience is Apple Silicon only.

Niche GemCozy

ZacharyZZ

204mo ago

GPU working set estimation catches memory overcommit before your 7B model swaps to SSD.

Solve My ProblemBig BrainNiche Gem

cjarchivist

212mo ago

AI/ML●●Solid

Full MLX power in Ruby: lazy arrays, Metal GPU, transformer layers—but Ruby adoption risk.

WizardryDark HorseNiche Gem

skryl

114mo ago