Back to browse
Multi-agent autoresearch for ANE inference beats Apple's CoreML by 6×

Multi-agent autoresearch for ANE inference beats Apple's CoreML by 6×

by christinetyip·Mar 31, 2026·6 points·0 comments

AI Analysis

●●●BangerWizardryBig BrainZero to One

Cross-chip agent knowledge sharing beats CoreML by 6× on Apple Silicon.

Strengths
  • Agents share optimization strategies across different M-series chips in real time
  • Measurable 6.31× latency improvement on DistilBERT vs Apple's baseline
  • Generalizable approach beyond single model — continuous search over optimization space
Weaknesses
  • Only DistilBERT demonstrated so far, unclear how it scales to larger models
  • Apple Silicon only, limits adoption to Mac ecosystem
Category
Target Audience

ML engineers, Apple Silicon developers, performance optimization teams

Similar To

AutoML · Ray Tune · Optuna

Post Description

We ran an experiment over the weekend to explore whether multiple autonomous agents could collaboratively optimize inference on Apple’s Neural Engine (ANE).

Each agent ran locally on a different Mac (M1–M4), repeatedly modifying how a DistilBERT model is executed on the ANE, benchmarking latency, and sharing results and insights with other agents in real time.

Instead of exploring independently, agents could:

- see what others had tried - reuse working strategies - avoid known failure modes

Across all tested chips, the agents ended up outperforming Apple’s CoreML baseline, with up to 6.31× lower median inference latency on the same hardware.

An interesting pattern we observed: an agent stuck at ~2.1ms latency on M4 was able to break through after incorporating strategies discovered by agents on different chips (M2, M4 Max), eventually reaching ~1.5ms and surpassing CoreML.

Full write-up: https://x.com/christinetyip/status/2039040161439224157

Detailed results: https://ensue-network.ai/lab/ane?view=strategies https://ensue-network.ai/lab/ane

Curious what other optimization problems this kind of setup could be applied to, especially in systems, compilers, or ML infra. Would be interested in exploring similar experiments.

Similar Projects

AI/ML●●●Banger

Orion – Native Training LLMs on the Apple Neural Engine Without CoreML

Direct ANE access bypasses CoreML to enable training—genuinely novel Apple Silicon unlock.

WizardryZero to OneBig Brain
mechramc
213mo ago