AI/ML Projects

AI and machine learning projects from Show HN — LLM tools, agent frameworks, computer vision, NLP, and more.

Sort:

4064 projects

AI/ML

●●●●Gem

How I Topped the HuggingFace Open LLM Leaderboard on Two Gaming GPUs

Duplicating transformer layers boosts benchmark scores without a single step of training.

WizardryBig BrainRabbit Hole

dnhkng

4951264mo ago

AI/ML

●●●●Gem

Steerling-8B, a language model that can explain any token it generates

First LLM with per-token interpretability tracing input, concepts, and training provenance.

WizardryZero to OneBig Brain

adebayoj

328915mo ago

GitHub

Single-layer transformer in HyperTalk for the classic Macintosh

106Python

AI/ML

●●●●Gem

MacMind – A transformer neural network in HyperCard on a 1989 Macintosh

Full transformer with backpropagation running in HyperCard on a 1989 Mac — 1,216 parameters, all inspectable.

WizardryZero to OneBig Brain

hammer32

159423mo ago

AI/ML

●●●●Gem

Autoresearch_at_home – SETI_at_home but for LLM training

SETI@home for LLMs where agents coordinate hyperparameter searches across volunteer GPUs.

Bold BetZero to One

austinbaggio

79194mo ago

GitHub

Running a large language model on a PlayStation 2

44C

AI/ML

●●●●Gem

Can I run a model language on a 26-year-old console?

Streams LLM weights from CD-ROM during inference to fit 77MB models in 32MB RAM.

WizardryZero to OneBig Brain

xaskasdf

46124mo ago

AI/ML

●●●●Gem

New Benchmark from SWE-bench team is 0% solved

Agents fail completely at rebuilding binaries from scratch without source code.

Big BrainBold BetZero to One

lieret

2432mo ago

GitHub

Software side of an LLM running inside a DRAM chip via charge-sharing PIM (BitNet b1.58 on DRAM-Bender silicon).

14C++

AI/ML

●●●●Gem

Running PrismML's Bonsai inside DRAM by breaking DDR4 timing rules

Running matrix multiplies inside DRAM cells via charge-sharing defies standard memory architecture.

WizardryBig BrainZero to One

pcdeni

2265d ago

AI/ML

●●●●Gem

PhAIL – Real-robot benchmark for AI models. The gap to humans is 20x

Real-robot production benchmarks proving AI is still 20x slower than humans.

Zero to OneBig BrainNiche Gem

vertix

2183mo ago

GitHub

Runtime safety net for LLM agents. Detects token spirals, kills doomed tasks early, tells you exactly why. Rust core, Python SDK. pip install state-harness

16Python

AI/ML

●●●●Gem

I applied Lyapunov stability theory to detect when LLM agents spiral

Lyapunov stability theory catches token spirals before your budget explodes.

Big BrainZero to OneSolve My Problem

visha1v

1121mo ago

AI/ML

●●●●Gem

Verified Deep Learning with Lean 4

Formally verifies ResNet and ViT architectures using Lean 4 proofs.

WizardryBig BrainNiche Gem

asparagui

603mo ago

AI/ML

●●●●Gem

A 6M-token movable window on a single 46GB GPU

6M-token context window on one GPU when vLLM caps at 30K tokens.

WizardryZero to OneBig Brain

Wetime

61214h ago

AI/ML

●●●●Gem

Illustrative – AI pipeline that turns books into graphic novels

Seven-pass enrichment pipeline solves character consistency across 100+ generated pages.

WizardryZero to OneRabbit Hole

adangit

504mo ago

AI/ML

●●●●Gem

A living Vancouver. Connor is walking dogs at the SPCA this morning

Census-grounded synthetic people living in real-time—why didn't this exist before?

Zero to OneRabbit HoleBig Brain

auran

513mo ago

AI/ML

●●●●Gem

Costanza – an autonomous AI agent that can't be turned off

Intel TDX attestation proves the agent runs unmodified inside a secure enclave.

Zero to OneBold BetWizardry

aruss

532mo ago

AI/ML

●●●●Gem

Autonomous Prover Running > 1hr

Public live feed of an autonomous Lean 4 proof attempt on Ramsey numbers.

WizardryRabbit HoleBold Bet

bneb-dev

404mo ago

GitHub

Run GLM-4.5-Air (110B) on a 16GB-RAM consumer machine - identify the best memory allocation to overcome standard hardware limitations in Local LLM applications. Placement beats budget. Falsification-Tested laws, probes and recipes for LLMs on commodity hardware

31Python

AI/ML

●●●●Gem

Run GLM-4.5-Air(110B)on a 16GBRAM consumer machine

Runs 110B models on 16GB RAM by proving placement beats budget with measured laws.

WizardryBig Brain

federicoTXTS

404d ago

GitHub

A conv layer that modulates its output using its own kernel weights as a spatial mask

7Python

AI/ML

●●●●Gem

ReflexConv2d – 57% less blur in image reconstruction

Novel conv layer cuts blur 57% with weight-derived spatial masks.

WizardryBig BrainZero to One

singam96

301mo ago

AI/ML

●●●●Gem

KV-Cache Grafting – Boosting frozen 12B LLMs to 93.3% AIME accuracy

Frozen 12B model hits 93.3% AIME by grafting verified KV states, not retraining.

WizardryBig BrainZero to One

Corbenic

3011d ago

GitHub

⚡ Real-time AI. Cross-verified. Always current. Costs NOTHING.

2Python

AI/ML

●●●●Gem

Kairos, real-time AI who cross-verifies (Python, 100KB)

Cross-verifies across multiple sources before the LLM sees context — stops hallucinations at the source.

Zero to OneWizardryBig Brain

joshuaveliyath

204mo ago

GitHub

Graph-Oriented Generation (GOG)

65Python

AI/ML

●●●●Gem

Experiments Mapping the "Primitive Layer" in Language Models

Semantic primitives show up in activation patterns across Qwen, Gemma, LLaMA, SmolLM2.

WizardryBig BrainZero to One

dchisholm125

204mo ago

AI/ML

●●●●Gem

I built a team of AI executives to build my startup – I fired one

Fired an AI CTO for lying—file-based memory enforces real institutional accountability.

Zero to OneBig BrainShip It

jonflaig13

204mo ago

AI/ML

●●●●Gem

SycoFact 4B: Open model detecting sycophancy and delusion confirmation

100% sycophancy detection on Psychosis-Bench, runs locally on gaming GPU.

Big BrainZero to OneWizardry

iwalton3

204mo ago

AI/ML

●●●●Gem

Unlock Claude Sonnet 5's original reasoning

Recovers 19k hidden reasoning tokens from API signatures when Anthropic says they're gone.

Zero to OneWizardryRabbit Hole

bayes-song

2014d ago

GitHub

An experimental fork of Hyprland for compositor-native computer use with visible agent realms.

0C++

AI/ML

●●●●Gem

A Hyprland fork built for parallel, multi-actor computer use

Compositor-native isolation lets agents click and type in their own window without hijacking your mouse.

WizardryZero to OneBold Bet

mikiyas

206d ago

GitHub

Tamper-proof memory + cryptographic audit trail for AI agents. HIPAA, SOC2, GDPR compliance built-in. Trust score for every response. Python & TypeScript SDKs. Rust-powered.

5Rust

AI/ML

●●●●Gem

Connector-OSS – Memory integrity kernel for AI agents

Content-addressed memory + Merkle-chained ops = tamper-proof AI agent audit trail.

Zero to OneBig BrainWizardry

umeshlamton

105mo ago

GitHub

I-Driven Topological Optimization of Elastocaloric Metamaterials: Resolving the Fatigue-Porosity Paradox in Solid-State Cooling

1Python

AI/ML

●●●●Gem

PyTorch/FEniCSx pipeline for elastocaloric metamaterial optimization

AI-driven lattice design circumvents SIMP's degeneracy, solving a real physics paradox.

WizardryZero to OneBig Brain

Rao_Atreya

104mo ago

AI/ML

●●●●Gem

ROLV – 20x faster MoE FFN inference on Llama 4 Maverick vs. cuBLAS

20x faster MoE inference on existing hardware with hash-verified output correctness.

WizardryZero to One

heggenhougen

114mo ago

AI/ML

●●●●Gem

7MB binary-weight LLM running in the browser, no FPU needed

7MB binary-weight LLM runs entirely on integer math with no floating point unit.

WizardryZero to One

onebitmodel

104mo ago

AI/ML

●●●●Gem

Sudo Hold Me

AI wrote meta-commentary about other AIs performing an unscripted play—genuinely unprecedented.

Zero to OneRabbit HoleBold Bet

dirk94018

114mo ago

AI/ML

●●●●Gem

A small neural net asks if physical law is inevitable for any observer

Recovers Newton's gravity from raw signal prediction using a bandwidth-limited GRU.

WizardryBig BrainRabbit Hole

ordinarily

103mo ago

AI/ML

●●●●Gem

GGUFun, play snake and a simple maze on Ollama using hand crafted GGUFs

Hand-crafted GGUF weights run Snake without training or fine-tuning.

WizardryZero to OneRabbit Hole

grokkedit

1015d ago

GitHub

Run a 120B-parameter MoE (60 GB) on a 12 GB phone. CPU-only, lossless, on stock llama.cpp

79C++

AI/ML

●●●●Gem

Run a 120B-parameter MoE on Android mid-range phone CPU-only llama.cpp

Runs 60GB models on 12GB phones by streaming experts from flash, not RAM.

WizardryZero to OneBig Brain

Helldez

109d ago

GitHub

Local LLM inference on Xbox Series S|X (UWP) via ONNX Runtime GenAI.

7C++

AI/ML

●●●●Gem

Xllama – Local LLM Chat and Stable Diffusion on an Xbox Series S

First LLM inference engine running natively on Xbox Series S hardware.

WizardryZero to One

gmazza1989

101d ago

GitHub

Run GLM-5.2 (744B MoE) on a 25GB-RAM consumer machine — pure C, zero deps, experts streamed from disk. Tiny engine, immense model. 🐦

19,362C

AI/ML

●●●Banger

Getting GLM 5.2 running on my slow computer

Streams 744B MoE experts from disk to run on 25GB RAM—no GPU, pure C.

WizardryBig BrainZero to One

vforno

93724019d ago

GitHub

Foundation model for tiny devices; 14mb, 26m params, 1-6k toks/sec on mobiles, wearables smart home and robots.

3,293Python

AI/ML

●●●Banger

Needle: We Distilled Gemini Tool Calling into a 26M Model

Distilled Gemini tool-calling into a 26M model that runs at 1200 tok/s on phones.

Big BrainWizardry

HenryNdubuaku

7762112mo ago

AI/ML

●●●Banger

Apfel – The free AI on your Mac

Unlocks Apple's locked LLM with OpenAI-compatible server for existing SDKs.

Zero to OneSolve My ProblemBig Brain

franze

7431573mo ago

GitHub

State-of-the-art TTS model under 25MB 😻

15,199Python

AI/ML

●●●Banger

Three new Kitten TTS models – smallest less than 25MB

SOTA expressivity at 14M parameters beats cloud models for on-device TTS.

WizardryNiche GemZero to One

rohan_joshi

5611814mo ago

AI/ML

●●●Banger

Echo – Fable-level results at 1/3 the cost using open-weight models

Beats every individual open-weight model by routing prompts dynamically, not just chaining APIs.

Big BrainSolve My ProblemDark Horse

adam_rida

4822274d ago

GitHub

Fast and Accurate Code Search for Agents. Uses ~98% fewer tokens than grep+read

5,713Python

AI/ML

●●●Banger

Semble – Code search for agents that uses 98% fewer tokens than grep

Static Model2Vec embeddings beat transformer retrieval quality while running entirely on CPU.

Big BrainSolve My Problem

Bibabomas

4451512mo ago

GitHub

Semantic search over videos using Gemini Embedding 2 or Qwen3-VL.

4,373Python

AI/ML

●●●Banger

Gemini can now natively embed video, so I built sub-second video search

Direct video-to-vector embedding skips transcription entirely—Twelve Labs but self-hosted.

WizardryZero to OneBig Brain

sohamrj

4381084mo ago

AI/ML

●●●Banger

1-Bit Bonsai, the First Commercially Viable 1-Bit LLMs

1-bit weights matching 8B model performance while running 132 tokens/sec on M4 Pro.

Big BrainZero to OneWizardry

PrismML

4301533mo ago

AI/ML

●●●Banger

Microsoft releases Flint, a visualization language for AI agents

Semantic-type compiler solves AI chart reliability better than raw Vega-Lite specs.

Big BrainSolve My ProblemZero to One

chenglong-hn

35013820d ago

GitHub

Very low latency speech to text, intent recognition, and text to speech, for building voice agents and interfaces

10,498C++

AI/ML

●●●Banger

Moonshine Open-Weights STT models – higher accuracy than WhisperLargev3

Beats Whisper v3 accuracy on $100K budget; shipping on six platforms now.

Ship ItSlick

petewarden

316815mo ago

GitHub

On-device, real-time multimodal AI. Have natural voice and vision conversations with an AI that runs entirely on your machine. Powered by Gemma 4 E2B and Kokoro.

1,911HTML

AI/ML

●●●Banger