GitHub Repository

GPT-2-style LLM built from scratch in C/CUDA with hand-written backprop, BPE tokenizer, FlashAttention, pretraining, and SFT.

0 starsCuda

NanoEuler – GPT-2 scale model in pure C/CUDA from scratch

Name: NanoEuler – GPT-2 scale model in pure C/CUDA from scratch
Availability: InStock
Author: vforno

by vforno·Jun 19, 2026·2 points·0 comments

Visit Project View on HN

AI Analysis

●●SolidWizardryBig Brain

Hand-written FlashAttention and full gradient checks in pure CUDA with no PyTorch.

Strengths

•Complete from-scratch pipeline: tokenizer, pretraining, and SFT in one repo
•CPU reference implementation validates CUDA gradients via full-model check
•Residual blocks explained as Forward-Euler ODE discretization

Weaknesses

•116M params on single GPU produces fluent but shallow output
•LLM-from-scratch educational projects already exist (nanoGPT)

Similar Projects

AI/ML●Mid

FlashQwen – A from-scratch CUDA inference engine for Qwen3

Another inference engine when vLLM and llama.cpp already dominate.

Bold BetNiche Gem

langtang1996

503d ago

AI/ML●●●Banger

WaveletLM – wavelet-based, attention-free model with O(n log n) scaling

Wavelet-based attention-free architecture beats GPT-2 Medium with 80x less training data.

Zero to OneWizardryBold Bet

anarmorarm

711mo ago

AI/ML●●●Banger

MicroGPT-C – C99 GPT for Edge Training and Tiny Model Pipelines

Karpathy's microgpt in C99, proves tiny coordinated models beat single large models on logic.

WizardryBig Brain

Ajay__soni

103mo ago

Education●●●Banger

How-to-Train-Your-GPT

Build a LLaMA-style model from scratch with zero ML prerequisites or math.

CozyBig Brain

RaiyanYahya

101mo ago

AI/ML●●●Banger

GPT-2 inference in pure C#, 0 bytes allocated per token

GPT-2 inference in pure C# allocating zero bytes per token beats ONNX Runtime.

WizardryBig Brain

dev-on-bike

111mo ago

Education●●Solid

A book that builds GPT-2, Llama 3, DeepSeek from scratch in PyTorch

Loads real Meta and OpenAI weights, not just training from scratch.

Niche GemBig Brain

s1lv3rj1nx

212mo ago