GitHub Repository

A lightweight and modular Gumbel MCTS implementation

7 starsPython

Gumbel-mcts, a high-performance Gumbel MCTS implementation

Name: Gumbel-mcts, a high-performance Gumbel MCTS implementation
Availability: InStock
Author: whiplash451

by whiplash451·Mar 19, 2026·1 point·0 comments

Visit Project View on HN

AI Analysis

●●SolidNiche GemBig Brain

Validated 2-15X speedup over Alpha Zero baseline with identical policy output.

Strengths

•Gold-standard validation against michaelnny/alpha_zero reference implementation ensures correctness.
•Sparse Gumbel variant handles large action spaces like chess where dense fails.
•Numba acceleration delivers hundreds of thousands of simulations per second.

Weaknesses

•Python/numba limits deployment compared to C++ alternatives like libtorch.
•Niche audience—only matters if you're already training game-playing agents.

Post Description

Hi folks,

Over the past few months, I built an efficient MCTS implementation in Python/numba.

As I was building a self-play environment from scratch (for learning purposes), I realized that there were few efficient implementation of this algorithm.

I spent a lot of time validating it against a golden standard baseline [1].

My PUCT implementation is 2-15X faster than the baseline while providing the exact same policy.

I also implemented a Gumbel MCTS, both dense and sparse. The sparse version is useful for games with large action spaces such as chess.

Gumbel makes much better usage of low simulation budgets than PUCT.

Overall, I think this could be useful for the community. I used coding agents to help me along the way, but spent a significant amount of manual work to validate everything myself.

Feedback welcome.

[1] https://github.com/michaelnny/alpha_zero/blob/main/alpha_zer...

Similar Projects

Open Source●●Solid

OpenSfM v1.0

Former maintainers revived abandoned OpenSfM with C++ rewrite and OpenCL GPU acceleration.

Big BrainNiche Gem

AlgerianSam

4021d ago

AI/ML●●Solid

ArXiv Scholar – An Open-Source RAG System for AI Research Papers

Hybrid search over 5,600 papers when Elicit and Semantic Scholar already exist.

Niche GemBig Brain

dubeyaayush07

201mo ago

Infrastructure●●Solid

We built an OCR server that can process 270 dense images/s on a 5090

50x faster than PaddleOCR Python with real TensorRT benchmarks.

WizardryNiche Gem

pfdomizer

822mo ago

Open Source●Mid

SVO Voxelization for Gaussian Splat Collisions

Using an SVO to voxelize Gaussian splats is a sensible way to prune overlap checks — hierarchical voxels fit the problem and should cut costly pairwise collisions. Can't judge the execution: the Reddit thread is blocked with no visible code, benchmarks, or demos, so this currently reads like an intriguing sketch rather than a drop-in tool.

Niche GemWizardry

slimbuck

505mo ago

Developer Tools●Mid

Cn-variants – Tailwind CSS variants in 3 lines of code

Minimalist cva alternative that splits variants into standalone typed functions.

CozySolve My Problem

bastianplsfix

104mo ago

AI/ML●Mid

Analyze Tweets with Sparse Autoencoders

SAE feature explorer, but limited to tweet analysis with unclear research value.

Rabbit Hole

nicetomeetyu

204mo ago