Back to browse
Association rule mining on 21.6M poker hands

Association rule mining on 21.6M poker hands

by et9797·Mar 22, 2026·2 points·0 comments

AI Analysis

●●SolidBig BrainNiche Gem

GPU-accelerated pattern mining from protein research repurposed for poker hand analysis.

Strengths
  • Vectorized CUDA implementation from ET-Miner pipeline originally built for AlphaFold protein mining.
  • 1.4M differential rules completely free with no paywall or account required.
  • 503K patterns with infinite lift—found in winners, never in losing hands.
Weaknesses
  • Niche audience limits broader developer appeal beyond poker players.
  • Poker analytics tools like PioSolver and GTO+ already serve this market.
Category
Target Audience

Poker players, data scientists

Similar To

PioSolver · GTO+ · Holdem Manager

Post Description

We built this as a side project that grew out of something completely different.

I work on ET-Miner [https://zenodo.org/records/18674353], which is a GPU-accelerated frequent itemset mining pipeline based on the infamous apriori-algorithm. We came with the idea to reformulate the algorithm into a fully vectorized implementation, using a boolean transaction matrix representation,CUDA kernels + Rust group builder for index construction to speed up computations. The original use case was mining protein structure patterns from AlphaFold, where we processed 109.2M proteins and extracted 16.8 billion frequent itemsets for protein structural motif discovery. At some point I realized the same pipeline could be pointed at any domain with structured categorical data, so I pointed it at poker, one of my long-standing hobbies.

What we learned: Most of the "surprising" patterns the mining surfaces are things good players already know intuitively: positional advantages, aggression frequency correlations, stack-to-pot ratios. But seeing them as statistically validated itemsets with exact support counts is different from folk wisdom. A few patterns around multi-way pot dynamics and specific board texture interactions were genuinely non-obvious to the poker players we showed them to. Modern GTO solvers have no solutions for these multi-way pot scenarios. Oh, controversial, but donk-betting is a ~40% winner's exclusive rule

Data is completely free to exlore. In total 1.4m rules have been mined from the PHH dataset published here: https://zenodo.org/records/13997158.

Similar Projects

Gaming●●Solid

Mach9 Poker Beta

Rule-based strategy building beats memorizing GTO charts for tournament play.

Niche GemShip It
ChicagoDave
212mo ago
Data●●Solid

Benchmarking Apple Silicon unified mem for GPU-accelerated SQL analysis

The repo does one practical thing well: quantify the real-world impact of Apple Silicon's unified memory on analytics by running six TPC-H queries plus a GPU-favorable QX and shipping the raw charts and code. It's specific and empirical — you get MLX vs NumPy vs DuckDB numbers and PNGs, not just hand-wavy claims — but it's narrowly scoped to M4 hardware and small-ish scales, so its conclusions are useful for experimentation rather than sweeping generalization.

WizardryNiche Gem
sadopc
313mo ago