Auto GPU Kernel – Autonomous GPU-kernel discovery and optimizer
Autonomous kernel optimizer that won MLSys contest with 34.93x speedup.
cuTile Rust provides a safe, tile-based kernel programming DSL for the Rust programming language. It features a safe host-side API for passing tensors to asynchronously executed kernel functions.
Extends Rust's ownership model across GPU boundary with tile-based partitioning for data-race-free kernels.
Rust developers writing GPU kernels, HPC engineers
rust-gpu · CUDA · wgpu
Autonomous kernel optimizer that won MLSys contest with 34.93x speedup.
Beats PyTorch eager by 5.29x on RMSNorm using autonomous agent loops.
Fencing tokens + lease expiry races caught with deterministic test harness—correctness, not just convenience.
Rust-native audio graphs without learning SuperCollider or MaxMSP.
Rust tile server with MapLibre Native rendering when Martin already exists.
Type-safe Rust-to-Haskell FFI with automatic memory management via ForeignPtr.