GitHub Repository

High-performance Rust extensions for Axolotl (no OOM for large datasets) - drop-in acceleration for existing installations.

3 starsPython

Fast-Axolotl – Rust extensions that make Axolotl fine-tuning 77x faster

Name: Fast-Axolotl – Rust extensions that make Axolotl fine-tuning 77x faster
Availability: InStock
Author: ticktockten

by ticktockten·Mar 11, 2026·1 point·0 comments

Visit Project View on HN

AI Analysis

●●SolidNiche GemShip It

77x faster data loading but only helps if you're already using Axolotl specifically.

Strengths

•Drop-in acceleration with single import line requires zero config changes.
•77x streaming speedup on 50k rows is benchmarked with specific methodology.
•Cross-platform wheels for Linux, macOS, Windows with Python 3.10-3.12.

Weaknesses

•Token packing and batch padding show overhead on small datasets due to FFI costs.
•Rust-accelerated Python ML pipelines is a well-trodden pattern (Polars, etc.).

Post Description

I built Rust extensions for Axolotl that dramatically speed up data loading and preprocessing for LLM fine-tuning.

The problem: Python data pipelines become the bottleneck when fine-tuning large models. Your GPUs sit idle waiting for data.

The solution: Drop-in Rust acceleration. One import line, zero config changes.

Results on 50k rows: - Streaming data loading: 0.009s vs 0.724s (77x faster) - Parallel SHA256 hashing: 0.027s vs 0.052s (1.9x faster)

Works with Parquet, Arrow, JSON, JSONL, CSV. Supports compression. Cross-platform.

Usage:

import fast_axolotl import axolotl # now accelerated pip install fast-axolotl

Built with PyO3 and maturin. MIT licensed. Happy to answer questions about the Rust/Python interop or benchmark methodology.

Similar Projects

AI/ML●●Solid

Shard-based scheduling for 100x more fine-tuning experiments on 4 GPUs

Shard-based scheduling cuts GPU wait time, though Ray Tune offers similar early stopping.

Big BrainSolve My Problem

kamranrapidfire

103mo ago

Data●●●Banger

Open load forecasts that beat US grid operators on 6 of 7 RTOs

Beats utility forecasts on 6 of 7 RTOs using only public EIA data and open models.

Big BrainDark HorseZero to One

tylergibbs1

403mo ago

AI/ML●Mid

OpenAI CLIP fine tuned on Galaxy morphology

Galaxy classification model, but model card has mostly empty fields.

Niche Gem

mjupp1

104mo ago

AI/ML●●Solid

Pre-training, fine-tuning, and evals platform

Eval-synthesize-train loop automates custom model development better than manual fine-tuning.

SlickBold Bet

oli_kitty

403mo ago

AI/ML●●Solid

Flint – A 30B model fine-tuned for less repetition

Fine-tuned Qwen 30B that prioritizes output diversity over convergent accuracy.

Niche GemSolve My Problem

thmsmxwll

623mo ago

AI/ML●●●Banger

ShadowPEFT – Centralized and Detachable Parameter-Efficient Fine-Tuning

Detachable PEFT modules that version independently, unlike LoRA's weight-coupled adapters.

Big BrainZero to OneNiche Gem

yokee

622mo ago