Digest AI vs HN About

Generator SFT and DPO datasets for tool-calling LoRA fine-tuning

Generator SFT and DPO datasets for tool-calling LoRA fine-tuning

by senza1dio·Mar 12, 2026·2 points·1 comment

Visit Project View on HN

AI Analysis

●●SolidBig BrainNiche Gem

SHA-256 deterministic RNG beats Python hash for reproducible dataset generation.

Strengths

•Anti-template detection uses four Bloom filter layers in ~8 MB fixed memory.
•Seven configurable quality gates catch low-quality synthetic examples automatically.
•Streaming pipeline yields examples one-at-a-time for constant RAM regardless of dataset size.

Weaknesses

•Niche audience limits adoption—only matters if you're already doing LLM fine-tuning.
•Pre-generated datasets are small (1,160 SFT, 120 DPO) compared to commercial alternatives.

Category

Target Audience

ML engineers fine-tuning LLMs for tool use

Similar To

Argilla · Distilabel · Synthetik

Similar Projects

AI/ML●●Solid

Afterimage is now open-source for infra-grade dataset generation

Composable YAML-to-dataset pipeline for LLM fine-tuning when Distilabel exists.

Big BrainNiche Gem

monatis

203mo ago

AI/ML●Mid

SFT to convert a base language model into a conversational chat model

Tutorial code for SFT pipeline, but dozens of identical examples exist on GitHub.

Ship It

onurkanbkrc

104mo ago

AI/ML●●Solid

Nova–Self-hosted personal AI learns from corrections &fine-tunes itself

DPO self-fine-tuning from corrections in a sea of Open WebUI clones.

Big BrainNiche Gem

heliosnova

334mo ago

AI/ML●●●Banger

Open-source LLM and dataset for sports forecasting (Pro Golf)

Beats GPT-5 at golf forecasting via auto-labeled data pipeline; replicable recipe for any domain via SDK.

Big BrainZero to One

bturtel

705mo ago

AI/ML●●Solid

DataFlow,Turn raw data into high-quality LLM training datasets

LLM-based cleaning operators beat regex pipelines for messy text data.

Solve My ProblemShip It

Junnn

204mo ago

AI/ML●●Solid

Shard-based scheduling for 100x more fine-tuning experiments on 4 GPUs

Shard-based scheduling cuts GPU wait time, though Ray Tune offers similar early stopping.

Big BrainSolve My Problem

kamranrapidfire

104mo ago