Minisft – from base model to chat model
sft.py packages a complete, small-footprint SFT flow — 4-bit quantized Llama-2-7b, LoRA adapters, OpenAssistant data, plus inference and push-to-HF helpers — so you can run an experiment with a few commands. It's a pragmatic, well-scoped starter for anyone learning parameter-efficient fine-tuning, but it doesn't claim new research and lacks deeper evaluation, recipe tuning guidance, or large-scale training validation.