RewardGuard – detect reward hacking in RL training loops
Catches reward hacking before it tanks your RL training run.
Stop silent data leakage in ML training pipelines.
Catches silent data leakage that train/test metrics miss — DuckDB-powered, 1M rows in 12s.
ML/Data engineers building training pipelines
Great Expectations (data validation framework) · Feast (feature store with temporal logic)
Timefence audits your dataset for any rows where feature_time > label_time, and can rebuild it with point-in-time correct joins. Built on DuckDB, handles 1M labels × 10 features in ~12s. Also has a --strict flag for CI.
pip install timefence timefence quickstart churn-example && cd churn-example timefence audit data/train_LEAKY.parquet
MIT licensed. Happy to answer any questions you might have.Catches reward hacking before it tanks your RL training run.
Vector DBs store memories; this one forgets, consolidates, and flags contradictions like human memory.
Zero-dependency Python library replacing removed stdlib functions with comprehensive platform detection.
The author documents ripping out Ultralytics and training YOLOX end-to-end on an aircraft dataset, releasing code under an MIT license so you can run and modify the whole pipeline yourself. This is the sort of no-frills, reproducible recipe that saves time if you need full control over configs, checkpoints and licensing — not novel research, but genuinely useful for people who hit the limits of packaged repos.
Detects deepfakes locally using optical flow vectors instead of sending files to cloud APIs.
Yet another secret scanner, but this one's a single Python file.