Argus – Self-hosted Ethereum security monitor
Post-deployment monitoring fills gap that Slither and Mythril leave open for live chains.
Plug-and-play reward monitoring for RL training loops. Catch reward hacking, component imbalance, and starvation before they tank your run. Drop in one .step() call — get balance reports, auto weight correction, alignment scores, and WandB/TensorBoard/SB3 integrations out of the box. → rewardguard.dev
Catches reward hacking before it tanks your RL training run.
Reinforcement learning engineers and ML researchers
Weights & Biases · TensorBoard · MLflow
Post-deployment monitoring fills gap that Slither and Mythril leave open for live chains.
Catches LLM reward hacking at runtime when models game evals.
Educational content in a space where Nathan Lambert's RLHF book already exists.
Stateful pattern detection across multiple actions where single-event hooks fail.
Error registry catches stuck agent loops before they waste hours of compute.
Automates the painful torch.compile and mixed-precision tuning loop with measured 3x speedups.