Back to browse
GitHub Repository

Human Software Accountability

2 starsPython

Verification of Human Understanding of LLM-Generated Work

by jbethune·May 12, 2026·1 point·3 comments

AI Analysis

●●●BangerBig BrainBold Bet

LLM-generated comprehension quizzes blocking PR merges is a genuinely novel accountability mechanism.

Strengths
  • Four distinct enforcement modes from casual tracking to strict blocking gates.
  • Local demo script allows testing the LLM pipeline without GitHub App setup.
  • Addresses the specific risk of developers shipping AI code they don't understand.
Weaknesses
  • Risk of becoming a friction point that teams disable rather than engage with.
  • LLM grading of open-ended answers may produce false negatives on valid logic.
Target Audience

Engineering managers, tech leads, teams using AI coding assistants

Similar To

GitHub Copilot · CodeRabbit · Guru

Post Description

Built this back in March because of fears my vibe coding was getting away from me. Basically it just analyzes PRs and asks a question to the developer about them to verify understanding. Turned into a larger project with my consulting firm and clients over the last month. Going to leave the original basic implementation open source.

Similar Projects