Cynical Sally – a personality that remembers you, now on OpenClaw
Personality skin on AI chat when countless chatbot personalities already exist.
Brutally honest senior-engineer code reviews for Claude Code, Cursor & Windsurf - and your terminal. Scores, evidence-backed issues, usable fixes.
CI/CD build gates with --fail-under flag beat generic ChatGPT paste workflows.
Developers wanting automated code reviews with attitude
CodeRabbit · GitHub Copilot · Reviewable
https://github.com/w1ckedxt/cynicalsally-cli
Sally is an AI code reviewer with a consistent personality. She scores your code 0-10, finds real issues with evidence, and gives you actionable fixes. Her words, not mine:
"Everyone has opinions. Mine just happen to be evidence-based."
What makes her different from "paste into ChatGPT":
- 6 specialised tools: code review, explain, refactor, PR review, brainstorm, frontend + marketing review - Works as CLI and MCP server (Claude Code, Cursor, Windsurf) - CI/CD ready with --fail-under flag for build gates - One consistent personality across all platforms
Try it right now, no account needed!
npm install -g @cynicalsally/cli
90 free roasts/month. Every premium tool has one free trial so you can see if her feedback and Full Suite is worth paying for.
She also reviews websites, CVs, and pretty much anything at cynicalsally.com, and as a Chrome/Safari extension, but the CLI is where developers live so that's where the code/project value resides.
Fun fact: Render offered somebody else a collab through a Reddit comment and I commented "I'd like to do thatas well!" or something to that effect. They hit me up and I've been building Sally from the ground up after the initial traction on Reddit via comments.
Built with: Node.js, Claude API (Anthropic), deployed on Render. Happy to answer questions about the architecture or how I built an entire multi-platform product solo.
Personality skin on AI chat when countless chatbot personalities already exist.
AI sidebar roasts websites with attitude, but Chrome store listing is broken.
Recursive review cycles force agent confidence scores before you ever see the output.
RFC 3339 hits 88% accuracy while unix epoch fails 50% of the time.
Pairwise comparison ranking fixes Google's useless 4.8-star inflation for luxury hotels.
LangGraph agent adapts search strategy per query, but LLMs still hallucinate in contracts.