Back to browse
GitHub Repository

Extract and serve CAA-style emotion steering vectors for any HF causal LM

5 starsPython

Per-request emotion steering for vLLM, with batching preserved

by ChrisPoensgen·May 5, 2026·1 point·0 comments

AI Analysis

●●SolidNiche GemBig Brain

Per-request emotion steering that preserves vLLM continuous batching for Qwen3.

Strengths
  • Preserves continuous batching in vLLM while applying per-request steering vectors.
  • Automated layer selection uses AUC probing to find optimal steering locations.
  • Provides both extraction tools and a serving API for immediate integration.
Weaknesses
  • Fast path is strictly pinned to vLLM 0.20.x and Qwen3 architecture only.
  • Niche utility limits appeal to researchers specifically interested in activation addition.
Category
Target Audience

ML engineers and researchers experimenting with model steering

Similar To

LLM-Beamer · Activation Additions

Similar Projects

AI/ML●●Solid

Kronaxis Router – Don't pay frontier prices when a local LLM is enough

LLM cost routing with LoRA awareness when LiteLLM already handles basic proxying.

Big BrainSolve My Problem
JasonDuke
202mo ago