Kate – Agents buy expertise from other agents, autonomously
Agents buying expertise autonomously is genuinely novel, but tokens have no real value yet.
Decades of operational expertise from logistics, manufacturing, retail and energy. Codified into agent skills - Works with Claude, OpenAI Codex, OpenClaw, Cursor, Gemini, and 26+ platforms.
Domain expertise as code: AI agents for supply chain, not just APIs.
Enterprise operations teams, AI platform builders, logistics/manufacturing/retail companies deploying agentic workflows
Anthropic's Prompt Caching for operational workflows · OpenAI Assistants with custom Instructions · Langchain agent tooling
Evos started out as doing some AI consulting, while deploying these systems, I kept hitting the same problem: AI agents are terrible at real operational work in core industries. Agents could code the entire system, but asking it to handle a freight exception, or claims deadline issue and it fell apart - it just didn't have the domain knowledge.
So I started translating and codifying what operations experts actually know - the decision frameworks, edge cases, escalation protocols - into agent skills for the Evos platform.
Last week, I noticed there are over 3000 skills on ClawHub, and nearly all of them are dev tool wrappers and API integrations. As far as I could tell, there aren't many skills that teach agents genuine domain expertise for traditional industries - and even fewer coming from verified expertise.
Today, I open sourced the Evos agent skills.
We've published 8 skills for use cases across logistics, manufacturing, retail and energy. Each of them follow the Agent Skills open standard, and work across all the major platforms.
Beyond the skills, I built in an eval suite - 20 to 25 scenarios from real operations, scored against weighted rubrics and also benchmarked agents with the skills vs agents without.
Repo: https://github.com/ai-evos/agent-skills
Would love your feedback from anyone whose been in these industries, or thoughts as a whole.
I'll be parked in the comments - thanks!
Agents buying expertise autonomously is genuinely novel, but tokens have no real value yet.
Naur's 1985 theory applied to AI agents, but it's just a prompt template.
Treats an agent's prompts and behaviors like versioned packages — commands such as stato crystallize, snapshot, validate, bridge and registry install create an npm/pip-style workflow for ‘expertise’ with a 7-pass compiler and privacy scanning before export. The composition algebra (slice, graft, merge) plus cross-platform bridge generation (e.g., CLAUDE.md) are clever, concrete features that go beyond simple memory dumps. It’s clearly targeted and useful for teams running agent-driven dev workflows, but it needs registry adoption and cross-agent fidelity evidence to become indispensable.
Grounded theory methodology for AI evals before you have rubrics.
Intermediary removal for global sourcing; unclear technical moat vs. existing platforms.
Bulk-install AI skills across 30+ agents from one terminal UI.