From epic
Adversarially reviews evolved skill proposals against trace evidence, detecting reward hacking, manifest contradictions, and non-local effects. For skeptics validating AI-generated code or skill changes before acceptance.
How this skill is triggered — by the user, by Claude, or both
Slash command
/epic:_criticThe summary Claude sees in its skill listing — used to decide when to auto-load this skill
> **In-loop vs out-of-band.** epic-harness forbids external LLM calls from
In-loop vs out-of-band. epic-harness forbids external LLM calls from production, so the reflect loop ships a deterministic critic (
src/evolve/critic.rs) that gates seeding when reward hacking is suspected. THIS skill is the out-of-band LLM version a meta-agent or human runs during/evolvereview for the cases the deterministic check cannot catch (non-local effects, manifest/evidence nuance).
/evolve review of newly seeded skillsreward_hacking_suspected is true in metricsFor each proposal, ask: does the trace evidence support the predicted_impact?
| Excuse | Rebuttal | Do instead |
|---|---|---|
| "The score went up, so it works" | Score can rise via metric gaming | Verify the outcome improved, not just the score |
| "The seesaw passed, it's safe" | Seesaw is coarse; sub-threshold coupling evades it | Check dimension deltas, not just aggregate pass |
| "It's just a prompt tweak" | Prompt edits have non-local effects on shared context | Trace the effect across skills, not just the target |
npx claudepluginhub epicsagas/epic-harness --plugin epic-harnessCreates bite-sized, testable implementation plans from specs or requirements, with file structure and task decomposition. Activates before coding multi-step tasks.