From plugin-eval
Evaluate a local Codex skill in engineer-friendly terms. Use when the user says "evaluate this skill", "give me an analysis of the game dev skill", "audit this skill", "why did this score that way", "what should I fix first", or asks for a skill-specific report before benchmarking it.
How this skill is triggered — by the user, by Claude, or both
Slash command
/plugin-eval:evaluate-skillThe summary Claude sees in its skill listing — used to decide when to auto-load this skill
Use this skill when the target is a local skill directory or `SKILL.md` file.
Use this skill when the target is a local skill directory or SKILL.md file.
~/.codex/skills/<skill-name> and then repo-local skills/<skill-name>.plugin-eval start <skill-path> --request "<user request>" --format markdown to show the routed path clearly.plugin-eval analyze <skill-path> --format markdown.At a Glance, Why It Matters, Fix First, and Recommended Next Step before drilling into details.plugin-eval init-benchmark <skill-path> and show the setup questions for refining the starter scenarios in .plugin-eval/benchmark.json.plugin-eval measurement-plan <skill-path> --observed-usage <usage.jsonl> --format markdown to recommend what to instrument or improve next.../improve-skill/SKILL.md.name and description qualitySKILL.md or descriptionsEvaluate this skill.Give me an analysis of the game dev skill.Audit this skill.Why did this skill score that way?What should I fix first?Measure the real token usage of this skill.plugin-eval start <skill-path> --request "Evaluate this skill." --format markdown
plugin-eval analyze <skill-path> --format markdown
plugin-eval explain-budget <skill-path> --format markdown
plugin-eval measurement-plan <skill-path> --format markdown
plugin-eval init-benchmark <skill-path>
plugin-eval benchmark <skill-path> --dry-run
../../references/chat-first-workflows.mdnpx claudepluginhub robinebers/converted-plugins --plugin plugin-evalCreates bite-sized, testable implementation plans from specs or requirements, with file structure and task decomposition. Activates before coding multi-step tasks.