Guides evaluation-driven development (EDD) process for agent skills. Use when setting up skill testing workflows, creating skill evaluation scenarios, or establishing Claude A/B feedback loops for skill validation. Provides development methodology, not content guidance.
/plugin marketplace add taisukeoe/agentic-ai-skills-creator/plugin install skills-helper-experimental@agentic-skills-creatorThis skill is limited to using the following tools:
tests/scenarios.mdRun evaluation-driven development cycle for agent skills.
Create evaluations BEFORE writing documentation. This ensures skills solve real problems.
Evaluation scenarios are saved to tests/scenarios.md as part of /creating-effective-skills workflow (Step 7).
Measure Claude's performance WITHOUT the skill:
Create just enough content to address the gaps:
REQUIRED: Run /creating-effective-skills before writing any skill content. This ensures proper naming, description format, and structure from the start.
Note: This step requires Claude Code CLI. Skip this step if using Claude.ai (model selection not available).
REQUIRED: Run /evaluating-skills-with-models with the skill path.
The skill will:
tests/scenarios.mdAfter evaluation: Document recommended model in skill's metadata.
REQUIRED: Run /improving-skills when observations reveal issues.
Before considering the skill complete:
REQUIRED: Run /reviewing-skills to verify compliance with best practices.
After all reviews pass, output instructions for user to validate in a fresh session:
## Test Your Skill
Run this command in a new terminal to test with a fresh Claude session:
claude --model {recommended_model} "{evaluation_query}"
After testing, paste the output file or result back to this session for final confirmation.
Replace:
{recommended_model}: Model determined in Step 4 (e.g., sonnet){evaluation_query}: A representative query from your evaluationsIdentify gaps -> Create evaluations -> Baseline -> Write minimal -> Model eval (sub-agents) -> Review -> User validation
| Observation | Indicates |
|---|---|
| Unexpected file reading order | Structure not intuitive |
| Missed references | Links need to be explicit |
| Repeated reads of same file | Move content to SKILL.md |
| Never accessed file | Unnecessary or poorly signaled |
This skill should be used when the user asks to "create a slash command", "add a command", "write a custom command", "define command arguments", "use command frontmatter", "organize commands", "create command with file references", "interactive command", "use AskUserQuestion in command", or needs guidance on slash command structure, YAML frontmatter fields, dynamic arguments, bash execution in commands, user interaction patterns, or command development best practices for Claude Code.
This skill should be used when the user asks to "create an agent", "add an agent", "write a subagent", "agent frontmatter", "when to use description", "agent examples", "agent tools", "agent colors", "autonomous agent", or needs guidance on agent structure, system prompts, triggering conditions, or agent development best practices for Claude Code plugins.
This skill should be used when the user asks to "create a hook", "add a PreToolUse/PostToolUse/Stop hook", "validate tool use", "implement prompt-based hooks", "use ${CLAUDE_PLUGIN_ROOT}", "set up event-driven automation", "block dangerous commands", or mentions hook events (PreToolUse, PostToolUse, Stop, SubagentStop, SessionStart, SessionEnd, UserPromptSubmit, PreCompact, Notification). Provides comprehensive guidance for creating and implementing Claude Code plugin hooks with focus on advanced prompt-based hooks API.