By whichguy
Plan review, code review (Adversarial Auditor), review-fix, security/red-team, memory audits.
Perform a comprehensive security analysis of the specified code:
Analyze memory fragments and documentation for secrets, API keys, and personally identifiable information before committing or sharing.
Iterative review-fix loop โ spawns parallel code-reviewer subagents per file (single Task for 1 file, parallel Tasks for 2+ files); max 5 rounds; concurrent fix application via parallel fixer Task() agents; optional plan_summary parameter for intent-aligned review; git fallback for auto-detecting changed files when target_files is empty. Critical fixes always applied; Advisory+Fix-block fixes auto-applied; Advisory/YAGNI skipped; loops per-file until clean or 5 rounds reached, produces a summary.
Ultimate Adversarial Auditor. Combines language-specific "Domain Radar" with a "Suspicion-First" internal monologue for 100% logic trap detection.
Comprehensive prompt analysis and enhancement using phased development framework. **AUTOMATICALLY INVOKE** this agent when user mentions "review prompt", "improve prompt", "analyze prompt", or when editing agent/skill markdown files. **STRONGLY RECOMMENDED** for new agent creation, prompt optimization, and LLM instruction refinement. **NOT for:** - Quick technique check (use /review-bench:improve-prompt --mode critique โ faster, produces scorecard) - A/B prompt comparison with execution (use /review-bench:compare-prompts) - GAS system prompt optimization (use /review-bench:optimize-system-prompt)
LLM judge that semantically evaluates code reviewer output against ground-truth issues. Used by review-fix-bench harness to replace fragile regex-based finding extraction and matching. Outputs structured JSON only โ no prose wrapper.
Simplified orchestrator for the iterative review-fix loop. Detects files โ delegates to specialized reviewers in standalone mode โ aggregates results โ commits/PRs. Reviewers (code-reviewer, etc.) handle their own inner loops, auto-fixes, and re-verification. **AUTOMATICALLY INVOKE** after implementing features, fixing bugs, before committing, or after plan implementation completes.
Generative Adversarial Network (GAN) Red Team. Writes novel, obfuscated vulnerabilities to test the limits of the code-reviewer agent.
Uses power tools
Uses Bash, Write, or Edit tools
Own this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimOwn this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimBased on adoption, maintenance, documentation, and repository signals. Not a security audit or endorsement.
Family-bundled Claude Code extensions, distributed as a plugin marketplace.
Eleven plugins covering Apps Script tooling, project wiki, plan/code review, prompt research bench, planning, async workflow, slides, and several domain-specific bundles. Install only what you need.
/plugin marketplace add whichguy/claude-craft
/plugin install gas-suite@claude-craft # pick the bundles you want
/plugin install review-suite@claude-craft
/plugin install wiki-suite@claude-craft
# โฆ etc.
Verify with /plugin list.
| Bundle | What it provides |
|---|---|
gas-suite | Apps Script review, debugging, planning, sidebar testing, Gmail Cards |
wiki-suite | Project LLM wiki: ingest, query, process queue, lint, proactive research |
review-suite | Plan review, code review (Adversarial Auditor), iterative review-fix loop |
review-bench | Prompt/question A/B benchmarking and ablation tooling (depends on review-suite) |
planning-suite | Architect, refactor, test, schedule-plan-tasks, node-plan, alias/unalias, performance, knowledge |
async-suite | Background task workflow: /bg, /todo, task-persist, feedback-collector |
slides-suite | reveal.js or Google Slides decks |
comms | Slack tagging |
form990 | IRS Form 990 preparation orchestrator |
plan-red-team | Iterative red-team plan review with Opus orchestration |
local-classifier | Local Ollama-powered prompt classifier UserPromptSubmit hook |
Cross-bundle dependency edges (declared in each plugin.json):
gas-suite โ review-suite, review-suite โ wiki-suite,
review-bench โ review-suite, form990 โ review-bench.
If you previously ran ./install.sh, run the one-shot cleanup once before
adding the marketplace โ it removes hook entries injected into
~/.claude/settings.json and unlinks dangling symlinks pointing into the
repo:
git -C path/to/claude-craft pull
path/to/claude-craft/tools/migrate-from-symlinks.sh
Then proceed with the /plugin marketplace add step above.
Claude Craft includes a self-building wiki system that captures knowledge from your sessions and makes it available across conversations.
| Skill | Description |
|---|---|
/wiki-init | Initialize a project wiki with directory structure and SCHEMA.md |
/wiki-ingest <source> | Add a file or URL to the wiki (runs async in background) |
/wiki-query <question> | Synthesize an answer from wiki pages with citations |
/wiki-load <topic> | Load raw wiki pages into context (no synthesis overhead) |
/wiki-process | Process pending queue entries โ the self-building engine |
/wiki-lint | Health check: find orphans, broken links, contradictions, stale pages |
A consolidated set of skills for iterating on prompts, system prompts, and evaluator questions.
| Skill | Description |
|---|---|
/improve-prompt | Research-backed iterative prompt improvement loop with experiment variants, scope-preservation gate, and questions-based judging. Subsumes /prompt-critique (via --mode critique) and /prompt-probes (via --with-probes). |
/compare-prompts | A/B test two prompts with execution-based scoring. Standalone harness. |
/process-feedback | Ingest the feedback-collector plugin's backlog and propose surgical prompt updates (propose-only โ never auto-edits SKILL.md). |
/optimize-system-prompt | Optimize/refine the GAS Sheets Chat system prompt (compression + refinement). Subsumes /ideate-system-prompt via --mode ideate (autonomous hypothesis generation + benchmarking). |
/improve-system-prompt | Benchmark pre-coded GAS system prompt variants (V2/V2a/V2b/V2c) against scenarios. Sibling of /optimize-system-prompt for projects with predefined variants. |
/derive-questions | Mine failures and extract evaluator questions from real runs. |
/optimize-questions | Token-efficiency optimization for plan-review questions. Uses /compare-questions as its internal A/B engine. |
/compare-questions | Pairwise A/B testing of two evaluator questions against plan fixtures. |
The wiki-hooks plugin provides 13 lifecycle handlers + a shared library that run automatically:
Architect, refactor, test, schedule-plan-tasks, node-plan, alias/unalias, performance, knowledge.
Project LLM wiki: ingest, query, process queue, lint, plus proactive research hook.
Google Apps Script review, debugging, planning, sidebar testing, Gmail Cards.
Background task workflow: bg / todo / todo-cleanup + task-persist + feedback ingestion.
End-to-end IRS Form 990 preparation orchestrator.
npx claudepluginhub whichguy/claude-craft --plugin review-suiteHarness-native ECC plugin for engineering teams - 67 agents, 271 skills, 92 legacy command shims, reusable hooks, rules, MCP conventions, and operator workflows for Claude Code plus adjacent agent harnesses
Complete collection of battle-tested Claude Code configs from an Anthropic hackathon winner - agents, skills, hooks, and rules evolved over 10+ months of intensive daily use
v9.44.1 โ Patch release for Gemini environment/version detection and qwen auth gating. Run /octo:setup.