Explain the autoresearch methodology — a verifiable autonomous experiment loop — and evaluate whether the current repo is a good fit before running the autoresearch-verify and autoresearch-program skills. Use when a user wants to add autonomous experimentation to a project, asks about autoresearch/autoresearch, or is about to invoke the other autoresearch skills.
Generate `program.md` at the repo root from the autoresearch program template, with mutable/immutable scope and verifier commands baked in via light templating. Run after autoresearch-verify produces the verifier script. The generated program.md is the operating directive for a fresh agent session running the autoresearch loop — after this skill runs, the autoresearch skills are no longer in the picture. Use when a user wants to generate the loop directive for an autoresearch run.
Build a repo-specific verifier script for an autoresearch loop. The script has two modes — `light` (fast per-candidate metric evaluation, emits the JSON the loop consumes) and `heavy` (matrix-of-conditions integrity check). Run after autoresearch-method confirms the repo fits and before autoresearch-program. Use when a user wants to build the verifier script for an autoresearch loop.
Ultra-compressed communication mode. Cuts token usage ~75% by speaking like caveman while keeping full technical accuracy. Supports intensity levels: lite, full (default), ultra, wenyan-lite, wenyan-full, wenyan-ultra. Use when user says "caveman mode", "talk like caveman", "use caveman", "less tokens", "be brief", or invokes /caveman. Also auto-triggers when token efficiency is requested.
Interview the user relentlessly about a plan or design. Use when the user wants to stress-test a plan before building, or uses any 'grill' trigger phrases.
Own this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimOwn this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimBased on adoption, maintenance, documentation, and repository signals. Not a security audit or endorsement.
bunx skills add https://github.com/will-wright-eng/skills
This command uses the vercel-labs/skills CLI to implement skills in this repo.
A Claude Code plugin that ships three sequential skills for adding a verifiable autonomous experiment loop to a git repository, generalized from karpathy/autoresearch.
After install, invoke them in order: autoresearch-method → autoresearch-verify → autoresearch-program. Once program.md is generated, hand it to a fresh agent session and the loop runs from there.
| Skill | Purpose |
|---|---|
autoresearch-method | Explain the methodology and evaluate whether the current repo is a good fit. |
autoresearch-verify | Build a repo-specific verifier script with light (per-candidate metric) and heavy (integrity matrix) modes. |
autoresearch-program | Generate program.md at the repo root — the operating directive a fresh agent session uses to run the loop. |
From mattpocock/skills — replicating third-party skills (after reading them) reduces prompt-injection risk versus installing from a remote source that can change underneath you.
| Skill | Purpose |
|---|---|
improve-codebase-architecture | Surface deepening opportunities — refactors that turn shallow modules into deep ones, using a fixed architectural vocabulary. |
grill-with-docs | Interview-style session that stress-tests a plan against the project's domain language and updates CONTEXT.md / ADRs inline as decisions crystallise. |
improve-codebase-architecture references grill-with-docs for CONTEXT.md and ADR format docs, so the two skills are designed to be installed together.
From mattpocock/skills — replicated locally to reduce prompt-injection risk.
| Skill | Purpose |
|---|---|
grill-me | Relentless, one-question-at-a-time interview that stress-tests a plan or design before you build, recommending an answer for each decision and exploring the codebase when it can answer a question itself. |
From JuliusBrussee/caveman — replicated locally to reduce prompt-injection risk.
| Skill | Purpose |
|---|---|
caveman | Ultra-compressed response mode — cuts token usage ~75% by stripping articles, filler, and hedging while keeping full technical accuracy. Supports lite / full / ultra and 文言文 (wenyan-*) intensity levels. |
npx claudepluginhub will-wright-eng/skillsUltra-compressed communication mode. Cuts ~75% of tokens while keeping full technical accuracy by speaking like a caveman.
Frontend design skill for UI/UX implementation
Memory compression system for Claude Code - persist context across sessions
Marketing skills for AI agents — conversion optimization, copywriting, SEO, paid ads, ad creative, and growth
Comprehensive UI/UX design plugin for mobile (iOS, Android, React Native) and web applications with design systems, accessibility, and modern patterns
Standalone image generation plugin using Nano Banana MCP server. Generates and edits images, icons, diagrams, patterns, and visual assets via Gemini image models. No Gemini CLI dependency required.