My plugins for Claude Code.
/plugin install <plugin-name>@ben-claude-plugins
| Plugin | Description |
|---|---|
| rodney | Browser automation via the rodney CLI for web scraping, frontend verification, and page interaction |
| eval-designer | Design production-quality LLM evaluations for Langfuse |
| langfuse | Query Langfuse LLM observability platform via the lf CLI |
| obsidian-agent-tools | CDP tools for Obsidian plugin development and testing |
| readme-generator | Generate excellent README files following best practices from https://github.com/matiassingers/awesome-readme |
| resend | Send emails and manage domains, API keys, and templates via the Resend CLI |
| sprite | Manage Sprites - persistent, isolated Linux microVMs for safe code execution |
MIT
Based on adoption, maintenance, documentation, and repository signals. Not a security audit or endorsement.
npx claudepluginhub tavva/ben-claude-plugins --plugin eval-designerTeaches AI coding agents to create promptfoo eval suites with deterministic assertions, provider configs, and best practices
Skills for adding DeepEval evaluations, tracing, datasets, Confident AI reports, and iterative improvement loops to AI applications.
Claude Code skill pack for Langfuse LLM observability (24 skills)
Skills for building LLM evaluations: pipeline audit, error analysis, synthetic data generation, LLM-as-Judge design, evaluator validation, RAG evaluation, and annotation interfaces.
Manage Sprites - persistent, isolated Linux microVMs for safe code execution
Generate excellent README files following best practices from awesome-readme
Obsidian CLI tools for plugin development, testing, and vault automation
Browser automation via the rodney CLI for web scraping, frontend verification, and page interaction
AI agent evaluation toolkit for Copilot Studio. Plan evals, generate test cases, interpret results, and triage failures — grounded in Microsoft's Eval Scenario Library and Triage & Improvement Playbook.
Measure AI output quality, user satisfaction, task success, and design effectiveness.
Own this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimOwn this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claim