By thinkwee
Diagnose and fix issues in reinforcement learning training and evaluation for LLM agents, using a corpus-anchored symptom-to-fix knowledge base to accelerate troubleshooting in single/multi-agent, multi-turn, tool-augmented scenarios.
Own this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimOwn this plugin?
Verify ownership to unlock analytics, metadata editing, and a verified badge. GitHub access is read-only (username + org membership).
Sign in to claimBased on adoption, maintenance, documentation, and repository signals. Not a security audit or endorsement.
npx claudepluginhub thinkwee/claude-plugins --plugin agents-meet-rlPre-submission auditor for academic papers: verify references actually exist (catch AI-hallucinated and retracted citations), check internal faithfulness (numbers match the tables, figures match the prose, no broken citations/refs), check LaTeX formatting/writing/anonymization, and enforce venue-specific rules (page limits, mandatory sections, checklists) for ACL/EMNLP/NAACL/CVPR/ICCV/ECCV/NeurIPS/ICML/ICLR.
Design patterns for the Langroid multi-agent LLM framework
RL routing + Thompson Sampling bandit for AgentDB. 9 algorithms (Q-Learning, SARSA, DQN, PPO, Actor-Critic, Policy Gradient, Decision Transformer, MCTS, Model-Based RL); /learn-task, /route-task.
LLM post-training — unified interface for SFT, OSFT, LoRA fine-tuning, and GRPO reinforcement learning
Editorial "Agent Architect" bundle for Claude Code from Antigravity Awesome Skills.
ML engineering plugin: Give your AI coding agent ML engineering superpowers.
LLM application development with LangGraph, RAG systems, vector search, and AI agent architectures for Claude 4.6 and GPT-5.4