From thinking-skills
Reviews code, auth, and APIs for security vulnerabilities using adversarial thinking. Enforces concrete reproducible attack paths for findings.
How this skill is triggered — by the user, by Claude, or both
Slash command
/thinking-skills:thinking-red-teamThe summary Claude sees in its skill listing — used to decide when to auto-load this skill
Red teaming is **adversarial security review**: deliberately attacking a system you control to find vulnerabilities before an attacker does. This skill is scoped to security/code vulnerability detection — it is NOT for plan stress-testing or decision challenge (use `thinking-pre-mortem` or `thinking-steel-manning` for those).
Red teaming is adversarial security review: deliberately attacking a system you control to find vulnerabilities before an attacker does. This skill is scoped to security/code vulnerability detection — it is NOT for plan stress-testing or decision challenge (use thinking-pre-mortem or thinking-steel-manning for those).
The anti-fabrication gate is the most important rule: every reported finding MUST include a concrete, reproducible attack path — entry point → exact steps → realized impact. If you cannot describe how the attack actually executes against this code/config, it is not a finding. Drop it. A short report of real, demonstrable vulnerabilities beats a long list of speculation.
Core Principle: Attack yourself before others do. But only report what you can actually break.
Decision flow:
Reviewing a system for security?
→ Can you demonstrate an attack path? → No → Don't report it
→ Is the finding reproducible against this code/config? → No → Drop it
→ Is this plan stress-testing or decision challenge? → Yes → Use thinking-pre-mortem or thinking-steel-manning
→ No → RED TEAM IT
thinking-pre-mortem. For "what's the strongest case against this decision," use thinking-steel-manning.thinking-systems or thinking-pre-mortem.Target: [System/component under review]
Scope: [What to attack — be specific]
Out of scope: [What to skip]
Goal: [What constitutes a successful attack — e.g., unauthorized access, data exfiltration, privilege escalation]
Identify who would attack this system and what they want:
Adversary profiles:
- External attacker (no access): targets public endpoints, auth bypass, injection
- Authenticated user (basic access): targets privilege escalation, IDOR, business logic
- Insider (elevated access): targets data exfiltration, audit bypass
For each profile, ask: "If I wanted to cause maximum damage with their access level, how would I?"
Map every entry point and trust boundary:
| Surface | Exposure | Trust Boundary |
|---|---|---|
| Login form | Public internet | Anonymous → Authenticated |
| API /graphql | Public (with key) | Authenticated → Application |
| Password reset flow | Public | Anonymous → Account owner |
| Admin panel | Internal network | Authenticated → Admin |
| File upload endpoint | Authenticated | User data → Server storage |
For each attack surface, attempt to break the system:
Attack: Credential stuffing against login
Steps:
1. Obtain breach database of known credentials
2. Script requests against /login with varied credentials
3. Observe rate limiting, account lockout, timing differences
Findings:
- Rate limiting: 10 req/min per IP (bypassable via distribution)
- Account lockout: none
- Timing: response time reveals valid vs invalid usernames ← VULNERABILITY
For each attack scenario, record: the entry point, exact steps taken, observed behavior, and whether a vulnerability was found.
For each defense encountered, try to bypass it:
| Defense | Bypass Attempt | Result |
|---|---|---|
| IP-based rate limiting | Distribute requests across IPs | BYPASSED |
| Input validation | Unicode normalization attacks | RESISTED |
| Session timeout | Replay expired token with modified timestamp | BYPASSED |
For EVERY finding, fill out:
Finding: [Title]
Severity: Critical / High / Medium / Low
Attack path (REQUIRED):
Entry point: [URL, endpoint, parameter, file]
Steps: [1. Send X, 2. Observe Y, 3. Escalate to Z]
Realized impact: [What the attacker actually achieves — data access, privilege, DoS]
Remediation: [Concrete fix]
If you cannot fill out the attack path completely, drop the finding. Do not report "Missing HttpOnly flag" as a standalone finding without demonstrating session token theft. Do not report "No rate limiting" without demonstrating a viable brute-force attack.
A completed Red Team report produces:
Findings without a complete attack path are excluded from the report. A report with zero findings is acceptable if no reproducible vulnerabilities were discovered.
| Anti-Pattern | Symptom | Correction |
|---|---|---|
| Speculative claims | Findings that say "could be vulnerable" or "best practice says" without a demonstrated attack path | Drop them; only report what you can actually break |
| Padding the report | Adding "informational" or "best-practice" items to look thorough | A short report of real vulns beats a long list of theory |
| Non-security red-teaming | Using this skill for plan stress-testing or decision challenge | Use thinking-pre-mortem (plans) or thinking-steel-manning (decisions) |
| Skipping the adversary model | Attacking without defining who the attacker is and what access they have | Define the adversary profile first; attacks make sense only in context |
| Missing the attack path | Reporting a vulnerability without showing how to reach it | Every finding needs: entry point → steps → impact |
| Scanner-as-substitute | Running a SAST tool and reporting its output without adversarial thinking | The tool finds patterns; red-teaming finds exploitable paths |
npx claudepluginhub tjboudreaux/cc-thinking-skills --plugin thinking-skillsSubmits code, architectures, or configs to LLMs for adversarial security analysis. Iterates until findings are resolved or PASSed.
Evaluates threats, vulnerabilities, and missing protections using STRIDE and OWASP Top 10. Designed for use by review orchestrators, not direct invocation.
Performs security audits, hardening, threat modeling (STRIDE/PASTA), OWASP checks, code review, incident response, and infrastructure security for any project. Operates in audit, threat-model, approve, block, and monitor modes.