Skill

autoresearch

Autonomous iterative research loop. Takes a topic, runs web searches, fetches sources, synthesizes findings, and files everything into the wiki as structured pages. Based on Karpathy's autoresearch pattern: program.md configures objectives and constraints, the loop runs until depth is reached, output goes directly into the knowledge base. Triggers on: "/autoresearch", "autoresearch", "research [topic]", "deep dive into [topic]", "investigate [topic]", "find everything about [topic]", "research and file", "go research", "build a wiki on".

Invocation

How this skill is triggered — by the user, by Claude, or both

Slash command

/lean-vault:autoresearch

User invocable

Model invocable

Inline context

Default effort

Tool Access

This skill is limited to the following tools:

ReadWriteEditGlobGrepWebFetchWebSearch

Context Preview

The summary Claude sees in its skill listing — used to decide when to auto-load this skill

You are a research agent. You take a topic, run iterative web searches, synthesize findings, and file everything into the wiki. The user gets wiki pages, not a chat response.

Supporting Files

references/program.md

SKILL.md

262 lines · ~2.7k tokens

Stats

LanguageShell

Stars0

MaintenanceGood

Last CommitJun 25, 2026

Actions

View Source View Plugin View on GitHub View README

Stats

Actions

autoresearch: Autonomous Research Loop

You are a research agent. You take a topic, run iterative web searches, synthesize findings, and file everything into the wiki. The user gets wiki pages, not a chat response.

This is based on Karpathy's autoresearch pattern: a configurable program defines your objectives. You run the loop until depth is reached. Output goes into the knowledge base.

Transport

This fork pins the filesystem transport. The research loop writes source / concept / entity pages and manifest updates with Claude's Write/Edit tools using vault-rooted paths. No Obsidian CLI, no REST API. Web fetches (WebFetch/WebSearch) are transport-agnostic.

Mode awareness

Before filing research output, consult the vault's methodology mode via python3 scripts/wiki-mode.py route research "<topic>". The router returns the vault-relative path:

PARA (this fork's mode): wiki/resources/<topic>/<topic>.md (topic-named subfolder under resources)

If .vault-meta/mode.json is absent, the router defaults to PARA. Always route via the helper rather than hardcoding paths.

When the research session produces multiple entity / concept pages alongside the main synthesis, route EACH via the appropriate router call (route entity / route concept), not just the synthesis page. Mode awareness applies to every new file the loop creates.

Web egress hygiene (v1.8.2+)

Autoresearch calls WebFetch and WebSearch to pull arbitrary URLs. Before each fetch and before writing fetched content to the vault, apply these guards:

1. URL validation. Reject these schemes and targets:

file://, javascript:, data: schemes — fetch only http(s)://
RFC1918 private addresses (10.x.x.x, 172.16-31.x.x, 192.168.x.x) and localhost/127.0.0.1 — these would target the user's internal network
Hosts not surfaced by the prior WebSearch step (be conservative; do not follow redirects to domains that never appeared in search results)

The Claude Code WebFetch tool has built-in defenses against many of these. Apply them here as defense-in-depth.

2. Content sanitization before writing fetched HTML into a wiki page. Fetched content can contain prompt-style injections, fake wikilinks, or executable code fences. Before any Write to wiki/sources/<source>.md:

Strip <script>, <iframe>, <style> tags and their contents
Escape [[ and ]] in the source body so adversarial content cannot inject wikilinks into the vault's link graph (encode as \[\[ or HTML-entity [[)
Reject any --- YAML-frontmatter delimiter inside fetched content — the source page's frontmatter is authored by the loop, not by the upstream source
Truncate fetched bodies to ~50KB to avoid context blowout

3. Per-loop cost expectation. A full autoresearch run is up to 3 rounds × 5 sources × 3 angles ≈ 45 WebFetch calls. WebFetch is metered through the Anthropic plan. The max_pages: 15 cap in references/program.md limits FILING cost but does NOT cap FETCH count. Surface the budget expectation to the user before kicking off research on a high-cost topic.

4. Failure mode. If a fetch fails (timeout, 4xx/5xx, content too large, sanitization removed everything), log the URL + reason to wiki/log.md and continue the loop. Do NOT abort the whole run. Do NOT silently swallow — every skipped source is a fact the user needs in the synthesis page's "Open Questions" section.

The router (python3 scripts/wiki-mode.py route) already sanitizes the topic-derived FILENAME via safe_name(). This section adds the second layer: BODY-content hygiene for fetched pages.

Concurrency (v1.7+)

The research loop is a high write-rate skill (often 10-30 page writes per topic). Every wiki page write MUST be preceded by wiki-lock acquire <path>:

bash scripts/wiki-lock.sh acquire wiki/sources/<slug>.md || sleep 2 && bash scripts/wiki-lock.sh acquire wiki/sources/<slug>.md
# … write via §Transport-selected method …
bash scripts/wiki-lock.sh release wiki/sources/<slug>.md

If autoresearch is invoked in parallel (e.g., two /autoresearch commands fired at once on overlapping topics), the locks ensure that the same source/concept/entity page is written by only one loop at a time. The losing acquire skips that page for the current pass and logs wiki/log.md; the page will be picked up in the next iteration of the winning loop's pass.

See skills/wiki-ingest/SKILL.md §Concurrency for the full lock semantics.

Before Starting

Read references/program.md to load the research objectives and constraints. This file is user-configurable. It defines what sources to prefer, how to score confidence, and any domain-specific constraints.

Topic Selection

Three paths to a topic:

A. Explicit topic (always respected)

When the user says /autoresearch [topic] or "research X", use the given topic verbatim and skip the section below.

B. User-chosen (default)

When /autoresearch is invoked without a topic, ask: "What topic should I research?"

Research Loop

Input: topic (from Topic Selection, above)

Round 1. Broad search
1. Decompose topic into 3-5 distinct search angles
2. For each angle: run 2-3 WebSearch queries
3. For top 2-3 results per angle: WebFetch the page
4. Extract from each: key claims, entities, concepts, open questions

Round 2. Gap fill
5. Identify what's missing or contradicted from Round 1
6. Run targeted searches for each gap (max 5 queries)
7. Fetch top results for each gap

Round 3. Synthesis check (optional, if gaps remain)
8. If major contradictions or missing pieces still exist: one more targeted pass
9. Otherwise: proceed to filing

Max rounds: 3 (as set in program.md). Stop when depth is reached or max rounds hit.

Filing Results

After research is complete, create these pages:

wiki/sources/. One page per major reference found

Use source frontmatter (type, source_type, author, date_published, url, confidence, key_claims)
Body: summary of the source, what it contributes to the topic

wiki/concepts/. One page per significant concept extracted

Only create a page if the concept is substantive enough to stand alone
Check the index first: update existing concept pages rather than creating duplicates

wiki/entities/. One page per significant person, org, or product identified

Check the index first: update existing entity pages

wiki/questions/. One synthesis page titled "Research: [Topic]"

This is the master synthesis. Everything comes together here.
Sections: Overview, Key Findings, Entities, Concepts, Contradictions, Open Questions, Sources
Full frontmatter with related links to all pages created in this session

Synthesis Page Structure

---
type: synthesis
title: "Research: [Topic]"
created: YYYY-MM-DD
updated: YYYY-MM-DD
tags:
  - research
  - [topic-tag]
status: developing
related:
  - "[[Every page created in this session]]"
sources:
  - "[[wiki/sources/Source 1]]"
  - "[[wiki/sources/Source 2]]"
---

# Research: [Topic]

## Overview
[2-3 sentence summary of what was found]

## Key Findings
- Finding 1 (Source: [[Source Page]])
- Finding 2 (Source: [[Source Page]])
- ...

## Key Entities
- [[Entity Name]]: role/significance

## Key Concepts
- [[Concept Name]]: one-line definition

## Contradictions
- [[Source A]] says X. [[Source B]] says Y. [Brief note on which is more credible and why]

## Open Questions
- [Question that research didn't fully answer]
- [Gap that needs more sources]

## Sources
- [[Source 1]]: author, date
- [[Source 2]]: author, date

After Filing

Update wiki/index.md. Add all new pages to the right sections

Append to wiki/log.md (at the TOP):

## [YYYY-MM-DD] autoresearch | [Topic]
- Rounds: N
- Sources found: N
- Pages created: [[Page 1]], [[Page 2]], ...
- Synthesis: [[Research: Topic]]
- Key finding: [one sentence]

Update wiki/hot.md with the research summary

Report to User

After filing everything:

Research complete: [Topic]

Rounds: N | Searches: N | Pages created: N

Created:
  wiki/questions/Research: [Topic].md (synthesis)
  wiki/sources/[Source 1].md
  wiki/concepts/[Concept 1].md
  wiki/entities/[Entity 1].md

Key findings:
- [Finding 1]
- [Finding 2]
- [Finding 3]

Open questions filed: N

Constraints

Follow the limits in references/program.md:

Max rounds (default: 3)
Max pages per session (default: 15)
Confidence scoring rules
Source preference rules

If a constraint conflicts with completeness, respect the constraint and note what was left out in the Open Questions section.

How to think (10-principle mapping)

When working on this skill, apply the 10-principle loop. See skills/think/SKILL.md for the canonical framework.

#	Principle	Application here
1	OBSERVE (ext)	Read `references/program.md` to load constraints. Read the topic verbatim. Note what's already in the wiki.
2	OBSERVE (int)	Am I steering the search toward what I already expect to find? Confirmation bias kills research.
3	LISTEN	The user's framing + cultural context + the counter-position the user might NOT have considered.
4	THINK	3-5 distinct search angles that cover the topic without overlap; credibility-weighted source filter.
5	CONNECT (lat)	Cross-source corroboration vs contradiction — the synthesis lives at the intersection, not in any single source.
6	CONNECT (sys)	WebFetch + WebSearch + §Web egress hygiene + wiki-mode router + wiki-lock for multi-writer safety.
7	FEEL	30 pages of low-signal noise wastes the user's time and Anthropic plan budget. Quality over volume.
8	ACCEPT	Missing sources are part of the synthesis — file them under Open Questions, don't paper over.
9	CREATE	Synthesis page + sources + entities + concepts; full traceability per claim.
10	GROW	Open Questions feed the next research cycle; the loop is incremental, not exhaustive.

autoresearch

Invocation

Tool Access

Context Preview

Supporting Files

SKILL.md

autoresearch

Invocation

Tool Access

Context Preview

Supporting Files

SKILL.md

autoresearch: Autonomous Research Loop

Transport

Mode awareness

Web egress hygiene (v1.8.2+)

Concurrency (v1.7+)

Before Starting

Topic Selection

A. Explicit topic (always respected)

B. User-chosen (default)

Research Loop

Filing Results

Synthesis Page Structure

After Filing

Report to User

Constraints

How to think (10-principle mapping)

Reused across plugins

Similar Skills

autoresearch: Autonomous Research Loop

Transport

Mode awareness

Web egress hygiene (v1.8.2+)

Concurrency (v1.7+)

Before Starting

Topic Selection

A. Explicit topic (always respected)

B. User-chosen (default)

Research Loop

Filing Results

Synthesis Page Structure

After Filing

Report to User

Constraints

How to think (10-principle mapping)

Similar Skills

Reused across plugins