name: research license: MIT description: >- Focused research investigations. Converts questions into structured findings with confidence levels and source citations. Single agent by default; with --parallel (or when the question decomposes into 3+ independent angles) it spawns scout agents whose findings are compressed into a unified brief. Does not make decisions; produces information that informs the next step. user-invocable: true auto-trigger: false trigger_keywords: - research - investigate - look into - find out - research fleet - parallel research - multi-angle research - compare options last-updated: 2026-06-11
/research — Focused Investigation
When to Use
- Evaluating whether a dependency has a newer version or has been superseded
- Finding community best practices for a specific technical problem
- Reading official documentation for an API or library
- Investigating how other projects solve a similar problem
- Checking if a pattern used in the codebase has known issues
- Any time you need external information before making a decision
Don't use when: you need to act on findings immediately (use /marshal which calls /research internally).
Modes
Single (default): one agent, 2-4 queries, 3-6 sources. Steps 1-5 below.
Parallel (/research --parallel): scout agents investigate independent
angles of the same question using Fleet wave mechanics. See Parallel Mode
below. Without the flag, prefer parallel mode (and say so in the plan) when
the question naturally decomposes into 3+ independent angles: evaluating
multiple competing technologies or approaches, distinct sub-questions that
don't depend on each other, or time-sensitive research where parallel
execution matters.
If the question is narrow and focused, stay single-agent. Don't parallelize what a single agent can answer in 5 minutes.
Protocol (Single Mode)
Step 1: FORMULATE
Convert the research question into 2-4 specific search queries:
- Official docs query (e.g., "express.js middleware error handling docs")
- Community/GitHub query (e.g., "express error middleware best practices site:github.com")
- Technical blog/comparison query (e.g., "express vs fastify error handling 2025")
- Release notes query if version-specific (e.g., "express 5.x changelog breaking changes")
State the question clearly in one sentence before searching.
Step 2: SEARCH
Execute searches and read actual content (not just snippets):
- Use WebSearch for discovery, WebFetch for reading actual pages
- Evaluate source credibility: official docs > GitHub repos with stars > recent blog posts > forum answers
- Stop at 3-6 credible sources (not exhaustive — focused)
- If a source contradicts another, note the disagreement
The WebFetch Restrictions under Parallel Mode apply in single mode too: never WebFetch rendered GitHub pages.
Step 3: EXTRACT
For each finding, record:
- What: The specific fact, recommendation, or pattern discovered
- Source: URL or reference
- Relevance: How this applies to the original question (one sentence)
- Confidence: high (official docs, verified), medium (community consensus), low (single source, opinion)
- Action: What the codebase should do with this information (or "informational only")
Step 4: WRITE
Write findings to .planning/research/{topic-slug}.md:
# Research: {Topic}
> Question: {The original question}
> Date: {ISO date}
> Confidence: {overall: high/medium/low}
## Findings
### 1. {Finding title}
**What:** {description}
**Source:** {URL}
**Confidence:** {high/medium/low}
**Action:** {recommendation or "informational"}
### 2. {Finding title}
...
## Summary
{2-3 sentences: what was learned, what the recommendation is}
## Open Questions
{Anything that couldn't be resolved — needs human judgment or deeper investigation}
Step 5: RETURN
Return the summary and recommendation to the caller (user, Marshal, or Archon). The research document persists for future reference.
Parallel Mode (--parallel)
Formerly /research-fleet, which is now a deprecated stub pointing here. Spawns multiple scout agents, each investigating a different angle of the same question. Findings are compressed between waves. Produces a unified research brief from multiple independent perspectives.
Inputs: the question, plus optional angles (specific sub-questions to investigate). If not provided, decompose the question into 3-5 angles automatically.
Step P1: DECOMPOSE
Break the research question into 3-5 independent angles:
Example: "Should we migrate from Express to Fastify?"
- Scout 1: Performance benchmarks (Express vs Fastify vs Hono, latest data)
- Scout 2: Migration effort (breaking changes, middleware compatibility, ecosystem)
- Scout 3: Community health (GitHub stars trend, npm downloads, maintainer activity)
- Scout 4: Production war stories (who migrated, what broke, was it worth it)
Each angle must be:
- Independent (scout doesn't need another scout's findings to do its work)
- Specific (one clear question per scout)
- Answerable (3-6 sources should be sufficient)
Step P2: DEPLOY WAVE 1
Spawn one scout agent per angle using Fleet wave mechanics:
For each scout:
- Create an isolated worktree
- Inject the scout's specific angle as the research question
- Each scout follows the single-mode protocol (formulate, search, extract, write)
- Each scout writes its findings to
.planning/research/fleet-{slug}/{angle-slug}.md
All scouts run in parallel. Wait for all to complete.
Step P3: COMPRESS
After Wave 1 completes:
- Read all scout findings
- Identify:
- Consensus: findings that multiple scouts independently confirmed
- Conflicts: findings that contradict each other (flag these prominently)
- Gaps: angles that didn't produce strong results (consider a Wave 2)
- Surprises: unexpected findings that change the framing of the question
- Compress into a unified brief (~500 tokens)
Step P4: WAVE 2 (Optional)
If gaps or conflicts exist:
- Spawn targeted scouts to resolve specific conflicts or fill gaps
- Each Wave 2 scout receives the compressed brief from Wave 1 as context
- Wave 2 scouts don't re-research what Wave 1 already covered
Skip Wave 2 if Wave 1 produced clear, consistent findings.
Step P5: REPORT
Write the unified report to .planning/research/fleet-{slug}/REPORT.md:
# Research Fleet: {Topic}
> Question: {The original question}
> Date: {ISO date}
> Scouts: {N} across {waves} wave(s)
> Confidence: {overall: high/medium/low}
## Consensus Findings
{Findings confirmed by 2+ scouts}
## Conflicts
{Findings where scouts disagreed — present both sides}
## Key Findings by Angle
### {Angle 1}: {title}
{Summary from scout 1}
Source: {scout report path}
### {Angle 2}: {title}
{Summary from scout 2}
Source: {scout report path}
...
## Recommendation
{2-3 sentences: what the evidence says, what the recommendation is}
## Open Questions
{What couldn't be resolved — needs human judgment}
Also log to .planning/telemetry/agent-runs.jsonl:
{"event":"research-fleet-complete","slug":"{slug}","scouts":0,"waves":0,"timestamp":"ISO"}
Safety Rules (Parallel Mode)
- Maximum 5 scouts per wave (don't burn tokens on diminishing angles)
- Maximum 2 waves (if Wave 2 doesn't resolve it, the question needs human judgment)
- Each scout follows the single-mode quality gates (sources, confidence, evidence)
- Scout findings are independent. No scout reads another scout's output during the same wave.
- Scout timeout: 15 minutes (configurable via
harness.jsonagentTimeouts.research). If a scout exceeds its timeout, skip it and proceed with other scouts' results. A timed-out scout's angle becomes a "Gap" in the final report.
WebFetch Restrictions
Every scout prompt MUST include this instruction:
Do NOT use WebFetch on GitHub repository pages (github.com/{user}/{repo}). These pages are massive HTML documents (500KB+) that hang the fetcher indefinitely. Instead:
- Use WebSearch to find information about repos (search snippets contain what you need)
- If you need a repo's README content, fetch the raw URL:
https://raw.githubusercontent.com/{user}/{repo}/{branch}/README.md- Never fetch rendered GitHub pages: issues, pull requests, repo root, or file views
This restriction exists because a real research-fleet run hung for 38+ minutes
on WebFetch(https://github.com/jehna/readme-best-practices) with zero output.
The circuit breaker didn't catch it because the tool didn't fail — it just
never completed.
What /research Does NOT Do
- Make architectural decisions (that's the caller's job)
- Install packages or modify code
- Search exhaustively (2-4 queries, 3-6 sources per agent, done)
- Evaluate subjective opinions as facts
- Recommend without evidence
Fringe Cases
- No web access available: Fall back to local-only research. Search the codebase, read docs files, check
package.json, and produce findings from local sources. In parallel mode, run all scouts in local-only mode. Note the limitation in the findings document's confidence level. - Search returns nothing relevant: Broaden the query (remove version-specific terms, try synonyms), try one more angle. If still empty, report uncertainty explicitly: "No strong evidence found. Recommend human review." A parallel-mode scout that comes up empty reports "No strong evidence found for this angle" rather than fabricating findings; the gap becomes an Open Question in the final report.
.planning/research/does not exist: Create it (in parallel mode, including thefleet-{slug}/subdirectory) before writing any findings. Never error on a missing output directory.- Conflicting sources: Surface the conflict explicitly in the findings rather than silently picking one. Both sides belong in the document.
- Question is too broad for 3-6 sources: Escalate to parallel mode, or narrow to the single most important sub-question, answer it well, and note what was scoped out.
- Question decomposes into fewer than 3 independent angles: Stay in (or fall back to) single mode rather than forcing artificial parallelism, even if --parallel was passed.
- A scout times out (parallel mode): Treat its angle as a gap. Record it in the final report's Open Questions section. Do not block the rest of the wave.
Contextual Gates
Reversibility: Green. Writes files under .planning/research/ only; delete to undo.
Cost: Single mode spawns no agents and needs no confirmation. Parallel mode spawns up to 5 scouts per wave (max 2 waves); state the scout count before deploying.
Trust: No gates. Read-only investigation plus report files, safe at all trust levels.
Quality Gates
- Every finding must have a source URL
- Confidence levels must be justified (not guessed)
- Summary must answer the original question or state why it can't be answered
- Research document must be written before returning findings
Parallel mode additionally:
- Every scout must produce a findings document
- Conflicts must be explicitly flagged, not silently resolved
- The compressed brief must be written before spawning Wave 2
Exit Protocol
Output findings summary, then the HANDOFF for the mode that ran.
Single mode:
---HANDOFF---
- Research: {topic}
- Findings: {count} sources analyzed
- Recommendation: {one-line summary}
- Document: .planning/research/{slug}.md
- Reversibility: green — one file written to .planning/research/; delete to undo
---
Parallel mode:
---HANDOFF---
- Research (parallel): {topic}
- Scouts: {N} across {waves} wave(s)
- Consensus: {one-line summary of agreed findings}
- Conflicts: {any unresolved disagreements}
- Recommendation: {one-line}
- Report: .planning/research/fleet-{slug}/REPORT.md
- Reversibility: green — delete `.planning/research/fleet-{slug}/` to undo
---