deep-investigation - SKILL.md Agent Skill

name: deep-investigation description: Progressive iterative research with persistent knowledge vault, entity tracking, and coverage gates. USE WHEN deep investigation, investigate, deep research, map the landscape, competitive analysis, market mapping, technology survey, thorough research.

Deep Investigation

Mode: Progressive iterative research with persistent knowledge vault Time: 10-60 minutes | Can span multiple sessions via persistent vault files

When to Use

User says "deep investigation", "investigate [topic]", "deep research", "map the [X] landscape"
Competitive analysis, market mapping, technology surveys, threat landscape analysis
Any research needing iterative deepening -- broad discovery first, then progressive deep dives
When surface-level research is insufficient and systematic coverage matters

Concept

Progressive narrowing funnel:

Iteration 1: Broad landscape --> discover entities --> score importance --> deep-dive top entity
Iteration 2: Read vault artifacts --> pick next highest-priority entity --> deep-dive
Iteration 3+: Continue until all critical entities researched and all categories covered

Single-run: One full cycle (landscape through first deep dive). Multi-session: User returns later; workflow detects existing vault state and continues.

Available Tools

Tool	Use For
WebSearch	Landscape queries, entity discovery, deep-dive queries
WebFetch	Deep reads of sources, entity profile research, URL verification
claude-browser MCP	Auth-walled or JS-heavy pages

Vault Location

~/.augment/MEMORY/RESEARCH/{YYYY-MM-DD}_{topic-slug}/

Vault survives across sessions. Workflow is stateless -- all state lives in files on disk.

Workflow

Step 0: Detect Iteration State

Check vault directory for existing artifacts:

LANDSCAPE.md + ENTITIES.md exist --> CONTINUATION: read ENTITIES.md
- PENDING CRITICAL/HIGH entities? --> Step 4 (Investigate)
- All CRITICAL/HIGH done but thin categories? --> Step 3 (Discover)
- All gates pass? --> Step 5 (Summarize)
Neither exists --> FIRST ITERATION: start at Step 1

Step 1: Landscape Scan (First Iteration Only)

Goal: Map the full landscape. Do this once, reference cheaply in all later iterations.

Run 4-6 WebSearch queries covering: domain overview, key players, recent developments, historical context, adjacent domains, contrarian views. Deep-read top 3-4 results with WebFetch.

Produce three vault files:

LANDSCAPE.md -- Domain synthesis with sections: Overview (2-3 paragraphs), Domain Structure, Key Dynamics, Entity Categories (discovered dynamically, with estimated counts), Initial Entity Discoveries, Sources (verified URLs only).

ENTITIES.md -- Master catalog table with columns: Entity, Category, Status (PENDING/RESEARCHED/SKIP), Value (CRITICAL/HIGH/MEDIUM/LOW), Effort (EASY/MODERATE/HARD), Profile link. Include legend definitions at the top.

INDEX.md -- Navigation hub linking to LANDSCAPE.md, ENTITIES.md, and profiles as they are created. Track coverage stats: categories complete, entities researched, CRITICAL/HIGH pending.

Step 2: Evaluate (Score Entities)

For each PENDING entity, assess two dimensions:

VALUE: CRITICAL (category definers, must research) > HIGH (major players) > MEDIUM (notable) > LOW (minor) EFFORT: EASY (abundant public info) > MODERATE (some digging) > HARD (limited info)

Investigation priority: CRITICAL+EASY first, then CRITICAL+HARD, HIGH+EASY, HIGH+HARD. MEDIUM+ only after all CRITICAL/HIGH done. Update ENTITIES.md with scores.

Step 3: Discover (Expand Coverage)

Find entities in categories with fewer than 3 entries. For each thin category:

WebSearch(query: "notable {category} in {domain} space")
WebSearch(query: "{domain} {category} key players landscape")

WebFetch the best results. Add discoveries to ENTITIES.md as PENDING, then score them (Step 2).

Step 4: Investigate (Deep Dive -- One Entity)

Quality over quantity. Research ONE entity per iteration.

Select the highest-priority PENDING entity (VALUE desc, EFFORT asc). Run 2-3 focused WebSearch queries:

"[entity] overview analysis {domain}"
"[entity] recent news funding developments"
"[entity] vs competitors strengths weaknesses"

Deep-read top 2-3 results with WebFetch.

Save profile to vault: {Category}/{entity-slug}.md containing: overview, key facts (founding, size, funding, leadership), products, competitive advantages/weaknesses, recent developments, cross-links to related vault entities, verified source URLs.

Update ENTITIES.md (mark RESEARCHED, add link) and INDEX.md (add to navigation).

Step 5: Progress Check (Coverage Gates)

Two gates must BOTH pass for completion:

Breadth Gate: Every entity category has >= 3 entities (excluding SKIP). Depth Gate: Every CRITICAL or HIGH entity is RESEARCHED or SKIP.

Both pass: Produce SUMMARY.md (executive synthesis), update INDEX.md with final stats, report completion. Either fails: Report current coverage, identify gaps (thin categories, pending CRITICAL/HIGH entities), ask user whether to continue or stop.

Output Artifacts

{vault}/
  INDEX.md          -- Navigation hub with coverage stats
  LANDSCAPE.md      -- Broad domain analysis
  ENTITIES.md       -- Master catalog with status tracking
  SUMMARY.md        -- Executive synthesis (on completion)
  {Category}/       -- Entity profiles by category
    entity-a.md
    entity-b.md

All profiles cross-linked. Vault is self-contained and readable standalone.

Single-Run vs Multi-Session

Aspect	Single-Run	Multi-Session
Coverage	Landscape + first deep dive	Full breadth + depth gates
Exit	After Step 4	After Step 5 gates pass
Time	10-15 minutes	30-60 min across sessions

Single-run: complete Steps 1-4, report what remains PENDING.

URL Verification

All vault URLs MUST be verified via WebFetch before inclusion. A broken link in a persistent vault is a compounding error -- it misleads every future session that reads it.