create-literature-report - SKILL.md Agent Skill

name: create-literature-report version: 2.0.0 description: > Reproducible scientific literature research report with citation-verified, hallucination-free synthesis. 6-phase pipeline composing Markov novelty writing, parallel web research, and zero-tolerance citation verification. last_updated: 2026-03-19 activation: See ACTIVATION-CHECKLIST.md invocation:

"create literature report on [topic]"
"research report on [topic]"
"literature survey for [topic]" revision_notes: > v2.0.0: Decomposed monolith into protocols + prompts + state schema. Added ACTIVATION-CHECKLIST, agent prompt templates, WebFetch failure handling, deduplication algorithm, FORCE_STOP protocol, workspace structure. Addresses 19 architectural gaps from 3-reviewer audit. metadata: category: research tags: [literature, citations, web-search, synthesis, hallucination-free] relations: compose_with:
- prd-update
- kg-sync depend_on:
- fetch-docs similar_to: [] belong_to: []

create-literature-report v2.0.0

Zero-hallucination scientific literature synthesis in 6 phases.

Quick Start

Run ACTIVATION-CHECKLIST.md pre-flight
Execute phases 0→5 sequentially
State file enables crash recovery at any sub-step

Parameters

Parameter	Required	Default	Description
`topic`	Yes	—	Research topic or question
`seed_papers`	Yes	—	Markdown list: `- [Author (Year). "Title."](URL)` per line
`output_path`	Yes	—	Path for the .md report file
`repo_context`	No	Current repo	Which repo/manuscript this informs
`target_paper_count`	No	50	Target total papers (seeds + discovered)
`quality_target`	No	MINOR_REVISION	Convergence: MAJOR_REVISION / MINOR_REVISION / ACCEPT
`create_issues`	No	true	Create GitHub issues from findings
`update_kg`	No	false	Propose KG decision node updates
`max_iterations`	No	3	Max council review iterations
`biblio_search_paths`	No	[]	Additional local bibliography directories to search
`report_length_words`	No	6000	Target report body length in words

Architecture: 6-Phase Pipeline

Phase 0: CAPTURE & ALIGN         (~5 min)
    ↓
Phase 1: GATHER (parallel)       (~15 min)
    ↓
Phase 2: SYNTHESIZE              (~15 min)
    ↓
Phase 3: VERIFY (parallel)       (~10 min)
    ↓
Phase 4: REVIEW (iterative)      (~10 min, optional)
    ↓
Phase 5: DELIVER                 (~5 min)

Total time budget: <60 minutes for 50-paper report.

Phase Protocols

Each phase has a dedicated protocol file with exact steps:

Phase	Protocol	Purpose
0	`protocols/phase-0-capture.md`	Save prompt, alignment questions, state init
1	`protocols/phase-1-gather.md`	Parallel research agents, deduplication
2	`protocols/phase-2-synthesize.md`	Markov novelty writing rules
3	`protocols/phase-3-verify.md`	Citation URL/title/author verification
4	`protocols/phase-4-review.md`	Iterated council convergence loop
5	`protocols/phase-5-deliver.md`	Issues, KG updates, final commit

Agent Prompt Templates

Agent	Template	Used in
Local inventory	`prompts/gather-local-inventory.md`	Phase 1
Web research	`prompts/gather-web-research.md`	Phase 1
Citation verifier	`prompts/verify-citation.md`	Phase 3
Review: domain	`prompts/review-domain-expert.md`	Phase 4
Review: novelty	`prompts/review-novelty-assessor.md`	Phase 4

State Management

Schema: state/literature-report-state.schema.json
Example: state/example-state.json
Location: state/literature-report-{topic_slug}-state.json
Sub-step tracking enables precise crash recovery

Workspace Structure

state/
  literature-report-{topic_slug}-state.json
workspace/literature-report-{topic_slug}/
  phase-1-gather/
    local-inventory.json       # Structured inventory from local agent
    web-research-a.json        # Papers from research agent A
    web-research-b.json        # Papers from research agent B
    deduplicated-papers.json   # Final merged, deduplicated list
  phase-3-verify/
    verification-batch-1.json  # Results from verifier agent 1
    verification-batch-2.json  # Results from verifier agent 2
    verification-batch-3.json  # Results from verifier agent 3
    corrections-log.json       # All corrections applied
  phase-4-review/
    iter-1/
      review-domain.md         # Domain expert feedback
      review-novelty.md        # Novelty assessor feedback
      synthesis.md             # L2 synthesis
      verdict.md               # L1 verdict

Convergence Criteria

Target	Criteria	Max Iterations
MAJOR_REVISION	0 hallucinated citations, body text coherent	1
MINOR_REVISION	Above + Markov novelty score ≥2.0/section	2
ACCEPT	Above + all "So What?" criteria met, ≥3 cross-domain insights	3

FORCE_STOP Protocol

If any phase exceeds its time budget or hits 3 consecutive failures:

Save current state with "force_stopped": true, "reason": "..."
Git commit all work so far
Report to user: phase, substep, failure reason, recommended action
Do NOT continue to next phase
On resume: read state, fix the blocking issue, restart current phase

Anti-Patterns (BANNED)

Pattern	Detection	Prevention
Memory-based citations	Phase 3 verifies ALL URLs	Never write a citation without a URL
Annotated bibliography	Phase 4 novelty reviewer	"So What?" test per section
Serial verification	Phase 3 runs ALL in parallel	GATHER→CATEGORIZE→FIX pattern
Skip verification	Phase 3 is MANDATORY	State blocks Phase 4 without Phase 3
Unattributed numbers	Phase 4 domain reviewer	Every number needs `[Author (Year)]`
Generic gaps	Phase 4 novelty reviewer	"More research needed" is BANNED
Citation spam	Phase 2 rules	Max 3 inline citations per sentence

Eval Criteria

Quantitative (automated)

Metric	Target	How to measure
Hallucination rate	0%	`verified / total` from Phase 3
URL validity	100%	All links resolve (Phase 3)
Title accuracy	100%	Verified titles match (Phase 3)
Paper count	≥ target_paper_count	`state.verified_count`

Qualitative (reviewer agents)

Criterion	Target	How to measure
No annotated bibliography	Pass	Phase 4 novelty reviewer
Cross-domain synthesis	≥3 domains connected	Phase 4 novelty reviewer
Specific research gaps	≥3 gaps (not generic)	Phase 4 novelty reviewer
Repo-contextualized	All findings mapped to repo	Phase 4 domain reviewer

Regression test

Run on biomedical-agentic-ai topic with original 27 seeds. Expected: 0 hallucinations, ≥55 verified papers, ≥3 novel insights.

Sub-Skill Dependencies

Skill	Location	Used for
engaging-review-writer	sci-llm-writer	Markov novelty rules (Phase 2)
citation-content-verifier	sci-llm-writer	Verification protocol (Phase 3)
iterated-llm-council	sci-llm-writer	Review loop (Phase 4)

Note: These skills provide principles that are inlined into this skill's protocol files. This skill is self-contained — it does NOT require the sci-llm-writer repo to be present at runtime.

Version History

Version	Date	Changes
2.0.0	2026-03-18	Decomposed into protocols + prompts + state schema. 19 gaps fixed.
1.0.0	2026-03-18	Initial release (monolithic).