name: deep-domain-learning description: "Use when building deep understanding of a new domain, producing a world model document, preparing for expert-level conversations, or needing to reason from first principles in an unfamiliar field. Encodes the Why-Chain Mandate, Quantification Mandate, Evidence Discipline, Grounding Requirement, and Narrative Flow frameworks. Produces a World Model document." version: "1.0.0" type: "codex" tags: ["Context Engineering", "Evaluation", "Meta"] created: "2026-02-25" valid_until: "2026-08-25" derived_from: "Projects/AI_Learning and Mastery/World_Model_Builder_v2.1_FINAL.md" tested_with: ["Claude Opus 4", "Claude Sonnet 4.5"] license: "MIT"
Purpose
Produce a deep, mechanistic World Model of any domain — a structured narrative document that enables the reader to predict, explain, diagnose, design, and feel the inevitability of outcomes in that domain. The output is not a summary or survey; it is a first-principles construction that traces every mechanism to bedrock.
When to Use / When NOT to Use
Use this skill when:
- Learning a new domain deeply enough to reason from first principles (not just recall facts)
- Preparing for expert-level conversations, keynotes, or advisory work in an unfamiliar field
- Building the conceptual foundation for a thesis, investment analysis, or product strategy
- An agent needs domain context before producing high-quality synthesis or analysis
- You need to identify where your intuitions about a domain are wrong
Do NOT use this skill when:
- You need a quick overview or executive summary (ask directly — this produces 10K-30K words)
- The domain is already well-understood by the reader (use Research Synthesis skill instead)
- You need tactical how-to instructions, not conceptual understanding (use a tutorial)
- The topic is narrow enough for a single Q&A exchange
- You need current news or market data (this produces timeless mechanistic understanding, not current events)
Anti-inputs (what this skill does NOT handle):
- Competitive analysis or market sizing (→ Competitive & Market Analysis skill)
- Specification of what to build (→ Specification Writing skill)
- Synthesis of multiple existing research papers (→ Research Synthesis skill)
- Opinion formation or thesis construction (→ Synthesizer agent)
Format Rules (Read First)
These rules govern every World Model document. They are not style preferences — they are quality enforcement mechanisms derived from iterative testing of domain learning outputs.
Dense explanatory prose, not bullet lists. Bullet lists are permitted ONLY for enumeration after narrative explanation. The primary content is flowing paragraphs that build understanding incrementally. A World Model that reads like a slide deck has failed.
Every non-trivial claim carries an inline citation. Format:
(Author, Year)or[Description, Date]. Uncited claims must be explicitly marked as the author's reasoning or common knowledge. A World Model with naked claims is opinion, not knowledge.Every abstraction is grounded in concrete reality. No concept is introduced without at least one observable, measurable, or experimentally demonstrated example. If you can't point to something in the physical world, the abstraction is floating.
Every quantitative claim includes actual numbers with units and context. Never say "large," "significant," "fast," or "expensive" without numbers. Contextualize numbers with comparisons, scaling relationships, and practical implications.
Epistemic status is marked explicitly on every substantive claim. Use the five-tier system: PROVEN, CONFIRMED, CONSENSUS, DEBATED, SPECULATIVE. When uncertain: UNCERTAIN with reason. A reader must know how much weight to put on each claim.
Write in flowing paragraphs that build understanding. Each paragraph makes the reader want to read the next. Structure each concept as: Hook → Build from familiar → Reveal mechanism → Make precise → Show it's real → Connect → Test intuition.
Mathematics is embedded naturally with full intuitive explanation. Every equation or formal relationship gets: (a) the formula, (b) what each variable means in plain language, (c) why the relationship has this shape, (d) what changes when you change each variable. Math without intuition is decoration.
The Opening is written LAST but placed FIRST. Only after the full document is written can you distill the essence accurately.
Output Template (Mandatory Document Skeleton)
Every World Model MUST follow this structure. Do not reorder sections, skip sections, or invent new top-level sections. Adjust page lengths based on the LENGTH parameter.
# Understanding [DOMAIN]: A World Model
> **Date:** [YYYY-MM-DD] | **Depth:** [Conversational / Practitioner / Expert / Frontier] | **Length:** [Focused / Comprehensive / Exhaustive]
---
## The Opening (2-4 pages)
- The core insight that changes everything
- Why this matters to your world model
- The conceptual landscape and journey ahead
- Where your intuition misleads you
- The best analogy (and where it breaks)
## Part I: Bedrock — What Cannot Be Otherwise
[3-5 fundamental constraints, 5-8 essential concepts]
For each constraint:
- State it and why it's inescapable — trace to deeper principle
- Quantify it with real numbers and context
- Show what would break if it weren't true — thought experiment
- Historical attempts to circumvent it and what happened
- How it shapes everything built above — trace consequences upward
- Observable manifestations in real systems
For each concept:
- Why this concept exists — historical problem it solved
- Build it from primitives — construction or measurement
- Precise definition with math, fully explained
- Why this definition and not plausible alternatives
- The mechanism behind it at micro-level
- How to develop intuition — analogies with limitations stated
- Where intuition fails — common misconceptions with corrections
- Observable signatures — how to recognize it in practice
- Deep connections to other concepts
## Part II: The Machinery — How It Actually Works
[2-3 central processes, 3-5 major interactions]
For each process — complete walkthrough:
- Initial state with specific numbers
- Triggering event
- Each step of transformation — mechanism at micro-level, timescale, energy flows
- Parallel or dependent processes — timing and ordering constraints
- The bottleneck — rate-limiting step, why it's slowest
- Feedback loops — what feeds back where, dynamics, equilibrium
- Completion and efficiency — total time, input/output, losses quantified
- Observable in real systems — examples with data and citations
For each interaction:
- Which processes/components interact and why
- Mechanism of interaction — step-by-step with numbers
- Nonlinearities or threshold effects
- Real examples demonstrating the interaction
## Part III: What Emerges — Seeing the Whole
[2-4 emergent phenomena]
For each phenomenon:
- What you observe at system level
- Why it emerges — trace from micro-interactions to macro-behavior
- Mathematical form and what the structure tells us
- Necessary conditions for emergence
- Inevitability — why it's forced by constraints, not accidental
- Observable signatures and measurements
- Robustness — how it responds to perturbations
- Cross-domain connections — where this pattern appears elsewhere
## Part IV: The Human Reality — Context and Consequences
[3-5 major misconceptions, incentive landscapes, decision-making under uncertainty]
For each misconception:
- Why people believe this — superficial logic
- Why it's wrong — show the mechanism they're missing
- Concrete evidence or thought experiment proving it wrong
- The correct intuition — revised mental model
- Historical example of this mistake and consequences
Then:
- Incentive landscapes (stakeholder motivations, misalignments)
- Decision-making under uncertainty (epistemic status of key claims)
- Real-world implications and tradeoffs
## Part V: The Synthesis — Your Updated World Model
- Core framework unified
- Before/after mental models
- Mental simulation test: can you predict, explain, diagnose, design, and feel inevitability?
- Cross-domain transferable insights
## Part VI: Going Deeper — Your Path Forward
- Next topics to explore with specific resources
- Progressive exercises (week 1, month 2-3)
- Signposts to watch for model updates
- Meta-lessons about learning this domain
## References
[All sources cited inline throughout, compiled here]
Domain Frameworks
These five frameworks are the intellectual core of this codex. Each encodes a specific discipline that separates deep understanding from surface knowledge. A World Model produced without these frameworks reads like a Wikipedia article; with them, it reads like a masterful textbook chapter.
Framework 1: The Why-Chain Mandate
Definition: For every key mechanism or concept, recursively ask "why?" until hitting one of three bedrock types: (a) fundamental physical law (e.g., conservation of energy), (b) mathematical axiom (e.g., chain rule), (c) empirical brute fact (e.g., measured physical constant).
Scoring Rubric — Depth of Explanation:
| Depth | Name | Test | Rating |
|---|---|---|---|
| D0 | Label | Names the concept without explaining it | 🔴 Unacceptable |
| D1 | Description | Says what the concept does, not why | 🔴 Unacceptable |
| D2 | Mechanism | Explains how it works at one level | 🟡 Insufficient |
| D3 | Causal chain | Traces 2-3 "why" levels with connections | 🟡 Minimum bar |
| D4 | Bedrock trace | Hits fundamental law, axiom, or brute fact | 🟢 Target |
| D5 | Bedrock + alternatives | D4 plus explains why alternatives don't work | 🟢 Elite |
Decision Rule: Every core concept in Parts I-III must reach D4. Supporting concepts may be D3. No concept below D2 appears in the final document.
Application Example:
- ❌ D1: "Transformers use attention mechanisms to process sequences efficiently."
- ✅ D4: "Attention computes all position-to-position interactions in parallel via matrix multiplication. An RNN must pass information through n-1 intermediate hidden states — a serial dependency chain of O(n·t). Transformers compute QK^T (n×n) in a single matrix multiply, yielding O(n²d) FLOPs but O(1) depth. The dot product q·k = Σ q_i·k_i serves as a differentiable content-based addressing mechanism — maximal when vectors align, minimal when orthogonal. This is identical to the retrieval mechanism in Hopfield networks (1982). The tradeoff: O(n²) memory for O(1) depth. For GPT-3 with context length 2048, that's 4M scores per layer — costly but parallelizable. An LSTM would need 2048 sequential steps. This is why transformers scaled to 100B+ parameters while RNNs topped out at ~200M. (Vaswani et al., 2017; Kaplan et al., 2020)"
Framework 2: The Quantification Mandate
Definition: Every quantitative claim must include actual numbers with units, contextual comparisons, scaling relationships, error bars or ranges, and practical interpretation.
Scoring Rubric — Quantitative Rigor:
| Level | Name | Test | Rating |
|---|---|---|---|
| Q0 | Adjective | "Large," "significant," "fast" — no numbers | 🔴 Unacceptable |
| Q1 | Bare number | A number without context or units | 🔴 Unacceptable |
| Q2 | Number + units | Number with proper units | 🟡 Minimum |
| Q3 | Contextualized | Number + comparison to familiar reference | 🟢 Adequate |
| Q4 | Full quantification | Number + units + comparison + scaling relationship + practical meaning | 🟢 Target |
Decision Rule: Core claims must reach Q3. Key mechanisms must reach Q4. No Q0 or Q1 survives in the final document.
Application Example:
- ❌ Q0: "Training large models requires significant compute."
- ✅ Q4: "Training GPT-3 (175B parameters) required
3640 petaflop-days on V100 GPUs, costing ~$4.6M at 2020 cloud pricing ($3/hr/GPU, ~3000 GPUs, ~1 month). This is 100,000× more compute than GPT-2 (1.5B, ~40 petaflop-days). The scaling follows C ∝ N^1.0 for data-rich training (Kaplan et al., 2020)."
Framework 3: The Evidence Discipline
Definition: Every substantive claim is tagged with its epistemic status — how we know what we claim to know.
Evidence Tier Classification:
| Status | Definition | Signature | Example |
|---|---|---|---|
| PROVEN | Mathematically derived from axioms | Cannot be otherwise | "The gradient of f(g(x)) is f'(g(x))·g'(x) by the chain rule" |
| CONFIRMED | Experimentally measured and replicated | Independent replication exists | "Electron charge is 1.602176634×10⁻¹⁹ C (CODATA 2018, uncertainty <1ppb)" |
| CONSENSUS | Widely accepted by domain experts, not formally proven | Practitioner experience + limited theory | "Adam typically converges faster than SGD for deep learning (Kingma & Ba, 2014)" |
| DEBATED | Experts disagree; show both sides | Active research conflict | "Whether scaling laws continue beyond 10¹² parameters — optimists cite smooth trends, skeptics note phase transitions" |
| SPECULATIVE | Plausible but unconfirmed | Structural similarity without direct evidence | "Attention might implement biological working memory — structural similarity to hippocampal indexing, but direct evidence lacking" |
| UNCERTAIN | Conflicting evidence or insufficient data | State the reason | "Conflicting measurements — see Smith 2020 vs Jones 2021" |
Decision Rule: Every claim in Parts I-III carries an explicit epistemic tag. Part IV discusses epistemic status as a theme. Claims without tags are treated as SPECULATIVE by default.
Framework 4: The Grounding Requirement
Definition: Every abstract concept must be anchored to concrete, observable, measurable reality. No floating abstractions.
Grounding Checklist (apply to every core concept):
| Element | Question | Required? |
|---|---|---|
| Concrete example | Can I point to this in the physical world? | Yes — for every concept |
| Numbers | Can I measure it? What are the values? | Yes — at least one measurement |
| Observable phenomenon | What would I see if I watched this happen? | Yes — real or thought experiment |
| Historical grounding | When was this first observed/discovered? By whom? | Recommended |
| Modern confirmation | How do we measure/verify this today? | Recommended |
Application Example — "Entropy":
- ❌ Ungrounded: "Entropy is a measure of disorder."
- ✅ Grounded: "Take 100 coins. The macrostate 'all heads' has 1 microstate. The macrostate '50 heads, 50 tails' has ~10²⁹ microstates. S = k ln(Ω), so S_all-heads = 0 while S_50-50 ≈ 67k. Systems evolve toward 50-50 because 10²⁹:1 odds. Observable: food coloring spreads in water — never spontaneously un-mixes. Confirmed: Boltzmann 1898, Einstein 1905, modern measurements Bustamante et al., 2005."
Framework 5: Narrative Flow Architecture
Definition: A prescribed structure for introducing each concept or mechanism that builds understanding incrementally. Not a template to fill — a cognitive sequence to follow.
The Seven-Step Concept Introduction:
| Step | Name | Purpose | Example (for "attention mechanism") |
|---|---|---|---|
| 1 | Hook | Why this matters or what puzzle it solves | "How do you let a model look at any part of a sequence at any time?" |
| 2 | Build from familiar | Start with intuition | "When you read a sentence, your eyes don't process each word in strict order..." |
| 3 | Reveal the mechanism | Step-by-step causal chain | "Query, key, and value vectors computed from the input..." |
| 4 | Make it precise | Math or formal definition, fully explained | "QK^T/√d_k followed by softmax..." |
| 5 | Show it's real | Observable examples, experiments, measurements | "Attention weight visualizations show heads specializing..." |
| 6 | Connect it | How it relates to other concepts | "This is differentiable Hopfield retrieval..." |
| 7 | Test intuition | What would happen if it were different? | "Without the √d_k normalization, dot products grow with dimension..." |
Decision Rule: Core concepts in Part I must follow all 7 steps. Supporting concepts may abbreviate to steps 1, 3, 4, 5. No concept skips steps 1 and 3.
Application Method
Quick Version
- Collect the caller's domain, current state, depth target, key questions, bridge domains, and length preference.
- Research deeply — prioritize Tier 1 (papers, textbooks, primary data) over Tier 2 (documentation, case studies) over Tier 3 (secondary sources).
- Identify 3-5 bedrock constraints and 5-8 essential concepts for Part I.
- Map 2-3 central processes and 3-5 interactions for Part II.
- Write Parts I-V applying all five frameworks: Why-Chain to D4+, Quantification to Q3+, Evidence tags on every claim, Grounding for every abstraction, Narrative Flow for every concept.
- Write Part VI (Going Deeper) and the Opening (written last, placed first).
- Run the Final Verification checklist.
Full Version
Step 1: Intake and Calibration
Collect from the caller:
- DOMAIN — specific enough to scope (e.g., "transformer attention mechanisms" not "AI")
- CURRENT STATE — what they know, what confuses them, intuitions that might be wrong
- DEPTH TARGET — one of four levels:
| Level | Output Calibration | Reader Outcome |
|---|---|---|
| Conversational fluency | Explain to others, recognize in practice | Can hold their own in a discussion |
| Working practitioner | Implement, modify, debug intelligently | Can build with this knowledge |
| Deep expert | Design novel variations, see deep implications | Can innovate in this domain |
| Research frontier | Contribute new insights, push boundaries | Can advance the field |
- KEY QUESTIONS — specific questions the document must answer
- BRIDGE DOMAINS — domains the caller already understands deeply (for cross-domain analogies)
- LENGTH — Focused (10-15K words), Comprehensive (20-30K, default), Exhaustive (30K+)
Quality checkpoint: If the domain is too broad (e.g., "biology"), push back and ask for scoping. If depth target is mismatched with length (e.g., "research frontier" + "focused"), flag the tension.
Step 2: Research Phase
Before writing, build a source inventory:
- Tier 1 (strongest): Peer-reviewed papers, textbooks, primary experimental data, fundamental equations
- Tier 2: Technical documentation, expert practitioner accounts, well-documented case studies
- Tier 3: Reputable secondary sources with clear methodology
Decision point: If Tier 1 sources are unavailable for a key mechanism, flag it with DEBATED or SPECULATIVE status in the output. Never fill gaps with Tier 3 and present as CONFIRMED.
Step 3: Structural Planning
Before prose, outline:
- The 3-5 bedrock constraints (what cannot be otherwise in this domain)
- The 5-8 essential concepts (what you must understand to reason in this domain)
- The 2-3 central processes (how the domain's machinery actually works)
- The 3-5 interactions between processes
- The 2-4 emergent phenomena (what arises that isn't obvious from components)
- The 3-5 major misconceptions (where intuition fails)
Decision point: If you can't identify at least 3 bedrock constraints, the domain scoping is too narrow or too abstract. Adjust.
Step 4: Writing — Parts I through III
Apply all five frameworks simultaneously:
- Why-Chain: Every core concept traced to D4 (bedrock)
- Quantification: Every quantitative claim at Q3+ (contextualized numbers)
- Evidence: Every substantive claim tagged with epistemic status
- Grounding: Every abstraction anchored to observable reality
- Narrative Flow: Every core concept follows the 7-step sequence
Quality checkpoint per section: Read back each Part. For every paragraph, ask: "Could a smart person who read only this paragraph and nothing else learn something precise and grounded?" If the answer is no, the paragraph is filler — cut or rewrite.
Step 5: Writing — Parts IV through VI
Part IV shifts from mechanism to human context — where intuitions fail, how incentives distort understanding, what decisions the reader faces. Evidence tags become the theme, not just annotations.
Part VI must be genuinely useful — specific resources (not "read more about X"), concrete exercises with timelines, and meta-lessons the reader can apply to learning other domains.
Step 6: The Opening
Written LAST. Distill the 3-5 most important insights from the completed document. Set up the conceptual journey. Include the single best analogy for the domain and explicitly state where that analogy breaks.
Step 7: Final Verification
Run through every item in the Appendix Checklist below. Fix any gaps before delivering.
Quality Gradients
Intern Tier
- Accurate factual content but reads like a rearranged Wikipedia article
- Bullet-list heavy; concepts named but not derived
- Numbers present but not contextualized ("175 billion parameters")
- No epistemic status marking — all claims presented with equal confidence
- Analogies are surface-level ("attention is like a spotlight")
- Missing the Why-Chain — concepts explained at D1-D2 only
- Reader can recall facts but cannot reason from first principles
Consultant Tier
- Well-structured narrative with clear sections
- Some mechanisms traced through 2-3 levels (D3)
- Numbers contextualized with some comparisons
- Citations present but inconsistent
- Some epistemic marking but not systematic
- Analogies are functional but limitations not stated
- Reader can explain the domain to others but cannot predict novel outcomes
Elite Tier
- Dense narrative prose that builds understanding paragraph by paragraph
- Every core mechanism traced to bedrock (D4-D5)
- Full quantification: numbers + units + comparisons + scaling + practical meaning (Q4)
- Systematic epistemic tagging on every substantive claim
- Every abstraction grounded in observable, measurable reality
- Cross-domain analogies with explicit structural isomorphism and stated limitations
- Reader can predict, explain, diagnose, design, and feel the inevitability of outcomes
- The document itself teaches how to think about the domain, not just what to know
Failure Modes
FM-1: The Wikipedia Trap What it looks like: Document reads like a well-organized encyclopedia entry — accurate but flat. Concepts are defined, not derived. Reader finishes with facts but no working mental model. Why it happens: Writer prioritized breadth (covering all subtopics) over depth (tracing mechanisms to bedrock). Skipped the Why-Chain. Detection: Count the "why" depth of the first 5 core concepts. If average depth is D1-D2, this failure mode is active. Correction: For each concept, ask "why?" three more times and write the answers as connected paragraphs. Cut breadth to make room for depth.
FM-2: The Number Desert What it looks like: Prose is conceptually rich but devoid of actual measurements, values, or quantities. Adjectives ("significant," "rapid," "enormous") replace numbers. Why it happens: Writer treated quantification as optional decoration rather than a core discipline. Often caused by choosing a depth target that doesn't match the domain's quantitative nature. Detection: Scan Part II (Machinery) for paragraphs with zero numbers. If >30% of paragraphs lack numbers, this failure mode is active. Correction: For every adjective describing magnitude, find the actual number and substitute. Add scaling relationships and comparisons.
FM-3: Floating Abstractions What it looks like: Elegant conceptual explanations that never touch ground. The reader understands the abstract idea but can't point to it in the real world. Common in philosophy-adjacent domains. Why it happens: Writer stayed in the "Build from familiar" and "Make it precise" steps of Narrative Flow but skipped "Show it's real." Detection: For each core concept, check: is there at least one observable, measurable, or experimentally demonstrated example? If not, the concept is floating. Correction: Add a concrete grounding paragraph for each floating concept — a measurement, an experiment, a visible phenomenon.
FM-4: Epistemic Flatness What it looks like: All claims presented with identical confidence. Mathematical theorems and speculative hypotheses are grammatically indistinguishable. Reader cannot calibrate what to trust. Why it happens: Writer defaulted to assertive prose throughout without applying the Evidence Discipline. Often a result of using confident language as a stylistic choice. Detection: Search for epistemic tags (PROVEN, CONFIRMED, CONSENSUS, DEBATED, SPECULATIVE). If fewer than 10 tags in a 20K-word document, this failure mode is active. Correction: Re-read every substantive claim and tag it. Rewrite DEBATED and SPECULATIVE claims to present both sides or flag uncertainty.
FM-5: The Slide Deck
What it looks like: Document is 80%+ bullet lists with minimal connecting prose. Reads like presentation notes, not narrative explanation. Understanding doesn't build — it's scattered.
Why it happens: Writer defaulted to list format for efficiency. The Narrative Flow framework was not applied.
Detection: Count lines that start with -, *, or numbered markers vs. prose paragraphs. If bullet content exceeds narrative content by volume, this failure mode is active.
Correction: Convert core content from bullets to flowing paragraphs that build on each other. Keep bullets only for genuine enumeration (lists of items after narrative explanation).
FM-6: Bridge Domain Misuse What it looks like: Cross-domain analogies are used decoratively ("this is like X") rather than structurally ("the mechanism here is identical to X in Y, and that identity reveals Z"). Analogies don't earn their weight. Why it happens: Writer used the caller's bridge domains as metaphors rather than as structural isomorphisms. The analogy adds color but not insight. Detection: For each cross-domain analogy, ask: "Does this analogy reveal a mechanism the reader couldn't see from inside the domain alone?" If not, it's decorative. Correction: Either sharpen the analogy to a structural isomorphism (state the mapping precisely, then show what it reveals) or cut it.
FM-7: Premature Coherence What it looks like: Everything fits together too neatly. No tensions, no open questions, no acknowledged gaps. The document feels like a closed system. Why it happens: Writer smoothed over genuine debates, unresolved questions, and competing explanations to produce a "clean" narrative. Detection: Check Part IV for misconceptions and Part V for open questions. If the document has zero DEBATED or SPECULATIVE tags, this failure mode is likely active. Correction: Actively search for: competing theories, unresolved experimental results, claims that experts disagree on. Add them explicitly.
What's Next
Upstream skills that feed this one:
- None required — this skill takes a raw domain name and produces a World Model from scratch
- Optional: if the caller provides research papers, the Research Synthesis skill can pre-process them into structured evidence
Downstream skills this feeds:
- Research Synthesis — the World Model provides the conceptual framework; Research Synthesis adds current evidence
- Competitive & Market Analysis — deep domain understanding enables structural (not surface) competitive assessment
- Specification Writing — understanding the domain deeply prevents specs that solve the wrong problem
- Synthesizer agent — World Models provide the substrate for thesis construction
Chain interface:
- Receives: Domain name + caller's current state + depth target + key questions + bridge domains + length preference
- Produces: A World Model document (10K-30K+ words) following the mandatory skeleton
- Handoff artifact: The World Model document itself, stored as
{domain_slug}_world_model_{date}.md
Appendix Checklist
Run this checklist before delivering the final document:
- Structure: All six Parts present (Opening, Bedrock, Machinery, Emergence, Human Reality, Synthesis, Going Deeper)
- Opening: Written after the rest of the document; distills the core insight; includes the best analogy with stated limitations
- Why-Chain: Every core concept in Parts I-III traced to bedrock (D4+); no concept below D2
- Quantification: Every quantitative claim has actual numbers with units (Q2+); core claims have full contextualization (Q3-Q4); zero bare adjectives describing magnitude
- Evidence tags: Every substantive claim in Parts I-III carries an epistemic status tag (PROVEN / CONFIRMED / CONSENSUS / DEBATED / SPECULATIVE / UNCERTAIN)
- Grounding: Every abstract concept has at least one concrete, observable, measurable example
- Narrative Flow: Core concepts follow the 7-step introduction sequence; no core concept skips Hook and Reveal
- Prose over bullets: Primary content is flowing narrative paragraphs; bullets used only for genuine enumeration
- Citations: All non-trivial claims have inline citations; References section compiles all sources
- Mathematics: Every equation or formula has full intuitive explanation (what each variable means, why this shape, what changes when you change inputs)
- Misconceptions: Part IV identifies at least 3 places where common intuition is wrong and explains why
- Going Deeper: Part VI includes specific resources (not vague recommendations), progressive exercises with timelines, and meta-lessons
- Mental simulation test: After reading the document, can the reader predict, explain, diagnose, design, and feel inevitability? If any of these five fail, the relevant section is incomplete
- Length: Document matches the requested length parameter (Focused: 10-15K, Comprehensive: 20-30K, Exhaustive: 30K+)
- Filename: Output saved as
{domain_slug}_world_model_{YYYY-MM-DD}.md