bayesian-reasoning - SKILL.md Agent Skill

name: bayesian-reasoning description: "Use when updating beliefs, forecasts, diagnoses, or decision assumptions under uncertainty using Bayesian reasoning: priors/base rates, likelihood, evidence strength, posterior direction, and residual uncertainty. Covers base-rate discipline, likelihood-vs-posterior separation, independent evidence updates, natural-frequency examples, confidence calibration, and when to stop at qualitative probability instead of fake precision. Do NOT use for expected monetary value calculations, strategy-cascade choices (use playing-to-win), industry-structure analysis (use porters-five-forces), or generic task prioritization (use prioritization). Do NOT use for calculate the expected value of these three options. Do NOT use for turn this growth plan into a strategy cascade. Do NOT use for analyze supplier power and substitutes in this industry. Do NOT use for rank these roadmap items by impact and effort. Do NOT use for build a statistical model from a dataset." license: MIT compatibility: "Markdown, decision memos, diagnostic reasoning, research synthesis, forecasting, agent confidence calibration" allowed-tools: Read Grep metadata: relations: "{"related":["prioritization","mental-models","constraint-awareness","problem-approach-router","epistemic-grounding"],"suppresses":["playing-to-win","porters-five-forces"],"verify_with":["epistemic-grounding","methodology"]}" subject: reasoning-strategy public: "true" scope: "Use when updating beliefs, forecasts, diagnoses, or decision assumptions under uncertainty using Bayesian reasoning: priors/base rates, likelihood, evidence strength, posterior direction, and residual uncertainty. Covers base-rate discipline, likelihood-vs-posterior separation, independent evidence updates, natural-frequency examples, confidence calibration, and when to stop at qualitative probability instead of fake precision. Do NOT use for expected monetary value calculations, strategy-cascade choices (use playing-to-win), industry-structure analysis (use porters-five-forces), or generic task prioritization (use prioritization)." taxonomy_domain: foundations/decision-quality stability: stable keywords: "["bayesian reasoning","bayes theorem","bayesian update","base rate","prior probability","posterior probability","likelihood ratio","evidence strength","confidence calibration","probabilistic reasoning"]" examples: "["use Bayesian reasoning to update our confidence after this new evidence","we have a rare bug signal; account for the base rate before concluding the cause","separate prior, likelihood, and posterior for this diagnosis","how should this customer interview change our belief in the product hypothesis?","calibrate my confidence instead of giving a binary yes/no answer"]" anti_examples: "["calculate the expected value of these three options","turn this growth plan into a strategy cascade","analyze supplier power and substitutes in this industry","rank these roadmap items by impact and effort","build a statistical model from a dataset"]" grounding: "{"subject_matter":"Bayesian reasoning for decision-making under uncertainty","grounding_mode":"universal","truth_sources":["https://plato.stanford.edu/entries/bayes-theorem/\",\"https://plato.stanford.edu/entries/epistemology-bayesian/\",\"https://pubmed.ncbi.nlm.nih.gov/17835457/\",\"skills/reasoning-strategy/bayesian-reasoning/references/bayesian-reasoning-sources.md\",\"skills/reasoning-strategy/bayesian-reasoning/references/upstream-displacement-2026-05-26.md\"],\"failure_modes\":[\"base_rate_neglect\",\"likelihood_confused_with_posterior\",\"anecdote_overweighted\",\"correlated_evidence_double_counted\",\"prior_hidden_or_smuggled\",\"false_precision_from_weak_inputs\",\"binary_answer_given_under_uncertainty\"],\"evidence_priority\":\"general_knowledge_first\"}" mental_model: "Bayesian reasoning treats belief as a state that changes when evidence arrives. The primitives are a hypothesis, prior probability or base rate, evidence, likelihood of seeing that evidence if the hypothesis were true, likelihood of seeing it if the hypothesis were false, posterior belief, residual uncertainty, and update history. The key move is comparing how much better the evidence is explained by one hypothesis than by alternatives, then updating from the prior instead of starting from the vividness of the evidence." purpose: "This skill prevents agents from jumping from a salient signal to a confident conclusion. It replaces binary diagnosis, anecdote-weighting, and base-rate neglect with an explicit update loop: start with the prior, estimate evidential force, adjust belief in the right direction, avoid double-counting correlated evidence, and state what would change the posterior next." concept_boundary: "Bayesian reasoning updates probabilities and confidence; it does not by itself choose the action with the best payoff, produce an expected value table, fit a statistical model, create a strategy cascade, analyze industry structure, or rank a backlog. Those downstream tools may consume Bayesian probabilities, but this skill owns the belief update." analogy: "Bayesian reasoning is like adjusting a dimmer switch rather than flipping a light switch: evidence moves confidence up or down from where it started, and stronger evidence moves it farther." misconception: "The common mistake is treating Bayes as a formula that requires precise numbers. The formula is the idealized version; in agent work the practical discipline is often qualitative: make the prior explicit, compare evidence under competing hypotheses, update directionally, and label uncertainty instead of inventing decimals." skill_graph_source_repo: "https://github.com/jacob-balslev/skill-graph" skill_graph_project: Skill Graph skill_graph_canonical_skill: skills/reasoning-strategy/bayesian-reasoning/SKILL.md skill_graph_export_description_projection: anti_examples

Bayesian Reasoning

Concept of the skill

Bayesian reasoning treats belief as a state that changes when evidence arrives. The primitives are a hypothesis, prior probability or base rate, evidence, likelihood of seeing that evidence if the hypothesis were true, likelihood of seeing it if the hypothesis were false, posterior belief, residual uncertainty, and update history.

Concept Card

What it is: Bayesian reasoning is a method for updating belief under uncertainty. It starts from a prior or base rate, evaluates how expected the new evidence is under competing hypotheses, updates toward the hypothesis that better predicts the evidence, and preserves residual uncertainty.

Mental model: Confidence is not reset by each new clue. A belief has an existing level, evidence applies pressure to that level, and the posterior becomes the new prior for the next update.

Why it exists: Agents tend to overreact to vivid recent evidence, ignore base rates, and answer uncertain questions as yes/no. Bayesian reasoning forces the belief state, evidence strength, and update size into the open.

What it is not: It is not an expected-value decision table, a statistical modeling workflow, a generic prioritization method, a strategy framework, or a requirement to fabricate exact probabilities when inputs are weak.

Adjacent concepts: base rates, priors, likelihood ratios, posterior probability, diagnostic reasoning, forecasting, calibration, expected value, hypothesis testing, evidence independence.

One-line analogy: Bayesian reasoning is a confidence ledger: every new piece of evidence is posted against the prior balance before the new balance is reported.

Common misconception: The method is not "new evidence says X, therefore X." Evidence matters by how differently it is predicted by X versus not-X, and by how plausible X was before the evidence arrived.

Coverage

This skill teaches agents to:

State the hypothesis and plausible alternatives before updating.
Make priors and base rates explicit.
Separate likelihood from posterior probability.
Estimate evidence strength by comparing competing explanations.
Update confidence directionally when exact numbers are unjustified.
Avoid double-counting correlated evidence.
Use natural frequencies for rare-event and diagnostic examples.
Report residual uncertainty and the evidence that would change the belief.

Philosophy of the skill

Bayesian reasoning is useful because it makes uncertainty inspectable. A confident answer can hide a weak prior, a diagnostic clue can look decisive while being common under multiple explanations, and a vivid example can overwhelm a large base rate. The Bayesian discipline forces those hidden weights into the answer.

The method is not a demand for spreadsheet precision. In many product, strategy, debugging, and research tasks, the honest output is qualitative: "this evidence raises confidence from low to moderate, but not high, because the base rate is low and the evidence is not independent." That is stronger than an invented 73 percent.

Workflow

1. Define the belief being updated

Name one hypothesis at a time, plus the alternatives.

Hypothesis:
Alternatives:
Decision or question this belief affects:
Current confidence:

Do not update a vague claim such as "this is promising." Rewrite it as a belief that can be supported or weakened.

2. Establish the prior

Use the best available prior source:

Prior source	Use when	Example
Base rate	Similar cases exist	"Only a small share of signups convert without activation."
Historical frequency	The system has logs or repeated runs	"This error has usually been config-related."
Reference class	No direct data exists	"Comparable B2B onboarding changes have mixed results."
Stated assumption	No evidence exists	"Assume low prior and mark it as a placeholder."

If the prior is uncertain, say so. Do not hide the prior by starting from the new evidence.

3. Compare likelihoods

Ask how expected the evidence is under each hypothesis.

Evidence:
If hypothesis is true, how expected is this evidence?
If hypothesis is false or an alternative is true, how expected is this evidence?
Likelihood direction:
Evidence independence:

Evidence is strong only when it is much more expected under one hypothesis than under plausible alternatives.

4. Update the belief

Move confidence in proportion to prior strength and evidence strength.

Situation	Update discipline
Strong prior, weak evidence	Small update
Weak prior, strong diagnostic evidence	Moderate or large update, but still state uncertainty
Rare hypothesis, noisy evidence	Small update unless the evidence is highly diagnostic
Multiple independent signals	Update more than once, but only if independence is credible
Correlated signals	Treat as one evidence cluster, not many independent confirmations

When inputs are rough, use bands: very low, low, moderate, high, very high. Prefer bands to fake decimals.

5. Report the posterior and next evidence

The useful answer includes the belief state and what would change it.

Bayesian update
- Prior/base rate:
- New evidence:
- Likelihood comparison:
- Update:
- Posterior confidence:
- Residual uncertainty:
- Evidence that would change the posterior next:

Natural-Frequency Check

For rare events, translate percentages into counts. This catches base-rate neglect.

Out of 10,000 cases:
- Prior/base-rate cases where the hypothesis is true:
- Cases where the evidence appears if true:
- Cases where the evidence appears if false:
- Total cases with evidence:
- Share of evidence-positive cases where the hypothesis is true:

If a rare event has a 1 percent base rate and a test is 90 percent accurate with a 10 percent false-positive rate, most positive tests may still be false positives. The exact result depends on the numbers, but the lesson is stable: low base rates require very diagnostic evidence.

Anti-Patterns

Anti-pattern	Why it fails	Repair
Base-rate neglect	Treats a vivid clue as if the prior were neutral	State the reference class and prior before the update
Likelihood-posterior swap	"Evidence is likely if H is true" becomes "H is likely"	Compare evidence under H and not-H, then update from the prior
Anecdote overweighting	One case gets treated as representative	Ask whether the evidence is diagnostic or merely salient
Double-counting correlated evidence	Many signals from one source masquerade as independent confirmation	Cluster correlated signals and update once
Hidden prior	The answer smuggles in confidence without naming it	Write the prior or mark it as an assumption
Fake precision	Weak inputs produce precise probabilities	Use confidence bands and evidence gaps
Binary conclusion	An uncertain belief becomes yes/no	Report posterior confidence and next evidence

Boundaries

Use Bayesian reasoning when the task is to update a belief, diagnosis, forecast, or assumption after evidence arrives.

Use another tool when the task is narrower or downstream:

Need	Better owner
Choose the option with highest probability-weighted payoff	Expected value skill when available; otherwise state that EV is downstream
Rank backlog items by impact, urgency, effort, or confidence	`prioritization`
Turn a vague business strategy into integrated choices	`playing-to-win`
Diagnose industry structure and profit-pool pressure	`porters-five-forces`
Choose which reasoning method to apply first	`problem-approach-router`
Ground factual claims to citations and modality	`epistemic-grounding`

Verification

Before finishing, verify:

The hypothesis and alternatives are explicit.
The prior or base rate is stated, or the absence of one is labeled.
Likelihood is not confused with posterior probability.
Evidence strength is judged against competing explanations.
Correlated evidence is not double-counted.
Probability precision matches evidence quality.
The posterior is reported as an update from the prior.
Residual uncertainty and next evidence are named.
The answer does not present expected value, strategy cascade, industry analysis, or generic prioritization as Bayesian reasoning.

References

skills/reasoning-strategy/bayesian-reasoning/references/bayesian-reasoning-sources.md
skills/reasoning-strategy/bayesian-reasoning/references/upstream-displacement-2026-05-26.md

Do NOT Use When

Use another skill when the task falls outside the declared scope, matches an anti_examples prompt, or is owned by a more specific related skill.

Skill Graph context

Classification

Subject: reasoning-strategy
Public: true
Domain: foundations/decision-quality
Scope: Use when updating beliefs, forecasts, diagnoses, or decision assumptions under uncertainty using Bayesian reasoning: priors/base rates, likelihood, evidence strength, posterior direction, and residual uncertainty. Covers base-rate discipline, likelihood-vs-posterior separation, independent evidence updates, natural-frequency examples, confidence calibration, and when to stop at qualitative probability instead of fake precision. Do NOT use for expected monetary value calculations, strategy-cascade choices (use playing-to-win), industry-structure analysis (use porters-five-forces), or generic task prioritization (use prioritization).

When to use

use Bayesian reasoning to update our confidence after this new evidence
we have a rare bug signal; account for the base rate before concluding the cause
separate prior, likelihood, and posterior for this diagnosis
how should this customer interview change our belief in the product hypothesis?
calibrate my confidence instead of giving a binary yes/no answer

Not for

calculate the expected value of these three options
turn this growth plan into a strategy cascade
analyze supplier power and substitutes in this industry
rank these roadmap items by impact and effort
build a statistical model from a dataset

Related skills

Verify with: epistemic-grounding, methodology
Related: prioritization, mental-models, constraint-awareness, problem-approach-router, epistemic-grounding

Concept

Mental model: Bayesian reasoning treats belief as a state that changes when evidence arrives. The primitives are a hypothesis, prior probability or base rate, evidence, likelihood of seeing that evidence if the hypothesis were true, likelihood of seeing it if the hypothesis were false, posterior belief, residual uncertainty, and update history. The key move is comparing how much better the evidence is explained by one hypothesis than by alternatives, then updating from the prior instead of starting from the vividness of the evidence.
Purpose: This skill prevents agents from jumping from a salient signal to a confident conclusion. It replaces binary diagnosis, anecdote-weighting, and base-rate neglect with an explicit update loop: start with the prior, estimate evidential force, adjust belief in the right direction, avoid double-counting correlated evidence, and state what would change the posterior next.
Boundary: Bayesian reasoning updates probabilities and confidence; it does not by itself choose the action with the best payoff, produce an expected value table, fit a statistical model, create a strategy cascade, analyze industry structure, or rank a backlog. Those downstream tools may consume Bayesian probabilities, but this skill owns the belief update.
Analogy: Bayesian reasoning is like adjusting a dimmer switch rather than flipping a light switch: evidence moves confidence up or down from where it started, and stronger evidence moves it farther.
Common misconception: The common mistake is treating Bayes as a formula that requires precise numbers. The formula is the idealized version; in agent work the practical discipline is often qualitative: make the prior explicit, compare evidence under competing hypotheses, update directionally, and label uncertainty instead of inventing decimals.

Grounding

Mode: universal
Truth sources: https://plato.stanford.edu/entries/bayes-theorem/, https://plato.stanford.edu/entries/epistemology-bayesian/, https://pubmed.ncbi.nlm.nih.gov/17835457/, skills/reasoning-strategy/bayesian-reasoning/references/bayesian-reasoning-sources.md, skills/reasoning-strategy/bayesian-reasoning/references/upstream-displacement-2026-05-26.md

Keywords

bayesian reasoning, bayes theorem, bayesian update, base rate, prior probability, posterior probability, likelihood ratio, evidence strength, confidence calibration, probabilistic reasoning