source-evaluation - SKILL.md Agent Skill

name: source-evaluation description: Score sources for reliability and credibility using the NATO Admiralty Code (A1–F6) and calibrate confidence using Sherman Kent probability bands. Use whenever Trevor cites an OSINT source, evaluates conflicting reports, or attaches a confidence band to a key judgment. Triggers on any request involving "how reliable is", "rate this source", "what's our confidence", "probability", or "Admiralty code". metadata: trevor: requires: bins: [python3]

Source Evaluation

Two independent disciplines: rate the source, calibrate the confidence. Both are mandatory for Trevor citations.

NATO Admiralty Code

Score every cited source on two axes:

Reliability of source

Code	Meaning	When to use
A	Completely reliable	Verified track record. Established primaries (UKMTO, IAEA)
B	Usually reliable	Mostly accurate; rare minor errors (major wires, peer-reviewed)
C	Fairly reliable	Mixed track record (regional outlets, well-sourced blogs)
D	Not usually reliable	Bias dominates (state media of an interested party)
E	Unreliable	Pattern of inaccuracy
F	Cannot be judged	Genuinely unknown — not a default for "I didn't check"

Credibility of information

Code	Meaning
1	Confirmed by independent sources
2	Probably true
3	Possibly true
4	Doubtful
5	Improbable
6	Cannot be judged

Cite as A1, B2, C3, etc. Anything below C3 should not anchor a key judgment without independent corroboration.

→ Full template with worked example: analyst/templates/source-evaluation-matrix.md

Sherman Kent Probability Bands

Use these — and only these — for probability language.

Band	Range	Verbal anchor
Almost certain	93–99%	"almost certain"
Highly likely	75–85%	"highly likely"
Likely / Probable	55–70%	"likely", "probable"
Even chance	45–55%	"roughly even chance"
Unlikely	25–35%	"unlikely"
Highly unlikely	10–20%	"highly unlikely"
Almost no chance	1–5%	"remote chance"

Discipline rules

Never use "could", "may", "might" without a band. They smuggle uncertainty.
Never report a point estimate ("65% probability of X"). Report a band.
The verbal anchor and the numeric band must agree. If you write "highly likely" while feeling 50%, downgrade to "even chance".
Sherman Kent bands are about confidence in a judgment, not source reliability. Don't conflate them.

Procedure

1. For every source you cite:
   a. Look up its Admiralty rating in analyst/meta/sources.json
      (add if missing — see analyst/playbooks/source-acquisition.md)
   b. Score the specific information item independently
   c. Attach the combined code (e.g. B2) at point of citation
2. For every key judgment:
   a. Pick a Sherman Kent band that honestly reflects evidence weight
   b. Use the verbal anchor from the table — don't invent new ones
   c. Note in the assessment which evidence anchors the band
3. Quality-gate before delivery: every cited source has a code; every
   key judgment has a band. (See analyst/playbooks/quality-gates.md
   gates 2 and 3.)

Anti-patterns

Defaulting to F6 because rating is effort. F6 is for genuinely unknown sources.
Source laundering: think tank cited a primary, you cite the think tank as if it were the primary. Score the chain you actually have.
Round-tripping reliability through credibility — they're independent axes.
Stair-stepping confidence: starting at "highly likely" and creeping to "almost certain" as the assessment is written.