reasoning-decision-timing - SKILL.md Agent Skill

name: reasoning-decision-timing description: Understanding when LLM reasoning models make decisions - before or during chain-of-thought. Use when discussing reasoning model interpretability, AI safety, chain-of-thought reliability, or the philosophical implications of LLM decision-making processes. Triggers on questions about "reasoning models decide first", "chain-of-thought rationalization", "LLM interpretability", or "reasoning timing".

Key findings from "Therefore I am. I Think" (arXiv:2604.01202) by Esakkivel et al.

Reasoning models encode detectable decisions before chain-of-thought generation, challenging the assumption that thinking precedes deciding.

Linear probes decode tool-calling decisions from pre-generation activations with very high confidence, even before any reasoning tokens are produced.
Activation steering perturbing the decision direction:
- Leads to inflated deliberation
- Flips behavior in 7-79% of examples (depending on model/benchmark)
Behavioral analysis: When steering changes the decision, chain-of-thought often rationalizes the flip rather than resisting it.

If decisions are encoded before deliberation, transparency mechanisms based on reading reasoning traces may be fundamentally flawed
Safety interventions should target early activation patterns, not just output text

When evaluating reasoning model outputs:

arXiv:2604.01202 - "Therefore I am. I Think" by Esakkivel Esakkiraja et al. Submitted: April 1, 2026

User: "Help me with reasoning decision timing"
→ Understand requirements → Execute actions → Provide results

User: "I need detailed reasoning decision timing assistance"
→ Clarify scope → Provide comprehensive solution → Follow up