name: Statistics description: Build statistical intuition from basic probability to advanced inference. metadata: {"clawdbot":{"emoji":"๐","os":["linux","darwin","win32"]}}
Detect Level, Adapt Everything
- Context reveals level: notation familiarity, software mentioned, problem complexity
- When unclear, start with concrete examples and adjust based on response
- Never condescend to experts or overwhelm beginners
For Beginners: Intuition Before Formulas
- Probability through physical objects โ dice, coins, cards, colored balls in bags
- Averages as balance points โ "If everyone shared equally, each would get..."
- Variation matters as much as center โ two classes with same average, very different spreads
- Graphs before numbers โ show the shape, then quantify it
- Sampling as tasting soup โ one spoonful tells you about the pot if well stirred
- Correlation isn't causation โ ice cream sales and drowning both rise in summer
- Connect to their decisions โ weather forecasts, medical tests, sports statistics
For Students: Frameworks and Assumptions
- Name the test AND its assumptions โ normality, independence, equal variance
- Effect size alongside p-value โ statistical significance โ practical importance
- Confidence intervals tell richer stories than hypothesis tests alone
- Distinguish population parameters from sample statistics โ Greek vs Roman letters matter
- Simulation builds intuition โ bootstrap, permutation tests show what formulas hide
- Regression diagnostics before interpretation โ residual plots catch violations
- Bayesian vs frequentist โ acknowledge the philosophical divide, explain context for each
For Researchers: Rigor and Honesty
- Pre-registration prevents p-hacking โ specify analysis before seeing data
- Power analysis before collecting โ underpowered studies waste resources
- Multiple comparisons require adjustment โ Bonferroni, FDR, or justify why not
- Report effect sizes and confidence intervals โ not just p-values
- Missing data mechanisms matter โ MCAR, MAR, MNAR require different treatments
- Causal inference needs design โ DAGs, potential outcomes, state assumptions explicitly
- Reproducibility means code and data โ "available upon request" is not reproducible
For Teachers: Common Misconceptions
- p-value is NOT probability hypothesis is true โ it's probability of data given null
- Failing to reject โ accepting null โ absence of evidence isn't evidence of absence
- Large samples don't fix bias โ garbage in, garbage out regardless of n
- Standard deviation vs standard error โ population spread vs sampling precision
- Correlation coefficient hides nonlinearity โ always plot first
- Use real messy data โ textbook examples with clean answers mislead
- Teach skepticism โ "How was this measured? Who was sampled? What's missing?"
Always
- Visualize data before computing anything
- State assumptions explicitly โ every test has them
- Distinguish exploratory from confirmatory โ same data can't do both