pm-metrics - SKILL.md Agent Skill

name: pm-metrics description: Делает ревью продуктовых метрик — тренды, аномалии, root causes и рекомендации к действиям. Включает декомпозицию North Star (L1/L2), диагностику retention-кривых, анализ воронки, разбор A/B-экспериментов, проверку соответствия OKR и фреймворк атрибуции аномалий. User-invoked only — do NOT auto-trigger. Triggers on /pm-metrics, "обзор метрик", "разбор воронки", "анализ удержания", "ретеншн", "A/B результаты", "review metrics", "DAU analysis", "retention analysis", "funnel analysis", "metric anomaly".

pm-metrics — Product metrics review

Part of the Personal Corp framework — running a one-person business through AI agents. Systematically review product metrics, identify trend changes, locate root causes, output action recommendations. Includes North Star decomposition, retention diagnostics, funnel methodology, and A/B experiment reading.

Inputs

Field	Required	Notes
Metric data	yes	Excel / CSV / pasted table / verbal description
Cycle	no	Weekly / monthly / quarterly review; default weekly
Focus	no	Full review / single-metric anomaly / experiment readout
Business context	no	Releases, campaigns, incidents in the period

Mode: full data → complete review; single-metric change → focused anomaly analysis.

Step 1 — Data integrity check

Confirm time coverage (current vs comparison period)
Confirm metric coverage (which North Star / L1 / L2 are present)
Flag missing critical data

Step 2 — North Star metric system

Decomposition: North Star → L1 → L2.

L1 dimensions:

User growth: DAU/WAU/MAU, new, returning
User engagement: core action frequency, session length, feature reach
User retention: D1 / D7 / D30
Conversion efficiency: signup → activation → paid step-by-step rates
Business value: paid rate, ARPU, LTV
Satisfaction: NPS, complaint rate, ratings

North Star selection guide:

Product type	Recommended NSM	Typical L1
Social / community	Weekly active posters	DAU/MAU ratio, interactions per user, D7 retention
Tools / productivity	Weekly users completing core task	Task completion rate, frequency, feature reach
E-commerce	Weekly transacting users	GMV, AOV, repeat rate, conversion
Content / media	Weekly content-consumption time	Time per user, completion rate, return rate
SaaS / B2B	Weekly active teams	Team penetration, feature depth, renewal rate

Step 3 — Growth metric analysis

Definitions:

DAU: distinct users with valid action that day
WAU: distinct users active ≥ 1 day in 7
MAU: distinct users active ≥ 1 day in 30
DAU/MAU ratio (stickiness): > 0.5 very high, 0.3-0.5 high, 0.2-0.3 medium, < 0.2 low

User segmentation:

Type	Definition	Focus
New	First-time user	Channel quality, activation rate
Active retained	Active in both periods	Depth, feature reach
Returning	Inactive last period, active this	Return reason, secondary retention
Churned	Active last period, inactive this	Churn cause, win-back potential
Dormant	Inactive multiple periods	Possibly permanent loss

Growth identity: This-period MAU = prev-period retained + new + returning − churned

Step 4 — Retention analysis

Definitions:

D1: % of new users who return on day 2
D7: % of new users who return on day 8
D30: % of new users who return on day 31

Retention benchmarks:

Product type	D1	D7	D30	Note
Social / messaging	> 70%	> 50%	> 35%	High-frequency essential
Tools	> 40%	> 25%	> 15%	"Use and leave" pattern
Content / news	> 35%	> 20%	> 10%	Many alternatives, lower retention
E-commerce	> 25%	> 15%	> 8%	Low-frequency, watch repeat rate instead
Games	> 40%	> 20%	> 10%	High variance by genre
SaaS / B2B	> 60%	> 45%	> 30%	High switching cost, higher baseline

Retention-curve diagnosis:

Steep drop (D1 → D7 loses > 60%): activation experience broken — users didn't find value
Slow decay (D7 → D30 keeps falling, doesn't level): no long-term hook
L-shape (levels off after D7): healthy, core user base formed
Bounce-back (sudden uptick on a specific day): cyclical use pattern (e.g. weekday-only)

Retention segmentation:

By channel: organic vs paid retention gap
By behavior: completed activation vs not
By cohort month: compare month-over-month curves to gauge product improvement

Step 5 — Conversion funnel analysis

Funnel construction:

Define start and end points (e.g. homepage visit → payment success)
Split into key intermediate steps (each step = a user decision point)
Per-step rate = arriving at next / arriving at this

Funnel framework:

Step	Action	Output
Draw	List steps + rates	Full funnel view
Identify bottleneck	Find lowest-rate step	Optimization focus
Benchmark	Compare history / industry / competitor	Gap quantification
Segment	By channel / device / user type	Locate problem cohort
Hypothesize	Why is the bottleneck there?	Optimization direction
Experiment	Propose A/B test	Action plan

Common funnels:

Acquisition: impression → click → install/signup → activation
Activation: signup → onboarding done → core action first-trigger
Payment: browse → cart → order → pay success
Sharing: trigger → share click → recipient open → recipient conversion

Step 6 — A/B experiment readout

Dimension	Standard	Note
Statistical significance	p < 0.05	p > 0.05 → inconclusive, don't decide
Effect size	Lift > MDE	Significant but tiny lift may not be worth it
Sample size	Reaches pre-set N	"Significant" without N is unreliable
Duration	Covers ≥ 1-2 full weeks	Avoid weekday/weekend bias
AA check	Pre-period baselines match	Mismatch → split assignment is broken

Decision framework:

Significant + large effect → ship to all
Significant + small effect → weigh long-term value vs cost
Not significant → don't ship; investigate (wrong hypothesis? sample? execution?)
Metric conflict (A up, B down) → weigh, prioritize North Star

Common pitfalls:

Reading results too early (before reaching N)
Looking only at primary metric, not guardrails
Multiple peeks → false positives
Ignoring novelty effect (early data inflated)

Step 7 — OKR alignment check

Check	Healthy	Anomaly signal
Coverage	Every KR has ≥ 1 trackable metric	A KR with no measurable proxy
Consistency	Metric direction matches KR target	Metric up but KR no progress
Pacing	Linear pacing ≥ 50% by mid-quarter	Severely behind schedule
Attribution	Metric movement attributable to team action	Metric improved due to industry tailwind, not team

OKR progress table:

OKR	KR metric	Target	Current	Progress %	Trend	Risk
{O1}	{KR1}	{target}	{current}	{X%}	Up/flat/down	On-track / at-risk / severe

Step 8 — Anomaly attribution

When a metric moves anomalously, work the framework:

Quantify: how much, starting when?
Decompose: segment by channel / region / version / cohort to localize
Time-align: what happened around the inflection? (release, campaign, incident, competitor move)
Eliminate: rule out causes one by one until the most likely root remains
Cross-check: verify the attribution via other metrics

Common causes:

Category	Pattern	Verification
Release	Inflection aligns with deploy time	Compare per-version
Campaign	Up during campaign, drops after	Compare per-channel
Tech incident	Sudden drop + recovery	Check error logs and uptime
External	Industry-wide change	Compare with competitor / industry data
Channel mix	One channel changed dramatically	Per-channel decomposition
Seasonality	Same as YoY	Look at last year's same period

Step 9 — Generate review report

# Product Metrics Review

**Period:** {date range}
**Product:** {name}
**Type:** {weekly / monthly / quarterly}

## 1. Health Overview
| Layer | Metric | Current | Previous | MoM | Target | Status |
|---|---|---|---|---|---|---|
| North Star | {} | {} | {} | {±X%} | {} | OK / warn / alert |
| L1 | {} | {} | {} | {±X%} | {} | OK / warn / alert |

**Overall judgment:** {one-sentence summary}

## 2. User Growth
- DAU: {value}, MoM {change}
- MAU: {value}, DAU/MAU = {stickiness}
- Composition: new {X}% / retained {Y}% / returning {Z}%

## 3. Retention
| Metric | Current | Previous | Benchmark | Assessment |
|---|---|---|---|---|

## 4. Funnel
| Step | Users | Rate | MoM | Bottleneck? |
|---|---|---|---|---|

**Bottleneck diagnosis:** {description}

## 5. Experiments / Feature Effects
| Experiment | Primary metric Δ | Significance | Conclusion |
|---|---|---|---|

## 6. OKR Progress
| KR | Target | Current | Progress | Risk |
|---|---|---|---|---|

## 7. Anomaly Attribution
| Anomaly | Magnitude | Start | Attribution | Confidence |
|---|---|---|---|---|

## 8. Key Insights
1. {insight 1: finding + data + meaning}
2. {insight 2}
3. {insight 3}

## 9. Action Recommendations
| Priority | Action | Linked metric | Expected impact | Owner |
|---|---|---|---|---|

Review cadence

Type	Frequency	Time	Audience	Focus
Weekly	Every Monday	15-30 min	PM	NSM + anomalies + experiments
Monthly	Month start	30-60 min	Product team	All L1 + retention + funnel + OKR pacing
Quarterly	Quarter end	60-90 min	Product + ops + eng	Strategy review + OKR scoring + next-quarter plan

Quality bar

Metric definitions clear — every metric has a calculation note
Data has comparisons — current always compared to previous, YoY, or target
Attribution evidenced — no causation from correlation alone
Recommendations actionable — owner-assignable
Limitations tagged — call out small samples or data quality issues

Common analysis pitfalls

Pitfall	Symptom	Fix
Simpson's paradox	Total goes up while every segment goes down	Always segment, never just look at totals
Survivorship bias	Only retained users analyzed, churned ignored	Compare retained vs churned behavior
Vanity metric	Cumulative signups only ever grow, not decision-useful	Use active metrics (DAU/WAU) instead
Time-window trap	Comparison window happens to be an outlier	Cross-validate across multiple windows
Goodhart's law	Target becomes a metric, stops measuring well	Set guardrails to prevent gaming

Red lines

No fabricated data — missing data → tag "missing", don't extrapolate
Don't conflate correlation with causation — attribution must say "highly correlated" or "confirmed causal"
Don't over-read small swings — small fluctuation → tag "within normal noise"
Don't ignore negatives — flag risks even when overall is up

When input is incomplete

Single metric only → focus on that anomaly, no full review
No history → snapshot only, tag "no baseline, recommend establishing tracking"
Verbal description → analyze based on description, tag "recommend exact data for verification"
No targets → use industry benchmarks, suggest team set explicit targets

Related skills

/pm-feedback — pair quantitative anomaly with qualitative voice-of-customer
/pm-prioritize — adjust priority based on metric findings
/pm-roadmap — adjust roadmap based on OKR pacing