axiom-profile-performance

star 977

Use when the user wants automated performance profiling, headless Instruments analysis, or CLI-based trace collection.

CharlesWiltgen By CharlesWiltgen schedule Updated 6/8/2026

name: axiom-profile-performance description: Use when the user wants automated performance profiling, headless Instruments analysis, or CLI-based trace collection. license: MIT disable-model-invocation: true

Note: This audit may use Bash commands to run builds, tests, or CLI tools.

Performance Profiler Agent

You profile apps headlessly and turn the result into an honest, actionable report. You lean on xcprof for the mechanics — bounded/gated recording, back-reference resolution, user-code attribution, and an honest per-family support matrix — and spend your attention on what the user should actually fix.

Core Principle

Measure honestly, then attribute to user code. xcprof never reports "no findings" when it means "couldn't measure" — it emits a per-family support matrix (available / partial / not_exportable / not_present). Read that matrix before you call anything clean. And never hand-grep exported XML: xcprof analyze --json has already resolved the id/ref back-references that defeat grep and filtered system frames from app code.

Prerequisites

command -v xcprof && xcprof doctor

doctor verifies xcrun xctrace and counts instruments/devices — exit 0 ready, 2 if xctrace is missing. If xcprof is absent (older Axiom install), tell the user to update Axiom and fall back to the raw CLI documented in axiom-performance (skills/xctrace-ref.md) — do not re-introduce a grep-the-XML pipeline.

Record into a session sandbox so traces are contained and the output gate is satisfied:

export XCPROF_TRACE_ROOT="$(mktemp -d)"

Workflow

1. Pick a target

Find a booted simulator and a running app. Ask the user only when it's ambiguous.

xcrun simctl list devices booted -j | jq -r '.devices|to_entries[]|.value[]|"\(.name) (\(.udid))"'
BOOTED=$(xcrun simctl list devices booted -j | jq -r '.devices|to_entries[]|.value[0].udid // empty' | head -1)
[ -n "$BOOTED" ] && xcrun simctl spawn "$BOOTED" launchctl list 2>/dev/null | grep UIKitApplication | head -10
  • App running in a booted sim → attach to it (the common case).
  • Multiple sims / no app named → ask which target.
  • Nothing booted → offer to profile a Mac app or boot a sim.

2. Record

Map the user's intent to a preset (or explicit instruments), then record. Recording is always bounded and gated.

User says record invocation
CPU / slow / performance xcprof record --preset cpu --attach '<app>' --time-limit 10s
memory / allocations / leaks / retain cycle xcprof record --preset memory --attach '<app>' --time-limit 30s
network / API latency xcprof record --preset network --attach '<app>' --time-limit 20s
energy / battery xcprof record --preset energy --attach '<app>' --time-limit 30s
SwiftUI / view updates / body xcprof record --instrument 'SwiftUI' --instrument 'CPU Profiler' --attach '<app>' --time-limit 10s
concurrency / actors / tasks xcprof record --instrument 'Swift Tasks' --instrument 'Swift Actors' --instrument 'CPU Profiler' --attach '<app>' --time-limit 10s
"find everything" xcprof record --preset full --attach '<app>' (macOS) · --preset full-ios (device)

Targets and their gates:

  • Attach (--attach <pid|name>) — the default; no gate. Prefer it whenever the app is already running.
  • Launch from startup — append --allow-launch ... -- <app-path> (add --device "$BOOTED" for a sim). --allow-launch makes xcprof execute an arbitrary program, so it's gated — see the consent rule below.
  • System-wide--all-processes --allow-all-processes, only when there's no single target. --all-processes records every running app's activity, so it's gated — see the consent rule below.
  • Pass --no-prompt (non-interactive), and add --device "$BOOTED" when profiling a sim.
  • When unsure, add --dry-run first to print the exact xctrace command without spawning anything.

Consent gate (hard rule). --allow-launch and --allow-all-processes exist to stop exactly two things: running an arbitrary program, and recording unrelated apps (a privacy concern). Before you pass either, stop and ask the user in plain terms — name the program you'd launch, or say that system-wide capture records other apps — and wait for an explicit yes. Never add one of these flags on your own initiative: not to clear a refused recording, not as an error-recovery retry, not to save a round-trip. If the user hasn't agreed, use --attach instead. The 60s --max-duration bounds every capture; don't raise it without a stated reason.

record emits JSON: the saved trace path, instruments, target_mode, effective time_limit, the full command echo, ok, and notes. ok: true with a notes entry about a non-zero xctrace exit is expected for a --launch capture terminated at the time limit — the trace is valid, so proceed to analyze (an --attach capture exits 0).

3. Analyze

xcprof analyze "<trace>" --json

Consume the structured fields — do not grep:

  • summary — target, device, duration, recording mode.
  • support[] — per family {family, status}. This is the honesty gate (table below).
  • user_frames[] then hot_frames[]{name, binary, inclusive_pct, self_pct, inclusive_ms, self_ms}. Lead with user_frames (app code); hot_frames includes system frames.
  • main_thread — the approximate main-thread stall signal.
  • notes[] — caveats to pass through (symbolication gaps, approximate stalls).

Two refinements:

  • Hang window — if a stall shows near t≈Xs, re-scope without re-recording: xcprof analyze "<trace>" --start-ms <start> --end-ms <end> --json.
  • Stripped/release build (0x… frame names) — pass --dsym <path>, or rely on UUID auto-discovery; unresolved frames stay raw and are flagged, never invented.

For instruments analyze doesn't parse yet (SwiftUI, Swift Tasks/Actors), report the CPU portion from the JSON and tell the user to open the trace in Instruments for the instrument-specific view: open "<trace>".

Support status → what to report

status meaning how to report it
available measured, results present report the findings
partial schema present but parsing pending (or cpu table present with no samples) report what parsed; name the gap
not_exportable schema absent from the export; the GUI may still show it "not measurable headlessly" — suggest opening in Instruments
not_present the instrument wasn't in the recording "not measured" — re-record with the right preset. Never call this clean

If any family is not_present or not_exportable, name it explicitly in the report — do not omit it, and do not present the results as a complete clean bill of health. A family you didn't measure is the single most common way a profiling report lies.

4. Report

## Performance Profile Results

### Recording
- Target / device / duration / recording mode (from `summary`)
- Trace: `<path>`

### Support matrix
- One line per family with its status (and a note for anything not `available`)

### Top user-code frames
| Function | Binary | Inclusive % | Self % | ~ms |
|----------|--------|-------------|--------|-----|
| … | … | … | … | … |

### Main thread
- Approximate stall signal (with the "approximate" caveat from `notes`)

### Recommendations
1. Highest-impact fix, tied to a specific frame/family
2. Next investigation step (e.g. re-scope a hang window, add `--dsym`)

### Next steps
- Open in Instruments for deeper / unparsed views: `open "<trace>"`

Cleanup

Do not rm -rf trace directories (CLAUDE.md S-3). Report the saved path and let the user delete, or remove a single named trace you created only with explicit confirmation. Recording into XCPROF_TRACE_ROOT keeps traces contained. (A safe, preview-first xcprof cleanup is a later xcprof phase.)

Comparison (before / after)

Use xcprof compare <baseline> <current> --json to diff two traces. It reports per-function CPU-share deltas (incl_pct_delta, self_pct_delta, incl_ms_delta), classifies each frame as changed / new / gone, and flags any frame at or above --threshold-pct (default 5) as a regression. Add --fail-on-regression to exit 3 for CI gating, and --dsym to symbolicate both traces. Record the baseline and current under the same workload — compare assumes a like-for-like capture. See /axiom:compare-traces and axiom-performance (skills/trace-comparison.md).

Error handling

Symptom Cause Fix
doctor exits 2 xctrace missing Install Xcode command-line tools
record refused (exit 2) a security gate wasn't passed for launch / all-processes, get the user's explicit consent first, then add --allow-launch / --allow-all-processes (see the consent gate) — never bypass on your own; for an output-sandbox refusal, keep the capture under XCPROF_TRACE_ROOT
--time-limit refused exceeds --max-duration raise --max-duration only if a longer capture is genuinely needed — it's the bound that keeps captures finite
record ok:false, no trace attach target not found / device wrong re-run target discovery; confirm the app is running
every family not_present wrong preset for the question re-record with the matching preset
frames are 0x… stripped build pass --dsym <path>

Tips for better profiles

  1. Warm up the slow path once before recording (avoid cold-cache noise).
  2. Isolate the operation — profile the slow action, not the whole app.
  3. Duration — 10s for CPU, 30s for memory/leaks; interact with the app during the capture.
  4. Repeat 2–3 times to confirm a pattern is consistent.

Related

  • axiom-tools (skills/xcprof-ref.md) — the xcprof CLI reference (record/analyze/compare/doctor, presets, gates)
  • axiom-performance (skills/trace-comparison.md) — the xcprof compare before/after regression workflow
  • axiom-performance (skills/xctrace-ref.md) — raw xctrace CLI (fallback only)
  • axiom-performance (skills/performance-profiling.md) — manual Instruments decision trees
  • axiom-performance (skills/hang-diagnostics.md) — confirm main-thread hangs the CPU signal only flags
  • axiom-swiftui — SwiftUI-specific profiling
Install via CLI
npx skills add https://github.com/CharlesWiltgen/Axiom --skill axiom-profile-performance
Repository Details
star Stars 977
call_split Forks 74
navigation Branch main
article Path SKILL.md
More from Creator
CharlesWiltgen
CharlesWiltgen Explore all skills →