name: test-coverage-improver
description: 'Improve test coverage in the OpenAI Agents Python repository: run bun run coverage, inspect coverage artifacts, identify low-coverage files, propose high-impact tests, and confirm with the user before writing tests.'
Test Coverage Improver
Overview
Use this skill whenever coverage needs assessment or improvement (coverage regressions, failing thresholds, or user requests for stronger tests). It runs the coverage suite, analyzes results, highlights the biggest gaps, and prepares test additions while confirming with the user before changing code.
Quick Start
- From the repo root run
bun run coverageto regeneratecoverage/data. - Collect artifacts:
coverage/. - Summarize coverage: total percentages, lowest files, and uncovered lines/paths.
- Draft test ideas per file: scenario, behavior under test, expected outcome, and likely coverage gain.
- Ask the user for approval to implement the proposed tests; pause until they agree.
- After approval, write the tests in
tests/, rerunbun run coverage, and then runbun run checkbefore marking work complete.
Workflow Details
- Run coverage: Execute
bun run coverageat repo root. Avoid watch flags and keep prior coverage artifacts only if comparing trends. - Parse summaries efficiently:
- Prefer the console output from
bun run coveragefor file-level totals
- Prefer the console output from
- Prioritize targets:
- Public APIs or shared utilities in
src/before examples or docs. - Files with low statement coverage or newly added code at 0%.
- Recent bug fixes or risky code paths (error handling, retries, timeouts, concurrency).
- Public APIs or shared utilities in
- Design impactful tests:
- Hit uncovered paths: error cases, boundary inputs, optional flags, and cancellation/timeouts.
- Cover combinational logic rather than trivial happy paths.
- Place tests under
tests/and avoid flaky async timing.
- Coordinate with the user: Present a numbered, concise list of proposed test additions and expected coverage gains. Ask explicitly before editing code or fixtures.
- After implementation: Rerun coverage, report the updated summary, and note any remaining low-coverage areas.
Notes
- Do not create
scripts/,references/, orassets/unless needed later. - If coverage artifacts are missing or stale, rerun
bun run coverageinstead of guessing.