name: coverity-audit description: Triage Coverity Scan defects (https://scan.coverity.com) for this project — fetch defect lists, fetch per-defect details, and apply triage decisions (Bug / FalsePositive / Intentional with severity, action, and a comment). Use when the user asks to "review Coverity defects", "triage Coverity findings", "fetch Coverity outstanding", or anything mentioning Coverity Scan, CIDs, or scan.coverity.com.
Coverity Scan triage skill
This skill drives the Coverity Scan unofficial JSON API (the public site has
no documented API — the scripts mimic what the browser does) for the project
configured in .env. Scripts auto-detect the repo root and write all
artifacts under <repo-root>/.local/audits/coverity/.
The skill captures operational knowledge (how the API works, where it trips up, what the data means). It does NOT prescribe a review pipeline — how you actually triage each defect (single model, multi-model, manual) is adhoc; agree the approach with the user up front.
MANDATORY — keep this skill alive
If you (the agent) discover a new pattern, gotcha, working flow, correction,
or any piece of knowledge while running this skill — update this SKILL.md
AND commit it BEFORE proceeding. Knowledge that isn't committed is lost.
Examples of things to capture:
- New view IDs encountered (and what each represents)
- New API endpoint or parameter behavior
- New failure mode (Cloudflare quirks, session expiry signals, rate limits)
- New FP guardrail you found via the codebase (idiom that Coverity mismodels)
- New severity / classification mapping detail learned the hard way
MANDATORY — startup sequence when this skill is invoked
Do these steps in this order. Skipping any step costs the user their session.
Step 1 — agree the triage approach with the user
Coverity reviews are usually 1-3 defects. Sometimes hundreds. Ask the user:
- "How many defects are we looking at — one specific CID, or a sweep?"
- "How do you want to review them — you read each one, you spawn one agent per defect, you want me to use multiple models for cross-checking, …?"
Do NOT default to a heavy multi-stage pipeline; that's only worth setting up when there are tens of defects to crunch. For small batches a single agent or the user reading the bundle directly is usually faster.
If the user does want a multi-model approach, a sensible (not prescribed) shape is: cheap models surface ideas, the strongest coding model produces the actual analysis + fix, the strongest reviewing model sanity-checks the result. The user picks the actual CLIs/models — this skill does not assume any specific tool.
Step 2 — ask the user for a fresh cookie
Coverity Scan auth is cookie-based and tied to a live browser session. Give the user the exact recipe:
- Open https://scan.coverity.com/projects/
- ?tab=overview (replace <owner>-<repo>with the project slug, e.g.netdata-netdata).- Click "View Defects" at the top right -- a new tab opens to https://scan4.scan.coverity.com/# (or
scanN.scan.coverity.com-- Coverity load-balances; use whichever URL you land on asCOVERITY_HOST).- Keep that tab open the whole time we work. Closing it kills the session immediately.
- Press F12 to open DevTools, switch to the Network tab.
- Click any defect in the list -- 3-4 requests appear in the Network tab.
- Right-click any of those requests, select Copy > Copy as cURL.
- Paste the entire curl command back to me.
If the curl the user pastes does NOT contain a -b 'cookie=...' line, ask
again -- they probably picked "Copy as fetch" or "Copy as Node.js fetch".
Step 3 — save the cookie in .env
Extract the value of the -b argument from the user's curl and write/update
<repo-root>/.env with:
COVERITY_COOKIE='<paste-the-entire-cookie-blob-here>'
COVERITY_PROJECT_ID=<numeric projectId from the URL or table.json query>
COVERITY_VIEW_OUTSTANDING=<viewId of the Outstanding view>
COVERITY_HOST=https://scan4.scan.coverity.com
.env is gitignored. The cookie blob must include both COVJSESSIONID-build
and XSRF-TOKEN. The scripts extract XSRF-TOKEN automatically.
Step 4 — start the keepalive (background)
Bash tool with run_in_background=true,
command="bash .agents/skills/coverity-audit/scripts/keepalive.sh"
The keepalive exits non-zero the moment a ping fails (session expired, browser tab closed, cookie went bad). The orchestrator's background-task notification fires immediately so you know to ask for a fresh cookie and restart.
Stop the keepalive at the end of the triage session by killing its background task.
Step 5 — proceed with the actual triage work
Now (and only now) is it safe to fetch tables, fetch details, prepare defect bundles, finalize verdicts.
CRITICAL — server-side view state
Coverity's table API has a stateful, server-side view cursor. The
/views/table.json POST changes server-state (which page is "current"),
then the /reports/table.json GET reads whatever the current state is.
This has two consequences:
1. Pagination is two-step, not one-shot
Per page:
POST /views/table.json {projectId, viewId, pageNum}-- moves the cursorGET /reports/table.json?projectId=...&viewId=...-- reads the page
fetch-table.sh handles both steps and the page loop.
2. The user MUST NOT touch the Coverity UI while a fetch is running
If the user clicks a different view, scrolls to a different page, sorts a column, applies a filter — the server-side cursor moves under our scripts' feet. The scripts will then fetch garbage (rows from whatever view the user just opened, not the view the script asked for).
Tell the user explicitly: "I'm about to fetch the defect list. Don't click in the Coverity tab until I say I'm done."
If the user accidentally interferes, re-run the fetch script — it's idempotent on cached pages but not on already-corrupted ones, so delete the output JSONs and re-fetch from page 1.
Coverity views (queue scopes)
Coverity organizes defects into named views, each with a numeric
viewId. The skill operates per-view.
How to find a view's ID
In the Coverity UI:
- Click the project's defects view, then the view selector dropdown.
- Each view has a URL like
https://scan4.scan.coverity.com/#viewId=NNNNN. - That
NNNNNis the value to put in.env(or pass tofetch-table.sh).
Common view types you'll encounter
The default project ships with these (the IDs are project-specific):
- Outstanding — the live queue: defects neither classified nor dismissed.
Where day-to-day triage happens. Save its viewId as
COVERITY_VIEW_OUTSTANDING(also used by the keepalive). - All in project — every defect, including already-classified ones. Useful for batch rescans or full audits.
- Dismissed — defects classified as False Positive / Intentional and ignored. Useful for re-review when the underlying code changed.
- Fixed — defects Coverity now considers fixed. Verification queue.
- Unclassified non-outstanding — corner cases.
When a user asks you to operate on a non-default view, ask them to give you
its viewId from the UI. You can have multiple COVERITY_VIEW_* env vars.
Switching views without confusing the cursor
The view-state cursor described above is per-session. If you want to
fetch view A then view B, do them sequentially (not concurrently): each
fetch-table.sh call sends its own POST that resets the cursor. There is
no need to "go back" -- just call it again with the next viewId.
Coverity attribute reference
Triage values are sent as integer attribute IDs, not names:
Classification (attribute 3)
| ID | Name |
|---|---|
| 20 | Unclassified |
| 21 | Pending |
| 22 | False Positive |
| 23 | Intentional |
| 24 | Bug |
Severity (attribute 1)
| ID | Name |
|---|---|
| 10 | Unspecified |
| 11 | Major |
| 12 | Moderate |
| 13 | Minor |
Action (attribute 2)
| ID | Name |
|---|---|
| 1 | Undecided |
| 2 | Fix Required |
| 3 | Fix Submitted |
| 4 | Modeling Required |
| 5 | Ignore |
External reference (attribute 4)
Free-form string. The scripts always send null.
Sensible (verdict → attributes) mappings
These are what finalize-defect.sh applies when given a verdict from the
suggested vocabulary below:
| Outcome | classification (3) | action (2) |
|---|---|---|
| Real bug, fixed | 24 Bug | 3 Fix Submitted |
| False positive | 22 False Positive | 5 Ignore |
| Cosmetic / harmless | 23 Intentional | 5 Ignore |
Coverity displayImpact → severity ID
finalize-defect.sh maps defect-summary.json#displayImpact to severity:
| displayImpact | severity ID | name |
|---|---|---|
High |
11 | Major |
Medium |
12 | Moderate |
Low |
13 | Minor |
null / missing |
10 | Unspecified |
Suggested verdict vocabulary
This is the taxonomy used historically; you can reuse it or replace it. The
finalize-defect.sh script understands these names directly:
Real bugs (classification = 24 Bug, action = 3 Fix Submitted)
| Verdict | When to use |
|---|---|
TRUE_BUG_MEMORY_CORRUPTION |
OOB read/write, UAF, double-free, type confusion, stack-escape UAF. |
TRUE_BUG_CRASH |
Reachable NULL deref / div-by-zero / assert / fatal. No corruption. |
TRUE_BUG_RESOURCE_LEAK |
fd / memory / lock / refcount leak that accumulates on a reachable path. |
TRUE_BUG_LOGIC |
Wrong result, wrong metric, wrong stored/transmitted data. No crash. |
TRUE_BUG_UB |
Spec-UB (signed overflow, strict aliasing) compiles today but latent. |
False positives (classification = 22 FP, action = 5 Ignore)
| Verdict | When to use |
|---|---|
FALSE_POSITIVE_GUARD_EXISTS |
The code has a check Coverity failed to track. |
FALSE_POSITIVE_UNREACHABLE |
The flagged path cannot be reached under any realistic state. |
FALSE_POSITIVE_TRUSTED_INPUT |
Tainted source is actually trusted (root-owned local file). |
FALSE_POSITIVE_TOOL_MODEL |
Coverity's semantic model is wrong (e.g. doesn't know mallocz cannot return NULL). |
IMPOSSIBLE_CONDITIONS |
Combination of states existing invariants prevent. |
Cosmetic (classification = 23 Intentional, action = 5 Ignore)
| Verdict | When to use |
|---|---|
COSMETIC |
Not a bug: unused value, dead branch, redundant expression. |
Bookkeeping (no Coverity update)
| Verdict | When to use |
|---|---|
CODE_GONE |
The flagged file/function no longer exists. (finalize-defect.sh skips.) |
NEEDS_HUMAN |
Genuinely unsure after reasonable investigation. (finalize-defect.sh skips.) |
Per-defect workdir convention
prepare-defect.sh creates a per-defect directory under
.local/audits/coverity/triage/<scope>/cid-<N>/ with:
defect-summary.json— the row from the table dumpdefect-details.json— the per-defect details (event trace, CWE, checker)source-context.c— ~150 lines of source around the main eventTODO.md— per-defect notes; keep all per-defect artifacts inside
Whatever review approach the user picks, write its outputs here too (e.g. analyzer reports, decider reasoning, commit message draft, build-verify log). Don't pile per-defect stuff at the repo root.
CID vs defectInstanceId
Two IDs for one defect, both flying around the API:
cidis stable across scans. Use it for tracking, comments, your permanent records.defectInstanceIdis per-scan. Thedefectdetails.jsonendpoint takes adefectInstanceId, NOT acid. Eachtable.jsonrow carries the currentlastDefectInstanceIdfor its CID.
When all you have is a CID (e.g. you're acting on an old list, or working from external triage notes), use:
bash .agents/skills/coverity-audit/scripts/resolve-cid-to-diid.sh <cid>
# prints the current defectInstanceId, or "GONE" if Coverity no longer reports it
Internally it queries /reports/defects.json?cid=N and parses
defectInstanceId out of the returned .url field.
Workflow
Fetch a view's table
bash .agents/skills/coverity-audit/scripts/fetch-table.sh \
"${COVERITY_VIEW_OUTSTANDING}" 7 .local/audits/coverity/raw/outstanding
Produces .local/audits/coverity/raw/outstanding-page1.json ... and a
combined flat array at .local/audits/coverity/raw/outstanding-all.json.
Pass the right page count (visible in the UI) for the view.
Reminder: the user must not touch the UI during a fetch.
Fetch per-defect details
bash .agents/skills/coverity-audit/scripts/fetch-details.sh \
.local/audits/coverity/raw/outstanding-all.json \
.local/audits/coverity/details/outstanding
One file per CID at <details>/cid-<N>.json. Idempotent.
Bundle a defect for review
bash .agents/skills/coverity-audit/scripts/prepare-defect.sh <CID> outstanding
Creates .local/audits/coverity/triage/outstanding/cid-<N>/ with the bundle.
The actual review (single-model, multi-model, human) is adhoc — agree
the approach with the user.
Apply a verdict
After review and (if needed) a fix commit:
bash .agents/skills/coverity-audit/scripts/finalize-defect.sh \
<CID> <VERDICT> <scope> .local/audits/coverity/triage/<scope>/cid-<N>/comment.txt \
[<commit-sha>]
<scope> is the same name prepare-defect.sh uses (e.g. outstanding,
dismissed, fixed, unclassified).
This:
- Skips silently for
NEEDS_HUMANandCODE_GONE. - Reads
displayImpactfrom the table dump to derive severity. - Appends
Fix commit: <sha>to the comment when a SHA is given. - Posts JSON to
/sourcebrowser/updatedefecttriage.json.
For scopes other than "outstanding" (re-triaging dismissed/fixed/etc.), finalize-defect prints a warning -- the caller is asserting that the new verdict disagrees with the existing classification. It still applies.
ASCII-only comments — non-negotiable
Coverity Scan sits behind Cloudflare. The WAF rejects bodies containing non-ASCII bytes (em-dashes, smart quotes, accented letters) with a 403 Cloudflare challenge that looks like an expired-session error but isn't.
- Use
--instead of em-dash (U+2014). - Use straight quotes
"'instead of smart quotes. - The scripts reject non-ASCII before the network round-trip.
Project-specific FP guardrails (Netdata)
Most of the cost of a Coverity audit is rejecting false positives. Before calling something a bug in this codebase, rule out these idioms — Coverity mis-models all of them:
z-suffix allocators never fail.mallocz,callocz,reallocz,strdupz,strndupz,mallocz_flexcallfatal()on OOM. They cannot return NULL. Any "possible NULL deref aftermallocz" warning is FP.freez(NULL)is safe. Same forstring_freez, etc.DOUBLE_LINKED_LIST_*macros (libnetdata/linked-lists.h) manage prev/next with their own invariants. Raw pointer manipulation inside them is expected.buffer_*API (libnetdata/buffer/) auto-grows the underlyingbuffer->buffer[]on write. Direct indexing ofbuffer->buffer[N]from callers is the risk, not the API.STRINGis refcounted and interned (libnetdata/string/).string_strdupzincrements refcount on an existing intern;string_dupacquires a new reference. Mixing them is a refcount bug.ARALis a slab allocator (libnetdata/aral/). Objects are reused — UAF looks different here: the same address is re-issued. A pointer that "still works" afteraral_freezmay be a reused object.DICTIONARYhasdictionary_acquired_item_get/_releasewith refcounts. Raw access bypasses the refcount.- Custom locks (
spinlock_lock,rw_spinlock_*) — not pthread. Coverity MISSING_LOCK warnings often misunderstand them. - Platform: glibc + musl. Watch glibc-only assumptions
(e.g.
strerror_rsignature). - Compilers: gcc + clang both must compile.
- Process model: long-running daemon, spawns plugin subprocesses over a line-based stdin/stdout protocol. Plugin stdin is typically trusted; streaming peers are UNTRUSTED remote peers over TCP.
Input trust boundaries (Netdata)
Used to decide whether a tainted-data path is FALSE_POSITIVE_TRUSTED_INPUT:
| Source | Trust | Module |
|---|---|---|
Local /proc, /sys |
Trusted (root-owned) | collectors/proc.plugin/, cgroups.plugin/ |
| Plugin stdin (our plugins) | Trusted | plugins.d/, collectors/*.plugin/ |
| Streaming peer (remote agent) | UNTRUSTED | streaming/, stream-* |
| HTTP request (dashboard API) | Semi-trusted (usually localhost) | web/api/, web/server/ |
| MCP request | Semi-trusted (localhost) | web/mcp/ |
| Cloud (aclk) | Trusted (TLS + token to Netdata Cloud) | aclk/ |
| Config files | Trusted (root-owned) | daemon/config/, health/ |
Failure modes — quick diagnosis
| Symptom | Likely cause |
|---|---|
| HTTP 401 / 403 / 302, response is HTML | Session expired. Recapture cookie from browser. |
| HTTP 403 with Cloudflare challenge HTML | Either non-ASCII in comment, or browser tab closed. |
| keepalive.sh exits non-zero | Same as above. Ask user for a fresh cookie. |
HTTP 200 but defectStatus empty |
XSRF token stale. Recapture cookie. |
Could not parse session from .env |
Wrong cookie format. Paste the FULL -b 'k=v; ...' string. |
| Fetched rows look wrong (different view) | User clicked in the UI mid-fetch. Delete output JSONs and re-fetch. |
.url field missing in /reports/defects.json reply |
Coverity no longer reports this CID -- treat as CODE_GONE. |
| Cookie expires in mid-run despite keepalive | Browser tab was closed. Reopen the tab and recapture cookie. |
Recurring tips
- The
lastDefectInstanceIdfield, NOTcid, is whatdefectdetails.jsonwants. - Coverity caches results aggressively; if a CID disappears from "Outstanding" immediately after finalize, a fresh fetch can take a few seconds to reflect.
- Idempotence:
fetch-details.shskips files already present, so partial runs are safe to re-invoke. prepare-defect.shuses Coverity'sdisplayFileand the main-event line to extract source context. If Coverity's line numbers are stale (after a refactor), the context may not center on the current code -- use it as a hint, not an authoritative location.- Coverity load-balances across
scanN.scan.coverity.comhosts. Whatever hostname your browser landed on must be the one inCOVERITY_HOST; cookies are per-host.