adhoc-analysis - SKILL.md Agent Skill

name: adhoc-analysis description: >- How to conduct ad-hoc analyses: gather data from multiple sources, synthesize findings, save reusable analysis artifacts that anyone can re-run for fresh results.

Ad-Hoc Analysis

Ad-hoc analyses are deep-dive investigations that cross-reference multiple data sources and produce a written report with findings. Saving a reusable artifact is optional: answer in chat first unless the user explicitly asks to save the analysis, create a reusable artifact, or re-run/update an existing saved analysis.

When to Use

Use the ad-hoc analysis workflow when:

The user asks a complex question that requires data from multiple sources
The investigation involves cross-referencing (e.g., CRM deals matched against call recordings)
The user explicitly asks for an "analysis" or "deep dive"

Save a reusable analysis only when:

The user explicitly asks to save, create, publish, or re-run a saved analysis
The user is already viewing or re-running an existing saved analysis
The output needs a durable artifact because it includes generated chart images or a reusable refresh workflow

For one-off questions and exploratory deep dives, query the data and answer in chat. Do not call save-analysis just because the user said "analysis" or "deep dive".

If the user asks for an analysis output that needs a bespoke interactive surface, custom visualization, multi-step workflow, or UI that cannot be faithfully represented as saved-analysis Markdown, generated chart images, and structured resultData, create an extension instead of forcing the request into the saved-analysis format. In production mode, call create-extension automatically and tell the user the requested analysis needed bespoke UI/code, so you built it as an extension.

Workflow

Step 1: Understand the Question (catalog-first, clarify-first)

Orient before gathering data. Consult the injected <data-dictionary> and data-source status first to see which sources are configured and which one owns each fact, then settle scope:

What is being analyzed? (deals, users, campaigns, errors, etc.)
What time range?
What data sources are relevant? (map each fact to the one source that owns it)
What output does the user expect? (summary, ranking, comparison, trend)

If the metric definition, date range, or grain is ambiguous and a wrong guess would change the numbers, use the ask-question clarifying tool (multiple-choice) before gathering data. Ask at most once per turn, and skip it when the dictionary or the user already answered.

Step 2: Gather Data from Multiple Sources

Use the available actions to pull data. Read the relevant .agents/skills/<provider>/SKILL.md before querying each source.

Common data source combinations:

Analysis type	Data sources
Deal/account deep dive	`account-deep-dive` bundle, then targeted HubSpot/Gong follow-up
Sales pipeline analysis	HubSpot deals + Gong calls + Slack mentions
Customer health check	HubSpot deals + Pylon support tickets + BigQuery usage events
Content performance	BigQuery pageviews + GA4 + SEO keywords + HubSpot signups
Engineering velocity	GitHub PRs + Jira tickets + BigQuery deploy events
Churn investigation	Stripe billing + HubSpot deals + Pylon tickets + BigQuery usage

Tips for data gathering:

Start with the primary source (e.g., HubSpot for deals), then enrich with secondary sources
For named deal/account deep dives, call account-deep-dive first with the account, company, domain, deal, or opportunity name. It returns HubSpot deals, associated companies/contacts/tickets/notes/emails, Gong call detail, compact transcript excerpts, coverage counts, and gaps. Use targeted hubspot-records or gong-calls follow-ups only when that bundle leaves a specific gap.
Structure named deal/account reports like Fusion Analytics: executive summary, company/deal overview, key contacts and roles, dated timeline, Gong evidence with call dates/titles, current state, risks/blockers, recommended next steps, and methodology/gaps. Do not answer from an all-deals dump or Gong metadata alone.
Use action filters such as query, properties, objectType, company, and limit to narrow results before cross-referencing
For HubSpot deal cohorts, use hubspot-deals structured filters (product, pipeline, closedStatus, closedDateFrom, closedDateTo) for the cohort definition. Do not use query when the user names a specific HubSpot field such as products.
When any first-class provider action is too narrow, use provider-api-catalog / provider-api-docs and then provider-api-request against the provider's real HTTP API. Do not weaken the analysis just because the convenience action is missing an argument.
When stitching identities across sources, follow cross-source-analysis: match on BOTH a stable id AND email (ids can be reassigned), de-duplicate, and record match quality. Email/company-name/domain matches alone are low-confidence — flag them as caveats, not headline numbers.
If a data source is not configured, mention what's missing and work with what's available — never invent rows to fill a gap.

Step 3: Analyze and Synthesize

Don't just dump raw data. Synthesize findings:

Identify patterns, trends, and outliers
Calculate key metrics (totals, averages, rates, distributions)
Rank or categorize items when useful
Call out surprises or actionable insights
Compare against benchmarks or prior periods when possible
Only report figures you actually retrieved from a source — never present a number you did not query. Attribute each figure to its source and time window.
Make the evidence trail explicit enough to audit: source(s), time window, filters, sample size or row count, join/match method, caveats/gaps, and recommended next action when useful.

Step 4: Generate Charts (when useful)

When the analysis benefits from a visual — trends over time, distributions, comparisons between categories — call generate-chart before formatting the report. The action returns a url you embed directly in the markdown.

generate-chart
  --title "Closed-lost deals by month"
  --type bar
  --labels '["Jan","Feb","Mar"]'
  --data '[18, 22, 14]'

Embed the returned URL in resultMarkdown using standard markdown image syntax:

![Closed-lost deals by month](/api/media/closed-lost-deals-by-month-1234567890.png?v=1234567890)

You can include multiple charts in one analysis. Reach for a chart when it communicates the finding faster than a table — don't force visuals on every analysis.

Include the re-generation step in your saved instructions so re-runs produce fresh charts.

Step 5: Format Results as Markdown

Structure the report clearly:

## Key Findings

- **Finding 1**: Specific insight with supporting numbers
- **Finding 2**: Another insight
- **Finding 3**: Actionable recommendation

## Summary Metrics

| Metric               | Value   |
| -------------------- | ------- |
| Total deals analyzed | 54      |
| Average deal size    | $42,300 |
| Win rate             | 23%     |

## Detailed Analysis

### Category 1

[Detailed breakdown with tables, lists, etc.]

### Category 2

[More detail...]

## Methodology

Data sources: HubSpot (deals, contacts), Gong (calls), Slack (mentions)
Time range: Jan 1 – Mar 31, 2026
Filters: S1+ pipeline, closed-lost only

Step 6: Save the Analysis (only when requested)

Call save-analysis with all required fields only when the user asked for a saved/re-runnable analysis or this turn is updating an existing saved analysis:

save-analysis
  --id "closed-lost-q1-2026"
  --name "Q1 2026 Closed-Lost Analysis"
  --description "Deep dive into 54 Fusion deals closed-lost in Q1, cross-referenced with Gong calls and Slack activity"
  --question "Why are we losing deals in Q1? Cross-reference with Gong calls and Slack."
  --dataSources '["hubspot", "gong", "slack"]'
  --instructions "1. Fetch closed-lost S1+ deals from HubSpot (pipeline: Fusion, close date: Q1 2026)
2. For each deal, fetch associated contacts and their emails
3. Search Gong calls matching those contact emails (lookback: 6 months before close date)
4. For top 15 deals by amount, search Slack for '{company name} fusion'
5. Calculate: total deals, avg deal size, win rate, Gong coverage rate
6. Break down by: lost reason, stage reached, deal size tier
7. Highlight deals with no Gong coverage (blind spots)
8. Save results with save-analysis using id='closed-lost-q1-2026'"
  --resultMarkdown "[the full markdown report]"
  --resultData '{"deals": [...], "metrics": {...}}'

resultData is required. Fill it with compact structured evidence from the real data-source action results you used: row samples, aggregate metrics, match decisions, call/message IDs, short transcript/message excerpts, coded themes, sentiment labels, and explicit provider errors for any gaps. Do not include full Gong transcripts, full tool outputs, or raw provider payload dumps. If you cannot query a source, do not save a guessed analysis; report the unavailable/error result instead.

Critical: Write good instructions. The instructions field is what gets sent to the agent on re-run. Be specific:

Which actions to call with which parameters
What filters to apply
How to match records across sources
What metrics to calculate
What structure the output should have
End with "Save results with save-analysis using id='...'"

Step 7: Navigate to the Result

After saving, navigate the user to see the saved analysis:

navigate --view=analyses --analysisId=closed-lost-q1-2026

Re-Running an Analysis

When a user clicks "Re-run" on a saved analysis, the agent receives:

The original question
The saved instructions (step-by-step)
The analysis ID to update

Follow the instructions to gather fresh data, then call save-analysis with the same id to update the results. The createdAt timestamp is preserved; updatedAt is refreshed.

If the user challenges the coverage of a chat answer or saved analysis ("why aren't you pulling more deals?", "where is the updated response?"), rerun the source query or revise from the corrected cohort and include the updated deliverable in the response. Do not summarize that a revision exists without showing it or saving it.

Actions Reference

Action	Purpose
`save-analysis`	Save or update an analysis (id, name, instructions, results)
`get-analysis`	Retrieve a saved analysis by ID
`list-analyses`	List all saved analyses (id, name, description, timestamps)
`delete-analysis`	Delete a saved analysis
`navigate`	Navigate to analyses view: `--view=analyses [--analysisId=<id>]`

Storage

Analyses are stored in the SQL settings table with key prefix adhoc-analysis-{id}. They respect org/user scoping — org-scoped analyses are visible to all org members.

API endpoints (for UI consumption):

GET /api/analyses — list all
GET /api/analyses/{id} — get one
DELETE /api/analyses/{id} — delete one

Best Practices

Use descriptive IDs — closed-lost-q1-2026 not analysis-1
Include methodology — mention data sources, time ranges, and filters in the report
Write self-contained instructions — another agent (or the same agent in a new session) should be able to re-run from the instructions alone
Include structured data — pass compact resultData with metrics, rows, IDs, short excerpts, and coded themes so the UI can render richer views in the future
Keep reports scannable — lead with key findings, put details below
Note data gaps — if a source was unavailable or matching was imperfect, say so
Suggest next steps — end with actionable recommendations when appropriate
Answer in chat — never deflect — present tables, inline charts, and findings directly. Do not say "check the dashboard" or redirect elsewhere.

Large Fan-Out Analyses (30+ Accounts, Deals, or Calls)

For batch analyses spanning many items, chunk the work instead of trying to hold everything in one pass:

Define the cohort — fetch the full list of items (accounts, deals, calls) from the primary source (HubSpot, BigQuery, etc.).
Chunk and persist — process 5-10 items per iteration. For each chunk, write a short per-item findings note (key signals, gaps, theme tags) as an intermediate result to save-analysis or agent scratch.
Synthesize — after all chunks are complete, read the intermediate notes back in and produce the final cross-item synthesis (patterns, rankings, themes, recommendations).

This is the same pattern as a batch document analysis: fetch → process chunk → write intermediate → synthesize. The chunking keeps context manageable and each iteration independent.

After Completing an Analysis — Record Discoveries

After completing a significant analysis, update LEARNINGS.md (via the resources tool) or save-memory with newly confirmed:

Metric definitions (how a metric is actually calculated in this dataset)
Provider gotchas discovered during the analysis
Schema discoveries (table names, column names, join patterns that worked)
Identity-stitching rules confirmed across sources

resources(action: "read", path: "LEARNINGS.md")
resources(action: "write", path: "LEARNINGS.md", content: "<updated>")

Keep entries short and actionable. This is the learning flywheel — the next analysis benefits from what this one confirmed.