Explore AI Agent Skills & Claude Prompts

Use when STANDING UP a new ML pipeline or onboarding to one for the first time — the cycle-zero work that has to happen before any hypothesis-driven experiment can start. Covers the end-to-end bootstrap: schema design → create small representative dataset → validate features → dry run → small-data run → first full-scale production run. Teaches the three-tier pattern (dry_run → small dataset → full dataset) that catches config and pipeline bugs before they cost full-scale compute. The skill's job ends when the pipeline produces real results; from there, hypothesis-driven iteration belongs in `/deriva-ml:experiment-lifecycle`. Triggers on: 'new ML project', 'set up a new pipeline', 'first model', 'onboard to existing project', 'standing up training', 'how should I get started', 'start small', 'representative dataset', 'development subset', 'what order should I do things', 'best practices for training', 'debug my training' (when the pipeline itself is new/unproven).

schedule Updated 14 days ago

configure-experiment

ALWAYS use this skill when setting up a DerivaML experiment project, adding config groups, or understanding how experiments compose. Triggers on: 'set up experiment', 'config groups', 'project structure', 'hydra defaults', 'DerivaModelConfig', 'experiment preset', 'new project from template'.

schedule Updated 26 days ago

write-hydra-config

Write, bootstrap, and validate hydra-zen config files for DerivaML — DatasetSpecConfig, asset_store, builds(), experiment_config, multirun_config, with_description. Use when adding/editing/updating any config in configs/, when bootstrapping a fresh project's configs from an existing catalog (per-config-group recipes + a worked end-to-end example), or when validating that config RIDs and versions match the catalog (singular validators per group, whole-tree composition, or the single-call deriva_ml_validate_config_file tool). Triggers on: 'write hydra config', 'edit datasets.py', 'edit assets.py', 'bootstrap configs', 'populate configs from catalog', 'validate config', 'validate datasets.py', 'check config matches catalog'.

schedule Updated 27 days ago

compare-model-runs

Use when comparing metrics across multiple ML training executions in DerivaML — ranking model runs by accuracy/F1/loss, finding the best of N recent runs, identifying performance regressions, or aggregating results across a sweep. Covers three metric-storage patterns: features-as-scalars (`deriva_ml_list_feature_values(execution_rids=...)` for one-round-trip catalog query), metrics-as-JSONL-asset files (`Metrics_File` asset, download + parse locally), and prediction-CSV-as-`Execution_Asset` (per-execution tabular CSV plus optional per-analysis summary CSV — the deriva-ml-model-template's default pattern). ALSO use for **artifact provenance tracing** — when the question is 'where did this prediction come from', 'what code produced this asset', 'what dataset version trained this model', or 'why is this metric different from the last run' — `deriva_ml_get_lineage` walks the full data-flow chain in one call; the worked example shows the two-step pattern (lineage walk → workflow resource fetch) that yields the wor

schedule Updated 27 days ago

help

Use when the user asks general questions about DerivaML, Deriva, deriva-mcp, or what they can do with these tools — including 'what is DerivaML', 'how do I use Deriva', 'what can you help me with', 'how does this work', or 'where do I start'. Also use for broad orientation questions about catalogs, datasets, experiments, hydra-zen configuration, ML workflows, or the MCP server when the user is asking 'how do I approach this' rather than requesting a specific action.

schedule Updated 24 days ago

download-bag

ALWAYS use this skill when getting data OUT of a Deriva catalog as a BDBag — exporting a slice of rows + their FK-reachable relations + the bulk objects they reference into a portable, self-describing, checksummed archive. Covers what a BDBag is, the two export paths (server-side export service via `deriva-export` / `DerivaExport`, or client-side orchestration via `deriva-download-cli` / `DerivaDownload`), authoring the export spec (the JSON config that defines what to include), the `bdbag` CLI for validating and materializing bags, asset materialization and caching strategy. Standalone — works on any Deriva catalog. Triggers on: 'download a bag', 'export a bag', 'BDBag', 'export catalog data', 'pull data out', 'download dataset' (when the user means the bag-export mechanism, not the DerivaML Dataset entity), 'deriva-download-cli', 'deriva-export', 'export spec', 'snapshot the catalog', 'bag manifest', 'materialize assets', 'self-describing archive', 'portable export', 'reproducible data drop', 'data package'