orchestrating-capability-development - SKILL.md Agent Skill

name: orchestrating-capability-development description: Use when developing security capabilities (VQL, Nuclei, Janus, Fingerprintx, Scanner integrations) - coordinates architecture, implementation, review, and testing phases with capability-specific quality checks allowed-tools: Skill, Task, TodoWrite, Read, Glob, Grep, AskUserQuestion

Capability Development Orchestration

Systematically guide security capability development through sixteen phases with parallel agent execution, explicit feedback loops, and structured capability directories.

Overview

This skill orchestrates complete security capability development from concept to tested artifact using a 16-phase workflow with capability-specific customizations.

Orchestration is NOT implementation. The orchestrator's responsibilities are:

Analyze capability scope and complexity
Decompose into specialized subtasks
Delegate to appropriate agents (capability-lead, capability-developer, capability-reviewer, capability-tester)
Synthesize results into coherent output
Track progress across the workflow

When to Use This Skill

Use this skill when you need to:

Develop a complete security capability from concept to tested artifact
Create VQL capabilities for Velociraptor-based detection
Write Nuclei templates for vulnerability scanning
Build Janus tool chains for security scanner orchestration
Implement Fingerprintx modules for service detection
Integrate external security scanners into the platform

Symptoms this skill addresses:

Manual orchestration of capability-specific agents
Missing architecture approval for security detection logic
Inconsistent quality checks across capability types
No structured workflow for VQL/Nuclei/Janus development
Lost context between development phases

State Tracking (MANDATORY)

MUST use TodoWrite BEFORE starting any orchestration workflow.

Multi-phase orchestration involves coordinating multiple agents. Without external state tracking, context drift causes forgotten phases, repeated spawns, and missed verification steps.

Create TodoWrite items for each phase BEFORE spawning any agents.

Standard Phase Template

Phase	Name	Purpose	Conditional	Gate
1	Setup	Worktree creation, output directory, MANIFEST.yaml	Always
2	Triage	Classify work type, select phases to execute	Always
3	Codebase Discovery	Explore codebase patterns, detect capability types	Always	G1
4	Skill Discovery	Map technologies to skills, detect gaps, background research	Always
5	Complexity	Technical complexity assessment, execution strategy	Always
6	Brainstorming	Design refinement with human-in-loop	LARGE only
7	Architecture Plan	Technical design AND task decomposition	MEDIUM+
8	Implementation	Capability artifact development	Always	G2
9	Design Verification	Verify implementation matches plan	MEDIUM+
10	Domain Compliance	Capability-specific mandatory patterns validation	Always
11	Code Quality	Capability review for detection accuracy	Always
12	Test Planning	Test strategy and plan creation	MEDIUM+
13	Testing	Test implementation and execution	Always	G3
14	Coverage Verification	Verify test coverage meets threshold	Always
15	Test Quality	No low-value tests, correct assertions, all pass	Always
16	Completion	Final verification, PR, cleanup	Always

G = Compaction Gate - BLOCKING checkpoint requiring compaction-gates.md protocol before next phase.

Work Types: BUGFIX, SMALL, MEDIUM, LARGE (determined in Phase 2: Triage)

Phase Skip Matrix:

Work Type	Skipped Phases
BUGFIX	5, 6, 7, 9, 12
SMALL	5, 6, 7, 9
MEDIUM	None
LARGE	None (all 16 phases)

Quick Reference

Phase	Agents	Execution	Checkpoint
1	-	Setup	-
2	-	Triage	-
3	Explore (1-10 via discovering-codebases)	PARALLEL	-
4	-	Skill mapping	-
5	-	Assessment	-
6	brainstorming skill	Sequential	Human
7	capability-lead + security-lead	PARALLEL	Human
8	capability-developer	Mode-dependent	Per-task if 4+
9	-	Verification	-
10	-	Compliance check	Violations->Human
11	capability-reviewer	Two-stage gated	2+1 retry->escalate
12	test-lead	Sequential	-
13	capability-tester ×3	PARALLEL	-
14	-	Coverage check	-
15	test-lead	Sequential	1 retry->escalate
16	-	Final verification	-

Configuration

This skill uses .claude/config/orchestration-limits.yaml:

Section	Scope	Used For
`inter_phase`	Implementation->Review->Test cycles	tight-feedback-loop pattern
`orchestrator`	Re-invoking entire patterns	Retry limits in Orchestration Guards

No Unilateral Overrides: Agents MUST NOT override config values. To change limits, update the config file or get user approval via AskUserQuestion.

Phase-Specific References

Phase 1: Setup - Worktree creation, output directory, MANIFEST.yaml
Phase 2: Triage - Work type classification (BUGFIX/SMALL/MEDIUM/LARGE)
Phase 3: Codebase Discovery - Explore agent patterns, capability type detection
Phase 4: Skill Discovery - Technology-to-skill mapping, gap detection, background research orchestration
Phase 5: Complexity - Technical assessment, execution strategy
Phase 6: Brainstorming - Design refinement with human-in-loop (LARGE only)
Phase 7: Architecture Plan - Technical design AND task decomposition
Phase 8: Implementation - Capability artifact development
Phase 9: Design Verification - Plan-to-implementation matching
Phase 10: Domain Compliance - Capability-specific mandatory patterns
Phase 11: Code Quality - Capability review for detection accuracy
Phase 12: Test Planning - Test strategy and plan creation
Phase 13: Testing - Test implementation and execution
Phase 14: Coverage Verification - Coverage threshold validation
Phase 15: Test Quality - Assertion quality, no low-value tests
Phase 16: Completion - Final verification, PR, cleanup

Compaction Gates

BLOCKING checkpoints at phase transitions where context accumulates:

Gate	After Phase	Before Phase	Trigger Threshold
1	3	4	70% (140k tokens)
2	8	9	70% (140k tokens)
3	13	14	70% (140k tokens)

Protocol: See compaction-gates.md for:

5-step compaction procedure
"What to Compact / What to Keep" tables
Capability-specific content lists
Rationalization counters

Agent Matrix for Capability Development

Capability Type	Phase 7 Agents	Phase 8 Agents	Phase 11 Agents	Phase 13 Agents
VQL	capability-lead, security-lead	capability-developer	capability-reviewer	capability-tester ×3
Nuclei	capability-lead, security-lead	capability-developer	capability-reviewer	capability-tester ×3
Janus	capability-lead, backend-lead	capability-developer	capability-reviewer, backend-reviewer	capability-tester ×3
Fingerprintx	capability-lead, backend-lead	capability-developer	capability-reviewer, backend-reviewer	capability-tester ×3
Scanner	capability-lead, backend-lead	capability-developer	capability-reviewer, backend-reviewer	capability-tester ×3

Capability Type Matrix

Type	Lead Focus	Developer Focus	Test Focus
VQL	Query structure, artifacts	Velociraptor queries	Query parsing, detection
Nuclei	Template structure, matchers	YAML templates, detection	False positives, CVE coverage
Janus	Pipeline design, tool chain	Go integration, orchestration	Pipeline flow, error handling
Fingerprintx	Probe design, protocol	Go modules, response parsing	Service detection accuracy
Scanner	API design, result normalization	Go/Python client, data mapping	Integration, error cases

See capability-types.md for complete comparison.

Capability-Specific Domain Compliance (Phase 10)

Capability development has specific P0 requirements by type:

VQL Capabilities

Check	Description	Severity
Query Syntax	VQL query parses without errors	P0
Artifact Schema	Output matches expected schema	P0
Performance	Query completes in ≤60s on typical system	P0
Platform Coverage	Works on target platforms (Win/Linux/Mac)	P1

Nuclei Templates

Check	Description	Severity
YAML Syntax	Template validates without errors	P0
Matcher Accuracy	≤2% false positive rate	P0
CVE Metadata	Complete CVE info if CVE-specific	P0
Request Count	≤3 HTTP requests per target	P1

Janus/Fingerprintx/Scanner

Check	Description	Severity
Go Compilation	Compiles without errors	P0
Interface Contract	Implements required interfaces	P0
Error Handling	Graceful failure on tool errors	P0
Rate Limiting	Respects API rate limits (Scanner)	P0

See Phase 10: Domain Compliance for full checklist.

Tight Feedback Loop (Phases 8-11-13)

Implementation->Review->Test cycle with automatic iteration when review or tests fail.

Max iterations: Read from orchestration-limits.yaml
Tracks iteration history in {OUTPUT_DIR}/feedback-scratchpad.md

Pattern: See tight-feedback-loop.md

Checkpoint Configuration

Human approval required at specific phases based on work type:

Work Type	Checkpoints (Phases)
BUGFIX	8 (implementation), 16
SMALL	8 (implementation), 16
MEDIUM	7 (plan), 8, 16
LARGE	6 (design), 7, 8, 16

For large plans (>5 tasks), add intermediate checkpoints during implementation.

See checkpoint-configuration.md for capability-specific configuration.

Parallel Execution

When tasks are independent, spawn in a SINGLE message:

// Phase 7: Architecture - spawn leads in parallel
Task("capability-lead", "Design capability architecture...");
Task("security-lead", "Security assessment...");

// Phase 13: Testing - spawn testers in parallel
Task("capability-tester", "Unit tests for detection logic...");
Task("capability-tester", "Integration tests for tool chain...");
Task("capability-tester", "Edge case tests for false positives...");

Do NOT spawn sequentially when tasks are independent.

File Scope Boundaries

When spawning parallel agents:

Agent	Scope
capability-developer	`{resolved_path}/` (external or internal)
capability-reviewer	READ-ONLY on capability paths
capability-tester	test files only (__test.go, __test.vql)

Path locations:

External (migrated): {CAPABILITIES_ROOT}/modules/{capability}/
Internal (not yet migrated): modules/{module}/ (all modules in the super-repo, including chariot, janus, tabularium, chariot-aegis-capabilities, msp-definitions, praetorian-cli, etc.)

Internal module scope includes:

modules/chariot/backend/ - Guard backend capabilities, handlers, and tasks
modules/tabularium/ - Shared data models and schemas
modules/janus/ - Security tool orchestration
modules/chariot-aegis-capabilities/ - VQL capabilities
Any other modules/ subdirectories required for capability development

See file-scope-boundaries.md for conflict detection protocol.

Rationalization Prevention

You MUST read rationalization-table.md BEFORE proceeding past any compaction gate (after phases 3, 8, or 13).

Read(.claude/skill-library/development/capabilities/orchestrating-capability-development/references/rationalization-table.md)

This file contains anti-rationalization counters for "Momentum Bias", "Context Seems Fine", and other compaction gate bypass patterns.

See rationalization-table.md for capability-specific rationalizations.

Emergency Abort Protocol

Triggers: User request, 3+ escalations in same phase, critical security finding, unrecoverable error, cost/time exceeded.

Flow: Stop work -> Capture state -> Present cleanup options -> Execute cleanup -> Report final state.

See emergency-abort.md for configuration.

Capability Directory Structure

.worktrees/{capability-name}/
├── .capability-development/
│   ├── MANIFEST.yaml              # Capability metadata, status, metrics
│   ├── progress.md                # Session progress tracking
│   ├── skill-manifest.yaml        # Phase 4 output: technology-to-skill mapping
│   ├── discovery.md               # Phase 3 output: codebase patterns
│   ├── architecture.md            # Phase 7 output: technical design
│   ├── plan.md                    # Phase 7 output: implementation tasks
│   ├── feedback-scratchpad.md     # Iteration history for tight feedback loop
│   └── agents/                    # Agent output files
│       ├── capability-lead.md
│       ├── capability-developer.md
│       └── ...
└── [capability artifacts]         # Actual implementation (VQL/Nuclei/Go)

Quality Standards

Each capability type has specific quality targets:

Metric	VQL	Nuclei	Janus	Fingerprintx	Scanner
Detection Accuracy	≥95%	≥95%	N/A	≥98%	N/A
False Positive Rate	≤5%	≤2%	N/A	≤1%	N/A
Pipeline Success	N/A	N/A	≥98%	N/A	N/A
API Integration	N/A	N/A	N/A	N/A	100%

See quality-standards.md for complete metrics.

Exit Criteria

Capability development is complete when:

All applicable phases marked "complete" in MANIFEST.yaml (based on work type)
Discovery artifacts generated (discovery.md, skill-manifest.yaml)
Capability-type quality metrics met (detection accuracy, false positive rate)
All reviewers returned verdict: APPROVED
Test plan created and all tests implemented (quality_score >= 70)
Final verification passed (build, lint, tests)
Metrics tracked in MANIFEST.yaml (tokens, cost, iterations)
Worktree cleaned up (merged and removed, OR kept per user request)
User approves final result
No rationalization phrases; all gate checklists passed; overrides documented

Integration

Called By

/capability command - Primary entry point for users
Parent orchestrators (multi-capability, scanner integration) - Via workflow handoff protocol

Requires (invoke before or at start)

using-git-worktrees (LIBRARY) - Phase 1
- Purpose: Create isolated workspace
- Read(".claude/skill-library/workflow/using-git-worktrees/SKILL.md")
persisting-agent-outputs (CORE) - Phase 1
- Purpose: Discover output directory, set up workspace
- skill: "persisting-agent-outputs"
discovering-codebases-for-planning (CORE) - Phase 3
- Purpose: Parallel discovery with dynamic agent count
- skill: "discovering-codebases-for-planning"

Calls (skill-invocation via Skill tool)

brainstorming (CORE) - Phase 6
- Purpose: Design refinement with human-in-loop
- skill: "brainstorming"
writing-plans (CORE) - Phase 7
- Purpose: Create detailed implementation plan
- skill: "writing-plans"
selecting-plugin-implementation-pattern (LIBRARY) - Phase 2
- Purpose: YAML vs Go plugin decision
- Read(".claude/skill-library/development/capabilities/selecting-plugin-implementation-pattern/SKILL.md")
finishing-a-development-branch (LIBRARY) - Phase 16
- Purpose: Verify, PR options, cleanup
- Read(".claude/skill-library/workflow/finishing-a-development-branch/SKILL.md")

Spawns (agent-dispatch via Task tool)

capability-lead + security-lead - Phase 7
- Purpose: Architecture design and security assessment
- Key mandatory skills: adhering-to-dry, adhering-to-yagni
capability-developer - Phase 8
- Purpose: Capability artifact implementation
- Key mandatory skills: developing-with-tdd, verifying-before-completion
capability-reviewer - Phase 11
- Purpose: Code quality review for detection accuracy
- Key mandatory skills: adhering-to-dry
test-lead - Phases 12, 15
- Purpose: Test strategy and quality validation
- Key mandatory skills: None
capability-tester ×3 - Phase 13
- Purpose: Parallel test implementation (unit, integration, edge cases)
- Key mandatory skills: developing-with-tdd

Pairs With (conditional)

porting-python-capabilities-to-go (LIBRARY) - Porting Python to Go
- Purpose: Coordinates idiom translation
- Read(".claude/skill-library/development/capabilities/porting-python-capabilities-to-go/SKILL.md")
developing-with-subagents (CORE) - Plan has >3 independent tasks
- Purpose: Fresh subagent per task + two-stage review
- skill: "developing-with-subagents"
dispatching-parallel-agents (CORE) - 3+ independent failures
- Purpose: Parallel investigation of unrelated issues
- skill: "dispatching-parallel-agents"
verifying-before-completion (CORE) - Phase 16 completion
- Purpose: Final verification checklist
- skill: "verifying-before-completion"
debugging-systematically (CORE) - Review/test failures
- Purpose: Systematic debugging when failures repeat
- skill: "debugging-systematically"

References

Phase Files

[Phase 1-16 references listed above]

Supporting Documentation

Agent Matrix - Agent selection by capability type
Capability Types - VQL, Nuclei, Janus, Fingerprintx, Scanner comparison
Checkpoint Configuration - Human approval points
Compaction Gates - BLOCKING context management
Directory Structure - Capability workspace organization
Emergency Abort - Safe workflow termination
File Scope Boundaries - Parallel agent conflict prevention
Progress Persistence - Cross-session resume
Quality Standards - Capability-specific quality metrics
Rationalization Table - Capability-specific rationalizations
Tight Feedback Loop - Implementation->Review->Test cycles
Troubleshooting - Common issues and solutions

Prompt Templates

prompts/architect-prompt.md - Phase 7 leads
prompts/developer-prompt.md - Phase 8 developers
prompts/reviewer-prompt.md - Phase 11 reviewers
prompts/tester-prompt.md - Phase 13 testers

External Sources

Pattern	Source
Context Compaction	https://www.anthropic.com/engineering/effective-context-engineering-for-ai-agents
Effort Scaling	https://www.anthropic.com/engineering/multi-agent-research-system
GitHub Flow (branch completion)	https://docs.github.com/en/get-started/using-github/github-flow
REQUIRED SUB-SKILL pattern	https://github.com/obra/superpowers
Tight Feedback Loop	https://awesomeclaude.ai/ralph-wiggum

Related Skills

orchestrating-feature-development - Similar pattern for feature development
orchestrating-integration-development - Similar pattern for third-party integrations
orchestrating-nerva-development - Specialized pattern for fingerprintx modules
persisting-progress-across-sessions - Cross-session persistence for long-running capabilities
developing-with-subagents - Same-session subagent execution with code review