name: repo-value-analysis
description: "Analyze external repositories for adoptable ideas and patterns."
user-invocable: false
argument-hint: "
- Agent
- Read
- Write
- Bash
- Grep
- Glob
routing:
triggers:
- repo value analysis
- does repo add value
- analyze repo for ideas
- what can we learn from
- compare against repo
- read every file in repo pairs_with:
- workflow complexity: Complex category: analysis
Repo Competitive Analysis Pipeline
Overview
This skill conducts systematic 6-phase analysis of external repositories to assess their value for adoption. You dispatch parallel subagents to read and catalog every file in an external repo, inventory your own toolkit in parallel, identify genuine capability gaps, audit those gaps against your actual codebase, and produce a reality-grounded comparison report with adoption recommendations.
The pipeline enforces full file reading (not sampling), parallel execution (up to 8 agent zones simultaneously), and mandatory audit (every recommendation verified before reporting). Optional flags allow local analysis (--local), zone focus (--zone), and quick comparison (--quick skips audit).
Reference Loading Table
| Signal | Load These Files | Why |
|---|---|---|
| errors, error handling | error-handling.md |
Loads detailed guidance from error-handling.md. |
| Phase 2 DEEP-READ: zone agent dispatch | phase2-agent-template.md |
Loads detailed guidance from phase2-agent-template.md. |
| Phase 3 INVENTORY: cataloging our system | phase3-inventory-template.md |
Loads detailed guidance from phase3-inventory-template.md. |
| Phase 5 AUDIT: verifying HIGH/MEDIUM recommendations | phase5-audit-template.md |
Loads detailed guidance from phase5-audit-template.md. |
| Phase 6 REPORT: final report structure | phase6-report-template.md |
Loads detailed guidance from phase6-report-template.md. |
Instructions
Input Parsing
Before starting Phase 1, parse the user's input:
- GitHub URL: Extract repo name from URL (e.g.,
https://github.com/org/repo->repo) - Local path: Validate the path exists and contains files
- Bare repo name: Assume
https://github.com/{name}if it looks likeorg/repo
Set REPO_NAME and REPO_PATH variables for use throughout the pipeline.
Phase 1: CLONE
Goal: Obtain the repository and categorize its contents into zones for parallel deep-read.
Step 1: Clone the repository
git clone --depth 1 <url> /tmp/<REPO_NAME>
If --local flag was provided, skip cloning and use the provided path instead. This allows re-analysis of already-cloned repos without redundant network calls.
Step 2: Count and categorize files
Survey the repository structure:
- Count total files (excluding
.git/) - List top-level directories with file counts
This gives you a baseline for zone complexity and helps identify sub-repo patterns.
Step 3: Define analysis zones
Categorize files into zones based on directory names and file patterns. Zones organize the repo into digestible chunks:
| Zone | Typical directories/patterns | Purpose |
|---|---|---|
| skills | skills/, commands/, prompts/, templates/ |
Reusable skill/prompt definitions |
| agents | agents/, personas/, roles/ |
Agent configurations |
| hooks | hooks/, middleware/, interceptors/ |
Event-driven automation |
| docs | docs/, *.md (non-config), adr/, guides/ |
Documentation and decisions |
| tests | tests/, *_test.*, *.spec.*, __tests__/ |
Test suites |
| config | Config files, CI/CD, *.yaml, *.toml, *.json (root) |
Configuration |
| code | scripts/, src/, lib/, pkg/, *.py, *.go, *.ts |
Source code |
| other | Everything else | Uncategorized files |
Step 4: Cap zones for parallel feasibility
If any zone exceeds ~100 files, split it into sub-zones by subdirectory. Each sub-zone gets its own agent in Phase 2. Cap at ~100 files per agent because:
- Agents MUST read every file in their zone, not sample or skim (sampling introduces bias and misses distinguishing components)
- ~100 files is feasible for a single agent within budget and timeout
- Larger zones are split, so no single agent is overwhelmed
Log the split decisions in the analysis notes for transparency.
Gate: Repository cloned (or local path validated). All files categorized into zones. Zone file counts recorded. No zone exceeds ~100 files (split if needed). Proceed only when gate passes.
Phase 2: DEEP-READ (Parallel)
Goal: Read every file in every zone of the external repository to extract techniques, patterns, and potential capability gaps.
Dispatch 1 Agent per analysis zone (background). Each agent receives the zone name and file list, instructions to read EVERY file (not sample, not skim) to avoid sampling bias, and a structured output template.
See references/phase2-agent-template.md for the full agent instructions template and parallel dispatch rules.
Gate: All zone agents have completed (or timed out after 5 minutes each). At least 75% of agents returned results (tolerance for individual agent failure). Zone finding files exist in /tmp/. Proceed only when gate passes.
Phase 3: INVENTORY (Parallel with Phase 2)
Goal: Catalog our own toolkit simultaneously with Phase 2 deep-read for faster wall-clock time.
Dispatch 1 Agent (in background, concurrent with Phase 2 zone agents) to inventory our system. Running this in parallel is safe because inventory is a read-only catalog of our codebase.
See references/phase3-inventory-template.md for the full agent instructions and parallel-execution rationale.
Gate: Self-inventory agent completed (or timed out after 5 minutes). /tmp/self-inventory.md exists and contains counts for all 4 component types. Proceed only when gate passes.
Phase 4: SYNTHESIZE
Goal: Merge Phase 2 and Phase 3 findings into a draft comparison with candidate adoption recommendations.
Step 1: Read all zone findings and inventory
Read every /tmp/[REPO_NAME]-zone-*.md file and /tmp/self-inventory.md to build a unified picture.
Step 2: Build comparison table
For each capability area discovered in the external repo, document what we have vs what they have:
| Capability | Their Approach | Our Approach | Gap? |
|---|---|---|---|
| ... | ... | ... | Yes/No/Partial |
This table is relative: "what do they have that we lack?" not "what do they have?"
Step 3: Identify candidate recommendations
For each genuine gap (not just a different approach to the same thing):
- Describe what they have
- Describe what we lack
- Rate value honestly: HIGH / MEDIUM / LOW
- HIGH = addresses a real pain point or enables new capability
- MEDIUM = nice to have, improves existing workflow
- LOW = marginal improvement, different but not better
Resist the temptation to over-count differences as gaps. A different naming convention is not a gap worth addressing.
Step 4: Save draft report
Save to research-[REPO_NAME]-comparison.md with:
- Executive summary
- Comparison table
- Candidate recommendations with ratings
- Clear "DRAFT — pending Phase 5 audit" watermark
This draft is intentionally unaudited so you can bail out early if findings look weak.
Gate: Draft report saved. At least 1 candidate recommendation identified (or explicit "no gaps found" conclusion). All recommendations have value ratings. Proceed only when gate passes.
Phase 5: AUDIT (Parallel)
Goal: Reality-check each HIGH and MEDIUM recommendation against our actual codebase to catch "we already have this" false positives.
For each HIGH or MEDIUM recommendation, dispatch 1 Agent (in background). Audit is what separates superficial analysis from rigorous analysis — skipping it produces unverified recommendations that erode trust.
See references/phase5-audit-template.md for the full audit agent instructions, coverage levels (ALREADY EXISTS / PARTIAL / MISSING), and --quick flag behavior.
Gate: All audit agents completed (or timed out after 5 minutes). At least 75% returned results. Audit files exist in /tmp/. Proceed only when gate passes.
Phase 6: REPORT
Goal: Produce the final, reality-grounded report with recommendations verified by Phase 5 audit.
Read audit findings, adjust recommendations (ALREADY EXISTS → move to "Already Covered"; PARTIAL → focus on gaps; MISSING → keep), overwrite research-[REPO_NAME]-comparison.md with the final report, and remove temporary /tmp/ files.
See references/phase6-report-template.md for the full 4-step workflow and final report markdown template.
Gate: Final report saved to research-[REPO_NAME]-comparison.md. Report contains comparison table, adjusted recommendations based on audit findings, and verdict. No "DRAFT" watermark remains. All recommendations have been reality-checked against Phase 5 audit findings (or marked as unaudited if --quick was used). Proceed only when gate passes.
Error Handling
See references/error-handling.md for clone failures, large repos (10k+ files), agent timeouts, no-gaps-found outcome, and self-inventory failures.
References
references/phase2-agent-template.md— Phase 2 DEEP-READ agent templatereferences/phase3-inventory-template.md— Phase 3 INVENTORY agent templatereferences/phase5-audit-template.md— Phase 5 AUDIT agent templatereferences/phase6-report-template.md— Phase 6 REPORT workflow and final templatereferences/error-handling.md— Pipeline error handling