name: ai-startup-advisor description: > Strategic advisor for evaluating software/AI product ideas and generating defensible startup strategies. Use this skill whenever the user wants to evaluate a product idea, brainstorm startup directions, assess defensibility against foundation model companies, choose between build-vs-buy for AI components, or needs a strategic positioning check for a development project. Trigger on phrases like "is this idea defensible", "what should I build", "will OpenAI/Anthropic eat this", "startup idea", "product strategy", "moat", "competitive positioning", "should I build X", "vertical AI opportunity", "business model for AI", or any discussion about whether a software product is worth building given the current AI landscape. Also trigger when the user is evaluating technical architecture decisions through a strategic lens — e.g. "should I use a fine-tuned model or API calls" where the answer has competitive implications. allowed-tools: [Read, Write, Edit, Bash, WebFetch, WebSearch]
AI Startup Advisor
A strategic evaluation framework for software founders navigating the 2025–2026 landscape where foundation model companies (OpenAI, Anthropic, Google) are absorbing the application layer.
This skill helps with two modes:
- Evaluate mode — Score a specific idea against defensibility criteria
- Generate mode — Brainstorm ideas for a given domain or vertical
For deep background research and examples, read references/strategic-research.md in this skill's
directory. Load it when the user needs detailed examples, case studies, or academic backing for a
recommendation.
Core mental model: The Foundation Model Squeeze
The strategic reality is simple: if a foundation model company's next quarterly release could replicate 80% of your value proposition, you are building a substitute, not a complement.
The key test — apply it to every idea:
"OpenAI Next Release" Test: Would this product get worse or better when the underlying foundation model improves? If better → you're a complement (safe). If replaceable → you're a wrapper (danger).
Value in AI follows Teece's Profiting from Innovation framework: when the core innovation (the model) is freely accessible via API, profits flow to owners of complementary assets — domain data, regulatory clearances, workflow integration, customer trust — not to the model accessor.
Mode 1: Evaluate an idea
When the user presents a specific product or startup idea, run it through the Defensibility Scorecard. Score each dimension 0–3 and provide a total with commentary.
Defensibility Scorecard (0–3 per dimension, max 21)
| # | Dimension | 0 (None) | 1 (Weak) | 2 (Moderate) | 3 (Strong) |
|---|---|---|---|---|---|
| 1 | Proprietary Data Flywheel | No unique data; uses public datasets or generic API output | Collects some user data but not structured for compounding advantage | Structured data collection where each customer improves the product | Unique dataset that compounds over time and cannot be replicated without equivalent operations (e.g. Tempus's clinical-molecular DB, CrowdStrike's threat telemetry) |
| 2 | Workflow Integration Depth | Standalone tool, no integration | Light integration (browser extension, Zapier) | Moderate integration into existing tools (plugin, API connector) | Deep embedding in mission-critical workflow (EHR integration, case management system, CI/CD pipeline) with high switching costs |
| 3 | Regulatory / Compliance Barrier | Unregulated space, no certifications needed | Light compliance (SOC 2, GDPR basics) | Moderate regulatory requirements (HIPAA, industry-specific certs) | Heavy regulatory burden that takes years (FDA clearance, FedRAMP, legal bar requirements) |
| 4 | Domain Expertise Moat | Generic capability any dev team could build | Some domain knowledge required but acquirable in weeks | Deep domain expertise required; team needs practitioners | Requires practitioners embedded in the team + years of domain iteration (lawyers at Harvey, clinicians at Abridge) |
| 5 | Counter-Positioning vs. Incumbents | Directly competing with model providers on their turf | Competing with SaaS incumbents who could add AI features | Different business model that incumbents could adopt with effort | Outcome-based pricing or service model that incumbents cannot copy without cannibalizing their own revenue |
| 6 | Production Complexity | Works out of the box with an API call | Some prompt engineering and pipeline work | Significant engineering for reliability, edge cases, monitoring | The last 10% of quality requires 10–100x the effort of a prototype; years of production hardening (compliance engines, safety-critical systems) |
| 7 | Vendor Neutrality Advantage | Tied to one model provider | Works with multiple providers but not a selling point | Multi-model support is a feature customers value | Structural requirement for vendor neutrality — model providers cannot credibly offer this (eval tools, security/guardrails, observability) |
Scoring interpretation
- 18–21: Strong defensibility. Build with confidence. Focus on execution speed.
- 13–17: Moderate defensibility. Identify which dimensions score low and develop a plan to strengthen them within 6–12 months.
- 8–12: Weak defensibility. Viable as a short-term revenue play but vulnerable to platform absorption. Consider pivoting toward a higher-scoring adjacent opportunity.
- 0–7: Wrapper territory. The "OpenAI Next Release" test likely fails. Rethink fundamentally.
How to present the evaluation
Use this structure:
## Defensibility Evaluation: [Idea Name]
### Quick Verdict
[One sentence: defensible / risky / wrapper territory]
### Scorecard
[Table with scores and brief justification per dimension]
### Total: X/21 — [Interpretation]
### Key Risks
[Top 2-3 specific threats, referencing which foundation model companies or incumbents
could absorb this and how]
### Strengthening Moves
[Concrete actions to improve the weakest dimensions — e.g., "Build EHR integration
with Epic to move Workflow Integration from 1→3"]
### Strategic Recommendation
[Build / Pivot / Avoid — with reasoning]
Mode 2: Generate ideas for a domain
When the user asks for ideas within a domain or vertical, use the Opportunity Generator framework.
Step 1: Identify the vertical characteristics
For the given domain, assess:
- Regulatory density: How heavily regulated? (Stronger = better for defensibility)
- Data fragmentation: Is domain data scattered across many siloed sources? (More fragmented = bigger data flywheel opportunity)
- Workflow pain: How manual and error-prone are current workflows? (More painful = higher willingness to pay for automation)
- Incumbent software quality: How good is existing software? (Worse = more displacement opportunity)
- Expert labor cost: How expensive are the humans currently doing this work? (Higher = larger addressable market for service-as-software)
Step 2: Apply the five defensible corridors
For each corridor, generate 1–2 specific ideas tailored to the domain:
- Vertical AI product — Deep domain-specific tool with proprietary data and workflow integration
- Infrastructure / picks-and-shovels — Tooling that the domain needs but model providers won't build (evaluation, compliance, data pipelines)
- Service-as-software — AI delivering a service humans currently perform, priced on outcomes not seats
- AI-native agency — Productized service with software margins, combining AI automation with domain expertise
- Data-centric business — Building and monetizing a proprietary domain-specific dataset
Step 3: Score each generated idea
Run each idea through the Defensibility Scorecard (abbreviated — just the total and top 2 dimensions).
Step 4: Present as a ranked list
## Opportunities in [Domain]
### Domain Profile
[Brief assessment of the 5 vertical characteristics]
### Top Ideas (ranked by defensibility score)
**1. [Idea Name]** — Score: X/21
[2-3 sentence description]
Strongest moats: [top 2 dimensions]
Business model: [pricing approach]
Example comparable: [existing company doing something similar, if any]
**2. [Idea Name]** — Score: X/21
...
Decision frameworks for common strategic questions
"Should I fine-tune or use API calls?"
This is as much a strategic question as a technical one:
- API-only: Faster to ship, but zero model differentiation. Anyone can replicate.
- Fine-tuned on public data: Slight performance edge, but the data is available to competitors.
- Fine-tuned on proprietary data: Real moat — your data flywheel produces a model competitors can't match.
- Custom model architecture: Deepest moat but highest cost. Only justified when domain-specific architectures genuinely outperform general models (e.g., protein folding, molecular simulation).
Recommendation: Start with API calls to validate the market. Invest in fine-tuning only after you have proprietary data worth fine-tuning on. The model is not your moat — the data pipeline feeding it is.
"Should I go model-agnostic or bet on one provider?"
79% of enterprises paying for OpenAI also pay for Anthropic. Design for multi-provider from day one:
- Abstract the model layer behind an internal interface
- Shallow API integration migrates in 20–40 hours; deep integration takes 80–120 hours
- Model-agnostic architecture is itself a mild defensibility signal — it shows sophistication
- Exception: If you're building infrastructure that depends on a specific provider's unique capability (e.g., MCP for Anthropic), that's fine as long as you plan for portability
"SaaS vs. outcome-based pricing?"
The shift from per-seat to per-outcome pricing is the single biggest business model disruption in AI:
- Per-seat SaaS: Legacy model. AI makes it vulnerable because one AI-augmented user does the work of 5. Your revenue shrinks as your product improves.
- Per-outcome: Aligns your revenue with value delivered. Harder to predict revenue early on, but structurally defensible — incumbents can't adopt it without cannibalizing themselves.
- Hybrid: Charge a platform fee + per-outcome variable. De-risks early-stage revenue while preserving strategic positioning.
"Is this a feature or a product?"
Apply this test: If you removed the AI component, would there still be a product worth paying for?
- Just a feature: The AI is doing something generic (summarization, translation, chat). Model providers will ship this.
- A product: The AI is embedded in a specific workflow with domain data, integrations, and compliance. Removing the AI leaves a broken workflow, not a simpler tool.
Red flags to always flag
When evaluating any idea, automatically check for these anti-patterns:
- "Chat with X" pattern — Chat-with-PDF, chat-with-database, chat-with-codebase. These are features, not products. Model providers ship these quarterly.
- "AI-powered [commodity]" — AI writing assistant, AI image generator, AI code completion without deep vertical focus. Competing directly with model providers' showcase apps.
- Single-model dependency — If one API provider going down or changing pricing kills your product, you have no moat.
- No data strategy — If you can't articulate how each customer interaction makes your product better for the next customer, you don't have a flywheel.
- Regulated industry without regulated team — Building for healthcare without clinicians, for legal without lawyers, for finance without compliance experts. Domain expertise is the moat — you need it on the team, not as advisors.
Reference material
For detailed case studies, academic frameworks, funding data, and company examples, read:
references/strategic-research.md
Load this when:
- The user asks "why?" or wants evidence for a recommendation
- You need specific company examples or revenue figures
- The user wants academic citations (Teece, Eisenmann, Zhu, Agrawal)
- A deeper dive into a specific vertical (healthcare, legal, biotech, infrastructure) is needed