research-discovery-setup - SKILL.md Agent Skill

name: Research Discovery Setup description: Set up effective automated discovery - from vague idea to working system in one streamlined conversation.

Research Discovery Setup

Help users configure automated discovery that delivers quality papers consistently.

Quick Start: The 5-Minute Setup

Most common scenario: User has a vague research interest and needs help setting up discovery.

Standard Opening

Agent: "Let's get your discovery set up! Three quick questions:

1. What research area interests you? (even if vague - we'll refine it)
2. How much time daily to review papers? (5 mins? 30 mins?)
3. Need cutting-edge research or learning fundamentals?

I'll configure everything based on your answers."

After User Answers

Agent: "Perfect! Based on what you said, here's your starter configuration:

**Research Focus**: [refined 1-sentence version]
**Keywords**: [2-4 core search terms]
**Sources**: [1-2 most relevant APIs]
**Schedule**: [frequency based on time commitment]
**Quality Filter**: [threshold based on volume needs]

This should get you ~[X] papers [per day/week] to review.

Ready to test it now?"

After Running Test Discovery

Agent: "Test results: [X] papers found

Quick scan of titles:
✓ [Y] papers look highly relevant
~ [Z] papers look somewhat relevant
✗ [W] papers look off-topic

Is this mix working for you? Too many? Too few? Wrong topic?"

Then: Refine based on feedback (see Refinement Patterns below)

Diagnosis: Where Is The User?

Ask These 3 Questions First

Q1: Do they know what they want to research?

✓ Clear idea → Skip to keyword building
✗ Vague idea → Question refinement needed

Q2: Have they tried searching yet?

✓ Yes, bad results → Diagnose what's wrong (see Troubleshooting)
✗ Not yet → Run initial test

Q3: How much time for review?

5-10 min/day → ~10-15 papers/day max
30 min/week → ~20-30 papers/week batch
1+ hour/week → ~50+ papers/week comprehensive

Conversation Patterns

Pattern 1: Vague Idea → Focused Question

User: "I'm interested in AI and finance"

Agent: "Let's narrow that down. Pick ONE direction to start:

Using AI to predict stock prices?
Using AI to analyze financial news sentiment?
Using AI to detect fraud?
Something else?

We can always expand later."

User picks one

Agent: "Great! Now let's scope it:

What specific data? (news? social media? earnings calls?)
What specific outcome? (daily predictions? risk scoring? trading signals?)
What timeframe? (real-time? daily? weekly?)

Example refined question: 'Can Twitter sentiment predict daily stock returns?'

Does something like that capture your interest?"

Pattern 2: Bad Results → Diagnosis

Agent: "Let's diagnose the problem. I need:

What keywords are you using?
Show me 2 papers you WANT
Show me 2 papers you're GETTING but don't want"

[User provides]

Agent analyzes and identifies issue:

Keywords too broad → Add specific terms
Keywords too narrow → Add synonyms, lower threshold
Wrong domain → Add negative keywords
Wrong timeframe → Add date filter

Agent: "The issue is [specific problem]. Here's the fix: [specific change].

Want me to test this now?"

Pattern 3: Optimization (Already Working)

Agent: "You've been running for [X] days. Let's optimize:

Current results:

[Y] papers/day on average
You marked [Z]% as relevant

Assessment: [Good/Too many/Too few/Wrong mix]

Recommendations: [1-2 specific tweaks]

Want to try these adjustments?"

Decision Trees

Too Many Papers (>40/day)

Check relevance:
├─ >70% relevant → Just cap max_papers to 25
├─ 40-70% relevant → Increase threshold +0.1
└─ <40% relevant → Keywords too broad, add specific terms

Too Few Papers (<5/day)

Check specificity:
├─ Keywords very specific → Broaden terms, add synonyms
├─ Keywords normal → Lower threshold -0.1
└─ Field is just slow → Adjust schedule to weekly

Wrong Topic Papers

Check mismatch:
├─ Different domain (crypto vs stocks) → Add negative keywords
├─ Different time period (old papers) → Add date filter: last 2 years
├─ Different methodology → Add method-specific terms
└─ Different language → Add language filter

Missing Key Papers

User shows example paper:
1. Look at that paper's title/abstract
2. Extract key terms it uses
3. Add those terms to search
4. Test again

Quick Reference Cards

Card 1: Question Refinement (30 seconds)

Ask 3 questions:

"What aspect interests you most?"
"Prediction? Analysis? Comparison?"
"Any constraints: domain/time/method?"

Result: "[Specific searchable question]"

Card 2: Keyword Building

Question → Keywords (2-step):
1. Extract core nouns from question
2. Add 1 technical synonym if they used plain language

Example:
Q: "Can Twitter predict stocks?"
K: "twitter sentiment stock prediction"

Card 3: Source Selection

Default recommendations:

CS/ML/AI → arxiv + semantic_scholar
Medical/Bio → pubmed + biorxiv
General Science → openalex + crossref
Economics/Social → openalex + ssrn

Start with 2 sources max, add more only if missing papers.

Card 4: Threshold Settings

Default: 0.7 (works for 80% of cases)

Adjustments:
- User wants only best papers → 0.8
- User wants comprehensive coverage → 0.65
- Getting <5 papers/day → Lower by 0.05
- Getting >40 papers/day → Raise by 0.05

Card 5: Schedule Guide

Based on user's available time:
- 10 min/day → Daily, 10-15 papers, threshold 0.75
- 30 min/3x week → Every other day, 20-30 papers, threshold 0.7
- 1 hour/week → Weekly, 50 papers, threshold 0.65

Refinement Patterns

Refinement 1: Too Much Noise

Symptom: <50% of papers are relevant

Diagnosis: Filter is too permissive OR keywords too broad

Fix:

Option A: Increase threshold (0.7 → 0.75 or 0.8)
Option B: Add specific terms to keywords
Option C: Remove noisy source if one is producing junk

Recommendation: Try A first (fastest), then B if needed

Refinement 2: Too Little Coverage

Symptom: <10 papers/day in active field

Diagnosis: Filter too strict OR keywords too narrow

Fix:

Option A: Lower threshold (0.7 → 0.65)
Option B: Add synonym terms to keywords
Option C: Add another relevant source

Recommendation: Try B first (maintains quality), then A

Refinement 3: Wrong Time Period

Symptom: Getting papers from 2015-2019, user wants recent

Fix:

Add date filter: published_after="2022-01-01"

Or for cutting-edge: published_after="2024-01-01"

Refinement 4: Domain Mismatch

Symptom: Getting crypto papers when researching stocks

Fix:

Add negative keywords:
exclude_terms = ["cryptocurrency", "bitcoin", "crypto", "blockchain"]

Troubleshooting Quick Reference

Problem	Instant Fix
Zero results	Lower threshold to 0.6, broader keywords
100+ results/day	Raise threshold to 0.8, add specific terms
All old papers (>3 years)	Add date filter: last 2 years
Papers in wrong language	Add language filter: English
Wrong scientific domain	Add negative keywords for that domain
Too many duplicates	Normal - Thoth auto-deduplicates by DOI/title
Missing recent papers	Add arxiv source (most current)
Missing published papers	Add openalex or crossref

Advanced Scenarios

Scenario: Comprehensive Literature Review

User: "I'm writing a survey paper, need EVERYTHING"

Agent: "Comprehensive strategy:

Sources: ALL available (arxiv, semantic_scholar, openalex, crossref)
Threshold: 0.65 (permissive)
Schedule: Daily for 2 weeks
Max: 50 papers/day

You'll get 400-700 papers total. This is the 'fire hose' approach. After 2 weeks, we'll refine based on what you found.

Ready for high volume?"

Scenario: Staying Current (Minimal Time)

User: "Just want to stay current, very busy"

Agent: "Time-efficient strategy:

Sources: arxiv only (most current for your field)
Threshold: 0.8 (very strict - only best)
Schedule: Weekly on Monday
Max: 10 papers/week

One 15-minute review session per week. You'll catch major advances without daily commitment.

Sound good?"

Scenario: Learning New Field

User: "I'm new to this field, need foundational papers"

Agent: "Learning strategy:

Sources: openalex + semantic_scholar (broader coverage)
Threshold: 0.7 (balanced)
Date filter: 2019-present (recent but not bleeding edge)
Max: 20 papers/week

Mix of recent reviews and important papers. Good for building understanding.

Ready to start?"

Source-Specific Notes

ArXiv

Best for: CS, Physics, Math, Stats preprints
Freshness: Papers within days of submission
Peer review: Minimal (pre-publication)
When to use: Need cutting-edge, active research areas

Semantic Scholar

Best for: AI-powered semantic search
Freshness: Mix of preprints and published
Coverage: Broad, finds related papers well
When to use: Want discovery of unexpected connections

PubMed

Best for: Medical, biomedical sciences
Freshness: Published papers (peer-reviewed)
Coverage: Comprehensive for medicine
When to use: Medical/bio research only

OpenAlex

Best for: General academic research
Freshness: Mix of everything
Coverage: Most comprehensive (200M+ papers)
When to use: Want broad coverage across fields

Crossref

Best for: DOI metadata
Freshness: Published papers only
Coverage: Journal articles, conferences
When to use: Need published/citable papers

Testing and Validation

After Initial Setup

Agent: "I've configured your discovery. Before scheduling it, let's test:

[Run discovery once]

Got [X] papers. Let's review together:
1. Look at top 5 titles - relevant?
2. Any obvious misses or noise?
3. Is [X] papers manageable for you?

Based on your feedback, we'll adjust before going live."

Weekly Check-In Pattern

Agent: "You've been running for a week. Quick review:

**Volume**: [X] papers/day average
**Your feedback**: [Y]% marked as relevant

[If >70% relevant]: "This is working well! Any tweaks needed?"
[If <50% relevant]: "We need to refine. The issue is likely [diagnosis]"

Summary: The Agent's Mental Model

Start simple: 2 sources, 0.7 threshold, reasonable schedule
Test immediately: Run discovery once to validate
Analyze results: Look at actual papers with user
Refine based on evidence: Adjust what's broken
Iterate: Test again after changes
Deploy when good: Set schedule and let it run

The goal: Get user from "vague idea" to "working discovery" in one conversation, with working system at the end.

Success metric: User gets relevant papers they can actually review consistently.