name: find-literature description: This skill should be used when the user asks to "find papers", "search for papers", "what does the literature say", "find research on", "academic papers about", "literature review", "cite papers", or needs to answer questions using academic literature. metadata: internal: true allowed-tools: Bash(asta literature find *) Bash(asta literature interactive *) Bash(asta papers *) TaskOutput Bash(jq *)
Find Literature
Search academic literature for papers relevant to a query. The search will return a ranked list of papers with relevance scores, summaries, and supporting snippets.
This is an advanced search, so the query can be long and complex. You may ask the user questions to clarify the topic and refine the query before running the search.
Choosing find vs interactive
Two entry points are available. Both run the same retrieval pipeline (criteria extraction, candidate retrieval, re-ranking). The significant difference is what happens around that pipeline — and the right choice is mostly driven by what the session as a whole is about, not by the complexity of any single query:
asta literature interactive— default for literature-focused sessions. Use it when the first interaction in the session is about literature search or exploration, or when the entire session is focused on literature. It runs the full Asta paper-finder agent: better planning for complex or multi-faceted queries (decomposition, iterative search strategies) and a reasoning / results-verification loop that can re-run or refine the search when the initial results don't satisfy the criteria. It also maintains conversation state via athread_id, so follow-ups can build on prior results: filtering ("now narrow to surveys from 2023+"), aggregating ("group by venue"), following relations ("expand on the third paper's citations"), or iterative refinement.asta literature find— one attempt, no agent loop. Use it primarily when literature search is a sub-step inside some other multi-step flow (data analysis, code generation, a broader research workflow, etc.) where the same flow may issue many unrelated searches and the latency of the full agent loop is not worth it for each one. Lower quality, much faster. Don't reach forfindjust because a single query "looks simple" — if the session is literature-centered, preferinteractive.
Continuing a conversation — recommended pattern (--thread-dir): for any
multi-turn search session, pick a directory named .asta/literature/threads/<YYYY-MM-DD>-<slug>/
(date-prefixed slug, matching the convention used by other agents) and pass it
on every turn. The CLI auto-resumes the conversation, writes one artifact per
turn with a .NNN index suffix, and maintains a DIR/index.json that records
turn order, queries, narratives, paper counts, and the thread_id. Use a
meaningful -o basename per turn — the CLI inserts the turn index for you.
# Turn 1 — picks a thread dir; CLI creates DIR/index.json and DIR/transformer-survey.001.json
asta literature interactive "transformer architecture survey" \
--thread-dir .asta/literature/threads/2026-05-04-transformer-architectures \
-o transformer-survey.json
# Turn 2 — same dir; thread_id auto-resumes from index.json.
# Writes DIR/narrow-2023.002.json and appends a turn entry to DIR/index.json.
asta literature interactive "narrow to 2023+ long-context surveys" \
--thread-dir .asta/literature/threads/2026-05-04-transformer-architectures \
-o narrow-2023.json
Start a fresh conversation when the topic shifts substantially: pick a new
--thread-dir. Reading the conversation back is straightforward — jq over
DIR/index.json for the turn list, then open the per-turn JSON files referenced
by turns[].file.
Without --thread-dir, an invocation is a one-shot turn that doesn't continue
any prior conversation. Use that for ad-hoc single queries; for any multi-turn
session, always pass --thread-dir.
Output deltas vs find: the interactive JSON adds thread_id and narrative
(the agent's terminal response text). It does not currently populate
citationContexts or publicationDate per paper; for those, use find or the
asta papers commands.
Tools Available
Paper Finder
Run in background for comprehensive searches (30-60s):
# Save to a temporary file with explicit -o parameter (required)
Bash(command="asta literature find 'query' -o /tmp/literature-search-result.json --timeout 300", run_in_background=true)
After the search completes, move the file to .asta/documents/literature/find/ and index it:
# Create directory if it doesn't exist
mkdir -p .asta/documents/literature/find
# Move the result file to the documents directory
mv /tmp/literature-search-result.json .asta/documents/literature/find/
# Index the file using asta-documents
Bash(command="asta documents add file://.asta/documents/literature/find/literature-search-result.json --name='Literature Search: <query>' --summary='Search results for: <query>'")
Browse results with jq:
jq '[.results | sort_by(-.relevanceScore) | .[0:10][] | {title, year, venue, corpusId, score: .relevanceScore, summary: .relevanceJudgement.relevanceSummary}]' <filename>
Go through all highly relevant papers, extracting relevance criteria, snippets, and citation contexts from each.
Semantic Scholar CLI (fast targeted searches)
# Get paper details
asta papers get ARXIV:2005.14165
# Search papers
asta papers search "transformers" --year 2023- --limit 20
# Get citations
asta papers citations ARXIV:2005.14165
# Author search
asta papers author search "Geoffrey Hinton"
asta papers author papers <author_id>
Recommended fields: --fields title,abstract,year,authors,venue,citationCount
JSON Structure
The output is a LiteratureSearchResult with the following structure:
Top level:
query: string - the search queryresults: array of Paper objects
Each paper in results[]:
corpusId: int - Semantic Scholar corpus IDtitle: string - paper titleabstract: string | null - paper abstractyear: int | null - publication yearauthors: array of {name: string, id: string}venue: string | null - publication venueurl: string | null - Semantic Scholar URLcitationCount: int | null - number of citationsrelevanceScore: float - 0-1 (higher = more relevant to query)relevanceJudgement: object with:relevance: int - overall relevance scorerelevanceSummary: string - AI explanation of relevancerelevanceCriteriaJudgements: array of per-concept judgements with:name: string - concept namerelevance: int - relevance for this conceptrelevantSnippets: array of supporting text excerpts
snippets: array of text excerpts from paper body (if open access):text: string - excerpt textsectionTitle: string - section name
citationContexts: array of citation contexts:text: string - how other papers cite this worksourceCorpusId: int - corpus ID of citing paper
Example access patterns:
# Top 10 papers by relevance
jq '[.results | sort_by(-.relevanceScore) | .[0:10][] | {title, year, score: .relevanceScore}]' <filename>
# Papers with relevance summary
jq '.results[] | {title, summary: .relevanceJudgement.relevanceSummary}' <filename>
# Extract snippets from a specific paper
jq '.results[] | select(.corpusId == 123456) | .snippets[].text' <filename>
Using Results
After a search completes, results can be used by the Literature Report skill.