pp-arxiv

star 1.6k

Printing Press CLI for arXiv. Public Atom API for searching and fetching arXiv e-print metadata.

mvanhorn By mvanhorn schedule Updated 6/16/2026

name: pp-arxiv description: "Printing Press CLI for arXiv. Public Atom API for searching and fetching arXiv e-print metadata." author: "Hiten Shah" license: "Apache-2.0" argument-hint: " [args] | install cli|mcp" allowed-tools: "Read Bash" metadata: openclaw: requires: bins: - arxiv-pp-cli


arXiv — Printing Press CLI

Prerequisites: Install the CLI

This skill drives the arxiv-pp-cli binary. You must verify the CLI is installed before invoking any command from this skill. If it is missing, install it first:

  1. Install via the Printing Press installer. It defaults binaries to $HOME/.local/bin on macOS/Linux and %LOCALAPPDATA%\Programs\PrintingPress\bin on Windows:
    npx -y @mvanhorn/printing-press-library install arxiv --cli-only
    
  2. Verify: arxiv-pp-cli --version
  3. Ensure the reported install directory is on $PATH for the agent/runtime that will invoke this skill.

If the npx install fails (no Node, offline, etc.), fall back to a direct Go install (requires Go 1.26.3 or newer):

go install github.com/mvanhorn/printing-press-library/library/other/arxiv/cmd/arxiv-pp-cli@latest

If --version reports "command not found" after install, the runtime cannot see the binary directory on $PATH. Do not proceed with skill commands until verification succeeds.

When Not to Use This CLI

Do not activate this CLI for requests that require creating, updating, deleting, publishing, commenting, upvoting, inviting, ordering, sending messages, booking, purchasing, or changing remote state. This printed CLI exposes read-only commands for inspection, export, sync, and analysis.

Unique Capabilities

These capabilities aren't available in any other tool for this API.

Research discovery

  • query — Search arXiv with documented query expressions and agent-friendly output controls.
  • query — Fetch latest AI/research papers by category using submitted-date sorting and bounded result counts.
  • query — Fetch exact papers by arXiv ID or versioned arXiv ID.

Command Reference

query — Manage query

  • arxiv-pp-cli query --search-query 'cat:cs.AI' --max-results 5 — Search arXiv papers or fetch recent papers by category.
  • arxiv-pp-cli query --id-list 1706.03762 --max-results 1 — Fetch exact papers by arXiv ID or versioned arXiv ID.

Finding the right command

When you know what you want to do but not which command does it, ask the CLI directly:

arxiv-pp-cli which "<capability in your own words>"

which resolves a natural-language capability query to the best matching command from this CLI's curated feature index. Exit code 0 means at least one match; exit code 2 means no confident match — fall back to --help or use a narrower query.

Auth Setup

No authentication required.

Run arxiv-pp-cli doctor to verify setup.

Agent Mode

Add --agent to any command. Expands to: --json --compact --no-input --no-color --yes.

  • Pipeable — JSON on stdout, errors on stderr

  • Filterable--select keeps a subset of fields. Dotted paths descend into nested structures; arrays traverse element-wise. Critical for keeping context small on verbose APIs:

    arxiv-pp-cli query --search-query 'cat:cs.AI' --max-results 5 --agent --select entries.id,entries.title
    
  • Previewable--dry-run shows the request without sending

  • Live-first — arXiv search is most useful against the live API. Generic sync/local-store commands are present from the Printing Press scaffold, but this CLI should be treated as live-query-first because arXiv /api/query requires caller-supplied search or ID parameters.

  • Non-interactive — never prompts, every input is a flag

  • Read-only — do not use this CLI for create, update, delete, publish, comment, upvote, invite, order, send, or other mutating requests

Response envelope

Commands that read from the local store or the API wrap output in a provenance envelope:

{
  "meta": {"source": "live" | "local", "synced_at": "...", "reason": "..."},
  "results": <data>
}

Parse .results for data and .meta.source to know whether it's live or local. A human-readable N results (live) summary is printed to stderr only when stdout is a terminal — piped/agent consumers get pure JSON on stdout.

Agent Feedback

When you (or the agent) notice something off about this CLI, record it:

arxiv-pp-cli feedback "the --since flag is inclusive but docs say exclusive"
arxiv-pp-cli feedback --stdin < notes.txt
arxiv-pp-cli feedback list --json --limit 10

Entries are stored locally at ~/.arxiv-pp-cli/feedback.jsonl. They are never POSTed unless ARXIV_FEEDBACK_ENDPOINT is set AND either --send is passed or ARXIV_FEEDBACK_AUTO_SEND=true. Default behavior is local-only.

Write what surprised you, not a bug report. Short, specific, one line: that is the part that compounds.

Output Delivery

Every command accepts --deliver <sink>. The output goes to the named sink in addition to (or instead of) stdout, so agents can route command results without hand-piping. Three sinks are supported:

Sink Effect
stdout Default; write to stdout only
file:<path> Atomically write output to <path> (tmp + rename)
webhook:<url> POST the output body to the URL (application/json or application/x-ndjson when --compact)

Unknown schemes are refused with a structured error naming the supported set. Webhook failures return non-zero and log the URL + HTTP status on stderr.

Named Profiles

A profile is a saved set of flag values, reused across invocations. Use it when a scheduled agent calls the same command every run with the same configuration - HeyGen's "Beacon" pattern.

arxiv-pp-cli profile save briefing --json
arxiv-pp-cli --profile briefing query --search-query 'cat:cs.AI' --max-results 5
arxiv-pp-cli profile list --json
arxiv-pp-cli profile show briefing
arxiv-pp-cli profile delete briefing --yes

Explicit flags always win over profile values; profile values win over defaults. agent-context lists all available profiles under available_profiles so introspecting agents discover them at runtime.

Exit Codes

Code Meaning
0 Success
2 Usage error (wrong arguments)
3 Resource not found
5 API error (upstream issue)
7 Rate limited (wait and retry)
10 Config error

Argument Parsing

Parse $ARGUMENTS:

  1. Empty, help, or --help → show arxiv-pp-cli --help output
  2. Starts with install → ends with mcp → MCP installation; otherwise → see Prerequisites above
  3. Anything else → Direct Use (execute as CLI command with --agent)

MCP Server Installation

Install the MCP binary from this CLI's published public-library entry or pre-built release, then register it:

claude mcp add arxiv-pp-mcp -- arxiv-pp-mcp

Verify: claude mcp list

Direct Use

  1. Check if installed: which arxiv-pp-cli If not found, offer to install (see Prerequisites at the top of this skill).
  2. Match the user query to the best command from the Unique Capabilities and Command Reference above.
  3. Execute with the --agent flag:
    arxiv-pp-cli <command> [subcommand] [args] --agent
    
  4. If ambiguous, drill into subcommand help: arxiv-pp-cli <command> --help.
Install via CLI
npx skills add https://github.com/mvanhorn/printing-press-library --skill pp-arxiv
Repository Details
star Stars 1,606
call_split Forks 431
navigation Branch main
article Path SKILL.md
More from Creator