name: onyx-cli description: Query the Onyx knowledge base using the onyx-cli command. Use when the user wants to search company documents, ask questions about internal knowledge, query connected data sources, or look up information stored in Onyx.
Onyx CLI — Agent Tool
onyx-cli is an agent's interface to the Onyx enterprise knowledge platform. It connects to company documents, apps, and people. Use it to answer questions that require internal knowledge — policies, docs, processes, data from connected sources (Confluence, Google Drive, Slack, etc.).
Prerequisites
1. Check if installed
which onyx-cli
2. Install (if needed)
pip install onyx-cli
3. Check if configured
If a human has already run onyx-cli chat (which includes first-time setup), the CLI is ready — no additional setup needed. The config file at ~/.config/onyx-cli/config.json (or $XDG_CONFIG_HOME/onyx-cli/config.json if set) is read automatically.
Environment variables override the config file and can be used as an alternative when no config file exists:
export ONYX_SERVER_URL="https://your-onyx-server.com" # default: https://cloud.onyx.app
export ONYX_PAT="your-pat"
| Variable | Required | Description |
|---|---|---|
ONYX_SERVER_URL |
No | Onyx server URL (default: https://cloud.onyx.app) |
ONYX_PAT |
Yes | Personal access token for authentication (unless config file exists) |
ONYX_PERSONA_ID |
No | Default agent/persona ID |
ONYX_STREAM_MARKDOWN |
No | Enable/disable progressive markdown rendering (true/false) |
If neither a config file nor environment variables are set, tell the user that onyx-cli needs to be configured and ask them to either:
- Run
onyx-cli chatto complete first-time setup interactively, or - Set
ONYX_SERVER_URLandONYX_PATenvironment variables (ONYX_PAT holds your PAT)
4. Verify configuration
onyx-cli validate-config
Exit code 0 on success. Non-zero with a descriptive error on failure (see exit codes below).
Commands
Search documents
onyx-cli search "What is our deployment process?"
Returns ranked, cited documents from the Onyx knowledge base as JSON. Default output is a lean shape: {"results": [{title, url, source_type, content, updated_at}, ...]}. Results contain only documents the LLM judged relevant, ordered by relevance; content is the full chunk text of each. Use --raw for the full API response (adds per-result citation_id).
# Filter by source
onyx-cli search --source slack,google_drive "auth migration status"
# Recent results only
onyx-cli search --days 30 "recent production incidents"
# Use a specific agent for scoped search
onyx-cli search --agent-id 5 "engineering roadmap"
# Full API response for programmatic use
onyx-cli search --raw "API documentation" | jq '.results[].title'
# Skip query expansion for exact matching
onyx-cli search --no-query-expansion "exact error message text"
| Flag | Type | Description |
|---|---|---|
--source |
string | Filter by source type (comma-separated: slack,google_drive) |
--days |
int | Only return results from the last N days |
--agent-id |
int | Agent ID for scoped search (inherits filters, document sets) |
--raw |
bool | Output full API response (adds per-result citation_id) |
--no-query-expansion |
bool | Skip LLM query expansion (faster, less comprehensive) |
--max-output |
int | Max bytes to print before truncating (0 to disable, default 50000 for non-TTY, ignored with --raw) |
Ask a question
onyx-cli ask "What is our company's PTO policy?"
Streams an LLM-generated answer as plain text to stdout. Use search instead when you need the source documents rather than a synthesized answer. When stdout is not a TTY, output is truncated to 50000 bytes and the full response is saved to a temp file (path printed at the end). Use --max-output 0 to disable truncation.
# Use a specific agent
onyx-cli ask --agent-id 5 "Summarize our Q4 roadmap"
# Pipe context in with the question
cat error.log | onyx-cli ask --prompt "Find the root cause"
# Structured NDJSON output
onyx-cli ask --json "List all active API integrations"
| Flag | Type | Description |
|---|---|---|
--agent-id |
int | Agent ID to use (overrides default) |
--json |
bool | Output NDJSON stream events instead of plain text (bypasses truncation) |
--quiet |
bool | Buffer output and print once at end (no streaming) |
--prompt |
str | Question text (use with piped stdin context) |
--max-output |
int | Max bytes to print before truncating (0 to disable, default 50000 for non-TTY) |
List available agents
onyx-cli agents
onyx-cli agents --json
Prints a table of agent IDs, names, and descriptions. Use --json for structured JSON output. Use agent IDs with search --agent-id or ask --agent-id.
Validate configuration
onyx-cli validate-config
Checks config exists, PAT is present, server is reachable, and credentials are valid. Use before search, ask, or agents to confirm the CLI is properly set up.
Output Conventions
- stdout: Results only (answer text, agent list, status)
- stderr: Progress indicators, warnings, errors
- Non-TTY: No ANSI escape codes, no interactive prompts
- Truncation: When stdout is not a TTY,
searchandaskoutput is truncated to 50000 bytes. Full response is saved to a temp file whose path is printed. Read the temp file for more.
Exit Codes
| Code | Name | Meaning |
|---|---|---|
| 0 | Success | Command completed successfully |
| 1 | General | Unknown or unclassified error |
| 2 | BadRequest | Invalid arguments |
| 3 | NotConfigured | Missing config or PAT |
| 4 | AuthFailure | Invalid PAT (401/403) |
| 5 | Unreachable | Server unreachable |
| 6 | RateLimited | Server returned 429 |
| 7 | Timeout | Request timed out |
| 8 | ServerError | Server returned 5xx |
| 9 | NotAvailable | Feature/endpoint does not exist |
Statelessness
Each invocation is independent. search does not create a chat session. ask creates a one-shot chat session. There is no way to chain context across multiple invocations — every call starts fresh.
When to Use
Use onyx-cli search when:
- You need to find specific documents or gather context for a task
- You want to reason over multiple source documents yourself
- The user asks you to look up or find information in company knowledge
- You need cited, structured results (document IDs, source types, content)
Use onyx-cli ask when:
- The user wants a direct answer, summarization, or synthesis
- A human-readable response is more useful than raw documents
- You need the LLM to reason across sources and produce an answer
Do NOT use either when:
- The question is about general programming knowledge (use your own knowledge)
- The user is asking about code in the current repository (use grep/read tools)
- The user hasn't mentioned Onyx and the question doesn't require internal company data
Examples
# Search for documents
onyx-cli search "What is our deployment process?"
onyx-cli search --source slack "auth migration status"
onyx-cli search --raw "API documentation" | jq '.results[].title'
# Ask for an answer
onyx-cli ask "What are the steps to deploy to production?"
onyx-cli ask --agent-id 3 "What were the action items from last week's standup?"
cat error.log | onyx-cli ask --prompt "What does this error mean?"