name: 10x-chat description: Chat with web AI agents (ChatGPT, Gemini, Claude, Grok, Perplexity, NotebookLM) via browser automation. Use when stuck, need cross-validation, want a second-model review, need image generation, want deep research, or want to generate video (Google Flow/Veo, Dreamina/Seedance) from web AI tools.
10x-chat — AI Agent Skill
Use 10x-chat to send prompts to web-based AI agents via automated browser sessions. Supports chat, image generation, deep research, and video generation. Sessions use a shared persisted profile by default.
Architecture (v0.9.0+)
10x-chat uses an HTTP browser daemon — a persistent local Node.js server wrapping Playwright:
- One daemon, multiple sessions: parallel CLI runs share a single Chrome instance via HTTP RPC
- Auto-start/stop: daemon launches on first use, shuts down after 30 min idle
- Crash recovery: if daemon dies, next CLI run auto-restarts it
- No zombie Chrome: proper tab ref-counting and graceful shutdown
- State file:
~/.10x-chat/browser-daemon.json(port, PID, bearer token, chmod 600)
Installation
No install needed. Always use @latest:
npx 10x-chat@latest --version
Use npx (not bunx — symlink conflicts in parallel).
When to use
- Stuck on a bug: ask another model for a fresh perspective
- Code review: send PR diff to GPT / Claude / Gemini for cross-review
- Cross-validation: compare answers from multiple models
- Knowledge gaps: leverage a model with different training data
- Image generation: DALL-E via ChatGPT or Imagen via Gemini
- Deep research: long-form analysis via Perplexity, ChatGPT, or Gemini
- Video generation: text/image-to-video via Google Flow (Veo) or Dreamina (Seedance)
Providers
| Provider | Chat | Image | Research | Video | Models | Notes |
|---|---|---|---|---|---|---|
| chatgpt | ✅ | ✅ (DALL-E) | ✅ | ❌ | — | Runs headed by default (anti-bot) |
| gemini | ✅ | ✅ (Imagen) | ✅ | ❌ | Fast, Thinking (default), Deep Think (Ultra tool), Pro | --model switches mode |
| claude | ✅ | ❌ | ❌ | ❌ | — | Runs headed by default |
| grok | ✅ | ✅ | ❌ | ❌ | — | UI changes often, use @latest |
| perplexity | ✅ | ❌ | ✅ | ❌ | — | Best for research with citations |
| notebooklm | ✅ | ❌ | ❌ | ❌ | — | Add sources first, then chat |
| flow | ❌ | ❌ | ❌ | ✅ (Veo) | Omni Flash (default), Veo 3.1 Lite/Fast/Quality | Google login (shared with Gemini) |
| dreamina | ❌ | ❌ | ❌ | ✅ (Seedance) | Seedance 2.0 Fast (default)/2.0; 1.x often plan-locked | login dreamina (CapCut); text + image-to-video |
Commands
# Login (one-time per provider — opens browser for auth)
npx 10x-chat@latest login chatgpt
npx 10x-chat@latest login gemini
npx 10x-chat@latest login claude
npx 10x-chat@latest login grok
npx 10x-chat@latest login perplexity
npx 10x-chat@latest login notebooklm
# Chat
npx 10x-chat@latest chat -p "Review this code for bugs" --provider chatgpt --file "src/**/*.ts"
npx 10x-chat@latest chat --provider gemini --file "path/to/prompt.md" -p "Complete this task"
npx 10x-chat@latest chat --provider gemini --model Pro -p "Solve this math problem"
npx 10x-chat@latest chat --provider gemini --model "Deep Think" -p "Solve this hard problem"
# Multi-step provider delegate for external AI callers (JSONL over stdin/stdout)
npx 10x-chat@latest delegate gemini --model Pro
# then send JSON lines: {"action":"status"}, {"action":"submit","prompt":"..."}, {"action":"capture"}, {"action":"close"}
# Image generation
npx 10x-chat@latest image -p "A fox astronaut in space" --provider chatgpt
npx 10x-chat@latest image -p "Watercolor landscape" --provider gemini --save-dir ./images
# Deep research (long-form, 5-10 min)
npx 10x-chat@latest research -p "Latest breakthroughs in quantum computing" --provider perplexity
npx 10x-chat@latest research -p "Hard technical research" --provider gemini --model "Deep Think"
npx 10x-chat@latest research -p "Market analysis of EVs" --provider chatgpt --timeout 600000
# Video generation (Flow / Veo default, or Dreamina / Seedance)
npx 10x-chat@latest video -p "Drone shot over snowy peaks at sunrise" --provider flow
npx 10x-chat@latest video -p "She walks, TikTok style" --provider flow --orientation portrait --duration 8 --image ref.png # 9:16 i2v
npx 10x-chat@latest video -p "Neon street, rain" --provider flow --model "Veo 3.1 - Quality" --duration 10
npx 10x-chat@latest login dreamina # one-time CapCut login for Dreamina
npx 10x-chat@latest video -p "A paper boat in a rain gutter, macro" --provider dreamina --aspect 9:16 --dreamina-duration 4
npx 10x-chat@latest video -p "The glowing orb floats up" --provider dreamina --image ref.png --ref-mode omni
# Dry run / clipboard
npx 10x-chat@latest chat --dry-run -p "Debug this error" --file src/
npx 10x-chat@latest chat --copy -p "Explain this" --file "src/**"
# Provider chat history + local session management
npx 10x-chat@latest history --provider gemini
npx 10x-chat@latest history --provider all --limit 10
npx 10x-chat@latest status
npx 10x-chat@latest session <id> --render
# NotebookLM
npx 10x-chat@latest notebooklm list
npx 10x-chat@latest notebooklm create "My Research"
npx 10x-chat@latest notebooklm add-url <id> https://...
npx 10x-chat@latest notebooklm add-file <id> ./paper.pdf
npx 10x-chat@latest notebooklm sources <id>
npx 10x-chat@latest notebooklm summarize <id>
# Install bundled skill to coding agent
npx 10x-chat@latest skill install
Provider delegate (v0.10.18+)
Use delegate <provider> when an external AI caller needs flexible multi-step control instead of a single one-shot chat, image, or research command. It opens one provider tab, keeps it alive, reads JSONL commands from stdin, and emits JSONL responses on stdout.
npx 10x-chat@latest delegate gemini --model Pro <<'EOF'
{"id":1,"action":"status"}
{"id":2,"action":"submit","prompt":"Draft three launch angles for this product."}
{"id":3,"action":"capture","timeoutMs":600000}
{"id":4,"action":"close"}
EOF
printf '%s\n' '{"action":"chat","model":"Thinking","prompt":"Review this pasted plan."}' \
| npx 10x-chat@latest delegate gemini
Actions:
status: return url/title/login stategoto: navigate with{ "url": "https://..." }selectModel: provider UI model/mode switch with{ "model": "Pro" }attach: attach files with{ "files": ["path/to/file.png"] }submit: submit prompt without waitingcapture: wait for and extract the current responsechat: select optional model/files, submit, then captureeval: run a page evaluation script for inspection/debuggingclose: close the browser tab and end the delegate
Flags: --model, --headed, --timeout <ms>, --isolated-profile, --no-login-check.
Parallel sessions (v0.9.0+)
HTTP daemon makes parallel runs stable. All providers share one Chrome:
# Login all once
npx 10x-chat@latest login gemini
npx 10x-chat@latest login claude
npx 10x-chat@latest login chatgpt
# Run concurrently — each opens a tab in the shared daemon
npx 10x-chat@latest chat --provider gemini -p "Your prompt" --file context.md &
npx 10x-chat@latest chat --provider claude -p "Your prompt" --file context.md &
npx 10x-chat@latest chat --provider chatgpt -p "Your prompt" --file context.md &
wait
Browser mode
- Headless (default for gemini, grok, perplexity): no visible window
- Headed (default for chatgpt, claude): visible Chrome window (anti-bot protection)
- Force headed:
--headedflag on any provider
The daemon stores the headless/headed mode in its state file. If you switch modes, the daemon restarts automatically.
Profile modes
Shared (default): One browser profile, all providers share cookies. Login once per Google account covers Gemini + NotebookLM.
Isolated: Separate profile per provider (backward compat): --isolated-profile
# Migrate from isolated to shared
npx 10x-chat@latest migrate
Daemon management
# Check daemon state
cat ~/.10x-chat/browser-daemon.json
# Force stop daemon
# (CLI calls stopDaemon() on Ctrl+C automatically)
kill $(cat ~/.10x-chat/browser-daemon.json | python3 -c "import sys,json; print(json.load(sys.stdin)['pid'])")
Tips
- Always use
@latest: ensures newest fixes - Login first:
login <provider>once per provider. Sessions persist in~/.10x-chat/profiles/ - Use
--headedif a provider is flaky (Grok especially) - Keep file sets small: fewer files + focused prompt = better answers
- Research needs longer timeouts:
--timeout 600000for 10-min research jobs - Image gen can take 1-2 min: use
--timeout 120000when needed - Video gen can take 1-5 min: Dreamina queues generations; keep the default 10-min timeout. For image-to-video, pass
--image(Dreamina) or--start-frame/--end-framewith--mode frames(Flow) - Flow duration:
--duration 4|6|8|10sets clip length in seconds (default is whatever Flow's UI defaults to, currently 8s) - Dreamina duration: use
--dreamina-duration(not--duration) to avoid conflict with Flow's flag - Dreamina models are plan-gated:
Seedance 2.0 Fast(cheapest) and2.0are generally available; the CLI errors clearly if a requested model is locked - Use
--dry-runto preview what will be sent
Known issues
- Grok: UI changes frequently. Always use
@latest. Use--headedfor best reliability - ChatGPT/Grok sessions expire quickly: login again if you get "Not logged in" errors
- Some provider UIs are flaky under automation: retry with
--headedbefore assuming a hard failure
Safety
- Never include credentials, API keys, or tokens in the bundled files
- The tool opens a real browser with real login state. Treat it like your own browser session