gemini-deep-research - SKILL.md Agent Skill

name: gemini-deep-research description: > Platform knowledge for Gemini Deep Research at gemini.google.com. Standalone: spawns browser-researcher agent. Or loaded by browser-researcher at runtime via skill_path. Trigger on "gemini deep research", "use gemini to research", "run deep research on gemini", "google deep research", or when explicitly requesting Gemini's native deep research capability. context: fork model: opus

Gemini Deep Research — Platform Knowledge

Standalone Invocation

If invoked directly via /gemini-deep-research, spawn the browser-researcher agent:

Agent(
  subagent_type="browser-researcher",
  name="gemini-researcher",
  prompt="skill_path: <skill-path>/SKILL.md\nresearch_prompt: <from user>\noutput_path: <from user or .work/gemini-report.md>\ncaller: <your name>",
  run_in_background=true
)

Then relay messages between user and agent via SendMessage. If loaded by browser-researcher via Read, ignore this section.

Platform Configuration

url: https://gemini.google.com/app
backend: playwright-cli
submit_method: enter_key
plan_review: gated
hard_timeout: 90
completion_signal: "I've completed your research"

Playwright-CLI Backend

All browser interaction is driven by scripts/pw-driver.py via Bash. The agent calls subcommands; the driver handles Chrome lifecycle, CDP attach, and all DOM interaction internally.

Phase	Subcommand
Login check	`python3 scripts/pw-driver.py status` (exit 10 = login required)
One-time login	`python3 scripts/pw-driver.py login` (user runs interactively)
Research activation + submit	`python3 scripts/pw-driver.py start-research --prompt-file=<path>`
Plan capture	`python3 scripts/pw-driver.py wait-plan --timeout=120`
Plan approval	`python3 scripts/pw-driver.py approve-plan`
Plan revision	`python3 scripts/pw-driver.py revise-plan --text-file=<path>` (or `--text=<str>`)
Research monitoring	`python3 scripts/pw-driver.py wait-complete --timeout=5400`
Report extraction	`python3 scripts/pw-driver.py extract --output=<path>`
Session close	`python3 scripts/pw-driver.py close`

Exit codes: 0 success · 10 login required · 20 timeout · 30 extraction failure · 40 DOM mismatch.

DOM Reference (verified March 2026)

Report container: div.markdown.markdown-main-panel (select largest by text length)
Prompt input: textbox "Enter a prompt for Gemini"
Deep Research toggle: button "Deselect Deep research" (X on chip)
Model picker: button "Open mode picker"
Completion text: "I've completed your research. Feel free to ask me follow-up questions or request changes."

These selectors may change as Google updates Gemini's UI.

Platform Quirks

Clarifying forms: Gemini may show refinement questions (checkboxes, dropdowns) before research. Handle autonomously based on the research prompt context. Only escalate genuinely ambiguous questions.
Pro model selection: Always select Pro for best results. If already selected, dismiss picker.
Report panel: Opens on the right side after completion. If not visible, look for the report link card in chat and click it.
Research activation: 4-step sequence — Tools button → Deep Research item → model picker → Pro model.