name: "desktop-vision" description: "Desktop vision capabilities — capture screenshots, stream desktop apps in real-time, and analyze UI changes using DesktopLens MCP tools"
Desktop Vision Skill
You have access to DesktopLens MCP tools that let you see and analyze desktop applications outside the browser. Use these tools to capture screenshots, stream windows in real-time, compare UI changes, and manage plugins.
Available Tools
Core Tools
desktoplens_list_windows
List all visible desktop windows. Use this first to find the target window.
filter(optional): Fuzzy search by window title or app name
desktoplens_screenshot
Capture a screenshot of a specific window.
window_idorwindow_title: Target window identifierformat(optional):png|jpeg|webp(default:png)max_width/max_height(optional): Resize constraintsannotate(optional): Add grid overlay for region reference
desktoplens_watch
Start real-time streaming of a window to a Chrome viewer.
window_idorwindow_title: Target window identifierfps(optional): Frame rate 0.5–5 (default: 2)quality(optional):low|medium|high(default:medium)open_browser(optional): Auto-open Chrome viewer (default:true)
desktoplens_stop
Stop streaming session(s).
session_id(optional): Specific session to stop. Omit to stop all.
desktoplens_status
Get server status, platform info, and active streaming sessions.
desktoplens_compare
Compare two screenshots to detect UI changes.
before_snapshot_idorbefore_window_id/before_window_title: Before stateafter_window_idorafter_window_title: After statehighlight_diff(optional): Generate visual diff imagethreshold(optional): Pixel difference sensitivity
Plugin Tools
desktoplens_plugin_search
Search GitHub for DesktopLens plugins.
query: Search keywords
desktoplens_plugin_install
Install a plugin from a local directory path.
source: Path to plugin directory
desktoplens_plugin_list
List all installed plugins with their status and tools.
desktoplens_plugin_remove
Remove an installed plugin.
plugin_name: Name of the plugin to remove
Workflow Patterns
Quick Screenshot
- Call
desktoplens_list_windowsto find the target window - Call
desktoplens_screenshotwith the window ID or title - Analyze the returned screenshot image
Before/After Comparison
- Call
desktoplens_screenshotto capture the "before" state (note thesnapshot_id) - Ask the user to make their changes
- Call
desktoplens_comparewith the before snapshot ID and current window
Real-time Monitoring
- Call
desktoplens_watchto start streaming - The Chrome viewer opens automatically showing the live feed
- Call
desktoplens_statusto check active sessions - Call
desktoplens_stopwhen monitoring is complete
Best Practices
- Always call
desktoplens_list_windowsfirst to verify the target window exists and get its exact ID - Use
window_titlefor convenience (supports fuzzy matching) orwindow_idfor precision - For UI analysis, use
annotate: trueto add a grid overlay that helps reference specific regions - Set appropriate quality levels:
lowfor quick checks,highfor detailed analysis - Stop streaming sessions when no longer needed to free resources
- Screenshots are automatically compressed for Claude Vision token efficiency