name: zai-mcp-server description: "Use Zhipu CodePlan MCP vision server for screenshot/UI understanding: OCR, error screenshot diagnosis, and UI→artifact conversion. Use when you need to analyze images, extract text, or turn UI into code/spec. Triggers: OCR, screenshot, 识图, 提取文字, diagnose error screenshot, ui_to_artifact."
Prerequisites (mcporter)
Add this server to your mcporter.json (merge under mcpServers):
"zai-mcp-server": {
"command": "npx",
"args": ["-y", "@z_ai/mcp-server"],
"env": {
"Z_AI_API_KEY": "YOUR_ZHIPU_API_KEY",
"Z_AI_MODE": "ZHIPU"
}
}
Based on the Zhipu CodePlan MCP server configuration, this skill provides functionality for visual understanding tasks. This server offers several specialized sub-tools:
ui_to_artifact: Converts UI screenshots into code, prompts, specifications, or descriptions.extract_text_from_screenshot: Performs OCR to extract text content from screenshots (supports local file paths or remote URLs).diagnose_error_screenshot: Analyzes error screenshots to provide diagnostic information.understand_technical_diagram: Interprets technical diagrams such as architecture diagrams, flowcharts, etc.analyze_data_visualization: Analyzes data charts and visualizations.
Common Parameters (for most sub-tools):
image_source(string, required): The local file path or remote URL of the image.prompt(string, required): Describes what you want to extract or analyze from the image.
Example (using extract_text_from_screenshot):
mcporter call zai-mcp-server.extract_text_from_screenshot image_source="https://avatars.githubusercontent.com/u/252820863" prompt="图中有什么文字或内容?"