name: model-architecture-diagram description: Return public original model architecture diagrams for user-specified LLM, VLM, MoE, diffusion, OCR, and SGLang/sgl-cookbook model families. Use when the user asks for a model structure chart, architecture diagram, or rendered image link for a specific model such as DeepSeek, GLM, Qwen, Kimi, MiniMax, Step, Hunyuan, or Qwen3-VL.
Model Architecture Diagram
Workflow
Return only public original diagrams indexed by this skill.
- Run the bundled resolver:
python3 skills/model-architecture-diagram/scripts/model_architecture_diagram.py "<model name>"
- If the resolver returns
kind: existing, return the raw image Markdown it prints and preserve the source attribution line. - If the resolver returns
kind: no_match, tell the user that no public original architecture diagram is indexed for that model.
Source Priority
Use references/diagram-index.json as the source of truth. It stores raw GitHub image URLs from:
datawhalechina/self-llmCalvinXKY/InfraTechTongyi-MAI/Z-ImageWan-Video/Wan2.1Wan-Video/Wan2.2Tencent-Hunyuan/HunyuanVideoTencent-Hunyuan/Hunyuan3D-2brayevalerien/Flux.1-Architecture-Diagram
Prefer detailed implementation, cookbook, or architecture-card diagrams over paper figures. Good sources show module boundaries, dataflow, MoE / attention / cache paths, or model-specific runtime structure rather than only a high-level paper overview. Official repository diagrams and curated implementation diagrams are first choice; paper figures are fallback only when no more detailed public original diagram is indexed.
Do not copy remote image binaries into the skill. Return the raw GitHub URLs so the chat renderer can display the original image.
Existing Diagram Rule
For a direct match, show the original image. Good direct matches include:
- DeepSeek V3/V3.2/V4, GLM-5, Kimi K2/K2.5, MiniMax M2.5, Qwen3.5, Qwen3-VL, and Step 3.5 Flash from InfraTech.
- Hunyuan-A13B, Kimi-VL, Qwen3, Qwen3-VL detail flows, MiniMax M2, and Llama 4 architecture/module diagrams from self-llm.
- Z-Image, Wan2.1, Wan2.2, HunyuanVideo, Hunyuan3D 2.0, and FLUX.1 diffusion architecture/module diagrams from public GitHub sources.
If multiple diagrams match, show all high-confidence matches up to the resolver's default limit. For example, DeepSeek V3 may return the full architecture plus MLA MHA/MQA diagrams.
Hosted Original Diagram Gallery
Do not commit the sgl-cookbook-model-architecture-images/ gallery into the repository. The public-original image set is hosted as a GitHub Release asset and indexed by a GitHub issue.
Current hosted artifact:
- Issue index: https://github.com/BBuf/AI-Infra-Auto-Driven-SKILLS/issues/31
- Release page: https://github.com/BBuf/AI-Infra-Auto-Driven-SKILLS/releases/tag/sgl-cookbook-architecture-images-2026-05-02
- Zip download: https://github.com/BBuf/AI-Infra-Auto-Driven-SKILLS/releases/download/sgl-cookbook-architecture-images-2026-05-02/sgl-cookbook-model-architecture-images-2026-05-02.zip
- Digest:
sha256:ea432081849a250429d3d1ecf246e267c5cc42f989aaf4b9ca695b581e7fa50f
The artifact contains 44 public original diagram image files from the indexed upstream repositories, plus a lightweight index.html, index.md, manifest.json, HTML contact sheet, and architecture-audit.md.
To inspect the gallery locally:
curl -L -o /tmp/sgl-cookbook-model-architecture-images-2026-05-02.zip \
https://github.com/BBuf/AI-Infra-Auto-Driven-SKILLS/releases/download/sgl-cookbook-architecture-images-2026-05-02/sgl-cookbook-model-architecture-images-2026-05-02.zip
unzip -q /tmp/sgl-cookbook-model-architecture-images-2026-05-02.zip -d /tmp
open /tmp/sgl-cookbook-model-architecture-images/index.html
Useful Commands
List known original diagram aliases:
python3 skills/model-architecture-diagram/scripts/model_architecture_diagram.py --list-known
Emit JSON for automation:
python3 skills/model-architecture-diagram/scripts/model_architecture_diagram.py "GLM-5" --format json
References
references/diagram-index.json: original diagram link index and aliases.references/source-notes.md: audited source repositories and local cache paths.