swift-ms-docs

star 6

USE THIS SKILL WHEN working with ms-swift (ModelScope Swift): fine-tuning LLMs/VLMs, LoRA/QLoRA/DoRA training, GRPO reinforcement learning, RLHF (DPO/KTO/SimPO), pre-training, Megatron distributed training, inference/deployment with vLLM/SGLang/LmDeploy, quantization (AWQ/GPTQ/BNB/FP8), evaluation, custom datasets/models, or Web-UI. Triggers on: ms-swift, swift train, swift infer, swift deploy, swift export, LoRA fine-tuning ModelScope, GRPO training.

wenerme By wenerme schedule Updated 6/8/2026

name: swift-ms-docs description: "USE THIS SKILL WHEN working with ms-swift (ModelScope Swift): fine-tuning LLMs/VLMs, LoRA/QLoRA/DoRA training, GRPO reinforcement learning, RLHF (DPO/KTO/SimPO), pre-training, Megatron distributed training, inference/deployment with vLLM/SGLang/LmDeploy, quantization (AWQ/GPTQ/BNB/FP8), evaluation, custom datasets/models, or Web-UI. Triggers on: ms-swift, swift train, swift infer, swift deploy, swift export, LoRA fine-tuning ModelScope, GRPO training."

ms-swift Documentation

ms-swift is ModelScope's large model fine-tuning and deployment framework supporting 600+ LLMs and 400+ VLMs across the full training-to-deployment pipeline.

CRITICAL: grep references/ for detailed docs before answering.

Quick Start

pip install ms-swift

# Fine-tune with LoRA
swift sft \
  --model Qwen/Qwen2.5-7B-Instruct \
  --train_type lora \
  --dataset alpaca-zh#5000 \
  --num_train_epochs 1

# Inference
swift infer --model Qwen/Qwen2.5-7B-Instruct --stream true

# Deploy OpenAI-compatible API
swift deploy --model Qwen/Qwen2.5-7B-Instruct

# GRPO reinforcement learning
swift rlhf --rlhf_type grpo --model Qwen/Qwen2.5-7B-Instruct --dataset math-shepherd-mix

Key Capabilities

Feature Description
LoRA/QLoRA/DoRA Lightweight fine-tuning (9GB for 7B model)
GRPO family GRPO, DAPO, GSPO, SAPO, CISPO, RLOO, Reinforce++
RLHF DPO, KTO, RM, CPO, SimPO, ORPO
Megatron TP/PP/CP/EP parallelism for MoE models
Inference vLLM, SGLang, LmDeploy acceleration
Quantization AWQ, GPTQ, FP8, BNB export
Evaluation EvalScope backend, 100+ datasets
Multimodal VLM training with image/video/audio

Key Topics

Getting Started

  • references/GetStarted/Quick-start.md — Quick start
  • references/GetStarted/SWIFT-installation.md — Installation
  • references/GetStarted/Web-UI.md — Web UI guide

Training

  • references/Instruction/Pre-training-and-Fine-tuning.md — SFT/CPT guide
  • references/Instruction/Use-tuners.md — LoRA, QLoRA, DoRA, Adapter, etc.
  • references/Instruction/RLHF.md — DPO, KTO, SimPO, ORPO
  • references/Instruction/Reinforced-Fine-tuning.md — GRPO overview
  • references/Instruction/GRPO/GetStarted/GRPO.md — GRPO quick start
  • references/Instruction/GRPO/DeveloperGuide/ — Reward functions, multi-turn, gym env
  • references/Instruction/GRPO/AdvancedResearch/ — DAPO, GSPO, SAPO, CISPO, RLOO

Inference & Deployment

  • references/Instruction/Inference-and-deployment.md — Inference + deploy
  • references/Instruction/Export-and-push.md — Quantization export, model push

Megatron Distributed

  • references/Megatron-SWIFT/Quick-start.md — Megatron quick start
  • references/Megatron-SWIFT/Command-line-parameters.md — All parameters
  • references/Megatron-SWIFT/GRPO.md — Megatron GRPO
  • references/Megatron-SWIFT/LoRA-Training.md — Megatron LoRA

Customization

  • references/Customization/Custom-dataset.md — Custom datasets
  • references/Customization/Custom-model.md — Custom models
  • references/Customization/Architecture.md — Architecture overview

Best Practices

  • references/BestPractices/ — Qwen3, GRPO, Embedding, Reranker, NPU, VLM
  • references/Instruction/Supported-models-and-datasets.md — 600+ models list
  • references/Instruction/Command-line-parameters.md — All CLI parameters
  • references/Instruction/Frequently-asked-questions.md — FAQ

References

  • references/ — 59 English doc files covering full training-to-deployment pipeline
  • references/GetStarted/ — Installation and quick start
  • references/Instruction/ — Training, inference, GRPO, RLHF
  • references/Megatron-SWIFT/ — Megatron distributed training
  • references/BestPractices/ — Model-specific best practices
  • references/Customization/ — Custom models and datasets
Install via CLI
npx skills add https://github.com/wenerme/ai --skill swift-ms-docs
Repository Details
star Stars 6
call_split Forks 1
navigation Branch main
article Path SKILL.md
More from Creator