Diffusion ModelOpt
Collection
SGLang-native diffusion transformer overrides converted with NVIDIA ModelOpt. • 8 items • Updated
This repository contains the SGLang-native ModelOpt FP8 transformer override for Wan-AI/Wan2.2-T2V-A14B-Diffusers.
It is intended to be used with SGLang Diffusion through --transformer-path while keeping the base model separate:
sglang generate \
--model-path Wan-AI/Wan2.2-T2V-A14B-Diffusers \
--transformer-path lmsys/wan22-t2v-a14b-modelopt-fp8-sglang-transformer \
--prompt "A cinematic scene with detailed lighting" \
--save-output
The repository is intentionally minimal and contains only:
config.json*.safetensors weight shard files*.safetensors.index.json when the checkpoint is shardedValidation images, benchmark outputs, profiler traces, and conversion scratch artifacts are not stored in this model repository.
config.json with quant_method=modelopt and quant_algo=FP8.Base model
Wan-AI/Wan2.2-T2V-A14B-Diffusers