Diffusion ModelOpt
Collection
SGLang-native diffusion transformer overrides converted with NVIDIA ModelOpt. • 8 items • Updated
This repository contains the SGLang-native ModelOpt FP8 transformer override for hunyuanvideo-community/HunyuanVideo.
It is intended to be used with SGLang Diffusion through --transformer-path while keeping the base model separate:
sglang generate \
--model-path hunyuanvideo-community/HunyuanVideo \
--transformer-path lmsys/hunyuanvideo-modelopt-fp8-sglang-transformer \
--prompt "A cinematic scene with detailed lighting" \
--save-output
The repository is intentionally minimal and contains only:
config.json*.safetensors weight shard files*.safetensors.index.json when the checkpoint is shardedValidation images, benchmark outputs, profiler traces, and conversion scratch artifacts are not stored in this model repository.
config.json with quant_method=modelopt and quant_algo=FP8.Base model
tencent/HunyuanVideo