flux1-dev-modelopt-fp8-sglang-transformer

This repository contains the SGLang-native ModelOpt FP8 transformer override for black-forest-labs/FLUX.1-dev.

It is intended to be used with SGLang Diffusion through --transformer-path while keeping the base model separate:

sglang generate \
  --model-path black-forest-labs/FLUX.1-dev \
  --transformer-path lmsys/flux1-dev-modelopt-fp8-sglang-transformer \
  --prompt "A cinematic scene with detailed lighting" \
  --save-output

The repository is intentionally minimal and contains only:

config.json
*.safetensors weight shard files
*.safetensors.index.json when the checkpoint is sharded

Validation images, benchmark outputs, profiler traces, and conversion scratch artifacts are not stored in this model repository.

Notes

Quantization config is stored in config.json with quant_method=modelopt and quant_algo=FP8.
Use this checkpoint with an SGLang version that includes diffusion ModelOpt support for the corresponding model family.
The original base model license and usage terms still apply.

Downloads last month: -

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for lmsys/flux1-dev-modelopt-fp8-sglang-transformer

Base model

black-forest-labs/FLUX.1-dev

Quantized

(67)

this model

Collection including lmsys/flux1-dev-modelopt-fp8-sglang-transformer

Diffusion ModelOpt

Collection

SGLang-native diffusion transformer overrides converted with NVIDIA ModelOpt. • 8 items • Updated about 6 hours ago

lmsys
/

flux1-dev-modelopt-fp8-sglang-transformer

flux1-dev-modelopt-fp8-sglang-transformer

Contents

Notes

Model tree for lmsys/flux1-dev-modelopt-fp8-sglang-transformer

Collection including lmsys/flux1-dev-modelopt-fp8-sglang-transformer

Diffusion ModelOpt