flux1-dev-modelopt-fp8-sglang-transformer

This repository contains the SGLang-native ModelOpt FP8 transformer override for black-forest-labs/FLUX.1-dev.

It is intended to be used with SGLang Diffusion through --transformer-path while keeping the base model separate:

sglang generate \
  --model-path black-forest-labs/FLUX.1-dev \
  --transformer-path lmsys/flux1-dev-modelopt-fp8-sglang-transformer \
  --prompt "A cinematic scene with detailed lighting" \
  --save-output

Contents

The repository is intentionally minimal and contains only:

  • config.json
  • *.safetensors weight shard files
  • *.safetensors.index.json when the checkpoint is sharded

Validation images, benchmark outputs, profiler traces, and conversion scratch artifacts are not stored in this model repository.

Notes

  • Quantization config is stored in config.json with quant_method=modelopt and quant_algo=FP8.
  • Use this checkpoint with an SGLang version that includes diffusion ModelOpt support for the corresponding model family.
  • The original base model license and usage terms still apply.
Downloads last month
-
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for lmsys/flux1-dev-modelopt-fp8-sglang-transformer

Quantized
(67)
this model

Collection including lmsys/flux1-dev-modelopt-fp8-sglang-transformer