Skip to content

Quickstart Guide

Note: For more advanced configurations, see the tutorial and options reference.

Feature Compatibility

For the complete and most accurate feature matrix, refer to the main README.

Model Quickstart Guides

Model Params PEFT LoRA Full-Rank Quantization Mixed Precision Grad Checkpoint Flow Shift TwinFlow LayerSync ControlNet Sliders† Guide
PixArt Sigma 0.6B–0.9B int8 optional bf16 SIGMA.md
NVLabs Sana 1.6B–4.8B int8 optional bf16 ✓+ SANA.md
Kwai Kolors 2.7B not recommended bf16 KOLORS.md
Stable Diffusion 3 2B–8B int8/fp8/nf4 optional bf16 ✓+ ✓ (SLG) SD3.md
Flux.1 8B–12B ✓* int8/fp8/nf4 optional bf16 ✓+ FLUX.md
Flux.2 32B ✓* int8/fp8/nf4 optional bf16 ✓+ FLUX2.md
Flux Kontext 8B–12B ✓* int8/fp8/nf4 optional bf16 ✓+ FLUX_KONTEXT.md
Z-Image Turbo 6B ✓* int8 optional bf16 ZIMAGE.md
ACE-Step 3.5B ✓* int8 optional bf16 ACE_STEP.md
Chroma 1 8.9B ✓* int8/fp8/nf4 optional bf16 ✓+ CHROMA.md
Auraflow 6B ✓* int8/fp8/nf4 optional bf16 ✓+ ✓ (SLG) AURAFLOW.md
HiDream I1 17B (8.5B MoE) ✓* int8/fp8/nf4 optional bf16 HIDREAM.md
OmniGen 3.8B int8/fp8 optional bf16 OMNIGEN.md
Stable Diffusion XL 2.6B not recommended bf16 SDXL.md
Lumina2 2B int8 optional bf16 LUMINA2.md
Cosmos2 2B not recommended bf16 COSMOS2IMAGE.md
LTX Video ~2.5B int8/fp8 optional bf16 LTXVIDEO.md
LTX Video 2 19B ✓* int8/fp8 optional bf16 LTXVIDEO2.md
Hunyuan Video 1.5 8.3B ✓* int8 optional bf16 HUNYUANVIDEO.md
Wan 2.x 1.3B–14B ✓* int8 optional bf16 WAN.md
Wan 2.2 S2V 14B ✓* int8 optional bf16 WAN_S2V.md
Qwen Image 20B ✓* required (int8/nf4) bf16 QWEN_IMAGE.md
Qwen Image Edit 20B ✓* required (int8/nf4) bf16 QWEN_EDIT.md
Stable Cascade (C) 1B, 3.6B prior ✓* not supported fp32 (required) STABLE_CASCADE_C.md
Kandinsky 5.0 Image 6B (lite) ✓* int8 optional bf16 KANDINSKY5_IMAGE.md
Kandinsky 5.0 Video 2B (lite), 19B (pro) ✓* int8 optional bf16 KANDINSKY5_VIDEO.md
LongCat-Video 13.6B ✓* int8/fp8 optional bf16 ✓+ LONGCAT_VIDEO.md
LongCat-Video Edit 13.6B ✓* int8/fp8 optional bf16 ✓+ LONGCAT_VIDEO_EDIT.md
LongCat-Image 6B ✓* int8/fp8 optional bf16 LONGCAT_IMAGE.md
LongCat-Image Edit 6B ✓* int8/fp8 optional bf16 LONGCAT_EDIT.md

✓ = supported, ✓ = requires DeepSpeed/FSDP2 for full-rank, ✗ = not supported, ✓+ indicates checkpointing is recommended due to VRAM pressure. TwinFlow ✓ means native support when twinflow_enabled=true (diffusion models need diff2flow_enabled+twinflow_allow_diff2flow). LayerSync ✓ means the backbone exposes transformer hidden states for self-alignment; ✗ marks UNet-style backbones without that buffer. †Sliders apply to LoRA and LyCORIS (including full-rank LyCORIS "full"). All models support LyCORIS.*

ℹ️ Wan quickstart includes 2.1 + 2.2 stage presets and the time-embedding toggle. Flux Kontext covers editing workflows built atop Flux.1.

⚠️ These quickstarts are living documents. Expect occasional updates as new models land or training recipes improve.

Fast paths: Z-Image Turbo & Flux Schnell

  • Z-Image Turbo: Fully supported LoRA with TREAD; runs fast on NVIDIA and macOS even without quant (int8 works too). Often the bottleneck is just trainer setup.
  • Flux Schnell: The quickstart config handles the fast noise schedule and assistant LoRA stack automatically; no extra flags needed to train Schnell LoRAs.

Advanced Experimental Features

  • Diff2Flow: Allows training standard epsilon/v-prediction models (SD1.5, SDXL, DeepFloyd, etc.) using a Flow Matching loss objective. This bridges the gap between older architectures and modern flow-based training.
  • Scheduled Sampling: Reduces exposure bias by letting the model generate its own intermediate noisy latents during training ("rollout"). This helps the model learn to recover from its own generation errors.

Common Issues

Dataset has fewer samples than expected

If your dataset ends up with fewer usable samples than you expected, files may have been filtered during processing. Common reasons include:

  • Files too small: Images below minimum_image_size are filtered out
  • Aspect ratio out of range: Images outside minimum_aspect_ratio/maximum_aspect_ratio bounds are excluded
  • Duration limits: Audio/video files exceeding duration limits are skipped

Viewing filtering statistics: - In the WebUI, browse to your dataset directory and select it to see filtering statistics - Check the logs during dataset processing for statistics like: Sample processing statistics: {'total_processed': 100, 'skipped': {'too_small': 15, ...}}

For detailed troubleshooting, see Troubleshooting filtered datasets in the dataloader documentation.