Skip to content
LLM-friendly formats:

Core Concepts

Model Families

Each family is a distinct transformer backbone with its own characteristics.

FamilyModalityDefault BackendStatus
fluximagenunchakuactive
zimageimagenunchakuactive
wanvideotorchcoming soon

Backends

Three inference backends, each with different tradeoffs:

BackendStackNotes
nunchakuNVFP4 on BlackwellFastest, 4-bit quantized
torchdiffusers + CUDAFlexible, full precision
tensorrtTRT-LLM + ModelOptNVIDIA-optimized, production

Formats

Image Formats

FormatDimensionsAspect
10241024×10241:1
512512×5121:1
portrait768×10243:4
landscape1024×7684:3

Video Formats

FormatDimensionsAspect
720p1280×72016:9
480p832×480~16:9
square640×6401:1

Tasks

TaskRequiresProducesDescription
t2vpromptvideotext to video
i2vprompt + imagevideoanimate a still
t2ipromptimagetext to image
i2iprompt + imageimagetransform / style transfer
editprompt + image + maskimageinpaint / outpaint