diffusers

mirror of https://github.com/huggingface/diffusers.git synced 2026-01-27 17:22:53 +03:00

Files

dg845 c10bdd9b73 Add LTX 2.0 Video Pipelines (#12915 )

* Initial LTX 2.0 transformer implementation

* Add tests for LTX 2 transformer model

* Get LTX 2 transformer tests working

* Rename LTX 2 compile test class to have LTX2

* Remove RoPE debug print statements

* Get LTX 2 transformer compile tests passing

* Fix LTX 2 transformer shape errors

* Initial script to convert LTX 2 transformer to diffusers

* Add more LTX 2 transformer audio arguments

* Allow LTX 2 transformer to be loaded from local path for conversion

* Improve dummy inputs and add test for LTX 2 transformer consistency

* Fix LTX 2 transformer bugs so consistency test passes

* Initial implementation of LTX 2.0 video VAE

* Explicitly specify temporal and spatial VAE scale factors when converting

* Add initial LTX 2.0 video VAE tests

* Add initial LTX 2.0 video VAE tests (part 2)

* Get diffusers implementation on par with official LTX 2.0 video VAE implementation

* Initial LTX 2.0 vocoder implementation

* Use RMSNorm implementation closer to original for LTX 2.0 video VAE

* start audio decoder.

* init registration.

* up

* simplify and clean up

* up

* Initial LTX 2.0 text encoder implementation

* Rough initial LTX 2.0 pipeline implementation

* up

* up

* up

* up

* Add imports for LTX 2.0 Audio VAE

* Conversion script for LTX 2.0 Audio VAE Decoder

* Add Audio VAE logic to T2V pipeline

* Duplicate scheduler for audio latents

* Support num_videos_per_prompt for prompt embeddings

* LTX 2.0 scheduler and full pipeline conversion

* Add script to test full LTX2Pipeline T2V inference

* Fix pipeline return bugs

* Add LTX 2 text encoder and vocoder to ltx2 subdirectory __init__

* Fix more bugs in LTX2Pipeline.__call__

* Improve CPU offload support

* Fix pipeline audio VAE decoding dtype bug

* Fix video shape error in full pipeline test script

* Get LTX 2 T2V pipeline to produce reasonable outputs

* Make LTX 2.0 scheduler more consistent with original code

* Fix typo when applying scheduler fix in T2V inference script

* Refactor Audio VAE to be simpler and remove helpers (#7)

* remove resolve causality axes stuff.

* remove a bunch of helpers.

* remove adjust output shape helper.

* remove the use of audiolatentshape.

* move normalization and patchify out of pipeline.

* fix

* up

* up

* Remove unpatchify and patchify ops before audio latents denormalization (#9)

---------

Co-authored-by: dg845 <58458699+dg845@users.noreply.github.com>

* Add support for I2V (#8)

* start i2v.

* up

* up

* up

* up

* up

* remove uniform strategy code.

* remove unneeded code.

* Denormalize audio latents in I2V pipeline (analogous to T2V change) (#11)

* test i2v.

* Move Video and Audio Text Encoder Connectors to Transformer (#12)

* Denormalize audio latents in I2V pipeline (analogous to T2V change)

* Initial refactor to put video and audio text encoder connectors in transformer

* Get LTX 2 transformer tests working after connector refactor

* precompute run_connectors,.

* fixes

* Address review comments

* Calculate RoPE double precisions freqs using torch instead of np

* Further simplify LTX 2 RoPE freq calc

* Make connectors a separate module (#18)

* remove text_encoder.py

* address yiyi's comments.

* up

* up

* up

* up

---------

Co-authored-by: sayakpaul <spsayakpaul@gmail.com>

* up (#19)

* address initial feedback from lightricks team (#16)

* cross_attn_timestep_scale_multiplier to 1000

* implement split rope type.

* up

* propagate rope_type to rope embed classes as well.

* up

* When using split RoPE, make sure that the output dtype is same as input dtype

* Fix apply split RoPE shape error when reshaping x to 4D

* Add export_utils file for exporting LTX 2.0 videos with audio

* Tests for T2V and I2V (#6)

* add ltx2 pipeline tests.

* up

* up

* up

* up

* remove content

* style

* Denormalize audio latents in I2V pipeline (analogous to T2V change)

* Initial refactor to put video and audio text encoder connectors in transformer

* Get LTX 2 transformer tests working after connector refactor

* up

* up

* i2v tests.

* up

* Address review comments

* Calculate RoPE double precisions freqs using torch instead of np

* Further simplify LTX 2 RoPE freq calc

* revert unneded changes.

* up

* up

* update to split style rope.

* up

---------

Co-authored-by: Daniel Gu <dgu8957@gmail.com>

* up

* use export util funcs.

* Point original checkpoint to LTX 2.0 official checkpoint

* Allow the I2V pipeline to accept image URLs

* make style and make quality

* remove function map.

* remove args.

* update docs.

* update doc entries.

* disable ltx2_consistency test

* Simplify LTX 2 RoPE forward by removing coords is None logic

* make style and make quality

* Support LTX 2.0 audio VAE encoder

* Apply suggestions from code review

Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

* Remove print statement in audio VAE

* up

* Fix bug when calculating audio RoPE coords

* Ltx 2 latent upsample pipeline (#12922)

* Initial implementation of LTX 2.0 latent upsampling pipeline

* Add new LTX 2.0 spatial latent upsampler logic

* Add test script for LTX 2.0 latent upsampling

* Add option to enable VAE tiling in upsampling test script

* Get latent upsampler working with video latents

* Fix typo in BlurDownsample

* Add latent upsample pipeline docstring and example

* Remove deprecated pipeline VAE slicing/tiling methods

* make style and make quality

* When returning latents, return unpacked and denormalized latents for T2V and I2V

* Add model_cpu_offload_seq for latent upsampling pipeline

---------

Co-authored-by: Daniel Gu <dgu8957@gmail.com>

* Fix latent upsampler filename in LTX 2 conversion script

* Add latent upsample pipeline to LTX 2 docs

* Add dummy objects for LTX 2 latent upsample pipeline

* Set default FPS to official LTX 2 ckpt default of 24.0

* Set default CFG scale to official LTX 2 ckpt default of 4.0

* Update LTX 2 pipeline example docstrings

* make style and make quality

* Remove LTX 2 test scripts

* Fix LTX 2 upsample pipeline example docstring

* Add logic to convert and save a LTX 2 upsampling pipeline

* Document LTX2VideoTransformer3DModel forward pass

---------

Co-authored-by: sayakpaul <spsayakpaul@gmail.com>

2026-01-07 21:24:27 -08:00

allegro

[Refactor] Move testing utils out of src (#12238 )

2025-08-28 19:53:02 +05:30

animatediff

[Refactor] Move testing utils out of src (#12238 )

2025-08-28 19:53:02 +05:30

audioldm2

[CI] Fix failing Pipeline CPU tests (#12681 )

2025-11-19 21:19:24 +05:30

aura_flow

Update Ruff to latest Version (#10919 )

2025-04-09 16:51:34 +05:30

bria

[Refactor] Move testing utils out of src (#12238 )

2025-08-28 19:53:02 +05:30

bria_fibo

Bria fibo (#12545 )

2025-10-28 16:27:48 +05:30

chroma

[Refactor] Move testing utils out of src (#12238 )

2025-08-28 19:53:02 +05:30

chronoedit

add ChronoEdit (#12593 )

2025-11-09 22:07:00 -08:00

cogvideo

[Refactor] Move testing utils out of src (#12238 )

2025-08-28 19:53:02 +05:30

cogview3

[Refactor] Move testing utils out of src (#12238 )

2025-08-28 19:53:02 +05:30

cogview4

[Refactor] Move testing utils out of src (#12238 )

2025-08-28 19:53:02 +05:30

consisid

[Refactor] Move testing utils out of src (#12238 )

2025-08-28 19:53:02 +05:30

consistency_models

[Refactor] Move testing utils out of src (#12238 )

2025-08-28 19:53:02 +05:30

controlnet

[Refactor] Move testing utils out of src (#12238 )

2025-08-28 19:53:02 +05:30

controlnet_flux

[Refactor] Move testing utils out of src (#12238 )

2025-08-28 19:53:02 +05:30

controlnet_hunyuandit

fix 3 xpu failures uts w/ latest pytorch (#12408 )

2025-09-30 14:07:48 +05:30

controlnet_sd3

[Refactor] Move testing utils out of src (#12238 )

2025-08-28 19:53:02 +05:30

cosmos

Cosmos Predict2.5 Base: inference pipeline, scheduler & chkpt conversion (#12852 )

2025-12-19 05:38:18 +05:30

ddim

[Refactor] Move testing utils out of src (#12238 )

2025-08-28 19:53:02 +05:30

ddpm

[Refactor] Move testing utils out of src (#12238 )

2025-08-28 19:53:02 +05:30

deepfloyd_if

[Refactor] Move testing utils out of src (#12238 )

2025-08-28 19:53:02 +05:30

dit

[Refactor] Move testing utils out of src (#12238 )

2025-08-28 19:53:02 +05:30

easyanimate

[tests] disable xformer tests for pipelines it isn't popular. (#12277 )

2025-09-24 09:02:25 +05:30

flux

[Feat] TaylorSeer Cache (#12648 )

2025-12-06 05:39:54 +05:30

flux2

let's go Flux2 🚀 (#12711 )

2025-11-25 21:49:04 +05:30

hidream_image

[tests] disable xformer tests for pipelines it isn't popular. (#12277 )

2025-09-24 09:02:25 +05:30

hunyuan_image_21

HunyuanImage21 (#12333 )

2025-10-23 22:31:12 -10:00

hunyuan_video

[Feat] TaylorSeer Cache (#12648 )

2025-12-06 05:39:54 +05:30

hunyuan_video1_5

Hunyuanvideo15 (#12696 )

2025-11-30 20:27:59 -10:00

hunyuandit

[Refactor] Move testing utils out of src (#12238 )

2025-08-28 19:53:02 +05:30

ip_adapters

[Refactor] Move testing utils out of src (#12238 )

2025-08-28 19:53:02 +05:30

kandinsky

[CI] disable installing transformers from main in ci for now. (#12397 )

2025-09-26 18:41:17 +05:30

kandinsky2_2

[CI] Fix failing Pipeline CPU tests (#12681 )

2025-11-19 21:19:24 +05:30

kandinsky3

[Refactor] Move testing utils out of src (#12238 )

2025-08-28 19:53:02 +05:30

kandinsky5

Kandinsky 5.0 Video Pro and Image Lite (#12664 )

2025-12-03 00:46:37 -10:00

kolors

[Refactor] Move testing utils out of src (#12238 )

2025-08-28 19:53:02 +05:30

latent_consistency_models

[Refactor] Move testing utils out of src (#12238 )

2025-08-28 19:53:02 +05:30

latent_diffusion

[Refactor] Move testing utils out of src (#12238 )

2025-08-28 19:53:02 +05:30

latte

[Refactor] Move testing utils out of src (#12238 )

2025-08-28 19:53:02 +05:30

ledits_pp

[Refactor] Move testing utils out of src (#12238 )

2025-08-28 19:53:02 +05:30

longcat_image

Add support for LongCat-Image (#12828 )

2025-12-15 07:45:17 -10:00

ltx

[Refactor] Move testing utils out of src (#12238 )

2025-08-28 19:53:02 +05:30

ltx2

Add LTX 2.0 Video Pipelines (#12915 )

2026-01-07 21:24:27 -08:00

lumina

[Refactor] Move testing utils out of src (#12238 )

2025-08-28 19:53:02 +05:30

lumina2

post release 0.33.0 (#11255 )

2025-04-15 06:50:08 -10:00

marigold

fix marigold ut case fail on xpu (#12350 )

2025-09-24 09:32:06 +05:30

mochi

[Refactor] Move testing utils out of src (#12238 )

2025-08-28 19:53:02 +05:30

omnigen

[tests] disable xformer tests for pipelines it isn't popular. (#12277 )

2025-09-24 09:02:25 +05:30

ovis_image

Add support for Ovis-Image (#12740 )

2025-12-02 11:48:07 -10:00

pag

docs: cleanup of runway model (#12503 )

2025-10-17 14:10:50 -07:00

pixart_alpha

[Refactor] Move testing utils out of src (#12238 )

2025-08-28 19:53:02 +05:30

pixart_sigma

fix pytest tests/pipelines/pixart_sigma/test_pixart.py::PixArtSigmaPi… (#12842 )

2025-12-15 14:36:01 +05:30

pndm

[Refactor] Move testing utils out of src (#12238 )

2025-08-28 19:53:02 +05:30

prx

Prx (#12525 )

2025-10-21 17:09:22 -07:00

qwenimage

[tests] disable xformer tests for pipelines it isn't popular. (#12277 )

2025-09-24 09:02:25 +05:30

sana

SANA-Video Image to Video pipeline SanaImageToVideoPipeline support (#12634 )

2025-11-17 00:23:34 -08:00

sana_video

SANA-Video Image to Video pipeline SanaImageToVideoPipeline support (#12634 )

2025-11-17 00:23:34 -08:00

shap_e

[Refactor] Move testing utils out of src (#12238 )

2025-08-28 19:53:02 +05:30

skyreels_v2

[Refactor] Move testing utils out of src (#12238 )

2025-08-28 19:53:02 +05:30

stable_audio

[Refactor] Move testing utils out of src (#12238 )

2025-08-28 19:53:02 +05:30

stable_cascade

[CI] xfail the test_wuerstchen_prior test (#12530 )

2025-10-22 08:45:47 -10:00

stable_diffusion

[Refactor] Move testing utils out of src (#12238 )

2025-08-28 19:53:02 +05:30

stable_diffusion_2

[CI] Fix failing Pipeline CPU tests (#12681 )

2025-11-19 21:19:24 +05:30

stable_diffusion_3

[Refactor] Move testing utils out of src (#12238 )

2025-08-28 19:53:02 +05:30

stable_diffusion_adapter

[Refactor] Move testing utils out of src (#12238 )

2025-08-28 19:53:02 +05:30

stable_diffusion_image_variation

[Refactor] Move testing utils out of src (#12238 )

2025-08-28 19:53:02 +05:30

stable_diffusion_xl

[Refactor] Move testing utils out of src (#12238 )

2025-08-28 19:53:02 +05:30

stable_unclip

[Refactor] Move testing utils out of src (#12238 )

2025-08-28 19:53:02 +05:30

stable_video_diffusion

[Refactor] Move testing utils out of src (#12238 )

2025-08-28 19:53:02 +05:30

visualcloze

[Refactor] Move testing utils out of src (#12238 )

2025-08-28 19:53:02 +05:30

wan

[WIP]Add Wan2.2 Animate Pipeline (Continuation of #12442 by tolgacangoz) (#12526 )

2025-11-12 16:52:31 -10:00

z_image

Add ZImageImg2ImgPipeline (#12751 )

2025-12-07 22:06:23 -10:00

__init__.py

Reorganize pipeline tests (#963 )

2022-10-24 16:34:01 +02:00

pipeline_params.py

[Modular] Fast Tests (#11937 )

2025-08-08 19:42:13 +05:30

test_pipeline_utils.py

[Refactor] Move testing utils out of src (#12238 )

2025-08-28 19:53:02 +05:30

test_pipelines_auto.py

[Refactor] Move testing utils out of src (#12238 )

2025-08-28 19:53:02 +05:30

test_pipelines_combined.py

[chore] change to 2025 licensing for remaining (#11741 )

2025-06-18 20:56:00 +05:30

test_pipelines_common.py

[Feat] TaylorSeer Cache (#12648 )

2025-12-06 05:39:54 +05:30

test_pipelines_onnx_common.py

[Refactor] Move testing utils out of src (#12238 )

2025-08-28 19:53:02 +05:30

test_pipelines.py

[ci] xfail more incorrect transformer imports. (#12455 )

2025-10-17 10:35:19 +05:30