1
0
mirror of https://github.com/huggingface/diffusers.git synced 2026-01-27 17:22:53 +03:00
Commit Graph

6111 Commits

Author SHA1 Message Date
Daniel Gu
90edc6abc9 Fix more bugs in LTX2Pipeline.__call__ 2025-12-23 10:41:27 +01:00
Daniel Gu
a56cf23483 Add LTX 2 text encoder and vocoder to ltx2 subdirectory __init__ 2025-12-23 10:40:56 +01:00
Daniel Gu
fa7d9f77f1 Fix pipeline return bugs 2025-12-23 08:49:11 +01:00
Daniel Gu
3bf736979f Add script to test full LTX2Pipeline T2V inference 2025-12-23 08:43:37 +01:00
Daniel Gu
595f485ad8 LTX 2.0 scheduler and full pipeline conversion 2025-12-23 07:41:28 +01:00
Daniel Gu
cbb10b8dca Support num_videos_per_prompt for prompt embeddings 2025-12-23 07:01:17 +01:00
Daniel Gu
6e6ce20595 Duplicate scheduler for audio latents 2025-12-23 06:40:35 +01:00
Daniel Gu
54bfc5d617 Add Audio VAE logic to T2V pipeline 2025-12-23 03:51:22 +01:00
Daniel Gu
ae3b6e7cc2 Merge branch 'ltx-2-transformer' into ltx-2-t2v-pipeline 2025-12-23 02:59:33 +01:00
Daniel Gu
d303e2a6ff Conversion script for LTX 2.0 Audio VAE Decoder 2025-12-23 02:48:15 +01:00
Daniel Gu
5f7e43d17f Add imports for LTX 2.0 Audio VAE 2025-12-23 02:08:51 +01:00
dg845
7bb4cf76ce Merge pull request #5 from huggingface/audio-decoder
Audio decoder
2025-12-22 17:00:11 -08:00
sayakpaul
409d651bab resolve conflicts. 2025-12-22 15:59:31 +05:30
sayakpaul
8134da6a56 up 2025-12-22 15:55:29 +05:30
Sayak Paul
059999a3f7 up 2025-12-22 10:24:55 +00:00
sayakpaul
58257eb0e0 up 2025-12-22 15:45:56 +05:30
Sayak Paul
5f0f2a03f7 up 2025-12-22 10:06:39 +00:00
Daniel Gu
d0f9cdaab1 Rough initial LTX 2.0 pipeline implementation 2025-12-22 10:07:20 +01:00
Daniel Gu
0028955c37 Initial LTX 2.0 text encoder implementation 2025-12-22 10:06:01 +01:00
sayakpaul
4904fd6fa5 up 2025-12-22 13:46:58 +05:30
sayakpaul
907896d533 simplify and clean up 2025-12-22 13:41:41 +05:30
sayakpaul
e54cd6bb1d up 2025-12-22 13:03:40 +05:30
sayakpaul
f4c2435d61 init registration. 2025-12-22 12:25:36 +05:30
sayakpaul
b34ddb1736 start audio decoder. 2025-12-22 12:23:31 +05:30
Daniel Gu
6c56954fa8 Use RMSNorm implementation closer to original for LTX 2.0 video VAE 2025-12-20 02:40:38 +01:00
dg845
b1cf6ff8a9 Merge pull request #2 from huggingface/ltx-2-video-vae
LTX 2.0 Video VAE Implementation
2025-12-19 16:36:38 -08:00
dg845
8bfeb4af56 Merge pull request #3 from huggingface/ltx-2-vocoder
LTX 2.0 Vocoder Implementation
2025-12-19 16:21:31 -08:00
Daniel Gu
c6a11a5530 Initial LTX 2.0 vocoder implementation 2025-12-19 12:17:10 +01:00
Daniel Gu
a748975a7c Get diffusers implementation on par with official LTX 2.0 video VAE implementation 2025-12-19 07:02:38 +01:00
Daniel Gu
491aae08d8 Add initial LTX 2.0 video VAE tests (part 2) 2025-12-17 11:39:09 +01:00
Daniel Gu
5b950d6fef Add initial LTX 2.0 video VAE tests 2025-12-17 11:30:15 +01:00
Daniel Gu
baf23e2da3 Explicitly specify temporal and spatial VAE scale factors when converting 2025-12-17 11:14:45 +01:00
Daniel Gu
269cf7b40d Initial implementation of LTX 2.0 video VAE 2025-12-17 10:51:34 +01:00
Daniel Gu
bda3ff13db Fix LTX 2 transformer bugs so consistency test passes 2025-12-16 10:53:43 +01:00
Daniel Gu
a7bc052e89 Improve dummy inputs and add test for LTX 2 transformer consistency 2025-12-16 10:44:02 +01:00
Daniel Gu
57a8b9c330 Allow LTX 2 transformer to be loaded from local path for conversion 2025-12-16 10:38:03 +01:00
Daniel Gu
d86f89ddea Add more LTX 2 transformer audio arguments 2025-12-16 07:58:12 +01:00
Daniel Gu
a5f2d2da6c Initial script to convert LTX 2 transformer to diffusers 2025-12-15 07:09:42 +01:00
Daniel Gu
aeecc4d712 Fix LTX 2 transformer shape errors 2025-12-15 06:38:57 +01:00
Daniel Gu
5765759cd3 Get LTX 2 transformer compile tests passing 2025-12-15 03:38:34 +01:00
Daniel Gu
780fb61d32 Remove RoPE debug print statements 2025-12-13 10:37:24 +01:00
Daniel Gu
e100b8f2a3 Rename LTX 2 compile test class to have LTX2 2025-12-13 10:34:11 +01:00
Daniel Gu
980591de53 Get LTX 2 transformer tests working 2025-12-13 04:57:23 +01:00
Daniel Gu
b3096c3c9e Add tests for LTX 2 transformer model 2025-12-13 04:55:41 +01:00
Daniel Gu
aa602ac483 Initial LTX 2.0 transformer implementation 2025-12-12 07:52:33 +01:00
Sayak Paul
8b4722de57 Fix Qwen Edit Plus modular for multi-image input (#12601)
* try to fix qwen edit plus multi images (modular)

* up

* up

* test

* up

* up
2025-12-09 10:08:30 -10:00
YiYi Xu
07ea0786e8 [Modular]z-image (#12808)
* initiL

* up up

* fix: z_image -> z-image

* style

* copy

* fix more

* some docstring fix
2025-12-09 08:08:41 -10:00
David El Malih
54fa0745c3 Improve docstrings and type hints in scheduling_dpmsolver_singlestep.py (#12798)
feat: add flow sigmas, dynamic shifting, and refine type hints in DPMSolverSinglestepScheduler
2025-12-08 08:58:57 -08:00
David Lacalle Castillo
3d02cd543e [PRX] Improve model compilation (#12787)
* Reimplement img2seq & seq2img in PRX to enable ONNX build without Col2Im (incompatible with TensorRT).

* Apply style fixes

---------

Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>
2025-12-08 17:42:17 +05:30
CalamitousFelicitousness
2246d2c7c4 Add ZImageImg2ImgPipeline (#12751)
* Add ZImageImg2ImgPipeline

Updated the pipeline structure to include ZImageImg2ImgPipeline
    alongside ZImagePipeline.
Implemented the ZImageImg2ImgPipeline class for image-to-image
    transformations, including necessary methods for
    encoding prompts, preparing latents, and denoising.
Enhanced the auto_pipeline to map the new ZImageImg2ImgPipeline
    for image generation tasks.
Added unit tests for ZImageImg2ImgPipeline to ensure
    functionality and performance.
Updated dummy objects to include ZImageImg2ImgPipeline for
    testing purposes.

* Address review comments for ZImageImg2ImgPipeline

- Add `# Copied from` annotations to encode_prompt and _encode_prompt
- Add ZImagePipeline to auto_pipeline.py for AutoPipeline support

* Add ZImage pipeline documentation

---------

Co-authored-by: YiYi Xu <yixu310@gmail.com>
Co-authored-by: Álvaro Somoza <asomoza@users.noreply.github.com>
2025-12-07 22:06:23 -10:00