naykun
f9c1e612fb
Qwen Image Layered Support ( #12853 )
...
* [qwen-image] qwen image layered support
* [qwen-image] update doc
* [qwen-image] fix pr comments
* Apply style fixes
* make fix-copies
---------
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
2025-12-17 16:57:57 +05:30
Wang, Yi
87f7d11143
extend TorchAoTest::test_model_memory_usage to other platform ( #12768 )
...
* extend TorchAoTest::test_model_memory_usage to other platform
Signe-off-by: Wang, Yi <yi.a.wang@inel.com >
* add some comments
Signed-off-by: Wang, Yi <yi.a.wang@intel.com >
---------
Signed-off-by: Wang, Yi <yi.a.wang@intel.com >
2025-12-17 13:44:08 +05:30
junqiangwu
5e48f466b9
fix the prefix_token_len bug ( #12845 )
2025-12-15 22:02:25 -10:00
junqiangwu
a748a839ad
Add support for LongCat-Image ( #12828 )
...
* Add LongCat-Image
* Update src/diffusers/models/transformers/transformer_longcat_image.py
Co-authored-by: YiYi Xu <yixu310@gmail.com >
* Update src/diffusers/models/transformers/transformer_longcat_image.py
Co-authored-by: YiYi Xu <yixu310@gmail.com >
* Update src/diffusers/models/transformers/transformer_longcat_image.py
Co-authored-by: YiYi Xu <yixu310@gmail.com >
* Update src/diffusers/pipelines/longcat_image/pipeline_longcat_image.py
Co-authored-by: YiYi Xu <yixu310@gmail.com >
* Update src/diffusers/pipelines/longcat_image/pipeline_longcat_image.py
Co-authored-by: YiYi Xu <yixu310@gmail.com >
* Update src/diffusers/pipelines/longcat_image/pipeline_longcat_image.py
Co-authored-by: YiYi Xu <yixu310@gmail.com >
* Update src/diffusers/models/transformers/transformer_longcat_image.py
Co-authored-by: YiYi Xu <yixu310@gmail.com >
* Update src/diffusers/pipelines/longcat_image/pipeline_longcat_image.py
Co-authored-by: YiYi Xu <yixu310@gmail.com >
* fix code
* add doc
* Update src/diffusers/pipelines/longcat_image/pipeline_longcat_image_edit.py
Co-authored-by: YiYi Xu <yixu310@gmail.com >
* Update src/diffusers/pipelines/longcat_image/pipeline_longcat_image_edit.py
Co-authored-by: YiYi Xu <yixu310@gmail.com >
* Update src/diffusers/pipelines/longcat_image/pipeline_longcat_image.py
Co-authored-by: YiYi Xu <yixu310@gmail.com >
* Update src/diffusers/pipelines/longcat_image/pipeline_longcat_image.py
Co-authored-by: YiYi Xu <yixu310@gmail.com >
* Update src/diffusers/pipelines/longcat_image/pipeline_longcat_image.py
Co-authored-by: YiYi Xu <yixu310@gmail.com >
* Update src/diffusers/pipelines/longcat_image/pipeline_longcat_image.py
Co-authored-by: YiYi Xu <yixu310@gmail.com >
* fix code & mask style & fix-copies
* Apply style fixes
* fix single input rewrite error
---------
Co-authored-by: YiYi Xu <yixu310@gmail.com >
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
Co-authored-by: hadoop-imagen <hadoop-imagen@psxfb7pxrbvmh3oq-worker-0.psxfb7pxrbvmh3oq.hadoop-aipnlp.svc.cluster.local >
2025-12-15 07:45:17 -10:00
DN6
d9b73ffd51
update
2025-12-15 16:12:50 +05:30
DN6
dcd6026d17
update
2025-12-15 16:12:15 +05:30
DN6
eae7543712
update
2025-12-15 16:02:38 +05:30
Yuqian Hong
58519283e7
Support for control-lora ( #10686 )
...
* run control-lora on diffusers
* cannot load lora adapter
* test
* 1
* add control-lora
* 1
* 1
* 1
* fix PeftAdapterMixin
* fix module_to_save bug
* delete json print
* resolve conflits
* merged but bug
* change peft.py
* 1
* delete state_dict print
* fix alpha
* Create control_lora.py
* Add files via upload
* rename
* no need modify as peft updated
* add doc
* fix code style
* styling isn't that hard 😉
* empty
---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
2025-12-15 15:52:42 +05:30
Wang, Yi
0c1ccc0775
fix pytest tests/pipelines/pixart_sigma/test_pixart.py::PixArtSigmaPi… ( #12842 )
...
fix pytest tests/pipelines/pixart_sigma/test_pixart.py::PixArtSigmaPipelineIntegrationTests::test_pixart_512 in xpu
Signed-off-by: Wang, Yi <yi.a.wang@intel.com >
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
2025-12-15 14:36:01 +05:30
DN6
d08e0bb545
update
2025-12-15 14:19:27 +05:30
naykun
b8a4cbac14
[qwen-image] edit 2511 support ( #12839 )
...
* [qwen-image] edit 2511 support
* Apply style fixes
---------
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
2025-12-15 12:35:01 +05:30
Wang, Yi
17c0e79dbd
support CP in native flash attention ( #12829 )
...
Signed-off-by: Wang, Yi <yi.a.wang@intel.com >
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
2025-12-12 13:18:39 +05:30
Sayak Paul
1567243463
[lora] Remove lora docs unneeded and add " # Copied from ..." ( #12824 )
...
* remove unneeded docs on load_lora_weights().
* remove more.
* up[
* up
* up
2025-12-12 08:31:27 +05:30
Sayak Paul
0eac64c7a6
Update distributed_inference.md to correct syntax ( #12827 )
2025-12-11 08:46:43 -08:00
Sayak Paul
10e820a2dd
post release 0.36.0 ( #12804 )
...
* post release 0.36.0
* Apply style fixes
---------
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
2025-12-11 22:01:59 +05:30
DN6
c366b5a817
update
2025-12-11 13:37:06 +05:30
DN6
0fdd9d3a60
update
2025-12-11 11:41:17 +05:30
DN6
489480b02a
update
2025-12-11 11:27:59 +05:30
DN6
fe451c367b
update
2025-12-11 11:04:47 +05:30
Sayak Paul
6708f5c76d
[docs] improve distributed inference cp docs. ( #12810 )
...
* improve distributed inference cp docs.
* Apply suggestions from code review
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
---------
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
2025-12-10 08:25:07 -08:00
Dhruv Nair
be3c2a0667
[WIP] Add Flux2 modular ( #12763 )
...
* update
* update
* update
* update
* update
* update
* update
* update
* update
* update
2025-12-10 12:19:07 +05:30
Sayak Paul
8b4722de57
Fix Qwen Edit Plus modular for multi-image input ( #12601 )
...
* try to fix qwen edit plus multi images (modular)
* up
* up
* test
* up
* up
2025-12-09 10:08:30 -10:00
YiYi Xu
07ea0786e8
[Modular]z-image ( #12808 )
...
* initiL
* up up
* fix: z_image -> z-image
* style
* copy
* fix more
* some docstring fix
2025-12-09 08:08:41 -10:00
David El Malih
54fa0745c3
Improve docstrings and type hints in scheduling_dpmsolver_singlestep.py ( #12798 )
...
feat: add flow sigmas, dynamic shifting, and refine type hints in DPMSolverSinglestepScheduler
2025-12-08 08:58:57 -08:00
David Lacalle Castillo
3d02cd543e
[PRX] Improve model compilation ( #12787 )
...
* Reimplement img2seq & seq2img in PRX to enable ONNX build without Col2Im (incompatible with TensorRT).
* Apply style fixes
---------
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
2025-12-08 17:42:17 +05:30
CalamitousFelicitousness
2246d2c7c4
Add ZImageImg2ImgPipeline ( #12751 )
...
* Add ZImageImg2ImgPipeline
Updated the pipeline structure to include ZImageImg2ImgPipeline
alongside ZImagePipeline.
Implemented the ZImageImg2ImgPipeline class for image-to-image
transformations, including necessary methods for
encoding prompts, preparing latents, and denoising.
Enhanced the auto_pipeline to map the new ZImageImg2ImgPipeline
for image generation tasks.
Added unit tests for ZImageImg2ImgPipeline to ensure
functionality and performance.
Updated dummy objects to include ZImageImg2ImgPipeline for
testing purposes.
* Address review comments for ZImageImg2ImgPipeline
- Add `# Copied from` annotations to encode_prompt and _encode_prompt
- Add ZImagePipeline to auto_pipeline.py for AutoPipeline support
* Add ZImage pipeline documentation
---------
Co-authored-by: YiYi Xu <yixu310@gmail.com >
Co-authored-by: Álvaro Somoza <asomoza@users.noreply.github.com >
2025-12-07 22:06:23 -10:00
YiYi Xu
671149e036
[HunyuanVideo1.5] support step-distilled ( #12802 )
...
* support step-distilled
* style
2025-12-07 21:50:36 -10:00
jiqing-feng
f67639b0bb
add post init for safty checker ( #12794 )
...
* add post init for safty checker
Signed-off-by: jiqing-feng <jiqing.feng@intel.com >
* check transformers version before post init
Signed-off-by: jiqing-feng <jiqing.feng@intel.com >
* Apply style fixes
---------
Signed-off-by: jiqing-feng <jiqing.feng@intel.com >
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
2025-12-08 11:31:03 +05:30
jingyu-ml
5a74319715
Update the TensorRT-ModelOPT to Nvidia-ModelOPT ( #12793 )
...
Update the naming
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
2025-12-08 10:07:04 +05:30
Tran Thanh Luan
6290fdfda4
[Feat] TaylorSeer Cache ( #12648 )
...
* init taylor_seer cache
* make compatible with any tuple size returned
* use logger for printing, add warmup feature
* still update in warmup steps
* refractor, add docs
* add configurable cache, skip compute module
* allow special cache ids only
* add stop_predicts (cooldown)
* update docs
* apply ruff
* update to handle multple calls per timestep
* refractor to use state manager
* fix format & doc
* chores: naming, remove redundancy
* add docs
* quality & style
* fix taylor precision
* Apply style fixes
* add tests
* Apply style fixes
* Remove TaylorSeerCacheTesterMixin from flux2 tests
* rename identifiers, use more expressive taylor predict loop
* torch compile compatible
* Apply style fixes
* Update src/diffusers/hooks/taylorseer_cache.py
Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com >
* update docs
* make fix-copies
* fix example usage.
* remove tests on flux kontext
---------
Co-authored-by: toilaluan <toilaluan@github.com >
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com >
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
2025-12-06 05:39:54 +05:30
David El Malih
256e010674
Improve docstrings and type hints in scheduling_deis_multistep.py ( #12796 )
...
* feat: Add `flow_prediction` to `prediction_type`, introduce `use_flow_sigmas`, `flow_shift`, `use_dynamic_shifting`, and `time_shift_type` parameters, and refine type hints for various arguments.
* style: reformat argument wrapping in `_convert_to_beta` and `index_for_timestep` method signatures.
2025-12-05 08:48:01 -08:00
Sayak Paul
8430ac2a2f
[docs] minor fixes to kandinsky docs ( #12797 )
...
up
2025-12-05 08:33:05 -08:00
sayakpaul
bb9e713d02
move kandisnky docs.
2025-12-05 21:44:24 +07:00
Álvaro Somoza
c98c157a9e
[Docs] Add Z-Image docs ( #12775 )
...
* initial
* toctree
* fix
* apply review and fix
* Update docs/source/en/api/pipelines/z_image.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
* Update docs/source/en/api/pipelines/z_image.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
* Update docs/source/en/api/pipelines/z_image.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
---------
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
2025-12-05 11:05:47 -03:00
swappy
f12d161d67
Fix broken group offloading with block_level for models with standalone layers ( #12692 )
...
* fix: group offloading to support standalone computational layers in block-level offloading
* test: for models with standalone and deeply nested layers in block-level offloading
* feat: support for block-level offloading in group offloading config
* fix: group offload block modules to AutoencoderKL and AutoencoderKLWan
* fix: update group offloading tests to use AutoencoderKL and adjust input dimensions
* refactor: streamline block offloading logic
* Apply style fixes
* update tests
* update
* fix for failing tests
* clean up
* revert to use skip_keys
* clean up
---------
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com >
2025-12-05 18:54:05 +05:30
David Bertoin
8d415a6f48
PRX Set downscale_freq_shift to 0 for consistency with internal implementation ( #12791 )
...
fix timestepembeddings downscale_freq_shift to be consitant with Photoroom's original code
2025-12-04 10:57:14 -10:00
Sayak Paul
7de51b826c
[lora] support more ZImage LoRAs ( #12790 )
...
up
Co-authored-by: Álvaro Somoza <asomoza@users.noreply.github.com >
2025-12-04 09:01:11 -10:00
Jiang
cd00ba685b
fix spatial compression ratio error for AutoEncoderKLWan doing tiled encode ( #12753 )
...
fix spatial compression ratio compute error for AutoEncoderKLWan
Co-authored-by: lirui.926 <lirui.926@bytedance.com >
2025-12-04 08:57:13 -10:00
David El Malih
2842c14c5f
Improve docstrings and type hints in scheduling_unipc_multistep.py ( #12767 )
...
refactor: add type hints and update docstrings for UniPCMultistepScheduler parameters and methods.
2025-12-04 10:10:54 -08:00
Sayak Paul
c318686090
Update attention_backends.md to format kernels ( #12757 )
2025-12-04 07:48:23 -08:00
hlky
6028613226
Z-Image-Turbo from_single_file ( #12756 )
...
* Z-Image-Turbo `from_single_file`
* compute_dtype
* -device cast
2025-12-04 20:22:48 +05:30
Sayak Paul
a1f36ee3ef
[Z-Image] various small changes, Z-Image transformer tests, etc. ( #12741 )
...
* start zimage model tests.
* up
* up
* up
* up
* up
* up
* up
* up
* up
* up
* up
* up
* Revert "up"
This reverts commit bca3e27c96 .
* expand upon compilation failure reason.
* Update tests/models/transformers/test_models_transformer_z_image.py
Co-authored-by: dg845 <58458699+dg845@users.noreply.github.com >
* reinitialize the padding tokens to ones to prevent NaN problems.
* updates
* up
* skipping ZImage DiT tests
* up
* up
---------
Co-authored-by: dg845 <58458699+dg845@users.noreply.github.com >
2025-12-03 19:35:46 +05:30
Sayak Paul
d96cbacacd
[tests] fix hunuyanvideo 1.5 offloading tests. ( #12782 )
...
fix hunuyanvideo 1.5 offloading tests.
2025-12-03 18:07:59 +05:30
Aditya Borate
5ab5946931
Fix: leaf_level offloading breaks after delete_adapters ( #12639 )
...
* Fix(peft): Re-apply group offloading after deleting adapters
* Test: Add regression test for group offloading + delete_adapters
* Test: Add assertions to verify output changes after deletion
* Test: Add try/finally to clean up group offloading hooks
---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
2025-12-03 17:39:11 +05:30
Lev Novitskiy
d0c54e5563
Kandinsky 5.0 Video Pro and Image Lite ( #12664 )
...
* add transformer pipeline first version
---------
Co-authored-by: Álvaro Somoza <asomoza@users.noreply.github.com >
Co-authored-by: YiYi Xu <yixu310@gmail.com >
Co-authored-by: Charles <charles@huggingface.co >
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
Co-authored-by: dmitrienkoae <dmitrienko.ae@phystech.edu >
Co-authored-by: nvvaulin <nvvaulin@gmail.com >
2025-12-03 00:46:37 -10:00
Dhruv Nair
1908c47600
Deprecate upcast_vae in SDXL based pipelines ( #12619 )
...
* update
* update
* Revert "update"
This reverts commit 73906381ab .
* Revert "update"
This reverts commit 21a03f93ef .
* update
* update
* update
* update
* update
2025-12-03 15:53:23 +05:30
Sayak Paul
759ea58708
[core] reuse AttentionMixin for compatible classes ( #12463 )
...
* remove attn_processors property
* more
* up
* up more.
* up
* add AttentionMixin to AuraFlow.
* up
* up
* up
* up
2025-12-03 13:58:33 +05:30
Sayak Paul
f48f9c250f
[core] start varlen variants for attn backend kernels. ( #12765 )
...
* start varlen variants for attn backend kernels.
* maybe unflatten heads.
* updates
* remove unused function.
* doc
* up
2025-12-03 13:34:52 +05:30
Kimbing Ng
3c05b9f71c
Fixes #12673 . record_stream in group offloading is not working properly ( #12721 )
...
* Fixes #12673 .
Wrong default_stream is used. leading to wrong execution order when record_steram is enabled.
* update
* Update test
---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
2025-12-03 11:37:11 +05:30
Jerry Wu
9379b2391b
Fix TPU (torch_xla) compatibility Error about tensor repeat func along with empty dim. ( #12770 )
...
* Refactor image padding logic to pervent zero tensor in transformer_z_image.py
* Apply style fixes
* Add more support to fix repeat bug on tpu devices.
* Fix for dynamo compile error for multi if-branches.
---------
Co-authored-by: Mingjia Li <mingjiali@tju.edu.cn >
Co-authored-by: Mingjia Li <mail@mingjia.li >
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
2025-12-02 12:51:23 -10:00