sayakpaul
ce12b356df
up
2025-12-17 18:09:26 +05:30
Sayak Paul
d6498d26d7
Merge branch 'main' into pipeline-specific-mixins
2025-12-17 20:35:48 +08:00
naykun
f9c1e612fb
Qwen Image Layered Support ( #12853 )
...
* [qwen-image] qwen image layered support
* [qwen-image] update doc
* [qwen-image] fix pr comments
* Apply style fixes
* make fix-copies
---------
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
2025-12-17 16:57:57 +05:30
Wang, Yi
87f7d11143
extend TorchAoTest::test_model_memory_usage to other platform ( #12768 )
...
* extend TorchAoTest::test_model_memory_usage to other platform
Signe-off-by: Wang, Yi <yi.a.wang@inel.com >
* add some comments
Signed-off-by: Wang, Yi <yi.a.wang@intel.com >
---------
Signed-off-by: Wang, Yi <yi.a.wang@intel.com >
2025-12-17 13:44:08 +05:30
sayakpaul
d6f59bcbe4
resolve conflicts.
2025-12-17 09:42:24 +05:30
junqiangwu
5e48f466b9
fix the prefix_token_len bug ( #12845 )
2025-12-15 22:02:25 -10:00
junqiangwu
a748a839ad
Add support for LongCat-Image ( #12828 )
...
* Add LongCat-Image
* Update src/diffusers/models/transformers/transformer_longcat_image.py
Co-authored-by: YiYi Xu <yixu310@gmail.com >
* Update src/diffusers/models/transformers/transformer_longcat_image.py
Co-authored-by: YiYi Xu <yixu310@gmail.com >
* Update src/diffusers/models/transformers/transformer_longcat_image.py
Co-authored-by: YiYi Xu <yixu310@gmail.com >
* Update src/diffusers/pipelines/longcat_image/pipeline_longcat_image.py
Co-authored-by: YiYi Xu <yixu310@gmail.com >
* Update src/diffusers/pipelines/longcat_image/pipeline_longcat_image.py
Co-authored-by: YiYi Xu <yixu310@gmail.com >
* Update src/diffusers/pipelines/longcat_image/pipeline_longcat_image.py
Co-authored-by: YiYi Xu <yixu310@gmail.com >
* Update src/diffusers/models/transformers/transformer_longcat_image.py
Co-authored-by: YiYi Xu <yixu310@gmail.com >
* Update src/diffusers/pipelines/longcat_image/pipeline_longcat_image.py
Co-authored-by: YiYi Xu <yixu310@gmail.com >
* fix code
* add doc
* Update src/diffusers/pipelines/longcat_image/pipeline_longcat_image_edit.py
Co-authored-by: YiYi Xu <yixu310@gmail.com >
* Update src/diffusers/pipelines/longcat_image/pipeline_longcat_image_edit.py
Co-authored-by: YiYi Xu <yixu310@gmail.com >
* Update src/diffusers/pipelines/longcat_image/pipeline_longcat_image.py
Co-authored-by: YiYi Xu <yixu310@gmail.com >
* Update src/diffusers/pipelines/longcat_image/pipeline_longcat_image.py
Co-authored-by: YiYi Xu <yixu310@gmail.com >
* Update src/diffusers/pipelines/longcat_image/pipeline_longcat_image.py
Co-authored-by: YiYi Xu <yixu310@gmail.com >
* Update src/diffusers/pipelines/longcat_image/pipeline_longcat_image.py
Co-authored-by: YiYi Xu <yixu310@gmail.com >
* fix code & mask style & fix-copies
* Apply style fixes
* fix single input rewrite error
---------
Co-authored-by: YiYi Xu <yixu310@gmail.com >
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
Co-authored-by: hadoop-imagen <hadoop-imagen@psxfb7pxrbvmh3oq-worker-0.psxfb7pxrbvmh3oq.hadoop-aipnlp.svc.cluster.local >
2025-12-15 07:45:17 -10:00
Yuqian Hong
58519283e7
Support for control-lora ( #10686 )
...
* run control-lora on diffusers
* cannot load lora adapter
* test
* 1
* add control-lora
* 1
* 1
* 1
* fix PeftAdapterMixin
* fix module_to_save bug
* delete json print
* resolve conflits
* merged but bug
* change peft.py
* 1
* delete state_dict print
* fix alpha
* Create control_lora.py
* Add files via upload
* rename
* no need modify as peft updated
* add doc
* fix code style
* styling isn't that hard 😉
* empty
---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
2025-12-15 15:52:42 +05:30
Wang, Yi
0c1ccc0775
fix pytest tests/pipelines/pixart_sigma/test_pixart.py::PixArtSigmaPi… ( #12842 )
...
fix pytest tests/pipelines/pixart_sigma/test_pixart.py::PixArtSigmaPipelineIntegrationTests::test_pixart_512 in xpu
Signed-off-by: Wang, Yi <yi.a.wang@intel.com >
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
2025-12-15 14:36:01 +05:30
naykun
b8a4cbac14
[qwen-image] edit 2511 support ( #12839 )
...
* [qwen-image] edit 2511 support
* Apply style fixes
---------
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
2025-12-15 12:35:01 +05:30
Wang, Yi
17c0e79dbd
support CP in native flash attention ( #12829 )
...
Signed-off-by: Wang, Yi <yi.a.wang@intel.com >
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
2025-12-12 13:18:39 +05:30
Sayak Paul
1567243463
[lora] Remove lora docs unneeded and add " # Copied from ..." ( #12824 )
...
* remove unneeded docs on load_lora_weights().
* remove more.
* up[
* up
* up
2025-12-12 08:31:27 +05:30
Sayak Paul
0eac64c7a6
Update distributed_inference.md to correct syntax ( #12827 )
2025-12-11 08:46:43 -08:00
Sayak Paul
10e820a2dd
post release 0.36.0 ( #12804 )
...
* post release 0.36.0
* Apply style fixes
---------
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
2025-12-11 22:01:59 +05:30
sayakpaul
04f0f95a7a
yup
2025-12-11 14:02:58 +05:30
sayakpaul
8cfaaa2fb6
yp
2025-12-11 13:39:49 +05:30
sayakpaul
f037930b90
up
2025-12-11 11:32:54 +05:30
sayakpaul
3e374fda38
up
2025-12-11 11:17:28 +05:30
sayakpaul
14f51c51f0
up
2025-12-11 10:47:26 +05:30
sayakpaul
9f9eb6ae5b
up
2025-12-11 10:46:23 +05:30
Sayak Paul
69927d9e39
Merge branch 'main' into qwen-pipeline-mixin
2025-12-11 13:12:42 +08:00
Sayak Paul
6708f5c76d
[docs] improve distributed inference cp docs. ( #12810 )
...
* improve distributed inference cp docs.
* Apply suggestions from code review
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
---------
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
2025-12-10 08:25:07 -08:00
Dhruv Nair
be3c2a0667
[WIP] Add Flux2 modular ( #12763 )
...
* update
* update
* update
* update
* update
* update
* update
* update
* update
* update
2025-12-10 12:19:07 +05:30
Sayak Paul
8b4722de57
Fix Qwen Edit Plus modular for multi-image input ( #12601 )
...
* try to fix qwen edit plus multi images (modular)
* up
* up
* test
* up
* up
2025-12-09 10:08:30 -10:00
YiYi Xu
07ea0786e8
[Modular]z-image ( #12808 )
...
* initiL
* up up
* fix: z_image -> z-image
* style
* copy
* fix more
* some docstring fix
2025-12-09 08:08:41 -10:00
David El Malih
54fa0745c3
Improve docstrings and type hints in scheduling_dpmsolver_singlestep.py ( #12798 )
...
feat: add flow sigmas, dynamic shifting, and refine type hints in DPMSolverSinglestepScheduler
2025-12-08 08:58:57 -08:00
David Lacalle Castillo
3d02cd543e
[PRX] Improve model compilation ( #12787 )
...
* Reimplement img2seq & seq2img in PRX to enable ONNX build without Col2Im (incompatible with TensorRT).
* Apply style fixes
---------
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
2025-12-08 17:42:17 +05:30
sayakpaul
a6d5c4d9c6
up
2025-12-08 14:17:59 +05:30
sayakpaul
e5c40f4f6e
uo
2025-12-08 14:06:09 +05:30
Sayak Paul
131ea1676e
Merge branch 'main' into qwen-pipeline-mixin
2025-12-08 16:26:52 +08:00
sayakpaul
e122079279
Revert "up"
...
This reverts commit 044392d65f .
2025-12-08 13:56:32 +05:30
sayakpaul
b4432fffab
Revert "path change for StableDiffusionLoraLoaderMixin"
...
This reverts commit 6d881198f3 .
2025-12-08 13:56:18 +05:30
sayakpaul
6d881198f3
path change for StableDiffusionLoraLoaderMixin
2025-12-08 13:38:07 +05:30
sayakpaul
044392d65f
up
2025-12-08 13:36:29 +05:30
CalamitousFelicitousness
2246d2c7c4
Add ZImageImg2ImgPipeline ( #12751 )
...
* Add ZImageImg2ImgPipeline
Updated the pipeline structure to include ZImageImg2ImgPipeline
alongside ZImagePipeline.
Implemented the ZImageImg2ImgPipeline class for image-to-image
transformations, including necessary methods for
encoding prompts, preparing latents, and denoising.
Enhanced the auto_pipeline to map the new ZImageImg2ImgPipeline
for image generation tasks.
Added unit tests for ZImageImg2ImgPipeline to ensure
functionality and performance.
Updated dummy objects to include ZImageImg2ImgPipeline for
testing purposes.
* Address review comments for ZImageImg2ImgPipeline
- Add `# Copied from` annotations to encode_prompt and _encode_prompt
- Add ZImagePipeline to auto_pipeline.py for AutoPipeline support
* Add ZImage pipeline documentation
---------
Co-authored-by: YiYi Xu <yixu310@gmail.com >
Co-authored-by: Álvaro Somoza <asomoza@users.noreply.github.com >
2025-12-07 22:06:23 -10:00
YiYi Xu
671149e036
[HunyuanVideo1.5] support step-distilled ( #12802 )
...
* support step-distilled
* style
2025-12-07 21:50:36 -10:00
sayakpaul
be586607de
remove fluxcontrolmixin
2025-12-08 12:19:50 +05:30
sayakpaul
cf3053b565
Merge branch 'main' into qwen-pipeline-mixin
2025-12-08 12:07:04 +05:30
jiqing-feng
f67639b0bb
add post init for safty checker ( #12794 )
...
* add post init for safty checker
Signed-off-by: jiqing-feng <jiqing.feng@intel.com >
* check transformers version before post init
Signed-off-by: jiqing-feng <jiqing.feng@intel.com >
* Apply style fixes
---------
Signed-off-by: jiqing-feng <jiqing.feng@intel.com >
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
2025-12-08 11:31:03 +05:30
jingyu-ml
5a74319715
Update the TensorRT-ModelOPT to Nvidia-ModelOPT ( #12793 )
...
Update the naming
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
2025-12-08 10:07:04 +05:30
Tran Thanh Luan
6290fdfda4
[Feat] TaylorSeer Cache ( #12648 )
...
* init taylor_seer cache
* make compatible with any tuple size returned
* use logger for printing, add warmup feature
* still update in warmup steps
* refractor, add docs
* add configurable cache, skip compute module
* allow special cache ids only
* add stop_predicts (cooldown)
* update docs
* apply ruff
* update to handle multple calls per timestep
* refractor to use state manager
* fix format & doc
* chores: naming, remove redundancy
* add docs
* quality & style
* fix taylor precision
* Apply style fixes
* add tests
* Apply style fixes
* Remove TaylorSeerCacheTesterMixin from flux2 tests
* rename identifiers, use more expressive taylor predict loop
* torch compile compatible
* Apply style fixes
* Update src/diffusers/hooks/taylorseer_cache.py
Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com >
* update docs
* make fix-copies
* fix example usage.
* remove tests on flux kontext
---------
Co-authored-by: toilaluan <toilaluan@github.com >
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com >
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
2025-12-06 05:39:54 +05:30
David El Malih
256e010674
Improve docstrings and type hints in scheduling_deis_multistep.py ( #12796 )
...
* feat: Add `flow_prediction` to `prediction_type`, introduce `use_flow_sigmas`, `flow_shift`, `use_dynamic_shifting`, and `time_shift_type` parameters, and refine type hints for various arguments.
* style: reformat argument wrapping in `_convert_to_beta` and `index_for_timestep` method signatures.
2025-12-05 08:48:01 -08:00
Sayak Paul
8430ac2a2f
[docs] minor fixes to kandinsky docs ( #12797 )
...
up
2025-12-05 08:33:05 -08:00
sayakpaul
bb9e713d02
move kandisnky docs.
2025-12-05 21:44:24 +07:00
Álvaro Somoza
c98c157a9e
[Docs] Add Z-Image docs ( #12775 )
...
* initial
* toctree
* fix
* apply review and fix
* Update docs/source/en/api/pipelines/z_image.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
* Update docs/source/en/api/pipelines/z_image.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
* Update docs/source/en/api/pipelines/z_image.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
---------
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
2025-12-05 11:05:47 -03:00
swappy
f12d161d67
Fix broken group offloading with block_level for models with standalone layers ( #12692 )
...
* fix: group offloading to support standalone computational layers in block-level offloading
* test: for models with standalone and deeply nested layers in block-level offloading
* feat: support for block-level offloading in group offloading config
* fix: group offload block modules to AutoencoderKL and AutoencoderKLWan
* fix: update group offloading tests to use AutoencoderKL and adjust input dimensions
* refactor: streamline block offloading logic
* Apply style fixes
* update tests
* update
* fix for failing tests
* clean up
* revert to use skip_keys
* clean up
---------
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com >
2025-12-05 18:54:05 +05:30
David Bertoin
8d415a6f48
PRX Set downscale_freq_shift to 0 for consistency with internal implementation ( #12791 )
...
fix timestepembeddings downscale_freq_shift to be consitant with Photoroom's original code
2025-12-04 10:57:14 -10:00
Sayak Paul
7de51b826c
[lora] support more ZImage LoRAs ( #12790 )
...
up
Co-authored-by: Álvaro Somoza <asomoza@users.noreply.github.com >
2025-12-04 09:01:11 -10:00
Jiang
cd00ba685b
fix spatial compression ratio error for AutoEncoderKLWan doing tiled encode ( #12753 )
...
fix spatial compression ratio compute error for AutoEncoderKLWan
Co-authored-by: lirui.926 <lirui.926@bytedance.com >
2025-12-04 08:57:13 -10:00
David El Malih
2842c14c5f
Improve docstrings and type hints in scheduling_unipc_multistep.py ( #12767 )
...
refactor: add type hints and update docstrings for UniPCMultistepScheduler parameters and methods.
2025-12-04 10:10:54 -08:00