Dhruv Nair
617c208bb4
[Docs] Update Wan Docs with memory optimizations ( #11089 )
...
* update
* update
2025-03-28 19:05:56 +05:30
hlky
5d970a4aa9
WanI2V encode_image ( #11164 )
...
* WanI2V encode_image
2025-03-28 18:05:34 +05:30
kentdan3msu
de6a88c2d7
Set self._hf_peft_config_loaded to True when LoRA is loaded using load_lora_adapter in PeftAdapterMixin class ( #11155 )
...
set self._hf_peft_config_loaded to True on successful lora load
Sets the `_hf_peft_config_loaded` flag if a LoRA is successfully loaded in `load_lora_adapter`. Fixes bug huggingface/diffusers/issues/11148
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
2025-03-26 18:31:18 +01:00
Dhruv Nair
7dc52ea769
[Quantization] dtype fix for GGUF + fix BnB tests ( #11159 )
...
* update
* update
* update
* update
2025-03-26 22:22:16 +05:30
Junsong Chen
739d6ec731
add a timestep scale for sana-sprint teacher model ( #11150 )
2025-03-25 08:47:39 -10:00
Aryan
1ddf3f3a19
Improve information about group offloading and layerwise casting ( #11101 )
...
* update
* Update docs/source/en/optimization/memory.md
* Apply suggestions from code review
Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com >
* apply review suggestions
* update
---------
Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com >
2025-03-24 23:25:59 +05:30
Jun Yeop Na
7aac77affa
[doc] Fix Korean Controlnet Train doc ( #11141 )
...
* remove typo from korean controlnet train doc
* removed more paragraphs to remain in sync with the english document
2025-03-24 09:38:21 -07:00
Aryan
8907a70a36
New HunyuanVideo-I2V ( #11066 )
...
* update
* update
* update
* add tests
* update docs
* raise value error
* warning for true cfg and guidance scale
* fix test
2025-03-24 21:18:40 +05:30
Junsong Chen
5dbe4f5de6
[fix SANA-Sprint] ( #11142 )
...
* fix bug in sana conversion script;
* add more model paths;
---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
2025-03-23 23:38:14 -10:00
Yuxuan Zhang
1d37f42055
Modify the implementation of retrieve_timesteps in CogView4-Control. ( #11125 )
...
* 1
* change to channel 1
* cogview4 control training
* add CacheMixin
* 1
* remove initial_input_channels change for val
* 1
* update
* use 3.5
* new loss
* 1
* use imagetoken
* for megatron convert
* 1
* train con and uc
* 2
* remove guidance_scale
* Update pipeline_cogview4_control.py
* fix
* use cogview4 pipeline with timestep
* update shift_factor
* remove the uncond
* add max length
* change convert and use GLMModel instead of GLMForCasualLM
* fix
* [cogview4] Add attention mask support to transformer model
* [fix] Add attention mask for padded token
* update
* remove padding type
* Update train_control_cogview4.py
* resolve conflicts with #10981
* add control convert
* use control format
* fix
* add missing import
* update with cogview4 formate
* make style
* Update pipeline_cogview4_control.py
* Update pipeline_cogview4_control.py
* remove
* Update pipeline_cogview4_control.py
* put back
* Apply style fixes
---------
Co-authored-by: OleehyO <leehy0357@gmail.com >
Co-authored-by: yiyixuxu <yixu310@gmail.com >
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
2025-03-23 21:17:14 +05:30
Tolga Cangöz
0213179ba8
Update README and example code for AnyText usage ( #11028 )
...
* [Documentation] Update README and example code with additional usage instructions for AnyText
* [Documentation] Update README for AnyTextPipeline and improve logging in code
* Remove wget command for font file from example docstring in anytext.py
2025-03-23 21:15:57 +05:30
hlky
a7d53a5939
Don't override torch_dtype and don't use when quantization_config is set ( #11039 )
...
* Don't use `torch_dtype` when `quantization_config` is set
* up
* djkajka
* Apply suggestions from code review
---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
2025-03-21 21:58:38 +05:30
YiYi Xu
8a63aa5e4f
add sana-sprint ( #11074 )
...
* add sana-sprint
---------
Co-authored-by: Junsong Chen <cjs1020440147@icloud.com >
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
Co-authored-by: Aryan <aryan@huggingface.co >
2025-03-21 06:21:18 -10:00
Aryan
844221ae4e
[core] FasterCache ( #10163 )
...
* init
* update
* update
* update
* make style
* update
* fix
* make it work with guidance distilled models
* update
* make fix-copies
* add tests
* update
* apply_faster_cache -> apply_fastercache
* fix
* reorder
* update
* refactor
* update docs
* add fastercache to CacheMixin
* update tests
* Apply suggestions from code review
* make style
* try to fix partial import error
* Apply style fixes
* raise warning
* update
---------
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
2025-03-21 09:35:04 +05:30
CyberVy
9b2c0a7dbe
fix _callback_tensor_inputs of sd controlnet inpaint pipeline missing some elements ( #11073 )
...
* Update pipeline_controlnet_inpaint.py
* Apply style fixes
2025-03-20 23:56:12 -03:00
Parag Ekbote
f424b1b062
Notebooks for Community Scripts-8 ( #11128 )
...
Add 4 Notebooks and update the missing links for the
example README.
2025-03-20 12:24:46 -07:00
YiYi Xu
e9fda3924f
remove F.rms_norm for now ( #11126 )
...
up
2025-03-20 07:55:01 -10:00
Dhruv Nair
2c1ed50fc5
Provide option to reduce CPU RAM usage in Group Offload ( #11106 )
...
* update
* update
* clean up
2025-03-20 17:01:09 +05:30
Fanli Lin
15ad97f782
[tests] make cuda only tests device-agnostic ( #11058 )
...
* enable bnb on xpu
* add 2 more cases
* add missing change
* add missing change
* add one more
* enable cuda only tests on xpu
* enable big gpu cases
2025-03-20 10:12:35 +00:00
hlky
9f2d5c9ee9
Flux with Remote Encode ( #11091 )
...
* Flux img2img remote encode
* Flux inpaint
* -copied from
2025-03-20 09:44:08 +00:00
Junsong Chen
dc62e6931e
[fix bug] PixArt inference_steps=1 ( #11079 )
...
* fix bug when pixart-dmd inference with `num_inference_steps=1`
* use return_dict=False and return [1] element for 1-step pixart model, which works for both lcm and dmd
2025-03-20 07:44:30 +00:00
Fanli Lin
56f740051d
[tests] enable bnb tests on xpu ( #11001 )
...
* enable bnb on xpu
* add 2 more cases
* add missing change
* add missing change
* add one more
2025-03-19 16:33:11 +00:00
Linoy Tsaban
a34d97cef0
[Wan LoRAs] make T2V LoRAs compatible with Wan I2V ( #11107 )
...
* @hlky t2v->i2v
* Apply style fixes
* try with ones to not nullify layers
* fix method name
* revert to zeros
* add check to state_dict keys
* add comment
* copies fix
* Revert "copies fix"
This reverts commit 051f534d18 .
* remove copied from
* Update src/diffusers/loaders/lora_pipeline.py
Co-authored-by: hlky <hlky@hlky.ac >
* Update src/diffusers/loaders/lora_pipeline.py
Co-authored-by: hlky <hlky@hlky.ac >
* update
* update
* Update src/diffusers/loaders/lora_pipeline.py
Co-authored-by: hlky <hlky@hlky.ac >
* Apply style fixes
---------
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
Co-authored-by: Linoy <linoy@hf.co >
Co-authored-by: hlky <hlky@hlky.ac >
2025-03-19 21:44:19 +05:30
Yuqian Hong
fc28791fc8
[BUG] Fix Autoencoderkl train script ( #11113 )
...
* add disc_optimizer step (not fix)
* support syncbatchnorm in discriminator
2025-03-19 16:49:02 +05:30
Sayak Paul
ae14612673
[CI] uninstall deps properly from pr gpu tests. ( #11102 )
...
uninstall deps properly from pr gpu tests.
2025-03-19 08:58:36 +05:30
hlky
0ab8fe49bf
Quality options in export_to_video ( #11090 )
...
* Quality options in `export_to_video`
* make style
2025-03-18 10:32:33 -10:00
Aryan
3be6706018
Fix Group offloading behaviour when using streams ( #11097 )
...
* update
* update
2025-03-18 14:44:10 +05:30
Cheng Jin
cb1b8b21b8
Resolve stride mismatch in UNet's ResNet to support Torch DDP ( #11098 )
...
Modify UNet's ResNet implementation to resolve stride mismatch in Torch's DDP
2025-03-18 07:38:13 +00:00
Juan Acevedo
27916822b2
update readme instructions. ( #11096 )
...
Co-authored-by: Juan Acevedo <jfacevedo@google.com >
2025-03-17 20:07:48 -10:00
co63oc
3fe3bc0642
Fix pipeline_flux_controlnet.py ( #11095 )
...
* Fix pipeline_flux_controlnet.py
* Fix style
2025-03-17 19:52:15 -10:00
Aryan
813d42cc96
Group offloading improvements ( #11094 )
...
update
2025-03-18 11:18:00 +05:30
Sayak Paul
b4d7e9c632
make PR GPU tests conditioned on styling. ( #11099 )
2025-03-18 11:15:35 +05:30
Aryan
2e83cbbb6d
LTX 0.9.5 ( #10968 )
...
* update
---------
Co-authored-by: YiYi Xu <yixu310@gmail.com >
Co-authored-by: hlky <hlky@hlky.ac >
2025-03-17 16:43:36 -10:00
C
33d10af28f
Fix Wan I2V Quality ( #11087 )
...
* fix_wan_i2v_quality
* Update src/diffusers/pipelines/wan/pipeline_wan_i2v.py
Co-authored-by: YiYi Xu <yixu310@gmail.com >
* Update src/diffusers/pipelines/wan/pipeline_wan_i2v.py
Co-authored-by: YiYi Xu <yixu310@gmail.com >
* Update src/diffusers/pipelines/wan/pipeline_wan_i2v.py
Co-authored-by: YiYi Xu <yixu310@gmail.com >
* Update pipeline_wan_i2v.py
---------
Co-authored-by: YiYi Xu <yixu310@gmail.com >
Co-authored-by: hlky <hlky@hlky.ac >
2025-03-17 06:24:57 -10:00
Sayak Paul
100142586f
[CI] pin transformers version for benchmarking. ( #11067 )
...
pin transformers version for benchmarking.
2025-03-16 10:27:35 +05:30
Yuxuan Zhang
82188cef04
CogView4 Control Block ( #10809 )
...
* cogview4 control training
---------
Co-authored-by: OleehyO <leehy0357@gmail.com >
Co-authored-by: yiyixuxu <yixu310@gmail.com >
2025-03-15 07:15:56 -10:00
Sayak Paul
cc19726f3d
[Tests] add requires peft decorator. ( #11037 )
...
* add requires peft decorator.
* install peft conditionally.
* conditional deps.
Co-authored-by: DN6 <dhruv.nair@gmail.com >
---------
Co-authored-by: DN6 <dhruv.nair@gmail.com >
2025-03-15 12:56:41 +05:30
Dimitri Barbot
be54a95b93
Fix deterministic issue when getting pipeline dtype and device ( #10696 )
...
Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com >
2025-03-15 07:50:58 +05:30
Juan Acevedo
6b9a3334db
reverts accidental change that removes attn_mask in attn. Improves fl… ( #11065 )
...
reverts accidental change that removes attn_mask in attn. Improves flux ptxla by using flash block sizes. Moves encoding outside the for loop.
Co-authored-by: Juan Acevedo <jfacevedo@google.com >
2025-03-14 12:47:01 -10:00
Andreas Jörg
8ead643bb7
[examples/controlnet/train_controlnet_sd3.py] Fixes #11050 - Cast prompt_embeds and pooled_prompt_embeds to weight_dtype to prevent dtype mismatch ( #11051 )
...
Fix: dtype mismatch of prompt embeddings in sd3 controlnet training
Co-authored-by: Andreas Jörg <andreasjoerg@MacBook-Pro-von-Andreas-2.fritz.box >
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
2025-03-14 17:33:15 +05:30
Sayak Paul
124ac3e81f
[LoRA] feat: support non-diffusers wan t2v loras. ( #11059 )
...
feat: support non-diffusers wan t2v loras.
2025-03-14 16:01:25 +05:30
Sayak Paul
2f0f281b0d
[Tests] restrict memory tests for quanto for certain schemes. ( #11052 )
...
* restrict memory tests for quanto for certain schemes.
* Apply suggestions from code review
Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com >
* fixes
* style
---------
Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com >
2025-03-14 10:35:19 +05:30
ZhengKai91
ccc8321651
Fix aclnnRepeatInterleaveIntWithDim error on NPU for get_1d_rotary_pos_embed ( #10820 )
...
* get_1d_rotary_pos_embed support npu
* Update src/diffusers/models/embeddings.py
---------
Co-authored-by: Kai zheng <kaizheng@KaideMacBook-Pro.local >
Co-authored-by: hlky <hlky@hlky.ac >
Co-authored-by: YiYi Xu <yixu310@gmail.com >
2025-03-13 09:58:03 -10:00
Yaniv Galron
5e48cd27d4
making ``formatted_images`` initialization compact ( #10801 )
...
compact writing
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
Co-authored-by: YiYi Xu <yixu310@gmail.com >
2025-03-13 09:27:14 -10:00
hlky
5551506b29
Rename Lumina(2)Text2ImgPipeline -> Lumina(2)Pipeline ( #10827 )
...
* Rename Lumina(2)Text2ImgPipeline -> Lumina(2)Pipeline
---------
Co-authored-by: YiYi Xu <yixu310@gmail.com >
2025-03-13 09:24:21 -10:00
Sayak Paul
20e4b6a628
[LoRA] change to warning from info when notifying the users about a LoRA no-op ( #11044 )
...
* move to warning.
* test related changes.
2025-03-12 21:20:48 +05:30
hlky
4ea9f89b8e
Wan Pipeline scaling fix, type hint warning, multi generator fix ( #11007 )
...
* Wan Pipeline scaling fix, type hint warning, multi generator fix
* Apply suggestions from code review
2025-03-12 12:05:52 +00:00
hlky
733b44ac82
[hybrid inference 🍯 🐝 ] Add VAE encode ( #11017 )
...
* [hybrid inference 🍯 🐝 ] Add VAE encode
* _toctree: add vae encode
* Add endpoints, tests
* vae_encode docs
* vae encode benchmarks
* api reference
* changelog
* Update docs/source/en/hybrid_inference/overview.md
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
* update
---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
2025-03-12 11:23:41 +00:00
hlky
8b4f8ba764
Use output_size in repeat_interleave ( #11030 )
2025-03-12 07:30:21 +00:00
Dhruv Nair
5428046437
[Refactor] Clean up import utils boilerplate ( #11026 )
...
* update
* update
* update
2025-03-12 07:48:34 +05:30