Aryan
2e83cbbb6d
LTX 0.9.5 ( #10968 )
...
* update
---------
Co-authored-by: YiYi Xu <yixu310@gmail.com >
Co-authored-by: hlky <hlky@hlky.ac >
2025-03-17 16:43:36 -10:00
C
33d10af28f
Fix Wan I2V Quality ( #11087 )
...
* fix_wan_i2v_quality
* Update src/diffusers/pipelines/wan/pipeline_wan_i2v.py
Co-authored-by: YiYi Xu <yixu310@gmail.com >
* Update src/diffusers/pipelines/wan/pipeline_wan_i2v.py
Co-authored-by: YiYi Xu <yixu310@gmail.com >
* Update src/diffusers/pipelines/wan/pipeline_wan_i2v.py
Co-authored-by: YiYi Xu <yixu310@gmail.com >
* Update pipeline_wan_i2v.py
---------
Co-authored-by: YiYi Xu <yixu310@gmail.com >
Co-authored-by: hlky <hlky@hlky.ac >
2025-03-17 06:24:57 -10:00
Sayak Paul
100142586f
[CI] pin transformers version for benchmarking. ( #11067 )
...
pin transformers version for benchmarking.
2025-03-16 10:27:35 +05:30
Yuxuan Zhang
82188cef04
CogView4 Control Block ( #10809 )
...
* cogview4 control training
---------
Co-authored-by: OleehyO <leehy0357@gmail.com >
Co-authored-by: yiyixuxu <yixu310@gmail.com >
2025-03-15 07:15:56 -10:00
Sayak Paul
cc19726f3d
[Tests] add requires peft decorator. ( #11037 )
...
* add requires peft decorator.
* install peft conditionally.
* conditional deps.
Co-authored-by: DN6 <dhruv.nair@gmail.com >
---------
Co-authored-by: DN6 <dhruv.nair@gmail.com >
2025-03-15 12:56:41 +05:30
Dimitri Barbot
be54a95b93
Fix deterministic issue when getting pipeline dtype and device ( #10696 )
...
Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com >
2025-03-15 07:50:58 +05:30
Juan Acevedo
6b9a3334db
reverts accidental change that removes attn_mask in attn. Improves fl… ( #11065 )
...
reverts accidental change that removes attn_mask in attn. Improves flux ptxla by using flash block sizes. Moves encoding outside the for loop.
Co-authored-by: Juan Acevedo <jfacevedo@google.com >
2025-03-14 12:47:01 -10:00
Andreas Jörg
8ead643bb7
[examples/controlnet/train_controlnet_sd3.py] Fixes #11050 - Cast prompt_embeds and pooled_prompt_embeds to weight_dtype to prevent dtype mismatch ( #11051 )
...
Fix: dtype mismatch of prompt embeddings in sd3 controlnet training
Co-authored-by: Andreas Jörg <andreasjoerg@MacBook-Pro-von-Andreas-2.fritz.box >
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
2025-03-14 17:33:15 +05:30
Sayak Paul
124ac3e81f
[LoRA] feat: support non-diffusers wan t2v loras. ( #11059 )
...
feat: support non-diffusers wan t2v loras.
2025-03-14 16:01:25 +05:30
Sayak Paul
2f0f281b0d
[Tests] restrict memory tests for quanto for certain schemes. ( #11052 )
...
* restrict memory tests for quanto for certain schemes.
* Apply suggestions from code review
Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com >
* fixes
* style
---------
Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com >
2025-03-14 10:35:19 +05:30
ZhengKai91
ccc8321651
Fix aclnnRepeatInterleaveIntWithDim error on NPU for get_1d_rotary_pos_embed ( #10820 )
...
* get_1d_rotary_pos_embed support npu
* Update src/diffusers/models/embeddings.py
---------
Co-authored-by: Kai zheng <kaizheng@KaideMacBook-Pro.local >
Co-authored-by: hlky <hlky@hlky.ac >
Co-authored-by: YiYi Xu <yixu310@gmail.com >
2025-03-13 09:58:03 -10:00
Yaniv Galron
5e48cd27d4
making ``formatted_images`` initialization compact ( #10801 )
...
compact writing
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
Co-authored-by: YiYi Xu <yixu310@gmail.com >
2025-03-13 09:27:14 -10:00
hlky
5551506b29
Rename Lumina(2)Text2ImgPipeline -> Lumina(2)Pipeline ( #10827 )
...
* Rename Lumina(2)Text2ImgPipeline -> Lumina(2)Pipeline
---------
Co-authored-by: YiYi Xu <yixu310@gmail.com >
2025-03-13 09:24:21 -10:00
Sayak Paul
20e4b6a628
[LoRA] change to warning from info when notifying the users about a LoRA no-op ( #11044 )
...
* move to warning.
* test related changes.
2025-03-12 21:20:48 +05:30
hlky
4ea9f89b8e
Wan Pipeline scaling fix, type hint warning, multi generator fix ( #11007 )
...
* Wan Pipeline scaling fix, type hint warning, multi generator fix
* Apply suggestions from code review
2025-03-12 12:05:52 +00:00
hlky
733b44ac82
[hybrid inference 🍯 🐝 ] Add VAE encode ( #11017 )
...
* [hybrid inference 🍯 🐝 ] Add VAE encode
* _toctree: add vae encode
* Add endpoints, tests
* vae_encode docs
* vae encode benchmarks
* api reference
* changelog
* Update docs/source/en/hybrid_inference/overview.md
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
* update
---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
2025-03-12 11:23:41 +00:00
hlky
8b4f8ba764
Use output_size in repeat_interleave ( #11030 )
2025-03-12 07:30:21 +00:00
Dhruv Nair
5428046437
[Refactor] Clean up import utils boilerplate ( #11026 )
...
* update
* update
* update
2025-03-12 07:48:34 +05:30
39th president of the United States, probably
e7ffeae0a1
Fix for multi-GPU WAN inference ( #10997 )
...
Ensure that hidden_state and shift/scale are on the same device when running with multiple GPUs
Co-authored-by: Jimmy <39@🇺🇸 .com>
2025-03-11 07:42:12 -10:00
CyberVy
d87ce2cefc
Fix missing **kwargs in lora_pipeline.py ( #11011 )
...
* Update lora_pipeline.py
* Apply style fixes
* fix-copies
---------
Co-authored-by: hlky <hlky@hlky.ac >
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
2025-03-11 07:34:27 -10:00
wonderfan
36d0553af2
chore: fix help messages in advanced diffusion examples ( #10923 )
2025-03-11 07:33:55 -10:00
hlky
7e0db46f73
Fix SD3 IPAdapter feature extractor ( #11027 )
2025-03-11 16:29:27 +00:00
Sayak Paul
e4b056fe65
[LoRA] support wan i2v loras from the world. ( #11025 )
...
* support wan i2v loras from the world.
* remove copied from.
* upates
* add lora.
2025-03-11 20:43:29 +05:30
Eliseu Silva
4e3ddd5afa
fix: mixture tiling sdxl pipeline - adjust gerating time_ids & embeddings ( #11012 )
...
small fix on generating time_ids & embeddings
2025-03-11 04:20:18 -03:00
Dhruv Nair
9add071592
[Quantization] Allow loading TorchAO serialized Tensor objects with torch>=2.6 ( #11018 )
...
* update
* update
* update
* update
* update
* update
* update
* update
* update
2025-03-11 10:52:01 +05:30
Tolga Cangöz
b88fef4785
[Research Project] Add AnyText: Multilingual Visual Text Generation And Editing ( #8998 )
...
* Add initial template
* Second template
* feat: Add TextEmbeddingModule to AnyTextPipeline
* feat: Add AuxiliaryLatentModule template to AnyTextPipeline
* Add bert tokenizer from the anytext repo for now
* feat: Update AnyTextPipeline's modify_prompt method
This commit adds improvements to the modify_prompt method in the AnyTextPipeline class. The method now handles special characters and replaces selected string prompts with a placeholder. Additionally, it includes a check for Chinese text and translation using the trans_pipe.
* Fill in the `forward` pass of `AuxiliaryLatentModule`
* `make style && make quality`
* `chore: Update bert_tokenizer.py with a TODO comment suggesting the use of the transformers library`
* Update error handling to raise and logging
* Add `create_glyph_lines` function into `TextEmbeddingModule`
* make style
* Up
* Up
* Up
* Up
* Remove several comments
* refactor: Remove ControlNetConditioningEmbedding and update code accordingly
* Up
* Up
* up
* refactor: Update AnyTextPipeline to include new optional parameters
* up
* feat: Add OCR model and its components
* chore: Update `TextEmbeddingModule` to include OCR model components and dependencies
* chore: Update `AuxiliaryLatentModule` to include VAE model and its dependencies for masked image in the editing task
* `make style`
* refactor: Update `AnyTextPipeline`'s docstring
* Update `AuxiliaryLatentModule` to include info dictionary so that text processing is done once
* simplify
* `make style`
* Converting `TextEmbeddingModule` to ordinary `encode_prompt()` function
* Simplify for now
* `make style`
* Up
* feat: Add scripts to convert AnyText controlnet to diffusers
* `make style`
* Fix: Move glyph rendering to `TextEmbeddingModule` from `AuxiliaryLatentModule`
* make style
* Up
* Simplify
* Up
* feat: Add safetensors module for loading model file
* Fix device issues
* Up
* Up
* refactor: Simplify
* refactor: Simplify code for loading models and handling data types
* `make style`
* refactor: Update to() method in FrozenCLIPEmbedderT3 and TextEmbeddingModule
* refactor: Update dtype in embedding_manager.py to match proj.weight
* Up
* Add attribution and adaptation information to pipeline_anytext.py
* Update usage example
* Will refactor `controlnet_cond_embedding` initialization
* Add `AnyTextControlNetConditioningEmbedding` template
* Refactor organization
* style
* style
* Move custom blocks from `AuxiliaryLatentModule` to `AnyTextControlNetConditioningEmbedding`
* Follow one-file policy
* style
* [Docs] Update README and pipeline_anytext.py to use AnyTextControlNetModel
* [Docs] Update import statement for AnyTextControlNetModel in pipeline_anytext.py
* [Fix] Update import path for ControlNetModel, ControlNetOutput in anytext_controlnet.py
* Refactor AnyTextControlNet to use configurable conditioning embedding channels
* Complete control net conditioning embedding in AnyTextControlNetModel
* up
* [FIX] Ensure embeddings use correct device in AnyTextControlNetModel
* up
* up
* style
* [UPDATE] Revise README and example code for AnyTextPipeline integration with DiffusionPipeline
* [UPDATE] Update example code in anytext.py to use correct font file and improve clarity
* down
* [UPDATE] Refactor BasicTokenizer usage to a new Checker class for text processing
* update pillow
* [UPDATE] Remove commented-out code and unnecessary docstring in anytext.py and anytext_controlnet.py for improved clarity
* [REMOVE] Delete frozen_clip_embedder_t3.py as it is in the anytext.py file
* [UPDATE] Replace edict with dict for configuration in anytext.py and RecModel.py for consistency
* 🆙
* style
* [UPDATE] Revise README.md for clarity, remove unused imports in anytext.py, and add author credits in anytext_controlnet.py
* style
* Update examples/research_projects/anytext/README.md
Co-authored-by: Aryan <contact.aryanvs@gmail.com >
* Remove commented-out image preparation code in AnyTextPipeline
* Remove unnecessary blank line in README.md
2025-03-11 01:49:37 +05:30
Sayak Paul
e7e6d85282
[Tests] improve quantization tests by additionally measuring the inference memory savings ( #11021 )
...
* memory usage tests
* fixes
* gguf
2025-03-10 21:42:24 +05:30
Aryan
8eefed65bd
[LoRA] CogView4 ( #10981 )
...
* update
* make fix-copies
* update
2025-03-10 20:24:05 +05:30
Sayak Paul
26149c0ecd
[LoRA] Improve warning messages when LoRA loading becomes a no-op ( #10187 )
...
* updates
* updates
* updates
* updates
* notebooks revert
* fix-copies.
* seeing
* fix
* revert
* fixes
* fixes
* fixes
* remove print
* fix
* conflicts ii.
* updates
* fixes
* better filtering of prefix.
---------
Co-authored-by: hlky <hlky@hlky.ac >
2025-03-10 09:28:32 +05:30
Ishan Modi
0703ce8800
[Single File] Add single file loading for SANA Transformer ( #10947 )
...
* added support for from_single_file
* added diffusers mapping script
* added testcase
* bug fix
* updated tests
* corrected code quality
* corrected code quality
---------
Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com >
2025-03-10 08:38:30 +05:30
Dhruv Nair
f5edaa7894
[Quantization] Add Quanto backend ( #10756 )
...
* update
* updaet
* update
* update
* update
* update
* update
* update
* update
* update
* update
* update
* Update docs/source/en/quantization/quanto.md
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
* update
* update
* update
* update
* update
* update
* update
* update
* update
* update
* update
* update
* update
* update
* update
* update
* update
* update
* Update src/diffusers/quantizers/quanto/utils.py
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
* update
* update
---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
2025-03-10 08:33:05 +05:30
Dhruv Nair
9a1810f0de
Fix for fetching variants only ( #10646 )
...
* update
* update
* update
* update
* update
* update
* update
* update
* update
* update
* update
* update
* update
* update
* update
* update
2025-03-10 07:45:44 +05:30
Sayak Paul
1fddee211e
[LoRA] Improve copied from comments in the LoRA loader classes ( #10995 )
...
* more sanity of mind with copied from ...
* better
* better
2025-03-08 19:59:21 +05:30
Kinam Kim
b38450d5d2
Add STG to community pipelines ( #10960 )
...
* Support STG for video pipelines
* Update README.md
* Update README.md
* Update README.md
* Update README.md
* Update README.md
* Update README.md
* Update pipeline_stg_cogvideox.py
* Update pipeline_stg_hunyuan_video.py
* Update pipeline_stg_ltx.py
* Update pipeline_stg_ltx_image2video.py
* Update pipeline_stg_mochi.py
* Update pipeline_stg_hunyuan_video.py
* Update pipeline_stg_ltx.py
* Update pipeline_stg_ltx_image2video.py
* Update pipeline_stg_mochi.py
* update
* remove rescaling
* Apply style fixes
---------
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
2025-03-08 00:28:24 +05:30
Dhruv Nair
1357931d74
[Single File] Add single file support for Wan T2V/I2V ( #10991 )
...
* update
* update
* update
* update
* update
* update
* update
2025-03-07 22:13:25 +05:30
Sayak Paul
a2d3d6af44
[LoRA] remove full key prefix from peft. ( #11004 )
...
remove full key prefix from peft.
2025-03-07 21:51:59 +05:30
hlky
363d1ab7e2
Wan VAE move scaling to pipeline ( #10998 )
2025-03-07 10:42:17 +00:00
C
6a0137eb3b
Fix Graph Breaks When Compiling CogView4 ( #10959 )
...
* Fix Graph Breaks When Compiling CogView4
Eliminate this:
```
t]V0304 10:24:23.421000 3131076 torch/_dynamo/guards.py:2813] [0/4] [__recompiles] Recompiling function forward in /home/zeyi/repos/diffusers/src/diffusers/models/transformers/transformer_cogview4.py:374
V0304 10:24:23.421000 3131076 torch/_dynamo/guards.py:2813] [0/4] [__recompiles] triggered by the following guard failure(s):
V0304 10:24:23.421000 3131076 torch/_dynamo/guards.py:2813] [0/4] [__recompiles] - 0/3: ___check_obj_id(L['self'].rope.freqs_h, 139976127328032)
V0304 10:24:23.421000 3131076 torch/_dynamo/guards.py:2813] [0/4] [__recompiles] - 0/2: ___check_obj_id(L['self'].rope.freqs_h, 139976107780960)
V0304 10:24:23.421000 3131076 torch/_dynamo/guards.py:2813] [0/4] [__recompiles] - 0/1: ___check_obj_id(L['self'].rope.freqs_h, 140022511848960)
V0304 10:24:23.421000 3131076 torch/_dynamo/guards.py:2813] [0/4] [__recompiles] - 0/0: ___check_obj_id(L['self'].rope.freqs_h, 140024081342416)
```
* Update transformer_cogview4.py
* fix cogview4 rotary pos embed
* Apply style fixes
---------
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
Co-authored-by: YiYi Xu <yixu310@gmail.com >
2025-03-06 22:57:17 -10:00
Aryan
2e5203be04
Hunyuan I2V ( #10983 )
...
* update
* update
* update
* add tests
* update
* add model tests
* update docs
* update
* update example
* fix defaults
* update
2025-03-07 12:52:48 +05:30
yupeng1111
d55f41102a
fix wan i2v pipeline bugs ( #10975 )
...
* fix wan i2v pipeline bugs
---------
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
Co-authored-by: YiYi Xu <yixu310@gmail.com >
2025-03-06 18:57:41 -10:00
LittleNyima
748cb0fab6
Add CogVideoX DDIM Inversion to Community Pipelines ( #10956 )
...
* add cogvideox ddim inversion script
* implement as a pipeline, and add documentation
---------
Co-authored-by: Linoy Tsaban <57615435+linoytsaban@users.noreply.github.com >
2025-03-06 10:46:38 -10:00
Dhruv Nair
790a909b54
[Single File] Add user agent to SF download requests. ( #10979 )
...
update
2025-03-06 10:45:20 -10:00
CyberVy
54ab475391
Fix Flux Controlnet Pipeline _callback_tensor_inputs Missing Some Elements ( #10974 )
...
* Update pipeline_flux_controlnet.py
* Update pipeline_flux_controlnet_image_to_image.py
* Update pipeline_flux_controlnet_inpainting.py
* Update pipeline_flux_controlnet_inpainting.py
* Update pipeline_flux_controlnet_inpainting.py
2025-03-06 14:26:20 -03:00
dependabot[bot]
f103993094
Bump jinja2 from 3.1.5 to 3.1.6 in /examples/research_projects/realfill ( #10984 )
...
Bumps [jinja2](https://github.com/pallets/jinja ) from 3.1.5 to 3.1.6.
- [Release notes](https://github.com/pallets/jinja/releases )
- [Changelog](https://github.com/pallets/jinja/blob/main/CHANGES.rst )
- [Commits](https://github.com/pallets/jinja/compare/3.1.5...3.1.6 )
---
updated-dependencies:
- dependency-name: jinja2
dependency-type: direct:production
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-03-06 11:59:51 +00:00
Sayak Paul
1be0202502
[CI] remove synchornized. ( #10980 )
...
removed synchornized.
2025-03-06 17:03:19 +05:30
Pierre Chapuis
ea81a4228d
fix default values of Flux guidance_scale in docstrings ( #10982 )
2025-03-06 16:37:45 +05:30
hlky
b15027636a
Fix loading OneTrainer Flux LoRA ( #10978 )
...
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
2025-03-06 13:53:36 +05:30
Sayak Paul
6e2a93de70
[tests] fix tests for save load components ( #10977 )
...
fix tests
2025-03-06 12:30:37 +05:30
Jun Yeop Na
37b8edfb86
[train_dreambooth_lora.py] Fix the LR Schedulers when num_train_epochs is passed in a distributed training env ( #10973 )
...
* updated train_dreambooth_lora to fix the LR schedulers for `num_train_epochs` in distributed training env
* fixed formatting
* remove trailing newlines
* fixed style error
2025-03-06 10:06:24 +05:30
Célina
fbf6b856cc
use style bot GH Action from huggingface_hub ( #10970 )
...
use style bot GH action from hfh
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
2025-03-05 23:39:50 +05:30