Aryan
2ffbb88f1c
[training] CogVideoX-I2V LoRA ( #9482 )
...
* update
* update
* update
* update
* update
* add coauthor
Co-Authored-By: yuan-shenghai <963658029@qq.com >
* add coauthor
Co-Authored-By: Shenghai Yuan <140951558+SHYuanBest@users.noreply.github.com >
* update
Co-Authored-By: yuan-shenghai <963658029@qq.com >
* update
---------
Co-authored-by: yuan-shenghai <963658029@qq.com >
Co-authored-by: Shenghai Yuan <140951558+SHYuanBest@users.noreply.github.com >
2024-10-16 02:07:07 +05:30
Ahnjj_DEV
d40da7b68a
Fix some documentation in ./src/diffusers/models/adapter.py ( #9591 )
...
* Fix some documentation in ./src/diffusers/models/adapter.py
* Update src/diffusers/models/adapter.py
* Update src/diffusers/models/adapter.py
* Update src/diffusers/models/adapter.py
* Update src/diffusers/models/adapter.py
* Update src/diffusers/models/adapter.py
* Update src/diffusers/models/adapter.py
* Update src/diffusers/models/adapter.py
* Update src/diffusers/models/adapter.py
* Update src/diffusers/models/adapter.py
* Update src/diffusers/models/adapter.py
* Update src/diffusers/models/adapter.py
* Update src/diffusers/models/adapter.py
* Update src/diffusers/models/adapter.py
* Update src/diffusers/models/adapter.py
* Update src/diffusers/models/adapter.py
* Update src/diffusers/models/adapter.py
* Update src/diffusers/models/adapter.py
* Update src/diffusers/models/adapter.py
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
* Update src/diffusers/models/adapter.py
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
* Update src/diffusers/models/adapter.py
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
* Update src/diffusers/models/adapter.py
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
* Update src/diffusers/models/adapter.py
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
* Update src/diffusers/models/adapter.py
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
* Update src/diffusers/models/adapter.py
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
* run make style
* make style & fix
* make style : 0.1.5 version ruff
* revert changes to examples
---------
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
Co-authored-by: Aryan <aryan@huggingface.co >
2024-10-15 10:27:39 -07:00
wony617
a3e8d3f7de
[docs] refactoring docstrings in models/embeddings_flax.py ( #9592 )
...
* [docs] refactoring docstrings in `models/embeddings_flax.py`
* Update src/diffusers/models/embeddings_flax.py
* make style
---------
Co-authored-by: Aryan <aryan@huggingface.co >
2024-10-15 19:15:14 +05:30
wony617
fff4be8e23
[docs] refactoring docstrings in community/hd_painter.py ( #9593 )
...
* [docs] refactoring docstrings in community/hd_painter.py
* Update examples/community/hd_painter.py
Co-authored-by: Aryan <contact.aryanvs@gmail.com >
* make style
---------
Co-authored-by: Aryan <contact.aryanvs@gmail.com >
Co-authored-by: Aryan <aryan@huggingface.co >
2024-10-15 18:50:12 +05:30
Jiwook Han
355bb641e3
[doc] Fix some docstrings in src/diffusers/training_utils.py ( #9606 )
...
* refac: docstrings in training_utils.py
* fix: manual edits
* run make style
* add docstring at cast_training_params
---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
2024-10-15 18:47:52 +05:30
Charchit Sharma
92d2baf643
refactor image_processor.py file ( #9608 )
...
* refactor image_processor file
* changes as requested
* +1 edits
* quality fix
* indent issue
---------
Co-authored-by: Aryan <aryan@huggingface.co >
Co-authored-by: YiYi Xu <yixu310@gmail.com >
2024-10-15 17:20:33 +05:30
0x名無し
dccf39f01e
Dreambooth lora flux bug 3dtensor to 2dtensor ( #9653 )
...
* fixed issue #9350 , Tensor is deprecated
* ran make style
2024-10-15 17:18:13 +05:30
Sayak Paul
99d87474fd
[Chore] fix import of EntryNotFoundError. ( #9676 )
...
fix import of EntryNotFoundError.
2024-10-15 14:07:08 +05:30
Robin
79b118e863
[Fix] when run load pretain with local_files_only, local variable 'cached_folder' referenced before assignment ( #9376 )
...
Fix local variable 'cached_folder' referenced before assignment in hub_utils.py
Fix when use `local_files_only=True` with `subfolder`, local variable 'cached_folder' referenced before assignment issue.
Co-authored-by: YiYi Xu <yixu310@gmail.com >
2024-10-14 20:49:36 -10:00
hlky
9d0616189e
Slight performance improvement to Euler, EDMEuler, FlowMatchHeun, KDPM2Ancestral ( #9616 )
...
* Slight performance improvement to Euler
* Slight performance improvement to EDMEuler
* Slight performance improvement to FlowMatchHeun
* Slight performance improvement to KDPM2Ancestral
* Update KDPM2AncestralDiscreteSchedulerTest
---------
Co-authored-by: YiYi Xu <yixu310@gmail.com >
2024-10-14 19:34:25 -10:00
hlky
5f0df17703
Refactor SchedulerOutput and add pred_original_sample in DPMSolverSDE, Heun, KDPM2Ancestral and KDPM2 ( #9650 )
...
Refactor SchedulerOutput and add pred_original_sample
Co-authored-by: YiYi Xu <yixu310@gmail.com >
2024-10-14 18:11:01 -10:00
hlky
957e5cabff
Convert list/tuple of HunyuanDiT2DControlNetModel to HunyuanDiT2DMultiControlNetModel ( #9651 )
...
Convert list/tuple of HunyuanDiT2DControlNetModel to HunyuanDiT2DMultiControlNetModel
Co-authored-by: YiYi Xu <yixu310@gmail.com >
2024-10-14 18:09:30 -10:00
hlky
3e4c5707c3
Convert list/tuple of SD3ControlNetModel to SD3MultiControlNetModel ( #9652 )
...
Convert list/tuple of SD3ControlNetModel to SD3MultiControlNetModel
Co-authored-by: YiYi Xu <yixu310@gmail.com >
2024-10-14 17:57:34 -10:00
hlky
1bcd19e4d0
Add pred_original_sample to if not return_dict path ( #9649 )
...
Co-authored-by: YiYi Xu <yixu310@gmail.com >
2024-10-14 17:56:54 -10:00
SahilCarterr
22ed39f571
Added Lora Support to SD3 Img2Img Pipeline ( #9659 )
...
* add lora
2024-10-14 11:39:20 -10:00
Tolga Cangöz
56c21150d8
[Community Pipeline] Add 🪆 Matryoshka Diffusion Models ( #9157 )
2024-10-14 11:38:44 -10:00
Leo Jiang
5956b68a69
Improve the performance and suitable for NPU computing ( #9642 )
...
* Improve the performance and suitable for NPU
* Improve the performance and suitable for NPU computing
* Improve the performance and suitable for NPU
* Improve the performance and suitable for NPU
* Improve the performance and suitable for NPU
* Improve the performance and suitable for NPU
---------
Co-authored-by: 蒋硕 <jiangshuo9@h-partners.com >
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
2024-10-14 21:39:33 +05:30
Yuxuan.Zhang
8d81564b27
CogView3Plus DiT ( #9570 )
...
* merge 9588
* max_shard_size="5GB" for colab running
* conversion script updates; modeling test; refactor transformer
* make fix-copies
* Update convert_cogview3_to_diffusers.py
* initial pipeline draft
* make style
* fight bugs 🐛 🪳
* add example
* add tests; refactor
* make style
* make fix-copies
* add co-author
YiYi Xu <yixu310@gmail.com >
* remove files
* add docs
* add co-author
Co-Authored-By: YiYi Xu <yixu310@gmail.com >
* fight docs
* address reviews
* make style
* make model work
* remove qkv fusion
* remove qkv fusion tets
* address review comments
* fix make fix-copies error
* remove None and TODO
* for FP16(draft)
* make style
* remove dynamic cfg
* remove pooled_projection_dim as a parameter
* fix tests
---------
Co-authored-by: Aryan <aryan@huggingface.co >
Co-authored-by: YiYi Xu <yixu310@gmail.com >
2024-10-14 19:30:36 +05:30
Ryan Lin
68d16f7806
Flux - soft inpainting via differential diffusion ( #9268 )
...
* Flux - soft inpainting via differential diffusion
* .
* track changes to FluxInpaintPipeline
* make mask arrangement simplier
* make style
---------
Co-authored-by: YiYi Xu <yixu310@gmail.com >
Co-authored-by: Álvaro Somoza <asomoza@users.noreply.github.com >
Co-authored-by: asomoza <somoza.alvaro@gmail.com >
2024-10-14 10:07:48 -03:00
Sayak Paul
86bcbc389e
[Tests] increase transformers version in test_low_cpu_mem_usage_with_loading ( #9662 )
...
increase transformers version in test_low_cpu_mem_usage_with_loading
2024-10-13 22:39:38 +05:30
Jinzhe Pan
6a5f06488c
[docs] Fix xDiT doc image damage ( #9655 )
...
* docs: fix xDiT doc image damage
* doc: move xdit images to hf dataset
---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
2024-10-12 13:05:07 +05:30
Sayak Paul
c7a6d77b5f
[CI] replace ubuntu version to 22.04. ( #9656 )
...
replace ubuntu version to 22.04.
2024-10-12 11:55:36 +05:30
hlky
0f8fb75c7b
FluxMultiControlNetModel ( #9647 )
2024-10-11 14:39:19 -03:00
M Saqlain
3033f08201
Add Differential Diffusion to Kolors ( #9423 )
...
* Added diff diff support for kolors img2img
* Fized relative imports
* Fized relative imports
* Added diff diff support for Kolors
* Fized import issues
* Added map
* Fized import issues
* Fixed naming issues
* Added diffdiff support for Kolors img2img pipeline
* Removed example docstrings
* Added map input
* Updated latents
Co-authored-by: Álvaro Somoza <asomoza@users.noreply.github.com >
* Updated `original_with_noise`
Co-authored-by: Álvaro Somoza <asomoza@users.noreply.github.com >
* Improved code quality
---------
Co-authored-by: Álvaro Somoza <asomoza@users.noreply.github.com >
2024-10-11 10:47:31 -03:00
GSSun
164ec9f423
fix IsADirectoryError when running the training code for sd3_dreambooth_lora_16gb.ipynb ( #9634 )
...
Add files via upload
fix IsADirectoryError when running the training code
2024-10-11 13:33:39 +05:30
Subho Ghosh
38a3e4df92
flux controlnet control_guidance_start and control_guidance_end implement ( #9571 )
...
* flux controlnet control_guidance_start and control_guidance_end implement
* minor fix - added docstrings, consistent controlnet scale flux and SD3
2024-10-10 09:29:02 -03:00
Sayak Paul
e16fd93d0a
[LoRA] fix dora test to catch the warning properly. ( #9627 )
...
fix dora test.
2024-10-10 11:47:49 +05:30
Pakkapon Phongthawee
07bd2fabb6
make controlnet support interrupt ( #9620 )
...
* make controlnet support interrupt
* remove white space in controlnet interrupt
2024-10-09 12:03:13 -10:00
SahilCarterr
af28ae2d5b
add PAG support for SD Img2Img ( #9463 )
...
* added pag to sd img2img pipeline
---------
Co-authored-by: YiYi Xu <yixu310@gmail.com >
2024-10-09 10:40:58 -10:00
Sayak Paul
31058cdaef
[LoRA] allow loras to be loaded with low_cpu_mem_usage. ( #9510 )
...
* allow loras to be loaded with low_cpu_mem_usage.
* add flux support but note https://github.com/huggingface/diffusers/pull/9510\#issuecomment-2378316687
* low_cpu_mem_usage.
* fix-copies
* fix-copies again
* tests
* _LOW_CPU_MEM_USAGE_DEFAULT_LORA
* _peft_version default.
* version checks.
* version check.
* version check.
* version check.
* require peft 0.13.1.
* explicitly specify low_cpu_mem_usage=False.
* docs.
* transformers version 4.45.2.
* update
* fix
* empty
* better name initialize_dummy_state_dict.
* doc todos.
* Apply suggestions from code review
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
* style
* fix-copies
---------
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
2024-10-09 10:57:16 +05:30
Yijun Lee
ec9e5264c0
refac/pipeline_output ( #9582 )
2024-10-08 16:11:13 -10:00
sanaka
acd6d2c42f
Fix the bug that joint_attention_kwargs is not passed to the FLUX's transformer attention processors ( #9517 )
...
* Update transformer_flux.py
2024-10-08 11:25:48 -10:00
v2ray
86bd991ee5
Fixed noise_pred_text referenced before assignment. ( #9537 )
...
* Fixed local variable noise_pred_text referenced before assignment when using PAG with guidance scale and guidance rescale at the same time.
* Fixed style.
* Made returning text pred noise an argument.
2024-10-08 09:27:10 -10:00
Sayak Paul
02eeb8e77e
[LoRA] Handle DoRA better ( #9547 )
...
* handle dora.
* print test
* debug
* fix
* fix-copies
* update logits
* add warning in the test.
* make is_dora check consistent.
* fix-copies
2024-10-08 21:47:44 +05:30
glide-the
66eef9a6dc
fix: CogVideox train dataset _preprocess_data crop video ( #9574 )
...
* Removed int8 to float32 conversion (`* 2.0 - 1.0`) from `train_transforms` as it caused image overexposure.
Added `_resize_for_rectangle_crop` function to enable video cropping functionality. The cropping mode can be configured via `video_reshape_mode`, supporting options: ['center', 'random', 'none'].
* The number 127.5 may experience precision loss during division operations.
* wandb request pil image Type
* Resizing bug
* del jupyter
* make style
* Update examples/cogvideo/README.md
* make style
---------
Co-authored-by: --unset <--unset>
Co-authored-by: Aryan <aryan@huggingface.co >
2024-10-08 12:52:52 +05:30
Sayak Paul
63a5c8742a
Update distributed_inference.md to include transformer.device_map ( #9553 )
...
* Update distributed_inference.md to include `transformer.device_map`
* Update docs/source/en/training/distributed_inference.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
---------
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
2024-10-08 08:03:51 +05:30
Eliseu Silva
1287822973
Fix for use_safetensors parameters, allow use of parameter on loading submodels ( #9576 ) ( #9587 )
...
* Fix for use_safetensors parameters, allow use of parameter on loading submodels (#9576 )
2024-10-07 10:41:32 -10:00
Yijun Lee
a80f689200
refac: docstrings in import_utils.py ( #9583 )
...
* refac: docstrings in import_utils.py
* Update import_utils.py
2024-10-07 13:27:35 -07:00
captainzz
2cb383f591
fix vae dtype when accelerate config using --mixed_precision="fp16" ( #9601 )
...
* fix vae dtype when accelerate config using --mixed_precision="fp16"
* Add param for upcast vae
2024-10-07 21:00:25 +05:30
Sayak Paul
31010ecc45
[Chore] add a note on the versions in Flux LoRA integration tests ( #9598 )
...
add a note on the versions.
2024-10-07 17:43:48 +05:30
Clem
3159e60d59
fix xlabs FLUX lora conversion typo ( #9581 )
...
* fix startswith syntax in xlabs lora conversion
* Trigger CI
https://github.com/huggingface/diffusers/pull/9581#issuecomment-2395530360
2024-10-07 10:47:54 +05:30
YiYi Xu
99f608218c
[sd3] make sure height and size are divisible by 16 ( #9573 )
...
* check size
* up
2024-10-03 08:36:26 -10:00
Xiangchendong
7f323f0f31
fix cogvideox autoencoder decode ( #9569 )
...
Co-authored-by: Aryan <aryan@huggingface.co >
2024-10-02 09:07:06 -10:00
Darren Hsu
61d37640ad
Support bfloat16 for Upsample2D ( #9480 )
...
* Support bfloat16 for Upsample2D
* Add test and use is_torch_version
* Resolve comments and add decorator
* Simplify require_torch_version_greater_equal decorator
* Run make style
---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
Co-authored-by: YiYi Xu <yixu310@gmail.com >
2024-10-01 16:08:12 -10:00
JuanCarlosPi
33fafe3d14
Add PAG support to StableDiffusionControlNetPAGInpaintPipeline ( #8875 )
...
* Add pag to controlnet inpainting pipeline
---------
Co-authored-by: YiYi Xu <yixu310@gmail.com >
2024-09-30 20:04:42 -10:00
hlky
c4a8979f30
Add beta sigmas to other schedulers and update docs ( #9538 )
2024-09-30 09:00:54 -10:00
Sayak Paul
f9fd511466
[LoRA] support Kohya Flux LoRAs that have text encoders as well ( #9542 )
...
* support kohya flux loras that have tes.
2024-09-30 07:59:39 -10:00
Sayak Paul
8e7d6c03a3
[chore] fix: retain memory utility. ( #9543 )
...
* fix: retain memory utility.
* fix
* quality
* free_memory.
2024-09-28 21:08:45 +05:30
Anand Kumar
b28675c605
[train_instruct_pix2pix.py]Fix the LR schedulers when num_train_epochs is passed in a distributed training env ( #9316 )
...
Fixed pix2pix lr scheduler
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
2024-09-28 21:01:37 +05:30
Aryan
bd4df2856a
[refactor] remove conv_cache from CogVideoX VAE ( #9524 )
...
* remove conv cache from the layer and pass as arg instead
* make style
* yiyi's cleaner implementation
Co-Authored-By: YiYi Xu <yixu310@gmail.com >
* sayak's compiled implementation
Co-Authored-By: Sayak Paul <spsayakpaul@gmail.com >
---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
Co-authored-by: YiYi Xu <yixu310@gmail.com >
2024-09-28 17:09:30 +05:30