Patrick von Platen
1ade42f729
make style
2023-10-23 19:43:54 +02:00
Shyam Marjit
677df5ac12
fixed SDXL text encoder training bug #5016 ( #5078 )
...
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
2023-10-23 19:43:43 +02:00
Andrei Filatov
16851efa0f
Update README.md ( #5497 )
...
Right now, only "main" branch has this community pipeline code. So, adding it manually into pipeline
2023-10-23 18:57:43 +02:00
linjiapro
45db049973
Fix the order of width and height of original size in SDXL training script ( #5382 )
...
* wip
* wip
---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
2023-10-18 22:38:52 +02:00
Liang Hou
a68f5062fb
style(sdxl): remove identity assignments ( #5418 )
2023-10-18 22:38:20 +02:00
Patrick von Platen
85dccab7fd
Add latent consistency ( #5438 )
...
* Add latent consistency
* Update examples/community/README.md
* Add latent consistency
* make fix copies
* Apply suggestions from code review
2023-10-18 14:18:31 +02:00
Susheel Thapa
324d18fba2
Chore: Typo fixed in multiple files ( #5422 )
2023-10-17 08:17:03 -07:00
Sayak Paul
cc12f3ec92
[Examples] Update with HFApi ( #5393 )
...
* update training examples to use HFAPI.
* update training example.
* reflect the changes in the korean version too.
* Empty-Commit
2023-10-16 19:34:46 +05:30
Heinz-Alexander Fuetterer
0ea78f9707
chore: fix typos ( #5386 )
...
* chore: fix typos
* Update src/diffusers/pipelines/shap_e/renderer.py
Co-authored-by: psychedelicious <4822129+psychedelicious@users.noreply.github.com >
---------
Co-authored-by: psychedelicious <4822129+psychedelicious@users.noreply.github.com >
2023-10-16 15:23:37 +02:00
Kashif Rasul
d03c9099bc
[Wuerstchen] text to image training script ( #5052 )
...
* initial script
* formatting
* prior trainer wip
* add efficient_net_encoder
* add CLIPTextModel
* add prior ema support
* optimizer
* fix typo
* add dataloader
* prompt_embeds and image_embeds
* intial training loop
* fix output_dir
* fix add_noise
* accelerator check
* make effnet_transforms dynamic
* fix training loop
* add validation logging
* use loaded text_encoder
* use PreTrainedTokenizerFast
* load weigth from pickle
* save_model_card
* remove unused file
* fix typos
* save prior pipeilne in its own folder
* fix imports
* fix pipe_t2i
* scale image_embeds
* remove snr_gamma
* format
* initial lora prior training
* log_validation and save
* initial gradient working
* remove save/load hooks
* set set_attn_processor on prior_prior
* add lora script
* typos
* use LoraLoaderMixin for prior pipeline
* fix usage
* make fix-copies
* yse repo_id
* write_lora_layers is a staitcmethod
* use defualts
* fix defaults
* undo
* Update src/diffusers/pipelines/wuerstchen/pipeline_wuerstchen_prior.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com >
* Update src/diffusers/loaders.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com >
* Update src/diffusers/loaders.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com >
* Update src/diffusers/pipelines/wuerstchen/modeling_wuerstchen_prior.py
* Update src/diffusers/loaders.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com >
* Update src/diffusers/loaders.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com >
* add graident checkpoint support to prior
* gradient_checkpointing
* formatting
* Update examples/wuerstchen/text_to_image/README.md
Co-authored-by: Pedro Cuenca <pedro@huggingface.co >
* Update examples/wuerstchen/text_to_image/README.md
Co-authored-by: Pedro Cuenca <pedro@huggingface.co >
* Update examples/wuerstchen/text_to_image/README.md
Co-authored-by: Pedro Cuenca <pedro@huggingface.co >
* Update examples/wuerstchen/text_to_image/README.md
Co-authored-by: Pedro Cuenca <pedro@huggingface.co >
* Update examples/wuerstchen/text_to_image/README.md
Co-authored-by: Pedro Cuenca <pedro@huggingface.co >
* Update examples/wuerstchen/text_to_image/train_text_to_image_lora_prior.py
Co-authored-by: Pedro Cuenca <pedro@huggingface.co >
* Update src/diffusers/loaders.py
Co-authored-by: Pedro Cuenca <pedro@huggingface.co >
* Update examples/wuerstchen/text_to_image/train_text_to_image_prior.py
Co-authored-by: Pedro Cuenca <pedro@huggingface.co >
* use default unet and text_encoder
* fix test
---------
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com >
Co-authored-by: Pedro Cuenca <pedro@huggingface.co >
2023-10-16 15:00:33 +02:00
Sayak Paul
93df5bb670
[Examples] fix unconditioning generation training example for mixed-precision training ( #5407 )
...
* fix: unconditional generation example
* fix: float in loss.
* apply styling.
2023-10-16 14:11:35 +05:30
Sayak Paul
0fa32bd674
[Examples] use loralinear instead of depecrecated lora attn procs. ( #5331 )
...
* use loralinear instead of depecrecated lora attn procs.
* fix parameters()
* fix saving
* add back support for add kv proj.
* fix: param accumul,ation.
* propagate the changes.
2023-10-11 13:02:42 +02:00
ssusie
aea73834f6
Adding PyTorch XLA support for sdxl inference ( #5273 )
...
* Added mark_step for sdxl to run with pytorch xla. Also updated README with instructions for xla
* adding soft dependency on torch_xla
* fix some styling
---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com >
2023-10-11 12:35:37 +02:00
Humphrey009
9c82b68f07
fix problem of 'accelerator.is_main_process' to run in mutiple GPUs ( #5340 )
...
fix problem of 'accelerator.is_main_process' to run in mutiple GPUs or NPUs
Co-authored-by: jiaqiw <wangjiaqi50@huawei.com >
2023-10-10 15:39:22 +05:30
Julien Simon
d3e0750d5d
Add missing dependency in requirements file ( #5345 )
...
Update requirements_sdxl.txt
Add missing 'datasets'
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
2023-10-10 15:37:58 +05:30
Pu Cao
cc2c4ae759
fix inference in custom diffusion ( #5329 )
...
* Update train_custom_diffusion.py
* make style
* Empty-Commit
---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
2023-10-09 10:08:01 +02:00
chuzh
6bd55b54bc
Fix [core/GLIGEN]: TypeError when iterating over 0-d tensor with In-painting mode when EulerAncestralDiscreteScheduler is used ( #5305 )
...
* fix(gligen_inpaint_pipeline): 🐛 Wrap the timestep() 0-d tensor in a list to convert to 1-d tensor. This avoids the TypeError caused by trying to directly iterate over a 0-dimensional tensor in the denoising stage
* test(gligen/gligen_text_image): unit test using the EulerAncestralDiscreteScheduler
---------
Co-authored-by: zhen-hao.chu <zhen-hao.chu@vitrox.com >
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
2023-10-09 09:54:01 +02:00
Zeng Xian
0513a8cfd8
fix typo in train dreambooth lora description ( #5332 )
2023-10-08 14:54:33 +02:00
Bagheera
02a8d664a2
Min-SNR Gamma: correct the fix for SNR weighted loss in v-prediction … ( #5238 )
...
Min-SNR Gamma: correct the fix for SNR weighted loss in v-prediction by adding 1 to SNR rather than the resulting loss weights
Co-authored-by: bghira <bghira@users.github.com >
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
2023-10-05 20:52:27 +02:00
Patrick von Platen
dfcce3ca6e
[Research folder] Add SDXL example ( #5275 )
...
* [SDXL Flax] Add research folder
* Add co-author
Co-authored-by: Juan Acevedo <jfacevedo@google.com >
---------
Co-authored-by: Juan Acevedo <jfacevedo@google.com >
2023-10-03 07:51:46 +02:00
Patrick von Platen
bdd16116f3
[Schedulers] Fix callback steps ( #5261 )
...
* fix all
* make fix copies
* make fix copies
2023-10-02 19:52:53 +02:00
Sayak Paul
d56825e4b4
fix: how print training resume logs. ( #5117 )
...
* fix: how print training resume logs.
* propagate changes to text-to-image scripts.
* propagate changes to instructpix2pix.
* propagate changes to dreambooth
* propagate changes to custom diffusion and instructpix2pix
* propagate changes to kandinsky
* propagate changes to textual inv.
* debug
* fix: checkpointing.
* debug
* debug
* debug
* back to the square
* debug
* debug
* change condition order.
* debug
* debug
* debug
* debug
* revert to original
* clean
---------
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com >
2023-10-02 18:29:52 +02:00
Nicholas Bardy
1c4c4c48d9
Correct file name in t2i adapter training readme ( #5207 )
...
Update README_sdxl.md
2023-09-28 14:51:02 +02:00
Benjamin Paine
693a0d08e4
Remove Offensive Language from Community Pipelines ( #5206 )
...
* Update run_onnx_controlnet.py
* Update run_tensorrt_controlnet.py
2023-09-27 22:02:25 +02:00
Sayak Paul
cdcc01be0e
[Examples] add compute_snr() to training utils. ( #5188 )
...
add compute_snr() to training utils.
2023-09-27 21:42:20 +05:30
Bagheera
4a06c74547
Min-SNR Gamma: follow-up fix for zero-terminal SNR models on v-prediction or epsilon ( #5177 )
...
* merge with main
* fix flax example
* fix onnx example
---------
Co-authored-by: bghira <bghira@users.github.com >
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
2023-09-26 18:14:52 +05:30
Bagheera
89d8f84893
Timestep bias for fine-tuning SDXL ( #5094 )
...
* Timestep bias for fine-tuning SDXL
* Adjust parameter choices to include "range" and reword the help statements
* Condition our use of weighted timesteps on the value of timestep_bias_strategy
* style
---------
Co-authored-by: bghira <bghira@users.github.com >
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
2023-09-26 13:45:37 +05:30
Bagheera
539846a7d5
SDXL microconditioning documentation should indicate the correct default order of parameters, so that developers know ( #5155 )
...
* SDXL microconditioning documentation should indicate the correct default order of parameters, so that developers know
* SDXL microconditioning documentation should indicate the correct default order of parameters, so that developers know
* empty
---------
Co-authored-by: bghira <bghira@users.github.com >
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com >
2023-09-25 20:22:09 +02:00
Anh71me
28254c79b6
Fix type annotation ( #5146 )
...
* Fix type annotation on Scheduler.from_pretrained
* Fix type annotation on PIL.Image
2023-09-25 19:26:39 +02:00
Bagheera
d558811b26
Min-SNR gamma support for Dreambooth training ( #5107 )
...
* min-SNR gamma for Dreambooth training
* Align the mse_loss_weights style with SDXL training example
---------
Co-authored-by: bghira <bghira@users.github.com >
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
2023-09-21 22:53:06 +01:00
Bagheera
24563ca654
SNR gamma fixes for v_prediction training ( #5106 )
...
Co-authored-by: bghira <bghira@users.github.com >
2023-09-20 21:18:56 +01:00
Bagheera
74e43a4fbd
Resolve v_prediction issue for min-SNR gamma weighted loss function ( #5096 )
...
* Resolve v_prediction issue for min-SNR gamma weighted loss function
* Combine MSE loss calculation of epsilon and velocity, with a note about the application of the epsilon code to sample prediction
* style
---------
Co-authored-by: bghira <bghira@users.github.com >
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
2023-09-19 17:31:27 +01:00
Bagheera
81331f3b7d
Add x-prediction / prediction_type=sample support for SDXL fine-tuning ( #5095 )
...
Co-authored-by: bghira <bghira@users.github.com >
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
2023-09-19 16:57:44 +01:00
maksymbekuzarovSC
65c162a5b3
Fixed get_word_inds mistake/typo in P2P community pipeline (breaking code examples) ( #5089 )
...
Fixed `get_word_inds` mistake/typo in P2P community pipeline
The function `get_word_inds` was taking a string of text and either a word (str) or a word index (int) and returned the indices of token(s) the word would be encoded to.
However, there was a typo, in which in the second `if` branch the word was checked to be a `str` **again**, not `int`, which resulted in an [example code from the docs](https://github.com/huggingface/diffusers/tree/main/examples/community#prompt2prompt-pipeline ) to result in an error
2023-09-19 11:34:49 +02:00
Ruoxi
16b9a57d29
Implement CustomDiffusionAttnProcessor2_0. ( #4604 )
...
* Implement `CustomDiffusionAttnProcessor2_0`
* Doc-strings and type annotations for `CustomDiffusionAttnProcessor2_0`. (#1 )
* Update attnprocessor.md
* Update attention_processor.py
* Interops for `CustomDiffusionAttnProcessor2_0`.
* Formatted `attention_processor.py`.
* Formatted doc-string in `attention_processor.py`
* Conditional CustomDiffusion2_0 for training example.
* Remove unnecessary reference impl in comments.
* Fix `save_attn_procs`.
2023-09-18 14:49:00 +02:00
Lee Dong Joo
b089102a8e
fix guidance_rescale docstring ( #5063 )
2023-09-18 13:39:12 +02:00
Kashif Rasul
73bb97adfc
[LoRA] fix typo in attention_processor.py ( #5066 )
...
* [LoRA] fix typo in attention_processor.py
fixes #5062
* make style
* make fix-copies, logger comented for torch compile
2023-09-16 14:43:18 +02:00
Sayak Paul
38a664a3d6
fix: validation_image arg ( #5053 )
2023-09-15 12:20:50 +01:00
Gang Wu
9f40d7970e
[FIX BUG] type of args in train_instruct_pix2pix_sdxl.py ( #4955 )
2023-09-15 12:53:07 +02:00
dotieuthien
941473a12f
Fix import in examples ( #5048 )
...
* convert tensorrt controlnet
* Fix code quality
* Fix code quality
* Fix code quality
* Fix code quality
* Fix code quality
* Fix code quality
* Fix number controlnet condition
* Add convert SD XL to onnx
* Add convert SD XL to tensorrt
* Add convert SD XL to tensorrt
* Add examples in comments
* Add examples in comments
* Add test onnx controlnet
* Add tensorrt test
* Remove copied
* Move file test to examples/community
* Remove script
* Remove script
* Remove text
* Fix import
---------
Co-authored-by: dotieuthien <thien.do@mservice.com.vn >
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com >
2023-09-15 12:48:06 +02:00
YiYi Xu
e70cb1243f
[WIP] adding Kandinsky training scripts ( #4890 )
...
* Add files via upload
Co-authored-by: Shahmatov Arseniy <62886550+cene555@users.noreply.github.com >
Co-authored-by: yiyixuxu <yixu310@gmail,com>
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
2023-09-14 06:58:20 -10:00
Patrick von Platen
342c5c02c0
[Release 0.21] Bump version ( #5018 )
...
* [Release 0.21] Bump version
* fix & remove
* fix more
* fix all, upload
2023-09-14 18:28:57 +02:00
UmerHA
169fc4add5
Add Prompt2Prompt pipeline ( #4563 )
...
* Initial commit P2P
* Replaced CrossAttention, added test skeleton
* bug fixes
* Updated docstring
* Removed unused function
* Created tests
* improved tests
- made fast inference tests faster
- corrected image shape assertions
* Corrected expected output shape in tests
* small fix: test inputs
* Update tests
- used conditional unet2d
- set expected image slices
- edit_kwargs are now not popped, so pipe can be run multiple times
* Fixed bug in int tests
* Fixed tests
* Linting
* Create prompt2prompt.md
* Added to docs toc
* Ran make fix-copies
* Fixed code blocks in docs
* Using same interface as StableDiffusionPipeline
* Fixed small test bug
* Added all options SDPipeline.__call_ has
* Fixed docstring; made __call__ like in SD
* Linting
* Added test for multiple prompts
* Improved docs
* Incorporated feedback
* Reverted formatting on unrelated files
* Moved prompt2prompt to community
- Moved prompt2prompt pipeline from main to community
- Deleted tests
- Moved documentation to community and shorted it
* Update src/diffusers/utils/dummy_torch_and_transformers_objects.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com >
---------
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com >
2023-09-14 16:39:59 +02:00
Patrick von Platen
1037287e2b
examples fix t2i training ( #5001 )
...
* examples fix t2i training
* make style
2023-09-12 23:52:41 +02:00
Kashif Rasul
73bf620dec
fix E721 Do not compare types, use isinstance() ( #4992 )
2023-09-12 16:52:25 +02:00
Dhruv Nair
b6e0b016ce
Lazy Import for Diffusers ( #4829 )
...
* initial commit
* move modules to import struct
* add dummy objects and _LazyModule
* add lazy import to schedulers
* clean up unused imports
* lazy import on models module
* lazy import for schedulers module
* add lazy import to pipelines module
* lazy import altdiffusion
* lazy import audio diffusion
* lazy import audioldm
* lazy import consistency model
* lazy import controlnet
* lazy import dance diffusion ddim ddpm
* lazy import deepfloyd
* lazy import kandinksy
* lazy imports
* lazy import semantic diffusion
* lazy imports
* lazy import stable diffusion
* move sd output to its own module
* clean up
* lazy import t2iadapter
* lazy import unclip
* lazy import versatile and vq diffsuion
* lazy import vq diffusion
* helper to fetch objects from modules
* lazy import sdxl
* lazy import txt2vid
* lazy import stochastic karras
* fix model imports
* fix bug
* lazy import
* clean up
* clean up
* fixes for tests
* fixes for tests
* clean up
* remove import of torch_utils from utils module
* clean up
* clean up
* fix mistake import statement
* dedicated modules for exporting and loading
* remove testing utils from utils module
* fixes from merge conflicts
* Update src/diffusers/pipelines/kandinsky2_2/__init__.py
* fix docs
* fix alt diffusion copied from
* fix check dummies
* fix more docs
* remove accelerate import from utils module
* add type checking
* make style
* fix check dummies
* remove torch import from xformers check
* clean up error message
* fixes after upstream merges
* dummy objects fix
* fix tests
* remove unused module import
---------
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com >
2023-09-11 09:56:22 +02:00
Will Berman
d73e6ad050
guard save model hooks to only execute on main process ( #4929 )
2023-09-08 10:30:06 -07:00
Sayak Paul
d0cf681a1f
[Tests] add: tests for t2i adapter training. ( #4947 )
...
add: tests for t2i adapter training.
2023-09-08 19:45:39 +05:30
Suraj Patil
dfec61f4b3
[examples] T2IAdapter training script ( #4934 )
...
* add t2i_example script
* remove in channels logic
* remove comments
* remove use_euler arg
* add requirements
* only use canny example
* use datasets
* comments
* make log_validation consistent with other scripts
* add readme
* fix title in readme
* update check_min_version
* change a few minor things.
* add doc entry
* add: test for t2i adapter training
* remove use_auth_token
* fix: logged info.
* remove tests for now.
---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
2023-09-08 10:03:02 +05:30
Bagheera
cfdfcf2018
Add --vae_precision option to the SDXL pix2pix script so that we have… ( #4881 )
...
* Add --vae_precision option to the SDXL pix2pix script so that we have the option of avoiding float32 overhead
* style
---------
Co-authored-by: bghira <bghira@users.github.com >
2023-09-05 09:04:06 +02:00