diffusers

mirror of https://github.com/huggingface/diffusers.git synced 2026-01-27 17:22:53 +03:00

Author	SHA1	Message	Date
Harutatsu Akiyama	428dbfecd9	[SDXL and IP2P]: instruction pix2pix XL training and pipeline (#4079 ) * Support instruction pix2pix sdxl * Support instruction pix2pix sdxl * Support instruction pix2pix sdxl * Support instruction pix2pix sdxl * Support instruction pix2pix sdxl * Support instruction pix2pix sdxl * Support instruction pix2pix sdxl * Support instruction pix2pix sdxl * Support instruction pix2pix sdxl * Support instruction pix2pix sdxl * Support instruction pix2pix sdxl * Support instruction pix2pix sdxl * Support instruction pix2pix sdxl * Support instruction pix2pix sdxl * Support instruction pix2pix sdxl * Support instruction pix2pix sdxl * Support instruction pix2pix sdxl * [Community] Implementation of the IADB community pipeline (#3996) * community pipeline: implementation of iadb * iadb.py: reformat using black * iadb.py: linting update * add kandinsky to readme table (#4081) Co-authored-by: yiyixuxu <yixu310@gmail,com> * [From Single File] Force accelerate to be installed (#4078) force accelerate to be installed * Support instruction pix2pix sdxl * Support instruction pix2pix sdxl * Support instruction pix2pix sdxl * Support instruction pix2pix sdxl * Support instruction pix2pix sdxl * Support instruction pix2pix sdxl * Support instruction pix2pix sdxl * Support instruction pix2pix sdxl * Support instruction pix2pix sdxl * Support instruction pix2pix sdxl * Support instruction pix2pix sdxl * Support instruction pix2pix sdxl * Support instruction pix2pix sdxl * Support instruction pix2pix sdxl * Support instruction pix2pix sdxl * Clean up IP2P SDXL code * Clean up IP2P SDXL code * [IP2P and SDXL] clean up code * [IP2P and SDXL] clean up code * [IP2P and SDXL] clean up code * [IP2P SDXL] Address code reviews * [IP2P SDXL] Address code reviews, add docs, tests * [IP2P SDXL] Address code reviews, add docs, tests * [IP2P SDXL] Address code reviews, add docs, tests * [IP2P SDXL] Address code reviews, add docs, tests * [IP2P SDXL] Address code reviews, add docs, tests * [IP2P SDXL] Address code reviews, add docs, tests * [IP2P SDXL] Address code reviews, add docs, tests * [IP2P SDXL] Address code reviews, add docs, tests * [IP2P SDXL] Address code reviews, add docs, tests * [IP2P SDXL] Address code reviews, add docs, tests * [IP2P SDXL] Address code reviews, add docs, tests * [IP2P SDXL] Address code reviews, add docs, tests * [IP2P SDXL] Address code reviews * [IP2P SDXL] Address code reviews * [IP2P SDXL] Add README_SDXL * [IP2P SDXL] Address code reviews * [IP2P SDXL] Address code reviews * [IP2P SDXL] Fix the copy problems * [IP2P SDXL] Add license * [IP2P SDXL] Add license * [IP2P SDXL] Add license * [IP2P SDXL] Address code reivew for selecting VAE andd others * [IP2P SDXL] Update README_sdxl * [IP2P SDXL] Update __init__ * [IP2P SDXL] Update dummy_torch_and_transformers_and_invisible_watermark_objects * address patrick's comments and some additions to readmes. --------- Co-authored-by: Harutatsu Akiyama <kf.zy.qin@gmail.com> Co-authored-by: Thomas Chambon <36728882+tchambon@users.noreply.github.com> Co-authored-by: YiYi Xu <yixu310@gmail.com> Co-authored-by: yiyixuxu <yixu310@gmail,com> Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2023-07-25 18:19:35 +05:30
Ragnar Rova	4e2a021829	Model path for sdxl wrong in dreambooth README (#4261 )	2023-07-25 18:06:50 +05:30
Sayak Paul	5ef6b8fa53	Update README_sdxl.md to change the note on default hyperparameters (#4258 )	2023-07-25 16:57:48 +05:30
Will Berman	3dd339379d	do not pass list to accelerator.init_trackers (#4248 )	2023-07-24 21:10:37 -07:00
nupurkmr9	5652c43f83	Resolve bf16 error as mentioned in this [issue](https://github.com/huggingface/diffusers/issues/4139#issuecomment-1639977304 ) (#4214 ) * resolve bf16 error * resolve bf16 error * resolve bf16 error * resolve bf16 error * resolve bf16 error * resolve bf16 error * resolve bf16 error	2023-07-25 05:41:19 +05:30
Sayak Paul	365e8461ac	[SDXL DreamBooth LoRA] add support for text encoder fine-tuning (#4097 ) * Allow low precision sd xl * finish * finish * feat: initial draft for supporting text encoder lora finetuning for SDXL DreamBooth * fix: variable assignments. * add: autocast block. * add debugging * vae dtype hell * fix: vae dtype hell. * fix: vae dtype hell 3. * clean up * lora text encoder loader. * fix: unwrapping models. * add: tests. * docs. * handle unexpected keys. * fix vae dtype in the final inference. * fix scope problem. * fix: save_model_card args. * initialize: prefix to None. * fix: dtype issues. * apply gixes. * debgging. * debugging * debugging * debugging * debugging * debugging * add: fast tests. * pre-tokenize. * address: will's comments. * fix: loader and tests. * fix: dataloader. * simplify dataloader. * length. * simplification. * make style && make quality * simplify state_dict munging * fix: tests. * fix: state_dict packing. * Apply suggestions from code review Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2023-07-25 05:35:48 +05:30
Sayak Paul	fed12376c5	[ControlNet SDXL training] fixes in the training script (#4223 ) * fix: #4206 * add: sdxl controlnet training smoketest. * remove unnecessary token inits. * add: licensing to model card. * include SDXL licensing in the model card and make public visibility default * debugging * debugging * disable local file download. * fix: training test. * fix: ckpt prefix.	2023-07-25 05:31:48 +05:30
Apoorva Kulkarni	cbb1ead60b	docs: Add missing import statement in textual_inversion inference example (#4227 ) docs: Add missing import statement in textual_inversion inference instructions	2023-07-24 11:07:53 -07:00
Apoorva Kulkarni	2e53936c97	docs: Typo in dreambooth example README.md (#4203 ) fix: Typo in dreambooth example README.md	2023-07-21 15:16:38 -07:00
Kadir Nar	bcc570b910	📄 Renamed File for Better Understanding (#4056 ) * 📄 Renamed File for Better Understanding Renamed the 'rl' file to 'run_locomotion'. This change was made to improve the clarity and readability of the codebase. The 'rl' name was ambiguous, and 'run_locomotion' provides a more clear description of the file's purpose. Thanks 🙌 * 📁 [Docs] Renamed Directory for Better Clarity Renamed the 'rl' directory to 'reinforcement_learning'. This change provides a clearer understanding of the directory's purpose and its contents. * Update examples/reinforcement_learning/README.md Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * 📝 Update README --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2023-07-21 09:08:27 -07:00
Sayak Paul	4dcab9227a	[SDXL ControlNet Training] Follow-up fixes (#4188 ) * hash computation. thanks to @lhoestq * disable dtype casting. * remove comments.	2023-07-21 20:55:33 +05:30
Patrick von Platen	d620070bb3	[ControlNet Training] Remove safety from controlnet (#4180 ) Remove safety from controlnet	2023-07-21 08:03:59 +05:30
Sayak Paul	3eb498e7b4	[Core] add: controlnet support for SDXL (#4038 ) * add: controlnet sdxl. * modifications to controlnet. * run styling. * add: __init__.pys * incorporate https://github.com/huggingface/diffusers/pull/4019 changes. * run make fix-copies. * resize the conditioning images. * remove autocast. * run styling. * disable autocast. * debugging * device placement. * back to autocast. * remove comment. * save some memory by reusing the vae and unet in the pipeline. * apply styling. * Allow low precision sd xl * finish * finish * changes to accommodate the improved VAE. * modifications to how we handle vae encoding in the training. * make style * make existing controlnet fast tests pass. * change vae checkpoint cli arg. * fix: vae pretrained paths. * fix: steps in get_scheduler(). * debugging. * debugging./ * fix: weight conversion. * add: docs. * add: limited tests./ * add: datasets to the requirements. * update docstrings and incorporate the usage of watermarking. * incorporate fix from #4083 * fix watermarking dependency handling. * run make-fix-copies. * Empty-Commit * Update requirements_sdxl.txt * remove vae upcasting part. * Apply suggestions from code review Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * run make style * run make fix-copies. * disable suppot for multicontrolnet. * Apply suggestions from code review Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * run make fix-copies. * dtyle/. * fix-copies. --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2023-07-18 18:25:34 +05:30
takuoko	6427aa995e	[Enhance] Add rank in dreambooth (#4112 ) add rank in dreambooth	2023-07-18 11:30:06 +05:30
Patrick von Platen	71c918b848	[Invisible watermark] Correct version (#4087 )	2023-07-14 09:30:43 +05:30
Gabriel Birnbaum	f3802eb805	fix requirement in SDXL (#4082 )	2023-07-14 02:58:20 +02:00
Thomas Chambon	2eceaaef0f	[Community] Implementation of the IADB community pipeline (#3996 ) * community pipeline: implementation of iadb * iadb.py: reformat using black * iadb.py: linting update	2023-07-13 16:49:41 +02:00
Ruoxi	ece55227ff	Multiply lr scheduler steps by `num_processes`. (#3983 ) * Multiply lr scheduler steps by `num_processes`. * Stop multiplying steps by gradient accumulation.	2023-07-13 17:50:25 +05:30
Patrick von Platen	e9eb0938f4	make style	2023-07-12 19:24:47 +02:00
junming huang	a29ea36d62	Update train_unconditional.py (#3899 ) increase the time of timeout when using big dataset or high resolution	2023-07-12 19:24:28 +02:00
Patrick von Platen	b9feed8795	move to 0.19.0dev (#4048 )	2023-07-11 22:49:12 +02:00
Sayak Paul	3d74dc2abd	[Examples] Add a training script for SDXL DreamBooth LoRA (#4016 ) * add dreambooth lora script for SDXL incorporating latest changes. * remove use_auth_token=True. * add: documentation * remove unneeded cli. * increase the number of training steps in the readme. * add LoraLoaderMixin to the subclassing mix. * add sdxl lora dreambooth test. * add: inference code sample. * add: refiner output. * add LoraLoaderMixin to the mix of classes of StableDiffusionXLImg2ImgPipeline. * change default resolution of DreamBoothDataset. * better sdxl report path. * Apply suggestions from code review Co-authored-by: Pedro Cuenca <pedro@huggingface.co> --------- Co-authored-by: Pedro Cuenca <pedro@huggingface.co>	2023-07-11 07:38:41 +05:30
Patrick von Platen	4a3e574807	make style	2023-07-09 16:02:59 +00:00
Will Berman	c2a28c346c	Refactor LoRA (#3778 ) * refactor to support patching LoRA into T5 instantiate the lora linear layer on the same device as the regular linear layer get lora rank from state dict tests fmt can create lora layer in float32 even when rest of model is float16 fix loading model hook remove load_lora_weights_ and T5 dispatching remove Unet#attn_processors_state_dict docstrings * text encoder monkeypatch class method * fix test --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2023-07-09 18:02:46 +02:00
Batuhan Taskaya	04ddad484e	Add 'rank' parameter to Dreambooth LoRA training script (#3945 )	2023-07-07 17:26:10 +05:30
Patrick von Platen	187ea539ae	Improve SD XL (#3968 ) * improve sd xl * correct more * finish * make style * fix more	2023-07-06 18:11:20 +02:00
Prathik Rao	1997614aa9	avoid upcasting by assigning dtype to noise tensor (#3713 ) * avoid upcasting by assigning dtype to noise tensor * make style * Update train_unconditional.py * Update train_unconditional.py * make style * add unit test for pickle * revert change --------- Co-authored-by: root <root@orttrainingdev8.d32nl1ml4oruzj4qz3bqlggovf.px.internal.cloudapp.net> Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: Prathik Rao <prathikrao@microsoft.com@orttrainingdev8.d32nl1ml4oruzj4qz3bqlggovf.px.internal.cloudapp.net>	2023-07-04 07:19:49 +05:30
Andrés Mauricio Repetto Ferrero	572d8e2002	Adding better way to define multiple concepts and also validation capabilities. (#3807 ) * - Added validation parameters - Changed some parameter descriptions to better explain their use. - Fixed a few typos. - Added concept_list parameter for better management of multiple subjects - changed logic for image validation * - Fixed bad logic for class data root directories * Defaulting validation_steps to None for an easier logic * Fixed multiple validation prompts * Fixed bug on validation negative prompt * Changed validation logic for tracker. * Added uuid for validation image labeling * Fix error when comparing validation prompts and validation negative prompts * Improved error message when negative prompts for validation are more than the number of prompts * - Changed image tracking number from epoch to global_step - Added Typing for functions * Added some validations more when using concept_list parameter and the regular ones. * Fixed error message * Added more validations for validation parameters * Improved messaging for errors * Fixed validation error for parameters with default values * - Added train step to image name for validation - reformatted code * - Added train step to image's name for validation - reformatted code * Updated README.md file. * reverted back original script of train_dreambooth.py * reverted back original script of train_dreambooth.py * left one blank line at the eof * reverted back setup.py * reverted back setup.py * added same logic for when parameters for prior preservation are used without enabling the flag while using concept_list parameter. * Ran black formatter. * fixed a few strings * fixed import sort with isort and removed fstrings without placeholder * fixed import order with ruff (since with isort wasn't ok) --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2023-07-03 17:55:45 +02:00
takuoko	cdf2ae8a84	[Enhance] Add LoRA rank args in train_text_to_image_lora (#3866 ) * add rank args in lora finetune * del network_alpha	2023-06-29 17:09:59 +05:30
Sayak Paul	4870626728	[Examples] Improve the model card pushed from the `train_text_to_image.py` script (#3810 ) * refactor: readme serialized from the example when push_to_hub is True. * fix: batch size arg. * a bit better formatting * minor fixes. * add note on env. * Apply suggestions from code review Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * condition wandb info better * make mixed_precision assignment in cli args explicit. * separate inference block for sample images. * Apply suggestions from code review Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * address more comments. * autocast mode. * correct none image type problem. * ifx: list assignment. * minor fix. --------- Co-authored-by: Pedro Cuenca <pedro@huggingface.co>	2023-06-20 08:59:41 +05:30
Will Berman	3ddc2b7395	[train text to image] add note to loading from checkpoint (#3806 ) add note to loading from checkpoint	2023-06-16 11:54:49 +05:30
Will Berman	d49e2dd54c	manual check for checkpoints_total_limit instead of using accelerate (#3681 ) * manual check for checkpoints_total_limit instead of using accelerate * remove controlnet_conditioning_embedding_out_channels	2023-06-15 15:38:54 -07:00
Naga Sai Abhinay	231bdf2e56	UnCLIP Image Interpolation -> Keep same initial noise across interpolation steps (#3782 ) * Maintain same decoder start noise for all interp steps * Correct comment * use batch_size for consistency	2023-06-15 15:15:40 +02:00
Patrick von Platen	908e5e9cc6	Fix some bad comment in training scripts (#3798 ) * relax tolerance slightly * correct incorrect naming	2023-06-15 15:07:51 +02:00
takuoko	1ae15fa64c	[Enhance] Update reference (#3723 ) * update reference pipeline * update reference pipeline --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2023-06-15 14:34:12 +02:00
Patrick von Platen	c42f6ee43e	Post 0.17.0 release (#3721 ) * Post release * Post release	2023-06-08 18:08:49 +02:00
Zachary Mueller	79fa94ea8b	Apply deprecations from Accelerate (#3714 ) Apply deprecations	2023-06-08 16:44:22 +02:00
Kadir Nar	cd6186907c	[Community] Support StableDiffusionCanvasPipeline (#3590 ) * added StableDiffusionCanvasPipeline pipeline * Added utils codes to pipe_utils file. * make style * delete mixture.py and Text2ImageRegion class * make style * Added the codes to the readme.md file. * Moved functions from pipeline_utils to mix_canvas	2023-06-07 17:43:33 +01:00
Alex McKinney	cd9d0913d9	Fixes eval generator init in `train_text_to_image_lora.py` (#3678 )	2023-06-07 15:37:13 +05:30
Max-We	12a232efa9	Fix schedulers zero SNR and rescale classifier free guidance (#3664 ) * Implement option for rescaling betas to zero terminal SNR * Implement rescale classifier free guidance in pipeline_stable_diffusion.py * focus on DDIM * make style * make style * make style * make style * Apply suggestions from Peter Lin * Apply suggestions from Peter Lin * make style * Apply suggestions from code review * Apply suggestions from code review * make style * make style --------- Co-authored-by: MaxWe00 <gitlab.9v1lq@slmail.me> Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2023-06-07 10:57:10 +01:00
Sayak Paul	8669e8313d	[LoRA] feat: add lora attention processor for pt 2.0. (#3594 ) * feat: add lora attention processor for pt 2.0. * explicit context manager for SDPA. * switch to flash attention * make shapes compatible to work optimally with SDPA. * fix: circular import problem. * explicitly specify the flash attention kernel in sdpa * fall back to efficient attention context manager. * remove explicit dispatch. * fix: removed processor. * fix: remove optional from type annotation. * feat: make changes regarding LoRAAttnProcessor2_0. * remove confusing warning. * formatting. * relax tolerance for PT 2.0 * fix: loading message. * remove unnecessary logging. * add: entry to the docs. * add: network_alpha argument. * relax tolerance.	2023-06-06 14:56:05 +05:30
Patrick von Platen	262d539a8a	Correct multi gpu dreambooth (#3673 ) Correct multi gpu	2023-06-05 11:03:11 +01:00
Will Berman	0fc2fb71c1	dreambooth upscaling fix added latents (#3659 )	2023-06-05 10:32:16 +01:00
0x1355	de45af4a46	Allow setting num_cycles for cosine_with_restarts lr scheduler (#3606 ) Expose num_cycles kwarg of get_schedule() through args.lr_num_cycles.	2023-06-05 10:18:29 +05:30
Will Berman	7a39691362	linting fix (#3653 )	2023-06-02 13:33:19 -07:00
Will Berman	5911a3aa47	dreambooth if docs - stage II, more info (#3628 ) * dreambooth if docs - stage II, more info * Update docs/source/en/training/dreambooth.mdx Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update docs/source/en/training/dreambooth.mdx Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update docs/source/en/training/dreambooth.mdx Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * download instructions for downsized images * update source README to match docs --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2023-06-02 10:37:13 -07:00
asfiyab-nvidia	d3717e6368	add Stable Diffusion TensorRT Inpainting pipeline (#3642 ) * add tensorrt inpaint pipeline Signed-off-by: Asfiya Baig <asfiyab@nvidia.com> * run make style Signed-off-by: Asfiya Baig <asfiyab@nvidia.com> --------- Signed-off-by: Asfiya Baig <asfiyab@nvidia.com> Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2023-06-02 18:14:31 +01:00
Kadir Nar	0dbdc0cbae	[Community Doc] Updated the filename and readme file. (#3634 ) * Updated the filename and readme file. * reformatter * reformetter	2023-06-02 17:53:09 +01:00
Kashif Rasul	f1d4743394	fixed typo in example train_text_to_image.py (#3608 ) fixed typo	2023-06-02 20:54:54 +05:30
Takuma Mori	8e552bb4fe	Support Kohya-ss style LoRA file format (in a limited capacity) (#3437 ) * add _convert_kohya_lora_to_diffusers * make style * add scaffold * match result: unet attention only * fix monkey-patch for text_encoder * with CLIPAttention While the terrible images are no longer produced, the results do not match those from the hook ver. This may be due to not setting the network_alpha value. * add to support network_alpha * generate diff image * fix monkey-patch for text_encoder * add test_text_encoder_lora_monkey_patch() * verify that it's okay to release the attn_procs * fix closure version * add comment * Revert "fix monkey-patch for text_encoder" This reverts commit `bb9c61e6fa`. * Fix to reuse utility functions * make LoRAAttnProcessor targets to self_attn * fix LoRAAttnProcessor target * make style * fix split key * Update src/diffusers/loaders.py * remove TEXT_ENCODER_TARGET_MODULES loop * add print memory usage * remove test_kohya_loras_scaffold.py * add: doc on LoRA civitai * remove print statement and refactor in the doc. * fix state_dict test for kohya-ss style lora * Apply suggestions from code review Co-authored-by: Takuma Mori <takuma104@gmail.com> --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2023-06-02 17:40:24 +05:30

1 2 3 4 5 ...

510 Commits