diffusers

mirror of https://github.com/huggingface/diffusers.git synced 2026-01-27 17:22:53 +03:00

Author	SHA1	Message	Date
Patrick von Platen	71c918b848	[Invisible watermark] Correct version (#4087 )	2023-07-14 09:30:43 +05:30
Gabriel Birnbaum	f3802eb805	fix requirement in SDXL (#4082 )	2023-07-14 02:58:20 +02:00
Thomas Chambon	2eceaaef0f	[Community] Implementation of the IADB community pipeline (#3996 ) * community pipeline: implementation of iadb * iadb.py: reformat using black * iadb.py: linting update	2023-07-13 16:49:41 +02:00
Ruoxi	ece55227ff	Multiply lr scheduler steps by `num_processes`. (#3983 ) * Multiply lr scheduler steps by `num_processes`. * Stop multiplying steps by gradient accumulation.	2023-07-13 17:50:25 +05:30
Patrick von Platen	e9eb0938f4	make style	2023-07-12 19:24:47 +02:00
junming huang	a29ea36d62	Update train_unconditional.py (#3899 ) increase the time of timeout when using big dataset or high resolution	2023-07-12 19:24:28 +02:00
Patrick von Platen	b9feed8795	move to 0.19.0dev (#4048 )	2023-07-11 22:49:12 +02:00
Sayak Paul	3d74dc2abd	[Examples] Add a training script for SDXL DreamBooth LoRA (#4016 ) * add dreambooth lora script for SDXL incorporating latest changes. * remove use_auth_token=True. * add: documentation * remove unneeded cli. * increase the number of training steps in the readme. * add LoraLoaderMixin to the subclassing mix. * add sdxl lora dreambooth test. * add: inference code sample. * add: refiner output. * add LoraLoaderMixin to the mix of classes of StableDiffusionXLImg2ImgPipeline. * change default resolution of DreamBoothDataset. * better sdxl report path. * Apply suggestions from code review Co-authored-by: Pedro Cuenca <pedro@huggingface.co> --------- Co-authored-by: Pedro Cuenca <pedro@huggingface.co>	2023-07-11 07:38:41 +05:30
Patrick von Platen	4a3e574807	make style	2023-07-09 16:02:59 +00:00
Will Berman	c2a28c346c	Refactor LoRA (#3778 ) * refactor to support patching LoRA into T5 instantiate the lora linear layer on the same device as the regular linear layer get lora rank from state dict tests fmt can create lora layer in float32 even when rest of model is float16 fix loading model hook remove load_lora_weights_ and T5 dispatching remove Unet#attn_processors_state_dict docstrings * text encoder monkeypatch class method * fix test --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2023-07-09 18:02:46 +02:00
Batuhan Taskaya	04ddad484e	Add 'rank' parameter to Dreambooth LoRA training script (#3945 )	2023-07-07 17:26:10 +05:30
Patrick von Platen	187ea539ae	Improve SD XL (#3968 ) * improve sd xl * correct more * finish * make style * fix more	2023-07-06 18:11:20 +02:00
Prathik Rao	1997614aa9	avoid upcasting by assigning dtype to noise tensor (#3713 ) * avoid upcasting by assigning dtype to noise tensor * make style * Update train_unconditional.py * Update train_unconditional.py * make style * add unit test for pickle * revert change --------- Co-authored-by: root <root@orttrainingdev8.d32nl1ml4oruzj4qz3bqlggovf.px.internal.cloudapp.net> Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: Prathik Rao <prathikrao@microsoft.com@orttrainingdev8.d32nl1ml4oruzj4qz3bqlggovf.px.internal.cloudapp.net>	2023-07-04 07:19:49 +05:30
Andrés Mauricio Repetto Ferrero	572d8e2002	Adding better way to define multiple concepts and also validation capabilities. (#3807 ) * - Added validation parameters - Changed some parameter descriptions to better explain their use. - Fixed a few typos. - Added concept_list parameter for better management of multiple subjects - changed logic for image validation * - Fixed bad logic for class data root directories * Defaulting validation_steps to None for an easier logic * Fixed multiple validation prompts * Fixed bug on validation negative prompt * Changed validation logic for tracker. * Added uuid for validation image labeling * Fix error when comparing validation prompts and validation negative prompts * Improved error message when negative prompts for validation are more than the number of prompts * - Changed image tracking number from epoch to global_step - Added Typing for functions * Added some validations more when using concept_list parameter and the regular ones. * Fixed error message * Added more validations for validation parameters * Improved messaging for errors * Fixed validation error for parameters with default values * - Added train step to image name for validation - reformatted code * - Added train step to image's name for validation - reformatted code * Updated README.md file. * reverted back original script of train_dreambooth.py * reverted back original script of train_dreambooth.py * left one blank line at the eof * reverted back setup.py * reverted back setup.py * added same logic for when parameters for prior preservation are used without enabling the flag while using concept_list parameter. * Ran black formatter. * fixed a few strings * fixed import sort with isort and removed fstrings without placeholder * fixed import order with ruff (since with isort wasn't ok) --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2023-07-03 17:55:45 +02:00
takuoko	cdf2ae8a84	[Enhance] Add LoRA rank args in train_text_to_image_lora (#3866 ) * add rank args in lora finetune * del network_alpha	2023-06-29 17:09:59 +05:30
Sayak Paul	4870626728	[Examples] Improve the model card pushed from the `train_text_to_image.py` script (#3810 ) * refactor: readme serialized from the example when push_to_hub is True. * fix: batch size arg. * a bit better formatting * minor fixes. * add note on env. * Apply suggestions from code review Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * condition wandb info better * make mixed_precision assignment in cli args explicit. * separate inference block for sample images. * Apply suggestions from code review Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * address more comments. * autocast mode. * correct none image type problem. * ifx: list assignment. * minor fix. --------- Co-authored-by: Pedro Cuenca <pedro@huggingface.co>	2023-06-20 08:59:41 +05:30
Will Berman	3ddc2b7395	[train text to image] add note to loading from checkpoint (#3806 ) add note to loading from checkpoint	2023-06-16 11:54:49 +05:30
Will Berman	d49e2dd54c	manual check for checkpoints_total_limit instead of using accelerate (#3681 ) * manual check for checkpoints_total_limit instead of using accelerate * remove controlnet_conditioning_embedding_out_channels	2023-06-15 15:38:54 -07:00
Naga Sai Abhinay	231bdf2e56	UnCLIP Image Interpolation -> Keep same initial noise across interpolation steps (#3782 ) * Maintain same decoder start noise for all interp steps * Correct comment * use batch_size for consistency	2023-06-15 15:15:40 +02:00
Patrick von Platen	908e5e9cc6	Fix some bad comment in training scripts (#3798 ) * relax tolerance slightly * correct incorrect naming	2023-06-15 15:07:51 +02:00
takuoko	1ae15fa64c	[Enhance] Update reference (#3723 ) * update reference pipeline * update reference pipeline --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2023-06-15 14:34:12 +02:00
Patrick von Platen	c42f6ee43e	Post 0.17.0 release (#3721 ) * Post release * Post release	2023-06-08 18:08:49 +02:00
Zachary Mueller	79fa94ea8b	Apply deprecations from Accelerate (#3714 ) Apply deprecations	2023-06-08 16:44:22 +02:00
Kadir Nar	cd6186907c	[Community] Support StableDiffusionCanvasPipeline (#3590 ) * added StableDiffusionCanvasPipeline pipeline * Added utils codes to pipe_utils file. * make style * delete mixture.py and Text2ImageRegion class * make style * Added the codes to the readme.md file. * Moved functions from pipeline_utils to mix_canvas	2023-06-07 17:43:33 +01:00
Alex McKinney	cd9d0913d9	Fixes eval generator init in `train_text_to_image_lora.py` (#3678 )	2023-06-07 15:37:13 +05:30
Max-We	12a232efa9	Fix schedulers zero SNR and rescale classifier free guidance (#3664 ) * Implement option for rescaling betas to zero terminal SNR * Implement rescale classifier free guidance in pipeline_stable_diffusion.py * focus on DDIM * make style * make style * make style * make style * Apply suggestions from Peter Lin * Apply suggestions from Peter Lin * make style * Apply suggestions from code review * Apply suggestions from code review * make style * make style --------- Co-authored-by: MaxWe00 <gitlab.9v1lq@slmail.me> Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2023-06-07 10:57:10 +01:00
Sayak Paul	8669e8313d	[LoRA] feat: add lora attention processor for pt 2.0. (#3594 ) * feat: add lora attention processor for pt 2.0. * explicit context manager for SDPA. * switch to flash attention * make shapes compatible to work optimally with SDPA. * fix: circular import problem. * explicitly specify the flash attention kernel in sdpa * fall back to efficient attention context manager. * remove explicit dispatch. * fix: removed processor. * fix: remove optional from type annotation. * feat: make changes regarding LoRAAttnProcessor2_0. * remove confusing warning. * formatting. * relax tolerance for PT 2.0 * fix: loading message. * remove unnecessary logging. * add: entry to the docs. * add: network_alpha argument. * relax tolerance.	2023-06-06 14:56:05 +05:30
Patrick von Platen	262d539a8a	Correct multi gpu dreambooth (#3673 ) Correct multi gpu	2023-06-05 11:03:11 +01:00
Will Berman	0fc2fb71c1	dreambooth upscaling fix added latents (#3659 )	2023-06-05 10:32:16 +01:00
0x1355	de45af4a46	Allow setting num_cycles for cosine_with_restarts lr scheduler (#3606 ) Expose num_cycles kwarg of get_schedule() through args.lr_num_cycles.	2023-06-05 10:18:29 +05:30
Will Berman	7a39691362	linting fix (#3653 )	2023-06-02 13:33:19 -07:00
Will Berman	5911a3aa47	dreambooth if docs - stage II, more info (#3628 ) * dreambooth if docs - stage II, more info * Update docs/source/en/training/dreambooth.mdx Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update docs/source/en/training/dreambooth.mdx Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update docs/source/en/training/dreambooth.mdx Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * download instructions for downsized images * update source README to match docs --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2023-06-02 10:37:13 -07:00
asfiyab-nvidia	d3717e6368	add Stable Diffusion TensorRT Inpainting pipeline (#3642 ) * add tensorrt inpaint pipeline Signed-off-by: Asfiya Baig <asfiyab@nvidia.com> * run make style Signed-off-by: Asfiya Baig <asfiyab@nvidia.com> --------- Signed-off-by: Asfiya Baig <asfiyab@nvidia.com> Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2023-06-02 18:14:31 +01:00
Kadir Nar	0dbdc0cbae	[Community Doc] Updated the filename and readme file. (#3634 ) * Updated the filename and readme file. * reformatter * reformetter	2023-06-02 17:53:09 +01:00
Kashif Rasul	f1d4743394	fixed typo in example train_text_to_image.py (#3608 ) fixed typo	2023-06-02 20:54:54 +05:30
Takuma Mori	8e552bb4fe	Support Kohya-ss style LoRA file format (in a limited capacity) (#3437 ) * add _convert_kohya_lora_to_diffusers * make style * add scaffold * match result: unet attention only * fix monkey-patch for text_encoder * with CLIPAttention While the terrible images are no longer produced, the results do not match those from the hook ver. This may be due to not setting the network_alpha value. * add to support network_alpha * generate diff image * fix monkey-patch for text_encoder * add test_text_encoder_lora_monkey_patch() * verify that it's okay to release the attn_procs * fix closure version * add comment * Revert "fix monkey-patch for text_encoder" This reverts commit `bb9c61e6fa`. * Fix to reuse utility functions * make LoRAAttnProcessor targets to self_attn * fix LoRAAttnProcessor target * make style * fix split key * Update src/diffusers/loaders.py * remove TEXT_ENCODER_TARGET_MODULES loop * add print memory usage * remove test_kohya_loras_scaffold.py * add: doc on LoRA civitai * remove print statement and refactor in the doc. * fix state_dict test for kohya-ss style lora * Apply suggestions from code review Co-authored-by: Takuma Mori <takuma104@gmail.com> --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2023-06-02 17:40:24 +05:30
Sayak Paul	55dbfa0229	[Docs] include the instruction-tuning blog link in the InstructPix2Pix docs (#3644 ) include the instruction-tuning blog link.	2023-06-02 08:04:35 +05:30
Will Berman	4f14b36329	Full Dreambooth IF stage II upscaling (#3561 ) * update dreambooth lora to work with IF stage II * Update dreambooth script for IF stage II upscaler	2023-05-31 09:39:31 -07:00
Will Berman	f751b8844e	update dreambooth lora to work with IF stage II (#3560 )	2023-05-31 09:39:03 -07:00
Prathik Rao	abb89da4de	update code to reflect latest changes as of May 30th (#3616 ) * update code to reflect latest changes as of May 30th * update text to image example * reflect changes to textual inversion * make style * fix typo * Revert unnecessary readme changes --------- Co-authored-by: root <root@orttrainingdev8.d32nl1ml4oruzj4qz3bqlggovf.px.internal.cloudapp.net> Co-authored-by: Prathik Rao <prathikrao@microsoft.com@orttrainingdev8.d32nl1ml4oruzj4qz3bqlggovf.px.internal.cloudapp.net>	2023-05-31 11:29:04 +02:00
Patrick von Platen	0cc3a7a123	Make sure we also change the config when setting `encoder_hid_dim_type=="text_proj"` and allow xformers (#3615 ) * fix if * make style * make style * add tests for xformers * make style * update	2023-05-30 20:47:14 +01:00
Patrick von Platen	9d3ff0794d	fix tests (#3614 )	2023-05-30 18:59:07 +01:00
Patrick von Platen	160c377ddc	Make style	2023-05-30 13:14:09 +01:00
Denis	bb22d546c0	[Community] CLIP Guided Images Mixing with Stable DIffusion Pipeline (#3587 ) * added clip_guided_images_mixing_stable_diffusion file and readme description * apply pre-commit --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2023-05-30 13:13:45 +01:00
takuoko	07ef4855cd	[Community, Enhancement] Add reference tricks in README (#3589 ) add reference tricks	2023-05-30 12:38:16 +01:00
Kadir Nar	6cbddf558a	[Community] Support StableDiffusionTilingPipeline (#3586 ) * added mixture pipeline * added docstring * update docstring	2023-05-30 12:24:15 +01:00
Leon Lin	1d1f648c6b	fix dreambooth attention mask (#3541 )	2023-05-26 10:58:50 -07:00
Sayak Paul	8e69708b0d	[Examples/DreamBooth] refactor save_model_card utility in dreambooth examples (#3543 ) refactor save_model_card utility in dreambooth examples.	2023-05-24 16:16:28 +05:30
takuoko	b134f6a8b6	[Community] ControlNet Reference (#3508 ) add controlnet reference and bugfix Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2023-05-23 13:20:34 +01:00
yingjieh	edc6505193	[Community Pipelines]Accelerate inference of stable diffusion by IPEX on CPU (#3105 ) * add stable_diffusion_ipex community pipeline * Update readme.md * reformat * reformat * Update examples/community/README.md Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * Update examples/community/README.md Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * Update examples/community/README.md Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * Update examples/community/README.md Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * Apply suggestions from code review Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * Update README.md * Update README.md * Apply suggestions from code review Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * style --------- Co-authored-by: Pedro Cuenca <pedro@huggingface.co>	2023-05-23 10:55:14 +02:00

1 2 3 4 5 ...

496 Commits