diffusers

mirror of https://github.com/huggingface/diffusers.git synced 2026-01-27 17:22:53 +03:00

Author	SHA1	Message	Date
Will Berman	f751b8844e	update dreambooth lora to work with IF stage II (#3560 )	2023-05-31 09:39:03 -07:00
Prathik Rao	abb89da4de	update code to reflect latest changes as of May 30th (#3616 ) * update code to reflect latest changes as of May 30th * update text to image example * reflect changes to textual inversion * make style * fix typo * Revert unnecessary readme changes --------- Co-authored-by: root <root@orttrainingdev8.d32nl1ml4oruzj4qz3bqlggovf.px.internal.cloudapp.net> Co-authored-by: Prathik Rao <prathikrao@microsoft.com@orttrainingdev8.d32nl1ml4oruzj4qz3bqlggovf.px.internal.cloudapp.net>	2023-05-31 11:29:04 +02:00
Patrick von Platen	0cc3a7a123	Make sure we also change the config when setting `encoder_hid_dim_type=="text_proj"` and allow xformers (#3615 ) * fix if * make style * make style * add tests for xformers * make style * update	2023-05-30 20:47:14 +01:00
Patrick von Platen	9d3ff0794d	fix tests (#3614 )	2023-05-30 18:59:07 +01:00
Patrick von Platen	160c377ddc	Make style	2023-05-30 13:14:09 +01:00
Denis	bb22d546c0	[Community] CLIP Guided Images Mixing with Stable DIffusion Pipeline (#3587 ) * added clip_guided_images_mixing_stable_diffusion file and readme description * apply pre-commit --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2023-05-30 13:13:45 +01:00
takuoko	07ef4855cd	[Community, Enhancement] Add reference tricks in README (#3589 ) add reference tricks	2023-05-30 12:38:16 +01:00
Kadir Nar	6cbddf558a	[Community] Support StableDiffusionTilingPipeline (#3586 ) * added mixture pipeline * added docstring * update docstring	2023-05-30 12:24:15 +01:00
Leon Lin	1d1f648c6b	fix dreambooth attention mask (#3541 )	2023-05-26 10:58:50 -07:00
Sayak Paul	8e69708b0d	[Examples/DreamBooth] refactor save_model_card utility in dreambooth examples (#3543 ) refactor save_model_card utility in dreambooth examples.	2023-05-24 16:16:28 +05:30
takuoko	b134f6a8b6	[Community] ControlNet Reference (#3508 ) add controlnet reference and bugfix Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2023-05-23 13:20:34 +01:00
yingjieh	edc6505193	[Community Pipelines]Accelerate inference of stable diffusion by IPEX on CPU (#3105 ) * add stable_diffusion_ipex community pipeline * Update readme.md * reformat * reformat * Update examples/community/README.md Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * Update examples/community/README.md Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * Update examples/community/README.md Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * Update examples/community/README.md Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * Apply suggestions from code review Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * Update README.md * Update README.md * Apply suggestions from code review Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * style --------- Co-authored-by: Pedro Cuenca <pedro@huggingface.co>	2023-05-23 10:55:14 +02:00
Will Berman	67cd460154	do not scale the initial global step by gradient accumulation steps when loading from checkpoint (#3506 )	2023-05-22 15:19:56 -07:00
takuoko	c4359d63e3	[Community] reference only control (#3435 ) * add reference only control * add reference only control * add reference only control * fix lint * fix lint * reference adain * bugfix EulerAncestralDiscreteScheduler * fix style fidelity rule * fix default output size * del unused line * fix deterministic	2023-05-22 16:21:54 +01:00
Patrick von Platen	2b56e8ca68	make style	2023-05-22 16:49:46 +02:00
Ambrosiussen	b8b5daaee3	DataLoader respecting EXIF data in Training Images (#3465 ) * DataLoader will now bake in any transforms or image manipulations contained in the EXIF Images may have rotations stored in EXIF. Training using such images will cause those transforms to be ignored while training and thus produce unexpected results * Fixed the Dataloading EXIF issue in main DreamBooth training as well * Run make style (black & isort)	2023-05-22 15:49:35 +01:00
Will Berman	8d646f2294	dreambooth docs torch.compile note (#3471 ) * dreambooth docs torch.compile note * Update examples/dreambooth/README.md Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * Update examples/dreambooth/README.md Co-authored-by: Pedro Cuenca <pedro@huggingface.co> --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: Pedro Cuenca <pedro@huggingface.co>	2023-05-19 07:40:14 +05:30
Will Berman	7200985eab	Add IF dreambooth docs (#3470 )	2023-05-17 11:56:10 -07:00
Will Berman	c9f939bf98	Update full dreambooth script to work with IF (#3425 )	2023-05-17 10:42:20 -07:00
wfng92	2faf91dbde	Add min snr to text2img lora training script (#3459 ) add min snr to text2img lora training script	2023-05-17 16:37:45 +05:30
Patrick von Platen	3ebd2d1f9e	Make dreambooth lora more robust to orig unet (#3462 ) * Make dreambooth lora more robust to orig unet * up	2023-05-17 11:20:13 +01:00
asfiyab-nvidia	9d44e2fb66	add stable diffusion tensorrt img2img pipeline (#3419 ) * add stable diffusion tensorrt img2img pipeline Signed-off-by: Asfiya Baig <asfiyab@nvidia.com> * update docstrings Signed-off-by: Asfiya Baig <asfiyab@nvidia.com> --------- Signed-off-by: Asfiya Baig <asfiyab@nvidia.com>	2023-05-16 14:28:01 +01:00
Sayak Paul	3a237f4fa2	fix: deepseepd_plugin retrieval from accelerate state (#3410 )	2023-05-12 10:02:22 +01:00
Patrick von Platen	f92253015c	Fix various bugs with LoRA Dreambooth and Dreambooth script (#3353 ) * Improve checkpointing lora * fix more * Improve doc string * Update src/diffusers/loaders.py * make stytle * Apply suggestions from code review * Update src/diffusers/loaders.py * Apply suggestions from code review * Apply suggestions from code review * better * Fix all * Fix multi-GPU dreambooth * Apply suggestions from code review Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * Fix all * make style * make style --------- Co-authored-by: Pedro Cuenca <pedro@huggingface.co>	2023-05-11 19:28:09 +01:00
Stas Bekman	af2a237676	[deepspeed] partial ZeRO-3 support (#3076 ) * [deepspeed] partial ZeRO-3 support * cleanup * improve deepspeed fixes * Improve * make style --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2023-05-11 16:59:20 +01:00
Will Berman	a757b2db6e	if dreambooth lora (#3360 ) * update IF stage I pipelines add fixed variance schedulers and lora loading * added kv lora attn processor * allow loading into alternative lora attn processor * make vae optional * throw away predicted variance * allow loading into added kv lora layer * allow load T5 * allow pre compute text embeddings * set new variance type in schedulers * fix copies * refactor all prompt embedding code class prompts are now included in pre-encoding code max tokenizer length is now configurable embedding attention mask is now configurable * fix for when variance type is not defined on scheduler * do not pre compute validation prompt if not present * add example test for if lora dreambooth * add check for train text encoder and pre compute text embeddings	2023-05-09 10:24:36 -07:00
Lucca Zenóbio	0407c3e7d0	Fix pipeline class on README (#3345 ) Update README.md	2023-05-06 12:06:52 +01:00
Adrià Arrufat	e9aa0925a8	Rename --only_save_embeds to --save_as_full_pipeline (#3206 ) * Set --only_save_embeds to False by default Due to how the option is named, it makes more sense to behave like this. * Refactor only_save_embeds to save_as_full_pipeline	2023-05-06 12:00:30 +01:00
Isamu Isozaki	fa9e35fca4	Added input pretubation (#3292 ) * Added input pretubation * Fixed spelling	2023-05-04 18:12:32 +05:30
Markus Pobitzer	2dd408504a	Add Stable Diffusion RePaint to community pipelines (#3320 ) * Add Stable Diffsuion RePaint to community pipelines - Adds Stable Diffsuion RePaint to community pipelines - Add Readme enty for pipeline * Fix: Remove wrong import - Remove wrong import - Minor change in comments * Fix: Code formatting of stable_diffusion_repaint * Fix: ruff errors in stable_diffusion_repaint	2023-05-03 17:59:49 +01:00
Sayak Paul	efc48da23b	fix: scale_lr and sync example readme and docs. (#3299 ) * fix: scale_lr and sync example readme and docs. * fix doc link.	2023-05-03 10:13:05 +05:30
Patrick von Platen	d464214464	Let's make sure that dreambooth always uploads to the Hub (#3272 ) * Update Dreambooth README * Adapt all docs as well * automatically write model card * fix * make style	2023-04-28 11:39:50 +01:00
timegate	6290668254	Add multiple conditions to StableDiffusionControlNetInpaintPipeline (#3125 ) * try multi controlnet inpaint * multi controlnet inpaint * multi controlnet inpaint	2023-04-28 10:58:10 +01:00
Joqsan	462b4edd31	[Community Pipelines] EDICT pipeline implementation (#3153 ) * EDICT pipeline initial commit - Starting point taking from https://github.com/Joqsan/edict-diffusion * refactor __init__() method * minor refactoring * refactor scheduler code - remove scheduler and move its methods to the EDICTPipeline class * make CFG optional - refactor encode_prompt(). - include optional generator for sampling with vae. - minor variable renaming * add EDICT pipeline description to README.md * replace preprocess() with VaeImageProcessor * run make style and make quality commands --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2023-04-28 10:11:29 +01:00
Sayak Paul	71de5b7051	[LoRA] quality of life improvements in the loading semantics and docs (#3180 ) * 👽 qol improvements for LoRA. * better function name? * fix: LoRA weight loading with the new format. * address Patrick's comments. * Apply suggestions from code review Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * change wording around encouraging the use of load_lora_weights(). * fix: function name. --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2023-04-28 11:36:49 +05:30
Patrick von Platen	2ced899cc7	Revert "Revert "[Community Pipelines] Update lpw_stable_diffusion pipeline"" (#3265 ) Revert "Revert "[Community Pipelines] Update lpw_stable_diffusion pipeline" (#3201)" This reverts commit `91a2a80eb2`.	2023-04-27 16:45:37 +01:00
Pedro Cuenca	70ef774fa0	Remove required from tracker_project_name (#3260 ) Remove required from tracker_project_name. As observed by https://github.com/off99555 in https://github.com/huggingface/diffusers/issues/2695#issuecomment-1470755050, it already has a default value.	2023-04-27 16:59:18 +05:30
Pedro Cuenca	e0a2bd15f9	Write model card in controlnet training script (#3229 ) Write model card in controlnet training script.	2023-04-26 21:22:27 +02:00
Patrick von Platen	f842396367	Post release for 0.16.0 (#3244 ) * Post release * fix more	2023-04-26 17:43:09 +01:00
Patrick von Platen	6ba0efb9a1	Release: v0.16.0	2023-04-26 13:35:01 +02:00
Lucca Zenóbio	0ddc5bf7b9	fix mixed precision training on train_dreambooth_inpaint_lora (#3138 ) cast to weight dtype	2023-04-25 15:22:57 +05:30
Will Berman	91a2a80eb2	Revert "[Community Pipelines] Update lpw_stable_diffusion pipeline" (#3201 ) Revert "[Community Pipelines] Update lpw_stable_diffusion pipeline (#3197)" This reverts commit `9965cb50ea`.	2023-04-22 12:36:55 -07:00
SkyTNT	9965cb50ea	[Community Pipelines] Update lpw_stable_diffusion pipeline (#3197 ) * Update lpw_stable_diffusion.py * fix cpu offload	2023-04-22 15:07:45 +01:00
Chengrui Wang	20e426cb5d	Fix bug in train_dreambooth_lora (#3183 ) * Update train_dreambooth_lora.py fix bug * Update train_dreambooth_lora.py	2023-04-22 09:04:28 +05:30
Patrick von Platen	2c04e5855c	Multi Vector Textual Inversion (#3144 ) * Multi Vector * Improve * fix multi token * improve test * make style * Update examples/test_examples.py * Apply suggestions from code review Co-authored-by: Suraj Patil <surajp815@gmail.com> * update * Finish * Apply suggestions from code review --------- Co-authored-by: Suraj Patil <surajp815@gmail.com>	2023-04-21 19:06:19 +01:00
asfiyab-nvidia	05d9baeacd	Fix TensorRT community pipeline device set function (#3157 ) pass silence_dtype_warnings as kwarg Signed-off-by: Asfiya Baig <asfiyab@nvidia.com> Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2023-04-21 18:53:10 +01:00
Sayak Paul	3045fb2763	[DreamBooth] add text encoder LoRA support in the DreamBooth training script (#3130 ) * add: LoRA text encoder support for DreamBooth example. * fix initialization. * fix: modification call. * add: entry in the readme. * use dog dataset from hub. * fix: params to clip. * add entry to the LoRA doc. * add: tests for lora. * remove unnecessary list comprehension./	2023-04-20 17:25:17 +05:30
XinyuYe-Intel	a5b242d30d	Added distillation for quantization example on textual inversion. (#2760 ) * Added distillation for quantization example on textual inversion. Signed-off-by: Ye, Xinyu <xinyu.ye@intel.com> * refined readme and code style. Signed-off-by: Ye, Xinyu <xinyu.ye@intel.com> * Update text2images.py * refined code of model load and added compatibility check. Signed-off-by: Ye, Xinyu <xinyu.ye@intel.com> * fixed code style. Signed-off-by: Ye, Xinyu <xinyu.ye@intel.com> * fix C403 [*] Unnecessary `list` comprehension (rewrite as a `set` comprehension) Signed-off-by: Ye, Xinyu <xinyu.ye@intel.com> --------- Signed-off-by: Ye, Xinyu <xinyu.ye@intel.com>	2023-04-20 11:55:42 +01:00
nupurkmr9	3979aac996	adding custom diffusion training to diffusers examples (#3031 ) * diffusers==0.14.0 update * custom diffusion update * custom diffusion update * custom diffusion update * custom diffusion update * custom diffusion update * custom diffusion update * custom diffusion * custom diffusion * custom diffusion * custom diffusion * custom diffusion * apply formatting and get rid of bare except. * refactor readme and other minor changes. * misc refactor. * fix: repo_id issue and loaders logging bug. * fix: save_model_card. * fix: save_model_card. * fix: save_model_card. * add: doc entry. * refactor doc,. * custom diffusion * custom diffusion * custom diffusion * apply style. * remove tralining whitespace. * fix: toctree entry. * remove unnecessary print. * custom diffusion * custom diffusion * custom diffusion test * custom diffusion xformer update * custom diffusion xformer update * custom diffusion xformer update --------- Co-authored-by: Nupur Kumari <nupurkumari@Nupurs-MacBook-Pro.local> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: Nupur Kumari <nupurkumari@nupurs-mbp.wifi.local.cmu.edu>	2023-04-20 09:31:42 +02:00
Will Berman	7e6886f5e9	controlnet training resize inputs to multiple of 8 (#3135 ) controlnet training center crop input images to multiple of 8 The pipeline code resizes inputs to multiples of 8. Not doing this resizing in the training script is causing the encoded image to have different height/width dimensions than the encoded conditioning image (which uses a separate encoder that's part of the controlnet model). We resize and center crop the inputs to make sure they're the same size (as well as all other images in the batch). We also check that the initial resolution is a multiple of 8.	2023-04-19 10:46:51 -07:00

1 2 3 4 5 ...

458 Commits