diffusers

mirror of https://github.com/huggingface/diffusers.git synced 2026-01-27 17:22:53 +03:00

Author	SHA1	Message	Date
Sayak Paul	8669e8313d	[LoRA] feat: add lora attention processor for pt 2.0. (#3594 ) * feat: add lora attention processor for pt 2.0. * explicit context manager for SDPA. * switch to flash attention * make shapes compatible to work optimally with SDPA. * fix: circular import problem. * explicitly specify the flash attention kernel in sdpa * fall back to efficient attention context manager. * remove explicit dispatch. * fix: removed processor. * fix: remove optional from type annotation. * feat: make changes regarding LoRAAttnProcessor2_0. * remove confusing warning. * formatting. * relax tolerance for PT 2.0 * fix: loading message. * remove unnecessary logging. * add: entry to the docs. * add: network_alpha argument. * relax tolerance.	2023-06-06 14:56:05 +05:30
Patrick von Platen	262d539a8a	Correct multi gpu dreambooth (#3673 ) Correct multi gpu	2023-06-05 11:03:11 +01:00
Will Berman	0fc2fb71c1	dreambooth upscaling fix added latents (#3659 )	2023-06-05 10:32:16 +01:00
0x1355	de45af4a46	Allow setting num_cycles for cosine_with_restarts lr scheduler (#3606 ) Expose num_cycles kwarg of get_schedule() through args.lr_num_cycles.	2023-06-05 10:18:29 +05:30
Will Berman	7a39691362	linting fix (#3653 )	2023-06-02 13:33:19 -07:00
Will Berman	5911a3aa47	dreambooth if docs - stage II, more info (#3628 ) * dreambooth if docs - stage II, more info * Update docs/source/en/training/dreambooth.mdx Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update docs/source/en/training/dreambooth.mdx Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update docs/source/en/training/dreambooth.mdx Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * download instructions for downsized images * update source README to match docs --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2023-06-02 10:37:13 -07:00
asfiyab-nvidia	d3717e6368	add Stable Diffusion TensorRT Inpainting pipeline (#3642 ) * add tensorrt inpaint pipeline Signed-off-by: Asfiya Baig <asfiyab@nvidia.com> * run make style Signed-off-by: Asfiya Baig <asfiyab@nvidia.com> --------- Signed-off-by: Asfiya Baig <asfiyab@nvidia.com> Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2023-06-02 18:14:31 +01:00
Kadir Nar	0dbdc0cbae	[Community Doc] Updated the filename and readme file. (#3634 ) * Updated the filename and readme file. * reformatter * reformetter	2023-06-02 17:53:09 +01:00
Kashif Rasul	f1d4743394	fixed typo in example train_text_to_image.py (#3608 ) fixed typo	2023-06-02 20:54:54 +05:30
Takuma Mori	8e552bb4fe	Support Kohya-ss style LoRA file format (in a limited capacity) (#3437 ) * add _convert_kohya_lora_to_diffusers * make style * add scaffold * match result: unet attention only * fix monkey-patch for text_encoder * with CLIPAttention While the terrible images are no longer produced, the results do not match those from the hook ver. This may be due to not setting the network_alpha value. * add to support network_alpha * generate diff image * fix monkey-patch for text_encoder * add test_text_encoder_lora_monkey_patch() * verify that it's okay to release the attn_procs * fix closure version * add comment * Revert "fix monkey-patch for text_encoder" This reverts commit `bb9c61e6fa`. * Fix to reuse utility functions * make LoRAAttnProcessor targets to self_attn * fix LoRAAttnProcessor target * make style * fix split key * Update src/diffusers/loaders.py * remove TEXT_ENCODER_TARGET_MODULES loop * add print memory usage * remove test_kohya_loras_scaffold.py * add: doc on LoRA civitai * remove print statement and refactor in the doc. * fix state_dict test for kohya-ss style lora * Apply suggestions from code review Co-authored-by: Takuma Mori <takuma104@gmail.com> --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2023-06-02 17:40:24 +05:30
Sayak Paul	55dbfa0229	[Docs] include the instruction-tuning blog link in the InstructPix2Pix docs (#3644 ) include the instruction-tuning blog link.	2023-06-02 08:04:35 +05:30
Will Berman	4f14b36329	Full Dreambooth IF stage II upscaling (#3561 ) * update dreambooth lora to work with IF stage II * Update dreambooth script for IF stage II upscaler	2023-05-31 09:39:31 -07:00
Will Berman	f751b8844e	update dreambooth lora to work with IF stage II (#3560 )	2023-05-31 09:39:03 -07:00
Prathik Rao	abb89da4de	update code to reflect latest changes as of May 30th (#3616 ) * update code to reflect latest changes as of May 30th * update text to image example * reflect changes to textual inversion * make style * fix typo * Revert unnecessary readme changes --------- Co-authored-by: root <root@orttrainingdev8.d32nl1ml4oruzj4qz3bqlggovf.px.internal.cloudapp.net> Co-authored-by: Prathik Rao <prathikrao@microsoft.com@orttrainingdev8.d32nl1ml4oruzj4qz3bqlggovf.px.internal.cloudapp.net>	2023-05-31 11:29:04 +02:00
Patrick von Platen	0cc3a7a123	Make sure we also change the config when setting `encoder_hid_dim_type=="text_proj"` and allow xformers (#3615 ) * fix if * make style * make style * add tests for xformers * make style * update	2023-05-30 20:47:14 +01:00
Patrick von Platen	9d3ff0794d	fix tests (#3614 )	2023-05-30 18:59:07 +01:00
Patrick von Platen	160c377ddc	Make style	2023-05-30 13:14:09 +01:00
Denis	bb22d546c0	[Community] CLIP Guided Images Mixing with Stable DIffusion Pipeline (#3587 ) * added clip_guided_images_mixing_stable_diffusion file and readme description * apply pre-commit --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2023-05-30 13:13:45 +01:00
takuoko	07ef4855cd	[Community, Enhancement] Add reference tricks in README (#3589 ) add reference tricks	2023-05-30 12:38:16 +01:00
Kadir Nar	6cbddf558a	[Community] Support StableDiffusionTilingPipeline (#3586 ) * added mixture pipeline * added docstring * update docstring	2023-05-30 12:24:15 +01:00
Leon Lin	1d1f648c6b	fix dreambooth attention mask (#3541 )	2023-05-26 10:58:50 -07:00
Sayak Paul	8e69708b0d	[Examples/DreamBooth] refactor save_model_card utility in dreambooth examples (#3543 ) refactor save_model_card utility in dreambooth examples.	2023-05-24 16:16:28 +05:30
takuoko	b134f6a8b6	[Community] ControlNet Reference (#3508 ) add controlnet reference and bugfix Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2023-05-23 13:20:34 +01:00
yingjieh	edc6505193	[Community Pipelines]Accelerate inference of stable diffusion by IPEX on CPU (#3105 ) * add stable_diffusion_ipex community pipeline * Update readme.md * reformat * reformat * Update examples/community/README.md Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * Update examples/community/README.md Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * Update examples/community/README.md Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * Update examples/community/README.md Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * Apply suggestions from code review Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * Update README.md * Update README.md * Apply suggestions from code review Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * style --------- Co-authored-by: Pedro Cuenca <pedro@huggingface.co>	2023-05-23 10:55:14 +02:00
Will Berman	67cd460154	do not scale the initial global step by gradient accumulation steps when loading from checkpoint (#3506 )	2023-05-22 15:19:56 -07:00
takuoko	c4359d63e3	[Community] reference only control (#3435 ) * add reference only control * add reference only control * add reference only control * fix lint * fix lint * reference adain * bugfix EulerAncestralDiscreteScheduler * fix style fidelity rule * fix default output size * del unused line * fix deterministic	2023-05-22 16:21:54 +01:00
Patrick von Platen	2b56e8ca68	make style	2023-05-22 16:49:46 +02:00
Ambrosiussen	b8b5daaee3	DataLoader respecting EXIF data in Training Images (#3465 ) * DataLoader will now bake in any transforms or image manipulations contained in the EXIF Images may have rotations stored in EXIF. Training using such images will cause those transforms to be ignored while training and thus produce unexpected results * Fixed the Dataloading EXIF issue in main DreamBooth training as well * Run make style (black & isort)	2023-05-22 15:49:35 +01:00
Will Berman	8d646f2294	dreambooth docs torch.compile note (#3471 ) * dreambooth docs torch.compile note * Update examples/dreambooth/README.md Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * Update examples/dreambooth/README.md Co-authored-by: Pedro Cuenca <pedro@huggingface.co> --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: Pedro Cuenca <pedro@huggingface.co>	2023-05-19 07:40:14 +05:30
Will Berman	7200985eab	Add IF dreambooth docs (#3470 )	2023-05-17 11:56:10 -07:00
Will Berman	c9f939bf98	Update full dreambooth script to work with IF (#3425 )	2023-05-17 10:42:20 -07:00
wfng92	2faf91dbde	Add min snr to text2img lora training script (#3459 ) add min snr to text2img lora training script	2023-05-17 16:37:45 +05:30
Patrick von Platen	3ebd2d1f9e	Make dreambooth lora more robust to orig unet (#3462 ) * Make dreambooth lora more robust to orig unet * up	2023-05-17 11:20:13 +01:00
asfiyab-nvidia	9d44e2fb66	add stable diffusion tensorrt img2img pipeline (#3419 ) * add stable diffusion tensorrt img2img pipeline Signed-off-by: Asfiya Baig <asfiyab@nvidia.com> * update docstrings Signed-off-by: Asfiya Baig <asfiyab@nvidia.com> --------- Signed-off-by: Asfiya Baig <asfiyab@nvidia.com>	2023-05-16 14:28:01 +01:00
Sayak Paul	3a237f4fa2	fix: deepseepd_plugin retrieval from accelerate state (#3410 )	2023-05-12 10:02:22 +01:00
Patrick von Platen	f92253015c	Fix various bugs with LoRA Dreambooth and Dreambooth script (#3353 ) * Improve checkpointing lora * fix more * Improve doc string * Update src/diffusers/loaders.py * make stytle * Apply suggestions from code review * Update src/diffusers/loaders.py * Apply suggestions from code review * Apply suggestions from code review * better * Fix all * Fix multi-GPU dreambooth * Apply suggestions from code review Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * Fix all * make style * make style --------- Co-authored-by: Pedro Cuenca <pedro@huggingface.co>	2023-05-11 19:28:09 +01:00
Stas Bekman	af2a237676	[deepspeed] partial ZeRO-3 support (#3076 ) * [deepspeed] partial ZeRO-3 support * cleanup * improve deepspeed fixes * Improve * make style --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2023-05-11 16:59:20 +01:00
Will Berman	a757b2db6e	if dreambooth lora (#3360 ) * update IF stage I pipelines add fixed variance schedulers and lora loading * added kv lora attn processor * allow loading into alternative lora attn processor * make vae optional * throw away predicted variance * allow loading into added kv lora layer * allow load T5 * allow pre compute text embeddings * set new variance type in schedulers * fix copies * refactor all prompt embedding code class prompts are now included in pre-encoding code max tokenizer length is now configurable embedding attention mask is now configurable * fix for when variance type is not defined on scheduler * do not pre compute validation prompt if not present * add example test for if lora dreambooth * add check for train text encoder and pre compute text embeddings	2023-05-09 10:24:36 -07:00
Lucca Zenóbio	0407c3e7d0	Fix pipeline class on README (#3345 ) Update README.md	2023-05-06 12:06:52 +01:00
Adrià Arrufat	e9aa0925a8	Rename --only_save_embeds to --save_as_full_pipeline (#3206 ) * Set --only_save_embeds to False by default Due to how the option is named, it makes more sense to behave like this. * Refactor only_save_embeds to save_as_full_pipeline	2023-05-06 12:00:30 +01:00
Isamu Isozaki	fa9e35fca4	Added input pretubation (#3292 ) * Added input pretubation * Fixed spelling	2023-05-04 18:12:32 +05:30
Markus Pobitzer	2dd408504a	Add Stable Diffusion RePaint to community pipelines (#3320 ) * Add Stable Diffsuion RePaint to community pipelines - Adds Stable Diffsuion RePaint to community pipelines - Add Readme enty for pipeline * Fix: Remove wrong import - Remove wrong import - Minor change in comments * Fix: Code formatting of stable_diffusion_repaint * Fix: ruff errors in stable_diffusion_repaint	2023-05-03 17:59:49 +01:00
Sayak Paul	efc48da23b	fix: scale_lr and sync example readme and docs. (#3299 ) * fix: scale_lr and sync example readme and docs. * fix doc link.	2023-05-03 10:13:05 +05:30
Patrick von Platen	d464214464	Let's make sure that dreambooth always uploads to the Hub (#3272 ) * Update Dreambooth README * Adapt all docs as well * automatically write model card * fix * make style	2023-04-28 11:39:50 +01:00
timegate	6290668254	Add multiple conditions to StableDiffusionControlNetInpaintPipeline (#3125 ) * try multi controlnet inpaint * multi controlnet inpaint * multi controlnet inpaint	2023-04-28 10:58:10 +01:00
Joqsan	462b4edd31	[Community Pipelines] EDICT pipeline implementation (#3153 ) * EDICT pipeline initial commit - Starting point taking from https://github.com/Joqsan/edict-diffusion * refactor __init__() method * minor refactoring * refactor scheduler code - remove scheduler and move its methods to the EDICTPipeline class * make CFG optional - refactor encode_prompt(). - include optional generator for sampling with vae. - minor variable renaming * add EDICT pipeline description to README.md * replace preprocess() with VaeImageProcessor * run make style and make quality commands --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2023-04-28 10:11:29 +01:00
Sayak Paul	71de5b7051	[LoRA] quality of life improvements in the loading semantics and docs (#3180 ) * 👽 qol improvements for LoRA. * better function name? * fix: LoRA weight loading with the new format. * address Patrick's comments. * Apply suggestions from code review Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * change wording around encouraging the use of load_lora_weights(). * fix: function name. --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2023-04-28 11:36:49 +05:30
Patrick von Platen	2ced899cc7	Revert "Revert "[Community Pipelines] Update lpw_stable_diffusion pipeline"" (#3265 ) Revert "Revert "[Community Pipelines] Update lpw_stable_diffusion pipeline" (#3201)" This reverts commit `91a2a80eb2`.	2023-04-27 16:45:37 +01:00
Pedro Cuenca	70ef774fa0	Remove required from tracker_project_name (#3260 ) Remove required from tracker_project_name. As observed by https://github.com/off99555 in https://github.com/huggingface/diffusers/issues/2695#issuecomment-1470755050, it already has a default value.	2023-04-27 16:59:18 +05:30
Pedro Cuenca	e0a2bd15f9	Write model card in controlnet training script (#3229 ) Write model card in controlnet training script.	2023-04-26 21:22:27 +02:00

1 2 3 4 5 ...

470 Commits