diffusers

mirror of https://github.com/huggingface/diffusers.git synced 2026-01-27 17:22:53 +03:00

Author	SHA1	Message	Date
Patrick von Platen	2b56e8ca68	make style	2023-05-22 16:49:46 +02:00
Ambrosiussen	b8b5daaee3	DataLoader respecting EXIF data in Training Images (#3465 ) * DataLoader will now bake in any transforms or image manipulations contained in the EXIF Images may have rotations stored in EXIF. Training using such images will cause those transforms to be ignored while training and thus produce unexpected results * Fixed the Dataloading EXIF issue in main DreamBooth training as well * Run make style (black & isort)	2023-05-22 15:49:35 +01:00
Will Berman	8d646f2294	dreambooth docs torch.compile note (#3471 ) * dreambooth docs torch.compile note * Update examples/dreambooth/README.md Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * Update examples/dreambooth/README.md Co-authored-by: Pedro Cuenca <pedro@huggingface.co> --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: Pedro Cuenca <pedro@huggingface.co>	2023-05-19 07:40:14 +05:30
Will Berman	7200985eab	Add IF dreambooth docs (#3470 )	2023-05-17 11:56:10 -07:00
Will Berman	c9f939bf98	Update full dreambooth script to work with IF (#3425 )	2023-05-17 10:42:20 -07:00
wfng92	2faf91dbde	Add min snr to text2img lora training script (#3459 ) add min snr to text2img lora training script	2023-05-17 16:37:45 +05:30
Patrick von Platen	3ebd2d1f9e	Make dreambooth lora more robust to orig unet (#3462 ) * Make dreambooth lora more robust to orig unet * up	2023-05-17 11:20:13 +01:00
asfiyab-nvidia	9d44e2fb66	add stable diffusion tensorrt img2img pipeline (#3419 ) * add stable diffusion tensorrt img2img pipeline Signed-off-by: Asfiya Baig <asfiyab@nvidia.com> * update docstrings Signed-off-by: Asfiya Baig <asfiyab@nvidia.com> --------- Signed-off-by: Asfiya Baig <asfiyab@nvidia.com>	2023-05-16 14:28:01 +01:00
Sayak Paul	3a237f4fa2	fix: deepseepd_plugin retrieval from accelerate state (#3410 )	2023-05-12 10:02:22 +01:00
Patrick von Platen	f92253015c	Fix various bugs with LoRA Dreambooth and Dreambooth script (#3353 ) * Improve checkpointing lora * fix more * Improve doc string * Update src/diffusers/loaders.py * make stytle * Apply suggestions from code review * Update src/diffusers/loaders.py * Apply suggestions from code review * Apply suggestions from code review * better * Fix all * Fix multi-GPU dreambooth * Apply suggestions from code review Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * Fix all * make style * make style --------- Co-authored-by: Pedro Cuenca <pedro@huggingface.co>	2023-05-11 19:28:09 +01:00
Stas Bekman	af2a237676	[deepspeed] partial ZeRO-3 support (#3076 ) * [deepspeed] partial ZeRO-3 support * cleanup * improve deepspeed fixes * Improve * make style --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2023-05-11 16:59:20 +01:00
Will Berman	a757b2db6e	if dreambooth lora (#3360 ) * update IF stage I pipelines add fixed variance schedulers and lora loading * added kv lora attn processor * allow loading into alternative lora attn processor * make vae optional * throw away predicted variance * allow loading into added kv lora layer * allow load T5 * allow pre compute text embeddings * set new variance type in schedulers * fix copies * refactor all prompt embedding code class prompts are now included in pre-encoding code max tokenizer length is now configurable embedding attention mask is now configurable * fix for when variance type is not defined on scheduler * do not pre compute validation prompt if not present * add example test for if lora dreambooth * add check for train text encoder and pre compute text embeddings	2023-05-09 10:24:36 -07:00
Lucca Zenóbio	0407c3e7d0	Fix pipeline class on README (#3345 ) Update README.md	2023-05-06 12:06:52 +01:00
Adrià Arrufat	e9aa0925a8	Rename --only_save_embeds to --save_as_full_pipeline (#3206 ) * Set --only_save_embeds to False by default Due to how the option is named, it makes more sense to behave like this. * Refactor only_save_embeds to save_as_full_pipeline	2023-05-06 12:00:30 +01:00
Isamu Isozaki	fa9e35fca4	Added input pretubation (#3292 ) * Added input pretubation * Fixed spelling	2023-05-04 18:12:32 +05:30
Markus Pobitzer	2dd408504a	Add Stable Diffusion RePaint to community pipelines (#3320 ) * Add Stable Diffsuion RePaint to community pipelines - Adds Stable Diffsuion RePaint to community pipelines - Add Readme enty for pipeline * Fix: Remove wrong import - Remove wrong import - Minor change in comments * Fix: Code formatting of stable_diffusion_repaint * Fix: ruff errors in stable_diffusion_repaint	2023-05-03 17:59:49 +01:00
Sayak Paul	efc48da23b	fix: scale_lr and sync example readme and docs. (#3299 ) * fix: scale_lr and sync example readme and docs. * fix doc link.	2023-05-03 10:13:05 +05:30
Patrick von Platen	d464214464	Let's make sure that dreambooth always uploads to the Hub (#3272 ) * Update Dreambooth README * Adapt all docs as well * automatically write model card * fix * make style	2023-04-28 11:39:50 +01:00
timegate	6290668254	Add multiple conditions to StableDiffusionControlNetInpaintPipeline (#3125 ) * try multi controlnet inpaint * multi controlnet inpaint * multi controlnet inpaint	2023-04-28 10:58:10 +01:00
Joqsan	462b4edd31	[Community Pipelines] EDICT pipeline implementation (#3153 ) * EDICT pipeline initial commit - Starting point taking from https://github.com/Joqsan/edict-diffusion * refactor __init__() method * minor refactoring * refactor scheduler code - remove scheduler and move its methods to the EDICTPipeline class * make CFG optional - refactor encode_prompt(). - include optional generator for sampling with vae. - minor variable renaming * add EDICT pipeline description to README.md * replace preprocess() with VaeImageProcessor * run make style and make quality commands --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2023-04-28 10:11:29 +01:00
Sayak Paul	71de5b7051	[LoRA] quality of life improvements in the loading semantics and docs (#3180 ) * 👽 qol improvements for LoRA. * better function name? * fix: LoRA weight loading with the new format. * address Patrick's comments. * Apply suggestions from code review Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * change wording around encouraging the use of load_lora_weights(). * fix: function name. --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2023-04-28 11:36:49 +05:30
Patrick von Platen	2ced899cc7	Revert "Revert "[Community Pipelines] Update lpw_stable_diffusion pipeline"" (#3265 ) Revert "Revert "[Community Pipelines] Update lpw_stable_diffusion pipeline" (#3201)" This reverts commit `91a2a80eb2`.	2023-04-27 16:45:37 +01:00
Pedro Cuenca	70ef774fa0	Remove required from tracker_project_name (#3260 ) Remove required from tracker_project_name. As observed by https://github.com/off99555 in https://github.com/huggingface/diffusers/issues/2695#issuecomment-1470755050, it already has a default value.	2023-04-27 16:59:18 +05:30
Pedro Cuenca	e0a2bd15f9	Write model card in controlnet training script (#3229 ) Write model card in controlnet training script.	2023-04-26 21:22:27 +02:00
Patrick von Platen	f842396367	Post release for 0.16.0 (#3244 ) * Post release * fix more	2023-04-26 17:43:09 +01:00
Patrick von Platen	6ba0efb9a1	Release: v0.16.0	2023-04-26 13:35:01 +02:00
Lucca Zenóbio	0ddc5bf7b9	fix mixed precision training on train_dreambooth_inpaint_lora (#3138 ) cast to weight dtype	2023-04-25 15:22:57 +05:30
Will Berman	91a2a80eb2	Revert "[Community Pipelines] Update lpw_stable_diffusion pipeline" (#3201 ) Revert "[Community Pipelines] Update lpw_stable_diffusion pipeline (#3197)" This reverts commit `9965cb50ea`.	2023-04-22 12:36:55 -07:00
SkyTNT	9965cb50ea	[Community Pipelines] Update lpw_stable_diffusion pipeline (#3197 ) * Update lpw_stable_diffusion.py * fix cpu offload	2023-04-22 15:07:45 +01:00
Chengrui Wang	20e426cb5d	Fix bug in train_dreambooth_lora (#3183 ) * Update train_dreambooth_lora.py fix bug * Update train_dreambooth_lora.py	2023-04-22 09:04:28 +05:30
Patrick von Platen	2c04e5855c	Multi Vector Textual Inversion (#3144 ) * Multi Vector * Improve * fix multi token * improve test * make style * Update examples/test_examples.py * Apply suggestions from code review Co-authored-by: Suraj Patil <surajp815@gmail.com> * update * Finish * Apply suggestions from code review --------- Co-authored-by: Suraj Patil <surajp815@gmail.com>	2023-04-21 19:06:19 +01:00
asfiyab-nvidia	05d9baeacd	Fix TensorRT community pipeline device set function (#3157 ) pass silence_dtype_warnings as kwarg Signed-off-by: Asfiya Baig <asfiyab@nvidia.com> Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2023-04-21 18:53:10 +01:00
Sayak Paul	3045fb2763	[DreamBooth] add text encoder LoRA support in the DreamBooth training script (#3130 ) * add: LoRA text encoder support for DreamBooth example. * fix initialization. * fix: modification call. * add: entry in the readme. * use dog dataset from hub. * fix: params to clip. * add entry to the LoRA doc. * add: tests for lora. * remove unnecessary list comprehension./	2023-04-20 17:25:17 +05:30
XinyuYe-Intel	a5b242d30d	Added distillation for quantization example on textual inversion. (#2760 ) * Added distillation for quantization example on textual inversion. Signed-off-by: Ye, Xinyu <xinyu.ye@intel.com> * refined readme and code style. Signed-off-by: Ye, Xinyu <xinyu.ye@intel.com> * Update text2images.py * refined code of model load and added compatibility check. Signed-off-by: Ye, Xinyu <xinyu.ye@intel.com> * fixed code style. Signed-off-by: Ye, Xinyu <xinyu.ye@intel.com> * fix C403 [*] Unnecessary `list` comprehension (rewrite as a `set` comprehension) Signed-off-by: Ye, Xinyu <xinyu.ye@intel.com> --------- Signed-off-by: Ye, Xinyu <xinyu.ye@intel.com>	2023-04-20 11:55:42 +01:00
nupurkmr9	3979aac996	adding custom diffusion training to diffusers examples (#3031 ) * diffusers==0.14.0 update * custom diffusion update * custom diffusion update * custom diffusion update * custom diffusion update * custom diffusion update * custom diffusion update * custom diffusion * custom diffusion * custom diffusion * custom diffusion * custom diffusion * apply formatting and get rid of bare except. * refactor readme and other minor changes. * misc refactor. * fix: repo_id issue and loaders logging bug. * fix: save_model_card. * fix: save_model_card. * fix: save_model_card. * add: doc entry. * refactor doc,. * custom diffusion * custom diffusion * custom diffusion * apply style. * remove tralining whitespace. * fix: toctree entry. * remove unnecessary print. * custom diffusion * custom diffusion * custom diffusion test * custom diffusion xformer update * custom diffusion xformer update * custom diffusion xformer update --------- Co-authored-by: Nupur Kumari <nupurkumari@Nupurs-MacBook-Pro.local> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: Nupur Kumari <nupurkumari@nupurs-mbp.wifi.local.cmu.edu>	2023-04-20 09:31:42 +02:00
Will Berman	7e6886f5e9	controlnet training resize inputs to multiple of 8 (#3135 ) controlnet training center crop input images to multiple of 8 The pipeline code resizes inputs to multiples of 8. Not doing this resizing in the training script is causing the encoded image to have different height/width dimensions than the encoded conditioning image (which uses a separate encoder that's part of the controlnet model). We resize and center crop the inputs to make sure they're the same size (as well as all other images in the batch). We also check that the initial resolution is a multiple of 8.	2023-04-19 10:46:51 -07:00
asfiyab-nvidia	bba1c1de15	Add TensorRT SD/txt2img Community Pipeline to diffusers along with TensorRT utils (#2974 ) * Add SD/txt2img Community Pipeline to diffusers along with TensorRT utils Signed-off-by: Asfiya Baig <asfiyab@nvidia.com> * update installation command Signed-off-by: Asfiya Baig <asfiyab@nvidia.com> * update tensorrt installation Signed-off-by: Asfiya Baig <asfiyab@nvidia.com> * changes 1. Update setting of cache directory 2. Address comments: merge utils and pipeline code. 3. Address comments: Add section in README Signed-off-by: Asfiya Baig <asfiyab@nvidia.com> * apply make style Signed-off-by: Asfiya Baig <asfiyab@nvidia.com> --------- Signed-off-by: Asfiya Baig <asfiyab@nvidia.com> Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2023-04-19 17:51:03 +01:00
Patrick von Platen	4bc157ffa9	Correct textual inversion readme (#3145 ) * Update README.md * Apply suggestions from code review	2023-04-18 16:35:12 +01:00
Patrick von Platen	f2df39fa0e	make style	2023-04-18 14:03:17 +02:00
Cristian Garcia	8ecdd3ef65	Optimize log_validation in train_controlnet_flax (#3110 ) extract pipeline from log_validation	2023-04-18 13:03:00 +01:00
Sayak Paul	3b641eabe9	feat: verfication of multi-gpu support for select examples. (#3126 ) * feat: verfication of multi-gpu support for select examples. * add: multi-gpu training sections to the relvant doc pages.	2023-04-18 08:36:13 +05:30
Patrick von Platen	703307efcc	Fix config deprecation (#3129 ) * Better deprecation message * Better deprecation message * Better doc string * Fixes * fix more * fix more * Improve __getattr__ * correct more * fix more * fix * Improve more * more improvements * fix more * Apply suggestions from code review Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * make style * Fix all rest & add tests & remove old deprecation fns --------- Co-authored-by: Pedro Cuenca <pedro@huggingface.co>	2023-04-17 17:16:28 +01:00
YiYi Xu	1bd4c9e93d	remvoe one line as requested by gc team (#3077 ) remvoe one line	2023-04-14 06:39:25 -10:00
Andreas Steiner	d06e06940b	Adds profiling flags, computes train metrics average. (#3053 ) * WIP controlnet training - bugfix --streaming - bugfix running report_to!='wandb' - adds memory profile before validation * Adds final logging statement. * Sets train epochs to 11. Looking at a longer ~16ep run, we see only good validation images after ~11ep: https://wandb.ai/andsteing/controlnet_fill50k/runs/3j2hx6n8 * Removes --logging_dir (it's not used). * Adds --profile flags. * Updates --output_dir=runs/fill-circle-{timestamp}. * Compute mean of `train_metrics`. Previously `train_metrics[-1]` was logged, resulting in very bumpy train metrics. * Improves logging a bit. - adds l2_grads gradient norm logging - adds steps_per_sec - sets walltime as x coordinate of train/step - logs controlnet_params config * Adds --ccache (doesn't really help though). * minor fix in controlnet flax example (#2986) * fix the error when push_to_hub but not log validation * contronet_from_pt & controlnet_revision * add intermediate checkpointing to the guide * Bugfix --profile_steps * Sets `RACKER_PROJECT_NAME='controlnet_fill50k'`. * Logs fractional epoch. * Adds relative `walltime` metric. * Adds `StepTraceAnnotation` and uses `global_step` insetad of `step`. * Applied `black`. * Streamlines commands in README a bit. * Removes `--ccache`. This makes only a very small difference (~1 min) with this model size, so removing the option introduced in cdb3cc. * Re-ran `black`. * Update examples/controlnet/README.md Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * Converts spaces to tab. * Removes repeated args. * Skips first step (compilation) in profiling * Updates README with profiling instructions. * Unifies tabs/spaces in README. * Re-ran style & quality. --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2023-04-12 08:29:18 -10:00
Patrick von Platen	0a73b4d3cd	[Post release] v0.16.0dev (#3072 )	2023-04-12 17:18:30 +01:00
Patrick von Platen	e7534542a2	Release: v0.15.0	2023-04-12 15:15:31 +00:00
Sayak Paul	5a7d35e29c	Fix InstructPix2Pix training in multi-GPU mode (#2978 ) * fix: norm group test for UNet3D. * fix: unet rejig. * fix: unwrapping when running validation inputs. * unwrapping the unet too. * fix: device. * better unwrapping. * unwrapping before ema. * unwrapping.	2023-04-12 10:13:53 +01:00
Sayak Paul	e607a582cf	[Examples] Fix type-casting issue in the ControlNet training script (#2994 ) * fix: norm group test for UNet3D. * fix: type-casting issue in controlnet training.	2023-04-12 06:35:06 +05:30
Chanchana Sornsoontorn	52c4d32d41	Fix typo and format BasicTransformerBlock attributes (#2953 ) * ⚙️chore(train_controlnet) fix typo in logger message * ⚙️chore(models) refactor modules order; make them the same as calling order When printing the BasicTransformerBlock to stdout, I think it's crucial that the attributes order are shown in proper order. And also previously the "3. Feed Forward" comment was not making sense. It should have been close to self.ff but it's instead next to self.norm3 * correct many tests * remove bogus file * make style * correct more tests * finish tests * fix one more * make style * make unclip deterministic * ⚙️chore(models/attention) reorganize comments in BasicTransformerBlock class --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2023-04-12 00:31:05 +02:00
Will Berman	67ec9cf513	accelerate min version for ProjectConfiguration import (#3042 )	2023-04-11 10:12:28 -07:00

1 2 3 4 5 ...

444 Commits