diffusers

mirror of https://github.com/huggingface/diffusers.git synced 2026-01-27 17:22:53 +03:00

Author	SHA1	Message	Date
Bagheera	02a8d664a2	Min-SNR Gamma: correct the fix for SNR weighted loss in v-prediction … (#5238 ) Min-SNR Gamma: correct the fix for SNR weighted loss in v-prediction by adding 1 to SNR rather than the resulting loss weights Co-authored-by: bghira <bghira@users.github.com> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2023-10-05 20:52:27 +02:00
Bagheera	4a06c74547	Min-SNR Gamma: follow-up fix for zero-terminal SNR models on v-prediction or epsilon (#5177 ) * merge with main * fix flax example * fix onnx example --------- Co-authored-by: bghira <bghira@users.github.com> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2023-09-26 18:14:52 +05:30
Bagheera	24563ca654	SNR gamma fixes for v_prediction training (#5106 ) Co-authored-by: bghira <bghira@users.github.com>	2023-09-20 21:18:56 +01:00
Patrick von Platen	342c5c02c0	[Release 0.21] Bump version (#5018 ) * [Release 0.21] Bump version * fix & remove * fix more * fix all, upload	2023-09-14 18:28:57 +02:00
Will Berman	d73e6ad050	guard save model hooks to only execute on main process (#4929 )	2023-09-08 10:30:06 -07:00
Sayak Paul	d0c30cfd37	make post-release (#4650 )	2023-08-17 14:16:25 +05:30
Sayak Paul	d67eba0f31	[Utility] adds an image grid utility (#4576 ) * add: utility for image grid. * add: return type. * change necessary places. * add to utility page.	2023-08-12 10:34:51 +05:30
Patrick von Platen	ea1fcc28a4	[SDXL] Allow SDXL LoRA to be run with less than 16GB of VRAM (#4470 ) * correct * correct blocks * finish * finish * finish * Apply suggestions from code review * fix * up * up * up * Update examples/dreambooth/README_sdxl.md Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * Apply suggestions from code review --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2023-08-04 20:06:38 +02:00
Patrick von Platen	b7b6d6138d	[SDXL] Make watermarker optional under certain circumstances to improve usability of SDXL 1.0 (#4346 ) * improve sdxl * more fixes * improve sdxl * improve sdxl * improve sdxl * finish	2023-07-28 19:29:22 +02:00
Sayak Paul	54fab2cd5f	Update README_sdxl.md to correct the header (#4330 ) Update README_sdxl.md	2023-07-28 09:22:14 +05:30
Sayak Paul	961173064d	Honor the SDXL 1.0 licensing from the training scripts. (#4319 ) * honor the original license. * train_instruct_pix2pix_xl -> train_instruct_pix2pix_sdxl	2023-07-28 01:28:36 +05:30
Patrick von Platen	20e92586c1	0.20.0dev0 (#4299 ) * 0.20.0dev0 * make style	2023-07-26 23:06:18 +02:00
Patrick von Platen	6a6dfe1cbd	Rename (#4294 ) * up * Apply suggestions from code review * Apply suggestions from code review * up	2023-07-26 20:41:21 +02:00
Sayak Paul	fed12376c5	[ControlNet SDXL training] fixes in the training script (#4223 ) * fix: #4206 * add: sdxl controlnet training smoketest. * remove unnecessary token inits. * add: licensing to model card. * include SDXL licensing in the model card and make public visibility default * debugging * debugging * disable local file download. * fix: training test. * fix: ckpt prefix.	2023-07-25 05:31:48 +05:30
Sayak Paul	4dcab9227a	[SDXL ControlNet Training] Follow-up fixes (#4188 ) * hash computation. thanks to @lhoestq * disable dtype casting. * remove comments.	2023-07-21 20:55:33 +05:30
Patrick von Platen	d620070bb3	[ControlNet Training] Remove safety from controlnet (#4180 ) Remove safety from controlnet	2023-07-21 08:03:59 +05:30
Sayak Paul	3eb498e7b4	[Core] add: controlnet support for SDXL (#4038 ) * add: controlnet sdxl. * modifications to controlnet. * run styling. * add: __init__.pys * incorporate https://github.com/huggingface/diffusers/pull/4019 changes. * run make fix-copies. * resize the conditioning images. * remove autocast. * run styling. * disable autocast. * debugging * device placement. * back to autocast. * remove comment. * save some memory by reusing the vae and unet in the pipeline. * apply styling. * Allow low precision sd xl * finish * finish * changes to accommodate the improved VAE. * modifications to how we handle vae encoding in the training. * make style * make existing controlnet fast tests pass. * change vae checkpoint cli arg. * fix: vae pretrained paths. * fix: steps in get_scheduler(). * debugging. * debugging./ * fix: weight conversion. * add: docs. * add: limited tests./ * add: datasets to the requirements. * update docstrings and incorporate the usage of watermarking. * incorporate fix from #4083 * fix watermarking dependency handling. * run make-fix-copies. * Empty-Commit * Update requirements_sdxl.txt * remove vae upcasting part. * Apply suggestions from code review Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * run make style * run make fix-copies. * disable suppot for multicontrolnet. * Apply suggestions from code review Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * run make fix-copies. * dtyle/. * fix-copies. --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2023-07-18 18:25:34 +05:30
Ruoxi	ece55227ff	Multiply lr scheduler steps by `num_processes`. (#3983 ) * Multiply lr scheduler steps by `num_processes`. * Stop multiplying steps by gradient accumulation.	2023-07-13 17:50:25 +05:30
Patrick von Platen	b9feed8795	move to 0.19.0dev (#4048 )	2023-07-11 22:49:12 +02:00
Will Berman	d49e2dd54c	manual check for checkpoints_total_limit instead of using accelerate (#3681 ) * manual check for checkpoints_total_limit instead of using accelerate * remove controlnet_conditioning_embedding_out_channels	2023-06-15 15:38:54 -07:00
Patrick von Platen	c42f6ee43e	Post 0.17.0 release (#3721 ) * Post release * Post release	2023-06-08 18:08:49 +02:00
Zachary Mueller	79fa94ea8b	Apply deprecations from Accelerate (#3714 ) Apply deprecations	2023-06-08 16:44:22 +02:00
Will Berman	67cd460154	do not scale the initial global step by gradient accumulation steps when loading from checkpoint (#3506 )	2023-05-22 15:19:56 -07:00
Pedro Cuenca	70ef774fa0	Remove required from tracker_project_name (#3260 ) Remove required from tracker_project_name. As observed by https://github.com/off99555 in https://github.com/huggingface/diffusers/issues/2695#issuecomment-1470755050, it already has a default value.	2023-04-27 16:59:18 +05:30
Pedro Cuenca	e0a2bd15f9	Write model card in controlnet training script (#3229 ) Write model card in controlnet training script.	2023-04-26 21:22:27 +02:00
Patrick von Platen	f842396367	Post release for 0.16.0 (#3244 ) * Post release * fix more	2023-04-26 17:43:09 +01:00
Patrick von Platen	6ba0efb9a1	Release: v0.16.0	2023-04-26 13:35:01 +02:00
Will Berman	7e6886f5e9	controlnet training resize inputs to multiple of 8 (#3135 ) controlnet training center crop input images to multiple of 8 The pipeline code resizes inputs to multiples of 8. Not doing this resizing in the training script is causing the encoded image to have different height/width dimensions than the encoded conditioning image (which uses a separate encoder that's part of the controlnet model). We resize and center crop the inputs to make sure they're the same size (as well as all other images in the batch). We also check that the initial resolution is a multiple of 8.	2023-04-19 10:46:51 -07:00
Patrick von Platen	f2df39fa0e	make style	2023-04-18 14:03:17 +02:00
Cristian Garcia	8ecdd3ef65	Optimize log_validation in train_controlnet_flax (#3110 ) extract pipeline from log_validation	2023-04-18 13:03:00 +01:00
Sayak Paul	3b641eabe9	feat: verfication of multi-gpu support for select examples. (#3126 ) * feat: verfication of multi-gpu support for select examples. * add: multi-gpu training sections to the relvant doc pages.	2023-04-18 08:36:13 +05:30
Andreas Steiner	d06e06940b	Adds profiling flags, computes train metrics average. (#3053 ) * WIP controlnet training - bugfix --streaming - bugfix running report_to!='wandb' - adds memory profile before validation * Adds final logging statement. * Sets train epochs to 11. Looking at a longer ~16ep run, we see only good validation images after ~11ep: https://wandb.ai/andsteing/controlnet_fill50k/runs/3j2hx6n8 * Removes --logging_dir (it's not used). * Adds --profile flags. * Updates --output_dir=runs/fill-circle-{timestamp}. * Compute mean of `train_metrics`. Previously `train_metrics[-1]` was logged, resulting in very bumpy train metrics. * Improves logging a bit. - adds l2_grads gradient norm logging - adds steps_per_sec - sets walltime as x coordinate of train/step - logs controlnet_params config * Adds --ccache (doesn't really help though). * minor fix in controlnet flax example (#2986) * fix the error when push_to_hub but not log validation * contronet_from_pt & controlnet_revision * add intermediate checkpointing to the guide * Bugfix --profile_steps * Sets `RACKER_PROJECT_NAME='controlnet_fill50k'`. * Logs fractional epoch. * Adds relative `walltime` metric. * Adds `StepTraceAnnotation` and uses `global_step` insetad of `step`. * Applied `black`. * Streamlines commands in README a bit. * Removes `--ccache`. This makes only a very small difference (~1 min) with this model size, so removing the option introduced in cdb3cc. * Re-ran `black`. * Update examples/controlnet/README.md Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * Converts spaces to tab. * Removes repeated args. * Skips first step (compilation) in profiling * Updates README with profiling instructions. * Unifies tabs/spaces in README. * Re-ran style & quality. --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2023-04-12 08:29:18 -10:00
Patrick von Platen	0a73b4d3cd	[Post release] v0.16.0dev (#3072 )	2023-04-12 17:18:30 +01:00
Patrick von Platen	e7534542a2	Release: v0.15.0	2023-04-12 15:15:31 +00:00
Sayak Paul	e607a582cf	[Examples] Fix type-casting issue in the ControlNet training script (#2994 ) * fix: norm group test for UNet3D. * fix: type-casting issue in controlnet training.	2023-04-12 06:35:06 +05:30
Chanchana Sornsoontorn	52c4d32d41	Fix typo and format BasicTransformerBlock attributes (#2953 ) * ⚙️chore(train_controlnet) fix typo in logger message * ⚙️chore(models) refactor modules order; make them the same as calling order When printing the BasicTransformerBlock to stdout, I think it's crucial that the attributes order are shown in proper order. And also previously the "3. Feed Forward" comment was not making sense. It should have been close to self.ff but it's instead next to self.norm3 * correct many tests * remove bogus file * make style * correct more tests * finish tests * fix one more * make style * make unclip deterministic * ⚙️chore(models/attention) reorganize comments in BasicTransformerBlock class --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2023-04-12 00:31:05 +02:00
Will Berman	67ec9cf513	accelerate min version for ProjectConfiguration import (#3042 )	2023-04-11 10:12:28 -07:00
YiYi Xu	dcfa6e1d20	add Min-SNR loss to Controlnet flax train script (#3016 ) * add wandb team and min-snr loss * make style * apply feedbacks	2023-04-10 07:56:54 +05:30
YiYi Xu	2de36fae7b	minor fix in controlnet flax example (#2986 ) * fix the error when push_to_hub but not log validation * contronet_from_pt & controlnet_revision * add intermediate checkpointing to the guide	2023-04-06 10:27:41 -10:00
YiYi Xu	ee20d1f8b9	update flax controlnet training script (#2951 ) * load_from_disk + checkpointing_steps * apply feedback	2023-04-04 15:49:44 -10:00
YiYi Xu	0c63c3839a	allow use custom local dataset for controlnet training scripts (#2928 ) use custom local datset Co-authored-by: yiyixuxu <yixu310@gmail,com> Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2023-04-04 10:37:47 -07:00
Lucain	a87e88b783	Use `upload_folder` in training scripts (#2934 ) use upload folder in training scripts Co-authored-by: testbot <lucainp@hf.co>	2023-04-04 16:19:12 +01:00
Patrick von Platen	a0263b2e5b	make style	2023-04-04 15:18:39 +02:00
Ernie Chu	62c01d267a	Ensure validation image RGB not RGBA (#2945 ) * ensure validation image RGB not RGBA * ensure validation image RGB not RGBA --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2023-04-04 14:17:59 +01:00
YiYi Xu	b3d5cc4a36	add flax requirement (#2894 ) Co-authored-by: yiyixuxu <yixu310@gmail,com>	2023-03-30 17:10:26 +01:00
Sayak Paul	d82b032319	[Examples] Add streaming support to the ControlNet training example in JAX (#2859 ) * improve stable unclip doc. * feat: add streaming support to controlnet flax training script. * fix: CLI arg. * fix: torch dataloader shuffle setting. * fix: dataset length. * fix: wandb config. * fix: steps_per_epoch in the training loop. * add: entry about streaming in the readme * get column names from iterable dataset + fix final logging --------- Co-authored-by: yiyixuxu <yixu310@gmail.com>	2023-03-29 06:42:08 +05:30
YiYi Xu	d4f846fa74	[WIP]Flax training script for controlnet (#2818 ) * add train_controlnet_flax --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2023-03-27 19:13:35 -10:00
Henrik Forstén	79eb3d07d0	Controlnet training (#2545 ) * Controlnet training code initial commit Works with circle dataset: https://github.com/lllyasviel/ControlNet/blob/main/docs/train.md * Script for adding a controlnet to existing model * Fix control image transform Control image should be in 0..1 range. * Add license header and remove more unused configs * controlnet training readme * Allow nonlocal model in add_controlnet.py * Formatting * Remove unused code * Code quality * Initialize controlnet in training script * Formatting * Address review comments * doc style * explicit constructor args and submodule names * hub dataset NOTE - not tested * empty prompts * add conditioning image * rename * remove instance data dir * image_transforms -> -1,1 . conditioning_image_transformers -> 0, 1 * nits * remove local rank config I think this isn't necessary in any of our training scripts * validation images * proportion_empty_prompts typo * weight copying to controlnet bug * call log validation fix * fix * gitignore wandb * fix progress bar and resume from checkpoint iteration * initial step fix * log multiple images * fix * fixes * tracker project name configurable * misc * add controlnet requirements.txt * update docs * image labels * small fixes * log validation using existing models for pipeline * fix for deepspeed saving * memory usage docs * Update examples/controlnet/train_controlnet.py Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * Update examples/controlnet/train_controlnet.py Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * Update examples/controlnet/README.md Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * Update examples/controlnet/README.md Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * Update examples/controlnet/README.md Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * Update examples/controlnet/README.md Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * Update examples/controlnet/README.md Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * Update examples/controlnet/README.md Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * Update examples/controlnet/README.md Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * Update examples/controlnet/README.md Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * remove extra is main process check * link to dataset in intro paragraph * remove unnecessary paragraph * note on deepspeed * Update examples/controlnet/README.md Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * assert -> value error * weights and biases note * move images out of git * remove .gitignore --------- Co-authored-by: William Berman <WLBberman@gmail.com> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2023-03-14 20:16:30 -07:00

48 Commits