diffusers

mirror of https://github.com/huggingface/diffusers.git synced 2026-01-27 17:22:53 +03:00

Author	SHA1	Message	Date
Aryan V S	9df566e6da	[Community] StyleAligned Pipeline (#6489 ) * add stylealigned sdxl pipeline * bugfix * update docs * remove einops dependency * update README * update example docstring	2024-01-11 14:35:55 +01:00
Sayak Paul	be0b425762	[Training] make checkpointing compatible when using `torch.compile` (part II) (#6511 ) make checkpointing compatible when using torch.compile.	2024-01-11 18:37:30 +05:30
dg845	17cece072a	Fix bug in LCM Distillation Scripts when args.unet_time_cond_proj_dim is used (#6523 ) * Fix bug where unet's time_cond_proj_dim is not set correctly if using args.unet_time_cond_proj_dim. * make style	2024-01-11 08:21:07 +05:30
Rahul Raman	2d1f2182cc	example: Train Instruct pix2 pix with lora implementation (#6469 ) * base template file - train_instruct_pix2pix.py * additional import and parser argument requried for lora * finetune only instructpix2pix model -- no need to include these layers * inject lora layers * freeze unet model -- only lora layers are trained * training modifications to train only lora parameters * store only lora parameters * move train script to research project * run quality and style code checks * move train script to a new folder * add README * update README * update references in README --------- Co-authored-by: Rahul Raman <rahulraman@gmail.com> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-01-10 06:38:19 +05:30
Sayak Paul	4497b3ec98	[Training] make DreamBooth SDXL LoRA training script compatible with torch.compile (#6483 ) * make it torch.compile comaptible * make the text encoder compatible too. * style	2024-01-09 20:11:26 +05:30
Yifan Zhou	fc63ebdd3a	[Community Pipeline] Rerender-A-Video: Zero-Shot Video-to-Video Translation (#6332 ) * upload codes and doc * lint * lint * lint * update code * remove blank lines * Fix load url	2024-01-09 14:55:34 +01:00
jiqing-feng	aa1797e109	enable stable-xl textual inversion (#6421 ) * enable stable-xl textual inversion * check if optimizer_2 exists * check text_encoder_2 before using * add textual inversion for sdxl in a single file * fix style * fix example style * reset for error changes * add readme for sdxl * fix style * disable autocast as it will cause cast error when weight_dtype=bf16 * fix spelling error * fix style and readme and 8bit optimizer * add README_sdxl.md link * add tracker key on log_validation * run style * rm the second center crop --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-01-09 15:12:33 +05:30
Sayak Paul	a483a8eddf	Rename REAMDE.md to README.md	2024-01-05 20:49:44 +05:30
Vinh H. Pham	3848606c7e	[Community Pipeline] Add gluegen (#6433 ) * init works * add gluegen pipeline * add gluegen code * add another way to load language adapter * make style * Update README.md * change doc	2024-01-05 13:05:40 +01:00
Sayak Paul	2a97067b84	[Experimental] Diffusion LoRA DPO training (#6422 ) * add: experimental script for diffusion dpo training. * random_crop cli. * fix: caption tokenization. * fix: pixel_values index. * fix: grad? * debug * fix: reduction. * fixes in the loss calculation. * style * fix: unwrap call. * fix: validation inference. * add: initial sdxl script * debug * make sure images in the tuple are of same res * fix model_max_length * report print * boom * fix: numerical issues. * fix: resolution * comment about resize. * change the order of the training transformation. * save call. * debug * remove print * manually detaching necessary? * use the same vae for validation. * add: readme.	2024-01-05 16:40:06 +05:30
Sayak Paul	9d945b2b90	0.25.0 post release (#6358 ) * post release * style --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2024-01-05 16:13:27 +05:30
Junsheng121	d184291c7d	null-text-inversion-pipeline-implementation (#6329 ) * null-text-inversion-implementation * edited * edited * edited * edited * edited * edit * makestyle --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-01-05 11:35:21 +01:00
Linoy Tsaban	2fada8dc1b	[bug fix] fixes #6444 - checkpointing save issue in advanced dreambooth lora sdxl script (#6464 ) * unwrap text encoder when saving hook only for full text encoder tuning * unwrap text encoder when saving hook only for full text encoder tuning * save embeddings in each checkpoint as well * save embeddings in each checkpoint as well * save embeddings in each checkpoint as well * Update examples/advanced_diffusion_training/train_dreambooth_lora_sdxl_advanced.py Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-01-05 15:35:24 +05:30
jiqing-feng	f2d51a28f7	Intel Gen 4 Xeon and later support bf16 (#6367 ) * Intel Gen 4 Xeon and later support bf16 * fix bf16 notes	2024-01-05 11:47:28 +05:30
dg845	f3d1333e02	Improve LCM(-LoRA) Distillation Scripts (#6420 ) * Make WDS pipeline interpolation type configurable. * Make the VAE encoding batch size configurable. * Make lora_alpha and lora_dropout configurable for LCM LoRA scripts. * Generalize scalings_for_boundary_conditions function and make the timestep scaling configurable. * Make LoRA target modules configurable for LCM-LoRA scripts. * Move resolve_interpolation_mode to src/diffusers/training_utils.py and make interpolation type configurable in non-WDS script. * apply suggestions from review	2024-01-05 06:55:13 +05:30
Sayak Paul	aad18faa3e	Update README_sdxl.md to update the LR (#6432 ) Update README_sdxl.md	2024-01-03 20:55:51 +05:30
Sayak Paul	d700140076	[LoRA deprecation] handle rest of the stuff related to deprecated lora stuff. (#6426 ) * handle rest of the stuff related to deprecated lora stuff. * fix: copies * don't modify the uNet in-place. * fix: temporal autoencoder. * manually remove lora layers. * don't copy unet. * alright * remove lora attn processors from unet3d * fix: unet3d. * styl * Empty-Commit	2024-01-03 20:54:09 +05:30
Aryan V S	e30b661437	Update lpw_xl pipeline to latest diffusers (#6411 ) * add clip_skip, freeu, qkv * fix * add ip-adapter support * callback on step end * update * fix NoneType bug * fix * add guidance scale embedding * add textual inversion	2024-01-02 16:28:45 +01:00
Linoy Tsaban	b4077af212	[bug fix] using snr gamma and prior preservation loss in the dreambooth lora sdxl training scripts (#6356 ) * change timesteps used to calculate snr when --with_prior_preservation is enabled * change timesteps used to calculate snr when --with_prior_preservation is enabled (canonical script) * style * revert canonical script to before snr gamma change --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-01-02 09:21:39 -06:00
2510	8a366b835c	Fix gradient-checkpointing option is ignored in SDXL+LoRA training. (#6388 ) (#6402 ) * Fix gradient-checkpointing option is ignored in SDXL+LoRA training. (#6388) * Fix gradient-checkpointing option is ignored in SD+LoRA training. * Fix gradient checkpoint is not applied to text encoders. (SDXL+LoRA) --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-01-01 08:51:04 +05:30
apolinário	1622265e13	Add WebUI format support to Advanced Training Script (#6403 ) * Add WebUI format support to Advanced Training Script * style --------- Co-authored-by: multimodalart <joaopaulo.passos+multimodal@gmail.com>	2023-12-30 08:45:49 -06:00
gzguevara	9f283b01d2	changed w&b report link (#6387 )	2023-12-29 19:49:11 +05:30
gzguevara	e7044a4221	multi-subject-dreambooth-inpainting with 🤗 datasets (#6378 ) * files added * fixing code quality * fixing code quality * fixing code quality * fixing code quality * sorted import block * seperated import wandb * ruff on script --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2023-12-29 09:33:49 +05:30
Sayak Paul	1ac07d8a8d	[Training examples] Follow up of #6306 (#6346 ) * add to dreambooth lora. * add: t2i lora. * add: sdxl t2i lora. * style * lcm lora sdxl. * unwrap * fix: enable_adapters().	2023-12-28 07:37:50 +05:30
apolinário	1fff527702	Fix keys for lora format on advanced training scripts (#6361 ) fix keys for lora format on advanced training scripts	2023-12-27 11:38:03 -06:00
apolinário	645a62bf3b	Add PEFT to advanced training script (#6294 ) * Fix ProdigyOPT in SDXL Dreambooth script * style * style * Add PEFT to Advanced Training Script * style * style * ✨ style ✨ * change order for logic operation * add lora alpha * style * Align PEFT to new format * Update train_dreambooth_lora_sdxl_advanced.py Apply #6355 fix --------- Co-authored-by: multimodalart <joaopaulo.passos+multimodal@gmail.com>	2023-12-27 10:00:32 -03:00
Andy W	43672b4a22	Fix "push_to_hub only create repo in consistency model lora SDXL training script" (#6102 ) * fix * style fix --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2023-12-27 15:25:19 +05:30
dg845	9df3d84382	Fix LCM distillation bug when creating the guidance scale embeddings using multiple GPUs. (#6279 ) Fix bug when creating the guidance embeddings using multiple GPUs. Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2023-12-27 14:25:21 +05:30
Jianqi Pan	c751449011	fix: use retrieve_latents (#6337 )	2023-12-27 10:44:26 +05:30
Dhruv Nair	c1e8bdf1d4	Move ControlNetXS into Community Folder (#6316 ) * update * update * update * update * update * make style * remove docs * update * move to research folder. * fix-copies * remove _toctree entry. --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2023-12-27 08:15:23 +05:30
Sayak Paul	78b87dc25a	[LoRA] make LoRAs trained with `peft` loadable when `peft` isn't installed (#6306 ) * spit diffusers-native format from the get go. * rejig the peft_to_diffusers mapping.	2023-12-27 08:01:10 +05:30
Will Berman	0af12f1f8a	amused update links to new repo (#6344 ) * amused update links to new repo * lint	2023-12-26 22:46:28 +01:00
priprapre	fa31704420	[SDXL-IP2P] Update README_sdxl, Replace the link for wandb log with the correct run (#6270 ) Replace the link for wandb log with the correct run	2023-12-26 21:13:11 +01:00
Sayak Paul	6683f97959	[Training] Add `datasets` version of LCM LoRA SDXL (#5778 ) * add: script to train lcm lora for sdxl with 🤗 datasets * suit up the args. * remove comments. * fix num_update_steps * fix batch unmarshalling * fix num_update_steps_per_epoch * fix; dataloading. * fix microconditions. * unconditional predictions debug * fix batch size. * no need to use use_auth_token * Apply suggestions from code review Co-authored-by: Suraj Patil <surajp815@gmail.com> * make vae encoding batch size an arg * final serialization in kohya * style * state dict rejigging * feat: no separate teacher unet. * debug * fix state dict serialization * debug * debug * debug * remove prints. * remove kohya utility and make style * fix serialization * fix * add test * add peft dependency. * add: peft * remove peft * autocast device determination from accelerator * autocast * reduce lora rank. * remove unneeded space * Apply suggestions from code review Co-authored-by: Suraj Patil <surajp815@gmail.com> * style * remove prompt dropout. * also save in native diffusers ckpt format. * debug * debug * debug * better formation of the null embeddings. * remove space. * autocast fixes. * autocast fix. * hacky * remove lora_sayak * Apply suggestions from code review Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com> * style * make log validation leaner. * move back enabled in. * fix: log_validation call. * add: checkpointing tests * taking my chances to see if disabling autocasting has any effect? * start debugging * name * name * name * more debug * more debug * index * remove index. * print length * print length * print length * move unet.train() after add_adapter() * disable some prints. * enable_adapters() manually. * remove prints. * some changes. * fix params_to_optimize * more fixes * debug * debug * remove print * disable grad for certain contexts. * Add support for IPAdapterFull (#5911) * Add support for IPAdapterFull Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> --------- Co-authored-by: YiYi Xu <yixu310@gmail.com> Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Fix a bug in `add_noise` function (#6085) * fix * copies --------- Co-authored-by: yiyixuxu <yixu310@gmail,com> * [Advanced Diffusion Script] Add Widget default text (#6100) add widget * [Advanced Training Script] Fix pipe example (#6106) * IP-Adapter for StableDiffusionControlNetImg2ImgPipeline (#5901) * adapter for StableDiffusionControlNetImg2ImgPipeline * fix-copies * fix-copies --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * IP adapter support for most pipelines (#5900) * support ip-adapter in src/diffusers/pipelines/stable_diffusion/pipeline_stable_diffusion_upscale.py * support ip-adapter in src/diffusers/pipelines/stable_diffusion/pipeline_stable_diffusion_attend_and_excite.py * support ip-adapter in src/diffusers/pipelines/stable_diffusion/pipeline_stable_diffusion_instruct_pix2pix.py * update tests * support ip-adapter in src/diffusers/pipelines/stable_diffusion/pipeline_stable_diffusion_panorama.py * support ip-adapter in src/diffusers/pipelines/stable_diffusion/pipeline_stable_diffusion_sag.py * support ip-adapter in src/diffusers/pipelines/stable_diffusion_safe/pipeline_stable_diffusion_safe.py * support ip-adapter in src/diffusers/pipelines/latent_consistency_models/pipeline_latent_consistency_text2img.py * support ip-adapter in src/diffusers/pipelines/latent_consistency_models/pipeline_latent_consistency_img2img.py * support ip-adapter in src/diffusers/pipelines/stable_diffusion/pipeline_stable_diffusion_ldm3d.py * revert changes to sd_attend_and_excite and sd_upscale * make style * fix broken tests * update ip-adapter implementation to latest * apply suggestions from review --------- Co-authored-by: YiYi Xu <yixu310@gmail.com> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * fix: lora_alpha * make vae casting conditional/ * param upcasting * propagate comments from https://github.com/huggingface/diffusers/pull/6145 Co-authored-by: dg845 <dgu8957@gmail.com> * [Peft] fix saving / loading when unet is not "unet" (#6046) * [Peft] fix saving / loading when unet is not "unet" * Update src/diffusers/loaders/lora.py Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * undo stablediffusion-xl changes * use unet_name to get unet for lora helpers * use unet_name --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * [Wuerstchen] fix fp16 training and correct lora args (#6245) fix fp16 training Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * [docs] fix: animatediff docs (#6339) fix: animatediff docs * add: note about the new script in readme_sdxl. * Revert "[Peft] fix saving / loading when unet is not "unet" (#6046)" This reverts commit `4c7e983bb5`. * Revert "[Wuerstchen] fix fp16 training and correct lora args (#6245)" This reverts commit `0bb9cf0216`. * Revert "[docs] fix: animatediff docs (#6339)" This reverts commit `11659a6f74`. * remove tokenize_prompt(). * assistive comments around enable_adapters() and diable_adapters(). --------- Co-authored-by: Suraj Patil <surajp815@gmail.com> Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com> Co-authored-by: Fabio Rigano <57982783+fabiorigano@users.noreply.github.com> Co-authored-by: YiYi Xu <yixu310@gmail.com> Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: yiyixuxu <yixu310@gmail,com> Co-authored-by: apolinário <joaopaulo.passos@gmail.com> Co-authored-by: Charchit Sharma <charchitsharma11@gmail.com> Co-authored-by: Aryan V S <contact.aryanvs@gmail.com> Co-authored-by: dg845 <dgu8957@gmail.com> Co-authored-by: Kashif Rasul <kashif.rasul@gmail.com>	2023-12-26 21:22:05 +05:30
Kashif Rasul	35b81fffae	[Wuerstchen] fix fp16 training and correct lora args (#6245 ) fix fp16 training Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2023-12-26 11:40:04 +01:00
dg845	a3d31e3a3e	Change LCM-LoRA README Script Example Learning Rates to 1e-4 (#6304 ) Change README LCM-LoRA example learning rates to 1e-4.	2023-12-25 21:29:20 +05:30
Jianqi Pan	84c403aedb	fix: cannot set guidance_scale (#6326 ) fix: set guidance_scale	2023-12-25 21:16:57 +05:30
Sayak Paul	f4b0b26f7e	[Tests] Speed up example tests (#6319 ) * remove validation args from textual onverson tests * reduce number of train steps in textual inversion tests * fix: directories. * debig * fix: directories. * remove validation tests from textual onversion * try reducing the time of test_text_to_image_checkpointing_use_ema * fix: directories * speed up test_text_to_image_checkpointing * speed up test_text_to_image_checkpointing_checkpoints_total_limit_removes_multiple_checkpoints * fix * speed up test_instruct_pix2pix_checkpointing_checkpoints_total_limit_removes_multiple_checkpoints * set checkpoints_total_limit to 2. * test_text_to_image_lora_checkpointing_checkpoints_total_limit_removes_multiple_checkpoints speed up * speed up test_unconditional_checkpointing_checkpoints_total_limit_removes_multiple_checkpoints * debug * fix: directories. * speed up test_instruct_pix2pix_checkpointing_checkpoints_total_limit * speed up: test_controlnet_checkpointing_checkpoints_total_limit_removes_multiple_checkpoints * speed up test_controlnet_sdxl * speed up dreambooth tests * speed up test_dreambooth_lora_checkpointing_checkpoints_total_limit_removes_multiple_checkpoints * speed up test_custom_diffusion_checkpointing_checkpoints_total_limit_removes_multiple_checkpoints * speed up test_text_to_image_lora_sdxl_text_encoder_checkpointing_checkpoints_total_limit * speed up # checkpoint-2 should have been deleted * speed up examples/text_to_image/test_text_to_image.py::TextToImage::test_text_to_image_checkpointing_checkpoints_total_limit * additional speed ups * style	2023-12-25 19:50:48 +05:30
mwkldeveloper	2d43094ffc	fix RuntimeError: Input type (float) and bias type (c10::Half) should be the same in train_text_to_image_lora.py (#6259 ) * fix RuntimeError: Input type (float) and bias type (c10::Half) should be the same * format source code * format code * remove the autocast blocks within the pipeline * add autocast blocks to pipeline caller in train_text_to_image_lora.py	2023-12-24 14:34:35 +05:30
Sayak Paul	90b9479903	[LoRA PEFT] fix LoRA loading so that correct alphas are parsed (#6225 ) * initialize alpha too. * add: test * remove config parsing * store rank * debug * remove faulty test	2023-12-24 09:59:41 +05:30
apolinário	df76a39e1b	Fix Prodigy optimizer in SDXL Dreambooth script (#6290 ) * Fix ProdigyOPT in SDXL Dreambooth script * style * style	2023-12-22 06:42:04 -06:00
Bingxin Ke	3369bc810a	[Community Pipeline] Add Marigold Monocular Depth Estimation (#6249 ) * [Community Pipeline] Add Marigold Monocular Depth Estimation - add single-file pipeline - update README * fix format - add one blank line * format script with ruff * use direct image link in example code --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2023-12-22 15:41:46 +05:30
Will Berman	4039815276	open muse (#5437 ) amused rename Update docs/source/en/api/pipelines/amused.md Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> AdaLayerNormContinuous default values custom micro conditioning micro conditioning docs put lookup from codebook in constructor fix conversion script remove manual fused flash attn kernel add training script temp remove training script add dummy gradient checkpointing func clarify temperatures is an instance variable by setting it remove additional SkipFF block args hardcode norm args rename tests folder fix paths and samples fix tests add training script training readme lora saving and loading non-lora saving/loading some readme fixes guards Update docs/source/en/api/pipelines/amused.md Co-authored-by: Suraj Patil <surajp815@gmail.com> Update examples/amused/README.md Co-authored-by: Suraj Patil <surajp815@gmail.com> Update examples/amused/train_amused.py Co-authored-by: Suraj Patil <surajp815@gmail.com> vae upcasting add fp16 integration tests use tuple for micro cond copyrights remove casts delegate to torch.nn.LayerNorm move temperature to pipeline call upsampling/downsampling changes	2023-12-21 11:40:55 -08:00
YShow	35a969d297	[Training] remove depcreated method from lora scripts again (#6266 ) * remove depcreated method from lora scripts * check code quality	2023-12-21 14:17:52 +05:30
lvzi	6ca9c4af05	fix: unscale fp16 gradient problem & potential error (#6086 ) (#6231 ) Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2023-12-21 09:09:26 +05:30
dependabot[bot]	0532cece97	Bump transformers from 4.34.0 to 4.36.0 in /examples/research_projects/realfill (#6255 ) Bump transformers in /examples/research_projects/realfill Bumps [transformers](https://github.com/huggingface/transformers) from 4.34.0 to 4.36.0. - [Release notes](https://github.com/huggingface/transformers/releases) - [Commits](https://github.com/huggingface/transformers/compare/v4.34.0...v4.36.0) --- updated-dependencies: - dependency-name: transformers dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2023-12-21 09:03:17 +05:30
hako-mikan	ff43dba7ea	[Fix] Fix Regional Prompting Pipeline (#6188 ) * Update regional_prompting_stable_diffusion.py * reformat * reformat * reformat * reformat * reformat * reformat * reformat * regormat * reformat * reformat * reformat * reformat * Update regional_prompting_stable_diffusion.py --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2023-12-20 10:37:19 +05:30
Sayak Paul	288ceebea5	[T2I LoRA training] fix: unscale fp16 gradient problem (#6119 ) * fix: unscale fp16 gradient problem * fix for dreambooth lora sdxl * make the type-casting conditional. * Apply suggestions from code review Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2023-12-19 09:54:17 +05:30
Haofan Wang	7d0a47f387	Update train_text_to_image_lora.py (#6144 ) * Update train_text_to_image_lora.py * Fix typo? --------- Co-authored-by: M. Tolga Cangöz <46008593+standardAI@users.noreply.github.com> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2023-12-18 19:33:05 +01:00
Aryan V S	67b3d3267e	Support img2img and inpaint in lpw-xl (#6114 ) * add img2img and inpaint support to lpw-xl * update community README --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2023-12-18 19:19:11 +01:00

1 2 3 4 5 ...

743 Commits