diffusers

mirror of https://github.com/huggingface/diffusers.git synced 2026-01-29 07:22:12 +03:00

Author	SHA1	Message	Date
Steven Liu	ddd8bd53ed	[docs] LCM training (#5796 ) * first draft * feedback	2023-11-29 16:08:05 +01:00
Steven Liu	bae14c8bcb	[docs] Update training docs (#5512 ) * first draft * try hfoption syntax * fix hfoption id * add text2image * fix tag * feedback * feedbacks * add textual inversion * DreamBooth * lora * controlnet * instructpix2pix * custom diffusion * t2i * separate training methods and models * sdxl * kandinsky * wuerstchen * light edits	2023-11-14 10:29:56 -08:00
Garry Dolley	1328aeb274	[Docs] Clarify that these are two separate examples (#5734 ) * [Docs] Running the pipeline twice does not appear to be the intention of these examples One is with `cross_attention_kwargs` and the other (next line) removes it * [Docs] Clarify that these are two separate examples One using `scale` and the other without it	2023-11-09 14:26:14 -08:00
M. Tolga Cangöz	a35e72b032	[`Docs`] Fix typos, improve, update at Using Diffusers' Tecniques page (#5627 ) Fix typos, improve, update; better visualization	2023-11-03 13:51:41 +01:00
Susheel Thapa	324d18fba2	Chore: Typo fixed in multiple files (#5422 )	2023-10-17 08:17:03 -07:00
Heinz-Alexander Fuetterer	0ea78f9707	chore: fix typos (#5386 ) * chore: fix typos * Update src/diffusers/pipelines/shap_e/renderer.py Co-authored-by: psychedelicious <4822129+psychedelicious@users.noreply.github.com> --------- Co-authored-by: psychedelicious <4822129+psychedelicious@users.noreply.github.com>	2023-10-16 15:23:37 +02:00
Sayak Paul	e6faf607f7	add: entry for DDPO support. (#5250 ) * add: entry for DDPO support. * move to training * address steven's comments./	2023-10-05 14:29:00 +02:00
Sayak Paul	bfc606301f	add doc around fusing multiple loras. (#5056 ) * add doc around fusing multiple loras. * Apply suggestions from code review Co-authored-by: apolinário <joaopaulo.passos@gmail.com> * address poli's comments. --------- Co-authored-by: apolinário <joaopaulo.passos@gmail.com>	2023-09-18 12:42:58 +01:00
YiYi Xu	e70cb1243f	[WIP] adding Kandinsky training scripts (#4890 ) * Add files via upload Co-authored-by: Shahmatov Arseniy <62886550+cene555@users.noreply.github.com> Co-authored-by: yiyixuxu <yixu310@gmail,com> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2023-09-14 06:58:20 -10:00
Sayak Paul	8009272f48	[Tests and Docs] Add a test on serializing pipelines with components containing fused LoRA modules (#4962 ) * add: test to ensure pipelines can be saved with fused lora modules. * add docs about serialization with fused lora. * Apply suggestions from code review Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Empty-Commit * Update docs/source/en/training/lora.md Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> --------- Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2023-09-13 10:01:37 +01:00
Sayak Paul	914c513ee0	[Docs] add t2i adapter entry to overview of training scripts. (#4946 ) add t2i adapter entry to overview of training scripts.	2023-09-09 06:52:11 +05:30
Suraj Patil	dfec61f4b3	[examples] T2IAdapter training script (#4934 ) * add t2i_example script * remove in channels logic * remove comments * remove use_euler arg * add requirements * only use canny example * use datasets * comments * make log_validation consistent with other scripts * add readme * fix title in readme * update check_min_version * change a few minor things. * add doc entry * add: test for t2i adapter training * remove use_auth_token * fix: logged info. * remove tests for now. --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2023-09-08 10:03:02 +05:30
Sayak Paul	6c314ad0ce	[Docs] add doc entry to explain lora fusion and use of different scales. (#4893 ) * add doc entry to explain lora fusion and use of different scales. * Apply suggestions from code review Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> --------- Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2023-09-06 07:38:13 +05:30
Sayak Paul	19f3161d94	[Docs] improve the LoRA doc. (#4838 ) * improve the LoRA doc. * include fuse_lora and unfuse_lora * Apply suggestions from code review Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> --------- Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2023-08-31 00:13:15 +05:30
Patrick von Platen	6e1af3a777	[Docs] Fix docs controlnet missing /Tip (#4717 )	2023-08-22 18:40:26 +02:00
Sayak Paul	351aab60e9	Update text2image.md to fix the links (#4626 )	2023-08-16 09:53:10 +05:30
Sayak Paul	5175d3d7a5	add: train to text image with sdxl script. (#4505 ) * add: train to text image with sdxl script. Co-authored-by: CaptnSeraph <s3raph1m@gmail.com> * fix: partial func. * fix: default value of output_dir. * make style * set num inference steps to 25. * remove mentions of LoRA. * up min version * add: ema cli arg * run device placement while running step. * precompute vae encodings too. * fix * debug * should work now. * debug * debug * goes alright? * style * debugging * debugging * debugging * debugging * fix * reinit scheduler if prediction_type was passed. * akways cast vae in float32 * better handling of snr. Co-authored-by: bghira <bghira@users.github.com> * the vae should be also passed * add: docs. * add: sdlx t2i tests * save the pipeline * autocast. * fix: save_model_card * fix: save_model_card. --------- Co-authored-by: CaptnSeraph <s3raph1m@gmail.com> Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: bghira <bghira@users.github.com>	2023-08-16 09:02:49 +05:30
Steven Liu	cd7071e750	[docs] Add safetensors flag (#4245 ) * add safetensors flag * apply review	2023-08-10 12:37:23 -07:00
Sayak Paul	b2456717e6	Update lora.md to clarify SDXL support (#4503 ) * Update lora.md * Update lora.md	2023-08-07 11:06:30 +05:30
takuoko	9c29bc2df8	[Examples] Support train_text_to_image_lora_sdxl.py (#4365 ) * add train_text_to_image_lora_sdxl.py * add train_text_to_image_lora_sdxl.py * add test and minor fix * Update examples/text_to_image/README_sdxl.md Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * fix unwrap_model rule * add invisible-watermark in requirements * del invisible-watermark * Update examples/text_to_image/README_sdxl.md Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update examples/text_to_image/README_sdxl.md Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update examples/text_to_image/train_text_to_image_lora_sdxl.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * del comment & update readme --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2023-08-06 13:47:20 +05:30
Sayak Paul	816ca0048f	[LoRA] Fix SDXL text encoder LoRAs (#4371 ) * temporarily disable text encoder loras. * debugging * debugging * debugging * debugging * debugging * debugging * debugging * debugging * debugging * debugging * debugging * debugging * debugging * debugging * debugging * debugging * debugging * debugging * debugging * debugging * debbuging. * modify doc. * rename tests. * print slices. * fix: assertions * Apply suggestions from code review Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2023-08-02 17:00:56 +05:30
Sayak Paul	ba43ce3476	minor doc fixes. (#4380 )	2023-07-31 12:15:56 +05:30
Sayak Paul	4a4cdd6b07	[Feat] Support SDXL Kohya-style LoRA (#4287 ) * sdxl lora changes. * better name replacement. * better replacement. * debugging * debugging * debugging * debugging * debugging * remove print. * print state dict keys. * print * distingisuih better * debuggable. * fxi: tyests * fix: arg from training script. * access from class. * run style * debug * save intermediate * some simplifications for SDXL LoRA * styling * unet config is not needed in diffusers format. * fix: dynamic SGM block mapping for SDXL kohya loras (#4322) * Use lora compatible layers for linear proj_in/proj_out (#4323) * improve condition for using the sgm_diffusers mapping * informative comment. * load compatible keys and embedding layer maaping. * Get SDXL 1.0 example lora to load * simplify * specif ranks and hidden sizes. * better handling of k rank and hidden * debug * debug * debug * debug * debug * fix: alpha keys * add check for handling LoRAAttnAddedKVProcessor * sanity comment * modifications for text encoder SDXL * debugging * debugging * debugging * debugging * debugging * debugging * debugging * debugging * denugging * debugging * debugging * debugging * debugging * debugging * debugging * debugging * debugging * debugging * debugging * debugging * debugging * debugging * debugging * debugging * debugging * debugging * debugging * up * up * up * up * up * up * unneeded comments. * unneeded comments. * kwargs for the other attention processors. * kwargs for the other attention processors. * debugging * debugging * debugging * debugging * improve * debugging * debugging * more print * Fix alphas * debugging * debugging * debugging * debugging * debugging * debugging * clean up * clean up. * debugging * fix: text --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: Batuhan Taskaya <batuhan@python.org>	2023-07-28 19:49:49 +02:00
camenduru	c6ae9b7df6	Where did this 'x' come from, Elon? (#4277 ) * why mdx? * why mdx? * why mdx? * no x for kandinksy either --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2023-07-26 18:18:14 +02:00
Batuhan Taskaya	ff8f58086b	Load Kohya-ss style LoRAs with auxilary states (#4147 ) * Support to load Kohya-ss style LoRA file format (without restrictions) Co-Authored-By: Takuma Mori <takuma104@gmail.com> Co-Authored-By: Sayak Paul <spsayakpaul@gmail.com> * tmp: add sdxl to mlp_modules --------- Co-authored-by: Takuma Mori <takuma104@gmail.com> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2023-07-26 00:24:19 +02:00
Harutatsu Akiyama	428dbfecd9	[SDXL and IP2P]: instruction pix2pix XL training and pipeline (#4079 ) * Support instruction pix2pix sdxl * Support instruction pix2pix sdxl * Support instruction pix2pix sdxl * Support instruction pix2pix sdxl * Support instruction pix2pix sdxl * Support instruction pix2pix sdxl * Support instruction pix2pix sdxl * Support instruction pix2pix sdxl * Support instruction pix2pix sdxl * Support instruction pix2pix sdxl * Support instruction pix2pix sdxl * Support instruction pix2pix sdxl * Support instruction pix2pix sdxl * Support instruction pix2pix sdxl * Support instruction pix2pix sdxl * Support instruction pix2pix sdxl * Support instruction pix2pix sdxl * [Community] Implementation of the IADB community pipeline (#3996) * community pipeline: implementation of iadb * iadb.py: reformat using black * iadb.py: linting update * add kandinsky to readme table (#4081) Co-authored-by: yiyixuxu <yixu310@gmail,com> * [From Single File] Force accelerate to be installed (#4078) force accelerate to be installed * Support instruction pix2pix sdxl * Support instruction pix2pix sdxl * Support instruction pix2pix sdxl * Support instruction pix2pix sdxl * Support instruction pix2pix sdxl * Support instruction pix2pix sdxl * Support instruction pix2pix sdxl * Support instruction pix2pix sdxl * Support instruction pix2pix sdxl * Support instruction pix2pix sdxl * Support instruction pix2pix sdxl * Support instruction pix2pix sdxl * Support instruction pix2pix sdxl * Support instruction pix2pix sdxl * Support instruction pix2pix sdxl * Clean up IP2P SDXL code * Clean up IP2P SDXL code * [IP2P and SDXL] clean up code * [IP2P and SDXL] clean up code * [IP2P and SDXL] clean up code * [IP2P SDXL] Address code reviews * [IP2P SDXL] Address code reviews, add docs, tests * [IP2P SDXL] Address code reviews, add docs, tests * [IP2P SDXL] Address code reviews, add docs, tests * [IP2P SDXL] Address code reviews, add docs, tests * [IP2P SDXL] Address code reviews, add docs, tests * [IP2P SDXL] Address code reviews, add docs, tests * [IP2P SDXL] Address code reviews, add docs, tests * [IP2P SDXL] Address code reviews, add docs, tests * [IP2P SDXL] Address code reviews, add docs, tests * [IP2P SDXL] Address code reviews, add docs, tests * [IP2P SDXL] Address code reviews, add docs, tests * [IP2P SDXL] Address code reviews, add docs, tests * [IP2P SDXL] Address code reviews * [IP2P SDXL] Address code reviews * [IP2P SDXL] Add README_SDXL * [IP2P SDXL] Address code reviews * [IP2P SDXL] Address code reviews * [IP2P SDXL] Fix the copy problems * [IP2P SDXL] Add license * [IP2P SDXL] Add license * [IP2P SDXL] Add license * [IP2P SDXL] Address code reivew for selecting VAE andd others * [IP2P SDXL] Update README_sdxl * [IP2P SDXL] Update __init__ * [IP2P SDXL] Update dummy_torch_and_transformers_and_invisible_watermark_objects * address patrick's comments and some additions to readmes. --------- Co-authored-by: Harutatsu Akiyama <kf.zy.qin@gmail.com> Co-authored-by: Thomas Chambon <36728882+tchambon@users.noreply.github.com> Co-authored-by: YiYi Xu <yixu310@gmail.com> Co-authored-by: yiyixuxu <yixu310@gmail,com> Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2023-07-25 18:19:35 +05:30
Sayak Paul	692b7a907d	[Feat] add: utility for unloading lora. (#4034 ) * add: test for testing unloading lora. * add :reason to skipif. * initial implementation of lora unload(). * apply styling. * add: doc. * change checkpoints. * reinit generator * finalize slow test. * add fast test for unloading lora.	2023-07-14 16:30:18 +05:30
Sayak Paul	3d74dc2abd	[Examples] Add a training script for SDXL DreamBooth LoRA (#4016 ) * add dreambooth lora script for SDXL incorporating latest changes. * remove use_auth_token=True. * add: documentation * remove unneeded cli. * increase the number of training steps in the readme. * add LoraLoaderMixin to the subclassing mix. * add sdxl lora dreambooth test. * add: inference code sample. * add: refiner output. * add LoraLoaderMixin to the mix of classes of StableDiffusionXLImg2ImgPipeline. * change default resolution of DreamBoothDataset. * better sdxl report path. * Apply suggestions from code review Co-authored-by: Pedro Cuenca <pedro@huggingface.co> --------- Co-authored-by: Pedro Cuenca <pedro@huggingface.co>	2023-07-11 07:38:41 +05:30
Steven Liu	f96b760658	[docs] Fix Colab notebook cells (#3777 ) fix colab notebook cells	2023-06-14 10:21:39 -07:00
Patrick von Platen	74fd735eb0	Add draft for lora text encoder scale (#3626 ) * Add draft for lora text encoder scale * Improve naming * fix: training dreambooth lora script. * Apply suggestions from code review * Update examples/dreambooth/train_dreambooth_lora.py * Apply suggestions from code review * Apply suggestions from code review * add lora mixin when fit * add lora mixin when fit * add lora mixin when fit * fix more * fix more --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2023-06-06 22:47:46 +01:00
Will Berman	0fc2fb71c1	dreambooth upscaling fix added latents (#3659 )	2023-06-05 10:32:16 +01:00
Will Berman	5911a3aa47	dreambooth if docs - stage II, more info (#3628 ) * dreambooth if docs - stage II, more info * Update docs/source/en/training/dreambooth.mdx Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update docs/source/en/training/dreambooth.mdx Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update docs/source/en/training/dreambooth.mdx Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * download instructions for downsized images * update source README to match docs --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2023-06-02 10:37:13 -07:00
Takuma Mori	8e552bb4fe	Support Kohya-ss style LoRA file format (in a limited capacity) (#3437 ) * add _convert_kohya_lora_to_diffusers * make style * add scaffold * match result: unet attention only * fix monkey-patch for text_encoder * with CLIPAttention While the terrible images are no longer produced, the results do not match those from the hook ver. This may be due to not setting the network_alpha value. * add to support network_alpha * generate diff image * fix monkey-patch for text_encoder * add test_text_encoder_lora_monkey_patch() * verify that it's okay to release the attn_procs * fix closure version * add comment * Revert "fix monkey-patch for text_encoder" This reverts commit `bb9c61e6fa`. * Fix to reuse utility functions * make LoRAAttnProcessor targets to self_attn * fix LoRAAttnProcessor target * make style * fix split key * Update src/diffusers/loaders.py * remove TEXT_ENCODER_TARGET_MODULES loop * add print memory usage * remove test_kohya_loras_scaffold.py * add: doc on LoRA civitai * remove print statement and refactor in the doc. * fix state_dict test for kohya-ss style lora * Apply suggestions from code review Co-authored-by: Takuma Mori <takuma104@gmail.com> --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2023-06-02 17:40:24 +05:30
Sayak Paul	55dbfa0229	[Docs] include the instruction-tuning blog link in the InstructPix2Pix docs (#3644 ) include the instruction-tuning blog link.	2023-06-02 08:04:35 +05:30
Steven Liu	ab986769f1	[docs] Maintenance (#3552 ) * doc fixes * fix latex * parenthesis on inside	2023-05-26 12:04:15 -07:00
Sayak Paul	49ad61c204	[Docs] add note on local directory path. (#3397 ) add note on local directory path. Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2023-05-21 15:26:56 +05:30
Steven Liu	e589bdb956	[docs] Distributed inference (#3376 ) * distributed inference * move to inference section * apply feedback * update with split_between_processes * apply feedback	2023-05-19 10:07:33 -07:00
Sayak Paul	e343443565	add: if entry in the dreambooth training docs. (#3472 )	2023-05-19 07:47:28 +05:30
Steven Liu	c49e9ede4d	[docs] Adapt a model (#3326 ) * first draft * apply feedback * conv_in.weight thrown away	2023-05-10 16:02:48 -07:00
Steven Liu	4bae76e453	[docs] Improve LoRA docs (#3311 ) * update docs * add to toctree * apply feedback	2023-05-04 11:28:44 +05:30
Sayak Paul	efc48da23b	fix: scale_lr and sync example readme and docs. (#3299 ) * fix: scale_lr and sync example readme and docs. * fix doc link.	2023-05-03 10:13:05 +05:30
YiYi Xu	14b460614b	[doc] add link to training script (#3271 ) add link to training script Co-authored-by: yiyixuxu <yixu@yis-macbook-pro.lan>	2023-04-28 07:14:30 -10:00
Patrick von Platen	d464214464	Let's make sure that dreambooth always uploads to the Hub (#3272 ) * Update Dreambooth README * Adapt all docs as well * automatically write model card * fix * make style	2023-04-28 11:39:50 +01:00
Sayak Paul	71de5b7051	[LoRA] quality of life improvements in the loading semantics and docs (#3180 ) * 👽 qol improvements for LoRA. * better function name? * fix: LoRA weight loading with the new format. * address Patrick's comments. * Apply suggestions from code review Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * change wording around encouraging the use of load_lora_weights(). * fix: function name. --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2023-04-28 11:36:49 +05:30
Jair Trejo	eb29dbad17	Fix typo in textual inversion JAX training script (#3123 ) The pipeline is built as `pipe` but then used as `pipeline`.	2023-04-27 16:24:12 +01:00
Patrick von Platen	2c04e5855c	Multi Vector Textual Inversion (#3144 ) * Multi Vector * Improve * fix multi token * improve test * make style * Update examples/test_examples.py * Apply suggestions from code review Co-authored-by: Suraj Patil <surajp815@gmail.com> * update * Finish * Apply suggestions from code review --------- Co-authored-by: Suraj Patil <surajp815@gmail.com>	2023-04-21 19:06:19 +01:00
Steven Liu	391cfcd7d7	[docs] Clarify training args (#3146 ) * clarify training arg * apply feedback	2023-04-21 11:03:44 -07:00
Sayak Paul	e573ae06e2	Update custom_diffusion.mdx to credit the author (#3163 ) * Update custom_diffusion.mdx * fix: unnecessary list comprehension.	2023-04-21 18:44:08 +01:00
Sayak Paul	3045fb2763	[DreamBooth] add text encoder LoRA support in the DreamBooth training script (#3130 ) * add: LoRA text encoder support for DreamBooth example. * fix initialization. * fix: modification call. * add: entry in the readme. * use dog dataset from hub. * fix: params to clip. * add entry to the LoRA doc. * add: tests for lora. * remove unnecessary list comprehension./	2023-04-20 17:25:17 +05:30
Mishig	a121e05feb	Update custom_diffusion.mdx (#3165 ) Add missing newlines for rendering the links correctly	2023-04-20 11:04:06 +02:00

1 2

71 Commits