diffusers

mirror of https://github.com/huggingface/diffusers.git synced 2026-01-27 17:22:53 +03:00

Author	SHA1	Message	Date
Sayak Paul	288ceebea5	[T2I LoRA training] fix: unscale fp16 gradient problem (#6119 ) * fix: unscale fp16 gradient problem * fix for dreambooth lora sdxl * make the type-casting conditional. * Apply suggestions from code review Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2023-12-19 09:54:17 +05:30
Haofan Wang	7d0a47f387	Update train_text_to_image_lora.py (#6144 ) * Update train_text_to_image_lora.py * Fix typo? --------- Co-authored-by: M. Tolga Cangöz <46008593+standardAI@users.noreply.github.com> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2023-12-18 19:33:05 +01:00
Aryan V S	67b3d3267e	Support img2img and inpaint in lpw-xl (#6114 ) * add img2img and inpaint support to lpw-xl * update community README --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2023-12-18 19:19:11 +01:00
TilmannR	4e77056885	Update README.md (#6191 ) Typo: The script for LoRA training is `train_text_to_image_lora_prior.py` not `train_text_to_image_prior_lora.py`. Alternatively you could rename the file and keep the README.md unchanged.	2023-12-18 19:08:29 +01:00
Sayak Paul	b98b314b7a	[Training] remove depcreated method from lora scripts. (#6207 ) remove depcreated method from lora scripts.	2023-12-18 15:52:43 +05:30
Yudong Jin	49644babd3	Fix the test script in examples/text_to_image/README.md (#6209 ) * Update examples/text_to_image/README.md * Update examples/text_to_image/README.md Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2023-12-18 15:36:00 +05:30
dg845	49db233b35	Clean Up Comments in LCM(-LoRA) Distillation Scripts. (#6145 ) * Clean up comments in LCM(-LoRA) distillation scripts. * Calculate predicted source noise noise_pred correctly for all prediction_types. * make style * apply suggestions from review --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2023-12-15 18:18:16 +05:30
Linoy Tsaban	29dfe22a8e	[advanced dreambooth lora sdxl training script] load pipeline for inference only if validation prompt is used (#6171 ) * load pipeline for inference only if validation prompt is used * move things outside * load pipeline for inference only if validation prompt is used * fix readme when validation prompt is used --------- Co-authored-by: linoytsaban <linoy@huggingface.co> Co-authored-by: apolinário <joaopaulo.passos@gmail.com>	2023-12-14 11:45:33 -06:00
Monohydroxides	c46711e895	[Community] Add SDE Drag pipeline (#6105 ) * Add community pipeline: sde_drag.py * Update README.md * Update README.md Update example code and visual example * Update sde_drag.py Update code example.	2023-12-14 20:47:20 +05:30
M. Tolga Cangöz	0a401b95b7	[`Docs`] Fix typos (#6122 ) Fix typos and trim trailing whitespaces	2023-12-11 10:55:28 -08:00
apolinário	2a111bc9fe	[Advanced Training Script] Fix pipe example (#6106 )	2023-12-08 15:56:35 +01:00
apolinário	16e6997f0d	[Advanced Diffusion Script] Add Widget default text (#6100 ) add widget	2023-12-08 12:45:27 +01:00
Aryan V S	978dec9014	[Community] AnimateDiff + Controlnet Pipeline (#5928 ) * begin work on animatediff + controlnet pipeline * complete todos, uncomment multicontrolnet, input checks Co-Authored-By: EdoardoBotta <botta.edoardo@gmail.com> * update Co-Authored-By: EdoardoBotta <botta.edoardo@gmail.com> * add example * update community README * Update examples/community/README.md --------- Co-authored-by: EdoardoBotta <botta.edoardo@gmail.com> Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2023-12-06 21:01:41 -10:00
Younes Belkada	c2717317f0	[`PEFT`] Adapt example scripts to use PEFT (#5388 ) * adapt example scripts to use PEFT * Update examples/text_to_image/train_text_to_image_lora.py * fix * add for SDXL * oops * make sure to install peft * fix * fix * fix dreambooth and lora * more fixes * add peft to requirements.txt * fix * final fix * add peft version in requirements * remove comment * change variable names * add few lines in readme * add to reqs * style * fix issues * fix lora dreambooth xl tests * init_lora_weights to gaussian and add out proj where missing * ammend requirements. * ammend requirements.txt * add correct peft versions --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2023-12-07 09:39:29 +05:30
Lucain	75ada25048	Harmonize HF environment variables + deprecate use_auth_token (#6066 ) * Harmonize HF environment variables + deprecate use_auth_token * fix import * fix	2023-12-06 22:22:31 +01:00
apolinário	466d32c442	[Advanced Diffusion Training] Cache latents to avoid VAE passes for every training step (#6076 ) * add cache latents * style	2023-12-06 14:46:53 +01:00
Pedro Cuenca	ab6672fecd	Use CC12M for LCM WDS training example (#5908 ) * Fix SD scripts - there are only 2 items per batch * Adjustments to make the SDXL scripts work with other datasets * Use public webdataset dataset for examples * make style * Minor tweaks to the readmes. * Stress that the database is illustrative.	2023-12-06 10:35:36 +01:00
apolinário	6e221334cd	[advanced_dreambooth_lora_sdxl_tranining_script] save embeddings locally fix (#6058 ) * Update train_dreambooth_lora_sdxl_advanced.py * remove global function args from dreamboothdataset class * style * style --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2023-12-05 13:52:34 +01:00
Radamés Ajna	eacf5e34eb	Fix demofusion (#6049 ) * Update pipeline_demofusion_sdxl.py * Update README.md	2023-12-05 18:10:46 +05:30
Linoy Tsaban	880c0fdd36	[advanced dreambooth lora training script][bug_fix] change token_abstraction type to str (#6040 ) * improve help tags * style fix * changes token_abstraction type to string. support multiple concepts for pivotal using a comma separated string. * style fixup * changed logger to warning (not yet available) * moved the token_abstraction parsing to be in the same block as where we create the mapping of identifier to token --------- Co-authored-by: Linoy <linoy@huggingface.co>	2023-12-04 18:38:44 +01:00
RuoyiDu	c36f1c3160	[Community Pipeline] DemoFusion: Democratising High-Resolution Image Generation With No $$$ (#6022 ) * Add files via upload * Update README.md * Update pipeline_demofusion_sdxl.py * Update pipeline_demofusion_sdxl.py * Update examples/community/README.md Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2023-12-04 19:44:57 +05:30
Levi McCallum	e185084a5d	Add variant argument to dreambooth lora sdxl advanced (#6021 ) Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2023-12-04 12:04:15 +01:00
gujing	bf92e746c0	fix StableDiffusionTensorRT super args error (#6009 )	2023-12-04 10:06:23 +05:30
Linoy Tsaban	b785a155d6	[advanced dreambooth lora sdxl training script] improve help tags (#6035 ) * improve help tags * style fix --------- Co-authored-by: Linoy <linoy@huggingface.co>	2023-12-04 09:41:25 +05:30
Long(Tony) Lian	618260409f	LLMGroundedDiffusionPipeline: inherit from DiffusionPipeline and fix peft (#6023 ) * LLMGroundedDiffusionPipeline: inherit from DiffusionPipeline and fix peft * Use main in the revision in the examples * Add "Copied from" statements in comments * Fix formatting with ruff	2023-12-01 09:58:25 -10:00
Patrick von Platen	dadd55fb36	Post Release: v0.24.0 (#5985 ) * Post Release: v0.24.0 * post pone deprecation * post pone deprecation * Add model_index.json	2023-12-01 18:43:44 +01:00
Patrick von Platen	0f55c17e17	fix style	2023-12-01 15:59:34 +00:00
hako-mikan	46c751e970	[Community Pipeline] Regional Prompting Pipeline (#6015 ) * Update README.md * Update README.md * Add files via upload * Update README.md * Update examples/community/README.md --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2023-12-01 16:22:59 +01:00
Linoy Tsaban	c1e4529541	[advanced_dreambooth_lora_sdxl_tranining_script] readme fix (#6019 ) readme	2023-12-01 15:14:57 +01:00
Linoy Tsaban	d29d97b616	[examples/advanced_diffusion_training] bug fixes and improvements for LoRA Dreambooth SDXL advanced training script (#5935 ) * imports and readme bug fixes * bug fix - ensures text_encoder params are dtype==float32 (when using pivotal tuning) even if the rest of the model is loaded in fp16 * added pivotal tuning to readme * mapping token identifier to new inserted token in validation prompt (if used) * correct default value of --train_text_encoder_frac * change default value of --adam_weight_decay_text_encoder * validation prompt generations when using pivotal tuning bug fix * style fix * textual inversion embeddings name change * style fix * bug fix - stopping text encoder optimization halfway * readme - will include token abstraction and new inserted tokens when using pivotal tuning - added type to --num_new_tokens_per_abstraction * style fix --------- Co-authored-by: Linoy Tsaban <linoy@huggingface.co>	2023-12-01 14:18:43 +01:00
Kristian Mischke	141cd52d56	Fix LLMGroundedDiffusionPipeline super class arguments (#5993 ) * make `requires_safety_checker` a kwarg instead of a positional argument as it's more future-proof * apply `make style` formatting edits * add image_encoder to arguments and pass to super constructor	2023-11-30 10:15:14 -10:00
Kashif Rasul	01782c220e	[Wuerstchen] Adapt lora training example scripts to use PEFT (#5959 ) * Adapt lora example scripts to use PEFT * add to_out.0	2023-11-29 16:18:20 +01:00
Linh Nguyen	636feba552	Rename output_dir argument (#5916 ) Fix typo in output_dir argument: "text-inversion-model" → "dreambooth-model"	2023-11-29 15:47:16 +01:00
Andrés Romero	79dc7df03e	[bug fix] Inpainting for MultiAdapter (#5922 ) * bug in MultiAdapter for Inpainting * adapter_input is a list for MultiAdapter --------- Co-authored-by: andres <andres@hax.ai> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2023-11-29 15:46:26 +01:00
Sayak Paul	fdd003d8e2	[Tests] Refactor `test_examples.py` for better readability (#5946 ) * control and custom diffusion * dreambooth * instructpix2pix and dreambooth ckpting * t2i adapters. * text to image ft * textual inversion * unconditional * workflows * import fix * fix import	2023-11-29 18:43:59 +05:30
estelleafl	5ae3c3a56b	[ldm3d] Ldm3d upscaler to community pipeline (#5870 ) --------- Co-authored-by: Aflalo <estellea@isl-gpu27.rr.intel.com> Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: YiYi Xu <yixu310@gmail.com>	2023-11-28 09:00:39 -10:00
T. Xu	14a0d21d2e	[Community Pipeline] Diffusion Posterior Sampling for General Noisy Inverse Problems (#5939 ) * [community pipeline] dps impl * add type checking * pass ruff check * ruff formatter	2023-11-27 14:29:42 +01:00
Viktor Grygorchuk	20f0cbc88f	fix: error on device for `lpw_stable_diffusion_xl` pipeline if `pipe.enable_sequential_cpu_offload()` enabled (#5885 ) fix: set device for pipe.enable_sequential_cpu_offload()	2023-11-27 13:47:47 +01:00
ginjia	d3cda804e7	add LoRA weights load and fuse support for IPEX pipeline (#5920 ) add IPEX pipeline LoRA weights loading support	2023-11-27 13:32:43 +01:00
dg845	07eac4d65a	Fix LCM Stable Diffusion distillation bug related to parsing unet_time_cond_proj_dim (#5893 ) * Fix bug related to parsing unet_time_cond_proj_dim. * Fix analogous bug in the SD-XL LCM distillation script.	2023-11-27 13:00:40 +01:00
Wang, Yi	c7bfb8b22a	set the model to train state before accelerator prepare (#5099 ) Signed-off-by: Wang, Yi A <yi.a.wang@intel.com>	2023-11-27 12:43:49 +01:00
Patrick von Platen	6d2e19f746	[Examples] Allow downloading variant model files (#5531 ) * add variant * add variant * Apply suggestions from code review * reformat * fix: textual_inversion.py * fix: variant in model_info --------- Co-authored-by: sayakpaul <spsayakpaul@gmail.com>	2023-11-27 10:43:20 +05:30
Linoy Tsaban	3003ff4947	[bug fix] fix small bug in readme template of sdxl lora training script (#5914 ) readme improvement and metadata fix	2023-11-23 19:08:49 +01:00
Linoy Tsaban	5ffa603244	[bug fix] fix small bug in readme template of sdxl lora training script (#5906 ) * readme bug fix * style fix --------- Co-authored-by: Linoy Tsaban <linoy@huggingface.co>	2023-11-23 12:11:50 +01:00
Linoy Tsaban	0eeee618cf	Adds an advanced version of the SD-XL DreamBooth LoRA training script supporting pivotal tuning (#5883 ) * sdxl dreambooth lora training script with pivotal tuning * bug fix - args missing from parse_args * code quality fixes * comment unnecessary code from TokenEmbedding handler class * fixup --------- Co-authored-by: Linoy Tsaban <linoy@huggingface.co>	2023-11-22 16:27:56 +01:00
Andrés Romero	93f1a14cab	ControlNet+Adapter pipeline, and ControlNet+Adapter+Inpaint pipeline (#5869 ) * ControlNet+Adapter pipeline, and +Inpaint pipeline --------- Co-authored-by: andres <andres@hax.ai>	2023-11-21 08:59:29 -10:00
Patrick von Platen	13d73d9303	[Lora] Seperate logic (#5809 ) * [Lora] Seperate logic * [Lora] Seperate logic * [Lora] Seperate logic * add comments to explain the code better * add comments to explain the code better	2023-11-21 18:58:37 +01:00
Linoy Tsaban	6fac1369d0	Add features to the Dreambooth LoRA SDXL training script (#5508 ) * Additions: - support for different lr for text encoder - support for Prodigy optimizer - support for min snr gamma - support for custom captions and dataset loading from the hub * adjusted --caption_column behaviour (to -not- use the second column of the dataset by default if --caption_column is not provided) * fixed --output_dir / --model_dir_name confusion * added --repeats, --adam_weight_decay_text_encoder + some fixes * Update examples/dreambooth/train_dreambooth_lora_sdxl.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update examples/dreambooth/train_dreambooth_lora_sdxl.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update examples/dreambooth/train_dreambooth_lora_sdxl.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * - import compute_snr from diffusers/training_utils.py - cluster adamw together - when using 'prodigy', if --train_text_encoder == True and --text_encoder_lr != --learning rate, changes the lr of the text encoders optimization params to be --learning_rate (otherwise errors) * shape fixes when custom captions are used * formatting and a little cleanup * code styling * --repeats default value fixed, changed to 1 * bug fix - removed redundant lines of embedding concatenation when using prior_preservation (that duplicated class_prompt embeddings) * changed dataset loading logic according to the following usecases (to avoid unnecessary dependency on datasets)- 1. user provides --dataset_name 2. user provides local dir --instance_data_dir that contains a metadata .jsonl file 3. user provides local dir --instance_data_dir that contains only images in cases [1,2] we import datasets and use load_dataset method, in case [3] we process the data same as in the original script setting * styling fix * arg name fix * adjusted the --repeats logic * -removed redundant arg and 'if' when loading local folder with prompts -updated readme template -some default val fixes -custom caption tests * image path fix for readme * code style * bug fix * --caption_column arg * readme fix --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: Linoy Tsaban <linoy@huggingface.co>	2023-11-21 17:38:43 +01:00
co63oc	ee519cfef5	Update README.md (#5855 )	2023-11-21 11:56:13 +01:00
Patrick von Platen	3303aec5f8	make style	2023-11-20 12:54:52 +01:00

1 2 3 4 5 ...

696 Commits