diffusers

mirror of https://github.com/huggingface/diffusers.git synced 2026-01-27 17:22:53 +03:00

Author	SHA1	Message	Date
Álvaro Somoza	edcbe8038b	Fix huggingface-hub failing tests (#11994 ) * login * more logins * uploads * missed login * another missed login * downloads * examples and more logins * fix * setup * Apply style fixes * fix * Apply style fixes	2025-07-29 02:34:58 -04:00
Quentin Gallouédec	c8bb1ff53e	Use HF Papers (#11567 ) * Use HF Papers * Apply style fixes --------- Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>	2025-05-19 06:22:33 -10:00
drhead	2ada094bff	Add extra performance features for EMAModel, torch._foreach operations and better support for non-blocking CPU offloading (#7685 ) * Add support for _foreach operations and non-blocking to EMAModel * default foreach to false * add non-blocking EMA offloading to SD1.5 T2I example script * fix whitespace * move foreach to cli argument * linting * Update README.md re: EMA weight training * correct args.foreach_ema * add tests for foreach ema * code quality * add foreach to from_pretrained * default foreach false * fix linting --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: drhead <a@a.a>	2024-06-24 14:03:47 +05:30
Tolga Cangöz	98730c5dd7	Errata (#8322 ) * Fix typos * Trim trailing whitespaces * Remove a trailing whitespace * chore: Update MarigoldDepthPipeline checkpoint to prs-eth/marigold-lcm-v1-0 * Revert "chore: Update MarigoldDepthPipeline checkpoint to prs-eth/marigold-lcm-v1-0" This reverts commit `fd742b30b4`. * pokemon -> naruto * `DPMSolverMultistep` -> `DPMSolverMultistepScheduler` * Improve Markdown stylization * Improve style * Improve style * Refactor pipeline variable names for consistency * up style	2024-06-05 13:59:09 -07:00
Bagheera	8edaf3b79c	7879 - adjust documentation to use naruto dataset, since pokemon is now gated (#7880 ) * 7879 - adjust documentation to use naruto dataset, since pokemon is now gated * replace references to pokemon in docs * more references to pokemon replaced * Japanese translation update --------- Co-authored-by: bghira <bghira@users.github.com>	2024-05-07 09:36:39 -07:00
39th president of the United States, probably	9d16daaf64	Add DREAM training (#6381 ) A new function compute_dream_and_update_latents has been added to the training utilities that allows you to do DREAM rectified training in line with the paper https://arxiv.org/abs/2312.00210. The method can be used with an extra argument in the train_text_to_image.py script. Co-authored-by: Jimmy <39@🇺🇸.com>	2024-04-27 07:19:15 +05:30
M. Tolga Cangöz	5a54dc9e95	Fix typos in text_to_image examples (#7050 ) Update copyright information and fix typos in text_to_image examples	2024-02-21 16:40:45 -08:00
Yudong Jin	49644babd3	Fix the test script in examples/text_to_image/README.md (#6209 ) * Update examples/text_to_image/README.md * Update examples/text_to_image/README.md Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2023-12-18 15:36:00 +05:30
M. Tolga Cangöz	0a401b95b7	[`Docs`] Fix typos (#6122 ) Fix typos and trim trailing whitespaces	2023-12-11 10:55:28 -08:00
Younes Belkada	c2717317f0	[`PEFT`] Adapt example scripts to use PEFT (#5388 ) * adapt example scripts to use PEFT * Update examples/text_to_image/train_text_to_image_lora.py * fix * add for SDXL * oops * make sure to install peft * fix * fix * fix dreambooth and lora * more fixes * add peft to requirements.txt * fix * final fix * add peft version in requirements * remove comment * change variable names * add few lines in readme * add to reqs * style * fix issues * fix lora dreambooth xl tests * init_lora_weights to gaussian and add out proj where missing * ammend requirements. * ammend requirements.txt * add correct peft versions --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2023-12-07 09:39:29 +05:30
Sayak Paul	5175d3d7a5	add: train to text image with sdxl script. (#4505 ) * add: train to text image with sdxl script. Co-authored-by: CaptnSeraph <s3raph1m@gmail.com> * fix: partial func. * fix: default value of output_dir. * make style * set num inference steps to 25. * remove mentions of LoRA. * up min version * add: ema cli arg * run device placement while running step. * precompute vae encodings too. * fix * debug * should work now. * debug * debug * goes alright? * style * debugging * debugging * debugging * debugging * fix * reinit scheduler if prediction_type was passed. * akways cast vae in float32 * better handling of snr. Co-authored-by: bghira <bghira@users.github.com> * the vae should be also passed * add: docs. * add: sdlx t2i tests * save the pipeline * autocast. * fix: save_model_card * fix: save_model_card. --------- Co-authored-by: CaptnSeraph <s3raph1m@gmail.com> Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: bghira <bghira@users.github.com>	2023-08-16 09:02:49 +05:30
takuoko	9c29bc2df8	[Examples] Support train_text_to_image_lora_sdxl.py (#4365 ) * add train_text_to_image_lora_sdxl.py * add train_text_to_image_lora_sdxl.py * add test and minor fix * Update examples/text_to_image/README_sdxl.md Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * fix unwrap_model rule * add invisible-watermark in requirements * del invisible-watermark * Update examples/text_to_image/README_sdxl.md Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update examples/text_to_image/README_sdxl.md Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update examples/text_to_image/train_text_to_image_lora_sdxl.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * del comment & update readme --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2023-08-06 13:47:20 +05:30
Sayak Paul	4870626728	[Examples] Improve the model card pushed from the `train_text_to_image.py` script (#3810 ) * refactor: readme serialized from the example when push_to_hub is True. * fix: batch size arg. * a bit better formatting * minor fixes. * add note on env. * Apply suggestions from code review Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * condition wandb info better * make mixed_precision assignment in cli args explicit. * separate inference block for sample images. * Apply suggestions from code review Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * address more comments. * autocast mode. * correct none image type problem. * ifx: list assignment. * minor fix. --------- Co-authored-by: Pedro Cuenca <pedro@huggingface.co>	2023-06-20 08:59:41 +05:30
Will Berman	3ddc2b7395	[train text to image] add note to loading from checkpoint (#3806 ) add note to loading from checkpoint	2023-06-16 11:54:49 +05:30
Sayak Paul	71de5b7051	[LoRA] quality of life improvements in the loading semantics and docs (#3180 ) * 👽 qol improvements for LoRA. * better function name? * fix: LoRA weight loading with the new format. * address Patrick's comments. * Apply suggestions from code review Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * change wording around encouraging the use of load_lora_weights(). * fix: function name. --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2023-04-28 11:36:49 +05:30
Sayak Paul	3b641eabe9	feat: verfication of multi-gpu support for select examples. (#3126 ) * feat: verfication of multi-gpu support for select examples. * add: multi-gpu training sections to the relvant doc pages.	2023-04-18 08:36:13 +05:30
Sayak Paul	24947317a6	[Examples] Add support for Min-SNR weighting strategy for better convergence (#2899 ) * improve stable unclip doc. * feat: support for applying min-snr weighting for faster convergence. * add: support for validation logging with wandb * make not a required arg. * fix: arg name. * fix: cli args. * fix: tracker config. * fix: loss calculation. * fix: validation logging. * fix: unwrap call. * fix: validation logging. * fix: internval. * fix: checkpointing push to hub. * fix: `c8a2856c6d`\#commitcomment-106913193 * fix: norm group test for UNet3D. * address PR comments. * remove unneeded code. * add: entry in the readme and docs. * Apply suggestions from code review Co-authored-by: Suraj Patil <surajp815@gmail.com> --------- Co-authored-by: Suraj Patil <surajp815@gmail.com>	2023-04-06 19:08:40 +05:30
Mishig	8e35ef0142	[doc wip] literalinclude (#2718 )	2023-03-23 13:42:54 +01:00
zxypro	f0b661b8fb	[Docs]Fix invalid link to Pokemons dataset (#2583 )	2023-03-07 14:26:09 +01:00
Pedro Cuenca	8178c840f2	Mention training problems with xFormers 0.0.16 (#2254 )	2023-02-06 11:19:26 +01:00
Sayak Paul	7d96b38b70	[examples] Fix CLI argument in the launch script command for text2image with LoRA (#2171 ) * Update README.md * Update README.md	2023-01-31 09:47:09 +01:00
Sayak Paul	c1184918c5	[docs] Adds a doc on LoRA support for diffusers (#2086 ) * add: a doc on LoRA support in diffusers. * Apply suggestions from code review Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * apply PR suggestions. * Apply suggestions from code review Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * remove visually incoherent elements. Co-authored-by: Pedro Cuenca <pedro@huggingface.co>	2023-01-25 12:23:12 +01:00
Sayak Paul	ffb3a26c5c	[LoRA] Adds example on text2image fine-tuning with LoRA (#2031 ) * example on fine-tuning with LoRA. * apply make quality. * fix: pipeline loading. * Apply suggestions from code review Co-authored-by: Suraj Patil <surajp815@gmail.com> Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * apply suggestions for PR review. Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * apply make style and make quality. * chore: remove mention of dreambooth from text2image. * add: weight path and wandb run link. * Apply suggestions from code review * apply make style. * make style Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: Suraj Patil <surajp815@gmail.com>	2023-01-23 08:31:07 +01:00
Katsuya	8874027efc	Make xformers optional even if it is available (#1753 ) * Make xformers optional even if it is available * Raise exception if xformers is used but not available * Rename use_xformers to enable_xformers_memory_efficient_attention * Add a note about xformers in README * Reformat code style	2022-12-27 19:47:50 +01:00
Suraj Patil	c228331068	[examples] add check_min_version (#1550 ) * add check_min_version for examples * move __version__ to the top * Apply suggestions from code review Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * fix comment * fix error_message * adapt the install message Co-authored-by: Pedro Cuenca <pedro@huggingface.co>	2022-12-06 14:36:50 +01:00
Pedro Gabriel Gengo Lourenço	4f596599f4	Fix training docs to install datasets (#1476 ) Fixed doc to install from training packages	2022-12-02 15:52:04 +01:00
Suraj Patil	6c56f05097	v-prediction training support (#1455 ) * add get_velocity * add v prediction for training * fix saving * add revision arg * fix saving * save checkpoints dreambooth * fix saving embeds * add instruction in readme * quality * noise_pred -> model_pred	2022-11-28 17:46:54 +01:00
Suraj Patil	8b84f85192	[examples] fix mixed_precision arg (#1359 ) * use accelerator to check mixed_precision * default `mixed_precision` to `None` * pass mixed_precision to accelerate launch	2022-11-22 13:35:23 +01:00
Pedro Cuenca	6b185b6acd	Update training and fine-tuning docs (#1020 ) * Update training and fine-tuning docs. * Update examples README. * Update README. * Add Flax fine-tuning section. * Accept suggestion Co-authored-by: Anton Lozhkov <anton@huggingface.co> * Accept suggestion Co-authored-by: Anton Lozhkov <anton@huggingface.co> Co-authored-by: Anton Lozhkov <anton@huggingface.co>	2022-10-28 21:02:08 +02:00
Suraj Patil	52f2128dc6	update readme for flax examples (#1026 )	2022-10-27 15:25:25 +02:00
Duong A. Nguyen	abe058221c	[Flax] Add finetune Stable Diffusion (#999 ) * [Flax] Add finetune Stable Diffusion * temporary fix * drop_last and seed * add dtype for mixed precision training * style * Add Flax example	2022-10-27 14:08:21 +02:00
Suraj Patil	66a5279a94	stable diffusion fine-tuning (#356 ) * begin text2image script * loading the datasets, preprocessing & transforms * handle input features correctly * add gradient checkpointing support * fix output names * run unet in train mode not text encoder * use no_grad instead of freezing params * default max steps None * pad to longest * don't pad when tokenizing * fix encode on multi gpu * fix stupid bug * add random flip * add ema * fix ema * put ema on cpu * improve EMA model * contiguous_format * don't warp vae and text encode in accelerate * remove no_grad * use randn_like * fix resize * improve few things * log epoch loss * set log level * don't log each step * remove max_length from collate * style * add report_to option * make scale_lr false by default * add grad clipping * add an option to use 8bit adam * fix logging in multi-gpu, log every step * more comments * remove eval for now * adress review comments * add requirements file * begin readme * begin readme * fix typo * fix push to hub * populate readme * update readme * remove use_auth_token from the script * address some review comments * better mixed precision support * remove redundant to * create ema model early * Apply suggestions from code review Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * better description for train_data_dir * add diffusers in requirements * update dataset_name_mapping * update readme * add inference example Co-authored-by: anton-l <anton@huggingface.co> Co-authored-by: Pedro Cuenca <pedro@huggingface.co>	2022-10-11 19:03:39 +02:00

32 Commits