diffusers

mirror of https://github.com/huggingface/diffusers.git synced 2026-01-27 17:22:53 +03:00

Author	SHA1	Message	Date
Pedro Cuenca	1d7b4b60b7	Ruff: apply same rules as in transformers (#2827 ) * Apply same ruff settings as in transformers See https://github.com/huggingface/transformers/blob/main/pyproject.toml Co-authored-by: Aaron Gokaslan <aaronGokaslan@gmail.com> * Apply new style rules * Style Co-authored-by: Aaron Gokaslan <aaronGokaslan@gmail.com> * style * remove list, ruff wouldn't auto fix. --------- Co-authored-by: Aaron Gokaslan <aaronGokaslan@gmail.com>	2023-03-27 16:18:57 +02:00
Sayak Paul	abb22b4eeb	Update `examples` README.md to include the latest examples (#2839 )	2023-03-27 19:34:58 +05:30
Haofan Wang	4a98d6e097	Update train_text_to_image_lora.py (#2795 )	2023-03-24 11:45:35 +05:30
Haofan Wang	dc5b4e2342	Update train_text_to_image_lora.py (#2767 ) * Update train_text_to_image_lora.py * Update train_text_to_image_lora.py * Update train_text_to_image_lora.py * Update train_text_to_image_lora.py * format	2023-03-23 14:28:47 +01:00
Nipun Jindal	055c90f589	[2737]: Add DPMSolverMultistepScheduler to CLIP guided community pipeline (#2779 ) [2737]: Add DPMSolverMultistepScheduler to CLIP guided community pipelines Co-authored-by: njindal <njindal@adobe.com> Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2023-03-23 14:20:24 +01:00
Naoki Ainoya	14e3a28c12	Rename 'CLIPFeatureExtractor' class to 'CLIPImageProcessor' (#2732 ) The 'CLIPFeatureExtractor' class name has been renamed to 'CLIPImageProcessor' in order to comply with future deprecation. This commit includes the necessary changes to the affected files.	2023-03-23 13:49:22 +01:00
Mishig	8e35ef0142	[doc wip] literalinclude (#2718 )	2023-03-23 13:42:54 +01:00
Sayak Paul	9dc84448ac	[Examples] InstructPix2Pix instruct training script (#2478 ) * add: initial implementation of the pix2pix instruct training script. * shorten cli arg. * fix: main process check. * fix: dataset column names. * simplify tokenization. * proper placement of null conditions. * apply styling. * remove debugging message for conditioning do. * complete license. * add: requirements.tzt * wandb column name order. * fix: augmentation. * change: dataset_id. * fix: convert_to_np() call. * fix: reshaping. * fix: final ema copy. * Apply suggestions from code review Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * address PR comments. * add: readme details. * config fix. * downgrade version. * reduce image width in the readme. * note on hyperparameters during generation. * add: output images. * update readme. * minor edits to readme. * debugging statement. * explicitly placement of the pipeline. * bump minimum diffusers version. * fix: device attribute error. * weight dtype. * debugging. * add dtype inform. * add seoarate te and vae. * add: explicit casting/ * remove casting. * up. * up 2. * up 3. * autocast. * disable mixed-precision in the final inference. * debugging information. * autocasting. * add: instructpix2pix training section to the docs. * Empty-Commit --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2023-03-23 10:15:01 +05:30
Haofan Wang	e0d8c9ef83	Support for Offset Noise in examples (#2753 ) * add noise offset * make style	2023-03-23 09:36:17 +05:30
Patrick von Platen	ca1a22296d	[MS Text To Video] Add first text to video (#2738 ) * [MS Text To Video} Add first text to video * upload * make first model example * match unet3d params * make sure weights are correcctly converted * improve * forward pass works, but diff result * make forward work * fix more * finish * refactor video output class. * feat: add support for a video export utility. * fix: opencv availability check. * run make fix-copies. * add: docs for the model components. * add: standalone pipeline doc. * edit docstring of the pipeline. * add: right path to TransformerTempModel * add: first set of tests. * complete fast tests for text to video. * fix bug * up * three fast tests failing. * add: note on slow tests * make work with all schedulers * apply styling. * add slow tests * change file name * update * more correction * more fixes * finish * up * Apply suggestions from code review * up * finish * make copies * fix pipeline tests * fix more tests * Apply suggestions from code review Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * apply suggestions * up * revert --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: Pedro Cuenca <pedro@huggingface.co>	2023-03-22 18:39:33 +01:00
Hyowon Ha	58bcf46a8f	Add guidance start/end parameters to StableDiffusionControlNetImg2ImgPipeline (#2731 ) * Add guidance start/end parameters to community controlnet img2img pipeline * Fix formats	2023-03-21 14:38:43 +01:00
Nipun Jindal	0042efd015	[1929]: Add CLIP guidance for Img2Img stable diffusion pipeline (#2723 ) * [Img2Img]: Copyover img2img pipeline * [Img2Img]: img2img pipeline * [Img2Img]: img2img pipeline * [Img2Img]: img2img pipeline --------- Co-authored-by: njindal <njindal@adobe.com>	2023-03-21 13:53:00 +01:00
Andy	116f70cbf8	Enabling gradient checkpointing for VAE (#2536 ) * updated black format * update black format * make style format * updated line endings * update code formatting * Update examples/research_projects/onnxruntime/text_to_image/train_text_to_image.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/diffusers/models/vae.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/diffusers/models/vae.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * added vae gradient checkpointing test * make style --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: Will Berman <wlbberman@gmail.com>	2023-03-17 14:59:38 -07:00
Will Berman	a4b2c2f150	train_unconditional save restore unet parameters (#2706 )	2023-03-16 16:15:56 -07:00
Patrick von Platen	e828232780	Rename attention (#2691 ) * rename file * rename attention * fix more * rename more * up * more deprecation imports * fixes	2023-03-16 00:35:54 +01:00
Henrik Forstén	79eb3d07d0	Controlnet training (#2545 ) * Controlnet training code initial commit Works with circle dataset: https://github.com/lllyasviel/ControlNet/blob/main/docs/train.md * Script for adding a controlnet to existing model * Fix control image transform Control image should be in 0..1 range. * Add license header and remove more unused configs * controlnet training readme * Allow nonlocal model in add_controlnet.py * Formatting * Remove unused code * Code quality * Initialize controlnet in training script * Formatting * Address review comments * doc style * explicit constructor args and submodule names * hub dataset NOTE - not tested * empty prompts * add conditioning image * rename * remove instance data dir * image_transforms -> -1,1 . conditioning_image_transformers -> 0, 1 * nits * remove local rank config I think this isn't necessary in any of our training scripts * validation images * proportion_empty_prompts typo * weight copying to controlnet bug * call log validation fix * fix * gitignore wandb * fix progress bar and resume from checkpoint iteration * initial step fix * log multiple images * fix * fixes * tracker project name configurable * misc * add controlnet requirements.txt * update docs * image labels * small fixes * log validation using existing models for pipeline * fix for deepspeed saving * memory usage docs * Update examples/controlnet/train_controlnet.py Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * Update examples/controlnet/train_controlnet.py Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * Update examples/controlnet/README.md Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * Update examples/controlnet/README.md Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * Update examples/controlnet/README.md Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * Update examples/controlnet/README.md Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * Update examples/controlnet/README.md Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * Update examples/controlnet/README.md Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * Update examples/controlnet/README.md Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * Update examples/controlnet/README.md Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * remove extra is main process check * link to dataset in intro paragraph * remove unnecessary paragraph * note on deepspeed * Update examples/controlnet/README.md Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * assert -> value error * weights and biases note * move images out of git * remove .gitignore --------- Co-authored-by: William Berman <WLBberman@gmail.com> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2023-03-14 20:16:30 -07:00
aengusng8	268ebcb015	Add ddim noise comparative analysis pipeline (#2665 ) * add DDIM Noise Comparative Analysis pipeline * update README * add comments * run BLACK format	2023-03-14 18:09:55 +01:00
Will Berman	ebd44957fc	image generation main process checks (#2631 )	2023-03-14 01:28:03 -07:00
Ruizhe Wang	ce08cb72fb	[Dreambooth] Editable number of class images (#2251 ) * [Dreambooth] Editable number of class images * 'class_num=None' bug fix --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2023-03-10 14:15:16 +01:00
Patrick von Platen	cbbad0af69	correct example	2023-03-08 20:14:19 +01:00
Haofan Wang	00132de359	Support LoRA for text encoder (#2588 ) * add lora * Update examples/research_projects/lora/README.md --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2023-03-08 20:14:01 +01:00
Michael Gartsbein	46bef6e31d	community stablediffusion controlnet img2img pipeline (#2584 ) Co-authored-by: mishka <gartsocial@gmail.com>	2023-03-07 13:31:56 -08:00
zxypro	f0b661b8fb	[Docs]Fix invalid link to Pokemons dataset (#2583 )	2023-03-07 14:26:09 +01:00
Isamu Isozaki	8552fd7efa	Added multitoken training for textual inversion. Issue 369 (#661 ) * Added multitoken training for textual inversion * Updated assertion * Removed duplicate save code * Fixed undefined bug * Fixed save * Added multitoken clip model +util helper * Removed code splitting * Removed class * Fixed errors * Fixed errors * Added loading functionality * Loading via dict instead * Fixed bug of invalid index being loaded * Fixed adding placeholder token only adding 1 token * Fixed bug when initializing tokens * Fixed bug when initializing tokens * Removed flawed logic * Fixed vector shuffle * Fixed tokenizer's inconsistent __call__ method * Fixed tokenizer's inconsistent __call__ method * Handling list input * Added exception for adding invalid tokens to token map * Removed unnecessary files and started working on progressive tokens * Set at minimum load one token * Changed to global step * Added method to load automatic1111 tokens * Fixed bug in load * Quality+style fixes * Update quality/style fixes * Cast embeddings to fp16 when loading * Fixed quality * Started moving things over * Clearing diffs * Clearing diffs * Moved everything * Requested changes	2023-03-07 12:09:36 +01:00
Pedro Cuenca	d3ce6f4b1e	Support revision in Flax text-to-image training (#2567 ) Support revision in Flax text-to-image training.	2023-03-07 08:16:31 +01:00
Will Berman	ca7ca11bcd	community controlnet inpainting pipelines (#2561 ) * community controlnet inpainting pipelines * add community member attribution re: @pcuenca	2023-03-06 12:55:31 -08:00
YiYi Xu	c7da8fd233	add intermediate logging for dreambooth training script (#2557 ) * add intermediate logging --------- Co-authored-by: yiyixuxu <yixu310@gmail,com>	2023-03-06 08:13:12 -10:00
Patrick von Platen	b8bfef2ab9	make style	2023-03-06 19:11:45 +01:00
haixinxu	f3f626d556	Allow textual_inversion_flax script to use save_steps and revision flag (#2075 ) * Update textual_inversion_flax.py * Update textual_inversion_flax.py * Typo sorry. * Format source	2023-03-06 19:11:27 +01:00
Patrick von Platen	56958e1177	[Training] Fix tensorboard typo (#2566 )	2023-03-06 15:13:38 +01:00
Naga Sai Abhinay	2e3541d7f4	[Community Pipeline] Unclip Image Interpolation (#2400 ) * unclip img interpolation poc * Added code sample and refactoring.	2023-03-05 16:55:30 -08:00
Alex McKinney	5e5ce13e2f	adds `xformers` support to `train_unconditional.py` (#2520 )	2023-03-03 18:35:59 +01:00
Patrick von Platen	3d2648d743	[Post release] Push post release (#2546 )	2023-03-03 18:11:01 +01:00
Patrick von Platen	f20c8f5a1a	Release: v0.14.0	2023-03-03 16:45:08 +01:00
Isamu Isozaki	d9b9533c7e	Textual inv make save log both steps (#2178 ) * Initial commit * removed images * Made logging the same as save * Removed logging function * Quality fixes * Quality fixes * Tested * Added support back for validation_epochs * Fixing styles * Did changes * Change to log_validation * Add extra space after wandb import * Add extra space after wandb Co-authored-by: Will Berman <wlbberman@gmail.com> * Fixed spacing --------- Co-authored-by: Will Berman <wlbberman@gmail.com>	2023-03-02 19:04:18 +01:00
Patrick von Platen	eadf0e2555	[Copyright] 2023 (#2524 )	2023-03-01 10:31:00 +01:00
Pedro Cuenca	5de4347663	Fix test `train_unconditional` (#2481 ) * Fix tensorboard tracking with `accelerate` @ `main` * Fix `train_unconditional.py` with accelerate from main.	2023-02-24 14:31:16 -08:00
Haofan Wang	589faa8c88	Update train_text_to_image_lora.py (#2464 ) * Update train_text_to_image_lora.py * Update train_text_to_image_lora.py	2023-02-23 11:08:21 +05:30
Patrick von Platen	3231712b7d	Post release 0.14	2023-02-17 23:57:46 +02:00
Patrick von Platen	b2c1e0d6d4	Release: v0.13.0	2023-02-17 23:38:05 +02:00
Will Berman	bfdffbea32	add xformers 0.0.16 warning message (#2345 ) * add xformers 0.0.16 warning message * fix version check to check whole version string	2023-02-17 13:25:46 -08:00
Will Berman	5979089713	Revert "Release: v0.13.0" (#2405 ) This reverts commit `024c4376fb`.	2023-02-17 10:48:16 -08:00
Patrick von Platen	024c4376fb	Release: v0.13.0	2023-02-17 18:46:00 +02:00
Patrick von Platen	ca980fd0d1	[Examples] Make sure EMA works with any device (#2382 ) * Fix EMA * up * update	2023-02-16 21:27:47 +01:00
Will Berman	b214bb25f8	train_text_to_image EMAModel saving (#2341 )	2023-02-16 14:40:28 +01:00
Damian Stewart	e3ddbe25ed	Fix 3-way merging with the checkpoint_merger community pipeline (#2355 ) correctly locate 3rd file; also correct misleading docs	2023-02-16 10:52:41 +01:00
Will Berman	46def7265f	checkpointing_steps_total_limit->checkpoints_total_limit (#2374 )	2023-02-16 00:28:58 -08:00
Will Berman	296b01e1a1	add total number checkpoints to training scripts (#2367 ) * add total number checkpoints to training scripts * Update examples/dreambooth/train_dreambooth.py Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2023-02-15 23:58:06 -08:00
Patrick von Platen	91925fbb76	Fix callback type hints - no optional function argument (#2357 ) replace type hints	2023-02-14 14:35:05 -08:00
Ben Evans	0db19da01f	Log Unconditional Image Generation Samples to W&B (#2287 ) * Log Unconditional Image Generation Samples to WandB * Check for wandb installation and parity between onnxruntime script * Log epoch to wandb * Check for tensorboard logger early on * style fixes --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2023-02-14 23:11:12 +01:00

... 17 18 19 20 21 ...

1269 Commits