diffusers

mirror of https://github.com/huggingface/diffusers.git synced 2026-01-27 17:22:53 +03:00

Author	SHA1	Message	Date
Dorsa Rohani	c10f875ff0	Add Diffusion Policy for Reinforcement Learning (#9824 ) * enable cpu ability * model creation + comprehensive testing * training + tests * all tests working * remove unneeded files + clarify docs * update train tests * update readme.md * remove data from gitignore * undo cpu enabled option * Update README.md * update readme * code quality fixes * diffusion policy example * update readme * add pretrained model weights + doc * add comment * add documentation * add docstrings * update comments * update readme * fix code quality * Update examples/reinforcement_learning/README.md Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * Update examples/reinforcement_learning/diffusion_policy.py Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * suggestions + safe globals for weights_only=True * suggestions + safe weights loading * fix code quality * reformat file --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-11-02 09:18:44 +05:30
Leo Jiang	a98a839de7	Reduce Memory Cost in Flux Training (#9829 ) * Improve NPU performance * Improve NPU performance * Improve NPU performance * Improve NPU performance * [bugfix] bugfix for npu free memory * [bugfix] bugfix for npu free memory * [bugfix] bugfix for npu free memory * Reduce memory cost for flux training process --------- Co-authored-by: 蒋硕 <jiangshuo9@h-partners.com> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-11-01 12:19:32 +05:30
Boseong Jeon	3deed729e6	Handling mixed precision for dreambooth flux lora training (#9565 ) Handling mixed precision and add unwarp Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: Linoy Tsaban <57615435+linoytsaban@users.noreply.github.com>	2024-11-01 10:16:05 +05:30
ScilenceForest	7ffbc2525f	Update train_controlnet_flux.py,Fix size mismatch issue in validation (#9679 ) Update train_controlnet_flux.py Fix the problem of inconsistency between size of image and size of validation_image which causes np.stack to report error. Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-11-01 10:15:10 +05:30
Leo Jiang	9dcac83057	NPU Adaption for FLUX (#9751 ) * NPU implementation for FLUX * NPU implementation for FLUX * NPU implementation for FLUX * NPU implementation for FLUX * NPU implementation for FLUX * NPU implementation for FLUX * NPU implementation for FLUX * NPU implementation for FLUX * NPU implementation for FLUX * NPU implementation for FLUX * NPU implementation for FLUX * NPU implementation for FLUX * NPU implementation for FLUX * NPU implementation for FLUX --------- Co-authored-by: 蒋硕 <jiangshuo9@h-partners.com>	2024-11-01 09:03:15 +05:30
Abhipsha Das	c75431843f	[Model Card] standardize advanced diffusion training sd15 lora (#7613 ) * modelcard generation edit * add missed tag * fix param name * fix var * change str to dict * add use_dora check * use correct tags for lora * make style && make quality --------- Co-authored-by: Aryan <aryan@huggingface.co>	2024-11-01 03:23:00 +05:30
Sayak Paul	8ce37ab055	[training] use the lr when using 8bit adam. (#9796 ) * use the lr when using 8bit adam. * remove lr as we pack it in params_to_optimize. --------- Co-authored-by: Linoy Tsaban <57615435+linoytsaban@users.noreply.github.com>	2024-10-31 15:51:42 +05:30
Sayak Paul	09b8aebd67	[training] fixes to the quantization training script and add AdEMAMix optimizer as an option (#9806 ) * fixes * more fixes.	2024-10-31 15:46:00 +05:30
Raul Ciotescu	c5376c5695	adds the pipeline for pixart alpha controlnet (#8857 ) * add the controlnet pipeline for pixart alpha --------- Co-authored-by: YiYi Xu <yixu310@gmail.com> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: junsongc <cjs1020440147@icloud.com>	2024-10-28 08:48:04 -10:00
Linoy Tsaban	743a5697f2	[flux dreambooth lora training] make LoRA target modules configurable + small bug fix (#9646 ) * make lora target modules configurable and change the default * style * make lora target modules configurable and change the default * fix bug when using prodigy and training te * fix mixed precision training as proposed in https://github.com/huggingface/diffusers/pull/9565 for full dreambooth as well * add test and notes * style * address sayaks comments * style * fix test --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-10-28 17:27:41 +02:00
Linoy Tsaban	db5b6a9630	[SD 3.5 Dreambooth LoRA] support configurable training block & layers (#9762 ) * configurable layers * configurable layers * update README * style * add test * style * add layer test, update readme, add nargs * readme * test style * remove print, change nargs * test arg change * style * revert nargs 2/2 * address sayaks comments * style * address sayaks comments	2024-10-28 16:07:54 +02:00
Biswaroop	493aa74312	[Fix] remove setting lr for T5 text encoder when using prodigy in flux dreambooth lora script (#9473 ) * fix: removed setting of text encoder lr for T5 as it's not being tuned * fix: removed setting of text encoder lr for T5 as it's not being tuned --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: Linoy Tsaban <57615435+linoytsaban@users.noreply.github.com>	2024-10-28 13:07:30 +02:00
Vinh H. Pham	3b5b1c5698	[Fix] train_dreambooth_lora_flux_advanced ValueError: unexpected save model: <class 'transformers.models.t5.modeling_t5.T5EncoderModel'> (#9777 ) fix save state te T5	2024-10-28 12:52:27 +02:00
Sayak Paul	fddbab7993	[research_projects] Update README.md to include a note about NF5 T5-xxl (#9775 ) Update README.md	2024-10-26 22:13:03 +09:00
Ina	73b59f5203	[refactor] enhance readability of flux related pipelines (#9711 ) * flux pipline: readability enhancement.	2024-10-25 11:01:51 -10:00
Sayak Paul	df073ba137	[research_projects] add flux training script with quantization (#9754 ) * add flux training script with quantization * remove exclamation	2024-10-26 00:07:57 +09:00
Linoy Tsaban	bfa0aa4ff2	[SD3-5 dreambooth lora] update model cards (#9749 ) * improve readme * style --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-10-23 23:16:53 +03:00
Sayak Paul	e45c25d03a	post-release 0.31.0 (#9742 ) * post-release * style	2024-10-22 20:42:30 +05:30
Yu Zheng	b0ffe92230	Update sd3 controlnet example (#9735 ) * use make_image_grid in diffusers.utils * use checkpoint on the Hub	2024-10-22 09:02:16 +05:30
Tolga Cangöz	1b64772b79	Fix `schedule_shifted_power` usage in 🪆Matryoshka Diffusion Models (#9723 ) * [matryoshka.py] Add schedule_shifted_power attribute and update get_schedule_shifted method	2024-10-21 14:23:50 -10:00
G.O.D	63a0c9e5f7	[bugfix] reduce float value error when adding noise (#9004 ) * Update train_controlnet.py reduce float value error for bfloat16 * Update train_controlnet_sdxl.py * style --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: yiyixuxu <yixu310@gmail.com>	2024-10-21 13:26:05 -10:00
hlky	89565e9171	Add prompt scheduling callback to community scripts (#9718 )	2024-10-19 14:22:22 -03:00
Linoy Tsaban	2541d141d5	[advanced flux lora script] minor updates to readme (#9705 ) * fix arg naming * fix arg naming * fix arg naming * fix arg naming	2024-10-18 15:35:44 +03:00
Linoy Tsaban	9a7f824645	[Flux] Add advanced training script + support textual inversion inference (#9434 ) * add ostris trainer to README & add cache latents of vae * add ostris trainer to README & add cache latents of vae * style * readme * add test for latent caching * add ostris noise scheduler `9ee1ef2a0a/toolkit/samplers/custom_flowmatch_sampler.py (L95)` * style * fix import * style * fix tests * style * --change upcasting of transformer? * update readme according to main * add pivotal tuning for CLIP * fix imports, encode_prompt call,add TextualInversionLoaderMixin to FluxPipeline for inference * TextualInversionLoaderMixin support for FluxPipeline for inference * move changes to advanced flux script, revert canonical * add latent caching to canonical script * revert changes to canonical script to keep it separate from https://github.com/huggingface/diffusers/pull/9160 * revert changes to canonical script to keep it separate from https://github.com/huggingface/diffusers/pull/9160 * style * remove redundant line and change code block placement to align with logic * add initializer_token arg * add transformer frac for range support from pure textual inversion to the orig pivotal tuning * support pure textual inversion - wip * adjustments to support pure textual inversion and transformer optimization in only part of the epochs * fix logic when using initializer token * fix pure_textual_inversion_condition * fix ti/pivotal loading of last validation run * remove embeddings loading for ti in final training run (to avoid adding huggingface hub dependency) * support pivotal for t5 * adapt pivotal for T5 encoder * adapt pivotal for T5 encoder and support in flux pipeline * t5 pivotal support + support fo pivotal for clip only or both * fix param chaining * fix param chaining * README first draft * readme * readme * readme * style * fix import * style * add fix from https://github.com/huggingface/diffusers/pull/9419 * add to readme, change function names * te lr changes * readme * change concept tokens logic * fix indices * change arg name * style * dummy test * revert dummy test * reorder pivoting * add warning in case the token abstraction is not the instance prompt * experimental - wip - specific block training * fix documentation and token abstraction processing * remove transformer block specification feature (for now) * style * fix copies * fix indexing issue when --initializer_concept has different amounts * add if TextualInversionLoaderMixin to all flux pipelines * style * fix import * fix imports * address review comments - remove necessary prints & comments, use pin_memory=True, use free_memory utils, unify warning and prints * style * logger info fix * make lora target modules configurable and change the default * make lora target modules configurable and change the default * style * make lora target modules configurable and change the default, add notes to readme * style * add tests * style * fix repo id * add updated requirements for advanced flux * fix indices of t5 pivotal tuning embeddings * fix path in test * remove `pin_memory` * fix filename of embedding * fix filename of embedding --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: YiYi Xu <yixu310@gmail.com>	2024-10-17 12:22:11 +03:00
Linoy Tsaban	ee4ab23892	[SD3 dreambooth-lora training] small updates + bug fixes (#9682 ) * add latent caching + smol updates * update license * replace with free_memory * add --upcast_before_saving to allow saving transformer weights in lower precision * fix models to accumulate * fix mixed precision issue as proposed in https://github.com/huggingface/diffusers/pull/9565 * smol update to readme * style * fix caching latents * style * add tests for latent caching * style * fix latent caching --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-10-16 11:13:37 +03:00
Aryan	2ffbb88f1c	[training] CogVideoX-I2V LoRA (#9482 ) * update * update * update * update * update * add coauthor Co-Authored-By: yuan-shenghai <963658029@qq.com> * add coauthor Co-Authored-By: Shenghai Yuan <140951558+SHYuanBest@users.noreply.github.com> * update Co-Authored-By: yuan-shenghai <963658029@qq.com> * update --------- Co-authored-by: yuan-shenghai <963658029@qq.com> Co-authored-by: Shenghai Yuan <140951558+SHYuanBest@users.noreply.github.com>	2024-10-16 02:07:07 +05:30
wony617	fff4be8e23	[docs] refactoring docstrings in `community/hd_painter.py` (#9593 ) * [docs] refactoring docstrings in community/hd_painter.py * Update examples/community/hd_painter.py Co-authored-by: Aryan <contact.aryanvs@gmail.com> * make style --------- Co-authored-by: Aryan <contact.aryanvs@gmail.com> Co-authored-by: Aryan <aryan@huggingface.co>	2024-10-15 18:50:12 +05:30
0x名無し	dccf39f01e	Dreambooth lora flux bug 3dtensor to 2dtensor (#9653 ) * fixed issue #9350, Tensor is deprecated * ran make style	2024-10-15 17:18:13 +05:30
Tolga Cangöz	56c21150d8	[`Community Pipeline`] Add 🪆Matryoshka Diffusion Models (#9157 )	2024-10-14 11:38:44 -10:00
Leo Jiang	5956b68a69	Improve the performance and suitable for NPU computing (#9642 ) * Improve the performance and suitable for NPU * Improve the performance and suitable for NPU computing * Improve the performance and suitable for NPU * Improve the performance and suitable for NPU * Improve the performance and suitable for NPU * Improve the performance and suitable for NPU --------- Co-authored-by: 蒋硕 <jiangshuo9@h-partners.com> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-10-14 21:39:33 +05:30
Ryan Lin	68d16f7806	Flux - soft inpainting via differential diffusion (#9268 ) * Flux - soft inpainting via differential diffusion * . * track changes to FluxInpaintPipeline * make mask arrangement simplier * make style --------- Co-authored-by: YiYi Xu <yixu310@gmail.com> Co-authored-by: Álvaro Somoza <asomoza@users.noreply.github.com> Co-authored-by: asomoza <somoza.alvaro@gmail.com>	2024-10-14 10:07:48 -03:00
M Saqlain	3033f08201	Add Differential Diffusion to Kolors (#9423 ) * Added diff diff support for kolors img2img * Fized relative imports * Fized relative imports * Added diff diff support for Kolors * Fized import issues * Added map * Fized import issues * Fixed naming issues * Added diffdiff support for Kolors img2img pipeline * Removed example docstrings * Added map input * Updated latents Co-authored-by: Álvaro Somoza <asomoza@users.noreply.github.com> * Updated `original_with_noise` Co-authored-by: Álvaro Somoza <asomoza@users.noreply.github.com> * Improved code quality --------- Co-authored-by: Álvaro Somoza <asomoza@users.noreply.github.com>	2024-10-11 10:47:31 -03:00
GSSun	164ec9f423	fix IsADirectoryError when running the training code for sd3_dreambooth_lora_16gb.ipynb (#9634 ) Add files via upload fix IsADirectoryError when running the training code	2024-10-11 13:33:39 +05:30
glide-the	66eef9a6dc	fix: CogVideox train dataset _preprocess_data crop video (#9574 ) * Removed int8 to float32 conversion (`* 2.0 - 1.0`) from `train_transforms` as it caused image overexposure. Added `_resize_for_rectangle_crop` function to enable video cropping functionality. The cropping mode can be configured via `video_reshape_mode`, supporting options: ['center', 'random', 'none']. * The number 127.5 may experience precision loss during division operations. * wandb request pil image Type * Resizing bug * del jupyter * make style * Update examples/cogvideo/README.md * make style --------- Co-authored-by: --unset <--unset> Co-authored-by: Aryan <aryan@huggingface.co>	2024-10-08 12:52:52 +05:30
captainzz	2cb383f591	fix vae dtype when accelerate config using --mixed_precision="fp16" (#9601 ) * fix vae dtype when accelerate config using --mixed_precision="fp16" * Add param for upcast vae	2024-10-07 21:00:25 +05:30
Sayak Paul	8e7d6c03a3	[chore] fix: retain memory utility. (#9543 ) * fix: retain memory utility. * fix * quality * free_memory.	2024-09-28 21:08:45 +05:30
Anand Kumar	b28675c605	[train_instruct_pix2pix.py]Fix the LR schedulers when `num_train_epochs` is passed in a distributed training env (#9316 ) Fixed pix2pix lr scheduler Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-09-28 21:01:37 +05:30
PromeAI	534848c370	[examples] add train flux-controlnet scripts in example. (#9324 ) * add train flux-controlnet scripts in example. * fix error * fix subfolder error * fix preprocess error * Update examples/controlnet/README_flux.md Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * Update examples/controlnet/README_flux.md Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * fix readme * fix note error * add some Tutorial for deepspeed * fix some Format Error * add dataset_path example * remove print, add guidance_scale CLI, readable apply * Update examples/controlnet/README_flux.md Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * update,push_to_hub,save_weight_dtype,static method,clear_objs_and_retain_memory,report_to=wandb * add push to hub in readme * apply weighting schemes * add note * Update examples/controlnet/README_flux.md Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * make code style and quality * fix some unnoticed error * make code style and quality * add example controlnet in readme * add test controlnet * rm Remove duplicate notes * Fix formatting errors * add new control image * add model cpu offload * update help for adafactor * make quality & style * make quality and style * rename flux_controlnet_model_name_or_path * fix back src/diffusers/pipelines/flux/pipeline_flux_controlnet.py * fix dtype error by pre calculate text emb * rm image save * quality fix * fix test * fix tiny flux train error * change report to to tensorboard * fix save name error when test * Fix shrinking errors --------- Co-authored-by: YiYi Xu <yixu310@gmail.com> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: Your Name <you@example.com>	2024-09-27 13:31:47 +05:30
Sayak Paul	6ca5a58e43	[Community Pipeline] Batched implementation of Flux with CFG (#9513 ) * batched implementation of flux cfg. * style. * readme * remove comments.	2024-09-25 15:25:15 +05:30
captainzz	bab17789b5	fix bugs for sd3 controlnet training (#9489 ) Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: YiYi Xu <yixu310@gmail.com>	2024-09-23 13:40:44 -10:00
Aryan	2b443a5d62	[training] CogVideoX Lora (#9302 ) * cogvideox lora training draft * update * update * update * update * update * make fix-copies * update * update * apply suggestions from review * apply suggestions from reveiw * fix typo * Update examples/cogvideo/train_cogvideox_lora.py Co-authored-by: YiYi Xu <yixu310@gmail.com> * fix lora alpha * use correct lora scaling for final test pipeline * Update examples/cogvideo/train_cogvideox_lora.py Co-authored-by: YiYi Xu <yixu310@gmail.com> * apply suggestions from review; prodigy optimizer YiYi Xu <yixu310@gmail.com> * add tests * make style * add README * update * update * make style * fix * update * add test skeleton * revert lora utils changes * add cleaner modifications to lora testing utils * update lora tests * deepspeed stuff * add requirements.txt * deepspeed refactor * add lora stuff to img2vid pipeline to fix tests * fight tests * add co-authors Co-Authored-By: Fu-Yun Wang <1697256461@qq.com> Co-Authored-By: zR <2448370773@qq.com> * fight lora runner tests * import Dummy optim and scheduler only wheh required * update docs * add coauthors Co-Authored-By: Fu-Yun Wang <1697256461@qq.com> * remove option to train text encoder Co-Authored-By: bghira <bghira@users.github.com> * update tests * fight more tests * update * fix vid2vid * fix typo * remove lora tests; todo in follow-up PR * undo img2vid changes * remove text encoder related changes in lora loader mixin * Revert "remove text encoder related changes in lora loader mixin" This reverts commit `f8a8444487`. * update * round 1 of fighting tests * round 2 of fighting tests * fix copied from comment * fix typo in lora test * update styling Co-Authored-By: YiYi Xu <yixu310@gmail.com> --------- Co-authored-by: YiYi Xu <yixu310@gmail.com> Co-authored-by: zR <2448370773@qq.com> Co-authored-by: Fu-Yun Wang <1697256461@qq.com> Co-authored-by: bghira <bghira@users.github.com>	2024-09-19 14:37:57 +05:30
Anatoly Belikov	5d476f57c5	adapt masked im2im pipeline for SDXL (#7790 ) * adapt masked im2im pipeline for SDXL * usage for masked im2im stable diffusion XL pipeline * style * style * style --------- Co-authored-by: YiYi Xu <yixu310@gmail.com>	2024-09-17 16:55:49 -10:00
Linoy Tsaban	8fcfb2a456	[Flux with CFG] add flux pipeline with cfg support (#9445 ) * true_cfg * add check negative prompt/embeds inputs * move to community pipelines * move to community pipelines * revert true cfg changes to the orig pipline * style --------- Co-authored-by: YiYi Xu <yixu310@gmail.com>	2024-09-16 12:09:34 -10:00
suzukimain	b52119ae92	[docs] Replace runwayml/stable-diffusion-v1-5 with Lykon/dreamshaper-8 (#9428 ) * [docs] Replace runwayml/stable-diffusion-v1-5 with Lykon/dreamshaper-8 Updated documentation as runwayml/stable-diffusion-v1-5 has been removed from Huggingface. * Update docs/source/en/using-diffusers/inpaint.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Replace with stable-diffusion-v1-5/stable-diffusion-v1-5 * Update inpaint.md --------- Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2024-09-16 10:18:45 -07:00
Linoy Tsaban	37e3603c4a	[Flux Dreambooth lora] add latent caching (#9160 ) * add ostris trainer to README & add cache latents of vae * add ostris trainer to README & add cache latents of vae * style * readme * add test for latent caching * add ostris noise scheduler `9ee1ef2a0a/toolkit/samplers/custom_flowmatch_sampler.py (L95)` * style * fix import * style * fix tests * style * --change upcasting of transformer? * update readme according to main * keep only latent caching * add configurable param for final saving of trained layers- --upcast_before_saving * style * Update examples/dreambooth/README_flux.md Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * Update examples/dreambooth/README_flux.md Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * use clear_objs_and_retain_memory from utilities * style --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-09-15 15:30:31 +03:00
Leo Jiang	e2ead7cdcc	Fix the issue on sd3 dreambooth w./w.t. lora training (#9419 ) * Fix dtype error * [bugfix] Fixed the issue on sd3 dreambooth training * [bugfix] Fixed the issue on sd3 dreambooth training --------- Co-authored-by: 蒋硕 <jiangshuo9@h-partners.com> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-09-14 16:29:38 +05:30
Juan Acevedo	45aa8bb187	Ptxla sd training (#9381 ) * enable pxla training of stable diffusion 2.x models. * run linter/style and run pipeline test for stable diffusion and fix issues. * update xla libraries * fix read me newline. * move files to research folder. * update per comments. * rename readme. --------- Co-authored-by: Juan Acevedo <jfacevedo@google.com> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-09-12 08:35:06 +05:30
Yu Zheng	c002731d93	[examples] add controlnet sd3 example (#9249 ) * add controlnet sd3 example * add controlnet sd3 example * update controlnet sd3 example * add controlnet sd3 example test * fix quality and style * update test * update test --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-09-11 07:04:37 +05:30
Sayak Paul	adf1f911f0	[Tests] fix some fast gpu tests. (#9379 ) fix some fast gpu tests.	2024-09-11 06:50:02 +05:30
Linoy Tsaban	55ac421f7b	improve README for flux dreambooth lora (#9290 ) * improve readme * improve readme * improve readme * improve readme	2024-09-05 17:53:23 +05:30

1 2 3 4 5 ...

1044 Commits