diffusers

mirror of https://github.com/huggingface/diffusers.git synced 2026-01-27 17:22:53 +03:00

Author	SHA1	Message	Date
SkyCol	074e12358b	Add prompt about wandb in examples/dreambooth/readme. (#10014 ) Add files via upload	2024-11-25 18:42:06 +05:30
Linoy Tsaban	c4b5d2ff6b	[SD3 dreambooth lora] smol fix to checkpoint saving (#9993 ) * smol change to fix checkpoint saving & resuming (as done in train_dreambooth_sd3.py) * style * modify comment to explain reasoning behind hidden size check	2024-11-24 18:51:06 +02:00
Parag Ekbote	cc7d88f247	Move IP Adapter Scripts to research project (#9960 ) * Move files to research-projects. * docs: add IP Adapter training instructions * Delete venv * Update examples/ip_adapter/tutorial_train_sdxl.py Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * Cherry-picked commits and re-moved files to research_projects. * make style. * Update toctree and delete ip_adapter. * Nit Fix * Fix nit. * Fix nit. * Create training script for single GPU and set model format to .safetensors * Add sample inference script and restore _toctree * Restore toctree.yaml * fix spacing. * Update toctree.yaml --------- Co-authored-by: AMohamedAakhil <a.aakhilmohamed@gmail.com> Co-authored-by: BootesVoid <78485654+AMohamedAakhil@users.noreply.github.com> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-11-19 10:37:22 -08:00
Linoy Tsaban	acf479bded	[advanced flux training] bug fix + reduce memory cost as in #9829 (#9838 ) * memory improvement as done here: https://github.com/huggingface/diffusers/pull/9829 * fix bug * fix bug * style --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-11-19 08:43:36 +05:30
Parag Ekbote	03bf77c4af	Notebooks for Community Scripts-2 (#9952 ) 4 Notebooks for Community Scripts and minor script improvements.	2024-11-18 12:58:57 -08:00
Grant Sherrick	c3c94fe71b	Add server example (#9918 ) * Add server example. * Minor updates to README. * Add fixes after local testing. * Apply suggestions from code review Updates to README from code review Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * More doc updates. * Maybe this will work to build the docs correctly? * Fix style issues. * Fix toc. * Minor reformatting. * Move docs to proper loc. * Fix missing tick. * Apply suggestions from code review Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Sync docs changes back to README. * Very minor update to docs to add space. --------- Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2024-11-18 09:26:13 -08:00
Parag Ekbote	e255920719	Move Wuerstchen Dreambooth to research_projects (#9935 ) update file paths to research_projects folder. Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-11-16 18:56:16 +05:30
Parag Ekbote	1dbd26fa23	Notebooks for Community Scripts Examples (#9905 ) * Add Notebooks on Community Scripts	2024-11-12 14:08:48 -10:00
Sayak Paul	d720b2132e	[Advanced LoRA v1.5] fix: gradient unscaling problem (#7018 ) fix: gradient unscaling problem Co-authored-by: Linoy Tsaban <57615435+linoytsaban@users.noreply.github.com>	2024-11-08 19:31:43 -04:00
SahilCarterr	9cc96a64f1	[FIX] Fix TypeError in DreamBooth SDXL when use_dora is False (#9879 ) * fix use_dora * fix style and quality * fix use_dora with peft version --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-11-08 19:09:24 -04:00
Michael Tkachuk	5b972fbd6a	Enabling gradient checkpointing in eval() mode (#9878 ) * refactored	2024-11-08 09:03:26 -10:00
Sayak Paul	ded3db164b	[Core] introduce `controlnet` module (#8768 ) * move vae flax module. * controlnet module. * prepare for PR. * revert a commit * gracefully deprecate controlnet deps. * fix * fix doc path * fix-copies * fix path * style * style * conflicts * fix * fix-copies * sparsectrl. * updates * fix * updates * updates * updates * fix --------- Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>	2024-11-06 22:08:55 -04:00
SahilCarterr	76b7d86a9a	Updated _encode_prompt_with_clip and encode_prompt in train_dreamboth_sd3 (#9800 ) * updated encode prompt and clip encod prompt --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-11-05 15:08:50 -10:00
Sookwan Han	e2b3c248d8	Add new community pipeline for 'Adaptive Mask Inpainting', introduced in [ECCV2024] ComA (#9228 ) * Add new community pipeline for 'Adaptive Mask Inpainting', introduced in [ECCV2024] Beyond the Contact: Discovering Comprehensive Affordance for 3D Objects from Pre-trained 2D Diffusion Models	2024-11-05 15:05:58 -10:00
Dorsa Rohani	c10f875ff0	Add Diffusion Policy for Reinforcement Learning (#9824 ) * enable cpu ability * model creation + comprehensive testing * training + tests * all tests working * remove unneeded files + clarify docs * update train tests * update readme.md * remove data from gitignore * undo cpu enabled option * Update README.md * update readme * code quality fixes * diffusion policy example * update readme * add pretrained model weights + doc * add comment * add documentation * add docstrings * update comments * update readme * fix code quality * Update examples/reinforcement_learning/README.md Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * Update examples/reinforcement_learning/diffusion_policy.py Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * suggestions + safe globals for weights_only=True * suggestions + safe weights loading * fix code quality * reformat file --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-11-02 09:18:44 +05:30
Leo Jiang	a98a839de7	Reduce Memory Cost in Flux Training (#9829 ) * Improve NPU performance * Improve NPU performance * Improve NPU performance * Improve NPU performance * [bugfix] bugfix for npu free memory * [bugfix] bugfix for npu free memory * [bugfix] bugfix for npu free memory * Reduce memory cost for flux training process --------- Co-authored-by: 蒋硕 <jiangshuo9@h-partners.com> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-11-01 12:19:32 +05:30
Boseong Jeon	3deed729e6	Handling mixed precision for dreambooth flux lora training (#9565 ) Handling mixed precision and add unwarp Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: Linoy Tsaban <57615435+linoytsaban@users.noreply.github.com>	2024-11-01 10:16:05 +05:30
ScilenceForest	7ffbc2525f	Update train_controlnet_flux.py,Fix size mismatch issue in validation (#9679 ) Update train_controlnet_flux.py Fix the problem of inconsistency between size of image and size of validation_image which causes np.stack to report error. Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-11-01 10:15:10 +05:30
Leo Jiang	9dcac83057	NPU Adaption for FLUX (#9751 ) * NPU implementation for FLUX * NPU implementation for FLUX * NPU implementation for FLUX * NPU implementation for FLUX * NPU implementation for FLUX * NPU implementation for FLUX * NPU implementation for FLUX * NPU implementation for FLUX * NPU implementation for FLUX * NPU implementation for FLUX * NPU implementation for FLUX * NPU implementation for FLUX * NPU implementation for FLUX * NPU implementation for FLUX --------- Co-authored-by: 蒋硕 <jiangshuo9@h-partners.com>	2024-11-01 09:03:15 +05:30
Abhipsha Das	c75431843f	[Model Card] standardize advanced diffusion training sd15 lora (#7613 ) * modelcard generation edit * add missed tag * fix param name * fix var * change str to dict * add use_dora check * use correct tags for lora * make style && make quality --------- Co-authored-by: Aryan <aryan@huggingface.co>	2024-11-01 03:23:00 +05:30
Sayak Paul	8ce37ab055	[training] use the lr when using 8bit adam. (#9796 ) * use the lr when using 8bit adam. * remove lr as we pack it in params_to_optimize. --------- Co-authored-by: Linoy Tsaban <57615435+linoytsaban@users.noreply.github.com>	2024-10-31 15:51:42 +05:30
Sayak Paul	09b8aebd67	[training] fixes to the quantization training script and add AdEMAMix optimizer as an option (#9806 ) * fixes * more fixes.	2024-10-31 15:46:00 +05:30
Raul Ciotescu	c5376c5695	adds the pipeline for pixart alpha controlnet (#8857 ) * add the controlnet pipeline for pixart alpha --------- Co-authored-by: YiYi Xu <yixu310@gmail.com> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: junsongc <cjs1020440147@icloud.com>	2024-10-28 08:48:04 -10:00
Linoy Tsaban	743a5697f2	[flux dreambooth lora training] make LoRA target modules configurable + small bug fix (#9646 ) * make lora target modules configurable and change the default * style * make lora target modules configurable and change the default * fix bug when using prodigy and training te * fix mixed precision training as proposed in https://github.com/huggingface/diffusers/pull/9565 for full dreambooth as well * add test and notes * style * address sayaks comments * style * fix test --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-10-28 17:27:41 +02:00
Linoy Tsaban	db5b6a9630	[SD 3.5 Dreambooth LoRA] support configurable training block & layers (#9762 ) * configurable layers * configurable layers * update README * style * add test * style * add layer test, update readme, add nargs * readme * test style * remove print, change nargs * test arg change * style * revert nargs 2/2 * address sayaks comments * style * address sayaks comments	2024-10-28 16:07:54 +02:00
Biswaroop	493aa74312	[Fix] remove setting lr for T5 text encoder when using prodigy in flux dreambooth lora script (#9473 ) * fix: removed setting of text encoder lr for T5 as it's not being tuned * fix: removed setting of text encoder lr for T5 as it's not being tuned --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: Linoy Tsaban <57615435+linoytsaban@users.noreply.github.com>	2024-10-28 13:07:30 +02:00
Vinh H. Pham	3b5b1c5698	[Fix] train_dreambooth_lora_flux_advanced ValueError: unexpected save model: <class 'transformers.models.t5.modeling_t5.T5EncoderModel'> (#9777 ) fix save state te T5	2024-10-28 12:52:27 +02:00
Sayak Paul	fddbab7993	[research_projects] Update README.md to include a note about NF5 T5-xxl (#9775 ) Update README.md	2024-10-26 22:13:03 +09:00
Ina	73b59f5203	[refactor] enhance readability of flux related pipelines (#9711 ) * flux pipline: readability enhancement.	2024-10-25 11:01:51 -10:00
Sayak Paul	df073ba137	[research_projects] add flux training script with quantization (#9754 ) * add flux training script with quantization * remove exclamation	2024-10-26 00:07:57 +09:00
Linoy Tsaban	bfa0aa4ff2	[SD3-5 dreambooth lora] update model cards (#9749 ) * improve readme * style --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-10-23 23:16:53 +03:00
Sayak Paul	e45c25d03a	post-release 0.31.0 (#9742 ) * post-release * style	2024-10-22 20:42:30 +05:30
Yu Zheng	b0ffe92230	Update sd3 controlnet example (#9735 ) * use make_image_grid in diffusers.utils * use checkpoint on the Hub	2024-10-22 09:02:16 +05:30
Tolga Cangöz	1b64772b79	Fix `schedule_shifted_power` usage in 🪆Matryoshka Diffusion Models (#9723 ) * [matryoshka.py] Add schedule_shifted_power attribute and update get_schedule_shifted method	2024-10-21 14:23:50 -10:00
G.O.D	63a0c9e5f7	[bugfix] reduce float value error when adding noise (#9004 ) * Update train_controlnet.py reduce float value error for bfloat16 * Update train_controlnet_sdxl.py * style --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: yiyixuxu <yixu310@gmail.com>	2024-10-21 13:26:05 -10:00
hlky	89565e9171	Add prompt scheduling callback to community scripts (#9718 )	2024-10-19 14:22:22 -03:00
Linoy Tsaban	2541d141d5	[advanced flux lora script] minor updates to readme (#9705 ) * fix arg naming * fix arg naming * fix arg naming * fix arg naming	2024-10-18 15:35:44 +03:00
Linoy Tsaban	9a7f824645	[Flux] Add advanced training script + support textual inversion inference (#9434 ) * add ostris trainer to README & add cache latents of vae * add ostris trainer to README & add cache latents of vae * style * readme * add test for latent caching * add ostris noise scheduler `9ee1ef2a0a/toolkit/samplers/custom_flowmatch_sampler.py (L95)` * style * fix import * style * fix tests * style * --change upcasting of transformer? * update readme according to main * add pivotal tuning for CLIP * fix imports, encode_prompt call,add TextualInversionLoaderMixin to FluxPipeline for inference * TextualInversionLoaderMixin support for FluxPipeline for inference * move changes to advanced flux script, revert canonical * add latent caching to canonical script * revert changes to canonical script to keep it separate from https://github.com/huggingface/diffusers/pull/9160 * revert changes to canonical script to keep it separate from https://github.com/huggingface/diffusers/pull/9160 * style * remove redundant line and change code block placement to align with logic * add initializer_token arg * add transformer frac for range support from pure textual inversion to the orig pivotal tuning * support pure textual inversion - wip * adjustments to support pure textual inversion and transformer optimization in only part of the epochs * fix logic when using initializer token * fix pure_textual_inversion_condition * fix ti/pivotal loading of last validation run * remove embeddings loading for ti in final training run (to avoid adding huggingface hub dependency) * support pivotal for t5 * adapt pivotal for T5 encoder * adapt pivotal for T5 encoder and support in flux pipeline * t5 pivotal support + support fo pivotal for clip only or both * fix param chaining * fix param chaining * README first draft * readme * readme * readme * style * fix import * style * add fix from https://github.com/huggingface/diffusers/pull/9419 * add to readme, change function names * te lr changes * readme * change concept tokens logic * fix indices * change arg name * style * dummy test * revert dummy test * reorder pivoting * add warning in case the token abstraction is not the instance prompt * experimental - wip - specific block training * fix documentation and token abstraction processing * remove transformer block specification feature (for now) * style * fix copies * fix indexing issue when --initializer_concept has different amounts * add if TextualInversionLoaderMixin to all flux pipelines * style * fix import * fix imports * address review comments - remove necessary prints & comments, use pin_memory=True, use free_memory utils, unify warning and prints * style * logger info fix * make lora target modules configurable and change the default * make lora target modules configurable and change the default * style * make lora target modules configurable and change the default, add notes to readme * style * add tests * style * fix repo id * add updated requirements for advanced flux * fix indices of t5 pivotal tuning embeddings * fix path in test * remove `pin_memory` * fix filename of embedding * fix filename of embedding --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: YiYi Xu <yixu310@gmail.com>	2024-10-17 12:22:11 +03:00
Linoy Tsaban	ee4ab23892	[SD3 dreambooth-lora training] small updates + bug fixes (#9682 ) * add latent caching + smol updates * update license * replace with free_memory * add --upcast_before_saving to allow saving transformer weights in lower precision * fix models to accumulate * fix mixed precision issue as proposed in https://github.com/huggingface/diffusers/pull/9565 * smol update to readme * style * fix caching latents * style * add tests for latent caching * style * fix latent caching --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-10-16 11:13:37 +03:00
Aryan	2ffbb88f1c	[training] CogVideoX-I2V LoRA (#9482 ) * update * update * update * update * update * add coauthor Co-Authored-By: yuan-shenghai <963658029@qq.com> * add coauthor Co-Authored-By: Shenghai Yuan <140951558+SHYuanBest@users.noreply.github.com> * update Co-Authored-By: yuan-shenghai <963658029@qq.com> * update --------- Co-authored-by: yuan-shenghai <963658029@qq.com> Co-authored-by: Shenghai Yuan <140951558+SHYuanBest@users.noreply.github.com>	2024-10-16 02:07:07 +05:30
wony617	fff4be8e23	[docs] refactoring docstrings in `community/hd_painter.py` (#9593 ) * [docs] refactoring docstrings in community/hd_painter.py * Update examples/community/hd_painter.py Co-authored-by: Aryan <contact.aryanvs@gmail.com> * make style --------- Co-authored-by: Aryan <contact.aryanvs@gmail.com> Co-authored-by: Aryan <aryan@huggingface.co>	2024-10-15 18:50:12 +05:30
0x名無し	dccf39f01e	Dreambooth lora flux bug 3dtensor to 2dtensor (#9653 ) * fixed issue #9350, Tensor is deprecated * ran make style	2024-10-15 17:18:13 +05:30
Tolga Cangöz	56c21150d8	[`Community Pipeline`] Add 🪆Matryoshka Diffusion Models (#9157 )	2024-10-14 11:38:44 -10:00
Leo Jiang	5956b68a69	Improve the performance and suitable for NPU computing (#9642 ) * Improve the performance and suitable for NPU * Improve the performance and suitable for NPU computing * Improve the performance and suitable for NPU * Improve the performance and suitable for NPU * Improve the performance and suitable for NPU * Improve the performance and suitable for NPU --------- Co-authored-by: 蒋硕 <jiangshuo9@h-partners.com> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-10-14 21:39:33 +05:30
Ryan Lin	68d16f7806	Flux - soft inpainting via differential diffusion (#9268 ) * Flux - soft inpainting via differential diffusion * . * track changes to FluxInpaintPipeline * make mask arrangement simplier * make style --------- Co-authored-by: YiYi Xu <yixu310@gmail.com> Co-authored-by: Álvaro Somoza <asomoza@users.noreply.github.com> Co-authored-by: asomoza <somoza.alvaro@gmail.com>	2024-10-14 10:07:48 -03:00
M Saqlain	3033f08201	Add Differential Diffusion to Kolors (#9423 ) * Added diff diff support for kolors img2img * Fized relative imports * Fized relative imports * Added diff diff support for Kolors * Fized import issues * Added map * Fized import issues * Fixed naming issues * Added diffdiff support for Kolors img2img pipeline * Removed example docstrings * Added map input * Updated latents Co-authored-by: Álvaro Somoza <asomoza@users.noreply.github.com> * Updated `original_with_noise` Co-authored-by: Álvaro Somoza <asomoza@users.noreply.github.com> * Improved code quality --------- Co-authored-by: Álvaro Somoza <asomoza@users.noreply.github.com>	2024-10-11 10:47:31 -03:00
GSSun	164ec9f423	fix IsADirectoryError when running the training code for sd3_dreambooth_lora_16gb.ipynb (#9634 ) Add files via upload fix IsADirectoryError when running the training code	2024-10-11 13:33:39 +05:30
glide-the	66eef9a6dc	fix: CogVideox train dataset _preprocess_data crop video (#9574 ) * Removed int8 to float32 conversion (`* 2.0 - 1.0`) from `train_transforms` as it caused image overexposure. Added `_resize_for_rectangle_crop` function to enable video cropping functionality. The cropping mode can be configured via `video_reshape_mode`, supporting options: ['center', 'random', 'none']. * The number 127.5 may experience precision loss during division operations. * wandb request pil image Type * Resizing bug * del jupyter * make style * Update examples/cogvideo/README.md * make style --------- Co-authored-by: --unset <--unset> Co-authored-by: Aryan <aryan@huggingface.co>	2024-10-08 12:52:52 +05:30
captainzz	2cb383f591	fix vae dtype when accelerate config using --mixed_precision="fp16" (#9601 ) * fix vae dtype when accelerate config using --mixed_precision="fp16" * Add param for upcast vae	2024-10-07 21:00:25 +05:30
Sayak Paul	8e7d6c03a3	[chore] fix: retain memory utility. (#9543 ) * fix: retain memory utility. * fix * quality * free_memory.	2024-09-28 21:08:45 +05:30

1 2 3 4 5 ...

1058 Commits