diffusers

mirror of https://github.com/huggingface/diffusers.git synced 2026-01-27 17:22:53 +03:00

Author	SHA1	Message	Date
Rafie Walker	6946facf69	Implement SD3 loss weighting (#8528 ) * Add lognorm and cosmap weighting * Implement mode sampling * Update examples/dreambooth/train_dreambooth_lora_sd3.py * Update examples/dreambooth/train_dreambooth_lora_sd3.py * Update examples/dreambooth/train_dreambooth_sd3.py * Update examples/dreambooth/train_dreambooth_sd3.py * Update examples/dreambooth/train_dreambooth_sd3.py * Update examples/dreambooth/train_dreambooth_lora_sd3.py * Update examples/dreambooth/train_dreambooth_sd3.py * Update examples/dreambooth/train_dreambooth_sd3.py * Update examples/dreambooth/train_dreambooth_lora_sd3.py * keep timestamp sampling fully on cpu --------- Co-authored-by: Kashif Rasul <kashif.rasul@gmail.com> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-06-16 20:15:50 +01:00
Jonathan Rahn	a899e42fc7	add `sentencepiece` to requirements.txt for SD3 dreambooth (#8538 ) * add `sentencepiece` requirement for SD3 add `sentencepiece` requirement * Empty-Commit --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-06-14 22:48:36 +01:00
Sayak Paul	2e4841ef1e	post release 0.29.0 (#8492 ) post release	2024-06-13 06:14:20 -10:00
Haofan Wang	8bea943714	Update requirements_sd3.txt (#8521 )	2024-06-13 17:02:17 +01:00
Ameer Azam	0240d4191a	Update README_sd3.md (#8490 ) becasue in Readme it was not correct train_dreambooth_sd3.py to train_dreambooth_lora_sd3	2024-06-12 21:08:36 +01:00
Dhruv Nair	04717fd861	Add Stable Diffusion 3 (#8483 ) * up * add sd3 * update * update * add tests * fix copies * fix docs * update * add dreambooth lora * add LoRA * update * update * update * update * import fix * update * Update src/diffusers/pipelines/stable_diffusion_3/pipeline_stable_diffusion_3.py Co-authored-by: YiYi Xu <yixu310@gmail.com> * import fix 2 * update * Update src/diffusers/models/autoencoders/autoencoder_kl.py Co-authored-by: YiYi Xu <yixu310@gmail.com> * Update src/diffusers/models/autoencoders/autoencoder_kl.py Co-authored-by: YiYi Xu <yixu310@gmail.com> * Update src/diffusers/models/autoencoders/autoencoder_kl.py Co-authored-by: YiYi Xu <yixu310@gmail.com> * Update src/diffusers/models/autoencoders/autoencoder_kl.py Co-authored-by: YiYi Xu <yixu310@gmail.com> * Update src/diffusers/models/autoencoders/autoencoder_kl.py Co-authored-by: YiYi Xu <yixu310@gmail.com> * Update src/diffusers/models/autoencoders/autoencoder_kl.py Co-authored-by: YiYi Xu <yixu310@gmail.com> * Update src/diffusers/models/autoencoders/autoencoder_kl.py Co-authored-by: YiYi Xu <yixu310@gmail.com> * Update src/diffusers/models/autoencoders/autoencoder_kl.py Co-authored-by: YiYi Xu <yixu310@gmail.com> * Update src/diffusers/models/autoencoders/autoencoder_kl.py Co-authored-by: YiYi Xu <yixu310@gmail.com> * Update src/diffusers/models/autoencoders/autoencoder_kl.py Co-authored-by: YiYi Xu <yixu310@gmail.com> * Update src/diffusers/models/autoencoders/autoencoder_kl.py Co-authored-by: YiYi Xu <yixu310@gmail.com> * update * update * update * fix ckpt id * fix more ids * update * missing doc * Update src/diffusers/schedulers/scheduling_flow_match_euler_discrete.py Co-authored-by: YiYi Xu <yixu310@gmail.com> * Update src/diffusers/schedulers/scheduling_flow_match_euler_discrete.py Co-authored-by: YiYi Xu <yixu310@gmail.com> * Update docs/source/en/api/pipelines/stable_diffusion/stable_diffusion_3.md Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * Update docs/source/en/api/pipelines/stable_diffusion/stable_diffusion_3.md Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * update' * fix * update * Update src/diffusers/models/autoencoders/autoencoder_kl.py * Update src/diffusers/models/autoencoders/autoencoder_kl.py * note on gated access. * requirements * licensing --------- Co-authored-by: sayakpaul <spsayakpaul@gmail.com> Co-authored-by: YiYi Xu <yixu310@gmail.com>	2024-06-12 20:44:00 +01:00
Sayak Paul	d457beed92	Update README.md to update the MaPO project (#8470 ) Update README.md	2024-06-11 10:10:45 +01:00
Tolga Cangöz	98730c5dd7	Errata (#8322 ) * Fix typos * Trim trailing whitespaces * Remove a trailing whitespace * chore: Update MarigoldDepthPipeline checkpoint to prs-eth/marigold-lcm-v1-0 * Revert "chore: Update MarigoldDepthPipeline checkpoint to prs-eth/marigold-lcm-v1-0" This reverts commit `fd742b30b4`. * pokemon -> naruto * `DPMSolverMultistep` -> `DPMSolverMultistepScheduler` * Improve Markdown stylization * Improve style * Improve style * Refactor pipeline variable names for consistency * up style	2024-06-05 13:59:09 -07:00
Hzzone	d3881f35b7	Gligen training (#7906 ) * add training code of gligen * fix code quality tests. --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-06-05 16:26:42 +04:00
satani99	352d96eb82	Modularize train_text_to_image_lora_sdxl inferencing during and after training in example (#8335 ) * Modularized the train_lora_sdxl file * Modularized the train_lora_sdxl file * Modularized the train_lora_sdxl file --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-05-31 04:52:22 +05:30
Genius Patrick	3511a9623f	fix(training): lr scheduler doesn't work properly in distributed scenarios (#8312 )	2024-05-30 15:23:19 +05:30
Tolga Cangöz	a2ecce26bc	Fix Copying Mechanism typo/bug (#8232 ) * Fix copying mechanism typos * fix copying mecha * Revert, since they are in TODO * Fix copying mechanism	2024-05-29 09:37:18 -07:00
satani99	3bc3b48c10	Modularize train_text_to_image_lora SD inferencing during and after training in example (#8283 ) * Modularized the train_lora file * Modularized the train_lora file * Modularized the train_lora file * Modularized the train_lora file * Modularized the train_lora file --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-05-29 10:08:02 +05:30
Sayak Paul	581d8aacf7	post release v0.28.0 (#8286 ) * post release v0.28.0 * style	2024-05-29 07:13:22 +05:30
Sajad Norouzi	67bef2027c	Add Kohya fix to SD pipeline for high resolution generation (#7633 ) add kohya high resolution fix.	2024-05-28 10:00:04 -10:00
Tolga Cangöz	0ab63ff647	Fix CPU Offloading Usage & Typos (#8230 ) * Fix typos * Fix `pipe.enable_model_cpu_offload()` usage * Fix cpu offloading * Update numbers	2024-05-24 11:25:29 -07:00
Tolga Cangöz	db33af065b	Fix a grammatical error in the `raise` messages (#8272 ) Fix grammatical error	2024-05-24 11:15:00 -07:00
Yifan Zhou	46a9db0336	[Community Pipeline] FRESCO: Spatial-Temporal Correspondence for Zero-Shot Video Translation (#8239 ) * code and doc * update paper link * remove redundant codes * add example video --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-05-24 14:44:20 +05:30
Sai-Suraj-27	df2bc5ef28	fix: Fixed few `docstrings` according to the Google Style Guide (#7717 ) Fixed few docstrings according to the Google Style Guide.	2024-05-20 10:26:05 -07:00
Alphin Jain	1221b28eac	Fix AttributeError in train_lcm_distill_lora_sdxl_wds.py (#7923 ) Fix conditional teacher model check in train_lcm_distill_lora_sdxl_wds.py Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-05-16 15:49:54 +05:30
Sai-Suraj-27	2afea72d29	refactor: Refactored code by Merging `isinstance` calls (#7710 ) * Merged isinstance calls to make the code simpler. * Corrected formatting errors using ruff. --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: YiYi Xu <yixu310@gmail.com>	2024-05-15 18:33:19 -10:00
Isamu Isozaki	d27e996ccd	Adding VQGAN Training script (#5483 ) * Init commit * Removed einops * Added default movq config for training * Update explanation of prompts * Fixed inheritance of discriminator and init_tracker * Fixed incompatible api between muse and here * Fixed output * Setup init training * Basic structure done * Removed attention for quick tests * Style fixes * Fixed vae/vqgan styles * Removed redefinition of wandb * Fixed log_validation and tqdm * Nothing commit * Added commit loss to lookup_from_codebook * Update src/diffusers/models/vq_model.py Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * Adding perliminary README * Fixed one typo * Local changes * Fixed main issues * Merging * Update src/diffusers/models/vq_model.py Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * Testing+Fixed bugs in training script * Some style fixes * Added wandb to docs * Fixed timm test * get testing suite ready. * remove return loss * remove return_loss * Remove diffs * Remove diffs * fix ruff format --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>	2024-05-15 08:47:12 +05:30
Jingyang Zhang	69fdb8720f	[Pipeline] Adding BoxDiff to community examples (#7947 ) add boxdiff to community examples	2024-05-14 11:18:29 -10:00
bssrdf	cdda94f412	fix VAE loading issue in train_dreambooth (#7632 ) * fixed vae loading issue #7619 * rerun make style && make quality * bring back model_has_vae and add change \ to / in config_file_name on windows os to make match work * add missing import platform * bring back import model_info * make config_file_name OS independent * switch to using Path.as_posix() to resolve OS dependence * improve style --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: bssrdf <bssrdf@gmail.com> Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>	2024-05-14 08:19:53 +05:30
dependabot[bot]	5b830aa356	Bump transformers from 4.36.0 to 4.38.0 in /examples/research_projects/realfill (#7635 ) Bump transformers in /examples/research_projects/realfill Bumps [transformers](https://github.com/huggingface/transformers) from 4.36.0 to 4.38.0. - [Release notes](https://github.com/huggingface/transformers/releases) - [Commits](https://github.com/huggingface/transformers/compare/v4.36.0...v4.38.0) --- updated-dependencies: - dependency-name: transformers dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2024-05-14 08:17:06 +05:30
Kohei	9e7bae9881	Update requirements.txt for text_to_image (#7892 ) Update requirements.txt If the datasets library is old, it will not read the metadata.jsonl and the label will default to an integer of type int. Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-05-14 08:09:12 +05:30
HelloWorldBeginner	98ba18ba55	Add Ascend NPU support for SDXL. (#7916 ) Co-authored-by: mhh001 <mahonghao1@huawei.com> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-05-12 13:34:23 +02:00
HelloWorldBeginner	0267c5233a	fix bugs when using deepspeed in sdxl (#7917 ) fix bugs when using deepspeed Co-authored-by: mhh001 <mahonghao1@huawei.com>	2024-05-11 20:49:09 +02:00
Mark Van Aken	be4afa0bb4	#7535 Update FloatTensor type hints to Tensor (#7883 ) * find & replace all FloatTensors to Tensor * apply formatting * Update torch.FloatTensor to torch.Tensor in the remaining files * formatting * Fix the rest of the places where FloatTensor is used as well as in documentation * formatting * Update new file from FloatTensor to Tensor	2024-05-10 09:53:31 -10:00
Tolga Cangöz	c1c42698c9	Remove dead code and fix f-string issue (#7720 ) * Remove dead code * PylancereportGeneralTypeIssues: Strings nested within an f-string cannot use the same quote character as the f-string prior to Python 3.12. * Remove dead code	2024-05-08 13:15:28 -10:00
Hyoungwon Cho	c2217142bd	Modification on the PAG community pipeline (re) (#7876 ) * edited_pag_implementation * update --------- Co-authored-by: yiyixuxu <yixu310@gmail.com>	2024-05-07 16:35:15 -10:00
Bagheera	8edaf3b79c	7879 - adjust documentation to use naruto dataset, since pokemon is now gated (#7880 ) * 7879 - adjust documentation to use naruto dataset, since pokemon is now gated * replace references to pokemon in docs * more references to pokemon replaced * Japanese translation update --------- Co-authored-by: bghira <bghira@users.github.com>	2024-05-07 09:36:39 -07:00
HelloWorldBeginner	58237364b1	Add Ascend NPU support for SDXL fine-tuning and fix the model saving bug when using DeepSpeed. (#7816 ) * Add Ascend NPU support for SDXL fine-tuning and fix the model saving bug when using DeepSpeed. * fix check code quality * Decouple the NPU flash attention and make it an independent module. * add doc and unit tests for npu flash attention. --------- Co-authored-by: mhh001 <mahonghao1@huawei.com> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-05-03 08:14:34 -10:00
YiYi Xu	21a7ff12a7	update the logic of `is_sequential_cpu_offload` (#7788 ) * up * add comment to the tests + fix dit --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-05-01 06:25:57 -10:00
Linoy Tsaban	26a7851e1e	Add B-Lora training option to the advanced dreambooth lora script (#7741 ) * add blora * add blora * add blora * add blora * little changes * little changes * remove redundancies * fixes * add B LoRA to readme * style * inference * defaults + path to loras+ generation * minor changes * style * minor changes * minor changes * blora arg * added --lora_unet_blocks * style * Update examples/advanced_diffusion_training/README.md Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * add commit hash to B-LoRA repo cloneing * change inference, remove cloning * change inference, remove cloning add section about configureable unet blocks * change inference, remove cloning add section about configureable unet blocks * Apply suggestions from code review --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-04-30 09:46:30 +05:30
Nilesh	235d34cf56	Check for latents, before calling prepare_latents - sdxlImg2Img (#7582 ) * Check for latents, before calling prepare_latents - sdxlImg2Img * Added latents check for all the img2img pipeline * Fixed silly mistake while checking latents as None	2024-04-28 14:53:29 -10:00
39th president of the United States, probably	9d16daaf64	Add DREAM training (#6381 ) A new function compute_dream_and_update_latents has been added to the training utilities that allows you to do DREAM rectified training in line with the paper https://arxiv.org/abs/2312.00210. The method can be used with an extra argument in the train_text_to_image.py script. Co-authored-by: Jimmy <39@🇺🇸.com>	2024-04-27 07:19:15 +05:30
Steven Liu	7404f1e9dc	[docs] Clean up toctree (#7715 ) * toctree * optim * feedback * improve overview	2024-04-23 09:30:33 -07:00
Abhinav Gopal	d1e3f489e9	Animatediff Controlnet Community Pipeline IP Adapter Fix (#7413 ) * fixed encode_image function signature in controlnet animatediff * copied encode_image from stable diffusion pipeline --------- Co-authored-by: YiYi Xu <yixu310@gmail.com>	2024-04-19 15:35:07 -10:00
Sai-Suraj-27	db969cc16d	fix: Fixed `type annotations` for compatability with python 3.8 (#7648 ) * Fixed type annotations for compatability with python 3.8 * Add required imports.	2024-04-18 19:34:09 -10:00
Dhruv Nair	90250d9e48	Cast height, width to int inside prepare latents (#7691 ) update	2024-04-18 19:30:39 -10:00
Fabio Rigano	b5c8b555d7	Move IP Adapter Face ID to core (#7186 ) * Switch to peft and multi proj layers * Move Face ID loading and inference to core --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-04-18 14:13:27 -10:00
UmerHA	fda1531d8a	Fixing implementation of ControlNet-XS (#6772 ) * CheckIn - created DownSubBlocks * Added extra channels, implemented subblock fwd * Fixed connection sizes * checkin * Removed iter, next in forward * Models for SD21 & SDXL run through * Added back pipelines, cleared up connections * Cleaned up connection creation * added debug logs * updated logs * logs: added input loading * Update umer_debug_logger.py * log: Loading hint * Update umer_debug_logger.py * added logs * Changed debug logging * debug: added more logs * Fixed num_norm_groups * Debug: Logging all of SDXL input * Update umer_debug_logger.py * debug: updated logs * checkim * Readded tests * Removed debug logs * Fixed Slow Tests * Added value ckecks \| Updated model_cpu_offload_seq * accelerate-offloading works ; fast tests work * Made unet & addon explicit in controlnet * Updated slow tests * Added dtype/device to ControlNetXS * Filled in test model paths * Added image_encoder/feature_extractor to XL pipe * Fixed fast tests * Added comments and docstrings * Fixed copies * Added docs ; Updates slow tests * Moved changes to UNetMidBlock2DCrossAttn * tiny cleanups * Removed stray prints * Removed ip adapters + freeU - Removed ip adapters + freeU as they don't make sense for ControlNet-XS - Fixed imports of UNet components * Fixed test_save_load_float16 * Make style, quality, fix-copies * Changed loading/saving API for ControlNetXS - Changed loading/saving API for ControlNetXS - other small fixes * Removed ControlNet-XS from research examples * Make style, quality, fix-copies * Small fixes - deleted ControlNetXSModel.init_original - added time_embedding_mix to StableDiffusionControlNetXSPipeline .from_pretrained / StableDiffusionXLControlNetXSPipeline.from_pretrained - fixed copy hints * checkin May 11 '23 * CheckIn Mar 12 '24 * Fixed tests for SD * Added tests for UNetControlNetXSModel * Fixed SDXL tests * cleanup * Delete Pipfile * CheckIn Mar 20 Started replacing sub blocks by `ControlNetXSCrossAttnDownBlock2D` and `ControlNetXSCrossAttnUplock2D` * check-in Mar 23 * checkin 24 Mar * Created init for UNetCnxs and CnxsAddon * CheckIn * Made from_modules, from_unet and no_control work * make style,quality,fix-copies & small changes * Fixed freezing * Added gradient ckpt'ing; fixed tests * Fix slow tests(+compile) ; clear naming confusion * Don't create UNet in init ; removed class_emb * Incorporated review feedback - Deleted get_base_pipeline / get_controlnet_addon for pipes - Pipes inherit from StableDiffusionXLPipeline - Made module dicts for cnxs-addon's down/mid/up classes - Added support for qkv fusion and freeU * Make style, quality, fix-copies * Implemented review feedback * Removed compatibility check for vae/ctrl embedding * make style, quality, fix-copies * Delete Pipfile * Integrated review feedback - Importing ControlNetConditioningEmbedding now - get_down/mid/up_block_addon now outside class - renamed `do_control` to `apply_control` * Reduced size of test tensors For this, added `norm_num_groups` as parameter everywhere * Renamed cnxs-`Addon` to cnxs-`Adapter` - `ControlNetXSAddon` -> `ControlNetXSAdapter` - `ControlNetXSAddonDownBlockComponents` -> `DownBlockControlNetXSAdapter`, and similarly for mid/up - `get_mid_block_addon` -> `get_mid_block_adapter`, and similarly for mid/up * Fixed save_pretrained/from_pretrained bug * Removed redundant code --------- Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>	2024-04-16 21:56:20 +05:30
kabachuha	2f23437618	Add (Scheduled) Pseudo-Huber Loss training scripts to research projects (#7527 ) * add scheduled pseudo-huber loss training scripts See #7488 * add reduction modes to huber loss * [DB Lora] 2 multiplier to huber loss cause of 1/2 a^2 conv. pairing of `c6495def1f` [DB Lora] add option for smooth l1 (huber / delta) Pairing of `dd22958caa` * [DB Lora] unify huber scheduling Pairing of `19a834c3ab` * [DB Lora] add snr huber scheduler Pairing of `47fb1a6854` * fixup examples link * use snr schedule by default in DB * update all huber scripts with snr * code quality * huber: make style && make quality --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-04-13 07:26:08 +05:30
dg845	0bee4d336b	LCM Distill Scripts Fix Bug when Initializing Target U-Net (#6848 ) * Initialize target_unet from unet rather than teacher_unet so that we correctly add time_embedding.cond_proj if necessary. * Use UNet2DConditionModel.from_config to initialize target_unet from unet's config. --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-04-11 07:52:12 -10:00
satani99	37e9d695af	Modularize instruct_pix2pix SD inferencing during and after training in examples (#7603 ) * Modularize instruct_pix2pix code * quality check * quality check --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-04-10 11:19:16 +05:30
Christopher Beckham	8e46d97cd8	Add missing restore() EMA call in train SDXL script (#7599 ) * Restore unet params back to normal from EMA when validation call is finished * empty commit --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-04-09 18:07:55 +05:30
Bagheera	8e963d1c2a	7529 do not disable autocast for cuda devices (#7530 ) * 7529 do not disable autocast for cuda devices * Remove typecasting error check for non-mps platforms, as a correct autocast implementation makes it a non-issue * add autocast fix to other training examples * disable native_amp for dreambooth (sdxl) * disable native_amp for pix2pix (sdxl) * remove tests from remaining files * disable native_amp on huggingface accelerator for every training example that uses it * convert more usages of autocast to nullcontext, make style fixes * make style fixes * style. * Empty-Commit --------- Co-authored-by: bghira <bghira@users.github.com> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-04-02 20:15:06 +05:30
Álvaro Somoza	73ba81090e	[Community pipeline] SDXL Differential Diffusion Img2Img Pipeline (#7550 ) * initial-commit pipeline created * updated README.md	2024-04-01 18:15:30 -10:00
YiYi Xu	7956c36aaa	add a `from_pipe` method to `DiffusionPipeline` (#7241 ) * add from_pipe --------- Co-authored-by: yiyixuxu <yixu310@gmail,com> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>	2024-04-01 13:02:00 -10:00

1 2 3 4 5 ...

929 Commits