diffusers

mirror of https://github.com/huggingface/diffusers.git synced 2026-01-27 17:22:53 +03:00

Author	SHA1	Message	Date
Yushu	a38dd79512	[Pipeline] Fix error of SVD pipeline when num_videos_per_prompt > 1 (#7786 ) swap the order for do_classifier_free_guidance concat with repeat Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>	2024-04-29 16:24:16 +05:30
Nilesh	235d34cf56	Check for latents, before calling prepare_latents - sdxlImg2Img (#7582 ) * Check for latents, before calling prepare_latents - sdxlImg2Img * Added latents check for all the img2img pipeline * Fixed silly mistake while checking latents as None	2024-04-28 14:53:29 -10:00
Sayak Paul	56bd7e67c2	[Scheduler] introduce sigma schedule. (#7649 ) * introduce sigma schedule. Co-authored-by: Suraj Patil <surajp815@gmail.com> * address yiyi * update docstrings. * implement the schedule for EDMDPMSolverMultistepScheduler --------- Co-authored-by: Suraj Patil <surajp815@gmail.com>	2024-04-27 07:40:35 +05:30
39th president of the United States, probably	9d16daaf64	Add DREAM training (#6381 ) A new function compute_dream_and_update_latents has been added to the training utilities that allows you to do DREAM rectified training in line with the paper https://arxiv.org/abs/2312.00210. The method can be used with an extra argument in the train_text_to_image.py script. Co-authored-by: Jimmy <39@🇺🇸.com>	2024-04-27 07:19:15 +05:30
Beinsezii	0d2d424fbe	Add PixArtSigmaPipeline to AutoPipeline mapping (#7783 )	2024-04-26 09:10:20 -10:00
Steven Liu	e24e54fdfa	[docs] Fix AutoPipeline docstring (#7779 ) fix Co-authored-by: YiYi Xu <yixu310@gmail.com> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-04-26 10:09:36 -07:00
btlorch	ebc99a77aa	Convert RGB to BGR for the SDXL watermark encoder (#7013 ) * Convert channel order to BGR for the watermark encoder. Convert the watermarked BGR images back to RGB. Fixes #6292 * Revert channel order before stacking images to overcome limitations that negative strides are currently not supported --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-04-25 14:44:53 -10:00
Sayak Paul	142f353e1c	Fix lora device test (#7738 ) * fix lora device test * fix more. * fix more/ * quality * empty --------- Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>	2024-04-25 18:05:27 +05:30
Sayak Paul	e963621649	[PixArt] fix small nits in pixart sigma (#7767 ) fix small nits in pixart sigma	2024-04-25 06:37:35 +05:30
Junsong Chen	39215aa30e	PixArt-Sigma Implementation (#7654 ) * support PixArt-DMD --------- Co-authored-by: jschen <chenjunsong4@h-partners.com> Co-authored-by: badayvedat <badayvedat@gmail.com> Co-authored-by: Vedat Baday <54285744+badayvedat@users.noreply.github.com> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: YiYi Xu <yixu310@gmail.com> Co-authored-by: yiyixuxu <yixu310@gmail,com>	2024-04-23 22:33:08 -10:00
Sai-Suraj-27	fc9fecc217	fix: Fixed a wrong decorator by modifying it to `@classmethod` (#7653 ) * Fixed wrong decorator by modifying it to @classmethod. * Updated the method and it's argument. --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-04-22 14:41:35 -10:00
Fabio Rigano	065f251766	Restore AttnProcessor2_0 in unload_ip_adapter (#7727 ) * Restore AttnProcessor2_0 in unload_ip_adapter * Fix style * Update test --------- Co-authored-by: YiYi Xu <yixu310@gmail.com>	2024-04-22 13:59:03 -10:00
Jenyuan-Huang	21c747fa0f	Support InstantStyle (#7668 ) * enable control ip-adapter per-transformer block on-the-fly --------- Co-authored-by: sayakpaul <spsayakpaul@gmail.com> Co-authored-by: ResearcherXman <xhs.research@gmail.com> Co-authored-by: YiYi Xu <yixu310@gmail.com>	2024-04-22 13:20:19 -10:00
Phil Butler	09129842e7	Remove redundant lines (#7396 ) Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com> Co-authored-by: YiYi Xu <yixu310@gmail.com>	2024-04-22 09:32:16 -10:00
Sai-Suraj-27	db969cc16d	fix: Fixed `type annotations` for compatability with python 3.8 (#7648 ) * Fixed type annotations for compatability with python 3.8 * Add required imports.	2024-04-18 19:34:09 -10:00
Dhruv Nair	3cfe187dc7	Cleanup ControlnetXS (#7701 ) * update * update	2024-04-18 19:32:00 -10:00
Dhruv Nair	90250d9e48	Cast height, width to int inside prepare latents (#7691 ) update	2024-04-18 19:30:39 -10:00
YiYi Xu	e5674015f3	adding back test_conversion_when_using_device_map (#7704 ) * style * Fix device map nits (#7705) --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-04-18 19:21:32 -10:00
Fabio Rigano	b5c8b555d7	Move IP Adapter Face ID to core (#7186 ) * Switch to peft and multi proj layers * Move Face ID loading and inference to core --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-04-18 14:13:27 -10:00
Sayak Paul	9d50f7eec1	[Core] `is_cosxl_edit` arg in SDXL ip2p. (#7650 ) * is_cosxl_edit arg in SDXL ip2p. * Empty-Commit Co-authored-by: Yiyi Xu <yixu310@gmail.com> * doc * remove redundant logic. * reflect drhuv's comments. --------- Co-authored-by: Yiyi Xu <yixu310@gmail.com> Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>	2024-04-16 22:15:55 +05:30
UmerHA	fda1531d8a	Fixing implementation of ControlNet-XS (#6772 ) * CheckIn - created DownSubBlocks * Added extra channels, implemented subblock fwd * Fixed connection sizes * checkin * Removed iter, next in forward * Models for SD21 & SDXL run through * Added back pipelines, cleared up connections * Cleaned up connection creation * added debug logs * updated logs * logs: added input loading * Update umer_debug_logger.py * log: Loading hint * Update umer_debug_logger.py * added logs * Changed debug logging * debug: added more logs * Fixed num_norm_groups * Debug: Logging all of SDXL input * Update umer_debug_logger.py * debug: updated logs * checkim * Readded tests * Removed debug logs * Fixed Slow Tests * Added value ckecks \| Updated model_cpu_offload_seq * accelerate-offloading works ; fast tests work * Made unet & addon explicit in controlnet * Updated slow tests * Added dtype/device to ControlNetXS * Filled in test model paths * Added image_encoder/feature_extractor to XL pipe * Fixed fast tests * Added comments and docstrings * Fixed copies * Added docs ; Updates slow tests * Moved changes to UNetMidBlock2DCrossAttn * tiny cleanups * Removed stray prints * Removed ip adapters + freeU - Removed ip adapters + freeU as they don't make sense for ControlNet-XS - Fixed imports of UNet components * Fixed test_save_load_float16 * Make style, quality, fix-copies * Changed loading/saving API for ControlNetXS - Changed loading/saving API for ControlNetXS - other small fixes * Removed ControlNet-XS from research examples * Make style, quality, fix-copies * Small fixes - deleted ControlNetXSModel.init_original - added time_embedding_mix to StableDiffusionControlNetXSPipeline .from_pretrained / StableDiffusionXLControlNetXSPipeline.from_pretrained - fixed copy hints * checkin May 11 '23 * CheckIn Mar 12 '24 * Fixed tests for SD * Added tests for UNetControlNetXSModel * Fixed SDXL tests * cleanup * Delete Pipfile * CheckIn Mar 20 Started replacing sub blocks by `ControlNetXSCrossAttnDownBlock2D` and `ControlNetXSCrossAttnUplock2D` * check-in Mar 23 * checkin 24 Mar * Created init for UNetCnxs and CnxsAddon * CheckIn * Made from_modules, from_unet and no_control work * make style,quality,fix-copies & small changes * Fixed freezing * Added gradient ckpt'ing; fixed tests * Fix slow tests(+compile) ; clear naming confusion * Don't create UNet in init ; removed class_emb * Incorporated review feedback - Deleted get_base_pipeline / get_controlnet_addon for pipes - Pipes inherit from StableDiffusionXLPipeline - Made module dicts for cnxs-addon's down/mid/up classes - Added support for qkv fusion and freeU * Make style, quality, fix-copies * Implemented review feedback * Removed compatibility check for vae/ctrl embedding * make style, quality, fix-copies * Delete Pipfile * Integrated review feedback - Importing ControlNetConditioningEmbedding now - get_down/mid/up_block_addon now outside class - renamed `do_control` to `apply_control` * Reduced size of test tensors For this, added `norm_num_groups` as parameter everywhere * Renamed cnxs-`Addon` to cnxs-`Adapter` - `ControlNetXSAddon` -> `ControlNetXSAdapter` - `ControlNetXSAddonDownBlockComponents` -> `DownBlockControlNetXSAdapter`, and similarly for mid/up - `get_mid_block_addon` -> `get_mid_block_adapter`, and similarly for mid/up * Fixed save_pretrained/from_pretrained bug * Removed redundant code --------- Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>	2024-04-16 21:56:20 +05:30
Benjamin Bossan	2523390c26	FIX Setting device for DoRA parameters (#7655 ) Fix a bug that causes the the call to set_lora_device to ignore the DoRA parameters.	2024-04-12 13:55:46 +02:00
Sai-Suraj-27	279de3c3ff	fix: Replaced deprecated `logger.warn` with `logger.warning` (#7643 ) Fixed deprecated logger.warn with logger.warning.	2024-04-11 09:43:01 -10:00
Yiqin Zhao	8e14535708	Fixed YAML loading. (#7579 )	2024-04-11 09:08:42 -10:00
Steven Munn	42f25d601a	Skip PEFT LoRA Scaling if the scale is 1.0 (#7576 ) * Skip scaling if scale is identity * move check for weight one to scale and unscale lora * fix code style/quality * Empty-Commit --------- Co-authored-by: Steven Munn <stevenjmunn@gmail.com> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: Steven Munn <5297082+stevenjlm@users.noreply.github.com>	2024-04-11 11:02:31 +05:30
Sayak Paul	33c5d125cb	[Core] fix img2img pipeline for Playground (#7627 ) * playground vae encoding should use std and mean of the vae. * style. * fix-copies.	2024-04-11 09:07:38 +05:30
YiYi Xu	aa1f00fd01	Fix cpu offload related slow tests (#7618 ) * fix * up --------- Co-authored-by: yiyixuxu <yixu310@gmail,com>	2024-04-10 14:53:45 -10:00
IDKiro	b99b1617cf	add the option of upsample function for tiny vae (#7604 ) Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: YiYi Xu <yixu310@gmail.com>	2024-04-10 09:27:39 +05:30
Sayak Paul	3e4a6bd2d4	[Core] add "balanced" `device_map` support to pipelines (#6857 ) * get device <-> component mapping when using multiple gpus. * condition the device_map bits. * relax condition * device_map progress. * device_map enhancement * some cleaning up and debugging * Apply suggestions from code review Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com> * incorporate suggestions from PR. * remove multi-gpu condition for now. * guard check the component -> device mapping * fix: device_memory variable * dispatching transformers model to have force_hooks=True * better guarding for transformers device_map * introduce support balanced_low_memory and balanced_ultra_low_memory. * remove device_map patch. * fix: intermediate variable scoping. * fix: condition in cpu offload. * fix: flax class restrictions. * remove modifications from cpu_offload and model_offload * incorporate changes. * add a simple forward pass test * add: torch_device in get_inputs() * add: tests * remove print * safe-guard to(), model offloading and cpu offloading when balanced is used as a device_map. * style * remove . * safeguard device_map with more checks and remove invalid device_mapping strategues. * make a class attribute and adjust tests accordingly. * fix device_map check * fix test * adjust comment * fix: device_map attribute * fix: dispatching. * max_memory test for pipeline * version guard the tests * fix guard. * address review feedback. * reset_device_map method. * add: test for reset_hf_device_map * fix a couple things. * add reset_device_map() in the error message. * add tests for checking reset_device_map doesn't have unintended consequences. * fix reset_device_map and offloading tests. * create _get_final_device_map utility. * hf_device_map -> _hf_device_map * add documentation * add notes suggested by Marc. * styling. * Apply suggestions from code review Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * move updates within gpu condition. * other docs related things * note on ignore a device not specified in . * provide a suggestion if device mapping errors out. * fix: typo. * _hf_device_map -> hf_device_map * Empty-Commit * add: example hf_device_map. --------- Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com> Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> Co-authored-by: Pedro Cuenca <pedro@huggingface.co>	2024-04-10 08:59:05 +05:30
Sayak Paul	44f6b859bf	[Core] refactor `transformer_2d` forward logic into meaningful conditions. (#7489 ) * refactor transformer_2d forward logic into meaningful conditions. * Empty-Commit * fix: _operate_on_patched_inputs * fix: _operate_on_patched_inputs * check * fix: patch output computation block. * fix: _operate_on_patched_inputs. * remove print. * move operations to blocks. * more readability neats. * empty commit * Apply suggestions from code review Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com> * Revert "Apply suggestions from code review" This reverts commit `12178b1aa0`. --------- Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>	2024-04-10 08:33:19 +05:30
Fabio Rigano	a0cf607667	Multi-image masking for single IP Adapter (#7499 ) * Support multiimage masking --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: YiYi Xu <yixu310@gmail.com>	2024-04-09 09:20:57 -10:00
w4ffl35	7e39516627	Allow more arguments to be passed to convert_from_ckpt (#7222 ) Allow safety and feature extractor arguments to be passed to convert_from_ckpt Allows management of safety checker and feature extractor from outside of the convert ckpt class. Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-04-08 10:13:48 +05:30
Nguyễn Công Tú Anh	56a76082ed	Add AudioLDM2 TTS (#5381 ) * add audioldm2 tts * change gpt2 max new tokens * remove unnecessary pipeline and class * add TTS to AudioLDM2Pipeline * add TTS docs * delete unnecessary file * remove unnecessary import * add audioldm2 slow testcase * fix code quality * remove AudioLDMLearnablePositionalEmbedding * add variable check vits encoder * add use_learned_position_embedding --------- Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>	2024-04-08 10:11:24 +05:30
YiYi Xu	6133d98ff7	[IF\| add set_begin_index for all IF pipelines (#7577 ) add set_begin_index for all if pipelines	2024-04-05 06:54:07 -10:00
Abhinav Gopal	35db2fdea9	Update pipeline_animatediff_video2video.py (#7457 ) * Update pipeline_animatediff_video2video.py * commit with test for whether latent input can be passed into animatediffvid2vid	2024-04-03 19:34:28 +05:30
Sayak Paul	a9a5b14f35	[Core] refactor transformers 2d into multiple init variants. (#7491 ) * refactor transformers 2d into multiple legacy variants. * fix: init. * fix recursive init. * add inits. * make transformer block creation more modular. * complete refactor. * remove forward * debug * remove legacy blocks and refactor within the module itself. * remove print * guard caption projection * remove fetcher. * reduce the number of args. * fix: norm_type * group variables that are shared. * remove _get_transformer_blocks * harmonize the init function signatures. * transformer_blocks to common * repeat .	2024-04-03 12:56:17 +05:30
Beinsezii	aa19025989	UniPC Multistep add `rescale_betas_zero_snr` (#7531 ) * UniPC Multistep add `rescale_betas_zero_snr` Same patch as DPM and Euler with the patched final alpha cumprod BF16 doesn't seem to break down, I think cause UniPC upcasts during some phases already? We could still force an upcast since it only loses ≈ 0.005 it/s for me but the difference in output is very small. A better endeavor might upcasting in step() and removing all the other upcasts elsewhere? * UniPC ZSNR UT * Re-add `rescale_betas_zsnr` doc oops	2024-04-02 17:23:55 -10:00
Beinsezii	19ab04ff56	UniPC Multistep fix tensor dtype/device on order=3 (#7532 ) * UniPC UTs iterate solvers on FP16 It wasn't catching errs on order==3. Might be excessive? * UniPC Multistep fix tensor dtype/device on order=3 * UniPC UTs Add v_pred to fp16 test iter For completions sake. Probably overkill?	2024-04-02 15:41:29 -10:00
Sayak Paul	4a34307702	add: utility to format our docs too 📜 (#7314 ) * add: utility to format our docs too 📜 * debugging saga * fix: message * checking * should be fixed. * revert pipeline_fixture * remove empty line * make style * fix: setup.py * style.	2024-04-02 20:49:43 +05:30
Bagheera	8e963d1c2a	7529 do not disable autocast for cuda devices (#7530 ) * 7529 do not disable autocast for cuda devices * Remove typecasting error check for non-mps platforms, as a correct autocast implementation makes it a non-issue * add autocast fix to other training examples * disable native_amp for dreambooth (sdxl) * disable native_amp for pix2pix (sdxl) * remove tests from remaining files * disable native_amp on huggingface accelerator for every training example that uses it * convert more usages of autocast to nullcontext, make style fixes * make style fixes * style. * Empty-Commit --------- Co-authored-by: bghira <bghira@users.github.com> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-04-02 20:15:06 +05:30
Sayak Paul	000fa82a1e	[Chore] remove class assignments for linear and conv. (#7553 ) * remove class assignments for linear and conv. * fix: self.nn	2024-04-02 13:01:04 +05:30
YiYi Xu	7956c36aaa	add a `from_pipe` method to `DiffusionPipeline` (#7241 ) * add from_pipe --------- Co-authored-by: yiyixuxu <yixu310@gmail,com> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>	2024-04-01 13:02:00 -10:00
Jianbing Wu	9bef9f4be7	Fix SVD bug (shape of `time_context`) (#7268 ) * Fix SVD bug (shape of `time_context`) * Formatting code * Formatting src/diffusers/models/transformers/transformer_temporal.py by `make style && make quality` --------- Co-authored-by: kevinkhwu <kevinkhwu@tencent.com> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>	2024-04-01 14:05:52 +05:30
Stephen	ca61287daa	Fix IP Adapter Support for SAG Pipeline (#7260 ) * fix ip adapter support * Update sag pipelines tests, adjust sag pipeline to pass tests --------- Co-authored-by: YiYi Xu <yixu310@gmail.com>	2024-03-30 06:15:29 -10:00
Beinsezii	f0c81562a4	Add `final_sigma_zero` to UniPCMultistep (#7517 ) * Add `final_sigma_zero` to UniPCMultistep Effectively the same trick as DDIM's `set_alpha_to_one` and DPM's `final_sigma_type='zero'`. Currently False by default but maybe this should be True? * `final_sigma_zero: bool` -> `final_sigmas_type: str` Should 1:1 match DPM Multistep now. * Set `final_sigmas_type='sigma_min'` in UniPC UTs	2024-03-29 22:23:45 -10:00
UmerHA	77103d71ca	Quick-Fix for #7352 block-lora (#7523 ) Fixed important typo	2024-03-30 06:42:28 +05:30
UmerHA	0302446819	Implements Blockwise lora (#7352 ) * Initial commit * Implemented block lora - implemented block lora - updated docs - added tests * Finishing up * Reverted unrelated changes made by make style * Fixed typo * Fixed bug + Made text_encoder_2 scalable * Integrated some review feedback * Incorporated review feedback * Fix tests * Made every module configurable * Adapter to new lora test structure * Final cleanup * Some more final fixes - Included examples in `using_peft_for_inference.md` - Added hint that only attns are scaled - Removed NoneTypes - Added test to check mismatching lens of adapter names / weights raise error * Update using_peft_for_inference.md * Update using_peft_for_inference.md * Make style, quality, fix-copies * Updated tutorial;Warning if scale/adapter mismatch * floats are forwarded as-is; changed tutorial scale * make style, quality, fix-copies * Fixed typo in tutorial * Moved some warnings into `lora_loader_utils.py` * Moved scale/lora mismatch warnings back * Integrated final review suggestions * Empty commit to trigger CI * Reverted emoty commit to trigger CI --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-03-29 21:15:57 +05:30
Sayak Paul	fac761694a	[Tests] Speed up some fast pipeline tests (#7477 ) * speed up test_vae_slicing in animatediff * speed up test_karras_schedulers_shape for attend and excite. * style. * get the static slices out. * specify torch print options. * modify * test run with controlnet * specify kwarg * fix: things * not None * flatten * controlnet img2img * complete controlet sd * finish more * finish more * finish more * finish more * finish the final batch * add cpu check for expected_pipe_slice. * finish the rest * remove print * style * fix ssd1b controlnet test * checking ssd1b * disable the test. * make the test_ip_adapter_single controlnet test more robust * fix: simple inpaint * multi * disable panorama * enable again * panorama is shaky so leave it for now * remove print * raise tolerance.	2024-03-29 14:11:38 +05:30
Lvkesheng Shen	e49c04d5d6	Bug fix for controlnetpipeline check_image (#7103 ) * Bug fix for controlnetpipeline check_image Bug fix for controlnetpipeline check_image when using multicontrolnet and prompt list * Update test_inference_multiple_prompt_input function * Update test_controlnet.py add test for multiple prompts and multiple image conditioning * Update test_controlnet.py Fix format error --------- Co-authored-by: Lvkesheng Shen <45848260+Fantast416@users.noreply.github.com> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-03-28 08:25:18 -10:00
YiYi Xu	f238cb0736	cpu_offload: remove all hooks before offload (#7448 ) * add remove_all_hooks * a few more fix and tests * up * Update src/diffusers/pipelines/pipeline_utils.py Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * split tests * add --------- Co-authored-by: Pedro Cuenca <pedro@huggingface.co>	2024-03-28 08:23:02 -10:00

1 2 3 4 5 ...

2204 Commits