diffusers

mirror of https://github.com/huggingface/diffusers.git synced 2026-01-27 17:22:53 +03:00

Author	SHA1	Message	Date
Vinh H. Pham	7a95f8d9d8	[Tests] Improve transformers model test suite coverage - Temporal Transformer (#8932 ) * add test for temporal transformer * remove unused variable * fix code quality --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-07-23 15:36:30 +05:30
Aritra Roy Gosthipaty	8b21feed42	[Tests] reduce the model size in the audioldm2 fast test (#7846 ) * chore: initial model size reduction * chore: fixing expected values for failing tests * requested edits --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-07-23 14:34:07 +05:30
Sayak Paul	af400040f5	[Tests] proper skipping of request caching test (#8908 ) proper skipping of request caching test	2024-07-22 12:52:57 -10:00
shinetzh	3b04cdc816	fix loop bug in SlicedAttnProcessor (#8836 ) * fix loop bug in SlicedAttnProcessor --------- Co-authored-by: neoshang <neoshang@tencent.com>	2024-07-19 18:14:29 -10:00
Sayak Paul	0f09b01ab3	[Core] fix: shard loading and saving when variant is provided. (#8869 ) fix: shard loading and saving when variant is provided.	2024-07-17 08:26:28 +05:30
Tolga Cangöz	e87bf62940	[`Cont'd`] Add the SDE variant of ~~DPM-Solver~~ and DPM-Solver++ to DPM Single Step (#8269 ) * Add the SDE variant of DPM-Solver and DPM-Solver++ to DPM Single Step --------- Co-authored-by: cmdr2 <secondary.cmdr2@gmail.com>	2024-07-16 15:40:02 -10:00
Aryan	bbd2f9d4e9	[tests] fix typo in pag tests (#8845 ) * fix typo in pag tests * fix typo	2024-07-12 17:41:34 +05:30
Nguyễn Công Tú Anh	d704b3bf8c	add PAG support sd15 controlnet (#8820 ) * add pag support sd15 controlnet * fix quality import * remove unecessary import * remove if state * fix tests * remove useless function * add sd1.5 controlnet pag docs --------- Co-authored-by: anhnct8 <anhnct8@fpt.com>	2024-07-12 15:42:56 +05:30
Dhruv Nair	11d18f3217	Add single file loading support for AnimateDiff (#8819 ) * update * update * update * update	2024-07-12 09:51:57 +05:30
Sayak Paul	2261510bbc	[Core] Add AuraFlow (#8796 ) * add lavender flow transformer --------- Co-authored-by: YiYi Xu <yixu310@gmail.com>	2024-07-11 08:50:19 -10:00
Álvaro Somoza	87b9db644b	[Core] Add Kolors (#8812 ) * initial draft	2024-07-11 06:09:17 -10:00
Xin Ma	b8cf84a3f9	Latte: Latent Diffusion Transformer for Video Generation (#8404 ) * add Latte to diffusers * remove print * remove print * remove print * remove unuse codes * remove layer_norm_latte and add a flag * remove layer_norm_latte and add a flag * update latte_pipeline * update latte_pipeline * remove unuse squeeze * add norm_hidden_states.ndim == 2: # for Latte * fixed test latte pipeline bugs * fixed test latte pipeline bugs * delete sh * add doc for latte * add licensing * Move Transformer3DModelOutput to modeling_outputs * give a default value to sample_size * remove the einops dependency * change norm2 for latte * modify pipeline of latte * update test for Latte * modify some codes for latte * modify for Latte pipeline * modify for Latte pipeline * modify for Latte pipeline * modify for Latte pipeline * modify for Latte pipeline * modify for Latte pipeline * modify for Latte pipeline * modify for Latte pipeline * modify for Latte pipeline * modify for Latte pipeline * modify for Latte pipeline * modify for Latte pipeline * modify for Latte pipeline * modify for Latte pipeline * modify for Latte pipeline * modify for Latte pipeline * modify for Latte pipeline * modify for Latte pipeline * modify for Latte pipeline * modify for Latte pipeline * modify for Latte pipeline * modify for Latte pipeline * modify for Latte pipeline * modify for Latte pipeline * modify for Latte pipeline * modify for Latte pipeline * modify for Latte pipeline * video_length -> num_frames; update prepare_latents copied from * make fix-copies * make style * typo: videe -> video * update * modify for Latte pipeline * modify latte pipeline * modify latte pipeline * modify latte pipeline * modify latte pipeline * modify for Latte pipeline * Delete .vscode directory * make style * make fix-copies * add latte transformer 3d to docs _toctree.yml * update example * reduce frames for test * fixed bug of _text_preprocessing * set num frame to 1 for testing * remove unuse print * add text = self._clean_caption(text) again --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: YiYi Xu <yixu310@gmail.com> Co-authored-by: Aryan <contact.aryanvs@gmail.com> Co-authored-by: Aryan <aryan@huggingface.co>	2024-07-11 15:06:22 +05:30
Sayak Paul	a785992c1d	[Tests] fix more sharding tests (#8797 ) * fix * fix * ugly * okay * fix more * fix oops	2024-07-09 13:09:36 +05:30
Xu Cao	35cc66dc4c	Add pipeline_stable_diffusion_3_inpaint.py for SD3 Inference (#8709 ) * Add pipeline_stable_diffusion_3_inpaint --------- Co-authored-by: Xu Cao <xucao2@jrehg-work-01.cs.illinois.edu> Co-authored-by: IrohXu <irohcao@gmail.com> Co-authored-by: YiYi Xu <yixu310@gmail.com>	2024-07-08 15:53:02 -10:00
Tolga Cangöz	57084dacc5	Remove unnecessary lines (#8569 ) * Remove unused line --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-07-08 10:42:02 -10:00
PommesPeter	98388670d2	[Alpha-VLLM Team] Add Lumina-T2X to diffusers (#8652 ) --------- Co-authored-by: zhuole1025 <zhuole1025@gmail.com> Co-authored-by: YiYi Xu <yixu310@gmail.com>	2024-07-07 17:12:09 -10:00
YiYi Xu	9e9ed353a2	fix loading sharded checkpoints from subfolder (#8798 ) * fix load sharded checkpoints from subfolder{ * style * os.path.join * add a small test --------- Co-authored-by: sayakpaul <spsayakpaul@gmail.com>	2024-07-06 11:32:04 -10:00
Dhruv Nair	0bab9d6be7	[Single File] Allow loading T5 encoder in mixed precision (#8778 ) * update * update * update * update	2024-07-05 10:29:38 +05:30
Sayak Paul	31adeb41cd	[Tests] fix sharding tests (#8764 ) fix sharding tests	2024-07-04 08:50:59 +05:30
Aryan	a7b9634e95	Fix minor bug in SD3 img2img test (#8779 ) fix minor bug in sd3 img2img	2024-07-03 07:45:37 -10:00
Sayak Paul	984d340534	Revert "[LoRA] introduce `LoraBaseMixin` to promote reusability." (#8773 ) Revert "[LoRA] introduce `LoraBaseMixin` to promote reusability. (#8670)" This reverts commit `a2071a1837`.	2024-07-03 07:05:01 +05:30
Sayak Paul	a2071a1837	[LoRA] introduce `LoraBaseMixin` to promote reusability. (#8670 ) * introduce to promote reusability. * up * add more tests * up * remove comments. * fix fuse_nan test * clarify the scope of fuse_lora and unfuse_lora * remove space	2024-07-03 07:04:37 +05:30
Shauray Singh	8690e8b9d6	add PAG support for SD architecture (#8725 ) * add pag to sd pipelines	2024-06-29 09:26:11 -10:00
Dhruv Nair	150142c537	[Tests] Fix precision related issues in slow pipeline tests (#8720 ) update	2024-06-28 08:13:46 +05:30
Mathis Koroglu	3e0d128da7	Motion Model / Adapter versatility (#8301 ) * Motion Model / Adapter versatility - allow to use a different number of layers per block - allow to use a different number of transformer per layers per block - allow a different number of motion attention head per block - use dropout argument in get_down/up_block in 3d blocks * Motion Model added arguments renamed & refactoring * Add test for asymmetric UNetMotionModel	2024-06-27 11:11:29 +05:30
vincedovy	a536e775fb	Fix json WindowsPath crash (#8662 ) * Add check for WindowsPath in to_json_string On Windows, os.path.join returns a WindowsPath. to_json_string does not convert this from a WindowsPath to a string. Added check for WindowsPath to to_json_saveable. * Remove extraneous convert to string in test_check_path_types (tests/others/test_config.py) * Fix style issues in tests/others/test_config.py * Add unit test to test_config.py to verify that PosixPath and WindowsPath (depending on system) both work when converted to JSON * Remove distinction between PosixPath and WindowsPath in ConfigMixIn.to_json_string(). Conditional now tests for Path, and uses Path.as_posix() to convert to string. --------- Co-authored-by: Vincent Dovydaitis <vincedovy@gmail.com> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-06-27 10:30:55 +05:30
Dhruv Nair	effe4b9784	Update xformers SD3 test (#8712 ) update	2024-06-26 10:24:27 -10:00
XCL	fa2abfdb03	[Tencent Hunyuan Team] Add Hunyuan-DiT ControlNet Inference (#8694 ) * add controlnet support --------- Co-authored-by: xingchaoliu <xingchaoliu@tencent.com> Co-authored-by: yiyixuxu <yixu310@gmail,com>	2024-06-26 00:43:03 -10:00
Dhruv Nair	0f0b531827	Add decorator for compile tests (#8703 ) * update * update --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-06-26 11:26:47 +05:30
YiYi Xu	540399f540	add PAG support (#7944 ) * first draft --------- Co-authored-by: yiyixuxu <yixu310@gmail,com> Co-authored-by: Junhwa Song <ethan9867@gmail.com> Co-authored-by: Ahn Donghoon (안동훈 / suno) <suno.vivid@gmail.com> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2024-06-25 08:40:02 -10:00
Sayak Paul	f088027e93	[Marigold tests] add `is_flaky` decorator to some Marigold tests (#8696 ) okay	2024-06-25 06:27:28 -10:00
Linoy Tsaban	c6e08ecd46	[Sd3 Dreambooth LoRA] Add text encoder training for the clip encoders (#8630 ) * add clip text-encoder training * no dora * text encoder traing fixes * text encoder traing fixes * text encoder training fixes * text encoder training fixes * text encoder training fixes * text encoder training fixes * add text_encoder layers to save_lora * style * fix imports * style * fix text encoder * review changes * review changes * review changes * minor change * add lora tag * style * add readme notes * add tests for clip encoders * style * typo * fixes * style * Update tests/lora/test_lora_layers_sd3.py Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * Update examples/dreambooth/README_sd3.md Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * minor readme change --------- Co-authored-by: YiYi Xu <yixu310@gmail.com> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-06-25 18:00:19 +05:30
Sayak Paul	4ad7a1f5fd	[Chore] create a utility for calculating the expected number of shards. (#8692 ) create a utility for calculating the expected number of shards.	2024-06-25 17:05:39 +05:30
Tolga Cangöz	f040c27d4c	Errata - Fix typos and improve style (#8571 ) * Fix typos * Fix typos & up style * chore: Update numbers --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-06-24 10:07:22 -07:00
Tolga Cangöz	138fac703a	Discourage using deprecated `revision` parameter (#8573 ) * Discourage using `revision` * `make style && make quality` * Refactor code to use 'variant' instead of 'revision' * `revision="bf16"` -> `variant="bf16"` --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-06-24 10:06:49 -07:00
Dong	3fca52022f	🎨 fix xl playground device (#8550 ) * 🎨 fix xl playground device * 🎨 run `make fix-copies` * 🎨 run `make fix-copies` * edit xl_controlnet_img2img file * edit playground img2img test slow * Update tests/pipelines/stable_diffusion_xl/test_stable_diffusion_xl_img2img.py --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-06-24 16:49:55 +05:30
Tolga Cangöz	c375903db5	Errata - Fix typos & improve contributing page (#8572 ) * Fix typos & improve contributing page * `make style && make quality` * fix typos * Fix typo --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-06-24 14:13:03 +05:30
drhead	2ada094bff	Add extra performance features for EMAModel, torch._foreach operations and better support for non-blocking CPU offloading (#7685 ) * Add support for _foreach operations and non-blocking to EMAModel * default foreach to false * add non-blocking EMA offloading to SD1.5 T2I example script * fix whitespace * move foreach to cli argument * linting * Update README.md re: EMA weight training * correct args.foreach_ema * add tests for foreach ema * code quality * add foreach to from_pretrained * default foreach false * fix linting --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: drhead <a@a.a>	2024-06-24 14:03:47 +05:30
Álvaro Somoza	e7b9a0762b	[SD3 LoRA] Fix list index out of range (#8584 ) * fix * add check * key present is checked before * test case draft * aply suggestions * changed testing repo, back to old class * forgot docstring --------- Co-authored-by: YiYi Xu <yixu310@gmail.com> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-06-21 21:17:34 +05:30
Sayak Paul	8eb17315c8	[LoRA] get rid of the legacy lora remnants and make our codebase lighter (#8623 ) * get rid of the legacy lora remnants and make our codebase lighter * fix depcrecated lora argument * fix * empty commit to trigger ci * remove print * empty	2024-06-21 16:36:05 +05:30
YiYi Xu	c71c19c5e6	a few fix for shard checkpoints (#8656 ) fix Co-authored-by: yiyixuxu <yixu310@gmail,com>	2024-06-21 12:50:58 +05:30
Sayak Paul	668e34c6e0	[LoRA SD3] add support for lora fusion in sd3 (#8616 ) * add support for lora fusion in sd3 * add test to ensure fused lora and effective lora produce same outpouts	2024-06-20 14:25:51 +05:30
Sayak Paul	25d7bb3ea6	[Flax tests] reduce tolerance for a flax test (#8640 ) reduce tolerance for a flax test	2024-06-20 00:48:08 +04:00
王奇勋	e5564d45bf	Support SD3 ControlNet and Multi-ControlNet. (#8566 ) * sd3 controlnet --------- Co-authored-by: haofanwang <haofanwang.ai@gmail.com>	2024-06-18 14:59:22 -10:00
Gæros	298ce67999	[LoRA] text encoder: read the ranks for all the attn modules (#8324 ) * [LoRA] text encoder: read the ranks for all the attn modules * In addition to out_proj, read the ranks of adapters for q_proj, k_proj, and v_proj * Allow missing adapters (UNet already supports this) * ruff format loaders.lora * [LoRA] add tests for partial text encoders LoRAs * [LoRA] update test_simple_inference_with_partial_text_lora to be deterministic * [LoRA] comment justifying test_simple_inference_with_partial_text_lora * style --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-06-18 21:10:50 +01:00
Marc Sun	96399c3ec6	Fix sharding when no device_map is passed (#8531 ) * Fix sharding when no device_map is passed * style * add tests * align * add docstring * format --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-06-18 05:47:23 -10:00
YiYi Xu	614d0c64e9	remove the deprecated prepare_mask_and_masked_image function (#8512 ) remove prepare mask fn Co-authored-by: yiyixuxu <yixu310@gmail,com> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-06-13 14:59:21 +01:00
Dhruv Nair	04717fd861	Add Stable Diffusion 3 (#8483 ) * up * add sd3 * update * update * add tests * fix copies * fix docs * update * add dreambooth lora * add LoRA * update * update * update * update * import fix * update * Update src/diffusers/pipelines/stable_diffusion_3/pipeline_stable_diffusion_3.py Co-authored-by: YiYi Xu <yixu310@gmail.com> * import fix 2 * update * Update src/diffusers/models/autoencoders/autoencoder_kl.py Co-authored-by: YiYi Xu <yixu310@gmail.com> * Update src/diffusers/models/autoencoders/autoencoder_kl.py Co-authored-by: YiYi Xu <yixu310@gmail.com> * Update src/diffusers/models/autoencoders/autoencoder_kl.py Co-authored-by: YiYi Xu <yixu310@gmail.com> * Update src/diffusers/models/autoencoders/autoencoder_kl.py Co-authored-by: YiYi Xu <yixu310@gmail.com> * Update src/diffusers/models/autoencoders/autoencoder_kl.py Co-authored-by: YiYi Xu <yixu310@gmail.com> * Update src/diffusers/models/autoencoders/autoencoder_kl.py Co-authored-by: YiYi Xu <yixu310@gmail.com> * Update src/diffusers/models/autoencoders/autoencoder_kl.py Co-authored-by: YiYi Xu <yixu310@gmail.com> * Update src/diffusers/models/autoencoders/autoencoder_kl.py Co-authored-by: YiYi Xu <yixu310@gmail.com> * Update src/diffusers/models/autoencoders/autoencoder_kl.py Co-authored-by: YiYi Xu <yixu310@gmail.com> * Update src/diffusers/models/autoencoders/autoencoder_kl.py Co-authored-by: YiYi Xu <yixu310@gmail.com> * Update src/diffusers/models/autoencoders/autoencoder_kl.py Co-authored-by: YiYi Xu <yixu310@gmail.com> * update * update * update * fix ckpt id * fix more ids * update * missing doc * Update src/diffusers/schedulers/scheduling_flow_match_euler_discrete.py Co-authored-by: YiYi Xu <yixu310@gmail.com> * Update src/diffusers/schedulers/scheduling_flow_match_euler_discrete.py Co-authored-by: YiYi Xu <yixu310@gmail.com> * Update docs/source/en/api/pipelines/stable_diffusion/stable_diffusion_3.md Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * Update docs/source/en/api/pipelines/stable_diffusion/stable_diffusion_3.md Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * update' * fix * update * Update src/diffusers/models/autoencoders/autoencoder_kl.py * Update src/diffusers/models/autoencoders/autoencoder_kl.py * note on gated access. * requirements * licensing --------- Co-authored-by: sayakpaul <spsayakpaul@gmail.com> Co-authored-by: YiYi Xu <yixu310@gmail.com>	2024-06-12 20:44:00 +01:00
Sayak Paul	7d887118b9	[Core] support saving and loading of sharded checkpoints (#7830 ) * feat: support saving a model in sharded checkpoints. * feat: make loading of sharded checkpoints work. * add tests * cleanse the loading logic a bit more. * more resilience while loading from the Hub. * parallelize shard downloads by using snapshot_download()/ * default to a shard size. * more fix * Empty-Commit * debug * fix * uality * more debugging * fix more * initial comments from Benjamin * move certain methods to loading_utils * add test to check if the correct number of shards are present. * add a test to check if loading of sharded checkpoints from the Hub is okay * clarify the unit when passed as an int. * use hf_hub for sharding. * remove unnecessary code * remove unnecessary function * lucain's comments. * fixes * address high-level comments. * fix test * subfolder shenanigans./ * Update src/diffusers/utils/hub_utils.py Co-authored-by: Lucain <lucainp@gmail.com> * Apply suggestions from code review Co-authored-by: Lucain <lucainp@gmail.com> * remove _huggingface_hub_version as not needed. * address more feedback. * add a test for local_files_only=True/ * need hf hub to be at least 0.23.2 * style * final comment. * clean up subfolder. * deal with suffixes in code. * _add_variant default. * use weights_name_pattern * remove add_suffix_keyword * clean up downloading of sharded ckpts. * don't return something special when using index.json * fix more * don't use bare except * remove comments and catch the errors better * fix a couple of things when using is_file() * empty --------- Co-authored-by: Lucain <lucainp@gmail.com>	2024-06-07 14:49:10 +05:30
Tolga Cangöz	ec1aded12e	Optimize test files by fixing CPU-offloading usage (#8409 ) * Refactor code to remove unnecessary calls to `to(torch_device)` * Refactor code to remove unnecessary calls to `to("cuda")` * Update pipeline_stable_diffusion_diffedit.py	2024-06-06 09:51:26 -10:00

1 2 3 4 5 ...

1104 Commits