diffusers

mirror of https://github.com/huggingface/diffusers.git synced 2026-01-27 17:22:53 +03:00

Author	SHA1	Message	Date
Dhruv Nair	d2df40c6f3	Add VAE tiling option for SD3 (#8791 ) update	2024-07-11 09:49:39 -10:00
Sayak Paul	2261510bbc	[Core] Add AuraFlow (#8796 ) * add lavender flow transformer --------- Co-authored-by: YiYi Xu <yixu310@gmail.com>	2024-07-11 08:50:19 -10:00
Álvaro Somoza	87b9db644b	[Core] Add Kolors (#8812 ) * initial draft	2024-07-11 06:09:17 -10:00
Xin Ma	b8cf84a3f9	Latte: Latent Diffusion Transformer for Video Generation (#8404 ) * add Latte to diffusers * remove print * remove print * remove print * remove unuse codes * remove layer_norm_latte and add a flag * remove layer_norm_latte and add a flag * update latte_pipeline * update latte_pipeline * remove unuse squeeze * add norm_hidden_states.ndim == 2: # for Latte * fixed test latte pipeline bugs * fixed test latte pipeline bugs * delete sh * add doc for latte * add licensing * Move Transformer3DModelOutput to modeling_outputs * give a default value to sample_size * remove the einops dependency * change norm2 for latte * modify pipeline of latte * update test for Latte * modify some codes for latte * modify for Latte pipeline * modify for Latte pipeline * modify for Latte pipeline * modify for Latte pipeline * modify for Latte pipeline * modify for Latte pipeline * modify for Latte pipeline * modify for Latte pipeline * modify for Latte pipeline * modify for Latte pipeline * modify for Latte pipeline * modify for Latte pipeline * modify for Latte pipeline * modify for Latte pipeline * modify for Latte pipeline * modify for Latte pipeline * modify for Latte pipeline * modify for Latte pipeline * modify for Latte pipeline * modify for Latte pipeline * modify for Latte pipeline * modify for Latte pipeline * modify for Latte pipeline * modify for Latte pipeline * modify for Latte pipeline * modify for Latte pipeline * modify for Latte pipeline * video_length -> num_frames; update prepare_latents copied from * make fix-copies * make style * typo: videe -> video * update * modify for Latte pipeline * modify latte pipeline * modify latte pipeline * modify latte pipeline * modify latte pipeline * modify for Latte pipeline * Delete .vscode directory * make style * make fix-copies * add latte transformer 3d to docs _toctree.yml * update example * reduce frames for test * fixed bug of _text_preprocessing * set num frame to 1 for testing * remove unuse print * add text = self._clean_caption(text) again --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: YiYi Xu <yixu310@gmail.com> Co-authored-by: Aryan <contact.aryanvs@gmail.com> Co-authored-by: Aryan <aryan@huggingface.co>	2024-07-11 15:06:22 +05:30
Alan Du	673eb60f1c	Reformat docstring for `get_timestep_embedding` (#8811 ) * Reformat docstring for `get_timestep_embedding` --------- Co-authored-by: YiYi Xu <yixu310@gmail.com>	2024-07-10 15:54:44 -10:00
Xu Cao	35cc66dc4c	Add pipeline_stable_diffusion_3_inpaint.py for SD3 Inference (#8709 ) * Add pipeline_stable_diffusion_3_inpaint --------- Co-authored-by: Xu Cao <xucao2@jrehg-work-01.cs.illinois.edu> Co-authored-by: IrohXu <irohcao@gmail.com> Co-authored-by: YiYi Xu <yixu310@gmail.com>	2024-07-08 15:53:02 -10:00
Tolga Cangöz	57084dacc5	Remove unnecessary lines (#8569 ) * Remove unused line --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-07-08 10:42:02 -10:00
Zhuoqun(Jack) Chen	70611a1068	Fix static typing and doc typos (#8807 ) * Fix static typing and doc typos * Fix more same type hint typos with make fix-copies	2024-07-08 09:09:33 -10:00
PommesPeter	98388670d2	[Alpha-VLLM Team] Add Lumina-T2X to diffusers (#8652 ) --------- Co-authored-by: zhuole1025 <zhuole1025@gmail.com> Co-authored-by: YiYi Xu <yixu310@gmail.com>	2024-07-07 17:12:09 -10:00
YiYi Xu	9e9ed353a2	fix loading sharded checkpoints from subfolder (#8798 ) * fix load sharded checkpoints from subfolder{ * style * os.path.join * add a small test --------- Co-authored-by: sayakpaul <spsayakpaul@gmail.com>	2024-07-06 11:32:04 -10:00
Dhruv Nair	0bab9d6be7	[Single File] Allow loading T5 encoder in mixed precision (#8778 ) * update * update * update * update	2024-07-05 10:29:38 +05:30
Sayak Paul	31adeb41cd	[Tests] fix sharding tests (#8764 ) fix sharding tests	2024-07-04 08:50:59 +05:30
XCL	6b6b4bcffe	[Tencent Hunyuan Team] Add checkpoint conversion scripts and changed controlnet (#8783 ) * add conversion files; changed controlnet for hunyuandit * style --------- Co-authored-by: xingchaoliu <xingchaoliu@tencent.com> Co-authored-by: yiyixuxu <yixu310@gmail.com>	2024-07-03 07:45:18 -10:00
Sayak Paul	06ee4db3e7	[Chore] add dummy lora attention processors to prevent failures in other libs (#8777 ) add dummy lora attention processors to prevent failures in other libs	2024-07-03 13:11:00 +05:30
Sayak Paul	984d340534	Revert "[LoRA] introduce `LoraBaseMixin` to promote reusability." (#8773 ) Revert "[LoRA] introduce `LoraBaseMixin` to promote reusability. (#8670)" This reverts commit `a2071a1837`.	2024-07-03 07:05:01 +05:30
Sayak Paul	a2071a1837	[LoRA] introduce `LoraBaseMixin` to promote reusability. (#8670 ) * introduce to promote reusability. * up * add more tests * up * remove comments. * fix fuse_nan test * clarify the scope of fuse_lora and unfuse_lora * remove space	2024-07-03 07:04:37 +05:30
YiYi Xu	d9f71ab3c3	correct `attention_head_dim` for `JointTransformerBlock` (#8608 ) * add * update sd3 controlnet * Update src/diffusers/models/controlnet_sd3.py --------- Co-authored-by: yiyixuxu <yixu310@gmail,com> Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>	2024-07-02 07:42:25 -10:00
Dhruv Nair	c104482b9c	Fix warning in UNetMotionModel (#8756 ) * update * Update src/diffusers/models/unets/unet_motion_model.py Co-authored-by: YiYi Xu <yixu310@gmail.com> --------- Co-authored-by: YiYi Xu <yixu310@gmail.com>	2024-07-02 11:07:13 +05:30
YiYi Xu	8b1e3ec93e	[hunyuan-dit] refactor `HunyuanCombinedTimestepTextSizeStyleEmbedding` (#8761 ) up Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-07-02 10:11:04 +05:30
Haofan Wang	0bae6e447c	Allow from_transformer in SD3ControlNetModel (#8749 ) * Update controlnet_sd3.py --------- Co-authored-by: YiYi Xu <yixu310@gmail.com>	2024-07-01 07:38:38 -10:00
Dhruv Nair	0368483b61	Remove legacy single file model loading mixins (#8754 ) update	2024-07-01 07:20:19 -10:00
YiYi Xu	ddb9d8548c	[doc] add a tip about using SDXL refiner with hunyuan-dit and pixart (#8735 ) * up * Apply suggestions from code review Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2024-07-01 06:30:09 -10:00
Lucain	49979753e1	Always raise from previous error (#8751 )	2024-07-01 14:22:30 +05:30
XCL	a3904d7e34	[Tencent Hunyuan Team] Add HunyuanDiT-v1.2 Support (#8747 ) * add v1.2 support --------- Co-authored-by: xingchaoliu <xingchaoliu@tencent.com> Co-authored-by: yiyixuxu <yixu310@gmail.com>	2024-06-30 21:33:38 -10:00
Shauray Singh	8690e8b9d6	add PAG support for SD architecture (#8725 ) * add pag to sd pipelines	2024-06-29 09:26:11 -10:00
Luo Chaofan	a216b0bb7f	fix: ValueError when using FromOriginalModelMixin in subclasses #8440 (#8454 ) * fix: ValueError when using FromOriginalModelMixin in subclasses #8440 (cherry picked from commit `9285997843`) * Update src/diffusers/loaders/single_file_model.py Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com> * Update single_file_model.py * Update single_file_model.py --------- Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-06-28 17:15:46 +05:30
Sayak Paul	d5dd8df3b4	[Chore] perform better deprecation for vqmodeloutput (#8719 ) perform better deprecation for vqmodeloutput	2024-06-27 12:16:37 +05:30
Mathis Koroglu	3e0d128da7	Motion Model / Adapter versatility (#8301 ) * Motion Model / Adapter versatility - allow to use a different number of layers per block - allow to use a different number of transformer per layers per block - allow a different number of motion attention head per block - use dropout argument in get_down/up_block in 3d blocks * Motion Model added arguments renamed & refactoring * Add test for asymmetric UNetMotionModel	2024-06-27 11:11:29 +05:30
vincedovy	a536e775fb	Fix json WindowsPath crash (#8662 ) * Add check for WindowsPath in to_json_string On Windows, os.path.join returns a WindowsPath. to_json_string does not convert this from a WindowsPath to a string. Added check for WindowsPath to to_json_saveable. * Remove extraneous convert to string in test_check_path_types (tests/others/test_config.py) * Fix style issues in tests/others/test_config.py * Add unit test to test_config.py to verify that PosixPath and WindowsPath (depending on system) both work when converted to JSON * Remove distinction between PosixPath and WindowsPath in ConfigMixIn.to_json_string(). Conditional now tests for Path, and uses Path.as_posix() to convert to string. --------- Co-authored-by: Vincent Dovydaitis <vincedovy@gmail.com> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-06-27 10:30:55 +05:30
Álvaro Somoza	3b01d72a64	Modify FlowMatch Scale Noise (#8678 ) * initial fix * apply suggestion * delete step_index line	2024-06-27 00:36:33 -04:00
Sayak Paul	adbb04864d	[LoRA] fix conversion utility so that lora dora loads correctly (#8688 ) fix conversion utility so that lora dora loads correctly	2024-06-27 08:58:32 +05:30
Sayak Paul	5b51ad0052	[LoRA] fix vanilla fine-tuned lora loading. (#8691 ) fix vanilla fine-tuned lora loading.	2024-06-26 07:38:57 -10:00
Sayak Paul	10b4e354b6	[Chore] remove deprecation from transformer2d regarding the output class. (#8698 ) * remove deprecation from transformer2d regarding the output class. * up * deprecate more	2024-06-26 07:35:36 -10:00
Donald.Lee	ea6938aea5	Fix: unet save_attn_procs at UNet2DconditionLoadersMixin (#8699 ) * fix: unet save_attn_procs at custom diffusion * style: recover unchanaged parts(max line length 119) / mod: add condition * style: recover unchanaged parts(max line length 119) --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-06-26 22:30:49 +05:30
XCL	fa2abfdb03	[Tencent Hunyuan Team] Add Hunyuan-DiT ControlNet Inference (#8694 ) * add controlnet support --------- Co-authored-by: xingchaoliu <xingchaoliu@tencent.com> Co-authored-by: yiyixuxu <yixu310@gmail,com>	2024-06-26 00:43:03 -10:00
Dhruv Nair	0f0b531827	Add decorator for compile tests (#8703 ) * update * update --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-06-26 11:26:47 +05:30
YiYi Xu	540399f540	add PAG support (#7944 ) * first draft --------- Co-authored-by: yiyixuxu <yixu310@gmail,com> Co-authored-by: Junhwa Song <ethan9867@gmail.com> Co-authored-by: Ahn Donghoon (안동훈 / suno) <suno.vivid@gmail.com> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2024-06-25 08:40:02 -10:00
Linoy Tsaban	c6e08ecd46	[Sd3 Dreambooth LoRA] Add text encoder training for the clip encoders (#8630 ) * add clip text-encoder training * no dora * text encoder traing fixes * text encoder traing fixes * text encoder training fixes * text encoder training fixes * text encoder training fixes * text encoder training fixes * add text_encoder layers to save_lora * style * fix imports * style * fix text encoder * review changes * review changes * review changes * minor change * add lora tag * style * add readme notes * add tests for clip encoders * style * typo * fixes * style * Update tests/lora/test_lora_layers_sd3.py Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * Update examples/dreambooth/README_sd3.md Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * minor readme change --------- Co-authored-by: YiYi Xu <yixu310@gmail.com> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-06-25 18:00:19 +05:30
Steven Liu	df4ad6f4ac	[docs] Fix Pillow import (#8684 ) fix import error	2024-06-24 10:13:15 -07:00
Tolga Cangöz	f040c27d4c	Errata - Fix typos and improve style (#8571 ) * Fix typos * Fix typos & up style * chore: Update numbers --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-06-24 10:07:22 -07:00
Tolga Cangöz	138fac703a	Discourage using deprecated `revision` parameter (#8573 ) * Discourage using `revision` * `make style && make quality` * Refactor code to use 'variant' instead of 'revision' * `revision="bf16"` -> `variant="bf16"` --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-06-24 10:06:49 -07:00
Dong	3fca52022f	🎨 fix xl playground device (#8550 ) * 🎨 fix xl playground device * 🎨 run `make fix-copies` * 🎨 run `make fix-copies` * edit xl_controlnet_img2img file * edit playground img2img test slow * Update tests/pipelines/stable_diffusion_xl/test_stable_diffusion_xl_img2img.py --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-06-24 16:49:55 +05:30
Tolga Cangöz	c375903db5	Errata - Fix typos & improve contributing page (#8572 ) * Fix typos & improve contributing page * `make style && make quality` * fix typos * Fix typo --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-06-24 14:13:03 +05:30
drhead	2ada094bff	Add extra performance features for EMAModel, torch._foreach operations and better support for non-blocking CPU offloading (#7685 ) * Add support for _foreach operations and non-blocking to EMAModel * default foreach to false * add non-blocking EMA offloading to SD1.5 T2I example script * fix whitespace * move foreach to cli argument * linting * Update README.md re: EMA weight training * correct args.foreach_ema * add tests for foreach ema * code quality * add foreach to from_pretrained * default foreach false * fix linting --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: drhead <a@a.a>	2024-06-24 14:03:47 +05:30
Haofan Wang	f1f542bdd4	Update pipeline_stable_diffusion_3_controlnet.py (#8660 ) Co-authored-by: YiYi Xu <yixu310@gmail,com>	2024-06-23 15:27:59 +05:30
Sayak Paul	a9c403c001	[LoRA] refactor lora conversion utility. (#8295 ) * refactor lora conversion utility. * remove error raises. * add onetrainer support too.	2024-06-22 08:29:12 +05:30
Álvaro Somoza	e7b9a0762b	[SD3 LoRA] Fix list index out of range (#8584 ) * fix * add check * key present is checked before * test case draft * aply suggestions * changed testing repo, back to old class * forgot docstring --------- Co-authored-by: YiYi Xu <yixu310@gmail.com> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-06-21 21:17:34 +05:30
Sayak Paul	8eb17315c8	[LoRA] get rid of the legacy lora remnants and make our codebase lighter (#8623 ) * get rid of the legacy lora remnants and make our codebase lighter * fix depcrecated lora argument * fix * empty commit to trigger ci * remove print * empty	2024-06-21 16:36:05 +05:30
YiYi Xu	c71c19c5e6	a few fix for shard checkpoints (#8656 ) fix Co-authored-by: yiyixuxu <yixu310@gmail,com>	2024-06-21 12:50:58 +05:30
Steaunk	adc31940a9	Fix Typo in StableDiffusion3 (#8642 ) * fix typo in __call__ of pipeline_stable_diffusion_3.py * fix typo in __call__ of pipeline_stable_diffusion_3_img2img.py --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-06-21 08:45:48 +05:30

1 2 3 4 5 ...

2361 Commits