diffusers

mirror of https://github.com/huggingface/diffusers.git synced 2026-01-27 17:22:53 +03:00

Author	SHA1	Message	Date
Dhruv Nair	617c208bb4	[Docs] Update Wan Docs with memory optimizations (#11089 ) * update * update	2025-03-28 19:05:56 +05:30
hlky	5d970a4aa9	WanI2V encode_image (#11164 ) * WanI2V encode_image	2025-03-28 18:05:34 +05:30
kentdan3msu	de6a88c2d7	Set self._hf_peft_config_loaded to True when LoRA is loaded using `load_lora_adapter` in PeftAdapterMixin class (#11155 ) set self._hf_peft_config_loaded to True on successful lora load Sets the `_hf_peft_config_loaded` flag if a LoRA is successfully loaded in `load_lora_adapter`. Fixes bug huggingface/diffusers/issues/11148 Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2025-03-26 18:31:18 +01:00
Dhruv Nair	7dc52ea769	[Quantization] dtype fix for GGUF + fix BnB tests (#11159 ) * update * update * update * update	2025-03-26 22:22:16 +05:30
Junsong Chen	739d6ec731	add a timestep scale for sana-sprint teacher model (#11150 )	2025-03-25 08:47:39 -10:00
Aryan	1ddf3f3a19	Improve information about group offloading and layerwise casting (#11101 ) * update * Update docs/source/en/optimization/memory.md * Apply suggestions from code review Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com> * apply review suggestions * update --------- Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>	2025-03-24 23:25:59 +05:30
Jun Yeop Na	7aac77affa	[doc] Fix Korean Controlnet Train doc (#11141 ) * remove typo from korean controlnet train doc * removed more paragraphs to remain in sync with the english document	2025-03-24 09:38:21 -07:00
Aryan	8907a70a36	New HunyuanVideo-I2V (#11066 ) * update * update * update * add tests * update docs * raise value error * warning for true cfg and guidance scale * fix test	2025-03-24 21:18:40 +05:30
Junsong Chen	5dbe4f5de6	[fix SANA-Sprint] (#11142 ) * fix bug in sana conversion script; * add more model paths; --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2025-03-23 23:38:14 -10:00
Yuxuan Zhang	1d37f42055	Modify the implementation of retrieve_timesteps in CogView4-Control. (#11125 ) * 1 * change to channel 1 * cogview4 control training * add CacheMixin * 1 * remove initial_input_channels change for val * 1 * update * use 3.5 * new loss * 1 * use imagetoken * for megatron convert * 1 * train con and uc * 2 * remove guidance_scale * Update pipeline_cogview4_control.py * fix * use cogview4 pipeline with timestep * update shift_factor * remove the uncond * add max length * change convert and use GLMModel instead of GLMForCasualLM * fix * [cogview4] Add attention mask support to transformer model * [fix] Add attention mask for padded token * update * remove padding type * Update train_control_cogview4.py * resolve conflicts with #10981 * add control convert * use control format * fix * add missing import * update with cogview4 formate * make style * Update pipeline_cogview4_control.py * Update pipeline_cogview4_control.py * remove * Update pipeline_cogview4_control.py * put back * Apply style fixes --------- Co-authored-by: OleehyO <leehy0357@gmail.com> Co-authored-by: yiyixuxu <yixu310@gmail.com> Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>	2025-03-23 21:17:14 +05:30
Tolga Cangöz	0213179ba8	Update README and example code for AnyText usage (#11028 ) * [Documentation] Update README and example code with additional usage instructions for AnyText * [Documentation] Update README for AnyTextPipeline and improve logging in code * Remove wget command for font file from example docstring in anytext.py	2025-03-23 21:15:57 +05:30
hlky	a7d53a5939	Don't override `torch_dtype` and don't use when `quantization_config` is set (#11039 ) * Don't use `torch_dtype` when `quantization_config` is set * up * djkajka * Apply suggestions from code review --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2025-03-21 21:58:38 +05:30
YiYi Xu	8a63aa5e4f	add sana-sprint (#11074 ) * add sana-sprint --------- Co-authored-by: Junsong Chen <cjs1020440147@icloud.com> Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: Aryan <aryan@huggingface.co>	2025-03-21 06:21:18 -10:00
Aryan	844221ae4e	[core] FasterCache (#10163 ) * init * update * update * update * make style * update * fix * make it work with guidance distilled models * update * make fix-copies * add tests * update * apply_faster_cache -> apply_fastercache * fix * reorder * update * refactor * update docs * add fastercache to CacheMixin * update tests * Apply suggestions from code review * make style * try to fix partial import error * Apply style fixes * raise warning * update --------- Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>	2025-03-21 09:35:04 +05:30
CyberVy	9b2c0a7dbe	fix _callback_tensor_inputs of sd controlnet inpaint pipeline missing some elements (#11073 ) * Update pipeline_controlnet_inpaint.py * Apply style fixes	2025-03-20 23:56:12 -03:00
Parag Ekbote	f424b1b062	Notebooks for Community Scripts-8 (#11128 ) Add 4 Notebooks and update the missing links for the example README.	2025-03-20 12:24:46 -07:00
YiYi Xu	e9fda3924f	remove F.rms_norm for now (#11126 ) up	2025-03-20 07:55:01 -10:00
Dhruv Nair	2c1ed50fc5	Provide option to reduce CPU RAM usage in Group Offload (#11106 ) * update * update * clean up	2025-03-20 17:01:09 +05:30
Fanli Lin	15ad97f782	[tests] make cuda only tests device-agnostic (#11058 ) * enable bnb on xpu * add 2 more cases * add missing change * add missing change * add one more * enable cuda only tests on xpu * enable big gpu cases	2025-03-20 10:12:35 +00:00
hlky	9f2d5c9ee9	Flux with Remote Encode (#11091 ) * Flux img2img remote encode * Flux inpaint * -copied from	2025-03-20 09:44:08 +00:00
Junsong Chen	dc62e6931e	[fix bug] PixArt inference_steps=1 (#11079 ) * fix bug when pixart-dmd inference with `num_inference_steps=1` * use return_dict=False and return [1] element for 1-step pixart model, which works for both lcm and dmd	2025-03-20 07:44:30 +00:00
Fanli Lin	56f740051d	[tests] enable bnb tests on xpu (#11001 ) * enable bnb on xpu * add 2 more cases * add missing change * add missing change * add one more	2025-03-19 16:33:11 +00:00
Linoy Tsaban	a34d97cef0	[Wan LoRAs] make T2V LoRAs compatible with Wan I2V (#11107 ) * @hlky t2v->i2v * Apply style fixes * try with ones to not nullify layers * fix method name * revert to zeros * add check to state_dict keys * add comment * copies fix * Revert "copies fix" This reverts commit `051f534d18`. * remove copied from * Update src/diffusers/loaders/lora_pipeline.py Co-authored-by: hlky <hlky@hlky.ac> * Update src/diffusers/loaders/lora_pipeline.py Co-authored-by: hlky <hlky@hlky.ac> * update * update * Update src/diffusers/loaders/lora_pipeline.py Co-authored-by: hlky <hlky@hlky.ac> * Apply style fixes --------- Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com> Co-authored-by: Linoy <linoy@hf.co> Co-authored-by: hlky <hlky@hlky.ac>	2025-03-19 21:44:19 +05:30
Yuqian Hong	fc28791fc8	[BUG] Fix Autoencoderkl train script (#11113 ) * add disc_optimizer step (not fix) * support syncbatchnorm in discriminator	2025-03-19 16:49:02 +05:30
Sayak Paul	ae14612673	[CI] uninstall deps properly from pr gpu tests. (#11102 ) uninstall deps properly from pr gpu tests.	2025-03-19 08:58:36 +05:30
hlky	0ab8fe49bf	Quality options in `export_to_video` (#11090 ) * Quality options in `export_to_video` * make style	2025-03-18 10:32:33 -10:00
Aryan	3be6706018	Fix Group offloading behaviour when using streams (#11097 ) * update * update	2025-03-18 14:44:10 +05:30
Cheng Jin	cb1b8b21b8	Resolve stride mismatch in UNet's ResNet to support Torch DDP (#11098 ) Modify UNet's ResNet implementation to resolve stride mismatch in Torch's DDP	2025-03-18 07:38:13 +00:00
Juan Acevedo	27916822b2	update readme instructions. (#11096 ) Co-authored-by: Juan Acevedo <jfacevedo@google.com>	2025-03-17 20:07:48 -10:00
co63oc	3fe3bc0642	Fix pipeline_flux_controlnet.py (#11095 ) * Fix pipeline_flux_controlnet.py * Fix style	2025-03-17 19:52:15 -10:00
Aryan	813d42cc96	Group offloading improvements (#11094 ) update	2025-03-18 11:18:00 +05:30
Sayak Paul	b4d7e9c632	make PR GPU tests conditioned on styling. (#11099 )	2025-03-18 11:15:35 +05:30
Aryan	2e83cbbb6d	LTX 0.9.5 (#10968 ) * update --------- Co-authored-by: YiYi Xu <yixu310@gmail.com> Co-authored-by: hlky <hlky@hlky.ac>	2025-03-17 16:43:36 -10:00
C	33d10af28f	Fix Wan I2V Quality (#11087 ) * fix_wan_i2v_quality * Update src/diffusers/pipelines/wan/pipeline_wan_i2v.py Co-authored-by: YiYi Xu <yixu310@gmail.com> * Update src/diffusers/pipelines/wan/pipeline_wan_i2v.py Co-authored-by: YiYi Xu <yixu310@gmail.com> * Update src/diffusers/pipelines/wan/pipeline_wan_i2v.py Co-authored-by: YiYi Xu <yixu310@gmail.com> * Update pipeline_wan_i2v.py --------- Co-authored-by: YiYi Xu <yixu310@gmail.com> Co-authored-by: hlky <hlky@hlky.ac>	2025-03-17 06:24:57 -10:00
Sayak Paul	100142586f	[CI] pin transformers version for benchmarking. (#11067 ) pin transformers version for benchmarking.	2025-03-16 10:27:35 +05:30
Yuxuan Zhang	82188cef04	CogView4 Control Block (#10809 ) * cogview4 control training --------- Co-authored-by: OleehyO <leehy0357@gmail.com> Co-authored-by: yiyixuxu <yixu310@gmail.com>	2025-03-15 07:15:56 -10:00
Sayak Paul	cc19726f3d	[Tests] add requires peft decorator. (#11037 ) * add requires peft decorator. * install peft conditionally. * conditional deps. Co-authored-by: DN6 <dhruv.nair@gmail.com> --------- Co-authored-by: DN6 <dhruv.nair@gmail.com>	2025-03-15 12:56:41 +05:30
Dimitri Barbot	be54a95b93	Fix deterministic issue when getting pipeline dtype and device (#10696 ) Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>	2025-03-15 07:50:58 +05:30
Juan Acevedo	6b9a3334db	reverts accidental change that removes attn_mask in attn. Improves fl… (#11065 ) reverts accidental change that removes attn_mask in attn. Improves flux ptxla by using flash block sizes. Moves encoding outside the for loop. Co-authored-by: Juan Acevedo <jfacevedo@google.com>	2025-03-14 12:47:01 -10:00
Andreas Jörg	8ead643bb7	[examples/controlnet/train_controlnet_sd3.py] Fixes #11050 - Cast prompt_embeds and pooled_prompt_embeds to weight_dtype to prevent dtype mismatch (#11051 ) Fix: dtype mismatch of prompt embeddings in sd3 controlnet training Co-authored-by: Andreas Jörg <andreasjoerg@MacBook-Pro-von-Andreas-2.fritz.box> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2025-03-14 17:33:15 +05:30
Sayak Paul	124ac3e81f	[LoRA] feat: support non-diffusers wan t2v loras. (#11059 ) feat: support non-diffusers wan t2v loras.	2025-03-14 16:01:25 +05:30
Sayak Paul	2f0f281b0d	[Tests] restrict memory tests for quanto for certain schemes. (#11052 ) * restrict memory tests for quanto for certain schemes. * Apply suggestions from code review Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com> * fixes * style --------- Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>	2025-03-14 10:35:19 +05:30
ZhengKai91	ccc8321651	Fix aclnnRepeatInterleaveIntWithDim error on NPU for get_1d_rotary_pos_embed (#10820 ) * get_1d_rotary_pos_embed support npu * Update src/diffusers/models/embeddings.py --------- Co-authored-by: Kai zheng <kaizheng@KaideMacBook-Pro.local> Co-authored-by: hlky <hlky@hlky.ac> Co-authored-by: YiYi Xu <yixu310@gmail.com>	2025-03-13 09:58:03 -10:00
Yaniv Galron	5e48cd27d4	making ```formatted_images``` initialization compact (#10801 ) compact writing Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: YiYi Xu <yixu310@gmail.com>	2025-03-13 09:27:14 -10:00
hlky	5551506b29	Rename Lumina(2)Text2ImgPipeline -> Lumina(2)Pipeline (#10827 ) * Rename Lumina(2)Text2ImgPipeline -> Lumina(2)Pipeline --------- Co-authored-by: YiYi Xu <yixu310@gmail.com>	2025-03-13 09:24:21 -10:00
Sayak Paul	20e4b6a628	[LoRA] change to warning from info when notifying the users about a LoRA no-op (#11044 ) * move to warning. * test related changes.	2025-03-12 21:20:48 +05:30
hlky	4ea9f89b8e	Wan Pipeline scaling fix, type hint warning, multi generator fix (#11007 ) * Wan Pipeline scaling fix, type hint warning, multi generator fix * Apply suggestions from code review	2025-03-12 12:05:52 +00:00
hlky	733b44ac82	[hybrid inference 🍯🐝] Add VAE encode (#11017 ) * [hybrid inference 🍯🐝] Add VAE encode * _toctree: add vae encode * Add endpoints, tests * vae_encode docs * vae encode benchmarks * api reference * changelog * Update docs/source/en/hybrid_inference/overview.md Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * update --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2025-03-12 11:23:41 +00:00
hlky	8b4f8ba764	Use `output_size` in `repeat_interleave` (#11030 )	2025-03-12 07:30:21 +00:00
Dhruv Nair	5428046437	[Refactor] Clean up import utils boilerplate (#11026 ) * update * update * update	2025-03-12 07:48:34 +05:30

1 2 3 4 5 ...

5274 Commits