diffusers

mirror of https://github.com/huggingface/diffusers.git synced 2026-01-27 17:22:53 +03:00

Author	SHA1	Message	Date
YiYi Xu	0021bfa1e1	support Wan-FLF2V (#11353 ) * update transformer --------- Co-authored-by: Aryan <aryan@huggingface.co>	2025-04-18 10:27:50 -10:00
Sayak Paul	b00a564dac	[docs] add note about use_duck_shape in auraflow docs. (#11348 ) add note about use_duck_shape in auraflow docs.	2025-04-17 10:25:39 +05:30
Sayak Paul	efc9d68b15	[chore] fix lora docs utils (#11338 ) fix lora docs utils	2025-04-17 09:25:53 +05:30
Ishan Modi	d63e6fccb1	[BUG] fixed _toctree.yml alphabetical ordering (#11277 ) update	2025-04-16 09:04:22 -07:00
Sayak Paul	ce1063acfa	[docs] add a snippet for compilation in the auraflow docs. (#11327 ) * add a snippet for compilation in the auraflow docs. * include speedups.	2025-04-16 11:12:09 +05:30
Hameer Abbasi	9352a5ca56	[LoRA] Add LoRA support to AuraFlow (#10216 ) * Add AuraFlowLoraLoaderMixin * Add comments, remove qkv fusion * Add Tests * Add AuraFlowLoraLoaderMixin to documentation * Add Suggested changes * Change attention_kwargs->joint_attention_kwargs * Rebasing derp. * fix * fix * Quality fixes. * make style * `make fix-copies` * `ruff check --fix` * Attept 1 to fix tests. * Attept 2 to fix tests. * Attept 3 to fix tests. * Address review comments. * Rebasing derp. * Get more tests passing by copying from Flux. Address review comments. * `joint_attention_kwargs`->`attention_kwargs` * Add `lora_scale` property for te LoRAs. * Make test better. * Remove useless property. * Skip TE-only tests for AuraFlow. * Support LoRA for non-CLIP TEs. * Restore LoRA tests. * Undo adding LoRA support for non-CLIP TEs. * Undo support for TE in AuraFlow LoRA. * `make fix-copies` * Sync with upstream changes. * Remove unneeded stuff. * Mirror `Lumina2`. * Skip for MPS. * Address review comments. * Remove duplicated code. * Remove unnecessary code. * Remove repeated docs. * Propagate attention. * Fix TE target modules. * MPS fix for LoRA tests. * Unrelated TE LoRA tests fix. * Fix AuraFlow LoRA tests by applying to the right denoiser layers. Co-authored-by: AstraliteHeart <81396681+AstraliteHeart@users.noreply.github.com> * Apply style fixes * empty commit * Fix the repo consistency issues. * Remove unrelated changes. * Style. * Fix `test_lora_fuse_nan`. * fix quality issues. * `pytest.xfail` -> `ValueError`. * Add back `skip_mps`. * Apply style fixes * `make fix-copies` --------- Co-authored-by: Warlord-K <warlordk28@gmail.com> Co-authored-by: hlky <hlky@hlky.ac> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: AstraliteHeart <81396681+AstraliteHeart@users.noreply.github.com> Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>	2025-04-15 10:41:28 +05:30
Sayak Paul	cefa28f449	[docs] Promote `AutoModel` usage (#11300 ) * docs: promote the usage of automodel. * bitsandbytes * Apply suggestions from code review Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> --------- Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2025-04-15 09:25:40 +05:30
Beinsezii	8819cda6c0	Add `skrample` section to `community_projects.md` (#11319 ) Update community_projects.md https://github.com/huggingface/diffusers/discussions/11158#discussioncomment-12681691	2025-04-14 12:12:59 -10:00
Ishan Modi	f1f38ffbee	[ControlNet] Adds controlnet for SanaTransformer (#11040 ) * added controlnet for sana transformer * improve code quality * addressed PR comments * bug fixes * added test cases * update * added dummy objects * addressed PR comments * update * Forcing update * add to docs * code quality * addressed PR comments * addressed PR comments * update * addressed PR comments * added proper styling * update * Revert "added proper styling" This reverts commit `344ee8a701`. * manually ordered * Apply suggestions from code review --------- Co-authored-by: Aryan <contact.aryanvs@gmail.com>	2025-04-13 19:19:39 +05:30
Adrien B	ed41db8525	Update autoencoderkl_allegro.md (#11303 ) Correction typo	2025-04-13 09:41:30 +05:30
hlky	0ef29355c9	HiDream Image (#11231 ) * HiDream Image --------- Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com> Co-authored-by: Aryan <contact.aryanvs@gmail.com> Co-authored-by: Aryan <aryan@huggingface.co>	2025-04-11 06:31:34 -10:00
hlky	552cd32058	[docs] AutoModel (#11250 ) Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2025-04-09 16:42:23 +05:30
Sayak Paul	f685981ed0	[docs] minor updates to dtype map docs. (#11237 ) minor updates to dtype map docs.	2025-04-09 08:38:17 +05:30
Sayak Paul	b924251dd8	minor update to sana sprint docs. (#11236 )	2025-04-09 08:17:45 +05:30
Sayak Paul	4b27c4a494	[feat] implement `record_stream` when using CUDA streams during group offloading (#11081 ) * implement record_stream for better performance. * fix * style. * merge #11097 * Update src/diffusers/hooks/group_offloading.py Co-authored-by: Aryan <aryan@huggingface.co> * fixes * docstring. * remaining todos in low_cpu_mem_usage * tests * updates to docs. --------- Co-authored-by: Aryan <aryan@huggingface.co>	2025-04-08 21:17:49 +05:30
Benjamin Bossan	fb54499614	[LoRA] Implement hot-swapping of LoRA (#9453 ) * [WIP][LoRA] Implement hot-swapping of LoRA This PR adds the possibility to hot-swap LoRA adapters. It is WIP. Description As of now, users can already load multiple LoRA adapters. They can offload existing adapters or they can unload them (i.e. delete them). However, they cannot "hotswap" adapters yet, i.e. substitute the weights from one LoRA adapter with the weights of another, without the need to create a separate LoRA adapter. Generally, hot-swapping may not appear not super useful but when the model is compiled, it is necessary to prevent recompilation. See #9279 for more context. Caveats To hot-swap a LoRA adapter for another, these two adapters should target exactly the same layers and the "hyper-parameters" of the two adapters should be identical. For instance, the LoRA alpha has to be the same: Given that we keep the alpha from the first adapter, the LoRA scaling would be incorrect for the second adapter otherwise. Theoretically, we could override the scaling dict with the alpha values derived from the second adapter's config, but changing the dict will trigger a guard for recompilation, defeating the main purpose of the feature. I also found that compilation flags can have an impact on whether this works or not. E.g. when passing "reduce-overhead", there will be errors of the type: > input name: arg861_1. data pointer changed from 139647332027392 to 139647331054592 I don't know enough about compilation to determine whether this is problematic or not. Current state This is obviously WIP right now to collect feedback and discuss which direction to take this. If this PR turns out to be useful, the hot-swapping functions will be added to PEFT itself and can be imported here (or there is a separate copy in diffusers to avoid the need for a min PEFT version to use this feature). Moreover, more tests need to be added to better cover this feature, although we don't necessarily need tests for the hot-swapping functionality itself, since those tests will be added to PEFT. Furthermore, as of now, this is only implemented for the unet. Other pipeline components have yet to implement this feature. Finally, it should be properly documented. I would like to collect feedback on the current state of the PR before putting more time into finalizing it. * Reviewer feedback * Reviewer feedback, adjust test * Fix, doc * Make fix * Fix for possible g++ error * Add test for recompilation w/o hotswapping * Make hotswap work Requires https://github.com/huggingface/peft/pull/2366 More changes to make hotswapping work. Together with the mentioned PEFT PR, the tests pass for me locally. List of changes: - docstring for hotswap - remove code copied from PEFT, import from PEFT now - adjustments to PeftAdapterMixin.load_lora_adapter (unfortunately, some state dict renaming was necessary, LMK if there is a better solution) - adjustments to UNet2DConditionLoadersMixin._process_lora: LMK if this is even necessary or not, I'm unsure what the overall relationship is between this and PeftAdapterMixin.load_lora_adapter - also in UNet2DConditionLoadersMixin._process_lora, I saw that there is no LoRA unloading when loading the adapter fails, so I added it there (in line with what happens in PeftAdapterMixin.load_lora_adapter) - rewritten tests to avoid shelling out, make the test more precise by making sure that the outputs align, parametrize it - also checked the pipeline code mentioned in this comment: https://github.com/huggingface/diffusers/pull/9453#issuecomment-2418508871; when running this inside the with torch._dynamo.config.patch(error_on_recompile=True) context, there is no error, so I think hotswapping is now working with pipelines. * Address reviewer feedback: - Revert deprecated method - Fix PEFT doc link to main - Don't use private function - Clarify magic numbers - Add pipeline test Moreover: - Extend docstrings - Extend existing test for outputs != 0 - Extend existing test for wrong adapter name * Change order of test decorators parameterized.expand seems to ignore skip decorators if added in last place (i.e. innermost decorator). * Split model and pipeline tests Also increase test coverage by also targeting conv2d layers (support of which was added recently on the PEFT PR). * Reviewer feedback: Move decorator to test classes ... instead of having them on each test method. * Apply suggestions from code review Co-authored-by: hlky <hlky@hlky.ac> * Reviewer feedback: version check, TODO comment * Add enable_lora_hotswap method * Reviewer feedback: check _lora_loadable_modules * Revert changes in unet.py * Add possibility to ignore enabled at wrong time * Fix docstrings * Log possible PEFT error, test * Raise helpful error if hotswap not supported I.e. for the text encoder * Formatting * More linter * More ruff * Doc-builder complaint * Update docstring: - mention no text encoder support yet - make it clear that LoRA is meant - mention that same adapter name should be passed * Fix error in docstring * Update more methods with hotswap argument - SDXL - SD3 - Flux No changes were made to load_lora_into_transformer. * Add hotswap argument to load_lora_into_transformer For SD3 and Flux. Use shorter docstring for brevity. * Extend docstrings * Add version guards to tests * Formatting * Fix LoRA loading call to add prefix=None See: https://github.com/huggingface/diffusers/pull/10187#issuecomment-2717571064 * Run make fix-copies * Add hot swap documentation to the docs * Apply suggestions from code review Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: hlky <hlky@hlky.ac> Co-authored-by: YiYi Xu <yixu310@gmail.com> Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2025-04-08 17:05:31 +05:30
Steven Liu	fc7a867ae5	[docs] MPS update (#11212 ) mps	2025-04-07 14:32:27 -10:00
Tolga Cangöz	13e48492f0	[LTX0.9.5] Refactor `LTXConditionPipeline` for text-only conditioning (#11174 ) * Refactor `LTXConditionPipeline` to add text-only conditioning * style * up * Refactor `LTXConditionPipeline` to streamline condition handling and improve clarity * Improve condition checks * Simplify latents handling based on conditioning type * Refactor rope_interpolation_scale preparation for clarity and efficiency * Update LTXConditionPipeline docstring to clarify supported input types * Add LTX Video 0.9.5 model to documentation * Clarify documentation to indicate support for text-only conditioning without passing `conditions` * refactor: comment out unused parameters in LTXConditionPipeline * fix: restore previously commented parameters in LTXConditionPipeline * fix: remove unused parameters from LTXConditionPipeline * refactor: remove unnecessary lines in LTXConditionPipeline	2025-04-04 16:43:15 +02:00
hlky	e5c6027ef8	[docs] `torch_dtype` map (#11194 )	2025-04-02 12:46:28 +01:00
Dhruv Nair	df1d7b01f1	[WIP] Add Wan Video2Video (#11053 ) * update * update * update * update * update * update * update * update * update * update * update * update * update * update * update * update	2025-04-01 17:22:11 +05:30
Mark	eb50defff2	[Docs] Fix environment variables in `installation.md` (#11179 )	2025-03-31 09:15:25 -07:00
Dhruv Nair	617c208bb4	[Docs] Update Wan Docs with memory optimizations (#11089 ) * update * update	2025-03-28 19:05:56 +05:30
Aryan	1ddf3f3a19	Improve information about group offloading and layerwise casting (#11101 ) * update * Update docs/source/en/optimization/memory.md * Apply suggestions from code review Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com> * apply review suggestions * update --------- Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>	2025-03-24 23:25:59 +05:30
Jun Yeop Na	7aac77affa	[doc] Fix Korean Controlnet Train doc (#11141 ) * remove typo from korean controlnet train doc * removed more paragraphs to remain in sync with the english document	2025-03-24 09:38:21 -07:00
Aryan	8907a70a36	New HunyuanVideo-I2V (#11066 ) * update * update * update * add tests * update docs * raise value error * warning for true cfg and guidance scale * fix test	2025-03-24 21:18:40 +05:30
YiYi Xu	8a63aa5e4f	add sana-sprint (#11074 ) * add sana-sprint --------- Co-authored-by: Junsong Chen <cjs1020440147@icloud.com> Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: Aryan <aryan@huggingface.co>	2025-03-21 06:21:18 -10:00
Aryan	844221ae4e	[core] FasterCache (#10163 ) * init * update * update * update * make style * update * fix * make it work with guidance distilled models * update * make fix-copies * add tests * update * apply_faster_cache -> apply_fastercache * fix * reorder * update * refactor * update docs * add fastercache to CacheMixin * update tests * Apply suggestions from code review * make style * try to fix partial import error * Apply style fixes * raise warning * update --------- Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>	2025-03-21 09:35:04 +05:30
Aryan	2e83cbbb6d	LTX 0.9.5 (#10968 ) * update --------- Co-authored-by: YiYi Xu <yixu310@gmail.com> Co-authored-by: hlky <hlky@hlky.ac>	2025-03-17 16:43:36 -10:00
hlky	5551506b29	Rename Lumina(2)Text2ImgPipeline -> Lumina(2)Pipeline (#10827 ) * Rename Lumina(2)Text2ImgPipeline -> Lumina(2)Pipeline --------- Co-authored-by: YiYi Xu <yixu310@gmail.com>	2025-03-13 09:24:21 -10:00
hlky	733b44ac82	[hybrid inference 🍯🐝] Add VAE encode (#11017 ) * [hybrid inference 🍯🐝] Add VAE encode * _toctree: add vae encode * Add endpoints, tests * vae_encode docs * vae encode benchmarks * api reference * changelog * Update docs/source/en/hybrid_inference/overview.md Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * update --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2025-03-12 11:23:41 +00:00
Sayak Paul	e4b056fe65	[LoRA] support wan i2v loras from the world. (#11025 ) * support wan i2v loras from the world. * remove copied from. * upates * add lora.	2025-03-11 20:43:29 +05:30
Dhruv Nair	9add071592	[Quantization] Allow loading TorchAO serialized Tensor objects with torch>=2.6 (#11018 ) * update * update * update * update * update * update * update * update * update	2025-03-11 10:52:01 +05:30
Dhruv Nair	f5edaa7894	[Quantization] Add Quanto backend (#10756 ) * update * updaet * update * update * update * update * update * update * update * update * update * update * Update docs/source/en/quantization/quanto.md Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * update * update * update * update * update * update * update * update * update * update * update * update * update * update * update * update * update * update * Update src/diffusers/quantizers/quanto/utils.py Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * update * update --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2025-03-10 08:33:05 +05:30
Dhruv Nair	1357931d74	[Single File] Add single file support for Wan T2V/I2V (#10991 ) * update * update * update * update * update * update * update	2025-03-07 22:13:25 +05:30
Aryan	2e5203be04	Hunyuan I2V (#10983 ) * update * update * update * add tests * update * add model tests * update docs * update * update example * fix defaults * update	2025-03-07 12:52:48 +05:30
Sayak Paul	cc22058324	Update evaluation.md (#10938 ) * Update evaluation.md * Update docs/source/en/conceptual/evaluation.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> --------- Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2025-03-04 13:58:16 +05:30
Parag Ekbote	982f9b38d6	Add Example of IPAdapterScaleCutoffCallback to Docs (#10934 ) * Add example of Ip-Adapter-Callback. * Add image links from HF Hub.	2025-03-03 08:32:45 -08:00
Bubbliiiing	5e3b7d2d8a	Add EasyAnimateV5.1 text-to-video, image-to-video, control-to-video generation model (#10626 ) * Update EasyAnimate V5.1 * Add docs && add tests && Fix comments problems in transformer3d and vae * delete comments and remove useless import * delete process * Update EXAMPLE_DOC_STRING * rename transformer file * make fix-copies * make style * refactor pt. 1 * update toctree.yml * add model tests * Update layer_norm for norm_added_q and norm_added_k in Attention * Fix processor problem * refactor vae * Fix problem in comments * refactor tiling; remove einops dependency * fix docs path * make fix-copies * Update src/diffusers/pipelines/easyanimate/pipeline_easyanimate_control.py * update _toctree.yml * fix test * update * update * update * make fix-copies * fix tests --------- Co-authored-by: Aryan <aryan@huggingface.co> Co-authored-by: Aryan <contact.aryanvs@gmail.com> Co-authored-by: YiYi Xu <yixu310@gmail.com> Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>	2025-03-03 18:37:19 +05:30
hlky	fc4229a0c3	Add `remote_decode` to `remote_utils` (#10898 ) * Add `remote_decode` to `remote_utils` * test dependency * test dependency * dependency * dependency * dependency * docstrings * changes * make style * apply * revert, add new options * Apply style fixes * deprecate base64, headers not needed * address comments * add license header * init test_remote_decode * more * more test * more test * skeleton for xl, flux * more test * flux test * flux packed * no scaling * -save * hunyuanvideo test * Apply style fixes * init docs * Update src/diffusers/utils/remote_utils.py Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * comments * Apply style fixes * comments * hybrid_inference/vae_decode * fix * tip? * tip * api reference autodoc * install tip --------- Co-authored-by: sayakpaul <spsayakpaul@gmail.com> Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>	2025-03-02 17:10:01 +00:00
YiYi Xu	2d8a41cae8	[Alibaba Wan Team] continue on #10921 Wan2.1 (#10922 ) * Add wanx pipeline, model and example * wanx_merged_v1 * change WanX into Wan * fix i2v fp32 oom error Link: https://code.alibaba-inc.com/open_wanx2/diffusers/codereview/20607813 * support t2v load fp32 ckpt * add example * final merge v1 * Update autoencoder_kl_wan.py * up * update middle, test up_block * up up * one less nn.sequential * up more * up * more * [refactor] [wip] Wan transformer/pipeline (#10926) * update * update * refactor rope * refactor pipeline * make fix-copies * add transformer test * update * update * make style * update tests * tests * conversion script * conversion script * update * docs * remove unused code * fix _toctree.yml * update dtype * fix test * fix tests: scale * up * more * Apply suggestions from code review * Apply suggestions from code review * style * Update scripts/convert_wan_to_diffusers.py * update docs * fix --------- Co-authored-by: Yitong Huang <huangyitong.hyt@alibaba-inc.com> Co-authored-by: 亚森 <wangjiayu.wjy@alibaba-inc.com> Co-authored-by: Aryan <aryan@huggingface.co>	2025-03-02 17:24:26 +05:30
Anton Obukhov	3fab6624fd	Marigold Update: v1-1 models, Intrinsic Image Decomposition pipeline, documentation (#10884 ) * minor documentation fixes of the depth and normals pipelines * update license headers * update model checkpoints in examples fix missing prediction_type in register_to_config in the normals pipeline * add initial marigold intrinsics pipeline update comments about num_inference_steps and ensemble_size minor fixes in comments of marigold normals and depth pipelines * update uncertainty visualization to work with intrinsics * integrate iid --------- Co-authored-by: YiYi Xu <yixu310@gmail.com> Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2025-02-25 14:13:02 -10:00
Dhruv Nair	87599691b9	[Docs] Fix toctree sorting (#10894 ) update	2025-02-24 10:05:32 -10:00
Aryan	64af74fc58	[docs] Add CogVideoX Schedulers (#10885 ) update	2025-02-24 07:02:59 -10:00
Steven Liu	db21c97043	[docs] Flux group offload (#10847 ) * flux group-offload * feedback	2025-02-24 08:47:08 -08:00
Steven Liu	3fdf173084	[docs] Update prompt weighting docs (#10843 ) * sd_embed * feedback	2025-02-24 08:46:26 -08:00
Steven Liu	64dec70e56	[docs] LoRA support (#10844 ) * lora * update * update --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2025-02-22 08:53:02 +05:30
SahilCarterr	85fcbaf314	[Fix] Docs overview.md (#10858 ) Fix docs	2025-02-21 08:03:22 -08:00
Aryan	e3bc4aab2e	SkyReels Hunyuan T2V & I2V (#10837 ) * update * make fix-copies * update * tests * update * update * add co-author Co-Authored-By: Langdx <82783347+Langdx@users.noreply.github.com> * add co-author Co-Authored-By: howe <howezhang2018@gmail.com> * update --------- Co-authored-by: Langdx <82783347+Langdx@users.noreply.github.com> Co-authored-by: howe <howezhang2018@gmail.com>	2025-02-21 06:48:15 +05:30
Daniel Regado	d9ee3879b0	SD3 IP-Adapter runtime checkpoint conversion (#10718 ) * Added runtime checkpoint conversion * Updated docs * Fix for quantized model	2025-02-20 10:35:57 -10:00
Sayak Paul	f550745a2b	[Utils] add utilities for checking if certain utilities are properly documented (#7763 ) * add; utility to check if attn_procs,norms,acts are properly documented. * add support listing to the workflows. * change to 2024. * small fixes. * does adding detailed docstrings help? * uncomment image processor check * quality * fix, thanks to @mishig. * Apply suggestions from code review Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * style * JointAttnProcessor2_0 * fixes * fixes * fixes * fixes * fixes * fixes * Update docs/source/en/api/normalization.md Co-authored-by: hlky <hlky@hlky.ac> --------- Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> Co-authored-by: hlky <hlky@hlky.ac>	2025-02-20 12:37:00 +05:30

1 2 3 4 5 ...

1010 Commits