sayakpaul
93eee19d50
up
2025-09-24 10:35:16 +05:30
Yao Matrix
08c29020dd
fix marigold ut case fail on xpu ( #12350 )
...
Signed-off-by: Yao, Matrix <matrix.yao@intel.com >
2025-09-24 09:32:06 +05:30
Yao Matrix
7a58734994
xpu enabling for 4 cases ( #12345 )
...
Signed-off-by: Yao, Matrix <matrix.yao@intel.com >
2025-09-24 09:31:45 +05:30
Sayak Paul
9ef118509e
[tests] disable xformer tests for pipelines it isn't popular. ( #12277 )
...
disable xformer tests for pipelines it isn't popular.
2025-09-24 09:02:25 +05:30
Dhruv Nair
7c54a7b38a
Fix Custom Code loading ( #12378 )
...
* update
* update
* update
2025-09-24 08:53:41 +05:30
Sayak Paul
09e777a3e1
[tests] Single scheduler in lora tests ( #12315 )
...
* single scheduler please.
* up
* up
* up
2025-09-24 08:36:50 +05:30
Steven Liu
a72bc0c4bb
[docs] Attention backends ( #12320 )
...
* init
* feedback
* update
* feedback
* fixes
2025-09-23 10:59:46 -07:00
Dhruv Nair
80de641c1c
Allow Automodel to support custom model code ( #12353 )
...
* update
* update
2025-09-23 07:31:42 -10:00
Steven Liu
76810eca2b
[docs] Schedulers ( #12246 )
...
* init
* toctree
* scheduler suggestions
* toctree
2025-09-23 10:29:16 -07:00
SahilCarterr
1448b03585
[Fix] chroma docs ( #12360 )
...
* Fixes chroma docs
* fix docs
fixed docs are now consistent
2025-09-22 13:04:13 -07:00
Sayak Paul
5796735015
add test and doc for QwenImageEdit Plus ( #12363 )
...
* up
* xfail some tests
* up
* up
2025-09-22 21:57:30 +05:30
Sayak Paul
d8310a8fca
[lora] factor out the overlaps in save_lora_weights(). ( #12027 )
...
* factor out the overlaps in save_lora_weights().
* remove comment.
* remove comment.
* up
* fix-copies
2025-09-22 15:14:39 +05:30
SahilCarterr
78031c2938
[Fix] enable_xformers_memory_efficient_attention() in Flux Pipeline ( #12337 )
...
* FIxes enable_xformers_memory_efficient_attention()
* Update attention.py
2025-09-22 12:37:41 +05:30
Chen Mingyi
d83d35c1bb
Fix bug with VAE slicing in autoencoder_dc.py ( #12343 )
2025-09-22 12:25:34 +05:30
Sayak Paul
843355f89f
[tests] xfail some kandinsky tests. ( #12364 )
...
xfail some kandinsky tests.
2025-09-22 11:17:47 +05:30
Jason Cox
c006a95df1
Fix example server install instructions ( #12362 )
...
* Upgrade huggingface-hub to version 0.35.0
Updated huggingface-hub version from 0.26.1 to 0.35.0.
* Add uvicorn and accelerate to requirements
* Fix install instructions for server
2025-09-22 08:37:17 +05:30
naykun
df267ee4e8
feat: Add QwenImageEditPlus to support future feature upgrades ( #12357 )
...
* feat: add support of qwenimageeditplus
* add copies statement
* fix copies statement
* remove vl_processor reference
2025-09-21 06:10:52 -10:00
Dhruv Nair
edd614ea38
[CI] Fix TRANSFORMERS_FLAX_WEIGHTS_NAME import issue ( #12354 )
...
update
2025-09-20 09:01:40 +05:30
Dave Lage
7e7e62c6ff
Convert alphas for embedders for sd-scripts to ai toolkit conversion ( #12332 )
...
* Convert alphas for embedders for sd-scripts to ai toolkit conversion
* Add kohya embedders conversion test
* Apply style fixes
---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
2025-09-18 12:41:22 +05:30
Fredy
eda9ff8300
Add RequestScopedPipeline for safe concurrent inference, tokenizer lock and non-mutating retrieve_timesteps ( #12328 )
...
* Basic implementation of request scheduling
* Basic editing in SD and Flux Pipelines
* Small Fix
* Fix
* Update for more pipelines
* Add examples/server-async
* Add examples/server-async
* Updated RequestScopedPipeline to handle a single tokenizer lock to avoid race conditions
* Fix
* Fix _TokenizerLockWrapper
* Fix _TokenizerLockWrapper
* Delete _TokenizerLockWrapper
* Fix tokenizer
* Update examples/server-async
* Fix server-async
* Optimizations in examples/server-async
* We keep the implementation simple in examples/server-async
* Update examples/server-async/README.md
* Update examples/server-async/README.md for changes to tokenizer locks and backward-compatible retrieve_timesteps
* The changes to the diffusers core have been undone and all logic is being moved to exmaples/server-async
* Update examples/server-async/utils/*
* Fix BaseAsyncScheduler
* Rollback in the core of the diffusers
* Update examples/server-async/README.md
* Complete rollback of diffusers core files
* Simple implementation of an asynchronous server compatible with SD3-3.5 and Flux Pipelines
* Update examples/server-async/README.md
* Fixed import errors in 'examples/server-async/serverasync.py'
* Flux Pipeline Discard
* Update examples/server-async/README.md
* Apply style fixes
---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
2025-09-18 11:33:43 +05:30
DefTruth
efb7a299af
Fix many type hint errors ( #12289 )
...
* fix hidream type hint
* fix hunyuan-video type hint
* fix many type hint
* fix many type hint errors
* fix many type hint errors
* fix many type hint errors
* make stype & make quality
2025-09-16 18:52:15 -10:00
Zijian Zhou
d06750a5fd
Fix autoencoder_kl_wan.py bugs for Wan2.2 VAE ( #12335 )
...
* Update autoencoder_kl_wan.py
When using the Wan2.2 VAE, the spatial compression ratio calculated here is incorrect. It should be 16 instead of 8. Pass it in directly via the config to ensure it’s correct here.
* Update autoencoder_kl_wan.py
2025-09-16 13:43:15 -10:00
Sari Hleihil
8c72cd12ee
Added LucyEditPipeline ( #12340 )
...
* Added LucyEditPipeline
* add import & stype
missing copied from
* Fix example doc string
---------
Co-authored-by: yiyixuxu <yixu310@gmail.com >
2025-09-16 13:41:05 -10:00
Samarth Agrawal
751e250f70
fixed bug in defining embed dim for UNet1D ( #12111 )
...
* fixed bug in defining embed dim
* matched 1d temb process to 2d
* Update src/diffusers/models/unets/unet_1d.py
Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com >
---------
Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com >
2025-09-16 12:18:48 +05:30
Linoy Tsaban
b50014067d
Add Wan2.2 VACE - Fun ( #12324 )
...
* support Wan2.2-VACE-Fun-A14B
* support Wan2.2-VACE-Fun-A14B
* support Wan2.2-VACE-Fun-A14B
* Apply style fixes
* test
---------
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
2025-09-15 21:31:26 +05:30
Daniel Socek
f5c113e439
Use SDP on BF16 in GPU/HPU migration ( #12310 )
...
* Use SDP on BF16 in GPU/HPU migration
Signed-off-by: Daniel Socek <daniel.socek@intel.com >
* Formatting fix for enabling SDP with BF16 precision on HPU
Signed-off-by: Daniel Socek <daniel.socek@intel.com >
---------
Signed-off-by: Daniel Socek <daniel.socek@intel.com >
2025-09-12 08:00:36 -10:00
Sayak Paul
5e181eddfe
Deprecate slicing and tiling methods from DiffusionPipeline ( #12271 )
...
* deprecate slicing from flux pipeline.
* propagate.
* tiling
* up
* up
2025-09-11 10:04:35 +05:30
Justin Ruan
55f0b3d758
Fix AttributeError of VisualClozeProcessor ( #12121 )
...
Co-authored-by: YiYi Xu <yixu310@gmail.com >
2025-09-11 04:17:34 +05:30
Sayak Paul
eb7ef26736
[quant] allow components_to_quantize to be a non-list for single components ( #12234 )
...
* allow non list components_to_quantize.
* up
* Apply suggestions from code review
* Apply suggestions from code review
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
* [docs] components_to_quantize (#12287 )
init
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
---------
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
2025-09-10 09:47:08 -10:00
ttio2tech
e1b7f1f240
fix for the qwen controlnet pipeline - wrong device can be used ( #12309 )
...
fix the device for textencoder
2025-09-10 08:59:08 -10:00
Sayak Paul
9e7ae568d6
[feat] cache allocator warmup for from_single_model ( #12305 )
...
* add
* add a test
2025-09-10 12:55:32 +05:30
Sayak Paul
f7b79452b4
[modular] fix flux modular pipelines for t2i and i2i ( #12272 )
...
fix flux modular pipelines for t2i and i2i
2025-09-10 12:39:55 +05:30
Sayak Paul
43459079ab
[core] feat: support group offloading at the pipeline level ( #12283 )
...
* feat: support group offloading at the pipeline level.
* add tests
* up
* [docs] Pipeline group offloading (#12286 )
init
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
---------
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
2025-09-10 09:09:57 +05:30
kaixuanliu
4067d6c4b6
adjust criteria for marigold-intrinsics example on XPU ( #12290 )
...
adjust criteria for XPU
Signed-off-by: Liu, Kaixuan <kaixuan.liu@intel.com >
Co-authored-by: Aryan <aryan@huggingface.co >
2025-09-10 03:06:03 +05:30
calcuis
28106fcac4
gguf new quant type support (with demo) ( #12076 )
...
* Update utils.py
not perfect but works
engine:
https://github.com/calcuis/gguf-connector/blob/main/src/gguf_connector/quant2c.py
inference example(s):
https://github.com/calcuis/gguf-connector/blob/main/src/gguf_connector/k6.py
https://github.com/calcuis/gguf-connector/blob/main/src/gguf_connector/k5.py
gguf file sample(s):
https://huggingface.co/calcuis/kontext-gguf/tree/main
https://huggingface.co/calcuis/krea-gguf/tree/main
* Apply style fixes
---------
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
2025-09-09 17:10:21 +05:30
Leo Jiang
c222570a9b
DeepSpeed adaption for flux-kontext ( #12240 )
...
Co-authored-by: J石页 <jiangshuo9@h-partners.com >
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
2025-09-09 12:58:08 +05:30
Frank (Haofan) Wang
4e36bb0d23
Support ControlNet-Inpainting for Qwen-Image ( #12301 )
...
* add qwen-image-cn-inpaint
---------
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
Co-authored-by: yiyixuxu <yixu310@gmail.com >
2025-09-08 14:59:26 -10:00
YiYi Xu
f50b18eec7
[Modular] Qwen ( #12220 )
...
* add qwen modular
2025-09-08 00:27:02 -10:00
Steven Liu
fc337d5853
[docs] Models ( #12248 )
...
* init
* fix
* feedback
* feedback
2025-09-05 11:52:09 -07:00
Steven Liu
32798bf242
[docs] Inference section cleanup ( #12281 )
...
init
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
2025-09-05 09:34:37 -07:00
Steven Liu
c2e5ece08b
[docs] Sharing pipelines/models ( #12280 )
...
init
2025-09-04 11:43:47 -07:00
co63oc
764b62473a
fix some typos ( #12265 )
...
Signed-off-by: co63oc <co63oc@users.noreply.github.com >
2025-09-03 21:28:24 +05:30
Ju Hoon Park
6682956333
Add AttentionMixin to WanVACETransformer3DModel ( #12268 )
...
* Add AttentionMixin to WanVACETransformer3DModel
to enable methods like `set_attn_processor()`.
* Import AttentionMixin in transformer_wan_vace.py
Special thanks to @tolgacangoz 🙇♂️
2025-09-03 15:05:41 +05:30
Sayak Paul
ffc8c0c1e1
[tests] feat: add AoT compilation tests ( #12203 )
...
* feat: add a test for aot.
* up
2025-09-03 11:15:27 +05:30
Ishan Modi
4acbfbf13b
[Quantization] Add TRT-ModelOpt as a Backend ( #11173 )
...
* initial commit
* update
* updates
* update
* update
* update
* update
* update
* update
* addressed PR comments
* update
* addressed PR comments
* update
* update
* update
* update
* update
* update
* updates
* update
* update
* addressed PR comments
* updates
* code formatting
* update
* addressed PR comments
* addressed PR comments
* addressed PR comments
* addressed PR comments
* fix docs and dependencies
* fixed dependency test
---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
2025-09-03 10:14:52 +05:30
Steven Liu
6549b04ec6
[docs] AutoPipeline ( #12160 )
...
* refresh
* feedback
* feedback
* supported models
* fix
2025-09-02 21:06:26 -07:00
Sayak Paul
130fd8df54
[core] use kernels to support _flash_3_hub attention backend ( #12236 )
...
* feat: try loading fa3 using kernels when available.
* up
* change to Hub.
* up
* up
* up
* switch env var.
* up
* up
* up
* up
* up
* up
2025-09-03 08:48:07 +05:30
Dhruv Nair
bcd4d77ba6
[CI] Remove big accelerator requirements from Quanto Tests ( #12266 )
...
update
2025-09-03 08:29:31 +05:30
Linoy Tsaban
006d092751
[Flux LoRA] fix for prior preservation and mixed precision sampling, follow up on #11873 ( #12264 )
...
* propagate fixes from https://github.com/huggingface/diffusers/pull/11873/ to flux script
* propagate fixes from https://github.com/huggingface/diffusers/pull/11873/ to flux script
* propagate fixes from https://github.com/huggingface/diffusers/pull/11873/ to flux script
* Apply style fixes
---------
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
2025-09-02 11:30:33 +03:00
Ziheng Zhang
9e4a75b142
[docs] Fix VAE scale factor calculation in distributed inference docs ( #12259 )
...
docs: Fix VAE scale factor calculation
2025-09-01 16:34:16 -10:00