Inigo Goiri
841504bb1a
Add support to pass image embeddings to the WAN I2V pipeline. ( #11175 )
...
* Add support to pass image embeddings to the pipeline.
---------
Co-authored-by: hlky <hlky@hlky.ac >
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
Co-authored-by: YiYi Xu <yixu310@gmail.com >
2025-04-07 15:47:06 -10:00
Steven Liu
fc7a867ae5
[docs] MPS update ( #11212 )
...
mps
2025-04-07 14:32:27 -10:00
alex choi
5ded26cdc7
ensure dtype match between diffused latents and vae weights ( #8391 )
2025-04-07 12:59:10 -10:00
Yao Matrix
506f39af3a
enable 1 case on XPU ( #11219 )
...
enable case on XPU: 1. tests/quantization/bnb/test_mixed_int8.py::BnB8bitTrainingTests::test_training
Signed-off-by: YAO Matrix <matrix.yao@intel.com >
2025-04-07 08:24:21 +01:00
Mikko Tukiainen
8ad68c1393
Add missing MochiEncoder3D.gradient_checkpointing attribute ( #11146 )
...
* Add missing 'gradient_checkpointing = False' attr
* Add (limited) tests for Mochi autoencoder
* Apply style fixes
* pass 'conv_cache' as arg instead of kwarg
---------
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
2025-04-06 02:46:45 +05:30
Edna
41afb6690c
Add Wan with STG as a community pipeline ( #11184 )
...
* Add stg wan to community pipelines
* remove debug prints
* remove unused comment
* Update doc
* Add credit + fix typo
* Apply style fixes
---------
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
2025-04-05 04:00:40 +02:00
Tolga Cangöz
13e48492f0
[LTX0.9.5] Refactor LTXConditionPipeline for text-only conditioning ( #11174 )
...
* Refactor `LTXConditionPipeline` to add text-only conditioning
* style
* up
* Refactor `LTXConditionPipeline` to streamline condition handling and improve clarity
* Improve condition checks
* Simplify latents handling based on conditioning type
* Refactor rope_interpolation_scale preparation for clarity and efficiency
* Update LTXConditionPipeline docstring to clarify supported input types
* Add LTX Video 0.9.5 model to documentation
* Clarify documentation to indicate support for text-only conditioning without passing `conditions`
* refactor: comment out unused parameters in LTXConditionPipeline
* fix: restore previously commented parameters in LTXConditionPipeline
* fix: remove unused parameters from LTXConditionPipeline
* refactor: remove unnecessary lines in LTXConditionPipeline
2025-04-04 16:43:15 +02:00
Suprhimp
94f2c48d58
[feat]Add strength in flux_fill pipeline (denoising strength for fluxfill) ( #10603 )
...
* [feat]add strength in flux_fill pipeline
* Update src/diffusers/pipelines/flux/pipeline_flux_fill.py
* Update src/diffusers/pipelines/flux/pipeline_flux_fill.py
* Update src/diffusers/pipelines/flux/pipeline_flux_fill.py
* [refactor] refactor after review
* [fix] change comment
* Apply style fixes
* empty
* fix
* update prepare_latents from flux.img2img pipeline
* style
* Update src/diffusers/pipelines/flux/pipeline_flux_fill.py
---------
2025-04-04 11:23:30 -03:00
Dhruv Nair
aabf8ce20b
Fix Single File loading for LTX VAE ( #11200 )
...
update
2025-04-04 18:02:39 +05:30
Kenneth Gerald Hamilton
f10775b1b5
Fixed requests.get function call by adding timeout parameter. ( #11156 )
...
* Fixed requests.get function call by adding timeout parameter.
* declare DIFFUSERS_REQUEST_TIMEOUT in constants and import when needed
* remove unneeded os import
* Apply style fixes
---------
Co-authored-by: Sai-Suraj-27 <sai.suraj.27.729@gmail.com >
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
2025-04-04 07:23:14 +01:00
célina
6edb774b5e
Update Style Bot workflow ( #11202 )
...
update style bot workflow
2025-04-03 19:31:49 +02:00
Basile Lewandowski
480510ada9
Change KolorsPipeline LoRA Loader to StableDiffusion ( #11198 )
...
Change LoRA Loader to StableDiffusion
Replace the SDXL LoRA Loader Mixin inheritance with the StableDiffusion one
2025-04-03 11:21:11 -03:00
Abhipsha Das
d9023a671a
[Model Card] standardize advanced diffusion training sdxl lora ( #7615 )
...
* model card gen code
* push modelcard creation
* remove optional from params
* add import
* add use_dora check
* correct lora var use in tags
* make style && make quality
---------
Co-authored-by: Aryan <aryan@huggingface.co >
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
2025-04-03 07:43:01 +05:30
Eliseu Silva
c4646a3931
feat: [Community Pipeline] - FaithDiff Stable Diffusion XL Pipeline ( #11188 )
...
* feat: [Community Pipeline] - FaithDiff Stable Diffusion XL Pipeline for Image SR.
* added pipeline
2025-04-02 11:33:19 -10:00
Dhruv Nair
c97b709afa
Add CacheMixin to Wan and LTX Transformers ( #11187 )
...
* update
* update
* update
2025-04-02 10:16:31 -10:00
lakshay sharma
b0ff822ed3
Update import_utils.py ( #10329 )
...
added onnxruntime-vitisai for custom build onnxruntime pkg
2025-04-02 20:47:10 +01:00
hlky
78c2fdc52e
SchedulerMixin from_pretrained and ConfigMixin Self type annotation ( #11192 )
2025-04-02 08:24:02 -10:00
hlky
54dac3a87c
Fix enable_sequential_cpu_offload in CogView4Pipeline ( #11195 )
...
* Fix enable_sequential_cpu_offload in CogView4Pipeline
* make fix-copies
2025-04-02 16:51:23 +01:00
hlky
e5c6027ef8
[docs] torch_dtype map ( #11194 )
2025-04-02 12:46:28 +01:00
hlky
da857bebb6
Revert save_model in ModelMixin save_pretrained and use safe_serialization=False in test ( #11196 )
2025-04-02 12:45:36 +01:00
Fanli Lin
52b460feb9
[tests] HunyuanDiTControlNetPipeline inference precision issue on XPU ( #11197 )
...
* add xpu part
* fix more cases
* remove some cases
* no canny
* format fix
2025-04-02 12:45:02 +01:00
hlky
d8c617ccb0
allow models to run with a user-provided dtype map instead of a single dtype ( #10301 )
...
* allow models to run with a user-provided dtype map instead of a single dtype
* make style
* Add warning, change `_` to `default`
* make style
* add test
* handle shared tensors
* remove warning
---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
2025-04-02 09:05:46 +01:00
Bruno Magalhaes
fe2b397426
remove unnecessary call to F.pad ( #10620 )
...
* rewrite memory count without implicitly using dimensions by @ic-synth
* replace F.pad by built-in padding in Conv3D
* in-place sums to reduce memory allocations
* fixed trailing whitespace
* file reformatted
* in-place sums
* simpler in-place expressions
* removed in-place sum, may affect backward propagation logic
* removed in-place sum, may affect backward propagation logic
* removed in-place sum, may affect backward propagation logic
* reverted change
2025-04-02 08:19:51 +01:00
Eliseu Silva
be0b7f55cc
fix: for checking mandatory and optional pipeline components ( #11189 )
...
fix: optional componentes verification on load
2025-04-02 08:07:24 +01:00
jiqing-feng
4d5a96e40a
fix autocast ( #11190 )
...
Signed-off-by: jiqing-feng <jiqing.feng@intel.com >
2025-04-02 07:26:27 +01:00
Yao Matrix
a7f07c1ef5
map BACKEND_RESET_MAX_MEMORY_ALLOCATED to reset_peak_memory_stats on XPU ( #11191 )
...
Signed-off-by: YAO Matrix <matrix.yao@intel.com >
2025-04-02 07:25:48 +01:00
Dhruv Nair
df1d7b01f1
[WIP] Add Wan Video2Video ( #11053 )
...
* update
* update
* update
* update
* update
* update
* update
* update
* update
* update
* update
* update
* update
* update
* update
* update
2025-04-01 17:22:11 +05:30
Fanli Lin
5a6edac087
[tests] no hard-coded cuda ( #11186 )
...
no cuda only
2025-04-01 12:14:31 +01:00
kakukakujirori
e8fc8b1f81
Bug fix in LTXImageToVideoPipeline.prepare_latents() when latents is already set ( #10918 )
...
* Bug fix in ltx
* Assume packed latents.
---------
Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com >
Co-authored-by: YiYi Xu <yixu310@gmail.com >
2025-03-31 12:15:43 -10:00
hlky
d6f4774c1c
Add latents_mean and latents_std to SDXLLongPromptWeightingPipeline ( #11034 )
2025-03-31 11:32:29 -10:00
Mark
eb50defff2
[Docs] Fix environment variables in installation.md ( #11179 )
2025-03-31 09:15:25 -07:00
Aryan
2c59af7222
Raise warning and round down if Wan num_frames is not 4k + 1 ( #11167 )
...
* update
* raise warning and round to nearest multiple of scale factor
2025-03-31 13:33:28 +05:30
hlky
75d7e5cc45
Fix LatteTransformer3DModel dtype mismatch with enable_temporal_attentions ( #11139 )
2025-03-29 15:52:56 +01:00
Dhruv Nair
617c208bb4
[Docs] Update Wan Docs with memory optimizations ( #11089 )
...
* update
* update
2025-03-28 19:05:56 +05:30
hlky
5d970a4aa9
WanI2V encode_image ( #11164 )
...
* WanI2V encode_image
2025-03-28 18:05:34 +05:30
kentdan3msu
de6a88c2d7
Set self._hf_peft_config_loaded to True when LoRA is loaded using load_lora_adapter in PeftAdapterMixin class ( #11155 )
...
set self._hf_peft_config_loaded to True on successful lora load
Sets the `_hf_peft_config_loaded` flag if a LoRA is successfully loaded in `load_lora_adapter`. Fixes bug huggingface/diffusers/issues/11148
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
2025-03-26 18:31:18 +01:00
Dhruv Nair
7dc52ea769
[Quantization] dtype fix for GGUF + fix BnB tests ( #11159 )
...
* update
* update
* update
* update
2025-03-26 22:22:16 +05:30
Junsong Chen
739d6ec731
add a timestep scale for sana-sprint teacher model ( #11150 )
2025-03-25 08:47:39 -10:00
Aryan
1ddf3f3a19
Improve information about group offloading and layerwise casting ( #11101 )
...
* update
* Update docs/source/en/optimization/memory.md
* Apply suggestions from code review
Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com >
* apply review suggestions
* update
---------
Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com >
2025-03-24 23:25:59 +05:30
Jun Yeop Na
7aac77affa
[doc] Fix Korean Controlnet Train doc ( #11141 )
...
* remove typo from korean controlnet train doc
* removed more paragraphs to remain in sync with the english document
2025-03-24 09:38:21 -07:00
Aryan
8907a70a36
New HunyuanVideo-I2V ( #11066 )
...
* update
* update
* update
* add tests
* update docs
* raise value error
* warning for true cfg and guidance scale
* fix test
2025-03-24 21:18:40 +05:30
Junsong Chen
5dbe4f5de6
[fix SANA-Sprint] ( #11142 )
...
* fix bug in sana conversion script;
* add more model paths;
---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
2025-03-23 23:38:14 -10:00
Yuxuan Zhang
1d37f42055
Modify the implementation of retrieve_timesteps in CogView4-Control. ( #11125 )
...
* 1
* change to channel 1
* cogview4 control training
* add CacheMixin
* 1
* remove initial_input_channels change for val
* 1
* update
* use 3.5
* new loss
* 1
* use imagetoken
* for megatron convert
* 1
* train con and uc
* 2
* remove guidance_scale
* Update pipeline_cogview4_control.py
* fix
* use cogview4 pipeline with timestep
* update shift_factor
* remove the uncond
* add max length
* change convert and use GLMModel instead of GLMForCasualLM
* fix
* [cogview4] Add attention mask support to transformer model
* [fix] Add attention mask for padded token
* update
* remove padding type
* Update train_control_cogview4.py
* resolve conflicts with #10981
* add control convert
* use control format
* fix
* add missing import
* update with cogview4 formate
* make style
* Update pipeline_cogview4_control.py
* Update pipeline_cogview4_control.py
* remove
* Update pipeline_cogview4_control.py
* put back
* Apply style fixes
---------
Co-authored-by: OleehyO <leehy0357@gmail.com >
Co-authored-by: yiyixuxu <yixu310@gmail.com >
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
2025-03-23 21:17:14 +05:30
Tolga Cangöz
0213179ba8
Update README and example code for AnyText usage ( #11028 )
...
* [Documentation] Update README and example code with additional usage instructions for AnyText
* [Documentation] Update README for AnyTextPipeline and improve logging in code
* Remove wget command for font file from example docstring in anytext.py
2025-03-23 21:15:57 +05:30
hlky
a7d53a5939
Don't override torch_dtype and don't use when quantization_config is set ( #11039 )
...
* Don't use `torch_dtype` when `quantization_config` is set
* up
* djkajka
* Apply suggestions from code review
---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
2025-03-21 21:58:38 +05:30
YiYi Xu
8a63aa5e4f
add sana-sprint ( #11074 )
...
* add sana-sprint
---------
Co-authored-by: Junsong Chen <cjs1020440147@icloud.com >
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
Co-authored-by: Aryan <aryan@huggingface.co >
2025-03-21 06:21:18 -10:00
Aryan
844221ae4e
[core] FasterCache ( #10163 )
...
* init
* update
* update
* update
* make style
* update
* fix
* make it work with guidance distilled models
* update
* make fix-copies
* add tests
* update
* apply_faster_cache -> apply_fastercache
* fix
* reorder
* update
* refactor
* update docs
* add fastercache to CacheMixin
* update tests
* Apply suggestions from code review
* make style
* try to fix partial import error
* Apply style fixes
* raise warning
* update
---------
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
2025-03-21 09:35:04 +05:30
CyberVy
9b2c0a7dbe
fix _callback_tensor_inputs of sd controlnet inpaint pipeline missing some elements ( #11073 )
...
* Update pipeline_controlnet_inpaint.py
* Apply style fixes
2025-03-20 23:56:12 -03:00
Parag Ekbote
f424b1b062
Notebooks for Community Scripts-8 ( #11128 )
...
Add 4 Notebooks and update the missing links for the
example README.
2025-03-20 12:24:46 -07:00
YiYi Xu
e9fda3924f
remove F.rms_norm for now ( #11126 )
...
up
2025-03-20 07:55:01 -10:00