Sayak Paul
393aefcdc7
[tests] fix audioldm2 for transformers main. ( #11522 )
...
fix audioldm2 for transformers main.
2025-05-08 21:13:42 +05:30
Aryan
6674a5157f
Conditionally import torchvision in Cosmos transformer ( #11524 )
...
fix
2025-05-08 19:37:47 +05:30
scxue
784db0eaab
Add cross attention type for Sana-Sprint training in diffusers. ( #11514 )
...
* test permission
* Add cross attention type for Sana-Sprint.
* Add Sana-Sprint training script in diffusers.
* make style && make quality;
* modify the attention processor with `set_attn_processor` and change `SanaAttnProcessor3_0` to `SanaVanillaAttnProcessor`
* Add import for SanaVanillaAttnProcessor
* Add README file.
* Apply suggestions from code review
* style
* Update examples/research_projects/sana/README.md
---------
Co-authored-by: lawrence-cj <cjs1020440147@icloud.com >
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
2025-05-08 18:55:29 +05:30
Linoy Tsaban
66e50d4e24
[LoRA] make lora alpha and dropout configurable ( #11467 )
...
* add lora_alpha and lora_dropout
* Apply style fixes
* add lora_alpha and lora_dropout
* Apply style fixes
* revert lora_alpha until #11324 is merged
* Apply style fixes
* empty commit
---------
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
2025-05-08 11:54:50 +03:00
sayakpaul
c5c34a4591
Revert "fix audioldm"
...
This reverts commit 87e508f11f .
2025-05-08 11:30:29 +05:30
sayakpaul
87e508f11f
fix audioldm
2025-05-08 11:30:11 +05:30
YiYi Xu
53bd367b03
clean up the __Init__ for stable_diffusion ( #11500 )
...
up
2025-05-07 07:01:17 -10:00
Aryan
7b904941bc
Cosmos ( #10660 )
...
* begin transformer conversion
* refactor
* refactor
* refactor
* refactor
* refactor
* refactor
* update
* add conversion script
* add pipeline
* make fix-copies
* remove einops
* update docs
* gradient checkpointing
* add transformer test
* update
* debug
* remove prints
* match sigmas
* add vae pt. 1
* finish CV* vae
* update
* update
* update
* update
* update
* update
* make fix-copies
* update
* make fix-copies
* fix
* update
* update
* make fix-copies
* update
* update tests
* handle device and dtype for safety checker; required in latest diffusers
* remove enable_gqa and use repeat_interleave instead
* enforce safety checker; use dummy checker in fast tests
* add review suggestion for ONNX export
Co-Authored-By: Asfiya Baig <asfiyab@nvidia.com >
* fix safety_checker issues when not passed explicitly
We could either do what's done in this commit, or update the Cosmos examples to explicitly pass the safety checker
* use cosmos guardrail package
* auto format docs
* update conversion script to support 14B models
* update name CosmosPipeline -> CosmosTextToWorldPipeline
* update docs
* fix docs
* fix group offload test failing for vae
---------
Co-authored-by: Asfiya Baig <asfiyab@nvidia.com >
2025-05-07 20:59:09 +05:30
Sayak Paul
fb29132b98
[docs] minor updates to bitsandbytes docs. ( #11509 )
...
* minor updates to bitsandbytes docs.
* Apply suggestions from code review
2025-05-06 18:52:18 +05:30
Valeriy Selitskiy
79371661d1
[lora_conversion] Enhance key handling for OneTrainer components in LORA conversion utility ( #11441 ) ( #11487 )
...
* [lora_conversion] Enhance key handling for OneTrainer components in LORA conversion utility (#11441 )
* Update src/diffusers/loaders/lora_conversion_utils.py
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
2025-05-06 18:44:58 +05:30
Yao Matrix
8c661ea586
enable lora cases on XPU ( #11506 )
...
* enable lora cases on XPU
Signed-off-by: Yao Matrix <matrix.yao@intel.com >
* remove hunyuanvideo xpu expectation
Signed-off-by: Yao Matrix <matrix.yao@intel.com >
---------
Signed-off-by: Yao Matrix <matrix.yao@intel.com >
2025-05-06 14:59:50 +05:30
Aryan
d7ffe60166
Hunyuan Video Framepack ( #11428 )
...
* add transformer
* add pipeline
* fixes
* make fix-copies
* update
* add flux mu shift
* update example snippet
* debug
* cleanup
* batch_size=1 optimization
* add pipeline test
* fix for model cpu offloading'
* add last_image support; credits: https://github.com/lllyasviel/FramePack/pull/167
* update example with flf2v
* update penguin url
* fix test
* address review comment: https://github.com/huggingface/diffusers/pull/11428#discussion_r2071032371
* address review comment: https://github.com/huggingface/diffusers/pull/11428#discussion_r2071087689
* Update src/diffusers/pipelines/hunyuan_video/pipeline_hunyuan_video_framepack.py
---------
Co-authored-by: Linoy Tsaban <57615435+linoytsaban@users.noreply.github.com >
2025-05-06 14:59:38 +05:30
Sayak Paul
10bee525e7
[LoRA] use removeprefix to preserve sanity. ( #11493 )
...
* use removeprefix to preserve sanity.
* f-string.
2025-05-06 12:17:57 +05:30
Sayak Paul
d88ae1f52a
update dep table. ( #11504 )
...
* update dep table.
* fix
2025-05-06 11:14:07 +05:30
Sayak Paul
53f1043cbb
Update setup.py to pin min version of peft ( #11502 )
2025-05-06 10:23:16 +05:30
Aryan
1fa5639438
Fix torchao docs typo for fp8 granular quantization ( #11473 )
...
update
2025-05-06 07:54:28 +05:30
RogerSinghChugh
ed4efbd63d
Update training script for txt to img sdxl with lora supp with new interpolation. ( #11496 )
...
* Update training script for txt to img sdxl with lora supp with new interpolation.
* ran make style and make quality.
2025-05-05 12:33:28 -04:00
Yijun Lee
9c29e938d7
Set LANCZOS as the default interpolation method for image resizing. ( #11492 )
...
* Set LANCZOS as the default interpolation method for image resizing.
* style: run make style and quality checks
2025-05-05 12:18:40 -04:00
Sayak Paul
071807c853
[training] feat: enable quantization for hidream lora training. ( #11494 )
...
* feat: enable quantization for hidream lora training.
* better handle compute dtype.
* finalize.
* fix dtype.
---------
Co-authored-by: Linoy Tsaban <57615435+linoytsaban@users.noreply.github.com >
2025-05-05 20:44:35 +05:30
Evan Han
ee1516e5c7
[train_dreambooth_lora_lumina2] Add LANCZOS as the default interpolation mode for image resizing ( #11491 )
...
[ADD] interpolation
2025-05-05 10:41:33 -04:00
MinJu-Ha
ec9323996b
[train_dreambooth_lora_sdxl] Add --image_interpolation_mode option for image resizing (default to lanczos) ( #11490 )
...
feat(train_dreambooth_lora_sdxl): support --image_interpolation_mode with default to lanczos
2025-05-05 10:19:30 -04:00
Parag Ekbote
fc5e906689
[train_text_to_image_sdxl]Add LANCZOS as default interpolation mode for image resizing ( #11455 )
...
* Add LANCZOS as default interplotation mode.
* update script
* Update as per code review.
* make style.
2025-05-05 09:52:19 -04:00
Connector Switch
8520d496f0
[Feature] Implement tiled VAE encoding/decoding for Wan model. ( #11414 )
...
* implement tiled encode/decode
* address review comments
2025-05-05 16:07:14 +05:30
Yao Matrix
a674914fd5
enable semantic diffusion and stable diffusion panorama cases on XPU ( #11459 )
...
Signed-off-by: Yao Matrix <matrix.yao@intel.com >
2025-05-05 15:28:07 +05:30
Yash
ec3d58286d
[train_dreambooth_lora_flux_advanced] Add LANCZOS as the default interpolation mode for image resizing ( #11472 )
...
* [train_controlnet_sdxl] Add LANCZOS as the default interpolation mode for image resizing
* [train_dreambooth_lora_flux_advanced] Add LANCZOS as the default interpolation mode for image resizing
2025-05-02 18:14:41 -04:00
Yuanzhou
ed6cf52572
[train_dreambooth_lora_sdxl_advanced] Add LANCZOS as the default interpolation mode for image resizing ( #11471 )
2025-05-02 16:46:01 -04:00
Steven Liu
e23705e557
[docs] Adapters ( #11331 )
...
* refactor adapter docs
* ip-adapter
* ip adapter
* fix toctree
* fix toctree
* lora
* images
* controlnet
* feedback
* controlnet
* t2i
* fix typo
* feedback
---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
2025-05-02 08:08:33 +05:30
Steven Liu
b848d479b1
[docs] Memory optims ( #11385 )
...
* reformat
* initial
* fin
* review
* inference
* feedback
* feedback
* feedback
2025-05-01 11:22:00 -07:00
Vladimir Mandic
d0c02398b9
cache packages_distributions ( #11453 )
...
* cache packages_distributions
* remove unused exception reference
* make style
Signed-off-by: Vladimir Mandic <mandic00@live.com >
* change name to _package_map
---------
Signed-off-by: Vladimir Mandic <mandic00@live.com >
Co-authored-by: DN6 <dhruv.nair@gmail.com >
2025-05-01 21:47:52 +05:30
Sayak Paul
5dcdf4ac9a
[tests] xfail recent pipeline tests for specific methods. ( #11469 )
...
xfail recent pipeline tests for specific methods.
2025-05-01 18:33:52 +05:30
co63oc
86294d3c7f
Fix typos in docs and comments ( #11416 )
...
* Fix typos in docs and comments
* Apply style fixes
---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
2025-04-30 20:30:53 -10:00
Sayak Paul
d70f8ee18b
[WAN] fix recompilation issues ( #11475 )
...
* [tests] Add torch.compile() test for WanTransformer3DModel
* fix wan recompilation issues.
* style
---------
Co-authored-by: tongyu0924 <winnie920924@gmail.com >
2025-04-30 20:29:08 -10:00
Yao Matrix
06beecafc5
make autoencoders. controlnet_flux and wan_transformer3d_single_file pass on xpu ( #11461 )
...
* make autoencoders. controlnet_flux and wan_transformer3d_single_file
pass on XPU
Signed-off-by: Yao Matrix <matrix.yao@intel.com >
* Apply style fixes
---------
Signed-off-by: Yao Matrix <matrix.yao@intel.com >
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
Co-authored-by: Aryan <aryan@huggingface.co >
2025-05-01 02:43:31 +05:30
Vaibhav Kumawat
daf0a23958
Add LANCZOS as default interplotation mode. ( #11463 )
...
* Add LANCZOS as default interplotation mode.
* LANCZOS as default interplotation
* LANCZOS as default interplotation mode
* Added LANCZOS as default interplotation mode
2025-04-30 14:22:38 -04:00
tongyu
38ced7ee59
[test_models_transformer_hunyuan_video] help us test torch.compile() for impactful models ( #11431 )
...
* Update test_models_transformer_hunyuan_video.py
* update
---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
2025-04-30 19:11:42 +08:00
Yao Matrix
23c98025b3
make safe diffusion test cases pass on XPU and A100 ( #11458 )
...
* make safe diffusion test cases pass on XPU and A100
Signed-off-by: Yao Matrix <matrix.yao@intel.com >
* calibrate A100 expected values
Signed-off-by: YAO Matrix <matrix.yao@intel.com >
---------
Signed-off-by: Yao Matrix <matrix.yao@intel.com >
Signed-off-by: YAO Matrix <matrix.yao@intel.com >
2025-04-30 16:05:28 +05:30
captainzz
8cd7426e56
Add StableDiffusion3InstructPix2PixPipeline ( #11378 )
...
* upload StableDiffusion3InstructPix2PixPipeline
* Move to community
* Add readme
* Fix images
* remove images
* Change image url
* fix
* Apply style fixes
2025-04-30 06:13:12 -04:00
Daniel Socek
fbce7aeb32
Add generic support for Intel Gaudi accelerator (hpu device) ( #11328 )
...
* Add generic support for Intel Gaudi accelerator (hpu device)
Signed-off-by: Daniel Socek <daniel.socek@intel.com >
Co-authored-by: Libin Tang <libin.tang@intel.com >
* Add loggers for generic HPU support
Signed-off-by: Daniel Socek <daniel.socek@intel.com >
* Refactor hpu support with is_hpu_available() logic
Signed-off-by: Daniel Socek <daniel.socek@intel.com >
* Fix style for hpu support update
Signed-off-by: Daniel Socek <daniel.socek@intel.com >
* Decouple soft HPU check from hard device validation to support HPU migration
Signed-off-by: Daniel Socek <daniel.socek@intel.com >
---------
Signed-off-by: Daniel Socek <daniel.socek@intel.com >
Co-authored-by: Libin Tang <libin.tang@intel.com >
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
2025-04-30 14:45:02 +05:30
Yao Matrix
35fada4169
enable unidiffuser test cases on xpu ( #11444 )
...
* enable unidiffuser cases on XPU
Signed-off-by: Yao Matrix <matrix.yao@intel.com >
* fix a typo
Signed-off-by: Yao Matrix <matrix.yao@intel.com >
* fix style
Signed-off-by: Yao Matrix <matrix.yao@intel.com >
---------
Signed-off-by: Yao Matrix <matrix.yao@intel.com >
2025-04-30 13:58:00 +05:30
Yao Matrix
fbe2fe5578
enable consistency test cases on XPU, all passed ( #11446 )
...
Signed-off-by: Yao Matrix <matrix.yao@intel.com >
2025-04-30 12:41:29 +05:30
Aryan
c86511586f
torch.compile fullgraph compatibility for Hunyuan Video (#11457 )
...
udpate
2025-04-30 11:21:17 +05:30
Yao Matrix
60892c55a4
enable marigold_intrinsics cases on XPU ( #11445 )
...
Signed-off-by: Yao Matrix <matrix.yao@intel.com >
2025-04-30 11:07:37 +05:30
Aryan
8fe5a14d9b
Raise warning instead of error for block offloading with streams ( #11425 )
...
raise warning instead of error
2025-04-30 08:26:16 +05:30
Youlun Peng
58431f102c
Set LANCZOS as the default interpolation for image resizing in ControlNet training ( #11449 )
...
Set LANCZOS as the default interpolation for image resizing
2025-04-29 08:47:02 -04:00
urpetkov-amd
4a9ab650aa
Fixing missing provider options argument ( #11397 )
...
* Fixing missing provider options argument
* Adding if else for provider options
* Apply suggestions from code review
Co-authored-by: YiYi Xu <yixu310@gmail.com >
* Apply style fixes
* Update src/diffusers/pipelines/onnx_utils.py
Co-authored-by: YiYi Xu <yixu310@gmail.com >
* Update src/diffusers/pipelines/onnx_utils.py
Co-authored-by: YiYi Xu <yixu310@gmail.com >
---------
Co-authored-by: Uros Petkovic <urpektov@amd.com >
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
Co-authored-by: YiYi Xu <yixu310@gmail.com >
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
2025-04-28 10:23:05 -10:00
Linoy Tsaban
0ac1d5b482
[Hi-Dream LoRA] fix bug in validation ( #11439 )
...
remove unnecessary pipeline moving to cpu in validation
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
2025-04-28 06:22:32 -10:00
Yao Matrix
7567adfc45
enable 28 GGUF test cases on XPU ( #11404 )
...
* enable gguf test cases on XPU
Signed-off-by: YAO Matrix <matrix.yao@intel.com >
* make SD35LargeGGUFSingleFileTests::test_pipeline_inference pas
Signed-off-by: root <root@a4bf01945cfe.jf.intel.com >
* make FluxControlLoRAGGUFTests::test_lora_loading pass
Signed-off-by: Yao Matrix <matrix.yao@intel.com >
* polish code
Signed-off-by: Yao Matrix <matrix.yao@intel.com >
* Apply style fixes
---------
Signed-off-by: YAO Matrix <matrix.yao@intel.com >
Signed-off-by: root <root@a4bf01945cfe.jf.intel.com >
Signed-off-by: Yao Matrix <matrix.yao@intel.com >
Co-authored-by: root <root@a4bf01945cfe.jf.intel.com >
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
2025-04-28 21:32:04 +05:30
tongyu
3da98e7ee3
[train_text_to_image_lora] Better image interpolation in training scripts follow up ( #11427 )
...
* Update train_text_to_image_lora.py
* update_train_text_to_image_lora
2025-04-28 11:23:24 -04:00
tongyu
b3b04fefde
[train_text_to_image] Better image interpolation in training scripts follow up ( #11426 )
...
* Update train_text_to_image.py
* update
2025-04-28 10:50:33 -04:00
Sayak Paul
0e3f2713c2
[tests] fix import. ( #11434 )
...
fix import.
2025-04-28 13:32:28 +08:00