diffusers

mirror of https://github.com/huggingface/diffusers.git synced 2026-01-27 17:22:53 +03:00

Author	SHA1	Message	Date
Sayak Paul	393aefcdc7	[tests] fix audioldm2 for transformers main. (#11522 ) fix audioldm2 for transformers main.	2025-05-08 21:13:42 +05:30
Aryan	6674a5157f	Conditionally import torchvision in Cosmos transformer (#11524 ) fix	2025-05-08 19:37:47 +05:30
scxue	784db0eaab	Add cross attention type for Sana-Sprint training in diffusers. (#11514 ) * test permission * Add cross attention type for Sana-Sprint. * Add Sana-Sprint training script in diffusers. * make style && make quality; * modify the attention processor with `set_attn_processor` and change `SanaAttnProcessor3_0` to `SanaVanillaAttnProcessor` * Add import for SanaVanillaAttnProcessor * Add README file. * Apply suggestions from code review * style * Update examples/research_projects/sana/README.md --------- Co-authored-by: lawrence-cj <cjs1020440147@icloud.com> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2025-05-08 18:55:29 +05:30
Linoy Tsaban	66e50d4e24	[LoRA] make lora alpha and dropout configurable (#11467 ) * add lora_alpha and lora_dropout * Apply style fixes * add lora_alpha and lora_dropout * Apply style fixes * revert lora_alpha until #11324 is merged * Apply style fixes * empty commit --------- Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>	2025-05-08 11:54:50 +03:00
sayakpaul	c5c34a4591	Revert "fix audioldm" This reverts commit `87e508f11f`.	2025-05-08 11:30:29 +05:30
sayakpaul	87e508f11f	fix audioldm	2025-05-08 11:30:11 +05:30
YiYi Xu	53bd367b03	clean up the __Init__ for stable_diffusion (#11500 ) up	2025-05-07 07:01:17 -10:00
Aryan	7b904941bc	Cosmos (#10660 ) * begin transformer conversion * refactor * refactor * refactor * refactor * refactor * refactor * update * add conversion script * add pipeline * make fix-copies * remove einops * update docs * gradient checkpointing * add transformer test * update * debug * remove prints * match sigmas * add vae pt. 1 * finish CV* vae * update * update * update * update * update * update * make fix-copies * update * make fix-copies * fix * update * update * make fix-copies * update * update tests * handle device and dtype for safety checker; required in latest diffusers * remove enable_gqa and use repeat_interleave instead * enforce safety checker; use dummy checker in fast tests * add review suggestion for ONNX export Co-Authored-By: Asfiya Baig <asfiyab@nvidia.com> * fix safety_checker issues when not passed explicitly We could either do what's done in this commit, or update the Cosmos examples to explicitly pass the safety checker * use cosmos guardrail package * auto format docs * update conversion script to support 14B models * update name CosmosPipeline -> CosmosTextToWorldPipeline * update docs * fix docs * fix group offload test failing for vae --------- Co-authored-by: Asfiya Baig <asfiyab@nvidia.com>	2025-05-07 20:59:09 +05:30
Sayak Paul	fb29132b98	[docs] minor updates to bitsandbytes docs. (#11509 ) * minor updates to bitsandbytes docs. * Apply suggestions from code review	2025-05-06 18:52:18 +05:30
Valeriy Selitskiy	79371661d1	[lora_conversion] Enhance key handling for OneTrainer components in LORA conversion utility (#11441 ) (#11487 ) * [lora_conversion] Enhance key handling for OneTrainer components in LORA conversion utility (#11441) * Update src/diffusers/loaders/lora_conversion_utils.py Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2025-05-06 18:44:58 +05:30
Yao Matrix	8c661ea586	enable lora cases on XPU (#11506 ) * enable lora cases on XPU Signed-off-by: Yao Matrix <matrix.yao@intel.com> * remove hunyuanvideo xpu expectation Signed-off-by: Yao Matrix <matrix.yao@intel.com> --------- Signed-off-by: Yao Matrix <matrix.yao@intel.com>	2025-05-06 14:59:50 +05:30
Aryan	d7ffe60166	Hunyuan Video Framepack (#11428 ) * add transformer * add pipeline * fixes * make fix-copies * update * add flux mu shift * update example snippet * debug * cleanup * batch_size=1 optimization * add pipeline test * fix for model cpu offloading' * add last_image support; credits: https://github.com/lllyasviel/FramePack/pull/167 * update example with flf2v * update penguin url * fix test * address review comment: https://github.com/huggingface/diffusers/pull/11428#discussion_r2071032371 * address review comment: https://github.com/huggingface/diffusers/pull/11428#discussion_r2071087689 * Update src/diffusers/pipelines/hunyuan_video/pipeline_hunyuan_video_framepack.py --------- Co-authored-by: Linoy Tsaban <57615435+linoytsaban@users.noreply.github.com>	2025-05-06 14:59:38 +05:30
Sayak Paul	10bee525e7	[LoRA] use `removeprefix` to preserve sanity. (#11493 ) * use removeprefix to preserve sanity. * f-string.	2025-05-06 12:17:57 +05:30
Sayak Paul	d88ae1f52a	update dep table. (#11504 ) * update dep table. * fix	2025-05-06 11:14:07 +05:30
Sayak Paul	53f1043cbb	Update setup.py to pin min version of `peft` (#11502 )	2025-05-06 10:23:16 +05:30
Aryan	1fa5639438	Fix torchao docs typo for fp8 granular quantization (#11473 ) update	2025-05-06 07:54:28 +05:30
RogerSinghChugh	ed4efbd63d	Update training script for txt to img sdxl with lora supp with new interpolation. (#11496 ) * Update training script for txt to img sdxl with lora supp with new interpolation. * ran make style and make quality.	2025-05-05 12:33:28 -04:00
Yijun Lee	9c29e938d7	Set LANCZOS as the default interpolation method for image resizing. (#11492 ) * Set LANCZOS as the default interpolation method for image resizing. * style: run make style and quality checks	2025-05-05 12:18:40 -04:00
Sayak Paul	071807c853	[training] feat: enable quantization for hidream lora training. (#11494 ) * feat: enable quantization for hidream lora training. * better handle compute dtype. * finalize. * fix dtype. --------- Co-authored-by: Linoy Tsaban <57615435+linoytsaban@users.noreply.github.com>	2025-05-05 20:44:35 +05:30
Evan Han	ee1516e5c7	[train_dreambooth_lora_lumina2] Add LANCZOS as the default interpolation mode for image resizing (#11491 ) [ADD] interpolation	2025-05-05 10:41:33 -04:00
MinJu-Ha	ec9323996b	[train_dreambooth_lora_sdxl] Add --image_interpolation_mode option for image resizing (default to lanczos) (#11490 ) feat(train_dreambooth_lora_sdxl): support --image_interpolation_mode with default to lanczos	2025-05-05 10:19:30 -04:00
Parag Ekbote	fc5e906689	[train_text_to_image_sdxl]Add LANCZOS as default interpolation mode for image resizing (#11455 ) * Add LANCZOS as default interplotation mode. * update script * Update as per code review. * make style.	2025-05-05 09:52:19 -04:00
Connector Switch	8520d496f0	[Feature] Implement tiled VAE encoding/decoding for Wan model. (#11414 ) * implement tiled encode/decode * address review comments	2025-05-05 16:07:14 +05:30
Yao Matrix	a674914fd5	enable semantic diffusion and stable diffusion panorama cases on XPU (#11459 ) Signed-off-by: Yao Matrix <matrix.yao@intel.com>	2025-05-05 15:28:07 +05:30
Yash	ec3d58286d	[train_dreambooth_lora_flux_advanced] Add LANCZOS as the default interpolation mode for image resizing (#11472 ) * [train_controlnet_sdxl] Add LANCZOS as the default interpolation mode for image resizing * [train_dreambooth_lora_flux_advanced] Add LANCZOS as the default interpolation mode for image resizing	2025-05-02 18:14:41 -04:00
Yuanzhou	ed6cf52572	[train_dreambooth_lora_sdxl_advanced] Add LANCZOS as the default interpolation mode for image resizing (#11471 )	2025-05-02 16:46:01 -04:00
Steven Liu	e23705e557	[docs] Adapters (#11331 ) * refactor adapter docs * ip-adapter * ip adapter * fix toctree * fix toctree * lora * images * controlnet * feedback * controlnet * t2i * fix typo * feedback --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2025-05-02 08:08:33 +05:30
Steven Liu	b848d479b1	[docs] Memory optims (#11385 ) * reformat * initial * fin * review * inference * feedback * feedback * feedback	2025-05-01 11:22:00 -07:00
Vladimir Mandic	d0c02398b9	cache packages_distributions (#11453 ) * cache packages_distributions * remove unused exception reference * make style Signed-off-by: Vladimir Mandic <mandic00@live.com> * change name to _package_map --------- Signed-off-by: Vladimir Mandic <mandic00@live.com> Co-authored-by: DN6 <dhruv.nair@gmail.com>	2025-05-01 21:47:52 +05:30
Sayak Paul	5dcdf4ac9a	[tests] xfail recent pipeline tests for specific methods. (#11469 ) xfail recent pipeline tests for specific methods.	2025-05-01 18:33:52 +05:30
co63oc	86294d3c7f	Fix typos in docs and comments (#11416 ) * Fix typos in docs and comments * Apply style fixes --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>	2025-04-30 20:30:53 -10:00
Sayak Paul	d70f8ee18b	[WAN] fix recompilation issues (#11475 ) * [tests] Add torch.compile() test for WanTransformer3DModel * fix wan recompilation issues. * style --------- Co-authored-by: tongyu0924 <winnie920924@gmail.com>	2025-04-30 20:29:08 -10:00
Yao Matrix	06beecafc5	make autoencoders. controlnet_flux and wan_transformer3d_single_file pass on xpu (#11461 ) * make autoencoders. controlnet_flux and wan_transformer3d_single_file pass on XPU Signed-off-by: Yao Matrix <matrix.yao@intel.com> * Apply style fixes --------- Signed-off-by: Yao Matrix <matrix.yao@intel.com> Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com> Co-authored-by: Aryan <aryan@huggingface.co>	2025-05-01 02:43:31 +05:30
Vaibhav Kumawat	daf0a23958	Add LANCZOS as default interplotation mode. (#11463 ) * Add LANCZOS as default interplotation mode. * LANCZOS as default interplotation * LANCZOS as default interplotation mode * Added LANCZOS as default interplotation mode	2025-04-30 14:22:38 -04:00
tongyu	38ced7ee59	[test_models_transformer_hunyuan_video] help us test torch.compile() for impactful models (#11431 ) * Update test_models_transformer_hunyuan_video.py * update --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2025-04-30 19:11:42 +08:00
Yao Matrix	23c98025b3	make safe diffusion test cases pass on XPU and A100 (#11458 ) * make safe diffusion test cases pass on XPU and A100 Signed-off-by: Yao Matrix <matrix.yao@intel.com> * calibrate A100 expected values Signed-off-by: YAO Matrix <matrix.yao@intel.com> --------- Signed-off-by: Yao Matrix <matrix.yao@intel.com> Signed-off-by: YAO Matrix <matrix.yao@intel.com>	2025-04-30 16:05:28 +05:30
captainzz	8cd7426e56	Add StableDiffusion3InstructPix2PixPipeline (#11378 ) * upload StableDiffusion3InstructPix2PixPipeline * Move to community * Add readme * Fix images * remove images * Change image url * fix * Apply style fixes	2025-04-30 06:13:12 -04:00
Daniel Socek	fbce7aeb32	Add generic support for Intel Gaudi accelerator (hpu device) (#11328 ) * Add generic support for Intel Gaudi accelerator (hpu device) Signed-off-by: Daniel Socek <daniel.socek@intel.com> Co-authored-by: Libin Tang <libin.tang@intel.com> * Add loggers for generic HPU support Signed-off-by: Daniel Socek <daniel.socek@intel.com> * Refactor hpu support with is_hpu_available() logic Signed-off-by: Daniel Socek <daniel.socek@intel.com> * Fix style for hpu support update Signed-off-by: Daniel Socek <daniel.socek@intel.com> * Decouple soft HPU check from hard device validation to support HPU migration Signed-off-by: Daniel Socek <daniel.socek@intel.com> --------- Signed-off-by: Daniel Socek <daniel.socek@intel.com> Co-authored-by: Libin Tang <libin.tang@intel.com> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2025-04-30 14:45:02 +05:30
Yao Matrix	35fada4169	enable unidiffuser test cases on xpu (#11444 ) * enable unidiffuser cases on XPU Signed-off-by: Yao Matrix <matrix.yao@intel.com> * fix a typo Signed-off-by: Yao Matrix <matrix.yao@intel.com> * fix style Signed-off-by: Yao Matrix <matrix.yao@intel.com> --------- Signed-off-by: Yao Matrix <matrix.yao@intel.com>	2025-04-30 13:58:00 +05:30
Yao Matrix	fbe2fe5578	enable consistency test cases on XPU, all passed (#11446 ) Signed-off-by: Yao Matrix <matrix.yao@intel.com>	2025-04-30 12:41:29 +05:30
Aryan	c86511586f	`torch.compile` fullgraph compatibility for Hunyuan Video (#11457 ) udpate	2025-04-30 11:21:17 +05:30
Yao Matrix	60892c55a4	enable marigold_intrinsics cases on XPU (#11445 ) Signed-off-by: Yao Matrix <matrix.yao@intel.com>	2025-04-30 11:07:37 +05:30
Aryan	8fe5a14d9b	Raise warning instead of error for block offloading with streams (#11425 ) raise warning instead of error	2025-04-30 08:26:16 +05:30
Youlun Peng	58431f102c	Set LANCZOS as the default interpolation for image resizing in ControlNet training (#11449 ) Set LANCZOS as the default interpolation for image resizing	2025-04-29 08:47:02 -04:00
urpetkov-amd	4a9ab650aa	Fixing missing provider options argument (#11397 ) * Fixing missing provider options argument * Adding if else for provider options * Apply suggestions from code review Co-authored-by: YiYi Xu <yixu310@gmail.com> * Apply style fixes * Update src/diffusers/pipelines/onnx_utils.py Co-authored-by: YiYi Xu <yixu310@gmail.com> * Update src/diffusers/pipelines/onnx_utils.py Co-authored-by: YiYi Xu <yixu310@gmail.com> --------- Co-authored-by: Uros Petkovic <urpektov@amd.com> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: YiYi Xu <yixu310@gmail.com> Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>	2025-04-28 10:23:05 -10:00
Linoy Tsaban	0ac1d5b482	[Hi-Dream LoRA] fix bug in validation (#11439 ) remove unnecessary pipeline moving to cpu in validation Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2025-04-28 06:22:32 -10:00
Yao Matrix	7567adfc45	enable 28 GGUF test cases on XPU (#11404 ) * enable gguf test cases on XPU Signed-off-by: YAO Matrix <matrix.yao@intel.com> * make SD35LargeGGUFSingleFileTests::test_pipeline_inference pas Signed-off-by: root <root@a4bf01945cfe.jf.intel.com> * make FluxControlLoRAGGUFTests::test_lora_loading pass Signed-off-by: Yao Matrix <matrix.yao@intel.com> * polish code Signed-off-by: Yao Matrix <matrix.yao@intel.com> * Apply style fixes --------- Signed-off-by: YAO Matrix <matrix.yao@intel.com> Signed-off-by: root <root@a4bf01945cfe.jf.intel.com> Signed-off-by: Yao Matrix <matrix.yao@intel.com> Co-authored-by: root <root@a4bf01945cfe.jf.intel.com> Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>	2025-04-28 21:32:04 +05:30
tongyu	3da98e7ee3	[train_text_to_image_lora] Better image interpolation in training scripts follow up (#11427 ) * Update train_text_to_image_lora.py * update_train_text_to_image_lora	2025-04-28 11:23:24 -04:00
tongyu	b3b04fefde	[train_text_to_image] Better image interpolation in training scripts follow up (#11426 ) * Update train_text_to_image.py * update	2025-04-28 10:50:33 -04:00
Sayak Paul	0e3f2713c2	[tests] fix import. (#11434 ) fix import.	2025-04-28 13:32:28 +08:00

1 2 3 4 5 ...

5457 Commits