diffusers

mirror of https://github.com/huggingface/diffusers.git synced 2026-01-27 17:22:53 +03:00

Author	SHA1	Message	Date
Quentin Gallouédec	c8bb1ff53e	Use HF Papers (#11567 ) * Use HF Papers * Apply style fixes --------- Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>	2025-05-19 06:22:33 -10:00
Dhruv Nair	799adf4a10	[Single File] Fix loading for LTX 0.9.7 transformer (#11578 ) update	2025-05-19 06:22:13 -10:00
Sayak Paul	00f9273da2	[WIP][LoRA] start supporting kijai wan lora. (#11579 ) * start supporting kijai wan lora. * diff_b keys. * Apply suggestions from code review Co-authored-by: Aryan <aryan@huggingface.co> * merge ready --------- Co-authored-by: Aryan <aryan@huggingface.co>	2025-05-19 20:47:44 +05:30
Linoy Tsaban	ceb7af277c	[LoRA] support non-diffusers LTX-Video loras (#11572 ) * support non diffusers loras for ltxv * Update src/diffusers/loaders/lora_conversion_utils.py Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * Update src/diffusers/loaders/lora_pipeline.py Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * Apply style fixes * empty commit --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>	2025-05-19 12:59:55 +03:00
Sayak Paul	6918f6d19a	[docs] tip for group offloding + quantization (#11576 ) * tip for group offloding + quantization Co-authored-by: Aryan VS <contact.aryanvs@gmail.com> * Apply suggestions from code review Co-authored-by: Aryan <aryan@huggingface.co> --------- Co-authored-by: Aryan VS <contact.aryanvs@gmail.com> Co-authored-by: Aryan <aryan@huggingface.co>	2025-05-19 14:49:15 +05:30
apolinário	915c537891	Revert error to warning when loading LoRA from repo with multiple weights (#11568 )	2025-05-19 13:33:43 +05:30
space_samurai	8270fa58e4	Doc update (#11531 ) Update docs/source/en/using-diffusers/inpaint.md	2025-05-19 13:32:08 +05:30
Yao Matrix	1a10fa0c82	enhance value guard of _device_agnostic_dispatch (#11553 ) enhance value guard Signed-off-by: Matrix Yao <matrix.yao@intel.com>	2025-05-19 06:05:32 +05:30
Sayak Paul	9836f0e000	[docs] Regional compilation docs (#11556 ) * add regional compilation docs. * minor. * reviwer feedback. * Update docs/source/en/optimization/torch2.0.md Co-authored-by: Ilyas Moutawwakil <57442720+IlyasMoutawwakil@users.noreply.github.com> --------- Co-authored-by: Ilyas Moutawwakil <57442720+IlyasMoutawwakil@users.noreply.github.com>	2025-05-15 19:11:24 +05:30
Sayak Paul	20379d9d13	[tests] add tests for combining layerwise upcasting and groupoffloading. (#11558 ) * add tests for combining layerwise upcasting and groupoffloading. * feedback	2025-05-15 17:16:44 +05:30
Animesh Jain	3a6caba8e4	[gguf] Refactor __torch_function__ to avoid unnecessary computation (#11551 ) * [gguf] Refactor __torch_function__ to avoid unnecessary computation This helps with torch.compile compilation lantency. Avoiding unnecessary computation should also lead to a slightly improved eager latency. * Apply style fixes --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>	2025-05-15 14:38:18 +05:30
Dhruv Nair	4267d8f4eb	[Single File] GGUF/Single File Support for HiDream (#11550 ) * update * update * update * update * update * update * update	2025-05-15 12:25:18 +05:30
Seokhyeon Jeong	f4fa3beee7	[tests] Add torch.compile test for UNet2DConditionModel (#11537 ) Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2025-05-14 11:26:12 +05:30
Anwesha Chowdhury	7e3353196c	Fix deprecation warnings in test_ltx_image2video.py (#11538 ) Fixed 2 warnings that were raised during running LTXImageToVideoPipelineFastTests Co-authored-by: achowdhury1211@gmail.com <anwesha@LAPTOP-E5QGFMOQ>	2025-05-13 08:05:52 -10:00
Meatfucker	8c249d1401	Update pipeline_flux_img2img.py to add missing vae_slicing and vae_tiling calls. (#11545 ) * Update pipeline_flux_img2img.py Adds missing vae_slicing and vae_tiling calls to FluxImage2ImagePipeline * Update src/diffusers/pipelines/flux/pipeline_flux_img2img.py Co-authored-by: Álvaro Somoza <asomoza@users.noreply.github.com> * Update src/diffusers/pipelines/flux/pipeline_flux_img2img.py Co-authored-by: Álvaro Somoza <asomoza@users.noreply.github.com> * Update src/diffusers/pipelines/flux/pipeline_flux_img2img.py Co-authored-by: Álvaro Somoza <asomoza@users.noreply.github.com> * Update src/diffusers/pipelines/flux/pipeline_flux_img2img.py Co-authored-by: Álvaro Somoza <asomoza@users.noreply.github.com> --------- Co-authored-by: Álvaro Somoza <asomoza@users.noreply.github.com>	2025-05-13 12:31:45 -04:00
Sayak Paul	b555a03723	[tests] Enable testing for HiDream transformer (#11478 ) * add tests for hidream transformer model. * fix * Update tests/models/transformers/test_models_transformer_hidream.py Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com> --------- Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>	2025-05-13 21:01:20 +05:30
Aryan	06fee551e9	LTX Video 0.9.7 (#11516 ) * add upsampling pipeline * ltx upsample pipeline conversion; pipeline fixes * make fix-copies * remove print * add vae convenience methods * update * add tests * support denoising strength for upscaling & video-to-video * update docs * update doc checkpoints * update docs * fix --------- Co-authored-by: Linoy Tsaban <57615435+linoytsaban@users.noreply.github.com>	2025-05-13 14:57:03 +05:30
Linoy Tsaban	8b99f7e157	[LoRA] small change to support Hunyuan LoRA Loading for FramePack (#11546 ) init	2025-05-13 10:15:06 +03:00
Kenneth Gerald Hamilton	07dd6f8c0e	[train_dreambooth.py] Fix the LR Schedulers when num_train_epochs is passed in a distributed training env (#11239 ) Co-authored-by: Linoy Tsaban <57615435+linoytsaban@users.noreply.github.com> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2025-05-13 07:34:01 +05:30
johannaSommer	f8d4a1e283	fix: remove `torch_dtype="auto"` option from docstrings (#11513 ) Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>	2025-05-12 15:42:35 -10:00
Abdellah Oumida	ddd0cfb497	Fix typo in train_diffusion_orpo_sdxl_lora_wds.py (#11541 )	2025-05-12 15:28:29 -10:00
Zhong-Yu Li	4f438de35a	Add VisualCloze (#11377 ) * VisualCloze * style quality * add docs * add docs * typo * Update docs/source/en/api/pipelines/visualcloze.md * delete einops * style quality * Update src/diffusers/pipelines/visualcloze/pipeline_visualcloze.py * reorg * refine doc * style quality * typo * typo * Update src/diffusers/image_processor.py * add comment * test * style * Modified based on review * style * restore image_processor * update example url * style * fix-copies * VisualClozeGenerationPipeline * combine * tests docs * remove VisualClozeUpsamplingPipeline * style * quality * test examples * quality style * typo * make fix-copies * fix test_callback_cfg and test_save_load_dduf in VisualClozePipelineFastTests * add EXAMPLE_DOC_STRING to VisualClozeGenerationPipeline * delete maybe_free_model_hooks from pipeline_visualcloze_combined * Apply suggestions from code review * fix test_save_load_local test; add reason for skipping cfg test * more save_load test fixes * fix tests in generation pipeline tests	2025-05-13 02:46:51 +05:30
Evan Han	98cc6d05e4	[test_models_transformer_ltx.py] help us test torch.compile() for impactful models (#11512 ) * Update test_models_transformer_ltx.py * Update test_models_transformer_ltx.py * Update test_models_transformer_ltx.py * Update test_models_transformer_ltx.py --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2025-05-12 19:26:15 +05:30
Yao Matrix	c3726153fd	enable several pipeline integration tests on XPU (#11526 ) * enable kandinsky2_2 integration test cases on XPU Signed-off-by: Yao Matrix <matrix.yao@intel.com> * fix style Signed-off-by: Yao Matrix <matrix.yao@intel.com> * enable latent_diffusion, dance_diffusion, musicldm, shap_e integration uts on xpu Signed-off-by: Yao Matrix <matrix.yao@intel.com> * fix style Signed-off-by: Yao Matrix <matrix.yao@intel.com> --------- Signed-off-by: Yao Matrix <matrix.yao@intel.com> Co-authored-by: Aryan <aryan@huggingface.co>	2025-05-12 16:51:37 +05:30
Aryan	e48f6aeeb4	Hunyuan Video Framepack F1 (#11534 ) * support framepack f1 * update docs * update toctree * remove typo	2025-05-12 16:11:10 +05:30
Sayak Paul	01abfc8736	[tests] add tests for framepack transformer model. (#11520 ) * start. * add tests for framepack transformer model. * merge conflicts. * make to square. * fixes	2025-05-11 09:50:06 +05:30
Aryan	92fe689f06	Change Framepack transformer layer initialization order (#11535 ) update	2025-05-09 23:09:50 +05:30
Yao Matrix	0ba1f76d4d	enable print_env on xpu (#11507 ) * detect xpu in print_env Signed-off-by: YAO Matrix <matrix.yao@intel.com> * enhance code, test passed on XPU Signed-off-by: Yao Matrix <matrix.yao@intel.com> --------- Signed-off-by: YAO Matrix <matrix.yao@intel.com> Signed-off-by: Yao Matrix <matrix.yao@intel.com>	2025-05-09 16:30:51 +05:30
Yao Matrix	d6bf268a4a	enable dit integration cases on xpu (#11523 ) * enable dit integration test on XPU Signed-off-by: Yao Matrix <matrix.yao@intel.com> * fix style Signed-off-by: Yao Matrix <matrix.yao@intel.com> --------- Signed-off-by: Yao Matrix <matrix.yao@intel.com>	2025-05-09 16:06:50 +05:30
James Xu	3c0a0129fe	[LTXPipeline] Update latents dtype to match VAE dtype (#11533 ) fix: update latents dtype to match vae	2025-05-09 16:05:21 +05:30
Yao Matrix	2d380895e5	enable 7 cases on XPU (#11503 ) * enable 7 cases on XPU Signed-off-by: Yao Matrix <matrix.yao@intel.com> * calibrate A100 expectations Signed-off-by: YAO Matrix <matrix.yao@intel.com> --------- Signed-off-by: Yao Matrix <matrix.yao@intel.com> Signed-off-by: YAO Matrix <matrix.yao@intel.com>	2025-05-09 15:52:08 +05:30
Sayak Paul	0c47c954f3	[LoRA] support non-diffusers hidream loras (#11532 ) * support non-diffusers hidream loras * make fix-copies	2025-05-09 14:42:39 +05:30
Sayak Paul	7acf8345f6	[Tests] Enable more general testing for `torch.compile()` with LoRA hotswapping (#11322 ) * refactor hotswap tester. * fix seeds.. * add to nightly ci. * move comment. * move to nightly	2025-05-09 11:29:06 +05:30
Sayak Paul	599c887164	feat: pipeline-level quantization config (#11130 ) * feat: pipeline-level quant config. Co-authored-by: SunMarc <marc.sun@hotmail.fr> condition better. support mapping. improvements. [Quantization] Add Quanto backend (#10756) * update * updaet * update * update * update * update * update * update * update * update * update * update * Update docs/source/en/quantization/quanto.md Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * update * update * update * update * update * update * update * update * update * update * update * update * update * update * update * update * update * update * Update src/diffusers/quantizers/quanto/utils.py Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * update * update --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> [Single File] Add single file loading for SANA Transformer (#10947) * added support for from_single_file * added diffusers mapping script * added testcase * bug fix * updated tests * corrected code quality * corrected code quality --------- Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com> [LoRA] Improve warning messages when LoRA loading becomes a no-op (#10187) * updates * updates * updates * updates * notebooks revert * fix-copies. * seeing * fix * revert * fixes * fixes * fixes * remove print * fix * conflicts ii. * updates * fixes * better filtering of prefix. --------- Co-authored-by: hlky <hlky@hlky.ac> [LoRA] CogView4 (#10981) * update * make fix-copies * update [Tests] improve quantization tests by additionally measuring the inference memory savings (#11021) * memory usage tests * fixes * gguf [`Research Project`] Add AnyText: Multilingual Visual Text Generation And Editing (#8998) * Add initial template * Second template * feat: Add TextEmbeddingModule to AnyTextPipeline * feat: Add AuxiliaryLatentModule template to AnyTextPipeline * Add bert tokenizer from the anytext repo for now * feat: Update AnyTextPipeline's modify_prompt method This commit adds improvements to the modify_prompt method in the AnyTextPipeline class. The method now handles special characters and replaces selected string prompts with a placeholder. Additionally, it includes a check for Chinese text and translation using the trans_pipe. * Fill in the `forward` pass of `AuxiliaryLatentModule` * `make style && make quality` * `chore: Update bert_tokenizer.py with a TODO comment suggesting the use of the transformers library` * Update error handling to raise and logging * Add `create_glyph_lines` function into `TextEmbeddingModule` * make style * Up * Up * Up * Up * Remove several comments * refactor: Remove ControlNetConditioningEmbedding and update code accordingly * Up * Up * up * refactor: Update AnyTextPipeline to include new optional parameters * up * feat: Add OCR model and its components * chore: Update `TextEmbeddingModule` to include OCR model components and dependencies * chore: Update `AuxiliaryLatentModule` to include VAE model and its dependencies for masked image in the editing task * `make style` * refactor: Update `AnyTextPipeline`'s docstring * Update `AuxiliaryLatentModule` to include info dictionary so that text processing is done once * simplify * `make style` * Converting `TextEmbeddingModule` to ordinary `encode_prompt()` function * Simplify for now * `make style` * Up * feat: Add scripts to convert AnyText controlnet to diffusers * `make style` * Fix: Move glyph rendering to `TextEmbeddingModule` from `AuxiliaryLatentModule` * make style * Up * Simplify * Up * feat: Add safetensors module for loading model file * Fix device issues * Up * Up * refactor: Simplify * refactor: Simplify code for loading models and handling data types * `make style` * refactor: Update to() method in FrozenCLIPEmbedderT3 and TextEmbeddingModule * refactor: Update dtype in embedding_manager.py to match proj.weight * Up * Add attribution and adaptation information to pipeline_anytext.py * Update usage example * Will refactor `controlnet_cond_embedding` initialization * Add `AnyTextControlNetConditioningEmbedding` template * Refactor organization * style * style * Move custom blocks from `AuxiliaryLatentModule` to `AnyTextControlNetConditioningEmbedding` * Follow one-file policy * style * [Docs] Update README and pipeline_anytext.py to use AnyTextControlNetModel * [Docs] Update import statement for AnyTextControlNetModel in pipeline_anytext.py * [Fix] Update import path for ControlNetModel, ControlNetOutput in anytext_controlnet.py * Refactor AnyTextControlNet to use configurable conditioning embedding channels * Complete control net conditioning embedding in AnyTextControlNetModel * up * [FIX] Ensure embeddings use correct device in AnyTextControlNetModel * up * up * style * [UPDATE] Revise README and example code for AnyTextPipeline integration with DiffusionPipeline * [UPDATE] Update example code in anytext.py to use correct font file and improve clarity * down * [UPDATE] Refactor BasicTokenizer usage to a new Checker class for text processing * update pillow * [UPDATE] Remove commented-out code and unnecessary docstring in anytext.py and anytext_controlnet.py for improved clarity * [REMOVE] Delete frozen_clip_embedder_t3.py as it is in the anytext.py file * [UPDATE] Replace edict with dict for configuration in anytext.py and RecModel.py for consistency * 🆙 * style * [UPDATE] Revise README.md for clarity, remove unused imports in anytext.py, and add author credits in anytext_controlnet.py * style * Update examples/research_projects/anytext/README.md Co-authored-by: Aryan <contact.aryanvs@gmail.com> * Remove commented-out image preparation code in AnyTextPipeline * Remove unnecessary blank line in README.md [Quantization] Allow loading TorchAO serialized Tensor objects with torch>=2.6 (#11018) * update * update * update * update * update * update * update * update * update fix: mixture tiling sdxl pipeline - adjust gerating time_ids & embeddings (#11012) small fix on generating time_ids & embeddings [LoRA] support wan i2v loras from the world. (#11025) * support wan i2v loras from the world. * remove copied from. * upates * add lora. Fix SD3 IPAdapter feature extractor (#11027) chore: fix help messages in advanced diffusion examples (#10923) Fix missing *kwargs in lora_pipeline.py (#11011) Update lora_pipeline.py * Apply style fixes * fix-copies --------- Co-authored-by: hlky <hlky@hlky.ac> Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com> Fix for multi-GPU WAN inference (#10997) Ensure that hidden_state and shift/scale are on the same device when running with multiple GPUs Co-authored-by: Jimmy <39@🇺🇸.com> [Refactor] Clean up import utils boilerplate (#11026) * update * update * update Use `output_size` in `repeat_interleave` (#11030) [hybrid inference 🍯🐝] Add VAE encode (#11017) * [hybrid inference 🍯🐝] Add VAE encode * _toctree: add vae encode * Add endpoints, tests * vae_encode docs * vae encode benchmarks * api reference * changelog * Update docs/source/en/hybrid_inference/overview.md Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * update --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Wan Pipeline scaling fix, type hint warning, multi generator fix (#11007) * Wan Pipeline scaling fix, type hint warning, multi generator fix * Apply suggestions from code review [LoRA] change to warning from info when notifying the users about a LoRA no-op (#11044) * move to warning. * test related changes. Rename Lumina(2)Text2ImgPipeline -> Lumina(2)Pipeline (#10827) * Rename Lumina(2)Text2ImgPipeline -> Lumina(2)Pipeline --------- Co-authored-by: YiYi Xu <yixu310@gmail.com> making ```formatted_images``` initialization compact (#10801) compact writing Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: YiYi Xu <yixu310@gmail.com> Fix aclnnRepeatInterleaveIntWithDim error on NPU for get_1d_rotary_pos_embed (#10820) * get_1d_rotary_pos_embed support npu * Update src/diffusers/models/embeddings.py --------- Co-authored-by: Kai zheng <kaizheng@KaideMacBook-Pro.local> Co-authored-by: hlky <hlky@hlky.ac> Co-authored-by: YiYi Xu <yixu310@gmail.com> [Tests] restrict memory tests for quanto for certain schemes. (#11052) * restrict memory tests for quanto for certain schemes. * Apply suggestions from code review Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com> * fixes * style --------- Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com> [LoRA] feat: support non-diffusers wan t2v loras. (#11059) feat: support non-diffusers wan t2v loras. [examples/controlnet/train_controlnet_sd3.py] Fixes #11050 - Cast prompt_embeds and pooled_prompt_embeds to weight_dtype to prevent dtype mismatch (#11051) Fix: dtype mismatch of prompt embeddings in sd3 controlnet training Co-authored-by: Andreas Jörg <andreasjoerg@MacBook-Pro-von-Andreas-2.fritz.box> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> reverts accidental change that removes attn_mask in attn. Improves fl… (#11065) reverts accidental change that removes attn_mask in attn. Improves flux ptxla by using flash block sizes. Moves encoding outside the for loop. Co-authored-by: Juan Acevedo <jfacevedo@google.com> Fix deterministic issue when getting pipeline dtype and device (#10696) Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com> [Tests] add requires peft decorator. (#11037) * add requires peft decorator. * install peft conditionally. * conditional deps. Co-authored-by: DN6 <dhruv.nair@gmail.com> --------- Co-authored-by: DN6 <dhruv.nair@gmail.com> CogView4 Control Block (#10809) * cogview4 control training --------- Co-authored-by: OleehyO <leehy0357@gmail.com> Co-authored-by: yiyixuxu <yixu310@gmail.com> [CI] pin transformers version for benchmarking. (#11067) pin transformers version for benchmarking. updates Fix Wan I2V Quality (#11087) * fix_wan_i2v_quality * Update src/diffusers/pipelines/wan/pipeline_wan_i2v.py Co-authored-by: YiYi Xu <yixu310@gmail.com> * Update src/diffusers/pipelines/wan/pipeline_wan_i2v.py Co-authored-by: YiYi Xu <yixu310@gmail.com> * Update src/diffusers/pipelines/wan/pipeline_wan_i2v.py Co-authored-by: YiYi Xu <yixu310@gmail.com> * Update pipeline_wan_i2v.py --------- Co-authored-by: YiYi Xu <yixu310@gmail.com> Co-authored-by: hlky <hlky@hlky.ac> LTX 0.9.5 (#10968) * update --------- Co-authored-by: YiYi Xu <yixu310@gmail.com> Co-authored-by: hlky <hlky@hlky.ac> make PR GPU tests conditioned on styling. (#11099) Group offloading improvements (#11094) update Fix pipeline_flux_controlnet.py (#11095) * Fix pipeline_flux_controlnet.py * Fix style update readme instructions. (#11096) Co-authored-by: Juan Acevedo <jfacevedo@google.com> Resolve stride mismatch in UNet's ResNet to support Torch DDP (#11098) Modify UNet's ResNet implementation to resolve stride mismatch in Torch's DDP Fix Group offloading behaviour when using streams (#11097) * update * update Quality options in `export_to_video` (#11090) * Quality options in `export_to_video` * make style improve more. add placeholders for docstrings. formatting. smol fix. solidify validation and annotation * Revert "feat: pipeline-level quant config." This reverts commit `316ff46b76`. * feat: implement pipeline-level quantization config Co-authored-by: SunMarc <marc@huggingface.co> * update * fixes * fix validation. * add tests and other improvements. * add tests * import quality * remove prints. * add docs. * fixes to docs. * doc fixes. * doc fixes. * add validation to the input quantization_config. * clarify recommendations. * docs * add to ci. * todo. --------- Co-authored-by: SunMarc <marc@huggingface.co>	2025-05-09 10:04:44 +05:30
Sayak Paul	393aefcdc7	[tests] fix audioldm2 for transformers main. (#11522 ) fix audioldm2 for transformers main.	2025-05-08 21:13:42 +05:30
Aryan	6674a5157f	Conditionally import torchvision in Cosmos transformer (#11524 ) fix	2025-05-08 19:37:47 +05:30
scxue	784db0eaab	Add cross attention type for Sana-Sprint training in diffusers. (#11514 ) * test permission * Add cross attention type for Sana-Sprint. * Add Sana-Sprint training script in diffusers. * make style && make quality; * modify the attention processor with `set_attn_processor` and change `SanaAttnProcessor3_0` to `SanaVanillaAttnProcessor` * Add import for SanaVanillaAttnProcessor * Add README file. * Apply suggestions from code review * style * Update examples/research_projects/sana/README.md --------- Co-authored-by: lawrence-cj <cjs1020440147@icloud.com> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2025-05-08 18:55:29 +05:30
Linoy Tsaban	66e50d4e24	[LoRA] make lora alpha and dropout configurable (#11467 ) * add lora_alpha and lora_dropout * Apply style fixes * add lora_alpha and lora_dropout * Apply style fixes * revert lora_alpha until #11324 is merged * Apply style fixes * empty commit --------- Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>	2025-05-08 11:54:50 +03:00
sayakpaul	c5c34a4591	Revert "fix audioldm" This reverts commit `87e508f11f`.	2025-05-08 11:30:29 +05:30
sayakpaul	87e508f11f	fix audioldm	2025-05-08 11:30:11 +05:30
YiYi Xu	53bd367b03	clean up the __Init__ for stable_diffusion (#11500 ) up	2025-05-07 07:01:17 -10:00
Aryan	7b904941bc	Cosmos (#10660 ) * begin transformer conversion * refactor * refactor * refactor * refactor * refactor * refactor * update * add conversion script * add pipeline * make fix-copies * remove einops * update docs * gradient checkpointing * add transformer test * update * debug * remove prints * match sigmas * add vae pt. 1 * finish CV* vae * update * update * update * update * update * update * make fix-copies * update * make fix-copies * fix * update * update * make fix-copies * update * update tests * handle device and dtype for safety checker; required in latest diffusers * remove enable_gqa and use repeat_interleave instead * enforce safety checker; use dummy checker in fast tests * add review suggestion for ONNX export Co-Authored-By: Asfiya Baig <asfiyab@nvidia.com> * fix safety_checker issues when not passed explicitly We could either do what's done in this commit, or update the Cosmos examples to explicitly pass the safety checker * use cosmos guardrail package * auto format docs * update conversion script to support 14B models * update name CosmosPipeline -> CosmosTextToWorldPipeline * update docs * fix docs * fix group offload test failing for vae --------- Co-authored-by: Asfiya Baig <asfiyab@nvidia.com>	2025-05-07 20:59:09 +05:30
Sayak Paul	fb29132b98	[docs] minor updates to bitsandbytes docs. (#11509 ) * minor updates to bitsandbytes docs. * Apply suggestions from code review	2025-05-06 18:52:18 +05:30
Valeriy Selitskiy	79371661d1	[lora_conversion] Enhance key handling for OneTrainer components in LORA conversion utility (#11441 ) (#11487 ) * [lora_conversion] Enhance key handling for OneTrainer components in LORA conversion utility (#11441) * Update src/diffusers/loaders/lora_conversion_utils.py Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2025-05-06 18:44:58 +05:30
Yao Matrix	8c661ea586	enable lora cases on XPU (#11506 ) * enable lora cases on XPU Signed-off-by: Yao Matrix <matrix.yao@intel.com> * remove hunyuanvideo xpu expectation Signed-off-by: Yao Matrix <matrix.yao@intel.com> --------- Signed-off-by: Yao Matrix <matrix.yao@intel.com>	2025-05-06 14:59:50 +05:30
Aryan	d7ffe60166	Hunyuan Video Framepack (#11428 ) * add transformer * add pipeline * fixes * make fix-copies * update * add flux mu shift * update example snippet * debug * cleanup * batch_size=1 optimization * add pipeline test * fix for model cpu offloading' * add last_image support; credits: https://github.com/lllyasviel/FramePack/pull/167 * update example with flf2v * update penguin url * fix test * address review comment: https://github.com/huggingface/diffusers/pull/11428#discussion_r2071032371 * address review comment: https://github.com/huggingface/diffusers/pull/11428#discussion_r2071087689 * Update src/diffusers/pipelines/hunyuan_video/pipeline_hunyuan_video_framepack.py --------- Co-authored-by: Linoy Tsaban <57615435+linoytsaban@users.noreply.github.com>	2025-05-06 14:59:38 +05:30
Sayak Paul	10bee525e7	[LoRA] use `removeprefix` to preserve sanity. (#11493 ) * use removeprefix to preserve sanity. * f-string.	2025-05-06 12:17:57 +05:30
Sayak Paul	d88ae1f52a	update dep table. (#11504 ) * update dep table. * fix	2025-05-06 11:14:07 +05:30
Sayak Paul	53f1043cbb	Update setup.py to pin min version of `peft` (#11502 )	2025-05-06 10:23:16 +05:30
Aryan	1fa5639438	Fix torchao docs typo for fp8 granular quantization (#11473 ) update	2025-05-06 07:54:28 +05:30

1 2 3 4 5 ...

5491 Commits