diffusers

mirror of https://github.com/huggingface/diffusers.git synced 2026-01-27 17:22:53 +03:00

Author	SHA1	Message	Date
YiYi Xu	f112eab97e	[modular] fix a bug in mellon param & improve docstrings (#12980 ) * update mellonparams docstring to incude the acutal param definition render in mellon * style --------- Co-authored-by: yiyi@huggingface.co <yiyi@ip-26-0-160-103.ec2.internal>	2026-01-15 10:42:42 -10:00
YiYi Xu	61f175660a	Flux2 klein (#12982 ) * flux2-klein * Apply suggestions from code review Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * Klein tests (#2) * tests * up * tests * up * support step-distilled * Apply suggestions from code review Co-authored-by: dg845 <58458699+dg845@users.noreply.github.com> * Apply suggestions from code review Co-authored-by: dg845 <58458699+dg845@users.noreply.github.com> * doc string etc * style * more * copies * klein lora training scripts (#3) * initial commit * initial commit * remove remote text encoder * initial commit * initial commit * initial commit * revert * img2img fix * text encoder + tokenizer * text encoder + tokenizer * update readme * guidance * guidance * guidance * test * test * revert changes not needed for the non klein model * Update examples/dreambooth/train_dreambooth_lora_flux2_klein.py Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * fix guidance * fix validation * fix validation * fix validation * fix path * space --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * style * Update src/diffusers/pipelines/flux2/pipeline_flux2_klein.py * Apply style fixes * auto pipeline --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: dg845 <58458699+dg845@users.noreply.github.com> Co-authored-by: Linoy Tsaban <57615435+linoytsaban@users.noreply.github.com> Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>	2026-01-15 09:10:54 -10:00
DefTruth	7f43cb1d79	fix Qwen-Image series context parallel (#12970 ) * fix qwen-image cp * relax attn_mask limit for cp * CP plan compatible with zero_cond_t * move modulate_index plan to top level	2026-01-15 15:40:24 +05:30
Hameer Abbasi	5efb81fa71	Add `ChromaInpaintPipeline` (#12848 ) * Add `ChromaInpaintPipeline` * Set `attention_mask` to `dtype=torch.bool` for `ChromaInpaintPipeline`. * Revert `.gitignore`.	2026-01-15 12:58:50 +05:30
Yahweasel	b351be2379	LongCat Image pipeline: Allow offloading/quantization of text_encoder component (#12963 ) * Don't attempt to move the text_encoder. Just move the generated_ids. * The inputs to the text_encoder should be on its device	2026-01-14 21:10:57 -10:00
YiYi Xu	d8f4dd295f	[Modular] mellon utils (#12978 ) * up * style --------- Co-authored-by: yiyi@huggingface.co <yiyi@ip-26-0-160-103.ec2.internal>	2026-01-14 19:03:41 -10:00
hlky	1ecfbfe12b	`disable_mmap` in pipeline `from_pretrained` (#12854 ) * update * `disable_mmap` in `from_pretrained` --------- Co-authored-by: DN6 <dhruv.nair@gmail.com>	2026-01-14 21:29:36 +05:30
Marc Sun	d7fa445453	Remove 8bit device restriction (#12972 ) * allow to * update version * fix version again * again * Update src/diffusers/pipelines/pipeline_utils.py Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> * style * xfail * add pr --------- Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2026-01-14 20:33:30 +05:30
Sayak Paul	7feb4fc791	[chore] make transformers version check stricter for glm image. (#12974 ) * make transformers version check stricter for glm image. * public checkpoint.	2026-01-14 10:29:48 +05:30
Sayak Paul	3c70440d26	Update distributed_inference.md to reposition sections (#12971 )	2026-01-13 11:07:39 -08:00
Sayak Paul	7299121413	Z rz rz rz rz rz rz r cogview (#12973 ) * init * add * add 1 * Update __init__.py * rename * 2 * update * init with encoder * merge2pipeline * Update pipeline_glm_image.py * remove sop * remove useless func * Update pipeline_glm_image.py * up (cherry picked from commit `cfe19a31b9`) * review for work only * change place * Update pipeline_glm_image.py * update * Update transformer_glm_image.py * 1 * no negative_prompt for GLM-Image * remove CogView4LoraLoaderMixin * refactor attention processor. * update * fix * use staticmethod * update * up * up * update * Update glm_image.md * 1 * Update pipeline_glm_image.py * Update transformer_glm_image.py * using new transformers impl * support * resolution change * fix-copies * Update src/diffusers/pipelines/glm_image/pipeline_glm_image.py Co-authored-by: YiYi Xu <yixu310@gmail.com> * Update pipeline_glm_image.py * use cogview4 * Update pipeline_glm_image.py * Update pipeline_glm_image.py * revert * update * batch support * update * version guard glm image pipeline * validate prompt_embeds and prior_token_ids * try docs. * 4 * up * up * skip properly * fix tests * up * up --------- Co-authored-by: zRzRzRzRzRzRzR <2448370773@qq.com> Co-authored-by: yiyixuxu <yixu310@gmail.com>	2026-01-13 06:39:22 -10:00
Álvaro Somoza	3114f6a796	[Modular] Changes for using WAN I2V (#12959 ) * initial * add kayers	2026-01-13 05:25:54 -10:00
Bissmella Bahaduri	9d68742214	Add Unified Sequence Parallel attention (#12693 ) * initial scheme of unified-sp * initial all_to_all_double * bug fixes, added cmnts * unified attention prototype done * remove raising value error in contextParallelConfig to enable unified attention * bug fix * feat: Adds Test for Unified SP Attention and Fixes a bug in Template Ring Attention * bug fix, lse calculation, testing bug fixes, lse calculation - switched to _all_to_all_single helper in _all_to_all_dim_exchange due contiguity issues bug fix bug fix bug fix * addressing comments * sequence parallelsim bug fixes * code format fixes * Apply style fixes * code formatting fix * added unified attention docs and removed test file * Apply style fixes * tip for unified attention in docs at distributed_inference.md Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * Update distributed_inference.md, adding benchmarks Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * Update docs/source/en/training/distributed_inference.md Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * function name fix * fixed benchmark in docs --------- Co-authored-by: KarthikSundar2002 <karthiksundar30092002@gmail.com> Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2026-01-13 09:16:51 +05:30
dg845	f1a93c765f	Add Flag to `PeftLoraLoaderMixinTests` to Enable/Disable Text Encoder LoRA Tests (#12962 ) * Improve incorrect LoRA format error message * Add flag in PeftLoraLoaderMixinTests to disable text encoder LoRA tests * Apply changes to LTX2LoraTests * Further improve incorrect LoRA format error msg following review --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2026-01-12 16:01:58 -08:00
Leo Jiang	29a930a142	Bugfix for flux2 img2img2 prediction (#12855 ) * Bugfix for dreambooth flux2 img2img2 * Bugfix for dreambooth flux2 img2img2 * Bugfix for dreambooth flux2 img2img2 * Bugfix for dreambooth flux2 img2img2 * Bugfix for dreambooth flux2 img2img2 * Bugfix for dreambooth flux2 img2img2 Co-authored-by: tcaimm <93749364+tcaimm@users.noreply.github.com> --------- Co-authored-by: tcaimm <93749364+tcaimm@users.noreply.github.com>	2026-01-12 20:07:02 +05:30
Kashif Rasul	dad5cb55e6	Fix QwenImage txt_seq_lens handling (#12702 ) * Fix QwenImage txt_seq_lens handling * formatting * formatting * remove txt_seq_lens and use bool mask * use compute_text_seq_len_from_mask * add seq_lens to dispatch_attention_fn * use joint_seq_lens * remove unused index_block * WIP: Remove seq_lens parameter and use mask-based approach - Remove seq_lens parameter from dispatch_attention_fn - Update varlen backends to extract seqlens from masks - Update QwenImage to pass 2D joint_attention_mask - Fix native backend to handle 2D boolean masks - Fix sage_varlen seqlens_q to match seqlens_k for self-attention Note: sage_varlen still producing black images, needs further investigation * fix formatting * undo sage changes * xformers support * hub fix * fix torch compile issues * fix tests * use _prepare_attn_mask_native * proper deprecation notice * add deprecate to txt_seq_lens * Update src/diffusers/models/transformers/transformer_qwenimage.py Co-authored-by: YiYi Xu <yixu310@gmail.com> * Update src/diffusers/models/transformers/transformer_qwenimage.py Co-authored-by: YiYi Xu <yixu310@gmail.com> * Only create the mask if there's actual padding * fix order of docstrings * Adds performance benchmarks and optimization details for QwenImage Enhances documentation with comprehensive performance insights for QwenImage pipeline: * rope_text_seq_len = text_seq_len * rename to max_txt_seq_len * removed deprecated args * undo unrelated change * Updates QwenImage performance documentation Removes detailed attention backend benchmarks and simplifies torch.compile performance description Focuses on key performance improvement with torch.compile, highlighting the specific speedup from 4.70s to 1.93s on an A100 GPU Streamlines the documentation to provide more concise and actionable performance insights * Updates deprecation warnings for txt_seq_lens parameter Extends deprecation timeline for txt_seq_lens from version 0.37.0 to 0.39.0 across multiple Qwen image-related models Adds a new unit test to verify the deprecation warning behavior for the txt_seq_lens parameter * fix compile * formatting * fix compile tests * rename helper * remove duplicate * smaller values * removed * use torch.cond for torch compile * Construct joint attention mask once * test different backends * construct joint attention mask once to avoid reconstructing in every block * Update src/diffusers/models/attention_dispatch.py Co-authored-by: YiYi Xu <yixu310@gmail.com> * formatting * raising an error from the EditPlus pipeline when batch_size > 1 --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: YiYi Xu <yixu310@gmail.com> Co-authored-by: cdutr <dutra_carlos@hotmail.com>	2026-01-12 13:45:09 +05:30
Francisco Kurucz	b86bd99eac	Fix link to diffedit implementation reference (#12708 )	2026-01-10 11:13:23 -08:00
omahs	5b202111bf	Fix typos (#12705 )	2026-01-10 11:11:15 -08:00
Sayak Paul	4ac2b4a521	[docs] polish caching docs. (#12684 ) * polish caching docs. * Update docs/source/en/optimization/cache.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/optimization/cache.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * up --------- Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2026-01-10 10:09:05 -08:00
YiYi Xu	418313bbf6	[Modular] better docstring (#12932 ) add output to auto blocks + core denoising block for better doc string	2026-01-09 23:53:56 -10:00
Rafael Tvelov	2120c3096f	Fix: typo in autoencoder_dc.py (#12687 ) Fix typo in autoencoder_dc.py Fixing typo in `get_block` function's parameter name: "qkv_mutliscales" -> "qkv_multiscales" Co-authored-by: YiYi Xu <yixu310@gmail.com>	2026-01-09 22:01:54 -10:00
Sayak Paul	ed6e5ecf67	[LoRA] add LoRA support to LTX-2 (#12933 ) * up * fixes * tests * docs. * fix * change loading info. * up * up	2026-01-10 11:27:22 +05:30
Sayak Paul	d44b5f86e6	fix how `is_fsdp` is determined (#12960 ) up	2026-01-10 10:34:25 +05:30
Jay Wu	02c7adc356	[ChronoEdit] support multiple loras (#12679 ) Co-authored-by: wjay <wjay@nvidia.com> Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>	2026-01-09 15:50:16 -10:00
Sayak Paul	a3cc0e7a52	[modular] error early in `enable_auto_cpu_offload` (#12578 ) error early in auto_cpu_offload	2026-01-09 15:30:52 -10:00
Daniel Socek	2a6cdc0b3e	Fix ftfy name error in Wan pipeline (#12314 ) Signed-off-by: Daniel Socek <daniel.socek@intel.com> Co-authored-by: YiYi Xu <yixu310@gmail.com>	2026-01-09 14:02:40 -10:00
SahilCarterr	1791306739	[Fix] syntax in QwenImageEditPlusPipeline (#12371 ) * Fixes syntax for consistency among pipelines * Update test_qwenimage_edit_plus.py	2026-01-09 13:55:42 -10:00
Samu Tamminen	df6516a716	Align HunyuanVideoConditionEmbedding with CombinedTimestepGuidanceTextProjEmbeddings (#12316 ) conditioning additions inline with CombinedTimestepGuidanceTextProjEmbeddings Co-authored-by: Samu Tamminen <samutamm@users.noreply.github.com> Co-authored-by: YiYi Xu <yixu310@gmail.com>	2026-01-09 13:51:04 -10:00
Steven Liu	5794ffffbe	[docs] Remote inference (#12372 ) * init * fix	2026-01-09 13:32:14 -10:00
Titong Jiang	4fb44bdf91	Fix wrong param types, docs, and handles noise=None in scale_noise of FlowMatching schedulers (#11669 ) * Bug: Fix wrong params, docs, and handles noise=None * make noise a required arg --------- Co-authored-by: YiYi Xu <yixu310@gmail.com>	2026-01-09 11:42:33 -10:00
Linoy Tsaban	b7a81582ae	[LoRA] add lora_alpha to sana README (#11780 ) add lora alpha to readme	2026-01-09 11:28:39 -10:00
Bhavya Bahl	4b64b5603f	Change timestep device to cpu for xla (#11501 ) * Change timestep device to cpu for xla * Add all pipelines * ruff format * Apply style fixes --------- Co-authored-by: YiYi Xu <yixu310@gmail.com> Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>	2026-01-09 11:22:51 -10:00
Kashif Rasul	2bb640f8ea	[Research] Latent Perceptual Loss (LPL) for Stable Diffusion XL (#11573 ) * initial * added readme * fix formatting * added logging * formatting * use config * debug * better * handle SNR * floats have no item() * remove debug * formatting * add paper link * acknowledge reference source * rename script --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2026-01-09 10:24:21 -10:00
Fredy Rivera	2dc9d2af50	Add thread-safe wrappers for components in pipeline (examples/server-async/utils/requestscopedpipeline.py) (#12515 ) * Basic implementation of request scheduling * Basic editing in SD and Flux Pipelines * Small Fix * Fix * Update for more pipelines * Add examples/server-async * Add examples/server-async * Updated RequestScopedPipeline to handle a single tokenizer lock to avoid race conditions * Fix * Fix _TokenizerLockWrapper * Fix _TokenizerLockWrapper * Delete _TokenizerLockWrapper * Fix tokenizer * Update examples/server-async * Fix server-async * Optimizations in examples/server-async * We keep the implementation simple in examples/server-async * Update examples/server-async/README.md * Update examples/server-async/README.md for changes to tokenizer locks and backward-compatible retrieve_timesteps * The changes to the diffusers core have been undone and all logic is being moved to exmaples/server-async * Update examples/server-async/utils/* * Fix BaseAsyncScheduler * Rollback in the core of the diffusers * Update examples/server-async/README.md * Complete rollback of diffusers core files * Simple implementation of an asynchronous server compatible with SD3-3.5 and Flux Pipelines * Update examples/server-async/README.md * Fixed import errors in 'examples/server-async/serverasync.py' * Flux Pipeline Discard * Update examples/server-async/README.md * Apply style fixes * Add thread-safe wrappers for components in pipeline Refactor requestscopedpipeline.py to add thread-safe wrappers for tokenizer, VAE, and image processor. Introduce locking mechanisms to ensure thread safety during concurrent access. * Add wrappers.py * Apply style fixes --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>	2026-01-09 09:43:14 -10:00
Teriks	57e57cfae0	Store vae.config.scaling_factor to prevent missing attr reference (sdxl advanced dreambooth training script) (#12346 ) Store vae.config.scaling_factor to prevent missing attr reference In sdxl advanced dreambooth training script vae.config.scaling_factor becomes inaccessible after: del vae when: --cache_latents, and no --validation_prompt Co-authored-by: Teriks <Teriks@users.noreply.github.com> Co-authored-by: Linoy Tsaban <57615435+linoytsaban@users.noreply.github.com> Co-authored-by: YiYi Xu <yixu310@gmail.com>	2026-01-09 09:42:30 -10:00
gapatron	644169433f	Laplace Scheduler for DDPM (#11320 ) * Add Laplace scheduler that samples more around mid-range noise levels (around log SNR=0), increasing performance (lower FID) with faster convergence speed, and robust to resolution and objective. Reference: https://arxiv.org/pdf/2407.03297. * Fix copies. * Apply style fixes --------- Co-authored-by: YiYi Xu <yixu310@gmail.com> Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>	2026-01-09 09:16:02 -10:00
Ishan Modi	632765a5ee	[Feature] MultiControlNet support for SD3Impainting (#11251 ) * update * update * addressed PR comments * update * Apply suggestions from code review --------- Co-authored-by: YiYi Xu <yixu310@gmail.com>	2026-01-09 08:55:16 -10:00
David El Malih	d36564f06a	Improve docstrings and type hints in scheduling_consistency_models.py (#12931 ) docs: improve docstring scheduling_consistency_models.py	2026-01-09 09:56:56 -08:00
Sayak Paul	441b69eabf	[core] Handle progress bar and logging in distributed environments (#12806 ) * disable progressbar in distributed. * up * up * up * up * up * up	2026-01-09 22:23:13 +05:30
Sayak Paul	d568c9773f	[chore] remove controlnet implementations outside controlnet module. (#12152 ) * remove controlnet implementations outside controlnet module. * fix * fix * fix	2026-01-09 21:22:45 +05:30
Sayak Paul	3981c955ce	[modular] Tests for custom blocks in modular diffusers (#12557 ) * start custom block testing. * simplify modular workflow ci. * up * style. * up * up * up * up * up * up * Apply suggestions from code review * up	2026-01-09 15:57:23 +05:30
YiYi Xu	1903383e94	[Modular] qwen refactor (#12872 ) * 3 files * add conditoinal pipeline * refactor qwen modular * add layered * up up * u p * add to import * more refacotr, make layer work * clean up a bit git add src * more * style * style	2026-01-08 23:38:49 -10:00
Leo Jiang	08f8b7af9a	Bugfix for dreambooth flux2 img2img2 (#12825 ) Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: Linoy Tsaban <57615435+linoytsaban@users.noreply.github.com>	2026-01-09 12:34:44 +05:30
Howard Zhang	2f66edc880	Torchao floatx version guard (#12923 ) * Adding torchao version guard for floatx usage Summary: TorchAO removing floatx support, added version guard in quantization_config.py * Adding torchao version guard for floatx usage Summary: TorchAO removing floatx support, added version guard in quantization_config.py Altered tests in test_torchao.py to version guard floatx Created new test to verify version guard of floatx support * Adding torchao version guard for floatx usage Summary: TorchAO removing floatx support, added version guard in quantization_config.py Altered tests in test_torchao.py to version guard floatx Created new test to verify version guard of floatx support * Adding torchao version guard for floatx usage Summary: TorchAO removing floatx support, added version guard in quantization_config.py Altered tests in test_torchao.py to version guard floatx Created new test to verify version guard of floatx support * Adding torchao version guard for floatx usage Summary: TorchAO removing floatx support, added version guard in quantization_config.py Altered tests in test_torchao.py to version guard floatx Created new test to verify version guard of floatx support * Adding torchao version guard for floatx usage Summary: TorchAO removing floatx support, added version guard in quantization_config.py Altered tests in test_torchao.py to version guard floatx Created new test to verify version guard of floatx support --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2026-01-09 10:51:53 +05:30
TmacAaron	be38f41f9f	[NPU] npu attention enable ulysses (#12919 ) * npu attention enable ulysses * clean the format * register _native_npu_attention to _supports_context_parallel Signed-off-by: yyt <yangyit139@gmail.com> * change npu_fusion_attention's input_layout to BSND to eliminate redundant transpose Signed-off-by: yyt <yangyit139@gmail.com> * Update format --------- Signed-off-by: yyt <yangyit139@gmail.com> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2026-01-09 10:11:49 +05:30
MSD	91e5134175	fix the warning torch_dtype is deprecated (#12841 ) * fix the warning torch_dtype is deprecated * Add transformers version check (>= 4.56.0) for dtype parameter * Fix linting errors	2026-01-09 08:35:26 +05:30
Salman Chishti	a812c87465	Upgrade GitHub Actions for Node 24 compatibility (#12865 ) Signed-off-by: Salman Muin Kayser Chishti <13schishti@gmail.com>	2026-01-09 08:28:58 +05:30
Aditya Borate	8b9f817ef5	Fix: Remove hardcoded CUDA autocast in Kandinsky 5 to fix import warning (#12814 ) * Fix: Remove hardcoded CUDA autocast in Kandinsky 5 to fix import warning * Apply style fixes * Fix: Remove import-time autocast in Kandinsky to prevent warnings - Removed @torch.autocast decorator from Kandinsky classes. - Implemented manual F.linear casting to ensure numerical parity with FP32. - Verified bit-exact output matches main branch. Co-authored-by: hlky <hlky@hlky.ac> * Used _keep_in_fp32_modules to align with standards --------- Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com> Co-authored-by: hlky <hlky@hlky.ac>	2026-01-08 09:00:58 -10:00
David El Malih	b1f06b780a	Improve docstrings and type hints in scheduling_consistency_decoder.py (#12928 ) docs: improve docstring scheduling_consistency_decoder.py	2026-01-08 09:45:38 -08:00
Pauline Bailly-Masson	8600b4c10d	Add environment variables to checkout step (#12927 )	2026-01-08 13:38:06 +05:30

1 2 3 4 5 ...

6161 Commits