diffusers

mirror of https://github.com/huggingface/diffusers.git synced 2026-01-29 07:22:12 +03:00

Author	SHA1	Message	Date
toilaluan	d06c6bc6c2	fix taylor precision	2025-11-28 08:14:41 +00:00
toilaluan	309ce72140	quality & style	2025-11-28 07:28:44 +00:00
toilaluan	83b62531f8	add docs	2025-11-28 07:23:06 +00:00
toilaluan	24267c76de	chores: naming, remove redundancy	2025-11-28 07:23:01 +00:00
Tran Thanh Luan	656c7bc501	Merge branch 'main' into feat-taylorseer	2025-11-26 11:12:25 +07:00
Sayak Paul	b91e8c0d0b	[lora]: Fix Flux2 LoRA NaN test (#12714 ) * up * Update tests/lora/test_lora_layers_flux2.py Co-authored-by: dg845 <58458699+dg845@users.noreply.github.com> --------- Co-authored-by: dg845 <58458699+dg845@users.noreply.github.com>	2025-11-26 09:07:48 +05:30
Tran Thanh Luan	a644417835	Merge branch 'huggingface:main' into feat-taylorseer	2025-11-26 10:27:22 +07:00
Andrei Filatov	ac7864624b	Update script names in README for Flux2 training (#12713 )	2025-11-26 07:02:18 +05:30
Sayak Paul	5ffb73d4ae	let's go Flux2 🚀 (#12711 ) * add vae * Initial commit for Flux 2 Transformer implementation * add pipeline part * small edits to the pipeline and conversion * update conversion script * fix * up up * finish pipeline * Remove Flux IP Adapter logic for now * Remove deprecated 3D id logic * Remove ControlNet logic for now * Add link to ViT-22B paper as reference for parallel transformer blocks such as the Flux 2 single stream block * update pipeline * Don't use biases for input projs and output AdaNorm * up * Remove bias for double stream block text QKV projections * Add script to convert Flux 2 transformer to diffusers * make style and make quality * fix a few things. * allow sft files to go. * fix image processor * fix batch * style a bit * Fix some bugs in Flux 2 transformer implementation * Fix dummy input preparation and fix some test bugs * fix dtype casting in timestep guidance module. * resolve conflicts., * remove ip adapter stuff. * Fix Flux 2 transformer consistency test * Fix bug in Flux2TransformerBlock (double stream block) * Get remaining Flux 2 transformer tests passing * make style; make quality; make fix-copies * remove stuff. * fix type annotaton. * remove unneeded stuff from tests * tests * up * up * add sf support * Remove unused IP Adapter and ControlNet logic from transformer (#9) * copied from * Apply suggestions from code review Co-authored-by: YiYi Xu <yixu310@gmail.com> Co-authored-by: apolinário <joaopaulo.passos@gmail.com> * up * up * up * up * up * Refactor Flux2Attention into separate classes for double stream and single stream attention * Add _supports_qkv_fusion to AttentionModuleMixin to allow subclasses to disable QKV fusion * Have Flux2ParallelSelfAttention inherit from AttentionModuleMixin with _supports_qkv_fusion=False * Log debug message when calling fuse_projections on a AttentionModuleMixin subclass that does not support QKV fusion * Address review comments * Update src/diffusers/pipelines/flux2/pipeline_flux2.py Co-authored-by: YiYi Xu <yixu310@gmail.com> * up * Remove maybe_allow_in_graph decorators for Flux 2 transformer blocks (#12) * up * support ostris loras. (#13) * up * update schdule * up * up (#17) * add training scripts (#16) * add training scripts Co-authored-by: Linoy Tsaban <linoytsaban@gmail.com> * model cpu offload in validation. * add flux.2 readme * add img2img and tests * cpu offload in log validation * Apply suggestions from code review * fix * up * fixes * remove i2i training tests for now. --------- Co-authored-by: Linoy Tsaban <linoytsaban@gmail.com> Co-authored-by: linoytsaban <linoy@huggingface.co> * up --------- Co-authored-by: yiyixuxu <yixu310@gmail.com> Co-authored-by: Daniel Gu <dgu8957@gmail.com> Co-authored-by: yiyi@huggingface.co <yiyi@ip-10-53-87-203.ec2.internal> Co-authored-by: dg845 <58458699+dg845@users.noreply.github.com> Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com> Co-authored-by: apolinário <joaopaulo.passos@gmail.com> Co-authored-by: yiyi@huggingface.co <yiyi@ip-26-0-160-103.ec2.internal> Co-authored-by: Linoy Tsaban <linoytsaban@gmail.com> Co-authored-by: linoytsaban <linoy@huggingface.co>	2025-11-25 21:49:04 +05:30
Jerry Wu	4088e8a851	Add Support for Z-Image Series (#12703 ) * Add Support for Z-Image. * Reformatting with make style, black & isort. * Remove init, Modify import utils, Merge forward in transformers block, Remove once func in pipeline. * modified main model forward, freqs_cis left * refactored to add B dim * fixed stack issue * fixed modulation bug * fixed modulation bug * fix bug * remove value_from_time_aware_config * styling * Fix neg embed and devide / bug; Reuse pad zero tensor; Turn cat -> repeat; Add hint for attn processor. * Replace padding with pad_sequence; Add gradient checkpointing. * Fix flash_attn3 in dispatch attn backend by _flash_attn_forward, replace its origin implement; Add DocString in pipeline for that. * Fix Docstring and Make Style. * Revert "Fix flash_attn3 in dispatch attn backend by _flash_attn_forward, replace its origin implement; Add DocString in pipeline for that." This reverts commit `fbf26b7ed1`. * update z-image docstring * Revert attention dispatcher * update z-image docstring * styling * Recover attention_dispatch.py with its origin impl, later would special commit for fa3 compatibility. * Fix prev bug, and support for prompt_embeds pass in args after prompt pre-encode as List of torch Tensor. * Remove einop dependency. * remove redundant imports & make fix-copies * fix import --------- Co-authored-by: liudongyang <liudongyang0114@gmail.com>	2025-11-25 05:50:00 -10:00
toilaluan	2be31f856e	fix format & doc	2025-11-25 06:02:13 +00:00
Tran Thanh Luan	b3217139f5	Merge branch 'main' into feat-taylorseer	2025-11-25 12:31:03 +07:00
toilaluan	a8ea383044	refractor to use state manager	2025-11-25 05:28:00 +00:00
Junsong Chen	d33d9f6715	fix typo in docs (#12675 ) * fix typo in docs * Update docs/source/en/api/pipelines/sana_video.md Co-authored-by: dg845 <58458699+dg845@users.noreply.github.com> --------- Co-authored-by: dg845 <58458699+dg845@users.noreply.github.com>	2025-11-24 19:42:16 -08:00
sq	dde8754ba2	Fix variable naming typos in community FluxControlNetFillInpaintPipeline (#12701 ) - Fixed variable naming typos (maskkk -> mask_fill, mask_imagee -> mask_image_fill, masked_imagee -> masked_image_fill, masked_image_latentsss -> masked_latents_fill) These changes improve code readability without affecting functionality.	2025-11-24 15:16:11 -08:00
cdutr	fbcd3ba6b2	[i8n-pt] Fix grammar and expand Portuguese documentation (#12598 ) * Updates Portuguese documentation for Diffusers library Enhances the Portuguese documentation with: - Restructured table of contents for improved navigation - Added placeholder page for in-translation content - Refined language and improved readability in existing pages - Introduced a new page on basic Stable Diffusion performance guidance Improves overall documentation structure and user experience for Portuguese-speaking users * Removes untranslated sections from Portuguese documentation Cleans up the Portuguese documentation table of contents by removing placeholder sections marked as "Em tradução" (In translation) Removes the in_translation.md file and associated table of contents entries for sections that are not yet translated, improving documentation clarity	2025-11-24 14:07:32 -08:00
Sayak Paul	d176f61fcf	[core] support sage attention + FA2 through `kernels` (#12439 ) * up * support automatic dispatch. * disable compile support for now./ * up * flash too. * document. * up * up * up * up	2025-11-24 16:58:07 +05:30
DefTruth	354d35adb0	bugfix: fix chrono-edit context parallel (#12660 ) * bugfix: fix chrono-edit context parallel * bugfix: fix chrono-edit context parallel * Update src/diffusers/models/transformers/transformer_chronoedit.py Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com> * Update src/diffusers/models/transformers/transformer_chronoedit.py Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com> * Clean up comments in transformer_chronoedit.py Removed unnecessary comments regarding parallelization in cross-attention. * fix style * fix qc --------- Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>	2025-11-24 16:36:53 +05:30
SwayStar123	544ba677dd	Add FluxLoraLoaderMixin to Fibo pipeline (#12688 ) Update pipeline_bria_fibo.py	2025-11-24 13:31:31 +05:30
David El Malih	6f1042e36c	Improve docstrings and type hints in scheduling_lms_discrete.py (#12678 ) * Enhance type hints and docstrings in LMSDiscreteScheduler class Updated type hints for function parameters and return types to improve code clarity and maintainability. Enhanced docstrings for several methods, providing clearer descriptions of their functionality and expected arguments. Notable changes include specifying Literal types for certain parameters and ensuring consistent return type annotations across the class. * docs: Add specific paper reference to `_convert_to_karras` docstring. * Refactor `_convert_to_karras` docstring in DPMSolverSDEScheduler to include detailed descriptions and a specific paper reference, enhancing clarity and documentation consistency.	2025-11-21 10:18:09 -08:00
toilaluan	9083e1eba5	update to handle multple calls per timestep	2025-11-20 09:54:29 +00:00
Tran Thanh Luan	05f61a9cc3	Merge branch 'main' into feat-taylorseer	2025-11-20 14:21:11 +07:00
Pratim Dasude	d5da453de5	Community Pipeline: FluxFillControlNetInpaintPipeline for FLUX Fill-Based Inpainting with ControlNet (#12649 ) * new flux fill controlnet inpaint pipline * Delete src/diffusers/pipelines/flux/pipline_flux_fill_controlnet_Inpaint.py deleting from main flux pipeline * Fluc_fill_controlnet community pipline * Update README.md * Apply style fixes	2025-11-19 16:18:46 -03:00
David El Malih	15370f8412	Improve docstrings and type hints in scheduling_pndm.py (#12676 ) * Enhance docstrings and type hints in PNDMScheduler class - Updated parameter descriptions to include default values and specific types using Literal for better clarity. - Improved docstring formatting and consistency across methods, including detailed explanations for the `_get_prev_sample` method. - Added type hints for method return types to enhance code readability and maintainability. * Refactor docstring in PNDMScheduler class to enhance clarity - Simplified the explanation of the method for computing the previous sample from the current sample. - Updated the reference to the PNDM paper for better accessibility. - Removed redundant notation explanations to streamline the documentation.	2025-11-19 09:36:41 -08:00
Dhruv Nair	a96b145304	[CI] Fix failing Pipeline CPU tests (#12681 ) update Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2025-11-19 21:19:24 +05:30
Dhruv Nair	6d8973ffe2	[CI] Fix indentation issue in workflow files (#12685 ) update	2025-11-19 09:30:04 +05:30
Sayak Paul	ab71f3c864	[core] Refactor hub attn kernels (#12475 ) * refactor how attention kernels from hub are used. * up * refactor according to Dhruv's ideas. Co-authored-by: Dhruv Nair <dhruv@huggingface.co> * empty Co-authored-by: Dhruv Nair <dhruv@huggingface.co> * empty Co-authored-by: Dhruv Nair <dhruv@huggingface.co> * empty Co-authored-by: dn6 <dhruv@huggingface.co> * up --------- Co-authored-by: Dhruv Nair <dhruv@huggingface.co> Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>	2025-11-19 08:19:00 +05:30
Dhruv Nair	b7df4a5387	[CI] Temporarily pin transformers (#12677 ) * update * update * update * update	2025-11-18 14:43:06 +05:30
dg845	67dc65e2e3	Revert `AutoencoderKLWan`'s `dim_mult` default value back to list (#12640 ) Revert dim_mult back to list and fix type annotation	2025-11-17 18:39:53 +05:30
Dhruv Nair	3579fdabf9	[CI] Make CI logs less verbose (#12674 ) update	2025-11-17 14:23:09 +05:30
Junsong Chen	1afc21855e	SANA-Video Image to Video pipeline `SanaImageToVideoPipeline` support (#12634 ) * move sana-video to a new dir and add `SanaImageToVideoPipeline` with no modify; * fix bug and run text/image-to-vidoe success; * make style; quality; fix-copies; * add sana image-to-video pipeline in markdown; * add test case for sana image-to-video; * make style; * add a init file in sana-video test dir; * Update src/diffusers/pipelines/sana_video/pipeline_sana_video_i2v.py Co-authored-by: dg845 <58458699+dg845@users.noreply.github.com> * Update tests/pipelines/sana_video/test_sana_video_i2v.py Co-authored-by: dg845 <58458699+dg845@users.noreply.github.com> * Update src/diffusers/pipelines/sana_video/pipeline_sana_video_i2v.py Co-authored-by: dg845 <58458699+dg845@users.noreply.github.com> * Update src/diffusers/pipelines/sana_video/pipeline_sana_video_i2v.py Co-authored-by: dg845 <58458699+dg845@users.noreply.github.com> * Update tests/pipelines/sana_video/test_sana_video_i2v.py Co-authored-by: dg845 <58458699+dg845@users.noreply.github.com> * minor update; * fix bug and skip fp16 save test; Co-authored-by: Yuyang Zhao <43061147+HeliosZhao@users.noreply.github.com> * Update src/diffusers/pipelines/sana_video/pipeline_sana_video_i2v.py Co-authored-by: dg845 <58458699+dg845@users.noreply.github.com> * Update src/diffusers/pipelines/sana_video/pipeline_sana_video_i2v.py Co-authored-by: dg845 <58458699+dg845@users.noreply.github.com> * Update src/diffusers/pipelines/sana_video/pipeline_sana_video_i2v.py Co-authored-by: dg845 <58458699+dg845@users.noreply.github.com> * Update src/diffusers/pipelines/sana_video/pipeline_sana_video_i2v.py Co-authored-by: dg845 <58458699+dg845@users.noreply.github.com> * add copied from for `encode_prompt` * Apply style fixes --------- Co-authored-by: dg845 <58458699+dg845@users.noreply.github.com> Co-authored-by: Yuyang Zhao <43061147+HeliosZhao@users.noreply.github.com> Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>	2025-11-17 00:23:34 -08:00
toilaluan	d929ab28a7	apply ruff	2025-11-17 13:24:20 +07:00
Tran Thanh Luan	9290b5895f	Merge branch 'main' into feat-taylorseer	2025-11-17 13:21:41 +07:00
toilaluan	acfebfa3f3	update docs	2025-11-17 13:21:01 +07:00
David Bertoin	0c35b580fe	[PRX pipeline]: add 1024 resolution ratio bins (#12670 ) add 1024 ratio bins	2025-11-17 10:37:40 +05:30
toilaluan	7238d40dd9	add stop_predicts (cooldown)	2025-11-16 05:09:44 +00:00
David Bertoin	01a56927f1	Rope in float32 for mps or npu compatibility (#12665 ) rope in float32	2025-11-15 20:44:34 +05:30
toilaluan	51b4318a3e	allow special cache ids only	2025-11-15 05:13:33 +00:00
dg845	a9e4883b6a	Update Wan Animate Docs (#12658 ) * Update the Wan Animate docs to reflect the most recent code * Further explain input preprocessing and link to original Wan Animate preprocessing scripts	2025-11-14 16:06:22 -08:00
David El Malih	63dd601758	Improve docstrings and type hints in scheduling_euler_discrete.py (#12654 ) * refactor: enhance type hints and documentation in EulerDiscreteScheduler Updated type hints for function parameters and return types in the EulerDiscreteScheduler class to improve code clarity and maintainability. Enhanced docstrings for several methods to provide clearer descriptions of their functionality and expected arguments. This includes specifying Literal types for certain parameters and ensuring consistent return type annotations across the class. * refactor: enhance type hints and documentation across multiple schedulers Updated type hints and improved docstrings in various scheduler classes, including CMStochasticIterativeScheduler, CosineDPMSolverMultistepScheduler, and others. This includes specifying parameter types, return types, and providing clearer descriptions of method functionalities. Notable changes include the addition of default values in the begin_index argument and enhanced explanations for noise addition methods. These improvements aim to enhance code clarity and maintainability across the scheduling module. * refactor: update docstrings to clarify noise schedule construction Revised docstrings across multiple scheduler classes to enhance clarity regarding the construction of noise schedules. Updated references to relevant papers, ensuring accurate citations for the methodologies used. This includes changes in DEISMultistepScheduler, DPMSolverMultistepInverseScheduler, and others, improving documentation consistency and readability.	2025-11-14 15:12:24 -08:00
toilaluan	7b4ad2de63	add configurable cache, skip compute module	2025-11-14 09:09:46 +00:00
toilaluan	1099e493e6	refractor, add docs	2025-11-14 07:00:12 +00:00
Dhruv Nair	eeae0338e7	[Modular] Add Custom Blocks guide to doc (#12339 ) * update * update * Update docs/source/en/modular_diffusers/custom_blocks.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/modular_diffusers/custom_blocks.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/_toctree.yml Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/modular_diffusers/custom_blocks.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Apply suggestion from @stevhliu Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Apply suggestion from @stevhliu Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * update * update * update * Apply suggestion from @stevhliu Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Apply suggestion from @stevhliu Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * update * update * update * update * update * Update docs/source/en/modular_diffusers/custom_blocks.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> --------- Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2025-11-14 10:59:59 +05:30
David El Malih	3c1ca869d7	Improve docstrings and type hints in scheduling_ddpm.py (#12651 ) * Enhance type hints and docstrings in scheduling_ddpm.py - Added type hints for function parameters and return types across the DDPMScheduler class and related functions. - Improved docstrings for clarity, including detailed descriptions of parameters and return values. - Updated the alpha_transform_type and beta_schedule parameters to use Literal types for better type safety. - Refined the _get_variance and previous_timestep methods with comprehensive documentation. * Refactor docstrings and type hints in scheduling_ddpm.py - Cleaned up whitespace in the rescale_zero_terminal_snr function. - Enhanced the variance_type parameter in the DDPMScheduler class with improved formatting for better readability. - Updated the docstring for the compute_variance method to maintain consistency and clarity in parameter descriptions and return values. * Apply `make fix-copies` * Refactor type hints across multiple scheduler files - Updated type hints to include `Literal` for improved type safety in various scheduling files. - Ensured consistency in type hinting for parameters and return types across the affected modules. - This change enhances code clarity and maintainability.	2025-11-13 14:46:23 -08:00
David El Malih	6fe4a6ff8e	Improve docstrings and type hints in scheduling_ddim.py (#12622 ) * Improve docstrings and type hints in scheduling_ddim.py - Add complete type hints for all function parameters - Enhance docstrings to follow project conventions - Add missing parameter descriptions Fixes #9567 * Enhance docstrings and type hints in scheduling_ddim.py - Update parameter types and descriptions for clarity - Improve explanations in method docstrings to align with project standards - Add optional annotations for parameters where applicable * Refine type hints and docstrings in scheduling_ddim.py - Update parameter types to use Literal for specific string options - Enhance docstring descriptions for clarity and consistency - Ensure all parameters have appropriate type annotations and defaults * Apply review feedback on scheduling_ddim.py - Replace "prevent singularities" with "avoid numerical instability" for better clarity - Add backticks around `alpha_bar` variable name for consistent formatting - Convert Imagen Video paper URLs to Hugging Face papers references * Propagate changes using 'make fix-copies' * Add missing Literal	2025-11-13 14:45:58 -08:00
toilaluan	0602044da7	still update in warmup steps	2025-11-13 17:03:35 +00:00
Steven Liu	40de88af8c	[docs] AutoModel (#12644 ) * automodel * fix	2025-11-13 08:43:24 -08:00
Steven Liu	6a2309b98d	[utils] Update check_doc_toc (#12642 ) update Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2025-11-13 08:42:31 -08:00
toilaluan	8f80072618	use logger for printing, add warmup feature	2025-11-13 13:11:29 +00:00
toilaluan	8f495b607f	make compatible with any tuple size returned	2025-11-13 11:37:54 +00:00

1 2 3 4 5 ...

6041 Commits