diffusers

mirror of https://github.com/huggingface/diffusers.git synced 2026-01-29 07:22:12 +03:00

Author	SHA1	Message	Date
YiYi Xu	404d3fa9a5	Update docs/source/en/api/pipelines/hunyuan_video15.md	2025-11-30 15:24:18 -10:00
YiYi Xu	5989014cfe	Update docs/source/en/api/pipelines/hunyuan_video15.md	2025-11-30 15:21:17 -10:00
YiYi Xu	0dae8f956d	Apply suggestions from code review	2025-11-30 15:10:16 -10:00
yiyixuxu	c715470709	add a note on changing guidance_scale on doc	2025-12-01 02:06:13 +01:00
YiYi Xu	2c018f8be6	Update docs/source/en/api/pipelines/hunyuan_video15.md	2025-11-30 14:43:51 -10:00
yiyi@huggingface.co	bdfab30766	up	2025-12-01 00:35:30 +00:00
yiyi@huggingface.co	d7f399d1b2	add a notes on the doc about attention backend	2025-12-01 00:33:00 +00:00
YiYi Xu	237d318e85	Apply suggestions from code review Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2025-11-30 13:22:43 -10:00
YiYi Xu	54f008e30b	Update docs/source/en/api/pipelines/hunyuan_video15.md	2025-11-30 13:17:31 -10:00
yiyi@huggingface.co	c3f45982b6	up	2025-11-30 20:40:31 +00:00
yiyi@huggingface.co	7aeab3f847	add to toctree	2025-11-30 20:39:49 +00:00
yiyi@huggingface.co	50abf504a1	add docs	2025-11-30 20:38:31 +00:00
yiyi@huggingface.co	8aa458ed46	copies	2025-11-30 20:16:59 +00:00
yiyi@huggingface.co	5029dbf763	style	2025-11-30 20:14:36 +00:00
YiYi Xu	3980f97d2f	Merge branch 'main' into hunyuanvideo15	2025-11-30 09:54:37 -10:00
yiyi@huggingface.co	e1940341ff	clean up a bit more pipelines	2025-11-30 19:46:38 +00:00
yiyi@huggingface.co	e319d7207a	simplify transformer	2025-11-30 19:24:00 +00:00
yiyi@huggingface.co	f9cb82b64f	a few small fix: proprocess, cpu_offloading, attention backend	2025-11-30 18:38:33 +00:00
DefTruth	152f7ca357	fix type-check for z-image transformer (#12739 ) * allow type-check for ZImageTransformer2DModel * make fix-copies	2025-11-29 14:58:33 +05:30
yiyi@huggingface.co	c22915d6c4	up up	2025-11-29 05:36:33 +00:00
yiyi@huggingface.co	0687a40768	remove use_meanflow	2025-11-29 05:36:22 +00:00
yiyi@huggingface.co	753d4075f9	add image to video pipeline	2025-11-29 01:59:44 +00:00
yiyi@huggingface.co	e3301cbda4	add i2v pipeline	2025-11-29 00:49:18 +00:00
yiyi@huggingface.co	090ceb5d4f	remove dtype from the _get_ encodeing methods	2025-11-29 00:33:58 +00:00
Dhruv Nair	b010a8ce0c	[Modular] Add single file support to Modular (#12383 ) * update * update * update * update * Apply style fixes * update * update * update * update * update --------- Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>	2025-11-28 22:23:04 +05:30
Ayush Sur	1b91856d0e	Fix examples not loading LoRA adapter weights from checkpoint (#12690 ) * Fix examples not loading LoRA adapter weights from checkpoint * Updated lora saving logic with accelerate save_model_hook and load_model_hook * Formatted the changes using ruff * import and upcasting changed --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2025-11-28 11:56:39 +05:30
yiyi@huggingface.co	38c42b4de1	conversion scripts	2025-11-27 22:15:08 +00:00
Sayak Paul	01e355516b	Enable regional compilation on z-image transformer model (#12736 ) up	2025-11-27 07:18:00 -10:00
Sayak Paul	6bf668c4d2	[chore] remove torch.save from remnant code. (#12717 ) remove torch.save from remnant code.	2025-11-27 13:04:09 +05:30
yiyi@huggingface.co	db0127cb9d	fix	2025-11-27 05:25:28 +00:00
yiyi@huggingface.co	a0b2fe02b0	update conversion script: remove dtype, always keep same precision as original checkpoint	2025-11-27 05:24:49 +00:00
Jerry Wu	e6d4612309	Support unittest for Z-image ⚡️ (#12715 ) * Add Support for Z-Image. * Reformatting with make style, black & isort. * Remove init, Modify import utils, Merge forward in transformers block, Remove once func in pipeline. * modified main model forward, freqs_cis left * refactored to add B dim * fixed stack issue * fixed modulation bug * fixed modulation bug * fix bug * remove value_from_time_aware_config * styling * Fix neg embed and devide / bug; Reuse pad zero tensor; Turn cat -> repeat; Add hint for attn processor. * Replace padding with pad_sequence; Add gradient checkpointing. * Fix flash_attn3 in dispatch attn backend by _flash_attn_forward, replace its origin implement; Add DocString in pipeline for that. * Fix Docstring and Make Style. * Revert "Fix flash_attn3 in dispatch attn backend by _flash_attn_forward, replace its origin implement; Add DocString in pipeline for that." This reverts commit `fbf26b7ed1`. * update z-image docstring * Revert attention dispatcher * update z-image docstring * styling * Recover attention_dispatch.py with its origin impl, later would special commit for fa3 compatibility. * Fix prev bug, and support for prompt_embeds pass in args after prompt pre-encode as List of torch Tensor. * Remove einop dependency. * remove redundant imports & make fix-copies * fix import * Support for num_images_per_prompt>1; Remove redundant unquote variables. * Fix bugs for num_images_per_prompt with actual batch. * Add unit tests for Z-Image. * Refine unitest and skip for cases needed separate test env; Fix compatibility with unitest in model, mostly precision formating. * Add clean env for test_save_load_float16 separ test; Add Note; Styling. * Update dtype mentioned by yiyi. --------- Co-authored-by: liudongyang <liudongyang0114@gmail.com>	2025-11-26 07:18:57 -10:00
David El Malih	a88a7b4f03	Improve docstrings and type hints in scheduling_dpmsolver_multistep.py (#12710 ) * Improve docstrings and type hints in multiple diffusion schedulers * docs: update Imagen Video paper link to Hugging Face Papers.	2025-11-26 08:38:41 -08:00
Sayak Paul	c8656ed73c	[docs] put autopipeline after overview and hunyuanimage in images (#12548 ) put autopipeline after overview and hunyuanimage in images	2025-11-26 15:34:22 +05:30
yiyi@huggingface.co	2f6914d57a	up up	2025-11-26 07:38:30 +00:00
yiyi@huggingface.co	c739ee9ced	update conversion script	2025-11-26 07:38:16 +00:00
Sayak Paul	94c9613f99	[docs] Correct flux2 links (#12716 ) * fix links * up	2025-11-26 10:46:51 +05:30
Sayak Paul	b91e8c0d0b	[lora]: Fix Flux2 LoRA NaN test (#12714 ) * up * Update tests/lora/test_lora_layers_flux2.py Co-authored-by: dg845 <58458699+dg845@users.noreply.github.com> --------- Co-authored-by: dg845 <58458699+dg845@users.noreply.github.com>	2025-11-26 09:07:48 +05:30
Andrei Filatov	ac7864624b	Update script names in README for Flux2 training (#12713 )	2025-11-26 07:02:18 +05:30
Sayak Paul	5ffb73d4ae	let's go Flux2 🚀 (#12711 ) * add vae * Initial commit for Flux 2 Transformer implementation * add pipeline part * small edits to the pipeline and conversion * update conversion script * fix * up up * finish pipeline * Remove Flux IP Adapter logic for now * Remove deprecated 3D id logic * Remove ControlNet logic for now * Add link to ViT-22B paper as reference for parallel transformer blocks such as the Flux 2 single stream block * update pipeline * Don't use biases for input projs and output AdaNorm * up * Remove bias for double stream block text QKV projections * Add script to convert Flux 2 transformer to diffusers * make style and make quality * fix a few things. * allow sft files to go. * fix image processor * fix batch * style a bit * Fix some bugs in Flux 2 transformer implementation * Fix dummy input preparation and fix some test bugs * fix dtype casting in timestep guidance module. * resolve conflicts., * remove ip adapter stuff. * Fix Flux 2 transformer consistency test * Fix bug in Flux2TransformerBlock (double stream block) * Get remaining Flux 2 transformer tests passing * make style; make quality; make fix-copies * remove stuff. * fix type annotaton. * remove unneeded stuff from tests * tests * up * up * add sf support * Remove unused IP Adapter and ControlNet logic from transformer (#9) * copied from * Apply suggestions from code review Co-authored-by: YiYi Xu <yixu310@gmail.com> Co-authored-by: apolinário <joaopaulo.passos@gmail.com> * up * up * up * up * up * Refactor Flux2Attention into separate classes for double stream and single stream attention * Add _supports_qkv_fusion to AttentionModuleMixin to allow subclasses to disable QKV fusion * Have Flux2ParallelSelfAttention inherit from AttentionModuleMixin with _supports_qkv_fusion=False * Log debug message when calling fuse_projections on a AttentionModuleMixin subclass that does not support QKV fusion * Address review comments * Update src/diffusers/pipelines/flux2/pipeline_flux2.py Co-authored-by: YiYi Xu <yixu310@gmail.com> * up * Remove maybe_allow_in_graph decorators for Flux 2 transformer blocks (#12) * up * support ostris loras. (#13) * up * update schdule * up * up (#17) * add training scripts (#16) * add training scripts Co-authored-by: Linoy Tsaban <linoytsaban@gmail.com> * model cpu offload in validation. * add flux.2 readme * add img2img and tests * cpu offload in log validation * Apply suggestions from code review * fix * up * fixes * remove i2i training tests for now. --------- Co-authored-by: Linoy Tsaban <linoytsaban@gmail.com> Co-authored-by: linoytsaban <linoy@huggingface.co> * up --------- Co-authored-by: yiyixuxu <yixu310@gmail.com> Co-authored-by: Daniel Gu <dgu8957@gmail.com> Co-authored-by: yiyi@huggingface.co <yiyi@ip-10-53-87-203.ec2.internal> Co-authored-by: dg845 <58458699+dg845@users.noreply.github.com> Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com> Co-authored-by: apolinário <joaopaulo.passos@gmail.com> Co-authored-by: yiyi@huggingface.co <yiyi@ip-26-0-160-103.ec2.internal> Co-authored-by: Linoy Tsaban <linoytsaban@gmail.com> Co-authored-by: linoytsaban <linoy@huggingface.co>	2025-11-25 21:49:04 +05:30
Jerry Wu	4088e8a851	Add Support for Z-Image Series (#12703 ) * Add Support for Z-Image. * Reformatting with make style, black & isort. * Remove init, Modify import utils, Merge forward in transformers block, Remove once func in pipeline. * modified main model forward, freqs_cis left * refactored to add B dim * fixed stack issue * fixed modulation bug * fixed modulation bug * fix bug * remove value_from_time_aware_config * styling * Fix neg embed and devide / bug; Reuse pad zero tensor; Turn cat -> repeat; Add hint for attn processor. * Replace padding with pad_sequence; Add gradient checkpointing. * Fix flash_attn3 in dispatch attn backend by _flash_attn_forward, replace its origin implement; Add DocString in pipeline for that. * Fix Docstring and Make Style. * Revert "Fix flash_attn3 in dispatch attn backend by _flash_attn_forward, replace its origin implement; Add DocString in pipeline for that." This reverts commit `fbf26b7ed1`. * update z-image docstring * Revert attention dispatcher * update z-image docstring * styling * Recover attention_dispatch.py with its origin impl, later would special commit for fa3 compatibility. * Fix prev bug, and support for prompt_embeds pass in args after prompt pre-encode as List of torch Tensor. * Remove einop dependency. * remove redundant imports & make fix-copies * fix import --------- Co-authored-by: liudongyang <liudongyang0114@gmail.com>	2025-11-25 05:50:00 -10:00
Junsong Chen	d33d9f6715	fix typo in docs (#12675 ) * fix typo in docs * Update docs/source/en/api/pipelines/sana_video.md Co-authored-by: dg845 <58458699+dg845@users.noreply.github.com> --------- Co-authored-by: dg845 <58458699+dg845@users.noreply.github.com>	2025-11-24 19:42:16 -08:00
sq	dde8754ba2	Fix variable naming typos in community FluxControlNetFillInpaintPipeline (#12701 ) - Fixed variable naming typos (maskkk -> mask_fill, mask_imagee -> mask_image_fill, masked_imagee -> masked_image_fill, masked_image_latentsss -> masked_latents_fill) These changes improve code readability without affecting functionality.	2025-11-24 15:16:11 -08:00
cdutr	fbcd3ba6b2	[i8n-pt] Fix grammar and expand Portuguese documentation (#12598 ) * Updates Portuguese documentation for Diffusers library Enhances the Portuguese documentation with: - Restructured table of contents for improved navigation - Added placeholder page for in-translation content - Refined language and improved readability in existing pages - Introduced a new page on basic Stable Diffusion performance guidance Improves overall documentation structure and user experience for Portuguese-speaking users * Removes untranslated sections from Portuguese documentation Cleans up the Portuguese documentation table of contents by removing placeholder sections marked as "Em tradução" (In translation) Removes the in_translation.md file and associated table of contents entries for sections that are not yet translated, improving documentation clarity	2025-11-24 14:07:32 -08:00
Sayak Paul	d176f61fcf	[core] support sage attention + FA2 through `kernels` (#12439 ) * up * support automatic dispatch. * disable compile support for now./ * up * flash too. * document. * up * up * up * up	2025-11-24 16:58:07 +05:30
DefTruth	354d35adb0	bugfix: fix chrono-edit context parallel (#12660 ) * bugfix: fix chrono-edit context parallel * bugfix: fix chrono-edit context parallel * Update src/diffusers/models/transformers/transformer_chronoedit.py Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com> * Update src/diffusers/models/transformers/transformer_chronoedit.py Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com> * Clean up comments in transformer_chronoedit.py Removed unnecessary comments regarding parallelization in cross-attention. * fix style * fix qc --------- Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>	2025-11-24 16:36:53 +05:30
SwayStar123	544ba677dd	Add FluxLoraLoaderMixin to Fibo pipeline (#12688 ) Update pipeline_bria_fibo.py	2025-11-24 13:31:31 +05:30
yiyixuxu	76bb607bc0	fix more, system prompt etc	2025-11-23 05:43:01 +01:00
yiyixuxu	5732d60db3	fix a bit more, remove print lines	2025-11-22 06:37:03 +01:00
yiyixuxu	b282ac1510	add text encoders to conversion script	2025-11-22 02:06:42 +01:00

1 2 3 4 5 ...

6060 Commits