diffusers

mirror of https://github.com/huggingface/diffusers.git synced 2026-01-27 17:22:53 +03:00

Author	SHA1	Message	Date
David Bertoin	dd07b19e27	Prx (#12525 ) * rename photon to prx * rename photon into prx * Revert .gitignore to state before commit `b7fb0fe9d6` * rename photon to prx * rename photon into prx * Revert .gitignore to state before commit `b7fb0fe9d6` * make fix-copies	2025-10-21 17:09:22 -07:00
David Bertoin	cefc2cf82d	Add Photon model and pipeline support (#12456 ) * Add Photon model and pipeline support This commit adds support for the Photon image generation model: - PhotonTransformer2DModel: Core transformer architecture - PhotonPipeline: Text-to-image generation pipeline - Attention processor updates for Photon-specific attention mechanism - Conversion script for loading Photon checkpoints - Documentation and tests * just store the T5Gemma encoder * enhance_vae_properties if vae is provided only * remove autocast for text encoder forwad * BF16 example * conditioned CFG * remove enhance vae and use vae.config directly when possible * move PhotonAttnProcessor2_0 in transformer_photon * remove einops dependency and now inherits from AttentionMixin * unify the structure of the forward block * update doc * update doc * fix T5Gemma loading from hub * fix timestep shift * remove lora support from doc * Rename EmbedND for PhotoEmbedND * remove modulation dataclass * put _attn_forward and _ffn_forward logic in PhotonBlock's forward * renam LastLayer for FinalLayer * remove lora related code * rename vae_spatial_compression_ratio for vae_scale_factor * support prompt_embeds in call * move xattention conditionning out computation out of the denoising loop * add negative prompts * Use _import_structure for lazy loading * make quality + style * add pipeline test + corresponding fixes * utility function that determines the default resolution given the VAE * Refactor PhotonAttention to match Flux pattern * built-in RMSNorm * Revert accidental .gitignore change * parameter names match the standard diffusers conventions * renaming and remove unecessary attributes setting * Update docs/source/en/api/pipelines/photon.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * quantization example * added doc to toctree * Update docs/source/en/api/pipelines/photon.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/api/pipelines/photon.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/api/pipelines/photon.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * use dispatch_attention_fn for multiple attention backend support * naming changes * make fix copy * Update docs/source/en/api/pipelines/photon.md Co-authored-by: dg845 <58458699+dg845@users.noreply.github.com> * Add PhotonTransformer2DModel to TYPE_CHECKING imports * make fix-copies * Use Tuple instead of tuple Co-authored-by: dg845 <58458699+dg845@users.noreply.github.com> * restrict the version of transformers Co-authored-by: dg845 <58458699+dg845@users.noreply.github.com> * Update tests/pipelines/photon/test_pipeline_photon.py Co-authored-by: dg845 <58458699+dg845@users.noreply.github.com> * Update tests/pipelines/photon/test_pipeline_photon.py Co-authored-by: dg845 <58458699+dg845@users.noreply.github.com> * change \| for Optional * fix nits. * use typing Dict --------- Co-authored-by: davidb <davidb@worker-10.soperator-worker-svc.soperator.svc.cluster.local> Co-authored-by: David Briand <david@photoroom.com> Co-authored-by: davidb <davidb@worker-8.soperator-worker-svc.soperator.svc.cluster.local> Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> Co-authored-by: dg845 <58458699+dg845@users.noreply.github.com> Co-authored-by: sayakpaul <spsayakpaul@gmail.com>	2025-10-21 20:55:55 +05:30
Sayak Paul	b3e56e71fb	styling issues. (#12522 )	2025-10-21 20:04:54 +05:30
Steven Liu	5b5fa49a89	[docs] Organize toctree by modality (#12514 ) * reorganize * fix --------- Co-authored-by: Álvaro Somoza <asomoza@users.noreply.github.com>	2025-10-21 10:18:54 +05:30
Lev Novitskiy	23ebbb4bc8	Kandinsky 5 is finally in Diffusers! (#12478 ) * add kandinsky5 transformer pipeline first version --------- Co-authored-by: Álvaro Somoza <asomoza@users.noreply.github.com> Co-authored-by: YiYi Xu <yixu310@gmail.com> Co-authored-by: Charles <charles@huggingface.co>	2025-10-17 18:34:30 -10:00
Ali Imran	1b456bd5d5	docs: cleanup of runway model (#12503 ) * cleanup of runway model * quality fixes	2025-10-17 14:10:50 -07:00
Steven Liu	26475082cb	[docs] Attention checks (#12486 ) * checks * feedback --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2025-10-16 09:19:30 -07:00
YiYi Xu	f072c64bf2	ltx0.9.8 (without IC lora, autoregressive sampling) (#12493 ) update Co-authored-by: Aryan <aryan@huggingface.co>	2025-10-15 07:41:17 -10:00
Steven Liu	b4e6dc3037	[docs] Fix broken links (#12487 ) fix broken links	2025-10-15 06:42:10 +05:30
Steven Liu	3eb40786ca	[docs] Prompting (#12312 ) * init * fix * batch inf * feedback * update	2025-10-14 13:53:56 -07:00
Meatfucker	a4bc845478	Fix missing load_video documentation and load_video import in WanVideoToVideoPipeline example code (#12472 ) * Update utilities.md Update missing load_video documentation * Update pipeline_wan_video2video.py Fix missing load_video import in example code	2025-10-14 10:43:21 -07:00
Manith Ratnayake	fa468c5d57	docs: api-pipelines-qwenimage typo fix (#12461 )	2025-10-13 08:57:46 -07:00
Steven Liu	8abc7aeb71	[docs] Fix syntax (#12464 ) * fix syntax * fix * style * fix	2025-10-11 08:13:30 +05:30
Changseop Yeom	0974b4c606	[i18n-KO] Fix typo and update translation in ethical_guidelines.md (#12435 )	2025-10-06 14:24:05 -07:00
Steven Liu	cc5b31ffc9	[docs] Migrate syntax (#12390 ) * change syntax * make style	2025-09-30 10:11:19 -07:00
Steven Liu	d7a1a0363f	[docs] CP (#12331 ) * init * feedback * feedback * feedback * feedback * feedback * feedback	2025-09-30 09:33:41 -07:00
Steven Liu	c07fcf780a	[docs] Model formats (#12256 ) * init * config * lora metadata * feedback * fix * cache allocator warmup for from_single_file * feedback * feedback	2025-09-29 11:36:14 -07:00
Steven Liu	ccedeca96e	[docs] Distributed inference (#12285 ) * init * feedback --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2025-09-29 11:24:26 -07:00
Sayak Paul	64a5187d96	[quantization] feat: support aobaseconfig classes in `TorchAOConfig` (#12275 ) * feat: support aobaseconfig classes. * [docs] AOBaseConfig (#12302) init Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * up * replace with is_torchao_version * up * up --------- Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2025-09-29 18:04:18 +05:30
Sayak Paul	9c0944581a	[docs] slight edits to the attention backends docs. (#12394 ) * slight edits to the attention backends docs. * Update docs/source/en/optimization/attention_backends.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> --------- Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2025-09-26 21:50:16 +05:30
DefTruth	310fdaf556	Introduce cache-dit to community optimization (#12366 ) * docs: introduce cache-dit to diffusers * docs: introduce cache-dit to diffusers * docs: introduce cache-dit to diffusers * docs: introduce cache-dit to diffusers * docs: introduce cache-dit to diffusers * docs: introduce cache-dit to diffusers * docs: introduce cache-dit to diffusers * misc: update examples link * misc: update examples link * docs: introduce cache-dit to diffusers * docs: introduce cache-dit to diffusers * docs: introduce cache-dit to diffusers * docs: introduce cache-dit to diffusers * docs: introduce cache-dit to diffusers * Refine documentation for CacheDiT features Updated the wording for clarity and consistency in the documentation. Adjusted sections on cache acceleration, automatic block adapter, patch functor, and hybrid cache configuration.	2025-09-24 10:50:57 -07:00
Aryan	dcb6dd9b7a	Context Parallel w/ Ring & Ulysses & Unified Attention (#11941 ) * update * update * add coauthor Co-Authored-By: Dhruv Nair <dhruv.nair@gmail.com> * improve test * handle ip adapter params correctly * fix chroma qkv fusion test * fix fastercache implementation * fix more tests * fight more tests * add back set_attention_backend * update * update * make style * make fix-copies * make ip adapter processor compatible with attention dispatcher * refactor chroma as well * remove rmsnorm assert * minify and deprecate npu/xla processors * update * refactor * refactor; support flash attention 2 with cp * fix * support sage attention with cp * make torch compile compatible * update * refactor * update * refactor * refactor * add ulysses backward * try to make dreambooth script work; accelerator backward not playing well * Revert "try to make dreambooth script work; accelerator backward not playing well" This reverts commit `768d0ea6fa`. * workaround compilation problems with triton when doing all-to-all * support wan * handle backward correctly * support qwen * support ltx * make fix-copies * Update src/diffusers/models/modeling_utils.py Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com> * apply review suggestions * update docs * add explanation * make fix-copies * add docstrings * support passing parallel_config to from_pretrained * apply review suggestions * make style * update * Update docs/source/en/api/parallel.md Co-authored-by: Aryan <aryan@huggingface.co> * up --------- Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com> Co-authored-by: sayakpaul <spsayakpaul@gmail.com>	2025-09-24 19:03:25 +05:30
Steven Liu	a72bc0c4bb	[docs] Attention backends (#12320 ) * init * feedback * update * feedback * fixes	2025-09-23 10:59:46 -07:00
Steven Liu	76810eca2b	[docs] Schedulers (#12246 ) * init * toctree * scheduler suggestions * toctree	2025-09-23 10:29:16 -07:00
Sayak Paul	5796735015	add test and doc for QwenImageEdit Plus (#12363 ) * up * xfail some tests * up * up	2025-09-22 21:57:30 +05:30
Sayak Paul	eb7ef26736	[quant] allow `components_to_quantize` to be a non-list for single components (#12234 ) * allow non list components_to_quantize. * up * Apply suggestions from code review * Apply suggestions from code review Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * [docs] components_to_quantize (#12287) init Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> --------- Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2025-09-10 09:47:08 -10:00
Sayak Paul	43459079ab	[core] feat: support group offloading at the pipeline level (#12283 ) * feat: support group offloading at the pipeline level. * add tests * up * [docs] Pipeline group offloading (#12286) init Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> --------- Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2025-09-10 09:09:57 +05:30
YiYi Xu	f50b18eec7	[Modular] Qwen (#12220 ) * add qwen modular	2025-09-08 00:27:02 -10:00
Steven Liu	fc337d5853	[docs] Models (#12248 ) * init * fix * feedback * feedback	2025-09-05 11:52:09 -07:00
Steven Liu	32798bf242	[docs] Inference section cleanup (#12281 ) init Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2025-09-05 09:34:37 -07:00
Steven Liu	c2e5ece08b	[docs] Sharing pipelines/models (#12280 ) init	2025-09-04 11:43:47 -07:00
Ishan Modi	4acbfbf13b	[Quantization] Add TRT-ModelOpt as a Backend (#11173 ) * initial commit * update * updates * update * update * update * update * update * update * addressed PR comments * update * addressed PR comments * update * update * update * update * update * update * updates * update * update * addressed PR comments * updates * code formatting * update * addressed PR comments * addressed PR comments * addressed PR comments * addressed PR comments * fix docs and dependencies * fixed dependency test --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2025-09-03 10:14:52 +05:30
Steven Liu	6549b04ec6	[docs] AutoPipeline (#12160 ) * refresh * feedback * feedback * supported models * fix	2025-09-02 21:06:26 -07:00
Ziheng Zhang	9e4a75b142	[docs] Fix VAE scale factor calculation in distributed inference docs (#12259 ) docs: Fix VAE scale factor calculation	2025-09-01 16:34:16 -10:00
Nguyễn Trọng Tuấn	67ffa7031e	Add Qwen-Image-Edit Inpainting pipeline (#12225 ) * add qwenimage-edit inpaint feature * stay up to date with main branch * fix style * fix docs * copies * fix * again * copies --------- Co-authored-by: “Trgtuan10” <“tuannguyentrong.402@gmail.com”> Co-authored-by: TuanNT-ZenAI <tuannt.zenai@gmail.com> Co-authored-by: yiyixuxu <yixu310@gmail.com>	2025-08-30 19:49:15 -10:00
Dhruv Nair	ba0e732eb0	[Modular] Consolidate `load_default_components` into `load_components` (#12217 ) * update * Apply style fixes * update * update --------- Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>	2025-08-28 19:55:02 +05:30
Steven Liu	cbecc33570	[docs] Reproducibility (#12237 ) * init * dupe * feedback	2025-08-27 11:35:31 -07:00
Steven Liu	5237a82a35	[docs] Remove Flax (#12244 ) * remove flax * toctree * feedback	2025-08-27 11:11:07 -07:00
Manith Ratnayake	552c127c05	docs: correct typos in using-diffusers/other-formats (#12243 )	2025-08-26 08:48:05 -07:00
Tianqi Tang	4b7fe044e3	Fix typos and inconsistencies (#12204 ) Fix typos and test assertions Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2025-08-26 07:58:08 -07:00
Tolga Cangöz	5fcd5f560f	Propose to update & upgrade SkyReels-V2 (#12167 ) * fix: update SkyReels-V2 documentation and moving into attn dispatcher * Refactors SkyReelsV2's attention implementation * style * up * Fixes formatting in SkyReels-V2 documentation Wraps the visual demonstration section in a Markdown code block. This change corrects the rendering of ASCII diagrams and examples, improving the overall readability of the document. * Docs: Condense example arrays in skyreels_v2 guide Improves the readability of the `step_matrix` examples by replacing long sequences of repeated numbers with a more compact `value×count` notation. This change makes the underlying data patterns in the examples easier to understand at a glance. * Add _repeated_blocks attribute to SkyReelsV2Transformer3DModel * Refactor rotary embedding calculations in SkyReelsV2 to separate cosine and sine frequencies * Enhance SkyReels-V2 documentation: update model loading for GPU support and remove outdated notes * up * up * Update model_id in SkyReels-V2 documentation * up * refactor: remove device_map parameter for model loading and add pipeline.to("cuda") for GPU allocation * fix: update copyright year to 2025 in skyreels_v2.md * docs: enhance parameter examples and formatting in skyreels_v2.md * docs: update example formatting and add notes on LoRA support in skyreels_v2.md * refactor: remove copied comments from transformer_wan in SkyReelsV2 classes * Clean up comments in skyreels_v2.md Removed comments about acceleration helpers and Flash Attention installation. * Add deprecation warning for `SkyReelsV2AttnProcessor2_0` class	2025-08-26 12:54:19 +05:30
Manith Ratnayake	8f8888a76e	[docs] typo : corrected 'compile regions' to 'compile_regions' (#12199 ) [docs] typo: corrected 'compile regions' to 'compile_regions'	2025-08-25 11:35:48 -07:00
Cyan	afc9721898	Fix typo in LoRA (#12228 ) Fix formatting in using_peft_for_inference.md	2025-08-25 11:19:55 -07:00
Steven Liu	2c4ee10b77	[docs] Diffusion pipeline (#12148 ) * init * refactor * refresh * fix? * fix? * fix * fix-copies * feedback * feedback * fix * feedback	2025-08-25 11:06:12 -07:00
Sayak Paul	cf1ca728ea	fix title for compile + offload quantized models (#12233 ) * up * up * Apply suggestions from code review Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> --------- Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2025-08-25 17:42:06 +02:00
Sayak Paul	144e6e2540	[docs] change wan2.1 -> wan (#12230 ) * change wan2.1 -> wan * up	2025-08-25 17:30:12 +02:00
Frank (Haofan) Wang	561ab54de3	Support ControlNet for Qwen-Image (#12215 ) * support qwen-image-cn-union --------- Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com> Co-authored-by: YiYi Xu <yixu310@gmail.com>	2025-08-22 11:00:01 -10:00
Steven Liu	b60faf456b	[docs] Pipeline callbacks (#12212 ) * init * review	2025-08-22 13:01:24 -07:00
Steven Liu	3e73dc24a4	[docs] Community pipelines (#12201 ) * refresh * feedback	2025-08-22 10:42:13 -07:00
Vương Đình Minh	d03240801f	[Docs] Add documentation for KontextInpaintingPipeline (#12197 ) * [Docs] Add documentation for KontextInpaintingPipeline * Update docs/source/en/api/pipelines/flux.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * update kontext inpaint docs with hfoption * Update docs/source/en/api/pipelines/flux.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/api/pipelines/flux.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> --------- Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2025-08-22 00:04:28 -07:00

1 2 3 4 5 ...

1153 Commits