shm4r7
de043c6044
Update chroma.md ( #11891 )
...
Fix typo in Inference example code
2025-07-09 09:58:38 +05:30
Steven Liu
64a9210315
[docs] Deprecated pipelines ( #11838 )
...
add warning
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
2025-07-01 14:02:54 -10:00
Sayak Paul
470458623e
[docs] fix single_file example. ( #11847 )
...
fix single_file example.
2025-07-01 21:23:27 +05:30
Aryan
a79c3af6bb
[single file] Cosmos ( #11801 )
...
* update
* update
* update docs
2025-07-01 18:02:58 +05:30
Aryan
d7dd924ece
Kontext fixes ( #11815 )
...
fix
2025-06-26 13:03:44 -10:00
Sayak Paul
00f95b9755
Kontext training ( #11813 )
...
* support flux kontext
* make fix-copies
* add example
* add tests
* update docs
* update
* add note on integrity checker
* initial commit
* initial commit
* add readme section and fixes in the training script.
* add test
* rectify ckpt_id
* fix ckpt
* fixes
* change id
* update
* Update examples/dreambooth/train_dreambooth_lora_flux_kontext.py
Co-authored-by: Aryan <aryan@huggingface.co >
* Update examples/dreambooth/README_flux.md
---------
Co-authored-by: Aryan <aryan@huggingface.co >
Co-authored-by: linoytsaban <linoy@huggingface.co >
Co-authored-by: Linoy Tsaban <57615435+linoytsaban@users.noreply.github.com >
2025-06-26 19:31:42 +03:00
Aryan
eea76892e8
Flux Kontext ( #11812 )
...
* support flux kontext
* make fix-copies
* add example
* add tests
* update docs
* update
* add note on integrity checker
* make fix-copies issue
* add copied froms
* make style
* update repository ids
* more copied froms
2025-06-26 21:29:59 +05:30
Dhruv Nair
195926bbdc
Update Chroma Docs ( #11753 )
...
* update
* update
---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
2025-06-19 19:33:19 +02:00
Aryan
a4df8dbc40
Update more licenses to 2025 ( #11746 )
...
update
2025-06-19 07:46:01 +05:30
Edna
8adc6003ba
Chroma Pipeline ( #11698 )
...
* working state from hameerabbasi and iddl
* working state form hameerabbasi and iddl (transformer)
* working state (normalization)
* working state (embeddings)
* add chroma loader
* add chroma to mappings
* add chroma to transformer init
* take out variant stuff
* get decently far in changing variant stuff
* add chroma init
* make chroma output class
* add chroma transformer to dummy tp
* add chroma to init
* add chroma to init
* fix single file
* update
* update
* add chroma to auto pipeline
* add chroma to pipeline init
* change to chroma transformer
* take out variant from blocks
* swap embedder location
* remove prompt_2
* work on swapping text encoders
* remove mask function
* dont modify mask (for now)
* wrap attn mask
* no attn mask (can't get it to work)
* remove pooled prompt embeds
* change to my own unpooled embeddeer
* fix load
* take pooled projections out of transformer
* ensure correct dtype for chroma embeddings
* update
* use dn6 attn mask + fix true_cfg_scale
* use chroma pipeline output
* use DN6 embeddings
* remove guidance
* remove guidance embed (pipeline)
* remove guidance from embeddings
* don't return length
* dont change dtype
* remove unused stuff, fix up docs
* add chroma autodoc
* add .md (oops)
* initial chroma docs
* undo don't change dtype
* undo arxiv change
unsure why that happened
* fix hf papers regression in more places
* Update docs/source/en/api/pipelines/chroma.md
Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com >
* do_cfg -> self.do_classifier_free_guidance
* Update docs/source/en/api/models/chroma_transformer.md
Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com >
* Update chroma.md
* Move chroma layers into transformer
* Remove pruned AdaLayerNorms
* Add chroma fast tests
* (untested) batch cond and uncond
* Add # Copied from for shift
* Update # Copied from statements
* update norm imports
* Revert cond + uncond batching
* Add transformer tests
* move chroma test (oops)
* chroma init
* fix chroma pipeline fast tests
* Update src/diffusers/models/transformers/transformer_chroma.py
Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com >
* Move Approximator and Embeddings
* Fix auto pipeline + make style, quality
* make style
* Apply style fixes
* switch to new input ids
* fix # Copied from error
* remove # Copied from on protected members
* try to fix import
* fix import
* make fix-copes
* revert style fix
* update chroma transformer params
* update chroma transformer approximator init params
* update to pad tokens
* fix batch inference
* Make more pipeline tests work
* Make most transformer tests work
* fix docs
* make style, make quality
* skip batch tests
* fix test skipping
* fix test skipping again
* fix for tests
* Fix all pipeline test
* update
* push local changes, fix docs
* add encoder test, remove pooled dim
* default proj dim
* fix tests
* fix equal size list input
* update
* push local changes, fix docs
* add encoder test, remove pooled dim
* default proj dim
* fix tests
* fix equal size list input
* Revert "fix equal size list input"
This reverts commit 3fe4ad67d5 .
* update
* update
* update
* update
* update
---------
Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com >
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
2025-06-14 06:52:56 +05:30
Aryan
9f91305f85
Cosmos Predict2 ( #11695 )
...
* support text-to-image
* update example
* make fix-copies
* support use_flow_sigmas in EDM scheduler instead of maintain cosmos-specific scheduler
* support video-to-world
* update
* rename text2image pipeline
* make fix-copies
* add t2i test
* add test for v2w pipeline
* support edm dpmsolver multistep
* update
* update
* update
* update tests
* fix tests
* safety checker
* make conversion script work without guardrail
2025-06-14 01:51:29 +05:30
Aryan
73a9d5856f
Wan VACE ( #11582 )
...
* initial support
* make fix-copies
* fix no split modules
* add conversion script
* refactor
* add pipeline test
* refactor
* fix bug with mask
* fix for reference images
* remove print
* update docs
* update slices
* update
* update
* update example
2025-06-06 17:53:10 +05:30
Steven Liu
c934720629
[docs] Model cards ( #11112 )
...
* initial
* update
* hunyuanvideo
* ltx
* fix
* wan
* gen guide
* feedback
* feedback
* pipeline-level quant config
* feedback
* ltx
2025-06-02 16:55:14 -07:00
VLT Media
d0ec6601df
Bug: Fixed Image 2 Image example ( #11619 )
...
Bug: Fixed Image 2 Image example where a PIL.Image was improperly being asked for an item via index.
2025-05-30 11:30:52 +05:30
Steven Liu
be2fb77dc1
[docs] PyTorch 2.0 ( #11618 )
...
* combine
* Update docs/source/en/optimization/fp16.md
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
2025-05-28 09:42:41 -07:00
Linoy Tsaban
28ef0165b9
[Sana Sprint] add image-to-image pipeline ( #11602 )
...
* sana sprint img2img
* fix import
* fix name
* fix image encoding
* fix image encoding
* fix image encoding
* fix image encoding
* fix image encoding
* fix image encoding
* try w/o strength
* try scaling differently
* try with strength
* revert unnecessary changes to scheduler
* revert unnecessary changes to scheduler
* Apply style fixes
* remove comment
* add copy statements
* add copy statements
* add to doc
* add to doc
* add to doc
* add to doc
* Apply style fixes
* empty commit
* fix copies
* fix copies
* fix copies
* fix copies
* fix copies
* docs
* make fix-copies.
* fix doc building error.
* initial commit - add img2img test
* initial commit - add img2img test
* fix import
* fix imports
* Apply style fixes
* empty commit
* remove
* empty commit
* test vocab size
* fix
* fix prompt missing from last commits
* small changes
* fix image processing when input is tensor
* fix order
* Apply style fixes
* empty commit
* fix shape
* remove comment
* image processing
* remove comment
* skip vae tiling test for now
* Apply style fixes
* empty commit
---------
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
Co-authored-by: sayakpaul <spsayakpaul@gmail.com >
2025-05-27 22:09:51 +03:00
Aryan
05c8b42b75
LTX 0.9.7-distilled; documentation improvements ( #11571 )
...
* add guidance rescale
* update docs
* support adaptive instance norm filter
* fix custom timesteps support
* add custom timestep example to docs
* add a note about best generation settings being available only in the original repository
* use original org hub ids instead of personal
* make fix-copies
---------
Co-authored-by: Linoy Tsaban <57615435+linoytsaban@users.noreply.github.com >
2025-05-20 02:29:16 +05:30
Quentin Gallouédec
c8bb1ff53e
Use HF Papers ( #11567 )
...
* Use HF Papers
* Apply style fixes
---------
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
2025-05-19 06:22:33 -10:00
Aryan
06fee551e9
LTX Video 0.9.7 ( #11516 )
...
* add upsampling pipeline
* ltx upsample pipeline conversion; pipeline fixes
* make fix-copies
* remove print
* add vae convenience methods
* update
* add tests
* support denoising strength for upscaling & video-to-video
* update docs
* update doc checkpoints
* update docs
* fix
---------
Co-authored-by: Linoy Tsaban <57615435+linoytsaban@users.noreply.github.com >
2025-05-13 14:57:03 +05:30
Zhong-Yu Li
4f438de35a
Add VisualCloze ( #11377 )
...
* VisualCloze
* style quality
* add docs
* add docs
* typo
* Update docs/source/en/api/pipelines/visualcloze.md
* delete einops
* style quality
* Update src/diffusers/pipelines/visualcloze/pipeline_visualcloze.py
* reorg
* refine doc
* style quality
* typo
* typo
* Update src/diffusers/image_processor.py
* add comment
* test
* style
* Modified based on review
* style
* restore image_processor
* update example url
* style
* fix-copies
* VisualClozeGenerationPipeline
* combine
* tests docs
* remove VisualClozeUpsamplingPipeline
* style
* quality
* test examples
* quality style
* typo
* make fix-copies
* fix test_callback_cfg and test_save_load_dduf in VisualClozePipelineFastTests
* add EXAMPLE_DOC_STRING to VisualClozeGenerationPipeline
* delete maybe_free_model_hooks from pipeline_visualcloze_combined
* Apply suggestions from code review
* fix test_save_load_local test; add reason for skipping cfg test
* more save_load test fixes
* fix tests in generation pipeline tests
2025-05-13 02:46:51 +05:30
Aryan
e48f6aeeb4
Hunyuan Video Framepack F1 ( #11534 )
...
* support framepack f1
* update docs
* update toctree
* remove typo
2025-05-12 16:11:10 +05:30
Aryan
7b904941bc
Cosmos ( #10660 )
...
* begin transformer conversion
* refactor
* refactor
* refactor
* refactor
* refactor
* refactor
* update
* add conversion script
* add pipeline
* make fix-copies
* remove einops
* update docs
* gradient checkpointing
* add transformer test
* update
* debug
* remove prints
* match sigmas
* add vae pt. 1
* finish CV* vae
* update
* update
* update
* update
* update
* update
* make fix-copies
* update
* make fix-copies
* fix
* update
* update
* make fix-copies
* update
* update tests
* handle device and dtype for safety checker; required in latest diffusers
* remove enable_gqa and use repeat_interleave instead
* enforce safety checker; use dummy checker in fast tests
* add review suggestion for ONNX export
Co-Authored-By: Asfiya Baig <asfiyab@nvidia.com >
* fix safety_checker issues when not passed explicitly
We could either do what's done in this commit, or update the Cosmos examples to explicitly pass the safety checker
* use cosmos guardrail package
* auto format docs
* update conversion script to support 14B models
* update name CosmosPipeline -> CosmosTextToWorldPipeline
* update docs
* fix docs
* fix group offload test failing for vae
---------
Co-authored-by: Asfiya Baig <asfiyab@nvidia.com >
2025-05-07 20:59:09 +05:30
Aryan
d7ffe60166
Hunyuan Video Framepack ( #11428 )
...
* add transformer
* add pipeline
* fixes
* make fix-copies
* update
* add flux mu shift
* update example snippet
* debug
* cleanup
* batch_size=1 optimization
* add pipeline test
* fix for model cpu offloading'
* add last_image support; credits: https://github.com/lllyasviel/FramePack/pull/167
* update example with flf2v
* update penguin url
* fix test
* address review comment: https://github.com/huggingface/diffusers/pull/11428#discussion_r2071032371
* address review comment: https://github.com/huggingface/diffusers/pull/11428#discussion_r2071087689
* Update src/diffusers/pipelines/hunyuan_video/pipeline_hunyuan_video_framepack.py
---------
Co-authored-by: Linoy Tsaban <57615435+linoytsaban@users.noreply.github.com >
2025-05-06 14:59:38 +05:30
co63oc
86294d3c7f
Fix typos in docs and comments ( #11416 )
...
* Fix typos in docs and comments
* Apply style fixes
---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
2025-04-30 20:30:53 -10:00
co63oc
f00a995753
Fix typos in strings and comments ( #11407 )
2025-04-24 08:53:47 -10:00
Emiliano
7986834572
Fix Flux IP adapter argument in the pipeline example ( #11402 )
...
Fix Flux IP adapter argument in the example
IP-Adapter example had a wrong argument. Fix `true_cfg` -> `true_cfg_scale`
2025-04-24 08:41:12 -10:00
YiYi Xu
0021bfa1e1
support Wan-FLF2V ( #11353 )
...
* update transformer
---------
Co-authored-by: Aryan <aryan@huggingface.co >
2025-04-18 10:27:50 -10:00
Sayak Paul
b00a564dac
[docs] add note about use_duck_shape in auraflow docs. ( #11348 )
...
add note about use_duck_shape in auraflow docs.
2025-04-17 10:25:39 +05:30
Sayak Paul
ce1063acfa
[docs] add a snippet for compilation in the auraflow docs. ( #11327 )
...
* add a snippet for compilation in the auraflow docs.
* include speedups.
2025-04-16 11:12:09 +05:30
Ishan Modi
f1f38ffbee
[ControlNet] Adds controlnet for SanaTransformer ( #11040 )
...
* added controlnet for sana transformer
* improve code quality
* addressed PR comments
* bug fixes
* added test cases
* update
* added dummy objects
* addressed PR comments
* update
* Forcing update
* add to docs
* code quality
* addressed PR comments
* addressed PR comments
* update
* addressed PR comments
* added proper styling
* update
* Revert "added proper styling"
This reverts commit 344ee8a701 .
* manually ordered
* Apply suggestions from code review
---------
Co-authored-by: Aryan <contact.aryanvs@gmail.com >
2025-04-13 19:19:39 +05:30
hlky
0ef29355c9
HiDream Image ( #11231 )
...
* HiDream Image
---------
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
Co-authored-by: Aryan <contact.aryanvs@gmail.com >
Co-authored-by: Aryan <aryan@huggingface.co >
2025-04-11 06:31:34 -10:00
Sayak Paul
b924251dd8
minor update to sana sprint docs. ( #11236 )
2025-04-09 08:17:45 +05:30
Steven Liu
fc7a867ae5
[docs] MPS update ( #11212 )
...
mps
2025-04-07 14:32:27 -10:00
Tolga Cangöz
13e48492f0
[LTX0.9.5] Refactor LTXConditionPipeline for text-only conditioning ( #11174 )
...
* Refactor `LTXConditionPipeline` to add text-only conditioning
* style
* up
* Refactor `LTXConditionPipeline` to streamline condition handling and improve clarity
* Improve condition checks
* Simplify latents handling based on conditioning type
* Refactor rope_interpolation_scale preparation for clarity and efficiency
* Update LTXConditionPipeline docstring to clarify supported input types
* Add LTX Video 0.9.5 model to documentation
* Clarify documentation to indicate support for text-only conditioning without passing `conditions`
* refactor: comment out unused parameters in LTXConditionPipeline
* fix: restore previously commented parameters in LTXConditionPipeline
* fix: remove unused parameters from LTXConditionPipeline
* refactor: remove unnecessary lines in LTXConditionPipeline
2025-04-04 16:43:15 +02:00
Dhruv Nair
df1d7b01f1
[WIP] Add Wan Video2Video ( #11053 )
...
* update
* update
* update
* update
* update
* update
* update
* update
* update
* update
* update
* update
* update
* update
* update
* update
2025-04-01 17:22:11 +05:30
Dhruv Nair
617c208bb4
[Docs] Update Wan Docs with memory optimizations ( #11089 )
...
* update
* update
2025-03-28 19:05:56 +05:30
Aryan
8907a70a36
New HunyuanVideo-I2V ( #11066 )
...
* update
* update
* update
* add tests
* update docs
* raise value error
* warning for true cfg and guidance scale
* fix test
2025-03-24 21:18:40 +05:30
YiYi Xu
8a63aa5e4f
add sana-sprint ( #11074 )
...
* add sana-sprint
---------
Co-authored-by: Junsong Chen <cjs1020440147@icloud.com >
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
Co-authored-by: Aryan <aryan@huggingface.co >
2025-03-21 06:21:18 -10:00
Aryan
2e83cbbb6d
LTX 0.9.5 ( #10968 )
...
* update
---------
Co-authored-by: YiYi Xu <yixu310@gmail.com >
Co-authored-by: hlky <hlky@hlky.ac >
2025-03-17 16:43:36 -10:00
hlky
5551506b29
Rename Lumina(2)Text2ImgPipeline -> Lumina(2)Pipeline ( #10827 )
...
* Rename Lumina(2)Text2ImgPipeline -> Lumina(2)Pipeline
---------
Co-authored-by: YiYi Xu <yixu310@gmail.com >
2025-03-13 09:24:21 -10:00
Sayak Paul
e4b056fe65
[LoRA] support wan i2v loras from the world. ( #11025 )
...
* support wan i2v loras from the world.
* remove copied from.
* upates
* add lora.
2025-03-11 20:43:29 +05:30
Dhruv Nair
1357931d74
[Single File] Add single file support for Wan T2V/I2V ( #10991 )
...
* update
* update
* update
* update
* update
* update
* update
2025-03-07 22:13:25 +05:30
Aryan
2e5203be04
Hunyuan I2V ( #10983 )
...
* update
* update
* update
* add tests
* update
* add model tests
* update docs
* update
* update example
* fix defaults
* update
2025-03-07 12:52:48 +05:30
Bubbliiiing
5e3b7d2d8a
Add EasyAnimateV5.1 text-to-video, image-to-video, control-to-video generation model ( #10626 )
...
* Update EasyAnimate V5.1
* Add docs && add tests && Fix comments problems in transformer3d and vae
* delete comments and remove useless import
* delete process
* Update EXAMPLE_DOC_STRING
* rename transformer file
* make fix-copies
* make style
* refactor pt. 1
* update toctree.yml
* add model tests
* Update layer_norm for norm_added_q and norm_added_k in Attention
* Fix processor problem
* refactor vae
* Fix problem in comments
* refactor tiling; remove einops dependency
* fix docs path
* make fix-copies
* Update src/diffusers/pipelines/easyanimate/pipeline_easyanimate_control.py
* update _toctree.yml
* fix test
* update
* update
* update
* make fix-copies
* fix tests
---------
Co-authored-by: Aryan <aryan@huggingface.co >
Co-authored-by: Aryan <contact.aryanvs@gmail.com >
Co-authored-by: YiYi Xu <yixu310@gmail.com >
Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com >
2025-03-03 18:37:19 +05:30
YiYi Xu
2d8a41cae8
[Alibaba Wan Team] continue on #10921 Wan2.1 ( #10922 )
...
* Add wanx pipeline, model and example
* wanx_merged_v1
* change WanX into Wan
* fix i2v fp32 oom error
Link: https://code.alibaba-inc.com/open_wanx2/diffusers/codereview/20607813
* support t2v load fp32 ckpt
* add example
* final merge v1
* Update autoencoder_kl_wan.py
* up
* update middle, test up_block
* up up
* one less nn.sequential
* up more
* up
* more
* [refactor] [wip] Wan transformer/pipeline (#10926 )
* update
* update
* refactor rope
* refactor pipeline
* make fix-copies
* add transformer test
* update
* update
* make style
* update tests
* tests
* conversion script
* conversion script
* update
* docs
* remove unused code
* fix _toctree.yml
* update dtype
* fix test
* fix tests: scale
* up
* more
* Apply suggestions from code review
* Apply suggestions from code review
* style
* Update scripts/convert_wan_to_diffusers.py
* update docs
* fix
---------
Co-authored-by: Yitong Huang <huangyitong.hyt@alibaba-inc.com >
Co-authored-by: 亚森 <wangjiayu.wjy@alibaba-inc.com >
Co-authored-by: Aryan <aryan@huggingface.co >
2025-03-02 17:24:26 +05:30
Anton Obukhov
3fab6624fd
Marigold Update: v1-1 models, Intrinsic Image Decomposition pipeline, documentation ( #10884 )
...
* minor documentation fixes of the depth and normals pipelines
* update license headers
* update model checkpoints in examples
fix missing prediction_type in register_to_config in the normals pipeline
* add initial marigold intrinsics pipeline
update comments about num_inference_steps and ensemble_size
minor fixes in comments of marigold normals and depth pipelines
* update uncertainty visualization to work with intrinsics
* integrate iid
---------
Co-authored-by: YiYi Xu <yixu310@gmail.com >
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
2025-02-25 14:13:02 -10:00
Steven Liu
db21c97043
[docs] Flux group offload ( #10847 )
...
* flux group-offload
* feedback
2025-02-24 08:47:08 -08:00
Steven Liu
64dec70e56
[docs] LoRA support ( #10844 )
...
* lora
* update
* update
---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
2025-02-22 08:53:02 +05:30
SahilCarterr
85fcbaf314
[Fix] Docs overview.md ( #10858 )
...
Fix docs
2025-02-21 08:03:22 -08:00
Aryan
e3bc4aab2e
SkyReels Hunyuan T2V & I2V ( #10837 )
...
* update
* make fix-copies
* update
* tests
* update
* update
* add co-author
Co-Authored-By: Langdx <82783347+Langdx@users.noreply.github.com >
* add co-author
Co-Authored-By: howe <howezhang2018@gmail.com >
* update
---------
Co-authored-by: Langdx <82783347+Langdx@users.noreply.github.com >
Co-authored-by: howe <howezhang2018@gmail.com >
2025-02-21 06:48:15 +05:30