Álvaro Somoza
87b9db644b
[Core] Add Kolors ( #8812 )
...
* initial draft
2024-07-11 06:09:17 -10:00
Xin Ma
b8cf84a3f9
Latte: Latent Diffusion Transformer for Video Generation ( #8404 )
...
* add Latte to diffusers
* remove print
* remove print
* remove print
* remove unuse codes
* remove layer_norm_latte and add a flag
* remove layer_norm_latte and add a flag
* update latte_pipeline
* update latte_pipeline
* remove unuse squeeze
* add norm_hidden_states.ndim == 2: # for Latte
* fixed test latte pipeline bugs
* fixed test latte pipeline bugs
* delete sh
* add doc for latte
* add licensing
* Move Transformer3DModelOutput to modeling_outputs
* give a default value to sample_size
* remove the einops dependency
* change norm2 for latte
* modify pipeline of latte
* update test for Latte
* modify some codes for latte
* modify for Latte pipeline
* modify for Latte pipeline
* modify for Latte pipeline
* modify for Latte pipeline
* modify for Latte pipeline
* modify for Latte pipeline
* modify for Latte pipeline
* modify for Latte pipeline
* modify for Latte pipeline
* modify for Latte pipeline
* modify for Latte pipeline
* modify for Latte pipeline
* modify for Latte pipeline
* modify for Latte pipeline
* modify for Latte pipeline
* modify for Latte pipeline
* modify for Latte pipeline
* modify for Latte pipeline
* modify for Latte pipeline
* modify for Latte pipeline
* modify for Latte pipeline
* modify for Latte pipeline
* modify for Latte pipeline
* modify for Latte pipeline
* modify for Latte pipeline
* modify for Latte pipeline
* modify for Latte pipeline
* video_length -> num_frames; update prepare_latents copied from
* make fix-copies
* make style
* typo: videe -> video
* update
* modify for Latte pipeline
* modify latte pipeline
* modify latte pipeline
* modify latte pipeline
* modify latte pipeline
* modify for Latte pipeline
* Delete .vscode directory
* make style
* make fix-copies
* add latte transformer 3d to docs _toctree.yml
* update example
* reduce frames for test
* fixed bug of _text_preprocessing
* set num frame to 1 for testing
* remove unuse print
* add text = self._clean_caption(text) again
---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
Co-authored-by: YiYi Xu <yixu310@gmail.com >
Co-authored-by: Aryan <contact.aryanvs@gmail.com >
Co-authored-by: Aryan <aryan@huggingface.co >
2024-07-11 15:06:22 +05:30
PommesPeter
98388670d2
[Alpha-VLLM Team] Add Lumina-T2X to diffusers ( #8652 )
...
---------
Co-authored-by: zhuole1025 <zhuole1025@gmail.com >
Co-authored-by: YiYi Xu <yixu310@gmail.com >
2024-07-07 17:12:09 -10:00
Sayak Paul
984d340534
Revert "[LoRA] introduce LoraBaseMixin to promote reusability." ( #8773 )
...
Revert "[LoRA] introduce `LoraBaseMixin` to promote reusability. (#8670 )"
This reverts commit a2071a1837 .
2024-07-03 07:05:01 +05:30
Sayak Paul
a2071a1837
[LoRA] introduce LoraBaseMixin to promote reusability. ( #8670 )
...
* introduce to promote reusability.
* up
* add more tests
* up
* remove comments.
* fix fuse_nan test
* clarify the scope of fuse_lora and unfuse_lora
* remove space
2024-07-03 07:04:37 +05:30
Jiwook Han
dd4b731e68
Reflect few contributions on philosophy.md that were not reflected on #8294 ( #8690 )
...
* Update philosophy.md
Some contributions were not reflected previously, so I am resubmitting them.
* Update docs/source/ko/conceptual/philosophy.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
* Update docs/source/ko/conceptual/philosophy.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
---------
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
2024-07-02 08:43:56 -07:00
Dhruv Nair
31b211bfe3
Fix mistake in Single File Docs page ( #8765 )
...
update
2024-07-02 12:45:49 +05:30
Dhruv Nair
0368483b61
Remove legacy single file model loading mixins ( #8754 )
...
update
2024-07-01 07:20:19 -10:00
YiYi Xu
ddb9d8548c
[doc] add a tip about using SDXL refiner with hunyuan-dit and pixart ( #8735 )
...
* up
* Apply suggestions from code review
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
2024-07-01 06:30:09 -10:00
Shauray Singh
8690e8b9d6
add PAG support for SD architecture ( #8725 )
...
* add pag to sd pipelines
2024-06-29 09:26:11 -10:00
Sayak Paul
10b4e354b6
[Chore] remove deprecation from transformer2d regarding the output class. ( #8698 )
...
* remove deprecation from transformer2d regarding the output class.
* up
* deprecate more
2024-06-26 07:35:36 -10:00
XCL
fa2abfdb03
[Tencent Hunyuan Team] Add Hunyuan-DiT ControlNet Inference ( #8694 )
...
* add controlnet support
---------
Co-authored-by: xingchaoliu <xingchaoliu@tencent.com >
Co-authored-by: yiyixuxu <yixu310@gmail,com>
2024-06-26 00:43:03 -10:00
YiYi Xu
1d3ef67b09
[doc] add more about from_pipe API for PAG doc ( #8701 )
...
* add more about from_pipe API
* Update docs/source/en/using-diffusers/pag.md
* Update docs/source/en/using-diffusers/pag.md
---------
Co-authored-by: yiyixuxu <yixu310@gmail,com>
2024-06-25 22:26:12 -10:00
Sayak Paul
e8284281c1
add docs on model sharding ( #8658 )
...
* add docs on model sharding
* add entry to _toctree.
* Apply suggestions from code review
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
* simplify wording
* add a note on transformer library handling
* move device placement section
* Update docs/source/en/training/distributed_inference.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
---------
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
2024-06-26 07:35:11 +05:30
Álvaro Somoza
14d224d4e6
[Docs] SD3 T5 Token limit doc ( #8654 )
...
* doc for max_sequence_length
* better position and changed note to tip
* apply suggestions
---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
2024-06-25 14:41:27 -04:00
YiYi Xu
540399f540
add PAG support ( #7944 )
...
* first draft
---------
Co-authored-by: yiyixuxu <yixu310@gmail,com>
Co-authored-by: Junhwa Song <ethan9867@gmail.com >
Co-authored-by: Ahn Donghoon (안동훈 / suno) <suno.vivid@gmail.com >
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
2024-06-25 08:40:02 -10:00
Steven Liu
675be88f00
[docs] Add note for float8 ( #8685 )
...
add note
2024-06-24 10:13:34 -07:00
Sayak Paul
bc90c28bc9
[Docs] add note on caching in fast diffusion ( #8675 )
...
* add note on caching in fast diffusion
* formatting
* Update docs/source/en/tutorials/fast_diffusion.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
---------
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
2024-06-24 10:10:45 -07:00
Tolga Cangöz
f040c27d4c
Errata - Fix typos and improve style ( #8571 )
...
* Fix typos
* Fix typos & up style
* chore: Update numbers
---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
2024-06-24 10:07:22 -07:00
Tolga Cangöz
138fac703a
Discourage using deprecated revision parameter ( #8573 )
...
* Discourage using `revision`
* `make style && make quality`
* Refactor code to use 'variant' instead of 'revision'
* `revision="bf16"` -> `variant="bf16"`
---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
2024-06-24 10:06:49 -07:00
Tolga Cangöz
468ae09ed8
Errata - Trim trailing white space in the whole repo ( #8575 )
...
* Trim all the trailing white space in the whole repo
* Remove unnecessary empty places
* make style && make quality
* Trim trailing white space
* trim
---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
2024-06-24 18:39:15 +05:30
Tolga Cangöz
c375903db5
Errata - Fix typos & improve contributing page ( #8572 )
...
* Fix typos & improve contributing page
* `make style && make quality`
* fix typos
* Fix typo
---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
2024-06-24 14:13:03 +05:30
Sayak Paul
8eb17315c8
[LoRA] get rid of the legacy lora remnants and make our codebase lighter ( #8623 )
...
* get rid of the legacy lora remnants and make our codebase lighter
* fix depcrecated lora argument
* fix
* empty commit to trigger ci
* remove print
* empty
2024-06-21 16:36:05 +05:30
王奇勋
e5564d45bf
Support SD3 ControlNet and Multi-ControlNet. ( #8566 )
...
* sd3 controlnet
---------
Co-authored-by: haofanwang <haofanwang.ai@gmail.com >
2024-06-18 14:59:22 -10:00
Vasco Ramos
34fab8b511
[SD3 Docs] Corrected title about loading model with T5 "without" -> "with" ( #8602 )
...
[SD3 Docs] Corrected title about loading model with T5
Corrected the documentation title to "Loading the single file checkpoint with T5" Previously, it incorrectly stated "Loading the single file checkpoint without T5" which contradicted the code snippet showing how to load the SD3 checkpoint with the T5 model
2024-06-18 11:33:43 -10:00
Sayak Paul
cd3082008e
[Core] Add shift_factor to SD3 tiny autoencoder ( #8618 )
...
* shift factor argument to tiny
* remove shift factor rejigging from the sd3 docs
2024-06-18 18:28:02 +01:00
MaoXianXin
10d3220abe
A backslash is missing from the run command ( #8471 )
2024-06-18 16:44:34 +01:00
Álvaro Somoza
d2b10b1f4f
[SD3] TAESD3 docs ( #8607 )
...
* tased3 docs
* apply suggestion
---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
2024-06-18 15:56:38 +01:00
AmosDinh
eeb70033a6
Syntax error in readme example "pipe" -> "pipeline" ( #8601 )
...
Update controlnet.md
Syntax error pipe -> pipeline
2024-06-17 11:02:07 -07:00
Dhruv Nair
b1a2c0d577
Expand Single File support in SD3 Pipeline ( #8517 )
...
* update
* update
2024-06-13 18:29:19 +05:30
Radamés Ajna
95e0c3757d
Fix small typo ( #8498 )
2024-06-12 15:30:58 -07:00
Dhruv Nair
04717fd861
Add Stable Diffusion 3 ( #8483 )
...
* up
* add sd3
* update
* update
* add tests
* fix copies
* fix docs
* update
* add dreambooth lora
* add LoRA
* update
* update
* update
* update
* import fix
* update
* Update src/diffusers/pipelines/stable_diffusion_3/pipeline_stable_diffusion_3.py
Co-authored-by: YiYi Xu <yixu310@gmail.com >
* import fix 2
* update
* Update src/diffusers/models/autoencoders/autoencoder_kl.py
Co-authored-by: YiYi Xu <yixu310@gmail.com >
* Update src/diffusers/models/autoencoders/autoencoder_kl.py
Co-authored-by: YiYi Xu <yixu310@gmail.com >
* Update src/diffusers/models/autoencoders/autoencoder_kl.py
Co-authored-by: YiYi Xu <yixu310@gmail.com >
* Update src/diffusers/models/autoencoders/autoencoder_kl.py
Co-authored-by: YiYi Xu <yixu310@gmail.com >
* Update src/diffusers/models/autoencoders/autoencoder_kl.py
Co-authored-by: YiYi Xu <yixu310@gmail.com >
* Update src/diffusers/models/autoencoders/autoencoder_kl.py
Co-authored-by: YiYi Xu <yixu310@gmail.com >
* Update src/diffusers/models/autoencoders/autoencoder_kl.py
Co-authored-by: YiYi Xu <yixu310@gmail.com >
* Update src/diffusers/models/autoencoders/autoencoder_kl.py
Co-authored-by: YiYi Xu <yixu310@gmail.com >
* Update src/diffusers/models/autoencoders/autoencoder_kl.py
Co-authored-by: YiYi Xu <yixu310@gmail.com >
* Update src/diffusers/models/autoencoders/autoencoder_kl.py
Co-authored-by: YiYi Xu <yixu310@gmail.com >
* Update src/diffusers/models/autoencoders/autoencoder_kl.py
Co-authored-by: YiYi Xu <yixu310@gmail.com >
* update
* update
* update
* fix ckpt id
* fix more ids
* update
* missing doc
* Update src/diffusers/schedulers/scheduling_flow_match_euler_discrete.py
Co-authored-by: YiYi Xu <yixu310@gmail.com >
* Update src/diffusers/schedulers/scheduling_flow_match_euler_discrete.py
Co-authored-by: YiYi Xu <yixu310@gmail.com >
* Update docs/source/en/api/pipelines/stable_diffusion/stable_diffusion_3.md
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
* Update docs/source/en/api/pipelines/stable_diffusion/stable_diffusion_3.md
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
* update'
* fix
* update
* Update src/diffusers/models/autoencoders/autoencoder_kl.py
* Update src/diffusers/models/autoencoders/autoencoder_kl.py
* note on gated access.
* requirements
* licensing
---------
Co-authored-by: sayakpaul <spsayakpaul@gmail.com >
Co-authored-by: YiYi Xu <yixu310@gmail.com >
2024-06-12 20:44:00 +01:00
Jiwook Han
6fd458e99d
🌐 [i18n-KO] Translated conceptual/philosophy.md and 3 other documents to Korean ( #8294 )
...
* translation about 3 documents into Korean
* evaluation doc korean translation
* _toctree.yml modify
* doc title fix : philosopy->philosophy
* Update docs/source/ko/conceptual/ethical_guidelines.md
Co-authored-by: Jihun Lim <31366038+heuristicwave@users.noreply.github.com >
* Update docs/source/ko/conceptual/ethical_guidelines.md
Co-authored-by: Jihun Lim <31366038+heuristicwave@users.noreply.github.com >
* Update docs/source/ko/conceptual/ethical_guidelines.md
Co-authored-by: Jihun Lim <31366038+heuristicwave@users.noreply.github.com >
* Update docs/source/ko/conceptual/ethical_guidelines.md
Co-authored-by: Jihun Lim <31366038+heuristicwave@users.noreply.github.com >
* Update docs/source/ko/conceptual/ethical_guidelines.md
Co-authored-by: Jihun Lim <31366038+heuristicwave@users.noreply.github.com >
* Update docs/source/ko/conceptual/ethical_guidelines.md
Co-authored-by: Jihun Lim <31366038+heuristicwave@users.noreply.github.com >
* Update docs/source/ko/conceptual/ethical_guidelines.md
Co-authored-by: Jihun Lim <31366038+heuristicwave@users.noreply.github.com >
* Update docs/source/ko/conceptual/ethical_guidelines.md
Co-authored-by: Jihun Lim <31366038+heuristicwave@users.noreply.github.com >
* Update docs/source/ko/conceptual/ethical_guidelines.md
Co-authored-by: Jihun Lim <31366038+heuristicwave@users.noreply.github.com >
* Update docs/source/ko/conceptual/ethical_guidelines.md
Co-authored-by: Jihun Lim <31366038+heuristicwave@users.noreply.github.com >
* Update docs/source/ko/conceptual/evaluation.md
Co-authored-by: Chulhwa (Evan) Han <cjfghk5697@ajou.ac.kr >
* Update docs/source/ko/conceptual/evaluation.md
Co-authored-by: Chulhwa (Evan) Han <cjfghk5697@ajou.ac.kr >
* Update docs/source/ko/conceptual/evaluation.md
Co-authored-by: Chulhwa (Evan) Han <cjfghk5697@ajou.ac.kr >
* Update docs/source/ko/conceptual/evaluation.md
Co-authored-by: Chulhwa (Evan) Han <cjfghk5697@ajou.ac.kr >
* Update docs/source/ko/conceptual/evaluation.md
Co-authored-by: Chulhwa (Evan) Han <cjfghk5697@ajou.ac.kr >
* Update docs/source/ko/conceptual/evaluation.md
Co-authored-by: Chulhwa (Evan) Han <cjfghk5697@ajou.ac.kr >
* Update docs/source/ko/conceptual/evaluation.md
Co-authored-by: Chulhwa (Evan) Han <cjfghk5697@ajou.ac.kr >
* Update philosophy.md (from jungnerd)
---------
Co-authored-by: Jihun Lim <31366038+heuristicwave@users.noreply.github.com >
Co-authored-by: Chulhwa (Evan) Han <cjfghk5697@ajou.ac.kr >
2024-06-12 09:40:37 -07:00
Tolga Cangöz
ec1aded12e
Optimize test files by fixing CPU-offloading usage ( #8409 )
...
* Refactor code to remove unnecessary calls to `to(torch_device)`
* Refactor code to remove unnecessary calls to `to("cuda")`
* Update pipeline_stable_diffusion_diffedit.py
2024-06-06 09:51:26 -10:00
Steven Liu
151a56b80e
[docs] Single file usage ( #8412 )
...
* single file usage
* edit
2024-06-06 12:40:34 -07:00
Sayak Paul
867a2b0cf9
[Hunyuan] add optimization related sections to the hunyuan dit docs. ( #8402 )
...
* optimizations to the hunyuan dit docs.
* Apply suggestions from code review
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
* Update docs/source/en/api/pipelines/hunyuandit.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
---------
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
2024-06-06 05:41:38 +05:30
Tolga Cangöz
98730c5dd7
Errata ( #8322 )
...
* Fix typos
* Trim trailing whitespaces
* Remove a trailing whitespace
* chore: Update MarigoldDepthPipeline checkpoint to prs-eth/marigold-lcm-v1-0
* Revert "chore: Update MarigoldDepthPipeline checkpoint to prs-eth/marigold-lcm-v1-0"
This reverts commit fd742b30b4 .
* pokemon -> naruto
* `DPMSolverMultistep` -> `DPMSolverMultistepScheduler`
* Improve Markdown stylization
* Improve style
* Improve style
* Refactor pipeline variable names for consistency
* up style
2024-06-05 13:59:09 -07:00
Sayak Paul
2f6f426f66
[Hunyuan] allow Hunyuan DiT to run under 6GB for GPU VRAM ( #8399 )
...
* allow hunyuan dit to run under 6GB for GPU VRAM
* add section in the docs/
2024-06-05 08:24:19 +04:00
Sayak Paul
3ff39e8e86
[HunyuanDiT] minor docs changes in hunyuandit ( #8395 )
...
minor docs changes in hunyuandit
2024-06-04 12:18:53 +04:00
Marçal Comajoan Cara
dc89434bdc
Update transformer2d.md title ( #8375 )
...
* Update transformer2d.md title
For the other classes (e.g., UNet2DModel) the title of the documentation coincides with the name of the class, but that was not the case for Transformer2DModel.
* Update model docs titles for consistency with class names
2024-06-03 17:01:21 -07:00
XCL
174cf868ea
Tencent Hunyuan Team - Updated Doc for HunyuanDiT ( #8383 )
...
* add hunyuandit doc
* update hunyuandit doc
* update hunyuandit 2d model
* update toctree.yml for hunyuandit
2024-06-03 14:02:46 +04:00
Anton Obukhov
86555c9f59
Fix marigold documentation ( #8372 )
...
* rename prs-eth/marigold-lcm-v1-0 into prs-eth/marigold-depth-lcm-v1-0
* update image paths in https://huggingface.co/datasets/huggingface/documentation-images to use main branch
* fix relative paths to other diffusers pages
* Update docs/source/en/using-diffusers/marigold_usage.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
---------
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
2024-05-31 12:10:05 -10:00
Sayak Paul
983dec3bf7
[Core] Introduce class variants for Transformer2DModel ( #7647 )
...
* init for patches
* finish patched model.
* continuous transformer
* vectorized transformer2d.
* style.
* inits.
* fix-copies.
* introduce DiTTransformer2DModel.
* fixes
* use REMAPPING as suggested by @DN6
* better logging.
* add pixart transformer model.
* inits.
* caption_channels.
* attention masking.
* fix use_additional_conditions.
* remove print.
* debug
* flatten
* fix: assertion for sigma
* handle remapping for modeling_utils
* add tests for dit transformer2d
* quality
* placeholder for pixart tests
* pixart tests
* add _no_split_modules
* add docs.
* check
* check
* check
* check
* fix tests
* fix tests
* move Transformer output to modeling_output
* move errors better and bring back use_additional_conditions attribute.
* add unnecessary things from DiT.
* clean up pixart
* fix remapping
* fix device_map things in pixart2d.
* replace Transformer2DModel with appropriate classes in dit, pixart tests
* empty
* legacy mixin classes./
* use a remapping dict for fetching class names.
* change to specifc model types in the pipeline implementations.
* move _fetch_remapped_cls_from_config to modeling_loading_utils.py
* fix dependency problems.
* add deprecation note.
2024-05-31 13:40:27 +05:30
Steven Liu
9e00b727ad
[docs] Files and formats ( #7874 )
...
* files and formats
* fix callout
* feedback
* code sample
* feedback
2024-05-29 09:31:32 -07:00
Steven Liu
f7a4626f4b
[docs] DeepFloyd training ( #8224 )
...
deepfloyd training
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
2024-05-29 09:27:37 -07:00
Sayak Paul
5edd0b34fa
move vqmodel to models.autoencoders. ( #8292 )
...
move vqmodel to models.autoencoders.
2024-05-29 06:30:35 +05:30
Steven Liu
1fa8dbc63a
[docs] Outpaint ( #7964 )
...
* first draft
* edits
2024-05-28 14:42:03 -07:00
Steven Liu
0ab6dc0f23
[docs] Scheduler features ( #7990 )
...
* noise schedule
* sigmas and zero snr
* feedback
* feedback
2024-05-28 14:41:22 -07:00
Jiwook Han
80cfaebaa1
Fix typo in philosophy.md ( #8303 )
...
fix typo in philosophy.md
2024-05-28 10:38:48 -07:00
Álvaro Somoza
ba82414106
[docs] Add controlnet example to marigold ( #8289 )
...
* initial doc
* fix wrong LCM sentence
* implement binary colormap without requiring matplotlib
update section about Marigold for ControlNet
update formatting of marigold_usage.md
* fix indentation
---------
Co-authored-by: anton <anton.obukhov@gmail.com >
2024-05-28 11:58:06 -04:00