Steven Liu
b4e6dc3037
[docs] Fix broken links ( #12487 )
...
fix broken links
2025-10-15 06:42:10 +05:30
Steven Liu
cc5b31ffc9
[docs] Migrate syntax ( #12390 )
...
* change syntax
* make style
2025-09-30 10:11:19 -07:00
Steven Liu
5237a82a35
[docs] Remove Flax ( #12244 )
...
* remove flax
* toctree
* feedback
2025-08-27 11:11:07 -07:00
Steven Liu
b60faf456b
[docs] Pipeline callbacks ( #12212 )
...
* init
* review
2025-08-22 13:01:24 -07:00
galbria
7993be9e7f
Bria 3 2 pipeline ( #12010 )
...
* Add Bria model and pipeline to diffusers
- Introduced `BriaTransformer2DModel` and `BriaPipeline` for enhanced image generation capabilities.
- Updated import structures across various modules to include the new Bria components.
- Added utility functions and output classes specific to the Bria pipeline.
- Implemented tests for the Bria pipeline to ensure functionality and output integrity.
* with working tests
* style and quality pass
* adding docs
* add to overview
* fixes from "make fix-copies"
* Refactor transformer_bria.py and pipeline_bria.py: Introduce new EmbedND class for rotary position embedding, and enhance Timestep and TimestepProjEmbeddings classes. Add utility functions for handling negative prompts and generating original sigmas in pipeline_bria.py.
* remove redundent and duplicates tests and fix bf16
slow test
* style fixes
* small doc update
* Enhance Bria 3.2 documentation and implementation
- Updated the GitHub repository link for Bria 3.2.
- Added usage instructions for the gated model access.
- Introduced the BriaTransformerBlock and BriaAttention classes to the model architecture.
- Refactored existing classes to integrate Bria-specific components, including BriaEmbedND and BriaPipeline.
- Updated the pipeline output class to reflect Bria-specific functionality.
- Adjusted test cases to align with the new Bria model structure.
* Refactor Bria model components and update documentation
- Removed outdated inference example from Bria 3.2 documentation.
- Introduced the BriaTransformerBlock class to enhance model architecture.
- Updated attention handling to use `attention_kwargs` instead of `joint_attention_kwargs`.
- Improved import structure in the Bria pipeline to handle optional dependencies.
- Adjusted test cases to reflect changes in model dtype assertions.
* Update Bria model reference in documentation to reflect new file naming convention
* Update docs/source/en/_toctree.yml
* Refactor BriaPipeline to inherit from DiffusionPipeline instead of FluxPipeline, updating imports accordingly.
* move the __call__ func to the end of file
* Update BriaPipeline example to use bfloat16 for precision sensitivity for better result
* make style && make quality && make fix-copiessource
---------
Co-authored-by: Linoy Tsaban <57615435+linoytsaban@users.noreply.github.com >
Co-authored-by: Aryan <contact.aryanvs@gmail.com >
2025-08-20 14:57:39 +05:30
Aryan
a4df8dbc40
Update more licenses to 2025 ( #11746 )
...
update
2025-06-19 07:46:01 +05:30
Zhong-Yu Li
4f438de35a
Add VisualCloze ( #11377 )
...
* VisualCloze
* style quality
* add docs
* add docs
* typo
* Update docs/source/en/api/pipelines/visualcloze.md
* delete einops
* style quality
* Update src/diffusers/pipelines/visualcloze/pipeline_visualcloze.py
* reorg
* refine doc
* style quality
* typo
* typo
* Update src/diffusers/image_processor.py
* add comment
* test
* style
* Modified based on review
* style
* restore image_processor
* update example url
* style
* fix-copies
* VisualClozeGenerationPipeline
* combine
* tests docs
* remove VisualClozeUpsamplingPipeline
* style
* quality
* test examples
* quality style
* typo
* make fix-copies
* fix test_callback_cfg and test_save_load_dduf in VisualClozePipelineFastTests
* add EXAMPLE_DOC_STRING to VisualClozeGenerationPipeline
* delete maybe_free_model_hooks from pipeline_visualcloze_combined
* Apply suggestions from code review
* fix test_save_load_local test; add reason for skipping cfg test
* more save_load test fixes
* fix tests in generation pipeline tests
2025-05-13 02:46:51 +05:30
Anton Obukhov
3fab6624fd
Marigold Update: v1-1 models, Intrinsic Image Decomposition pipeline, documentation ( #10884 )
...
* minor documentation fixes of the depth and normals pipelines
* update license headers
* update model checkpoints in examples
fix missing prediction_type in register_to_config in the normals pipeline
* add initial marigold intrinsics pipeline
update comments about num_inference_steps and ensemble_size
minor fixes in comments of marigold normals and depth pipelines
* update uncertainty visualization to work with intrinsics
* integrate iid
---------
Co-authored-by: YiYi Xu <yixu310@gmail.com >
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
2025-02-25 14:13:02 -10:00
SahilCarterr
85fcbaf314
[Fix] Docs overview.md ( #10858 )
...
Fix docs
2025-02-21 08:03:22 -08:00
Steven Liu
bbcf2a8589
[docs] Add pipelines to table ( #9282 )
...
update pipelines
2024-08-27 12:15:30 +05:30
Yoach Lacombe
69e72b1dd1
Stable Audio integration ( #8716 )
...
* WIP modeling code and pipeline
* add custom attention processor + custom activation + add to init
* correct ProjectionModel forward
* add stable audio to __initèè
* add autoencoder and update pipeline and modeling code
* add half Rope
* add partial rotary v2
* add temporary modfis to scheduler
* add EDM DPM Solver
* remove TODOs
* clean GLU
* remove att.group_norm to attn processor
* revert back src/diffusers/schedulers/scheduling_dpmsolver_multistep.py
* refactor GLU -> SwiGLU
* remove redundant args
* add channel multiples in autoencoder docstrings
* changes in docsrtings and copyright headers
* clean pipeline
* further cleaning
* remove peft and lora and fromoriginalmodel
* Delete src/diffusers/pipelines/stable_audio/diffusers.code-workspace
* make style
* dummy models
* fix copied from
* add fast oobleck tests
* add brownian tree
* oobleck autoencoder slow tests
* remove TODO
* fast stable audio pipeline tests
* add slow tests
* make style
* add first version of docs
* wrap is_torchsde_available to the scheduler
* fix slow test
* test with input waveform
* add input waveform
* remove some todos
* create stableaudio gaussian projection + make style
* add pipeline to toctree
* fix copied from
* make quality
* refactor timestep_features->time_proj
* refactor joint_attention_kwargs->cross_attention_kwargs
* remove forward_chunk
* move StableAudioDitModel to transformers folder
* correct convert + remove partial rotary embed
* apply suggestions from yiyixuxu -> removing attn.kv_heads
* remove temb
* remove cross_attention_kwargs
* further removal of cross_attention_kwargs
* remove text encoder autocast to fp16
* continue removing autocast
* make style
* refactor how text and audio are embedded
* add paper
* update example code
* make style
* unify projection model forward + fix device placement
* make style
* remove fuse qkv
* apply suggestions from review
* Update src/diffusers/pipelines/stable_audio/pipeline_stable_audio.py
Co-authored-by: YiYi Xu <yixu310@gmail.com >
* make style
* smaller models in fast tests
* pass sequential offloading fast tests
* add docs for vae and autoencoder
* make style and update example
* remove useless import
* add cosine scheduler
* dummy classes
* cosine scheduler docs
* better description of scheduler
---------
Co-authored-by: YiYi Xu <yixu310@gmail.com >
2024-07-30 15:29:06 +05:30
Steven Liu
fa750a15bd
[docs] Refactor image quality docs ( #7758 )
...
* refactor
* code snippets
* fix path
* fix path in guide
* code outputs
* align toctree title
* title
* fix title
2024-04-25 16:55:35 -07:00
Manuel Brack
00eca4b887
[Pipeline] Add LEDITS++ pipelines ( #6074 )
...
* Setup LEdits++ file structure
* Fix import
* LEditsPP Stable Diffusion pipeline
* Include variable image aspect ratios
* Implement LEDITS++ for SDXL
* clean up LEditsPPPipelineStableDiffusion
* Adjust inversion output
* Added docu, more cleanup for LEditsPPPipelineStableDiffusion
* clean up LEditsPPPipelineStableDiffusionXL
* Update documentation
* Fix documentation import
* Add skeleton IF implementation
* Fix documentation typo
* Add LEDTIS docu to toctree
* Add missing title
* Finalize SD documentation
* Finalize SD-XL documentation
* Fix code style and quality
* Fix typo
* Fix return types
* added LEditsPPPipelineIF; minor changes for LEditsPPPipelineStableDiffusion and LEditsPPPipelineStableDiffusionXL
* Fix copy reference
* add documentation for IF
* Add first tests
* Fix batching for SD-XL
* Fix text encoding and perfect reconstruction for SD-XL
* Add tests for SD-XL, minor changes
* move user_mask to correct device, use cross_attention_kwargs also for inversion
* Example docstring
* Fix attention resolution for non-square images
* Refactoring for PR review
* Safely remove ledits_utils.py
* Style fixes
* Replace assertions with ValueError
* Remove LEditsPPPipelineIF
* Remove unecessary input checks
* Refactoring of CrossAttnProcessor
* Revert unecessary changes to scheduler
* Remove first progress-bar in inversion
* Refactor scheduler usage and reset
* Use imageprocessor instead of custom logic
* Fix scheduler init warning
* Fix error when running the pipeline in fp16
* Update documentation wrt perfect inversion
* Update tests
* Fix code quality and copy consistency
* Update LEditsPP import
* Remove enable/disable methods that are now in StableDiffusionMixin
* Change import in docs
* Revert import structure change
* Fix ledits imports
---------
Co-authored-by: Katharina Kornmeier <katharina.kornmeier@stud.tu-darmstadt.de >
2024-03-13 12:43:47 +02:00
Sayak Paul
30e5e81d58
change to 2024 in the license ( #6902 )
...
change to 2024
2024-02-08 08:19:31 -10:00
UmerHA
e192ae08d3
Add ControlNet-XS support ( #5827 )
...
* Check in 23-10-05
* check-in 23-10-06
* check-in 23-10-07 2pm
* check-in 23-10-08
* check-in 231009T1200
* check-in 230109
* checkin 231010
* init + forward run
* checkin
* checkin
* ControlNetXSModel is now saveable+loadable
* Forward works
* checkin
* Pipeline works with `no_control=True`
* checkin
* debug: save intermediate outputs of resnet
* checkin
* Understood time error + fixed connection error
* checkin
* checkin 231106T1600
* turned off detailled debug prints
* time debug logs
* small fix
* Separated control_scale for connections/time
* simplified debug logging
* Full denoising works with control scale = 0
* aligned logs
* Added control_attention_head_dim param
* Passing n_heads instead of dim_head into ctrl unet
* Fixed ctrl midblock bug
* Cleanup
* Fixed time dtype bug
* checkin
* 1. from_unet, 2. base passed, 3. all unet params
* checkin
* Finished docstrings
* cleanup
* make style
* checkin
* more tests pass
* Fixed tests
* removed debug logs
* make style + quality
* make fix-copies
* fixed documentation
* added cnxs to doc toc
* added control start/end param
* Update controlnetxs_sdxl.md
* tried to fix copies..
* Fixed norm_num_groups in from_unet
* added sdxl-depth test
* created SD2.1 controlnet-xs pipeline
* re-added debug logs
* Adjusting group norm ; readded logs
* Added debug log statements
* removed debug logs ; started tests for sd2.1
* updated sd21 tests
* fixed tests
* fixed tests
* slightly increased error tolerance for 1 test
* make style & quality
* Added docs for CNXS-SD
* make fix-copies
* Fixed sd compile test ; fixed gradient ckpointing
* vae downs = cnxs conditioning downs; removed guess
* make style & quality
* Fixed tests
* fixed test
* Incorporated review feedback
* simplified control model surgery
* fixed tests & make style / quality
* Updated docs; deleted pip & cursor files
* Rolled back minimal change to resnet
* Update resnet.py
* Update resnet.py
* Update src/diffusers/models/controlnetxs.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com >
* Update src/diffusers/models/controlnetxs.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com >
* Incorporated review feedback
* Update docs/source/en/api/pipelines/controlnetxs_sdxl.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
* Update docs/source/en/api/pipelines/controlnetxs.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
* Update docs/source/en/api/pipelines/controlnetxs.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
* Update docs/source/en/api/pipelines/controlnetxs.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
* Update src/diffusers/models/controlnetxs.py
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
* Update src/diffusers/models/controlnetxs.py
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
* Update src/diffusers/pipelines/controlnet_xs/pipeline_controlnet_xs.py
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
* Update docs/source/en/api/pipelines/controlnetxs.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
* Update src/diffusers/pipelines/controlnet_xs/pipeline_controlnet_xs_sd_xl.py
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
* Incorporated doc feedback
---------
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com >
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com >
2023-12-06 23:33:47 +01:00
Steven Liu
87a09d66f3
[docs] SDXL Turbo ( #6065 )
...
api docs
2023-12-06 14:33:14 -08:00
Steven Liu
172acc98b9
[docs] Update pipeline list ( #5952 )
...
add to list
2023-11-29 14:08:39 +01:00
estelleafl
5ae3c3a56b
[ldm3d] Ldm3d upscaler to community pipeline ( #5870 )
...
---------
Co-authored-by: Aflalo <estellea@isl-gpu27.rr.intel.com >
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com >
Co-authored-by: YiYi Xu <yixu310@gmail.com >
2023-11-28 09:00:39 -10:00
M. Tolga Cangöz
8092017d3f
[Docs] Fix typos and update files at API's Pipelines page 1 ( #5744 )
...
* Fix typos, update, add Copyright info, and trim trailing whitespace
* Update alt_diffusion.md
* Remove nonoperational demo
* Update docs/source/en/api/pipelines/consistency_models.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
* Update docs/source/en/api/pipelines/latent_consistency_models.md
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
---------
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
2023-11-14 10:36:20 -08:00
Steven Liu
7c3a75a1ce
[docs] General updates ( #5378 )
...
* first draft
* feedback
* feedback
2023-10-24 11:51:55 -07:00
Sayak Paul
a7508a76f0
add: pushtohubmixin to pipelines and schedulers docs overview. ( #4607 )
...
* add: pushtohubmixin to pipelines and schedulers docs overview.
* Apply suggestions from code review
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
---------
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
2023-08-15 22:23:17 +05:30
camenduru
c6ae9b7df6
Where did this 'x' come from, Elon? ( #4277 )
...
* why mdx?
* why mdx?
* why mdx?
* no x for kandinksy either
---------
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com >
2023-07-26 18:18:14 +02:00