Will Berman
2fd46405cd
consistency decoder ( #5694 )
...
* consistency decoder
* rename
* Apply suggestions from code review
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
* Update src/diffusers/pipelines/consistency_models/pipeline_consistency_models.py
* uP
* Apply suggestions from code review
* uP
* uP
* uP
---------
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com >
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
2023-11-09 12:21:41 +01:00
Sayak Paul
d61889fc17
[Feat] PixArt-Alpha ( #5642 )
...
* init pixart alpha pipeline
* fix: import
* script
* script
* script
* add: vae to the pipeline
* add: vae_scale_factor
* add: checkpoint_path
* clean conversion script a bit.
* size embeddings.
* fix: size embedding
* update scrip
* support for interpolation of position embedding.
* support for conditioning.
* ..
* ..
* ..
* final layer
* final layer
* align if encode_prompt
* support for caption embedding
* refactor
* refactor
* refactor
* start cross attention
* start cross attention
* cross_attention_dim
* cross
* cross
* support for resolution and aspect_ratio
* support for caption projection
* refactor patch embeddings
* batch_size
* up
* commit
* commit
* commit.
* squeeze
* squeeze
* squeeze
* squeeze
* squeeze
* squeeze
* squeeze
* squeeze
* squeeze
* squeeze
* squeeze
* squeeze.
* squeeze.
* fix final block./
* fix final block./
* fix final block./
* clean
* fix: interpolation scale.
* debugging'
* debugging'
* debugging'
* debugging'
* debugging'
* debugging'
* debugging'
* debugging'
* debugging'
* debugging'
* debugging'
* debugging'
* debugging'
* debugging'
* debugging'
* debugging'
* debugging'
* debugging'
* debugging'
* debugging'
* debugging'
* debugging'
* debugging'
* debugging'
* debugging'
* debugging'
* debugging'
* debugging'
* debugging'
* debugging'
* debugging'
* debugging'
* debugging'
* debugging'
* debugging'
* debugging'
* debugging'
* debugging'
* debugging'
* debugging'
* debugging'
* debugging'
* debugging
* debugging
* debugging
* debugging
* debugging
* debugging
* debugging
* make --checkpoint_path non-required.
* debugging
* debugging
* debugging
* debugging
* debugging
* debugging
* debugging
* debugging
* debugging
* debugging
* debugging
* debugging
* debugging
* debugging
* debugging
* debugging
* debugging
* debugging
* debugging
* debugging
* debugging
* debugging
* debugging
* debugging
* debugging
* debugging
* debugging
* debugging
* debugging
* debugging
* debugging
* remove num_tokens
* timesteps -> timestep
* timesteps -> timestep
* timesteps -> timestep
* timesteps -> timestep
* timesteps -> timestep
* timesteps -> timestep
* debug
* debug
* update conversion script.
* update conversion script.
* update conversion script.
* debug
* debug
* debug
* clean
* debug
* debug
* debug
* debug
* debug
* debug
* debug
* debug
* deug
* debug
* debug
* debug
* fix
* fix
* fix
* fix
* fix
* fix
* fix
* fix
* fix
* fix
* fix
* fix
* fix
* clean
* fix
* fix
* boom
* boom
* some changes
* boom
* save
* up
* remove i
* fix more tests
* DPMSolverMultistepScheduler
* fix
* offloading
* fix conversion script
* fix conversion script
* remove print
* remove support for negative prompt embeds.
* typo.
* remove extra kwargs
* bring conversion script to where it was
* fix
* trying mu luck
* trying my luck again
* again
* again
* again
* clean up
* up
* up
* update example
* support for 512
* remove spacing
* finalize docs.
* test debug
* fix: assertion values.
* debug
* debug
* debug
* fix: repeat
* remove prints.
* Apply suggestions from code review
* Apply suggestions from code review
* Correct more
* Apply suggestions from code review
* Change all
* Clean more
* fix more
* Fix more
* Fix more
* Correct more
* address patrick's comments.
* remove unneeded args
* clean up pipeline.
* sty;e
* make the use of additional conditions better conditioned.
* None better
* dtype
* height and width validation
* add a note about size brackets.
* fix
* spit out slow test outputs.
* fix?
* fix optional test
* fix more
* remove unneeded comment
* debug
---------
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com >
2023-11-06 08:40:04 +01:00
Sayak Paul
60c5eb5877
[Easy] clean up the LCM docstrings. ( #5637 )
...
* clean up the LCM docstrings.
* clean up
* fix: examples
* Apply suggestions from code review
2023-11-03 12:14:48 +01:00
Sayak Paul
c84982a804
[Easy] Minor AnimateDiff Doc nits ( #5640 )
...
minor
2023-11-03 16:27:54 +05:30
Dhruv Nair
84e7bb875d
Update animatediff docs to include section on Motion LoRAs ( #5639 )
...
update animatediff docs
2023-11-03 15:53:59 +05:30
Patrick von Platen
072e00897a
[LCM] Make sure img2img works ( #5632 )
...
* [LCM] Clean up implementations
* Add all
* correct more
* correct more
* finish
* up
2023-11-02 19:50:47 +01:00
Dhruv Nair
2a8cf8e39f
Animatediff Proposal ( #5413 )
...
* draft design
* clean up
* clean up
* clean up
* clean up
* clean up
* clean up
* clean up
* clean up
* clean up
* update pipeline
* clean up
* clean up
* clean up
* add tests
* change motion block
* clean up
* clean up
* clean up
* update
* update
* update
* update
* update
* update
* update
* update
* clean up
* update
* update
* update model test
* update
* update
* update
* update
* make style
* update
* fix embeddings
* update
* merge upstream
* max fix copies
* fix bug
* fix mistake
* add docs
* update
* clean up
* update
* clean up
* clean up
* fix docstrings
* fix docstrings
* update
* update
* clean up
* update
2023-11-02 15:04:03 +01:00
Steven Liu
75ea54a151
[docs] Kandinsky guide ( #4555 )
...
* kandinsky 2.1 first draft
* add kandinsky 2.2
* fix identical section headers
* try hfoptions syntax
* add img2img
* add inpaint
* add interpolate
* fix tag
* more cleanups
* typo
* update hfoptions id
* align hfoptions tags
2023-11-01 15:36:22 -07:00
Steven Liu
d1eb14bc35
[docs] Lu lambdas ( #5602 )
...
lu lambdas
2023-11-01 11:47:11 -07:00
M. Tolga Cangöz
442017ccc8
[Docs] Fix typos ( #5583 )
...
* Add Copyright info
* Fix typos, improve, update
* Update deepfloyd_if.md
* Update ldm3d_diffusion.md
* Update opt_overview.md
2023-10-31 10:04:08 -07:00
Steven Liu
595ba6f786
[docs] Internal classes API ( #5513 )
...
* internal classes api
* add internal class overview
* fix toctree
2023-10-27 09:48:41 -07:00
YiYi Xu
f912f39b50
correct checkpoint in kandinsky2.2 doc page ( #5550 )
...
update checkpoint
Co-authored-by: yiyixuxu <yixu310@gmail,com>
2023-10-27 08:49:15 +05:30
Chengxi Guo
dcbfe662ef
fix typo ( #5505 )
...
Signed-off-by: mymusise <mymusise1@gmail.com >
2023-10-24 17:14:05 -07:00
dg845
958e17dada
Add Latent Consistency Models Pipeline ( #5448 )
...
* initial commit for LatentConsistencyModelPipeline and LCMScheduler based on the community pipeline
* Add callback and freeu support.
* apply suggestions from review
* Clean up LCMScheduler
* Remove timeindex argument to LCMScheduler.step.
* Add support for clipping or thresholding the predicted original sample.
* Remove unused methods and arguments in LCMScheduler.
* Improve comment about (lack of) negative prompt support.
* Change input guidance_scale to match the StableDiffusionPipeline (Imagen) CFG formulation.
* Move lcm_origin_steps from pipeline __call__ to LCMScheduler.__init__/config (as origin_steps).
* Fix typo when clipping/thresholding in LCMScheduler.
* Add some initial LCMScheduler tests.
* add type annotations from review
* Fix type annotation bug.
* Override test_add_noise_device in LCMSchedulerTest since hardcoded timesteps doesn't work under default settings.
* Add generator argument pipeline prepare_latents call.
* Cast LCMScheduler.timesteps to long in set_timesteps.
* Add onestep and multistep full loop scheduler tests.
* Set default height/width to None and don't hardcode guidance scale embedding dim.
* Add initial LatentConsistencyPipeline fast and slow tests.
* Add initial documentation for LatentConsistencyModelPipeline and LCMScheduler.
* Make remaining failing fast tests pass.
* make style
* Make original_inference_steps configurable from pipeline __call__ again.
* make style
* Remove guidance_rescale arg from pipeline __call__ since LCM currently doesn't support CFG.
* Make LCMScheduler defaults match config of LCM_Dreamshaper_v7 checkpoint.
* Fix LatentConsistencyPipeline slow tests and add dummy expected slices.
* Add checks for original_steps in LCMScheduler.set_timesteps.
* make fix-copies
* Improve LatentConsistencyModelPipeline docs.
* Apply suggestions from code review
Co-authored-by: Aryan V S <avs050602@gmail.com >
* Apply suggestions from code review
Co-authored-by: Aryan V S <avs050602@gmail.com >
* Apply suggestions from code review
Co-authored-by: Aryan V S <avs050602@gmail.com >
* Update src/diffusers/schedulers/scheduling_lcm.py
* Apply suggestions from code review
Co-authored-by: Aryan V S <avs050602@gmail.com >
* finish
---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com >
Co-authored-by: Aryan V S <avs050602@gmail.com >
2023-10-24 21:06:02 +02:00
Steven Liu
7c3a75a1ce
[docs] General updates ( #5378 )
...
* first draft
* feedback
* feedback
2023-10-24 11:51:55 -07:00
Sayak Paul
77241c48af
[Core] Refactor activation and normalization layers ( #5493 )
...
* move out the activations.
* move normalization layers.
* add doc.
* add doc.
* fix: paths
* Apply suggestions from code review
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
* style
---------
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
2023-10-24 08:49:43 +05:30
YiYi Xu
9e1edfc1ad
fix a few issues in controlnet inpaint pipelines ( #5470 )
...
* add
* Update docs/source/en/api/pipelines/controlnet_sdxl.md
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com >
---------
Co-authored-by: yiyixuxu <yixu310@gmail,com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com >
2023-10-23 09:24:51 -10:00
Steven Liu
6b06c30a65
[docs] Fix links ( #5499 )
...
fix links
2023-10-23 20:39:29 +02:00
Heinz-Alexander Fuetterer
0ea78f9707
chore: fix typos ( #5386 )
...
* chore: fix typos
* Update src/diffusers/pipelines/shap_e/renderer.py
Co-authored-by: psychedelicious <4822129+psychedelicious@users.noreply.github.com >
---------
Co-authored-by: psychedelicious <4822129+psychedelicious@users.noreply.github.com >
2023-10-16 15:23:37 +02:00
Jonathan Whitaker
35952e61c1
Fix links in docs to adapter code ( #5323 )
...
Update adapter.md to fix links to adapter pipelines
2023-10-09 17:20:12 +02:00
Patrick von Platen
a91a273d0b
[Docs] Try to fix doc builder ( #5180 )
...
* try to fix docs
* try to fix docs
2023-09-25 20:24:50 +02:00
Patrick von Platen
d70944bf7f
fix docs
2023-09-25 19:55:49 +02:00
MLRichter
0bc6be6960
Update wuerstchen.md ( #5156 )
2023-09-25 18:43:08 +02:00
Patrick von Platen
144c3a8b7c
[Imports] Fix many import bugs and make sure that doc builder CI test works correctly ( #5176 )
...
* [Doc builder] Ensure slow import for doc builder
* Apply suggestions from code review
* env for doc builder
* fix more
* [Diffusers] Set import to slow as env variable
* fix docs
* fix docs
* Apply suggestions from code review
* Apply suggestions from code review
* fix docs
* fix docs
2023-09-25 18:06:51 +02:00
Ayush Mangal
157c9011d8
Add BLIP Diffusion ( #4388 )
...
* Add BLIP Diffusion skeleton
* Add other model components
* Add BLIP2, need to change it for now
* Fix pipeline imports
* Load pretrained ViT
* Make qformer fwd pass same
* Replicate fwd passes
* Fix device bug
* Add accelerate functions
* Remove extra functions from Blip2
* Minor bug
* Integrate initial review changes
* Refactoring
* Refactoring
* Refactor
* Add controlnet
* Refactor
* Update conversion script
* Add image processor
* Shift postprocessing to ImageProcessor
* Refactor
* Fix device
* Add fast tests
* Update conversion script
* Fix checkpoint conversion script
* Integrate review changes
* Integrate reivew changes
* Remove unused functions from test
* Reuse HF image processor in Cond image
* Create new BlipImageProcessor based on transfomers
* Fix image preprocessor
* Minor
* Minor
* Add canny preprocessing
* Fix controlnet preprocessing
* Fix blip diffusion test
* Add controlnet test
* Add initial doc strings
* Integrate review changes
* Refactor
* Update examples
* Remove DDIM comments
* Add copied from for prepare_latents
* Add type anotations
* Add docstrings
* Do black formatting
* Add batch support
* Make tests pass
* Make controlnet tests pass
* Black formatting
* Fix progress bar
* Fix some licensing comments
* Fix imports
* Refactor controlnet
* Make tests faster
* Edit examples
* Black formatting/Ruff
* Add doc
* Minor
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com >
* Move controlnet pipeline
* Make tests faster
* Fix imports
* Fix formatting
* Fix make errors
* Fix make errors
* Minor
* Add suggested doc changes
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
* Edit docs
* Fix 16 bit loading
* Update examples
* Edit toctree
* Update docs/source/en/api/pipelines/blip_diffusion.md
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
* Minor
* Add tips
* Edit examples
* Update model paths
---------
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com >
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
2023-09-21 17:05:35 +01:00
Ruoxi
16b9a57d29
Implement CustomDiffusionAttnProcessor2_0. ( #4604 )
...
* Implement `CustomDiffusionAttnProcessor2_0`
* Doc-strings and type annotations for `CustomDiffusionAttnProcessor2_0`. (#1 )
* Update attnprocessor.md
* Update attention_processor.py
* Interops for `CustomDiffusionAttnProcessor2_0`.
* Formatted `attention_processor.py`.
* Formatted doc-string in `attention_processor.py`
* Conditional CustomDiffusion2_0 for training example.
* Remove unnecessary reference impl in comments.
* Fix `save_attn_procs`.
2023-09-18 14:49:00 +02:00
Kashif Rasul
427feb5359
[Wuerstchen] fix typos in docs ( #5051 )
...
* fix typos in docs
* fix for issue #5023
2023-09-15 12:53:25 +02:00
Lucain
b954c22a44
Fix broken link in docs ( #5015 )
...
fix broken link
2023-09-13 15:40:25 +02:00
Kashif Rasul
77373c5eb1
[Wuerstchen] fix compel usage ( #4999 )
...
* fix compel usage
* minor changes in documentation
* fix tests
* fix more
* fix more
* typos
* fix tests
* formatting
---------
Co-authored-by: Dominic Rampas <d6582533@gmail.com >
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com >
2023-09-13 14:54:59 +02:00
Patrick von Platen
324aef6d14
[SDXL] Add LoRA to all pipelines ( #4896 )
...
* [SDXL] Add LoRA to all pipelines
* fix all
* fix all
* fix all
* fix more docs
* make style
2023-09-13 11:05:20 +02:00
Kashif Rasul
16a056a7b5
Wuerstchen fixes ( #4942 )
...
* fix arguments and make example code work
* change arguments in combined test
* Add default timesteps
* style
* fixed test
* fix broken test
* formatting
* fix docstrings
* fix num_images_per_prompt
* fix doc styles
* please dont change this
* fix tests
* rename to DEFAULT_STAGE_C_TIMESTEPS
---------
Co-authored-by: Dominic Rampas <d6582533@gmail.com >
2023-09-11 15:47:53 +02:00
Dhruv Nair
b6e0b016ce
Lazy Import for Diffusers ( #4829 )
...
* initial commit
* move modules to import struct
* add dummy objects and _LazyModule
* add lazy import to schedulers
* clean up unused imports
* lazy import on models module
* lazy import for schedulers module
* add lazy import to pipelines module
* lazy import altdiffusion
* lazy import audio diffusion
* lazy import audioldm
* lazy import consistency model
* lazy import controlnet
* lazy import dance diffusion ddim ddpm
* lazy import deepfloyd
* lazy import kandinksy
* lazy imports
* lazy import semantic diffusion
* lazy imports
* lazy import stable diffusion
* move sd output to its own module
* clean up
* lazy import t2iadapter
* lazy import unclip
* lazy import versatile and vq diffsuion
* lazy import vq diffusion
* helper to fetch objects from modules
* lazy import sdxl
* lazy import txt2vid
* lazy import stochastic karras
* fix model imports
* fix bug
* lazy import
* clean up
* clean up
* fixes for tests
* fixes for tests
* clean up
* remove import of torch_utils from utils module
* clean up
* clean up
* fix mistake import statement
* dedicated modules for exporting and loading
* remove testing utils from utils module
* fixes from merge conflicts
* Update src/diffusers/pipelines/kandinsky2_2/__init__.py
* fix docs
* fix alt diffusion copied from
* fix check dummies
* fix more docs
* remove accelerate import from utils module
* add type checking
* make style
* fix check dummies
* remove torch import from xformers check
* clean up error message
* fixes after upstream merges
* dummy objects fix
* fix tests
* remove unused module import
---------
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com >
2023-09-11 09:56:22 +02:00
Sayak Paul
88735249da
[Docs] fix: minor formatting in the Würstchen docs ( #4965 )
...
fix: minor formatting in the docs
2023-09-11 09:12:53 +02:00
Sayak Paul
9800cc5ece
[InstructPix2Pix] Fix pipeline implementation and add docs ( #4844 )
...
* initial evident fixes.
* instructpix2pix fixes.
* add: entry to doc.
* address PR feedback.
* make fix-copies
2023-09-07 15:34:19 +05:30
Kashif Rasul
541bb6ee63
Würstchen model ( #3849 )
...
* initial
* initial
* added initial convert script for paella vqmodel
* initial wuerstchen pipeline
* add LayerNorm2d
* added modules
* fix typo
* use model_v2
* embed clip caption amd negative_caption
* fixed name of var
* initial modules in one place
* WuerstchenPriorPipeline
* inital shape
* initial denoising prior loop
* fix output
* add WuerstchenPriorPipeline to __init__.py
* use the noise ratio in the Prior
* try to save pipeline
* save_pretrained working
* Few additions
* add _execution_device
* shape is int
* fix batch size
* fix shape of ratio
* fix shape of ratio
* fix output dataclass
* tests folder
* fix formatting
* fix float16 + started with generator
* Update pipeline_wuerstchen.py
* removed vqgan code
* add WuerstchenGeneratorPipeline
* fix WuerstchenGeneratorPipeline
* fix docstrings
* fix imports
* convert generator pipeline
* fix convert
* Work on Generator Pipeline. WIP
* Pipeline works with our diffuzz code
* apply scale factor
* removed vqgan.py
* use cosine schedule
* redo the denoising loop
* Update src/diffusers/models/resnet.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com >
* use torch.lerp
* use warp-diffusion org
* clip_sample=False,
* some refactoring
* use model_v3_stage_c
* c_cond size
* use clip-bigG
* allow stage b clip to be None
* add dummy
* würstchen scheduler
* minor changes
* set clip=None in the pipeline
* fix attention mask
* add attention_masks to text_encoder
* make fix-copies
* add back clip
* add text_encoder
* gen_text_encoder and tokenizer
* fix import
* updated pipeline test
* undo changes to pipeline test
* nip
* fix typo
* fix output name
* set guidance_scale=0 and remove diffuze
* fix doc strings
* make style
* nip
* removed unused
* initial docs
* rename
* toc
* cleanup
* remvoe test script
* fix-copies
* fix multi images
* remove dup
* remove unused modules
* undo changes for debugging
* no new line
* remove dup conversion script
* fix doc string
* cleanup
* pass default args
* dup permute
* fix some tests
* fix prepare_latents
* move Prior class to modules
* offload only the text encoder and vqgan
* fix resolution calculation for prior
* nip
* removed testing script
* fix shape
* fix argument to set_timesteps
* do not change .gitignore
* fix resolution calculations + readme
* resolution calculation fix + readme
* small fixes
* Add combined pipeline
* rename generator -> decoder
* Update .gitignore
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com >
* removed efficient_net
* create combined WuerstchenPipeline
* make arguments consistent with VQ model
* fix var names
* no need to return text_encoder_hidden_states
* add latent_dim_scale to config
* split model into its own file
* add WuerschenPipeline to docs
* remove unused latent_size
* register latent_dim_scale
* update script
* update docstring
* use Attention preprocessor
* concat with normed input
* fix-copies
* add docs
* fix test
* fix style
* add to cpu_offloaded_model
* updated type
* remove 1-line func
* updated type
* initial decoder test
* formatting
* formatting
* fix autodoc link
* num_inference_steps is int
* remove comments
* fix example in docs
* Update src/diffusers/pipelines/wuerstchen/diffnext.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com >
* rename layernorm to WuerstchenLayerNorm
* rename DiffNext to WuerstchenDiffNeXt
* added comment about MixingResidualBlock
* move paella vq-vae to pipelines' folder
* initial decoder test
* increased test_float16_inference expected diff
* self_attn is always true
* more passing decoder tests
* batch image_embeds
* fix failing tests
* set the correct dtype
* relax inference test
* update prior
* added combined pipeline test
* faster test
* faster test
* Update src/diffusers/pipelines/wuerstchen/pipeline_wuerstchen_combined.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com >
* fix issues from review
* update wuerstchen.md + change generator name
* resolve issues
* fix copied from usage and add back batch_size
* fix API
* fix arguments
* fix combined test
* Added timesteps argument + fixes
* Update tests/pipelines/test_pipelines_common.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com >
* Update tests/pipelines/wuerstchen/test_wuerstchen_prior.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com >
* Update src/diffusers/pipelines/wuerstchen/pipeline_wuerstchen_combined.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com >
* Update src/diffusers/pipelines/wuerstchen/pipeline_wuerstchen_combined.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com >
* Update src/diffusers/pipelines/wuerstchen/pipeline_wuerstchen_combined.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com >
* Update src/diffusers/pipelines/wuerstchen/pipeline_wuerstchen_combined.py
* up
* Fix more
* failing tests
* up
* up
* correct naming
* correct docs
* correct docs
* fix test params
* correct docs
* fix classifier free guidance
* fix classifier free guidance
* fix more
* fix all
* make tests faster
---------
Co-authored-by: Dominic Rampas <d6582533@gmail.com >
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com >
Co-authored-by: Dominic Rampas <61938694+dome272@users.noreply.github.com >
2023-09-06 16:15:51 +02:00
Steven Liu
946bb53c56
[docs] Add stronger warning for SDXL height/width ( #4867 )
...
* add size warning
* feedback
2023-09-05 10:50:42 -07:00
Steven Liu
2c45a53aef
[docs] Shap-E guide ( #4700 )
...
* first draft
* fixes
* more fixes
* fix toctree
2023-09-01 19:52:41 -07:00
Steven Liu
22ea35cf23
[docs] DiffEdit guide ( #4722 )
...
* first draft
* minor edits
2023-09-01 14:18:41 -07:00
Pedro Cuenca
60d259add1
Fix link from API to using-diffusers ( #4856 )
...
* Fix link from API to using-diffusers
* Fix link
2023-09-01 15:05:01 +02:00
Nguyễn Công Tú Anh
38466c369f
Add GLIGEN Text Image implementation ( #4777 )
...
* Add GLIGEN Text Image implementation
* add style transfer from image
* fix check_repository_consistency
* add convert script GLIGEN model to Diffusers
* rename attention type
* fix style code
* remove PositionNetTextImage
* Revert "fix check_repository_consistency"
This reverts commit 15f098c96e .
* change attention type name
* update docs for GLIGEN
* change examples with hf-document-image
* fix style
* add CLIPImageProjection for GLIGEN
* Add new encode_prompt, load project matrix in pipe init
* move CLIPImageProjection to stable_diffusion
* add comment
2023-09-01 15:48:01 +05:30
Steven Liu
aedd78767c
[docs] ControlNet guide ( #4640 )
...
* first draft
* finish first draft
* feedback and remove sections from API pages
* clean docstrings
* add full code example
2023-08-31 10:02:02 -04:00
Steven Liu
a1fdfca36f
[docs] SDXL ( #4428 )
...
* first draft
* reorg toctree
* note about minsdxl
* feedback
* fix
* micro-conditionings
* add tip
* fix section levels
* d'oh fix pipeline names
* feedback
* remove old section
2023-08-30 11:34:55 -04:00
Chong Mou
12358b986f
add models for T2I-Adapter-XL ( #4696 )
...
* T2I-Adapter-XL
* update
* update
* add pipeline
* modify pipeline
* modify pipeline
* modify pipeline
* modify pipeline
* modify pipeline
* modify modeling_text_unet
* fix styling.
* fix: copies.
* adapter settings
* new test case
* new test case
* debugging
* debugging
* debugging
* debugging
* debugging
* debugging
* debugging
* debugging
* revert prints.
* new test case
* remove print
* org test case
* add test_pipeline
* styling.
* fix copies.
* modify test parameter
* style.
* add adapter-xl doc
* double quotes in docs
* Fix potential type mismatch
* style.
---------
Co-authored-by: sayakpaul <spsayakpaul@gmail.com >
2023-08-29 10:34:07 +05:30
Sayak Paul
3be0ff9056
[Core] Support negative conditions in SDXL ( #4774 )
...
* add: support negative conditions.
* fix: key
* add: tests
* address PR feedback.
* add documentation
* add img2img support.
* add inpainting support.
* ad controlnet support
* Apply suggestions from code review
* modify wording in the doc.
2023-08-26 09:13:44 +05:30
Sanchit Gandhi
b1290d3fb8
Convert MusicLDM ( #4579 )
...
* from audioldm
* fix vae
* move to new pipeline
* copied from audioldm
* remove redundant control flow
* iterate
* fix docstring
* finish pipeline
* tests: from audioldm2
* iterate
* finish fast tests
* finish slow integration tests
* add docs
* remove dtype test
* update toctree
* "copied from" in conversion (where possible)
* Update docs/source/en/api/pipelines/musicldm.md
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com >
* fix docstring
* make nightly
* style
* fix dtype test
---------
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com >
2023-08-25 13:31:00 +01:00
Sanchit Gandhi
24c5e7708b
[AudioLDM2] Doc fixes ( #4739 )
...
* [AudioLDM2] Doc fixes
* update docstrings
* fix unet docstring
* Apply suggestions from code review
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com >
---------
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com >
2023-08-24 07:20:27 +05:30
Sanchit Gandhi
05b0ec63bc
[AudioLDM Docs] Fix docs for output ( #4737 )
2023-08-23 18:02:11 +02:00
dg845
f75b8aa9dd
[docs] Add note in UniDiffusers Doc about PyTorch 1.X numerical stability issue ( #4703 )
...
* Add note regarding UniDiffuser pipeline numerical stability issues on PyTorch 1.X
* Use the doc-builder warning tag.
2023-08-22 07:12:06 +05:30
Sanchit Gandhi
7a24977ce3
Add AudioLDM 2 ( #4549 )
...
* from audioldm
* unet down + mid
* vae, clap, flan-t5
* start sequence audio mae
* iterate on audioldm encoder
* finish encoder
* finish weight conversion
* text pre-processing
* gpt2 pre-processing
* fix projection model
* working
* unet equivalence
* finish in base
* add unet cond
* finish unet
* finish custom unet
* start clean-up
* revert base unet changes
* refactor pre-processing
* tests: from audioldm
* fix some tests
* more fixes
* iterate on tests
* make fix copies
* harden fast tests
* slow integration tests
* finish tests
* update checkpoint
* update copyright
* docs
* remove outdated method
* add docstring
* make style
* remove decode latents
* enable cpu offload
* (text_encoder_1, tokenizer_1) -> (text_encoder, tokenizer)
* more clean up
* more refactor
* build pr docs
* Update docs/source/en/api/pipelines/audioldm2.md
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
* small clean
* tidy conversion
* update for large checkpoint
* generate -> generate_language_model
* full clap model
* shrink clap-audio in tests
* fix large integration test
* fix fast tests
* use generation config
* make style
* update docs
* finish docs
* finish doc
* update tests
* fix last test
* syntax
* finalise tests
* refactor projection model in prep for TTS
* fix fast tests
* style
---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
2023-08-21 12:34:21 +01:00
Sayak Paul
5333f4c0ec
make things clear in the controlnet sdxl doc. ( #4644 )
2023-08-17 09:04:28 +05:30