Susung Hong
0df47efee2
[Docs] update Self-Attention Guidance docs ( #2952 )
...
* Update index.mdx
* Edit docs & add HF space link
* Only change equation numbers in comments
2023-04-12 10:14:32 +01:00
Sayak Paul
a89a14fa7a
[LoRA] Enabling limited LoRA support for text encoder ( #2918 )
...
* add: first draft for a better LoRA enabler.
* make fix-copies.
* feat: backward compatibility.
* add: entry to the docs.
* add: tests.
* fix: docs.
* fix: norm group test for UNet3D.
* feat: add support for flat dicts.
* add depcrcation message instead of warning.
2023-04-12 08:29:04 +05:30
Steven Liu
cb9d77af23
[docs] Reusing components ( #3000 )
...
* reuse-components
* format
2023-04-11 15:34:34 +01:00
Patrick von Platen
8b451eb63b
Fix config prints and save, load of pipelines ( #2849 )
...
* [Config] Fix config prints and save, load
* Only use potential nn.Modules for dtype and device
* Correct vae image processor
* make sure in_channels is not accessed directly
* make sure in channels is only accessed via config
* Make sure schedulers only access config attributes
* Make sure to access config in SAG
* Fix vae processor and make style
* add tests
* uP
* make style
* Fix more naming issues
* Final fix with vae config
* change more
2023-04-11 13:35:42 +02:00
Mishig
4f48476dd6
Update contribution.mdx ( #3054 )
...
* Update contribution.mdx
hotfix for doc-builder parsing quote in heading bug
* quoteation replace
2023-04-11 09:23:58 +02:00
Andranik Movsisyan
ba49272db8
[Pipeline] Add TextToVideoZeroPipeline ( #2954 )
...
* add TextToVideoZeroPipeline and CrossFrameAttnProcessor
* add docs for text-to-video zero
* add teaser image for text-to-video zero docs
* Fix review changes. Add Documentation. Add test
* clean up the codes in pipeline_text_to_video.py. Add descriptive comments and docstrings
* make style && make quality
* make fix-copies
* make requested changes to docs. use huggingface server links for resources, delete res folder
* make style && make quality && make fix-copies
* make style && make quality
* Apply suggestions from code review
---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
2023-04-10 22:09:53 +02:00
Pedro Cuenca
983a7fbfd8
Initial draft of Core ML docs ( #2987 )
...
* Initial draft of Core ML docs.
* Apply suggestions from code review
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
* Fix Core ML spelling
* Apply the rest of suggestions.
* Attempt to fix hyperlink inside Tip.
* Apply suggestions from code review
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
* Apply suggestions from code review
---------
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
2023-04-10 21:09:04 +02:00
Guspan Tanadi
ce144d6dd0
docs: Link Navigation Path API Pipelines ( #2976 )
...
* docs: link navigation Safe Stable Diffusion
Link navigation API pipelines text2img and using diffusers Conditional Image Generation.
* docs: link navigation Versatile Diffusion
Removing exceeding path Stable Diffusion Overview.
* docs: Python extension Spectrogram Diffusion
Link navigation Spectrogram Diffusion Pipeline source code
* docs: Link navigation AltDiffusion Pipelines
Stable Diffusion Overview and Using Diffusers path.
2023-04-07 14:07:42 -07:00
Sayak Paul
24947317a6
[Examples] Add support for Min-SNR weighting strategy for better convergence ( #2899 )
...
* improve stable unclip doc.
* feat: support for applying min-snr weighting for faster convergence.
* add: support for validation logging with wandb
* make not a required arg.
* fix: arg name.
* fix: cli args.
* fix: tracker config.
* fix: loss calculation.
* fix: validation logging.
* fix: unwrap call.
* fix: validation logging.
* fix: internval.
* fix: checkpointing push to hub.
* fix: c8a2856c6d \#commitcomment-106913193
* fix: norm group test for UNet3D.
* address PR comments.
* remove unneeded code.
* add: entry in the readme and docs.
* Apply suggestions from code review
Co-authored-by: Suraj Patil <surajp815@gmail.com >
---------
Co-authored-by: Suraj Patil <surajp815@gmail.com >
2023-04-06 19:08:40 +05:30
Steven Liu
0d0fa2a3e1
[docs] Simplify loading guide ( #2694 )
...
* simplify loading guide
* apply feedbacks
* clarify variants
* clarify torch_dtype and variant
* remove conceptual pipeline doc
2023-04-04 14:08:21 -07:00
Guspan Tanadi
f3e72e9e57
Removing explicit markdown extension ( #2944 )
...
Trigger from previous PR. Build the page once again.
2023-04-04 14:15:19 +01:00
M. Tolga Cangöz
4fd7e97f33
Update ddpm.mdx ( #2929 )
2023-04-04 14:02:30 +01:00
M. Tolga Cangöz
4a1eae07c7
Update ddim.mdx ( #2926 )
2023-04-04 14:01:55 +01:00
M. Tolga Cangöz
e329edff7e
Update score_sde_vp.mdx ( #2938 )
2023-04-04 14:00:43 +01:00
M. Tolga Cangöz
3e2d1af867
Update score_sde_ve.mdx ( #2937 )
2023-04-04 14:00:15 +01:00
M. Tolga Cangöz
715c25d344
Update unipc.mdx ( #2936 )
2023-04-04 13:59:53 +01:00
M. Tolga Cangöz
4274a3a915
Update euler_ancestral.mdx ( #2932 )
2023-04-04 13:58:58 +01:00
M. Tolga Cangöz
c43356267b
Update controlnet.mdx ( #2912 )
...
.
2023-03-31 14:32:36 +01:00
M. Tolga Cangöz
89b23d9869
Update image_variation.mdx ( #2911 )
...
.
2023-03-31 14:31:43 +01:00
Guspan Tanadi
419660c99b
Have fix current pipeline link ( #2910 )
...
Also capitalization notebook provider name
2023-03-31 14:31:14 +01:00
YiYi Xu
51d970d60d
[docs] add the Stable diffusion with Jax/Flax Guide into the docs ( #2487 )
...
* add stable diffusion jax guide
---------
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com >
2023-03-30 16:22:40 -10:00
Sayak Paul
b2021273eb
[Docs] add an example use for StableUnCLIPPipeline in the pipeline docs ( #2897 )
...
* improve stable unclip doc.
* add: entry of StableUnCLIPPipeline to the docs
* Apply suggestions from code review
Co-authored-by: apolinario <joaopaulo.passos@gmail.com >
---------
Co-authored-by: apolinario <joaopaulo.passos@gmail.com >
2023-03-30 17:14:04 +05:30
Steven Liu
e47459c80f
[docs] Performance tutorial ( #2773 )
...
* update performance tutorial
* fix divs
* oops forgot to close tag
* apply feedback
* apply feedback
* apply feedback
* align doc title
2023-03-29 12:48:14 -07:00
Patrick von Platen
40a7b8629e
[Docs] Correct phrasing ( #2873 )
2023-03-28 17:32:18 +01:00
M. Tolga Cangöz
628fefb232
Update stable_diffusion_safe.mdx ( #2870 )
...
Fix typos
2023-03-28 17:23:54 +01:00
M. Tolga Cangöz
03fe36f183
Update paint_by_example.mdx ( #2869 )
...
.
2023-03-28 17:23:39 +01:00
M. Tolga Cangöz
ef4c2fa4f1
Update alt_diffusion.mdx ( #2865 )
...
Fix typos
2023-03-28 17:17:53 +01:00
M. Tolga Cangöz
3980858ad4
Update overview.mdx ( #2864 )
...
Fix typos
2023-03-28 17:17:33 +01:00
M. Tolga Cangöz
37c82480bb
Update evaluation.mdx ( #2862 )
...
Fix typos
2023-03-28 17:15:37 +01:00
dg845
663c654577
[WIP][Docs] Use DiffusionPipeline Instead of Child Classes when Loading Pipeline ( #2809 )
...
* Change the docs to use the parent DiffusionPipeline class when loading a checkpoint using from_pretrained() instead of a child class (e.g. StableDiffusionPipeline) where possible.
* Run make style to fix style issues.
* Change more docs to use DiffusionPipeline rather than a subclass.
---------
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com >
2023-03-28 16:44:34 +01:00
John HU
920a15cf70
Fix link to LoRA training guide in DreamBooth training guide ( #2836 )
...
Fix link to LoRA training guide
2023-03-28 16:35:41 +01:00
Sandeep
b76d9fde8d
Remove suggestion to use cuDNN benchmark in docs ( #2793 )
...
* Remove suggestion to use cuDNN benchmark in docs
* removing the wrong line
2023-03-28 16:01:30 +01:00
Sayak Paul
fab4f3d6e4
improve stable unclip doc. ( #2823 )
2023-03-28 08:18:29 +05:30
Sayak Paul
5883d8d4d1
[Docs] update docs (Stable unCLIP) to reflect the updated ckpts. ( #2815 )
...
* update docs to reflect the updated ckpts.
* update: point about prompt.
* Apply suggestions from code review
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com >
* emove image resizing.
* Apply suggestions from code review
* Apply suggestions from code review
---------
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com >
2023-03-24 17:24:19 +01:00
Bahjat Kawar
37a44bb283
Add ModelEditing pipeline ( #2721 )
...
* TIME first commit
* styling.
* styling 2.
* fixes; tests
* apply styling and doc fix.
* remove sups.
* fixes
* remove temp file
* move augmentations to const
* added doc entry
* code quality
* customize augmentations
* quality
* quality
---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
2023-03-24 13:01:39 +05:30
Sanchit Gandhi
b94880e536
Add AudioLDM ( #2232 )
...
* Add AudioLDM
* up
* add vocoder
* start unet
* unconditional unet
* clap, vocoder and vae
* clean-up: conversion scripts
* fix: conversion script token_type_ids
* clean-up: pipeline docstring
* tests: from SD
* clean-up: cpu offload vocoder instead of safety checker
* feat: adapt tests to audioldm
* feat: add docs
* clean-up: amend pipeline docstrings
* clean-up: make style
* clean-up: make fix-copies
* fix: add doc path to toctree
* clean-up: args for conversion script
* clean-up: paths to checkpoints
* fix: use conditional unet
* clean-up: make style
* fix: type hints for UNet
* clean-up: docstring for UNet
* clean-up: make style
* clean-up: remove duplicate in docstring
* clean-up: make style
* clean-up: make fix-copies
* clean-up: move imports to start in code snippet
* fix: pass cross_attention_dim as a list/tuple to unet
* clean-up: make fix-copies
* fix: update checkpoint path
* fix: unet cross_attention_dim in tests
* film embeddings -> class embeddings
* Apply suggestions from code review
Co-authored-by: Will Berman <wlbberman@gmail.com >
* fix: unet film embed to use existing args
* fix: unet tests to use existing args
* fix: make style
* fix: transformers import and version in init
* clean-up: make style
* Revert "clean-up: make style"
This reverts commit 5d6d1f8b32 .
* clean-up: make style
* clean-up: use pipeline tester mixin tests where poss
* clean-up: skip attn slicing test
* fix: add torch dtype to docs
* fix: remove conversion script out of src
* fix: remove .detach from 1d waveform
* fix: reduce default num inf steps
* fix: swap height/width -> audio_length_in_s
* clean-up: make style
* fix: remove nightly tests
* fix: imports in conversion script
* clean-up: slim-down to two slow tests
* clean-up: slim-down fast tests
* fix: batch consistent tests
* clean-up: make style
* clean-up: remove vae slicing fast test
* clean-up: propagate changes to doc
* fix: increase test tol to 1e-2
* clean-up: finish docs
* clean-up: make style
* feat: vocoder / VAE compatibility check
* feat: possibly expand / cut audio waveform
* fix: pipeline call signature test
* fix: slow tests output len
* clean-up: make style
* make style
---------
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com >
Co-authored-by: William Berman <WLBberman@gmail.com >
2023-03-23 19:00:21 +01:00
Steven Liu
1870fb05a9
[docs] Add Colab notebooks and Spaces ( #2713 )
...
* add colab notebook and spaces
* fix image link
2023-03-23 09:48:58 -07:00
YiYi Xu
df91c44712
Flax controlnet ( #2727 )
...
* add contronet flax
---------
Co-authored-by: yiyixuxu <yixu310@gmail,com>
2023-03-23 05:46:23 -10:00
Sayak Paul
0d7aac3e8d
[Docs] small fixes to the text to video doc. ( #2787 )
...
* small fixes to the text to video doc.
* add: Spaces link.
* add: warning on research-only model.
2023-03-23 18:57:02 +05:30
Kashif Rasul
2ef9bdd76f
Music Spectrogram diffusion pipeline ( #1044 )
...
* initial TokenEncoder and ContinuousEncoder
* initial modules
* added ContinuousContextTransformer
* fix copy paste error
* use numpy for get_sequence_length
* initial terminal relative positional encodings
* fix weights keys
* fix assert
* cross attend style: concat encodings
* make style
* concat once
* fix formatting
* Initial SpectrogramPipeline
* fix input_tokens
* make style
* added mel output
* ignore weights for config
* move mel to numpy
* import pipeline
* fix class names and import
* moved models to models folder
* import ContinuousContextTransformer and SpectrogramDiffusionPipeline
* initial spec diffusion converstion script
* renamed config to t5config
* added weight loading
* use arguments instead of t5config
* broadcast noise time to batch dim
* fix call
* added scale_to_features
* fix weights
* transpose laynorm weight
* scale is a vector
* scale the query outputs
* added comment
* undo scaling
* undo depth_scaling
* inital get_extended_attention_mask
* attention_mask is none in self-attention
* cleanup
* manually invert attention
* nn.linear need bias=False
* added T5LayerFFCond
* remove to fix conflict
* make style and dummy
* remove unsed variables
* remove predict_epsilon
* Move accelerate to a soft-dependency (#1134 )
* finish
* finish
* Update src/diffusers/modeling_utils.py
* Update src/diffusers/pipeline_utils.py
Co-authored-by: Anton Lozhkov <anton@huggingface.co >
* more fixes
* fix
Co-authored-by: Anton Lozhkov <anton@huggingface.co >
* fix order
* added initial midi to note token data pipeline
* added int to int tokenizer
* remove duplicate
* added logic for segments
* add melgan to pipeline
* move autoregressive gen into pipeline
* added note_representation_processor_chain
* fix dtypes
* remove immutabledict req
* initial doc
* use np.where
* require note_seq
* fix typo
* update dependency
* added note-seq to test
* added is_note_seq_available
* fix import
* added toc
* added example usage
* undo for now
* moved docs
* fix merge
* fix imports
* predict first segment
* avoid un-needed copy to and from cpu
* make style
* Copyright
* fix style
* add test and fix inference steps
* remove bogus files
* reorder models
* up
* remove transformers dependency
* make work with diffusers cross attention
* clean more
* remove @
* improve further
* up
* uP
* Apply suggestions from code review
* Update tests/pipelines/spectrogram_diffusion/test_spectrogram_diffusion.py
* loop over all tokens
* make style
* Added a section on the model
* fix formatting
* grammer
* formatting
* make fix-copies
* Update src/diffusers/pipelines/__init__.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com >
* Update src/diffusers/pipelines/spectrogram_diffusion/pipeline_spectrogram_diffusion.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com >
* added callback ad optional ionnx
* do not squeeze batch dim
* clean up more
* upload
* convert jax to nnumpy
* make style
* fix warning
* make fix-copies
* fix warning
* add initial fast tests
* add initial pipeline_params
* eval mode due to dropout
* skip batch tests as pipeline runs on a single file
* make style
* fix relative path
* fix doc tests
* Update src/diffusers/models/t5_film_transformer.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com >
* Update src/diffusers/models/t5_film_transformer.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com >
* Update docs/source/en/api/pipelines/spectrogram_diffusion.mdx
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com >
* Update tests/pipelines/spectrogram_diffusion/test_spectrogram_diffusion.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com >
* Update tests/pipelines/spectrogram_diffusion/test_spectrogram_diffusion.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com >
* Update tests/pipelines/spectrogram_diffusion/test_spectrogram_diffusion.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com >
* Update tests/pipelines/spectrogram_diffusion/test_spectrogram_diffusion.py
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com >
* add MidiProcessor
* format
* fix org
* Apply suggestions from code review
* Update tests/pipelines/spectrogram_diffusion/test_spectrogram_diffusion.py
* make style
* pin protobuf to <4
* fix formatting
* white space
* tensorboard needs protobuf
---------
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com >
Co-authored-by: Anton Lozhkov <anton@huggingface.co >
2023-03-23 14:06:17 +01:00
Naoki Ainoya
14e3a28c12
Rename 'CLIPFeatureExtractor' class to 'CLIPImageProcessor' ( #2732 )
...
The 'CLIPFeatureExtractor' class name has been renamed to 'CLIPImageProcessor' in order to comply with future deprecation. This commit includes the necessary changes to the affected files.
2023-03-23 13:49:22 +01:00
Mishig
8e35ef0142
[doc wip] literalinclude ( #2718 )
2023-03-23 13:42:54 +01:00
Sayak Paul
0d633a42f4
deduplicate training section in the docs. ( #2788 )
2023-03-23 11:21:53 +05:30
Sayak Paul
9dc84448ac
[Examples] InstructPix2Pix instruct training script ( #2478 )
...
* add: initial implementation of the pix2pix instruct training script.
* shorten cli arg.
* fix: main process check.
* fix: dataset column names.
* simplify tokenization.
* proper placement of null conditions.
* apply styling.
* remove debugging message for conditioning do.
* complete license.
* add: requirements.tzt
* wandb column name order.
* fix: augmentation.
* change: dataset_id.
* fix: convert_to_np() call.
* fix: reshaping.
* fix: final ema copy.
* Apply suggestions from code review
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com >
* address PR comments.
* add: readme details.
* config fix.
* downgrade version.
* reduce image width in the readme.
* note on hyperparameters during generation.
* add: output images.
* update readme.
* minor edits to readme.
* debugging statement.
* explicitly placement of the pipeline.
* bump minimum diffusers version.
* fix: device attribute error.
* weight dtype.
* debugging.
* add dtype inform.
* add seoarate te and vae.
* add: explicit casting/
* remove casting.
* up.
* up 2.
* up 3.
* autocast.
* disable mixed-precision in the final inference.
* debugging information.
* autocasting.
* add: instructpix2pix training section to the docs.
* Empty-Commit
---------
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com >
2023-03-23 10:15:01 +05:30
Sayak Paul
c681ad1af2
add: section on multiple controlnets. ( #2762 )
...
* add: section on multiple controlnets.
Co-authored-by: William Berman <WLBberman@gmail.com >
* fix: docs.
* fix: docs.
---------
Co-authored-by: William Berman <WLBberman@gmail.com >
2023-03-23 09:55:25 +05:30
Pedro Cuenca
92e1164e2e
mps: remove warmup passes (#2771 )
...
* Remove warmup passes in mps tests.
* Update mps docs: no warmup pass in PyTorch 2
* Update imports.
2023-03-22 19:29:27 +01:00
Patrick von Platen
ca1a22296d
[MS Text To Video] Add first text to video ( #2738 )
...
* [MS Text To Video} Add first text to video
* upload
* make first model example
* match unet3d params
* make sure weights are correcctly converted
* improve
* forward pass works, but diff result
* make forward work
* fix more
* finish
* refactor video output class.
* feat: add support for a video export utility.
* fix: opencv availability check.
* run make fix-copies.
* add: docs for the model components.
* add: standalone pipeline doc.
* edit docstring of the pipeline.
* add: right path to TransformerTempModel
* add: first set of tests.
* complete fast tests for text to video.
* fix bug
* up
* three fast tests failing.
* add: note on slow tests
* make work with all schedulers
* apply styling.
* add slow tests
* change file name
* update
* more correction
* more fixes
* finish
* up
* Apply suggestions from code review
* up
* finish
* make copies
* fix pipeline tests
* fix more tests
* Apply suggestions from code review
Co-authored-by: Pedro Cuenca <pedro@huggingface.co >
* apply suggestions
* up
* revert
---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
Co-authored-by: Pedro Cuenca <pedro@huggingface.co >
2023-03-22 18:39:33 +01:00
Steven Liu
7fe88613fa
[docs] Clarify purpose of reproducibility docs ( #2756 )
...
* clarify purpose of repro docs
* apply feedback
2023-03-21 17:35:21 -07:00
Pedro Cuenca
a39d42b91d
[docs] update torch 2 benchmark ( #2764 )
...
* Update benchmark for A100, 3090, 3090 Ti, 4090.
* Link to PyTorch blog.
* Update install instructions.
2023-03-21 17:41:13 +00:00
Patrick von Platen
2120b4eee3
Improve Contribution Doc ( #2043 )
...
* first refactor
* more text
* improve
* finish
* up
* up
* up
* up
* finish
* Apply suggestions from code review
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
Co-authored-by: YiYi Xu <yixu310@gmail.com >
* up
* Apply suggestions from code review
Co-authored-by: Pedro Cuenca <pedro@huggingface.co >
* finished
* Apply suggestions from code review
Co-authored-by: Pedro Cuenca <pedro@huggingface.co >
* finished
---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
Co-authored-by: YiYi Xu <yixu310@gmail.com >
Co-authored-by: Pedro Cuenca <pedro@huggingface.co >
2023-03-21 13:41:29 +01:00