diffusers

mirror of https://github.com/huggingface/diffusers.git synced 2026-01-27 17:22:53 +03:00

Author	SHA1	Message	Date
Steven Liu	7c3a75a1ce	[docs] General updates (#5378 ) * first draft * feedback * feedback	2023-10-24 11:51:55 -07:00
Isamu Isozaki	b8896a154a	Japanese docs (#5478 ) * Finished _toctree.yml and index.md * Finished installation.md * Properly finished installation.md and almost finished quicktour * Finished quicktour * Finished stable diffusion doc * Fixed _toctree.yml * Fixed requests * Fix country code * Properly push	2023-10-24 11:30:04 -07:00
Sayak Paul	77241c48af	[Core] Refactor activation and normalization layers (#5493 ) * move out the activations. * move normalization layers. * add doc. * add doc. * fix: paths * Apply suggestions from code review Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * style --------- Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2023-10-24 08:49:43 +05:30
Abhishar Sinha	096f84b05f	Fixed autoencoder typo (#5500 )	2023-10-23 13:59:00 -07:00
YiYi Xu	9e1edfc1ad	fix a few issues in controlnet inpaint pipelines (#5470 ) * add * Update docs/source/en/api/pipelines/controlnet_sdxl.md Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> --------- Co-authored-by: yiyixuxu <yixu310@gmail,com> Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2023-10-23 09:24:51 -10:00
Steven Liu	6b06c30a65	[docs] Fix links (#5499 ) fix links	2023-10-23 20:39:29 +02:00
Kyunghwan Kim	6e608d8a35	Fix typo in controlnet docs (#5486 )	2023-10-23 20:36:35 +02:00
Susheel Thapa	324d18fba2	Chore: Typo fixed in multiple files (#5422 )	2023-10-17 08:17:03 -07:00
Sayak Paul	b4cbbd5ed2	[Examples] Follow up of #5393 (#5420 ) * fix: create_repo() * Empty-Commit	2023-10-17 12:07:39 +05:30
Sayak Paul	cc12f3ec92	[Examples] Update with HFApi (#5393 ) * update training examples to use HFAPI. * update training example. * reflect the changes in the korean version too. * Empty-Commit	2023-10-16 19:34:46 +05:30
Heinz-Alexander Fuetterer	0ea78f9707	chore: fix typos (#5386 ) * chore: fix typos * Update src/diffusers/pipelines/shap_e/renderer.py Co-authored-by: psychedelicious <4822129+psychedelicious@users.noreply.github.com> --------- Co-authored-by: psychedelicious <4822129+psychedelicious@users.noreply.github.com>	2023-10-16 15:23:37 +02:00
Sayak Paul	5495073faf	[Docs] add docs on peft diffusers integration (#5359 ) * add docs on peft diffusers integration/ Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com> Co-authored-by: pacman100 <13534540+pacman100@users.noreply.github.com> * update URLs. * Apply suggestions from code review Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com> * Apply suggestions from code review * Apply suggestions from code review Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * minor changes * Update docs/source/en/tutorials/using_peft_for_inference.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * reflect the latest changes. * note about update. --------- Co-authored-by: Younes Belkada <49240599+younesbelkada@users.noreply.github.com> Co-authored-by: pacman100 <13534540+pacman100@users.noreply.github.com> Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2023-10-16 18:41:37 +05:30
Steven Liu	cf03f5b718	[docs] Minor fixes (#5369 ) minor fixes	2023-10-11 17:18:29 -07:00
Soumik Rakshit	e00df25aee	Fix `StableDiffusionXLImg2ImgPipeline` creation in sdxl tutorial (#5367 ) fix: StableDiffusionXLImg2ImgPipeline creation in sdxl tutorial	2023-10-11 13:07:53 +02:00
Steven Liu	a139213578	[docs] Create a mask for inpainting (#5322 ) add mask making section	2023-10-10 08:29:27 -07:00
Jonathan Whitaker	35952e61c1	Fix links in docs to adapter code (#5323 ) Update adapter.md to fix links to adapter pipelines	2023-10-09 17:20:12 +02:00
Shubham S Jagtap	306dc6e751	Update README.md (#5267 ) Co-authored-by: YiYi Xu <yixu310@gmail.com>	2023-10-06 14:50:18 -07:00
TimothyAlexisVass	016866792d	Minor fixes (#5309 ) tiny fixes	2023-10-06 11:20:06 -07:00
Steven Liu	f0a2c63753	[docs] Improved inpaint docs (#5210 ) * start * finish draft * add section * edits * feedback * make fix-copies * rebase	2023-10-06 09:44:24 -07:00
Sayak Paul	e6faf607f7	add: entry for DDPO support. (#5250 ) * add: entry for DDPO support. * move to training * address steven's comments./	2023-10-05 14:29:00 +02:00
Kadir Nar	84b82a6cb7	✨ [Core] Add FreeU mechanism (#5164 ) * ✨ Added Fourier filter function to upsample blocks * 🔧 Update Fourier_filter for float16 support * ✨ Added UNetFreeUConfig to UNet model for FreeU adaptation 🛠️ * move unet to its original form and add fourier_filter to torch_utils. * implement freeU enable mechanism * implement disable mechanism * resolution index. * correct resolution idx condition. * fix copies. * no need to use resolution_idx in vae. * spell out the kwargs * proper config property * fix attribution setting * place unet hasattr properly. * fix: attribute access. * proper disable * remove validation method. * debug * debug * debug * debug * debug * debug * potential fix. * add: doc. * fix copies * add: tests. * add: support freeU in SDXL. * set default value of resolution idx. * set default values for resolution_idx. * fix copies * fix rest. * fix copies * address PR comments. * run fix-copies * move apply_free_u to utils and other minors. * introduce support for video (unet3D) * minor ups * consistent fix-copies. * consistent stuff * fix-copies * add: rest * add: docs. * fix: tests * fix: doc path * Apply suggestions from code review Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * style up * move to techniques. * add: slow test for sd freeu. * add: slow test for sd freeu. * add: slow test for sd freeu. * add: slow test for sd freeu. * add: slow test for sd freeu. * add: slow test for sd freeu. * add: slow test for video with freeu * add: slow test for video with freeu * add: slow test for video with freeu * style --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2023-10-05 10:37:04 +02:00
1toTree	e46ec5f88f	Zh doc (#4807 ) * Update _toctree.yml * Add files via upload * Update docs/source/zh/stable_diffusion.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> --------- Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2023-10-04 08:54:13 -07:00
Mishig	84e5cc596c	Fix doc KO unconditional_image_generation.md (#5236 ) Fix indent issue	2023-09-29 18:49:40 +02:00
Patrick von Platen	ad06e5106e	[Docs] Improve xformers page (#5196 ) [Docs] Improve	2023-09-27 16:02:15 +05:30
Steven Liu	fd1c54abf2	[docs] Improved text-to-image guide (#4938 ) * first draft * edits * feedback	2023-09-26 09:20:19 -07:00
Patrick von Platen	a91a273d0b	[Docs] Try to fix doc builder (#5180 ) * try to fix docs * try to fix docs	2023-09-25 20:24:50 +02:00
Patrick von Platen	d70944bf7f	fix docs	2023-09-25 19:55:49 +02:00
MLRichter	0bc6be6960	Update wuerstchen.md (#5156 )	2023-09-25 18:43:08 +02:00
Patrick von Platen	144c3a8b7c	[Imports] Fix many import bugs and make sure that doc builder CI test works correctly (#5176 ) * [Doc builder] Ensure slow import for doc builder * Apply suggestions from code review * env for doc builder * fix more * [Diffusers] Set import to slow as env variable * fix docs * fix docs * Apply suggestions from code review * Apply suggestions from code review * fix docs * fix docs	2023-09-25 18:06:51 +02:00
Mishig	48664d62b8	Delete duplicatd doc file (#5169 )	2023-09-24 19:58:13 +02:00
Steven Liu	06b316ef5c	[docs] Improved image-to-image guide (#5020 ) * finish first draft * feedback * feedback	2023-09-22 13:20:30 -07:00
Ayush Mangal	157c9011d8	Add BLIP Diffusion (#4388 ) * Add BLIP Diffusion skeleton * Add other model components * Add BLIP2, need to change it for now * Fix pipeline imports * Load pretrained ViT * Make qformer fwd pass same * Replicate fwd passes * Fix device bug * Add accelerate functions * Remove extra functions from Blip2 * Minor bug * Integrate initial review changes * Refactoring * Refactoring * Refactor * Add controlnet * Refactor * Update conversion script * Add image processor * Shift postprocessing to ImageProcessor * Refactor * Fix device * Add fast tests * Update conversion script * Fix checkpoint conversion script * Integrate review changes * Integrate reivew changes * Remove unused functions from test * Reuse HF image processor in Cond image * Create new BlipImageProcessor based on transfomers * Fix image preprocessor * Minor * Minor * Add canny preprocessing * Fix controlnet preprocessing * Fix blip diffusion test * Add controlnet test * Add initial doc strings * Integrate review changes * Refactor * Update examples * Remove DDIM comments * Add copied from for prepare_latents * Add type anotations * Add docstrings * Do black formatting * Add batch support * Make tests pass * Make controlnet tests pass * Black formatting * Fix progress bar * Fix some licensing comments * Fix imports * Refactor controlnet * Make tests faster * Edit examples * Black formatting/Ruff * Add doc * Minor Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Move controlnet pipeline * Make tests faster * Fix imports * Fix formatting * Fix make errors * Fix make errors * Minor * Add suggested doc changes Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * Edit docs * Fix 16 bit loading * Update examples * Edit toctree * Update docs/source/en/api/pipelines/blip_diffusion.md Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * Minor * Add tips * Edit examples * Update model paths --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2023-09-21 17:05:35 +01:00
Younes Belkada	914586f5b6	[`core`] Use python 3.8 in workflow and setup file (#5122 ) * use python 3.7 instead * Update setup.py	2023-09-20 20:57:06 +02:00
Ruoxi	16b9a57d29	Implement `CustomDiffusionAttnProcessor2_0`. (#4604 ) * Implement `CustomDiffusionAttnProcessor2_0` * Doc-strings and type annotations for `CustomDiffusionAttnProcessor2_0`. (#1) * Update attnprocessor.md * Update attention_processor.py * Interops for `CustomDiffusionAttnProcessor2_0`. * Formatted `attention_processor.py`. * Formatted doc-string in `attention_processor.py` * Conditional CustomDiffusion2_0 for training example. * Remove unnecessary reference impl in comments. * Fix `save_attn_procs`.	2023-09-18 14:49:00 +02:00
Sayak Paul	bfc606301f	add doc around fusing multiple loras. (#5056 ) * add doc around fusing multiple loras. * Apply suggestions from code review Co-authored-by: apolinário <joaopaulo.passos@gmail.com> * address poli's comments. --------- Co-authored-by: apolinário <joaopaulo.passos@gmail.com>	2023-09-18 12:42:58 +01:00
Kashif Rasul	427feb5359	[Wuerstchen] fix typos in docs (#5051 ) * fix typos in docs * fix for issue #5023	2023-09-15 12:53:25 +02:00
Patrick von Platen	abc47dece6	[SDXL, Docs] Textual inversion (#5039 ) * [SDXL, Docs] Textual inversion * Update docs/source/en/using-diffusers/sdxl.md * finish * Apply suggestions from code review Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> --------- Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2023-09-15 12:51:36 +02:00
YiYi Xu	e70cb1243f	[WIP] adding Kandinsky training scripts (#4890 ) * Add files via upload Co-authored-by: Shahmatov Arseniy <62886550+cene555@users.noreply.github.com> Co-authored-by: yiyixuxu <yixu310@gmail,com> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2023-09-14 06:58:20 -10:00
Steven Liu	19edca82f1	[docs] Create clearer optimization sections (#4870 ) * refactor * update general optim sections * update more sections * few more updates * benchmark code	2023-09-13 15:21:15 -07:00
Lucain	b954c22a44	Fix broken link in docs (#5015 ) fix broken link	2023-09-13 15:40:25 +02:00
Kashif Rasul	77373c5eb1	[Wuerstchen] fix compel usage (#4999 ) * fix compel usage * minor changes in documentation * fix tests * fix more * fix more * typos * fix tests * formatting --------- Co-authored-by: Dominic Rampas <d6582533@gmail.com> Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2023-09-13 14:54:59 +02:00
Patrick von Platen	324aef6d14	[SDXL] Add LoRA to all pipelines (#4896 ) * [SDXL] Add LoRA to all pipelines * fix all * fix all * fix all * fix more docs * make style	2023-09-13 11:05:20 +02:00
Sayak Paul	8009272f48	[Tests and Docs] Add a test on serializing pipelines with components containing fused LoRA modules (#4962 ) * add: test to ensure pipelines can be saved with fused lora modules. * add docs about serialization with fused lora. * Apply suggestions from code review Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Empty-Commit * Update docs/source/en/training/lora.md Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> --------- Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2023-09-13 10:01:37 +01:00
Kashif Rasul	16a056a7b5	Wuerstchen fixes (#4942 ) * fix arguments and make example code work * change arguments in combined test * Add default timesteps * style * fixed test * fix broken test * formatting * fix docstrings * fix num_images_per_prompt * fix doc styles * please dont change this * fix tests * rename to DEFAULT_STAGE_C_TIMESTEPS --------- Co-authored-by: Dominic Rampas <d6582533@gmail.com>	2023-09-11 15:47:53 +02:00
Dhruv Nair	b6e0b016ce	Lazy Import for Diffusers (#4829 ) * initial commit * move modules to import struct * add dummy objects and _LazyModule * add lazy import to schedulers * clean up unused imports * lazy import on models module * lazy import for schedulers module * add lazy import to pipelines module * lazy import altdiffusion * lazy import audio diffusion * lazy import audioldm * lazy import consistency model * lazy import controlnet * lazy import dance diffusion ddim ddpm * lazy import deepfloyd * lazy import kandinksy * lazy imports * lazy import semantic diffusion * lazy imports * lazy import stable diffusion * move sd output to its own module * clean up * lazy import t2iadapter * lazy import unclip * lazy import versatile and vq diffsuion * lazy import vq diffusion * helper to fetch objects from modules * lazy import sdxl * lazy import txt2vid * lazy import stochastic karras * fix model imports * fix bug * lazy import * clean up * clean up * fixes for tests * fixes for tests * clean up * remove import of torch_utils from utils module * clean up * clean up * fix mistake import statement * dedicated modules for exporting and loading * remove testing utils from utils module * fixes from merge conflicts * Update src/diffusers/pipelines/kandinsky2_2/__init__.py * fix docs * fix alt diffusion copied from * fix check dummies * fix more docs * remove accelerate import from utils module * add type checking * make style * fix check dummies * remove torch import from xformers check * clean up error message * fixes after upstream merges * dummy objects fix * fix tests * remove unused module import --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2023-09-11 09:56:22 +02:00
Sayak Paul	88735249da	[Docs] fix: minor formatting in the Würstchen docs (#4965 ) fix: minor formatting in the docs	2023-09-11 09:12:53 +02:00
Sayak Paul	914c513ee0	[Docs] add t2i adapter entry to overview of training scripts. (#4946 ) add t2i adapter entry to overview of training scripts.	2023-09-09 06:52:11 +05:30
Suraj Patil	dfec61f4b3	[examples] T2IAdapter training script (#4934 ) * add t2i_example script * remove in channels logic * remove comments * remove use_euler arg * add requirements * only use canny example * use datasets * comments * make log_validation consistent with other scripts * add readme * fix title in readme * update check_min_version * change a few minor things. * add doc entry * add: test for t2i adapter training * remove use_auth_token * fix: logged info. * remove tests for now. --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2023-09-08 10:03:02 +05:30
Sayak Paul	9800cc5ece	[InstructPix2Pix] Fix pipeline implementation and add docs (#4844 ) * initial evident fixes. * instructpix2pix fixes. * add: entry to doc. * address PR feedback. * make fix-copies	2023-09-07 15:34:19 +05:30
Kashif Rasul	541bb6ee63	Würstchen model (#3849 ) * initial * initial * added initial convert script for paella vqmodel * initial wuerstchen pipeline * add LayerNorm2d * added modules * fix typo * use model_v2 * embed clip caption amd negative_caption * fixed name of var * initial modules in one place * WuerstchenPriorPipeline * inital shape * initial denoising prior loop * fix output * add WuerstchenPriorPipeline to __init__.py * use the noise ratio in the Prior * try to save pipeline * save_pretrained working * Few additions * add _execution_device * shape is int * fix batch size * fix shape of ratio * fix shape of ratio * fix output dataclass * tests folder * fix formatting * fix float16 + started with generator * Update pipeline_wuerstchen.py * removed vqgan code * add WuerstchenGeneratorPipeline * fix WuerstchenGeneratorPipeline * fix docstrings * fix imports * convert generator pipeline * fix convert * Work on Generator Pipeline. WIP * Pipeline works with our diffuzz code * apply scale factor * removed vqgan.py * use cosine schedule * redo the denoising loop * Update src/diffusers/models/resnet.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * use torch.lerp * use warp-diffusion org * clip_sample=False, * some refactoring * use model_v3_stage_c * c_cond size * use clip-bigG * allow stage b clip to be None * add dummy * würstchen scheduler * minor changes * set clip=None in the pipeline * fix attention mask * add attention_masks to text_encoder * make fix-copies * add back clip * add text_encoder * gen_text_encoder and tokenizer * fix import * updated pipeline test * undo changes to pipeline test * nip * fix typo * fix output name * set guidance_scale=0 and remove diffuze * fix doc strings * make style * nip * removed unused * initial docs * rename * toc * cleanup * remvoe test script * fix-copies * fix multi images * remove dup * remove unused modules * undo changes for debugging * no new line * remove dup conversion script * fix doc string * cleanup * pass default args * dup permute * fix some tests * fix prepare_latents * move Prior class to modules * offload only the text encoder and vqgan * fix resolution calculation for prior * nip * removed testing script * fix shape * fix argument to set_timesteps * do not change .gitignore * fix resolution calculations + readme * resolution calculation fix + readme * small fixes * Add combined pipeline * rename generator -> decoder * Update .gitignore Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * removed efficient_net * create combined WuerstchenPipeline * make arguments consistent with VQ model * fix var names * no need to return text_encoder_hidden_states * add latent_dim_scale to config * split model into its own file * add WuerschenPipeline to docs * remove unused latent_size * register latent_dim_scale * update script * update docstring * use Attention preprocessor * concat with normed input * fix-copies * add docs * fix test * fix style * add to cpu_offloaded_model * updated type * remove 1-line func * updated type * initial decoder test * formatting * formatting * fix autodoc link * num_inference_steps is int * remove comments * fix example in docs * Update src/diffusers/pipelines/wuerstchen/diffnext.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * rename layernorm to WuerstchenLayerNorm * rename DiffNext to WuerstchenDiffNeXt * added comment about MixingResidualBlock * move paella vq-vae to pipelines' folder * initial decoder test * increased test_float16_inference expected diff * self_attn is always true * more passing decoder tests * batch image_embeds * fix failing tests * set the correct dtype * relax inference test * update prior * added combined pipeline test * faster test * faster test * Update src/diffusers/pipelines/wuerstchen/pipeline_wuerstchen_combined.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * fix issues from review * update wuerstchen.md + change generator name * resolve issues * fix copied from usage and add back batch_size * fix API * fix arguments * fix combined test * Added timesteps argument + fixes * Update tests/pipelines/test_pipelines_common.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update tests/pipelines/wuerstchen/test_wuerstchen_prior.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/diffusers/pipelines/wuerstchen/pipeline_wuerstchen_combined.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/diffusers/pipelines/wuerstchen/pipeline_wuerstchen_combined.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/diffusers/pipelines/wuerstchen/pipeline_wuerstchen_combined.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/diffusers/pipelines/wuerstchen/pipeline_wuerstchen_combined.py * up * Fix more * failing tests * up * up * correct naming * correct docs * correct docs * fix test params * correct docs * fix classifier free guidance * fix classifier free guidance * fix more * fix all * make tests faster --------- Co-authored-by: Dominic Rampas <d6582533@gmail.com> Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: Dominic Rampas <61938694+dome272@users.noreply.github.com>	2023-09-06 16:15:51 +02:00

1 2 3 4 5 ...

541 Commits