diffusers

mirror of https://github.com/huggingface/diffusers.git synced 2026-01-27 17:22:53 +03:00

Author	SHA1	Message	Date
Will Berman	2fd46405cd	consistency decoder (#5694 ) * consistency decoder * rename * Apply suggestions from code review Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * Update src/diffusers/pipelines/consistency_models/pipeline_consistency_models.py * uP * Apply suggestions from code review * uP * uP * uP --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2023-11-09 12:21:41 +01:00
Sayak Paul	d61889fc17	[Feat] PixArt-Alpha (#5642 ) * init pixart alpha pipeline * fix: import * script * script * script * add: vae to the pipeline * add: vae_scale_factor * add: checkpoint_path * clean conversion script a bit. * size embeddings. * fix: size embedding * update scrip * support for interpolation of position embedding. * support for conditioning. * .. * .. * .. * final layer * final layer * align if encode_prompt * support for caption embedding * refactor * refactor * refactor * start cross attention * start cross attention * cross_attention_dim * cross * cross * support for resolution and aspect_ratio * support for caption projection * refactor patch embeddings * batch_size * up * commit * commit * commit. * squeeze * squeeze * squeeze * squeeze * squeeze * squeeze * squeeze * squeeze * squeeze * squeeze * squeeze * squeeze. * squeeze. * fix final block./ * fix final block./ * fix final block./ * clean * fix: interpolation scale. * debugging' * debugging' * debugging' * debugging' * debugging' * debugging' * debugging' * debugging' * debugging' * debugging' * debugging' * debugging' * debugging' * debugging' * debugging' * debugging' * debugging' * debugging' * debugging' * debugging' * debugging' * debugging' * debugging' * debugging' * debugging' * debugging' * debugging' * debugging' * debugging' * debugging' * debugging' * debugging' * debugging' * debugging' * debugging' * debugging' * debugging' * debugging' * debugging' * debugging' * debugging' * debugging' * debugging * debugging * debugging * debugging * debugging * debugging * debugging * make --checkpoint_path non-required. * debugging * debugging * debugging * debugging * debugging * debugging * debugging * debugging * debugging * debugging * debugging * debugging * debugging * debugging * debugging * debugging * debugging * debugging * debugging * debugging * debugging * debugging * debugging * debugging * debugging * debugging * debugging * debugging * debugging * debugging * debugging * remove num_tokens * timesteps -> timestep * timesteps -> timestep * timesteps -> timestep * timesteps -> timestep * timesteps -> timestep * timesteps -> timestep * debug * debug * update conversion script. * update conversion script. * update conversion script. * debug * debug * debug * clean * debug * debug * debug * debug * debug * debug * debug * debug * deug * debug * debug * debug * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * fix * clean * fix * fix * boom * boom * some changes * boom * save * up * remove i * fix more tests * DPMSolverMultistepScheduler * fix * offloading * fix conversion script * fix conversion script * remove print * remove support for negative prompt embeds. * typo. * remove extra kwargs * bring conversion script to where it was * fix * trying mu luck * trying my luck again * again * again * again * clean up * up * up * update example * support for 512 * remove spacing * finalize docs. * test debug * fix: assertion values. * debug * debug * debug * fix: repeat * remove prints. * Apply suggestions from code review * Apply suggestions from code review * Correct more * Apply suggestions from code review * Change all * Clean more * fix more * Fix more * Fix more * Correct more * address patrick's comments. * remove unneeded args * clean up pipeline. * sty;e * make the use of additional conditions better conditioned. * None better * dtype * height and width validation * add a note about size brackets. * fix * spit out slow test outputs. * fix? * fix optional test * fix more * remove unneeded comment * debug --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2023-11-06 08:40:04 +01:00
Sayak Paul	60c5eb5877	[Easy] clean up the LCM docstrings. (#5637 ) * clean up the LCM docstrings. * clean up * fix: examples * Apply suggestions from code review	2023-11-03 12:14:48 +01:00
Sayak Paul	c84982a804	[Easy] Minor AnimateDiff Doc nits (#5640 ) minor	2023-11-03 16:27:54 +05:30
Dhruv Nair	84e7bb875d	Update animatediff docs to include section on Motion LoRAs (#5639 ) update animatediff docs	2023-11-03 15:53:59 +05:30
Patrick von Platen	072e00897a	[LCM] Make sure img2img works (#5632 ) * [LCM] Clean up implementations * Add all * correct more * correct more * finish * up	2023-11-02 19:50:47 +01:00
Dhruv Nair	2a8cf8e39f	Animatediff Proposal (#5413 ) * draft design * clean up * clean up * clean up * clean up * clean up * clean up * clean up * clean up * clean up * update pipeline * clean up * clean up * clean up * add tests * change motion block * clean up * clean up * clean up * update * update * update * update * update * update * update * update * clean up * update * update * update model test * update * update * update * update * make style * update * fix embeddings * update * merge upstream * max fix copies * fix bug * fix mistake * add docs * update * clean up * update * clean up * clean up * fix docstrings * fix docstrings * update * update * clean up * update	2023-11-02 15:04:03 +01:00
Steven Liu	75ea54a151	[docs] Kandinsky guide (#4555 ) * kandinsky 2.1 first draft * add kandinsky 2.2 * fix identical section headers * try hfoptions syntax * add img2img * add inpaint * add interpolate * fix tag * more cleanups * typo * update hfoptions id * align hfoptions tags	2023-11-01 15:36:22 -07:00
Steven Liu	d1eb14bc35	[docs] Lu lambdas (#5602 ) lu lambdas	2023-11-01 11:47:11 -07:00
M. Tolga Cangöz	442017ccc8	[Docs] Fix typos (#5583 ) * Add Copyright info * Fix typos, improve, update * Update deepfloyd_if.md * Update ldm3d_diffusion.md * Update opt_overview.md	2023-10-31 10:04:08 -07:00
Steven Liu	595ba6f786	[docs] Internal classes API (#5513 ) * internal classes api * add internal class overview * fix toctree	2023-10-27 09:48:41 -07:00
YiYi Xu	f912f39b50	correct checkpoint in kandinsky2.2 doc page (#5550 ) update checkpoint Co-authored-by: yiyixuxu <yixu310@gmail,com>	2023-10-27 08:49:15 +05:30
Chengxi Guo	dcbfe662ef	fix typo (#5505 ) Signed-off-by: mymusise <mymusise1@gmail.com>	2023-10-24 17:14:05 -07:00
dg845	958e17dada	Add Latent Consistency Models Pipeline (#5448 ) * initial commit for LatentConsistencyModelPipeline and LCMScheduler based on the community pipeline * Add callback and freeu support. * apply suggestions from review * Clean up LCMScheduler * Remove timeindex argument to LCMScheduler.step. * Add support for clipping or thresholding the predicted original sample. * Remove unused methods and arguments in LCMScheduler. * Improve comment about (lack of) negative prompt support. * Change input guidance_scale to match the StableDiffusionPipeline (Imagen) CFG formulation. * Move lcm_origin_steps from pipeline __call__ to LCMScheduler.__init__/config (as origin_steps). * Fix typo when clipping/thresholding in LCMScheduler. * Add some initial LCMScheduler tests. * add type annotations from review * Fix type annotation bug. * Override test_add_noise_device in LCMSchedulerTest since hardcoded timesteps doesn't work under default settings. * Add generator argument pipeline prepare_latents call. * Cast LCMScheduler.timesteps to long in set_timesteps. * Add onestep and multistep full loop scheduler tests. * Set default height/width to None and don't hardcode guidance scale embedding dim. * Add initial LatentConsistencyPipeline fast and slow tests. * Add initial documentation for LatentConsistencyModelPipeline and LCMScheduler. * Make remaining failing fast tests pass. * make style * Make original_inference_steps configurable from pipeline __call__ again. * make style * Remove guidance_rescale arg from pipeline __call__ since LCM currently doesn't support CFG. * Make LCMScheduler defaults match config of LCM_Dreamshaper_v7 checkpoint. * Fix LatentConsistencyPipeline slow tests and add dummy expected slices. * Add checks for original_steps in LCMScheduler.set_timesteps. * make fix-copies * Improve LatentConsistencyModelPipeline docs. * Apply suggestions from code review Co-authored-by: Aryan V S <avs050602@gmail.com> * Apply suggestions from code review Co-authored-by: Aryan V S <avs050602@gmail.com> * Apply suggestions from code review Co-authored-by: Aryan V S <avs050602@gmail.com> * Update src/diffusers/schedulers/scheduling_lcm.py * Apply suggestions from code review Co-authored-by: Aryan V S <avs050602@gmail.com> * finish --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: Aryan V S <avs050602@gmail.com>	2023-10-24 21:06:02 +02:00
Steven Liu	7c3a75a1ce	[docs] General updates (#5378 ) * first draft * feedback * feedback	2023-10-24 11:51:55 -07:00
Sayak Paul	77241c48af	[Core] Refactor activation and normalization layers (#5493 ) * move out the activations. * move normalization layers. * add doc. * add doc. * fix: paths * Apply suggestions from code review Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * style --------- Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2023-10-24 08:49:43 +05:30
YiYi Xu	9e1edfc1ad	fix a few issues in controlnet inpaint pipelines (#5470 ) * add * Update docs/source/en/api/pipelines/controlnet_sdxl.md Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> --------- Co-authored-by: yiyixuxu <yixu310@gmail,com> Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2023-10-23 09:24:51 -10:00
Steven Liu	6b06c30a65	[docs] Fix links (#5499 ) fix links	2023-10-23 20:39:29 +02:00
Heinz-Alexander Fuetterer	0ea78f9707	chore: fix typos (#5386 ) * chore: fix typos * Update src/diffusers/pipelines/shap_e/renderer.py Co-authored-by: psychedelicious <4822129+psychedelicious@users.noreply.github.com> --------- Co-authored-by: psychedelicious <4822129+psychedelicious@users.noreply.github.com>	2023-10-16 15:23:37 +02:00
Jonathan Whitaker	35952e61c1	Fix links in docs to adapter code (#5323 ) Update adapter.md to fix links to adapter pipelines	2023-10-09 17:20:12 +02:00
Patrick von Platen	a91a273d0b	[Docs] Try to fix doc builder (#5180 ) * try to fix docs * try to fix docs	2023-09-25 20:24:50 +02:00
Patrick von Platen	d70944bf7f	fix docs	2023-09-25 19:55:49 +02:00
MLRichter	0bc6be6960	Update wuerstchen.md (#5156 )	2023-09-25 18:43:08 +02:00
Patrick von Platen	144c3a8b7c	[Imports] Fix many import bugs and make sure that doc builder CI test works correctly (#5176 ) * [Doc builder] Ensure slow import for doc builder * Apply suggestions from code review * env for doc builder * fix more * [Diffusers] Set import to slow as env variable * fix docs * fix docs * Apply suggestions from code review * Apply suggestions from code review * fix docs * fix docs	2023-09-25 18:06:51 +02:00
Ayush Mangal	157c9011d8	Add BLIP Diffusion (#4388 ) * Add BLIP Diffusion skeleton * Add other model components * Add BLIP2, need to change it for now * Fix pipeline imports * Load pretrained ViT * Make qformer fwd pass same * Replicate fwd passes * Fix device bug * Add accelerate functions * Remove extra functions from Blip2 * Minor bug * Integrate initial review changes * Refactoring * Refactoring * Refactor * Add controlnet * Refactor * Update conversion script * Add image processor * Shift postprocessing to ImageProcessor * Refactor * Fix device * Add fast tests * Update conversion script * Fix checkpoint conversion script * Integrate review changes * Integrate reivew changes * Remove unused functions from test * Reuse HF image processor in Cond image * Create new BlipImageProcessor based on transfomers * Fix image preprocessor * Minor * Minor * Add canny preprocessing * Fix controlnet preprocessing * Fix blip diffusion test * Add controlnet test * Add initial doc strings * Integrate review changes * Refactor * Update examples * Remove DDIM comments * Add copied from for prepare_latents * Add type anotations * Add docstrings * Do black formatting * Add batch support * Make tests pass * Make controlnet tests pass * Black formatting * Fix progress bar * Fix some licensing comments * Fix imports * Refactor controlnet * Make tests faster * Edit examples * Black formatting/Ruff * Add doc * Minor Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Move controlnet pipeline * Make tests faster * Fix imports * Fix formatting * Fix make errors * Fix make errors * Minor * Add suggested doc changes Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * Edit docs * Fix 16 bit loading * Update examples * Edit toctree * Update docs/source/en/api/pipelines/blip_diffusion.md Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * Minor * Add tips * Edit examples * Update model paths --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2023-09-21 17:05:35 +01:00
Ruoxi	16b9a57d29	Implement `CustomDiffusionAttnProcessor2_0`. (#4604 ) * Implement `CustomDiffusionAttnProcessor2_0` * Doc-strings and type annotations for `CustomDiffusionAttnProcessor2_0`. (#1) * Update attnprocessor.md * Update attention_processor.py * Interops for `CustomDiffusionAttnProcessor2_0`. * Formatted `attention_processor.py`. * Formatted doc-string in `attention_processor.py` * Conditional CustomDiffusion2_0 for training example. * Remove unnecessary reference impl in comments. * Fix `save_attn_procs`.	2023-09-18 14:49:00 +02:00
Kashif Rasul	427feb5359	[Wuerstchen] fix typos in docs (#5051 ) * fix typos in docs * fix for issue #5023	2023-09-15 12:53:25 +02:00
Lucain	b954c22a44	Fix broken link in docs (#5015 ) fix broken link	2023-09-13 15:40:25 +02:00
Kashif Rasul	77373c5eb1	[Wuerstchen] fix compel usage (#4999 ) * fix compel usage * minor changes in documentation * fix tests * fix more * fix more * typos * fix tests * formatting --------- Co-authored-by: Dominic Rampas <d6582533@gmail.com> Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2023-09-13 14:54:59 +02:00
Patrick von Platen	324aef6d14	[SDXL] Add LoRA to all pipelines (#4896 ) * [SDXL] Add LoRA to all pipelines * fix all * fix all * fix all * fix more docs * make style	2023-09-13 11:05:20 +02:00
Kashif Rasul	16a056a7b5	Wuerstchen fixes (#4942 ) * fix arguments and make example code work * change arguments in combined test * Add default timesteps * style * fixed test * fix broken test * formatting * fix docstrings * fix num_images_per_prompt * fix doc styles * please dont change this * fix tests * rename to DEFAULT_STAGE_C_TIMESTEPS --------- Co-authored-by: Dominic Rampas <d6582533@gmail.com>	2023-09-11 15:47:53 +02:00
Dhruv Nair	b6e0b016ce	Lazy Import for Diffusers (#4829 ) * initial commit * move modules to import struct * add dummy objects and _LazyModule * add lazy import to schedulers * clean up unused imports * lazy import on models module * lazy import for schedulers module * add lazy import to pipelines module * lazy import altdiffusion * lazy import audio diffusion * lazy import audioldm * lazy import consistency model * lazy import controlnet * lazy import dance diffusion ddim ddpm * lazy import deepfloyd * lazy import kandinksy * lazy imports * lazy import semantic diffusion * lazy imports * lazy import stable diffusion * move sd output to its own module * clean up * lazy import t2iadapter * lazy import unclip * lazy import versatile and vq diffsuion * lazy import vq diffusion * helper to fetch objects from modules * lazy import sdxl * lazy import txt2vid * lazy import stochastic karras * fix model imports * fix bug * lazy import * clean up * clean up * fixes for tests * fixes for tests * clean up * remove import of torch_utils from utils module * clean up * clean up * fix mistake import statement * dedicated modules for exporting and loading * remove testing utils from utils module * fixes from merge conflicts * Update src/diffusers/pipelines/kandinsky2_2/__init__.py * fix docs * fix alt diffusion copied from * fix check dummies * fix more docs * remove accelerate import from utils module * add type checking * make style * fix check dummies * remove torch import from xformers check * clean up error message * fixes after upstream merges * dummy objects fix * fix tests * remove unused module import --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2023-09-11 09:56:22 +02:00
Sayak Paul	88735249da	[Docs] fix: minor formatting in the Würstchen docs (#4965 ) fix: minor formatting in the docs	2023-09-11 09:12:53 +02:00
Sayak Paul	9800cc5ece	[InstructPix2Pix] Fix pipeline implementation and add docs (#4844 ) * initial evident fixes. * instructpix2pix fixes. * add: entry to doc. * address PR feedback. * make fix-copies	2023-09-07 15:34:19 +05:30
Kashif Rasul	541bb6ee63	Würstchen model (#3849 ) * initial * initial * added initial convert script for paella vqmodel * initial wuerstchen pipeline * add LayerNorm2d * added modules * fix typo * use model_v2 * embed clip caption amd negative_caption * fixed name of var * initial modules in one place * WuerstchenPriorPipeline * inital shape * initial denoising prior loop * fix output * add WuerstchenPriorPipeline to __init__.py * use the noise ratio in the Prior * try to save pipeline * save_pretrained working * Few additions * add _execution_device * shape is int * fix batch size * fix shape of ratio * fix shape of ratio * fix output dataclass * tests folder * fix formatting * fix float16 + started with generator * Update pipeline_wuerstchen.py * removed vqgan code * add WuerstchenGeneratorPipeline * fix WuerstchenGeneratorPipeline * fix docstrings * fix imports * convert generator pipeline * fix convert * Work on Generator Pipeline. WIP * Pipeline works with our diffuzz code * apply scale factor * removed vqgan.py * use cosine schedule * redo the denoising loop * Update src/diffusers/models/resnet.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * use torch.lerp * use warp-diffusion org * clip_sample=False, * some refactoring * use model_v3_stage_c * c_cond size * use clip-bigG * allow stage b clip to be None * add dummy * würstchen scheduler * minor changes * set clip=None in the pipeline * fix attention mask * add attention_masks to text_encoder * make fix-copies * add back clip * add text_encoder * gen_text_encoder and tokenizer * fix import * updated pipeline test * undo changes to pipeline test * nip * fix typo * fix output name * set guidance_scale=0 and remove diffuze * fix doc strings * make style * nip * removed unused * initial docs * rename * toc * cleanup * remvoe test script * fix-copies * fix multi images * remove dup * remove unused modules * undo changes for debugging * no new line * remove dup conversion script * fix doc string * cleanup * pass default args * dup permute * fix some tests * fix prepare_latents * move Prior class to modules * offload only the text encoder and vqgan * fix resolution calculation for prior * nip * removed testing script * fix shape * fix argument to set_timesteps * do not change .gitignore * fix resolution calculations + readme * resolution calculation fix + readme * small fixes * Add combined pipeline * rename generator -> decoder * Update .gitignore Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * removed efficient_net * create combined WuerstchenPipeline * make arguments consistent with VQ model * fix var names * no need to return text_encoder_hidden_states * add latent_dim_scale to config * split model into its own file * add WuerschenPipeline to docs * remove unused latent_size * register latent_dim_scale * update script * update docstring * use Attention preprocessor * concat with normed input * fix-copies * add docs * fix test * fix style * add to cpu_offloaded_model * updated type * remove 1-line func * updated type * initial decoder test * formatting * formatting * fix autodoc link * num_inference_steps is int * remove comments * fix example in docs * Update src/diffusers/pipelines/wuerstchen/diffnext.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * rename layernorm to WuerstchenLayerNorm * rename DiffNext to WuerstchenDiffNeXt * added comment about MixingResidualBlock * move paella vq-vae to pipelines' folder * initial decoder test * increased test_float16_inference expected diff * self_attn is always true * more passing decoder tests * batch image_embeds * fix failing tests * set the correct dtype * relax inference test * update prior * added combined pipeline test * faster test * faster test * Update src/diffusers/pipelines/wuerstchen/pipeline_wuerstchen_combined.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * fix issues from review * update wuerstchen.md + change generator name * resolve issues * fix copied from usage and add back batch_size * fix API * fix arguments * fix combined test * Added timesteps argument + fixes * Update tests/pipelines/test_pipelines_common.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update tests/pipelines/wuerstchen/test_wuerstchen_prior.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/diffusers/pipelines/wuerstchen/pipeline_wuerstchen_combined.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/diffusers/pipelines/wuerstchen/pipeline_wuerstchen_combined.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/diffusers/pipelines/wuerstchen/pipeline_wuerstchen_combined.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/diffusers/pipelines/wuerstchen/pipeline_wuerstchen_combined.py * up * Fix more * failing tests * up * up * correct naming * correct docs * correct docs * fix test params * correct docs * fix classifier free guidance * fix classifier free guidance * fix more * fix all * make tests faster --------- Co-authored-by: Dominic Rampas <d6582533@gmail.com> Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: Dominic Rampas <61938694+dome272@users.noreply.github.com>	2023-09-06 16:15:51 +02:00
Steven Liu	946bb53c56	[docs] Add stronger warning for SDXL height/width (#4867 ) * add size warning * feedback	2023-09-05 10:50:42 -07:00
Steven Liu	2c45a53aef	[docs] Shap-E guide (#4700 ) * first draft * fixes * more fixes * fix toctree	2023-09-01 19:52:41 -07:00
Steven Liu	22ea35cf23	[docs] DiffEdit guide (#4722 ) * first draft * minor edits	2023-09-01 14:18:41 -07:00
Pedro Cuenca	60d259add1	Fix link from API to using-diffusers (#4856 ) * Fix link from API to using-diffusers * Fix link	2023-09-01 15:05:01 +02:00
Nguyễn Công Tú Anh	38466c369f	Add GLIGEN Text Image implementation (#4777 ) * Add GLIGEN Text Image implementation * add style transfer from image * fix check_repository_consistency * add convert script GLIGEN model to Diffusers * rename attention type * fix style code * remove PositionNetTextImage * Revert "fix check_repository_consistency" This reverts commit `15f098c96e`. * change attention type name * update docs for GLIGEN * change examples with hf-document-image * fix style * add CLIPImageProjection for GLIGEN * Add new encode_prompt, load project matrix in pipe init * move CLIPImageProjection to stable_diffusion * add comment	2023-09-01 15:48:01 +05:30
Steven Liu	aedd78767c	[docs] ControlNet guide (#4640 ) * first draft * finish first draft * feedback and remove sections from API pages * clean docstrings * add full code example	2023-08-31 10:02:02 -04:00
Steven Liu	a1fdfca36f	[docs] SDXL (#4428 ) * first draft * reorg toctree * note about minsdxl * feedback * fix * micro-conditionings * add tip * fix section levels * d'oh fix pipeline names * feedback * remove old section	2023-08-30 11:34:55 -04:00
Chong Mou	12358b986f	add models for T2I-Adapter-XL (#4696 ) * T2I-Adapter-XL * update * update * add pipeline * modify pipeline * modify pipeline * modify pipeline * modify pipeline * modify pipeline * modify modeling_text_unet * fix styling. * fix: copies. * adapter settings * new test case * new test case * debugging * debugging * debugging * debugging * debugging * debugging * debugging * debugging * revert prints. * new test case * remove print * org test case * add test_pipeline * styling. * fix copies. * modify test parameter * style. * add adapter-xl doc * double quotes in docs * Fix potential type mismatch * style. --------- Co-authored-by: sayakpaul <spsayakpaul@gmail.com>	2023-08-29 10:34:07 +05:30
Sayak Paul	3be0ff9056	[Core] Support negative conditions in SDXL (#4774 ) * add: support negative conditions. * fix: key * add: tests * address PR feedback. * add documentation * add img2img support. * add inpainting support. * ad controlnet support * Apply suggestions from code review * modify wording in the doc.	2023-08-26 09:13:44 +05:30
Sanchit Gandhi	b1290d3fb8	Convert MusicLDM (#4579 ) * from audioldm * fix vae * move to new pipeline * copied from audioldm * remove redundant control flow * iterate * fix docstring * finish pipeline * tests: from audioldm2 * iterate * finish fast tests * finish slow integration tests * add docs * remove dtype test * update toctree * "copied from" in conversion (where possible) * Update docs/source/en/api/pipelines/musicldm.md Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * fix docstring * make nightly * style * fix dtype test --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2023-08-25 13:31:00 +01:00
Sanchit Gandhi	24c5e7708b	[AudioLDM2] Doc fixes (#4739 ) * [AudioLDM2] Doc fixes * update docstrings * fix unet docstring * Apply suggestions from code review Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2023-08-24 07:20:27 +05:30
Sanchit Gandhi	05b0ec63bc	[AudioLDM Docs] Fix docs for output (#4737 )	2023-08-23 18:02:11 +02:00
dg845	f75b8aa9dd	[docs] Add note in UniDiffusers Doc about PyTorch 1.X numerical stability issue (#4703 ) * Add note regarding UniDiffuser pipeline numerical stability issues on PyTorch 1.X * Use the doc-builder warning tag.	2023-08-22 07:12:06 +05:30
Sanchit Gandhi	7a24977ce3	Add AudioLDM 2 (#4549 ) * from audioldm * unet down + mid * vae, clap, flan-t5 * start sequence audio mae * iterate on audioldm encoder * finish encoder * finish weight conversion * text pre-processing * gpt2 pre-processing * fix projection model * working * unet equivalence * finish in base * add unet cond * finish unet * finish custom unet * start clean-up * revert base unet changes * refactor pre-processing * tests: from audioldm * fix some tests * more fixes * iterate on tests * make fix copies * harden fast tests * slow integration tests * finish tests * update checkpoint * update copyright * docs * remove outdated method * add docstring * make style * remove decode latents * enable cpu offload * (text_encoder_1, tokenizer_1) -> (text_encoder, tokenizer) * more clean up * more refactor * build pr docs * Update docs/source/en/api/pipelines/audioldm2.md Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * small clean * tidy conversion * update for large checkpoint * generate -> generate_language_model * full clap model * shrink clap-audio in tests * fix large integration test * fix fast tests * use generation config * make style * update docs * finish docs * finish doc * update tests * fix last test * syntax * finalise tests * refactor projection model in prep for TTS * fix fast tests * style --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2023-08-21 12:34:21 +01:00
Sayak Paul	5333f4c0ec	make things clear in the controlnet sdxl doc. (#4644 )	2023-08-17 09:04:28 +05:30

1 2 3 4 5

202 Commits