diffusers

mirror of https://github.com/huggingface/diffusers.git synced 2026-01-27 17:22:53 +03:00

Author	SHA1	Message	Date
Steven Liu	b64f835ea7	[docs] Add Kandinsky 3 (#5988 ) * add * fix api docs * edits	2023-12-04 10:11:15 -08:00
Linoy Tsaban	880c0fdd36	[advanced dreambooth lora training script][bug_fix] change token_abstraction type to str (#6040 ) * improve help tags * style fix * changes token_abstraction type to string. support multiple concepts for pivotal using a comma separated string. * style fixup * changed logger to warning (not yet available) * moved the token_abstraction parsing to be in the same block as where we create the mapping of identifier to token --------- Co-authored-by: Linoy <linoy@huggingface.co>	2023-12-04 18:38:44 +01:00
RuoyiDu	c36f1c3160	[Community Pipeline] DemoFusion: Democratising High-Resolution Image Generation With No $$$ (#6022 ) * Add files via upload * Update README.md * Update pipeline_demofusion_sdxl.py * Update pipeline_demofusion_sdxl.py * Update examples/community/README.md Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2023-12-04 19:44:57 +05:30
takuoko	0a08d41961	[Feature] Support IP-Adapter Plus (#5915 ) * Support IP-Adapter Plus * fix format * restore before black format * restore before black format * generic * Refactor PerceiverAttention * format * fix test and refactor PerceiverAttention * generic encode_image * keep attention implementation * merge tests * encode_image backward compatible * code quality * fix controlnet inpaint pipeline * refactor FFN * refactor FFN --------- Co-authored-by: YiYi Xu <yixu310@gmail.com>	2023-12-04 12:43:34 +01:00
Levi McCallum	e185084a5d	Add variant argument to dreambooth lora sdxl advanced (#6021 ) Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2023-12-04 12:04:15 +01:00
Dhruv Nair	b21729225a	Update Tests Fetcher (#5950 ) * update setup and deps table * update * update * update * up * up * update * up * update * update * update * update * update * update * update * update * update * update * update * update * update * update * update * update * update * quality fix * fix failure reporting --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2023-12-04 12:59:41 +05:30
Parth38	8a812e4e14	Update value_guided_sampling.py (#6027 ) * Update value_guided_sampling.py Changed the scheduler step function as predict_epsilon parameter is not there in latest DDPM Scheduler * Update value_guided_sampling.md Updated a link to a working notebook --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2023-12-04 10:36:25 +05:30
gujing	bf92e746c0	fix StableDiffusionTensorRT super args error (#6009 )	2023-12-04 10:06:23 +05:30
Linoy Tsaban	b785a155d6	[advanced dreambooth lora sdxl training script] improve help tags (#6035 ) * improve help tags * style fix --------- Co-authored-by: Linoy <linoy@huggingface.co>	2023-12-04 09:41:25 +05:30
Sayak Paul	d486f0e846	[LoRA serialization] fix: duplicate unet prefix problem. (#5991 ) * fix: duplicate unet prefix problem. * Update src/diffusers/loaders/lora.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2023-12-02 21:35:16 +05:30
Sayak Paul	3351270627	[PixArt Tests] remove fast tests from slow suite (#5945 ) remove fast tests from slow suite	2023-12-02 20:58:27 +05:30
Junsong Chen	4520e1221a	adapt PixArtAlphaPipeline for pixart-lcm model (#5974 ) * adapt PixArtAlphaPipeline for pixart-lcm model * remove original_inference_steps from __call__ --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2023-12-02 13:30:40 +05:30
Long(Tony) Lian	618260409f	LLMGroundedDiffusionPipeline: inherit from DiffusionPipeline and fix peft (#6023 ) * LLMGroundedDiffusionPipeline: inherit from DiffusionPipeline and fix peft * Use main in the revision in the examples * Add "Copied from" statements in comments * Fix formatting with ruff	2023-12-01 09:58:25 -10:00
Patrick von Platen	dadd55fb36	Post Release: v0.24.0 (#5985 ) * Post Release: v0.24.0 * post pone deprecation * post pone deprecation * Add model_index.json	2023-12-01 18:43:44 +01:00
YiYi Xu	1b6c7ea74e	[schedulers] create `self.sigmas` during __init__ (#6006 ) * fix dpm * all scheulers	2023-12-01 07:15:37 -10:00
YiYi Xu	b41f809a4e	[Kandinsky 3.0] Follow-up TODOs (#5944 ) clean-up kendinsky 3.0	2023-12-01 07:14:22 -10:00
Patrick von Platen	0f55c17e17	fix style	2023-12-01 15:59:34 +00:00
Charchit Sharma	5058d27f12	added attention_head_dim, attention_type, resolution_idx (#6011 )	2023-12-01 16:26:58 +01:00
M. Tolga Cangöz	748c1b3ec7	[`Docs`] Update a link (#6014 ) * Update the location of Python's version * Trim trailing whitespace	2023-12-01 16:26:25 +01:00
M. Tolga Cangöz	523507034f	[`logging`] Fix assertion bug (#6012 ) Fix assertion bug	2023-12-01 16:26:04 +01:00
hako-mikan	46c751e970	[Community Pipeline] Regional Prompting Pipeline (#6015 ) * Update README.md * Update README.md * Add files via upload * Update README.md * Update examples/community/README.md --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2023-12-01 16:22:59 +01:00
Patrick von Platen	bc1d28c888	[From Single File] Allow Text Encoder to be passed (#6020 ) Allow text encoder to be passed	2023-12-01 16:19:04 +01:00
Sayak Paul	af378c1dd1	[Easy] minor edits to setup.py (#5996 ) minor edits to setup	2023-12-01 20:38:46 +05:30
Steven Liu	6ba4c5395f	[docs] Fix SVD video (#6004 ) Update svd.md	2023-12-01 16:07:47 +01:00
Linoy Tsaban	c1e4529541	[advanced_dreambooth_lora_sdxl_tranining_script] readme fix (#6019 ) readme	2023-12-01 15:14:57 +01:00
Linoy Tsaban	d29d97b616	[examples/advanced_diffusion_training] bug fixes and improvements for LoRA Dreambooth SDXL advanced training script (#5935 ) * imports and readme bug fixes * bug fix - ensures text_encoder params are dtype==float32 (when using pivotal tuning) even if the rest of the model is loaded in fp16 * added pivotal tuning to readme * mapping token identifier to new inserted token in validation prompt (if used) * correct default value of --train_text_encoder_frac * change default value of --adam_weight_decay_text_encoder * validation prompt generations when using pivotal tuning bug fix * style fix * textual inversion embeddings name change * style fix * bug fix - stopping text encoder optimization halfway * readme - will include token abstraction and new inserted tokens when using pivotal tuning - added type to --num_new_tokens_per_abstraction * style fix --------- Co-authored-by: Linoy Tsaban <linoy@huggingface.co>	2023-12-01 14:18:43 +01:00
Jongho Choi	7d4a257c7f	Remove a duplicated line? (#6010 ) Update __init__.py	2023-12-01 15:49:36 +05:30
Kristian Mischke	141cd52d56	Fix LLMGroundedDiffusionPipeline super class arguments (#5993 ) * make `requires_safety_checker` a kwarg instead of a positional argument as it's more future-proof * apply `make style` formatting edits * add image_encoder to arguments and pass to super constructor	2023-11-30 10:15:14 -10:00
Steven Liu	f72b28c75b	[docs] Fix video link (#5986 ) Update svd.md	2023-11-29 20:52:25 +01:00
Suraj Patil	ada8109d5b	Fix SVD doc (#5983 ) fix url	2023-11-29 19:55:05 +01:00
Patrick von Platen	b34acbdcbc	[SDXL Turbo] Add some docs (#5982 ) * add diffusers example * add diffusers example * Comment about making it faster * Apply suggestions from code review Co-authored-by: Pedro Cuenca <pedro@huggingface.co> --------- Co-authored-by: Pedro Cuenca <pedro@huggingface.co>	2023-11-29 19:52:07 +01:00
Suraj Patil	63f767ef15	Add SVD (#5895 ) * begin model * finish blocks * add_embedding * addition_time_embed_dim * use TimestepEmbedding * fix temporal res block * fix time_pos_embed * fix add_embedding * add conversion script * fix model * up * add new resnet blocks * make forward work * return sample in original shape * fix temb shape in TemporalResnetBlock * add spatio temporal transformers * add vae blocks * fix blocks * update * update * fix shapes in Alphablender and add time activation in res blcok * use new blocks * style * fix temb shape * fix SpatioTemporalResBlock * reuse TemporalBasicTransformerBlock * fix TemporalBasicTransformerBlock * use TransformerSpatioTemporalModel * fix TransformerSpatioTemporalModel * fix time_context dim * clean up * make temb optional * add blocks * rename model * update conversion script * remove UNetMidBlockSpatioTemporal * add in init * remove unused arg * remove unused arg * remove more unsed args * up * up * check for None * update vae * update up/mid blocks for decoder * begin pipeline * adapt scheduler * add guidance scalings * fix norm eps in temporal transformers * add temporal autoencoder * make pipeline run * fix frame decodig * decode in float32 * decode n frames at a time * pass decoding_t to decode_latents * fix decode_latents * vae encode/decode in fp32 * fix dtype in TransformerSpatioTemporalModel * type image_latents same as image_embeddings * allow using differnt eps in temporal block for video decoder * fix default values in vae * pass num frames in decode * switch spatial to temporal for mixing in VAE * fix num frames during split decoding * cast alpha to sample dtype * fix attention in MidBlockTemporalDecoder * fix typo * fix guidance_scales dtype * fix missing activation in TemporalDecoder * skip_post_quant_conv * add vae conversion * style * take guidance scale as input * up * allow passing PIL to export_video * accept fps as arg * add pipeline and vae in init * remove hack * use AutoencoderKLTemporalDecoder * don't scale image latents * add unet tests * clean up unet * clean TransformerSpatioTemporalModel * add slow svd test * clean up * make temb optional in Decoder mid block * fix norm eps in TransformerSpatioTemporalModel * clean up temp decoder * clean up * clean up * use c_noise values for timesteps * use math for log * update * fix copies * doc * upcast vae * update forward pass for gradient checkpointing * make added_time_ids is tensor * up * fix upcasting * remove post quant conv * add _resize_with_antialiasing * fix _compute_padding * cleanup model * more cleanup * more cleanup * more cleanup * remove freeu * remove attn slice * small clean * up * up * remove extra step kwargs * remove eta * remove dropout * remove callback * remove merge factor args * clean * clean up * move to dedicated folder * remove attention_head_dim * docstr and small fix * update unet doc strings * rename decoding_t * correct linting * store c_skip and c_out * cleanup * clean TemporalResnetBlock * more cleanup * clean up vae * clean up * begin doc * more cleanup * up * up * doc * Improve * better naming * better naming * better naming * better naming * better naming * better naming * better naming * better naming * Apply suggestions from code review * Default chunk size to None * add example * Better * Apply suggestions from code review * update doc * Update src/diffusers/pipelines/stable_diffusion_video/pipeline_stable_diffusion_video.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * style * Get torch compile working * up * rename * fix doc * add chunking * torch compile * torch compile * add modelling outputs * torch compile * Improve chunking * Apply suggestions from code review * Update docs/source/en/using-diffusers/svd.md * Close diff tag * remove slicing * resnet docstr * add docstr in resnet * rename * Apply suggestions from code review * update tests * Fix output type latents * fix more * fix more * Update docs/source/en/using-diffusers/svd.md * fix more * add pipeline tests * remove unused arg * clean up * make sure get_scaling receives tensors * fix euler scheduler * fix get_scalings * simply euler for now * remove old test file * use randn_tensor to create noise * fix device for rand tensor * increase expected_max_difference * fix test_inference_batch_single_identical * actually fix test_inference_batch_single_identical * disable test_save_load_float16 * skip test_float16_inference * skip test_inference_batch_single_identical * fix test_xformers_attention_forwardGenerator_pass * Apply suggestions from code review * update StableVideoDiffusionPipelineSlowTests * update image * add diffusers example * fix more --------- Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com> Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: apolinário <joaopaulo.passos@gmail.com>	2023-11-29 19:13:36 +01:00
PENGUINLIONG	d1b2a1a957	Fixed custom module importing on Windows (#5891 ) * Fixed custom module importing on Windows Windows use back slash and `os.path.join()` follows that convention. * Apply suggestions from code review Co-authored-by: Lucain <lucainp@gmail.com> * Update pipeline_utils.py --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: Lucain <lucainp@gmail.com>	2023-11-29 16:33:04 +01:00
Kashif Rasul	01782c220e	[Wuerstchen] Adapt lora training example scripts to use PEFT (#5959 ) * Adapt lora example scripts to use PEFT * add to_out.0	2023-11-29 16:18:20 +01:00
vahramtadevosyan	d63a498c3b	[Pipeline] Add TextToVideoZeroSDXLPipeline (#4695 ) * integrated sdxl for the text2video-zero pipeline * make fix-copies * fixed CI issues * make fix-copies * added docs and `copied from` statements * added fast tests * made a small change in docs * quality+style check fix * updated docs. added controlnet inference with sdxl * added device compatibility for fast tests * fixed docstrings * changing vae upcasting * remove torch.empty_cache to speed up inference Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * made fast tests to run on dummy models only, fixed copied from statements * fixed testing utils imports * Added bullet points for SDXL support * fixed formatting & quality * Update tests/pipelines/text_to_video/test_text_to_video_zero_sdxl.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update tests/pipelines/text_to_video/test_text_to_video_zero_sdxl.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * fixed minor error for merging * fixed updates of sdxl * made fast tests inherit from `PipelineTesterMixin` and run in 3-4secs on CPU * make style && make quality * reimplemented fast tests w/o default attn processor * make style & make quality * make fix-copies * make fix-copies * fixed docs * make style & make quality & make fix-copies * bug fix in cross attention * make style && make quality * make fix-copies * fix gpu issues * make fix-copies * updated pipeline signature --------- Co-authored-by: Vahram <vahram.tadevosyan@lambda-loginnode02.cm.cluster> Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>	2023-11-29 16:10:43 +01:00
Marko Kostiv	6a4aad43dc	Controlnet ssd 1b support (#5779 ) * Add SSD-1B support for controlnet model * Add conditioning_channels into ControlNet init from unet * Fix black formatting * Isort fixes * Adds SSD-1B controlnet pipeline test with UNetMidBlock2D as mid block * Overrides failing ssd-1b tests * Fixes tests after main branch update * Fixes code quality checks --------- Co-authored-by: Marko Kostiv <marko@linearity.io> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2023-11-29 16:10:01 +01:00
Steven Liu	ddd8bd53ed	[docs] LCM training (#5796 ) * first draft * feedback	2023-11-29 16:08:05 +01:00
JuanCarlosPi	9f7b2cf2dc	Support of ip-adapter to the StableDiffusionControlNetInpaintPipeline (#5887 ) * Change pipeline_controlnet_inpaint.py to add ip-adapter support. Changes are similar to those in pipeline_controlnet * Change tests for the StableDiffusionControlNetInpaintPipeline by adding image_encoder: None * Update src/diffusers/pipelines/controlnet/pipeline_controlnet_inpaint.py Co-authored-by: YiYi Xu <yixu310@gmail.com> --------- Co-authored-by: YiYi Xu <yixu310@gmail.com>	2023-11-29 16:00:24 +01:00
Sayak Paul	895c4b704b	[LoRA refactor] move several state dict conversion utils out of lora.py (#5955 ) * move several state dict conversion utils out of lora.py * check * check * check * check * check * check * check * revert back * check * check * again check * maybe fix? * Apply suggestions from code review Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2023-11-29 20:24:21 +05:30
Linh Nguyen	636feba552	Rename output_dir argument (#5916 ) Fix typo in output_dir argument: "text-inversion-model" → "dreambooth-model"	2023-11-29 15:47:16 +01:00
Andrés Romero	79dc7df03e	[bug fix] Inpainting for MultiAdapter (#5922 ) * bug in MultiAdapter for Inpainting * adapter_input is a list for MultiAdapter --------- Co-authored-by: andres <andres@hax.ai> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2023-11-29 15:46:26 +01:00
Charchit Sharma	6031ecbd23	added doc for Kandinsky3.0 (#5937 ) * added en doc for Kandinsky3.0 * required changes * Update docs/source/en/api/pipelines/kandinsky3.md * Update docs/source/en/api/pipelines/kandinsky3.md * Update docs/source/en/api/pipelines/kandinsky3.md --------- Co-authored-by: YiYi Xu <yixu310@gmail.com> Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2023-11-29 15:32:00 +01:00
Sayak Paul	fdd003d8e2	[Tests] Refactor `test_examples.py` for better readability (#5946 ) * control and custom diffusion * dreambooth * instructpix2pix and dreambooth ckpting * t2i adapters. * text to image ft * textual inversion * unconditional * workflows * import fix * fix import	2023-11-29 18:43:59 +05:30
Steven Liu	172acc98b9	[docs] Update pipeline list (#5952 ) add to list	2023-11-29 14:08:39 +01:00
estelleafl	5ae3c3a56b	[ldm3d] Ldm3d upscaler to community pipeline (#5870 ) --------- Co-authored-by: Aflalo <estellea@isl-gpu27.rr.intel.com> Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: YiYi Xu <yixu310@gmail.com>	2023-11-28 09:00:39 -10:00
Soumik Rakshit	21bc59ab24	fix: minor typo in docstring (#5961 )	2023-11-28 18:18:34 +05:30
Steven Liu	50a749e909	[docs] Fix space (#5898 ) * fix * minor edits	2023-11-27 11:50:59 -08:00
YiYi Xu	d9075be494	[load_textual_inversion]: allow multiple tokens (#5837 ) Co-authored-by: yiyixuxu <yixu310@gmail,com>	2023-11-27 06:52:36 -10:00
Patrick von Platen	b135b6e905	[From_pretrained] Fix warning (#5948 )	2023-11-27 14:35:19 +01:00
T. Xu	14a0d21d2e	[Community Pipeline] Diffusion Posterior Sampling for General Noisy Inverse Problems (#5939 ) * [community pipeline] dps impl * add type checking * pass ruff check * ruff formatter	2023-11-27 14:29:42 +01:00

1 2 3 4 5 ...

3334 Commits