diffusers

mirror of https://github.com/huggingface/diffusers.git synced 2026-01-27 17:22:53 +03:00

Author	SHA1	Message	Date
Nipun Jindal	fd512d7461	[2064]: Add stochastic sampler (sample_dpmpp_sde) (#3020 ) * [2064]: Add stochastic sampler * [2064]: Add stochastic sampler * [2064]: Add stochastic sampler * [2064]: Add stochastic sampler * [2064]: Add stochastic sampler * [2064]: Add stochastic sampler * [2064]: Add stochastic sampler * Review comments * [Review comment]: Add is_torchsde_available() * [Review comment]: Test and docs * [Review comment] * [Review comment] * [Review comment] * [Review comment] * [Review comment] --------- Co-authored-by: njindal <njindal@adobe.com>	2023-04-27 11:18:38 +05:30
Pedro Cuenca	e0a2bd15f9	Write model card in controlnet training script (#3229 ) Write model card in controlnet training script.	2023-04-26 21:22:27 +02:00
Pedro Cuenca	c399de396d	[docs] only mention one stage (#3246 ) * [docs] only mention one stage * add blurb on auto accepting --------- Co-authored-by: William Berman <WLBberman@gmail.com>	2023-04-26 12:06:50 -07:00
Patrick von Platen	f842396367	Post release for 0.16.0 (#3244 ) * Post release * fix more	2023-04-26 17:43:09 +01:00
Patrick von Platen	6ba0efb9a1	Release: v0.16.0 v0.16.0	2023-04-26 13:35:01 +02:00
Sanchit Gandhi	46ceba5b35	[AudioLDM] Update docs to use updated ckpt (#3240 ) * [AudioLDM] Update docs to use updated ckpt * make style	2023-04-26 12:33:08 +01:00
Sayak Paul	977162c02b	Adds a document on token merging (#3208 ) * add document on token merging. * fix headline. * fix: headline. * add some samples for comparison.	2023-04-26 16:25:48 +05:30
Patrick von Platen	744663f8dc	fix fast test (#3241 )	2023-04-26 11:44:19 +01:00
Patrick von Platen	abbf3c1adf	Allow fp16 attn for x4 upscaler (#3239 ) * Add all files * update * Make sure vae is memory efficient for PT 1 * make style	2023-04-26 11:16:06 +01:00
Patrick von Platen	da2ce1a6b9	Allow return pt x4 (#3236 ) * Add all files * update	2023-04-26 09:34:34 +01:00
Patrick von Platen	e51f19aee8	add model (#3230 ) * add * clean * up * clean up more * fix more tests * Improve docs further * improve * more fixes docs * Improve docs more * Update src/diffusers/models/unet_2d_condition.py * fix * up * update doc links * make fix-copies * add safety checker and watermarker to stage 3 doc page code snippets * speed optimizations docs * memory optimization docs * make style * add watermarking snippets to doc string examples * make style * use pt_to_pil helper functions in doc strings * skip mps tests * Improve safety * make style * new logic * fix * fix bad onnx design * make new stable diffusion upscale pipeline model arguments optional * define has_nsfw_concept when non-pil output type * lowercase linked to notebook name --------- Co-authored-by: William Berman <WLBberman@gmail.com>	2023-04-25 14:20:43 -07:00
Patrick von Platen	1ffcc924bc	Fix docs text inversion (#3166 ) * Fix docs text inversion * Apply suggestions from code review	2023-04-25 14:18:40 +01:00
Yuchen Fan	730e01ec93	Sync cache version check from transformers (#3179 ) sync cache version check from transformers	2023-04-25 14:18:25 +01:00
pdoane	0d196f9f45	Fix issue in maybe_convert_prompt (#3188 ) When the token used for textual inversion does not have any special symbols (e.g. it is not surrounded by <>), the tokenizer does not properly split the replacement tokens. Adding a space for the padding tokens fixes this.	2023-04-25 14:17:57 +01:00
Patrick von Platen	131312caba	Add ControlNet v1.1 docs (#3226 ) Add v1.1 docs	2023-04-25 14:12:35 +01:00
Isaac	e9edbfc251	adding enable_vae_tiling and disable_vae_tiling functions (#3225 ) adding enable_vae_tiling and disable_val_tiling functions	2023-04-25 14:12:21 +01:00
Lucca Zenóbio	0ddc5bf7b9	fix mixed precision training on train_dreambooth_inpaint_lora (#3138 ) cast to weight dtype	2023-04-25 15:22:57 +05:30
Patrick von Platen	c5933c9c89	[Bug fix] Fix batch size attention head size mismatch (#3214 )	2023-04-25 00:44:00 +02:00
Will Berman	91a2a80eb2	Revert "[Community Pipelines] Update lpw_stable_diffusion pipeline" (#3201 ) Revert "[Community Pipelines] Update lpw_stable_diffusion pipeline (#3197)" This reverts commit `9965cb50ea`.	2023-04-22 12:36:55 -07:00
Patrick von Platen	425192fe15	Make sure VAE attention works with Torch 2_0 (#3200 ) * Make sure attention works with Torch 2_0 * make style * Fix more	2023-04-22 17:29:29 +01:00
SkyTNT	9965cb50ea	[Community Pipelines] Update lpw_stable_diffusion pipeline (#3197 ) * Update lpw_stable_diffusion.py * fix cpu offload	2023-04-22 15:07:45 +01:00
Chengrui Wang	20e426cb5d	Fix bug in train_dreambooth_lora (#3183 ) * Update train_dreambooth_lora.py fix bug * Update train_dreambooth_lora.py	2023-04-22 09:04:28 +05:30
Sanchit Gandhi	90eac14f72	[AudioLDM] Fix dtype of returned waveform (#3189 )	2023-04-21 19:24:37 +01:00
Youssef Adarrab	11f527ac0f	Add `Karras sigmas` to HeunDiscreteScheduler (#3160 ) * Add karras pattern to discrete heun scheduler * Add integration test * Fix failing CI on pytorch test on M1 (mps) --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2023-04-21 19:21:04 +01:00
Patrick von Platen	2c04e5855c	Multi Vector Textual Inversion (#3144 ) * Multi Vector * Improve * fix multi token * improve test * make style * Update examples/test_examples.py * Apply suggestions from code review Co-authored-by: Suraj Patil <surajp815@gmail.com> * update * Finish * Apply suggestions from code review --------- Co-authored-by: Suraj Patil <surajp815@gmail.com>	2023-04-21 19:06:19 +01:00
Steven Liu	391cfcd7d7	[docs] Clarify training args (#3146 ) * clarify training arg * apply feedback	2023-04-21 11:03:44 -07:00
YiYi Xu	bc0392a0cb	make `from_flax` work for controlnet (#3161 ) fix from_flax Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2023-04-21 19:01:36 +01:00
asfiyab-nvidia	05d9baeacd	Fix TensorRT community pipeline device set function (#3157 ) pass silence_dtype_warnings as kwarg Signed-off-by: Asfiya Baig <asfiyab@nvidia.com> Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2023-04-21 18:53:10 +01:00
Sayak Paul	e573ae06e2	Update custom_diffusion.mdx to credit the author (#3163 ) * Update custom_diffusion.mdx * fix: unnecessary list comprehension.	2023-04-21 18:44:08 +01:00
Steven Liu	2f6351b001	[docs] Deterministic algorithms (#3172 ) deterministic algos	2023-04-21 10:38:34 -07:00
Patrick von Platen	9c856118c7	Add model offload to x4 upscaler (#3187 ) * Add model offload to x4 upscaler * fix	2023-04-21 17:47:33 +01:00
regisss	9bce375f77	Update Habana Gaudi documentation (#3169 ) * Update Habana Gaudi doc * Fix tables	2023-04-21 17:24:43 +01:00
Sayak Paul	3045fb2763	[DreamBooth] add text encoder LoRA support in the DreamBooth training script (#3130 ) * add: LoRA text encoder support for DreamBooth example. * fix initialization. * fix: modification call. * add: entry in the readme. * use dog dataset from hub. * fix: params to clip. * add entry to the LoRA doc. * add: tests for lora. * remove unnecessary list comprehension./	2023-04-20 17:25:17 +05:30
clarencechen	7b0ba4820a	Update Noise Autocorrelation Loss Function for Pix2PixZero Pipeline (#2942 ) * Update Pix2PixZero Auto-correlation Loss * Add fast inversion tests * Clarify purpose and mark as deprecated Fix inversion prompt broadcasting * Register modules set to `None` in config for `test_save_load_optional_components` * Update new tests to coordinate with #2953	2023-04-20 12:13:47 +01:00
Patrick von Platen	8d5906a331	Merge branch 'main' of https://github.com/huggingface/diffusers	2023-04-20 13:09:33 +02:00
Patrick von Platen	17470057d2	make style	2023-04-20 13:09:20 +02:00
XinyuYe-Intel	a5b242d30d	Added distillation for quantization example on textual inversion. (#2760 ) * Added distillation for quantization example on textual inversion. Signed-off-by: Ye, Xinyu <xinyu.ye@intel.com> * refined readme and code style. Signed-off-by: Ye, Xinyu <xinyu.ye@intel.com> * Update text2images.py * refined code of model load and added compatibility check. Signed-off-by: Ye, Xinyu <xinyu.ye@intel.com> * fixed code style. Signed-off-by: Ye, Xinyu <xinyu.ye@intel.com> * fix C403 [*] Unnecessary `list` comprehension (rewrite as a `set` comprehension) Signed-off-by: Ye, Xinyu <xinyu.ye@intel.com> --------- Signed-off-by: Ye, Xinyu <xinyu.ye@intel.com>	2023-04-20 11:55:42 +01:00
Mishig	a121e05feb	Update custom_diffusion.mdx (#3165 ) Add missing newlines for rendering the links correctly	2023-04-20 11:04:06 +02:00
nupurkmr9	3979aac996	adding custom diffusion training to diffusers examples (#3031 ) * diffusers==0.14.0 update * custom diffusion update * custom diffusion update * custom diffusion update * custom diffusion update * custom diffusion update * custom diffusion update * custom diffusion * custom diffusion * custom diffusion * custom diffusion * custom diffusion * apply formatting and get rid of bare except. * refactor readme and other minor changes. * misc refactor. * fix: repo_id issue and loaders logging bug. * fix: save_model_card. * fix: save_model_card. * fix: save_model_card. * add: doc entry. * refactor doc,. * custom diffusion * custom diffusion * custom diffusion * apply style. * remove tralining whitespace. * fix: toctree entry. * remove unnecessary print. * custom diffusion * custom diffusion * custom diffusion test * custom diffusion xformer update * custom diffusion xformer update * custom diffusion xformer update --------- Co-authored-by: Nupur Kumari <nupurkumari@Nupurs-MacBook-Pro.local> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: Nupur Kumari <nupurkumari@nupurs-mbp.wifi.local.cmu.edu>	2023-04-20 09:31:42 +02:00
Will Berman	7e6886f5e9	controlnet training resize inputs to multiple of 8 (#3135 ) controlnet training center crop input images to multiple of 8 The pipeline code resizes inputs to multiples of 8. Not doing this resizing in the training script is causing the encoded image to have different height/width dimensions than the encoded conditioning image (which uses a separate encoder that's part of the controlnet model). We resize and center crop the inputs to make sure they're the same size (as well as all other images in the batch). We also check that the initial resolution is a multiple of 8.	2023-04-19 10:46:51 -07:00
superhero-7	a4c91be73b	Modified altdiffusion pipline to support altdiffusion-m18 (#2993 ) * Modified altdiffusion pipline to support altdiffusion-m18 * Modified altdiffusion pipline to support altdiffusion-m18 * Modified altdiffusion pipline to support altdiffusion-m18 * Modified altdiffusion pipline to support altdiffusion-m18 * Modified altdiffusion pipline to support altdiffusion-m18 * Modified altdiffusion pipline to support altdiffusion-m18 * Modified altdiffusion pipline to support altdiffusion-m18 --------- Co-authored-by: root <fulong_ye@163.com>	2023-04-19 18:00:29 +01:00
hwuebben	3becd368b1	Update pipeline_stable_diffusion_inpaint_legacy.py (#2903 ) * Update pipeline_stable_diffusion_inpaint_legacy.py * fix preprocessing of Pil images with adequate batch size * revert map * add tests * reformat * Update test_stable_diffusion_inpaint_legacy.py * Update test_stable_diffusion_inpaint_legacy.py * Update test_stable_diffusion_inpaint_legacy.py * Update test_stable_diffusion_inpaint_legacy.py * next try to fix the style * wth is this * Update testing_utils.py * Update testing_utils.py * Update test_stable_diffusion_inpaint_legacy.py * Update test_stable_diffusion_inpaint_legacy.py * Update test_stable_diffusion_inpaint_legacy.py * Update test_stable_diffusion_inpaint_legacy.py * Update test_stable_diffusion_inpaint_legacy.py * Update test_stable_diffusion_inpaint_legacy.py --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2023-04-19 17:58:13 +01:00
Chanchana Sornsoontorn	c8fdfe4572	Correct `Transformer2DModel.forward` docstring (#3074 ) ⚙️chore(transformer_2d) update function signature for encoder_hidden_states	2023-04-19 17:51:58 +01:00
asfiyab-nvidia	bba1c1de15	Add TensorRT SD/txt2img Community Pipeline to diffusers along with TensorRT utils (#2974 ) * Add SD/txt2img Community Pipeline to diffusers along with TensorRT utils Signed-off-by: Asfiya Baig <asfiyab@nvidia.com> * update installation command Signed-off-by: Asfiya Baig <asfiyab@nvidia.com> * update tensorrt installation Signed-off-by: Asfiya Baig <asfiyab@nvidia.com> * changes 1. Update setting of cache directory 2. Address comments: merge utils and pipeline code. 3. Address comments: Add section in README Signed-off-by: Asfiya Baig <asfiyab@nvidia.com> * apply make style Signed-off-by: Asfiya Baig <asfiyab@nvidia.com> --------- Signed-off-by: Asfiya Baig <asfiyab@nvidia.com> Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2023-04-19 17:51:03 +01:00
1lint	86ecd4b795	add from_ckpt method as Mixin (#2318 ) * add mixin class for pipeline from original sd ckpt * Improve * make style * merge main into * Improve more * fix more * up * Apply suggestions from code review * finish docs * rename * make style --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2023-04-19 17:07:36 +01:00
cmdr2	bdeff4d64a	[ckpt loader] Allow loading the Inpaint and Img2Img pipelines, while loading a ckpt model (#2705 ) * [ckpt loader] Allow loading the Inpaint and Img2Img pipelines, while loading a ckpt model * Address review comment from PR * PyLint formatting * Some more pylint fixes, unrelated to our change * Another pylint fix * Styling fix	2023-04-19 13:37:07 +01:00
Will Berman	fc1883918f	class labels timestep embeddings projection dtype cast (#3137 ) This mimics the dtype cast for the standard time embeddings	2023-04-18 15:05:41 -07:00
Will Berman	f0c74e9a75	Add unet act fn to other model components (#3136 ) Adding act fn config to the unet timestep class embedding and conv activation. The custom activation defaults to silu which is the default activation function for both the conv act and the timestep class embeddings so default behavior is not changed. The only unet which use the custom activation is the stable diffusion latent upscaler https://huggingface.co/stabilityai/sd-x2-latent-upscaler/blob/main/unet/config.json (I ran a script against the hub to confirm). The latent upscaler does not use the conv activation nor the timestep class embeddings so we don't change its behavior.	2023-04-18 14:13:16 -07:00
Patrick von Platen	4bc157ffa9	Correct textual inversion readme (#3145 ) * Update README.md * Apply suggestions from code review	2023-04-18 16:35:12 +01:00
Patrick von Platen	f2df39fa0e	make style	2023-04-18 14:03:17 +02:00

1 2 3 4 5 ...

2236 Commits