diffusers

mirror of https://github.com/huggingface/diffusers.git synced 2026-01-27 17:22:53 +03:00

Author	SHA1	Message	Date
Jason C.H	2de9e2df36	Fix from_ckpt for Stable Diffusion 2.x (#3662 )	2023-06-06 22:39:11 +01:00
Isotr0py	11b3002b48	Support views batch for panorama (#3632 ) * support views batch for panorama * add entry for the new argument * format entry for the new argument * add view_batch_size test * fix batch test and a boundary condition * add more docstrings * fix a typos * fix typos * add: entry to the doc about view_batch_size. * Revert "add: entry to the doc about view_batch_size." This reverts commit `a36aeaa9ed`. * add a tip on . --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2023-06-07 02:50:02 +05:30
stano	10f4ecd177	Fix the Kandinsky docstring examples (#3695 ) - use the correct Prior hub model id - use the new names in KandinskyPriorPipelineOutput	2023-06-06 22:18:14 +01:00
Sayak Paul	de16f64667	feat: when using PT 2.0 use LoRAAttnProcessor2_0 for text enc LoRA. (#3691 )	2023-06-06 21:20:53 +01:00
YiYi Xu	017ee1609b	refactor Image processor for x4 upscaler (#3692 ) * refactor x4 upscaler * style * copies --------- Co-authored-by: yiyixuxu <yixu310@gmail,com>	2023-06-06 21:08:36 +01:00
Sayak Paul	8669e8313d	[LoRA] feat: add lora attention processor for pt 2.0. (#3594 ) * feat: add lora attention processor for pt 2.0. * explicit context manager for SDPA. * switch to flash attention * make shapes compatible to work optimally with SDPA. * fix: circular import problem. * explicitly specify the flash attention kernel in sdpa * fall back to efficient attention context manager. * remove explicit dispatch. * fix: removed processor. * fix: remove optional from type annotation. * feat: make changes regarding LoRAAttnProcessor2_0. * remove confusing warning. * formatting. * relax tolerance for PT 2.0 * fix: loading message. * remove unnecessary logging. * add: entry to the docs. * add: network_alpha argument. * relax tolerance.	2023-06-06 14:56:05 +05:30
Takuma Mori	b45204ea5a	Add function to remove monkey-patch for text encoder LoRA (#3649 ) * merge undoable-monkeypatch * remove TEXT_ENCODER_TARGET_MODULES, refactoring * move create_lora_weight_file	2023-06-06 14:06:13 +05:30
Steven Liu	a8b0f42c38	[docs] Fix link to loader method (#3680 ) fix link to load_lora_weights	2023-06-06 13:37:47 +05:30
Will Berman	41ae670828	move activation dispatches into helper function (#3656 ) * move activation dispatches into helper function * tests	2023-06-05 12:30:48 -07:00
Will Berman	462956be7b	small tweaks for parsing thibaudz controlnet checkpoints (#3657 )	2023-06-05 10:24:31 -07:00
YiYi Xu	5990014700	[WIP]Vae preprocessor refactor (PR1) (#3557 ) VaeImageProcessor.preprocess refactor * refactored VaeImageProcessor - allow passing optional height and width argument to resize() - add convert_to_rgb * refactored prepare_latents method for img2img pipelines so that if we pass latents directly as image input, it will not encode it again * added a test in test_pipelines_common.py to test latents as image inputs * refactored img2img pipelines that accept latents as image: - controlnet img2img, stable diffusion img2img , instruct_pix2pix --------- Co-authored-by: yiyixuxu <yixu310@gmail,com> Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: Pedro Cuenca <pedro@huggingface.co> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2023-06-05 07:11:00 -10:00
Steven Liu	1a6a647e06	[docs] More API fixes (#3640 ) * part 2 of api fixes * move randn_tensor * add to toctree * apply feedback * more feedback	2023-06-05 09:47:26 -07:00
Sayak Paul	995bbcb9aa	[UniDiffuser test] fix one test so that it runs correctly on V100 (#3675 ) * fix: assertion. * assertion fix.	2023-06-05 17:42:31 +05:30
pdoane	d0416ab090	Update Compel documentation for textual inversions (#3663 ) * Update Compel documentation for textual inversions * Fix typo	2023-06-05 16:46:27 +05:30
Vladislav Lyubimov	1994dbcb5e	Fix from_ckpt not working properly on windows (#3666 )	2023-06-05 11:55:37 +01:00
Patrick von Platen	262d539a8a	Correct multi gpu dreambooth (#3673 ) Correct multi gpu	2023-06-05 11:03:11 +01:00
Will Berman	0fc2fb71c1	dreambooth upscaling fix added latents (#3659 )	2023-06-05 10:32:16 +01:00
Steven Liu	523a50a8eb	[docs] Load A1111 LoRA (#3629 ) * load a1111 lora * fix * apply feedback * fix	2023-06-05 11:05:42 +05:30
0x1355	de45af4a46	Allow setting num_cycles for cosine_with_restarts lr scheduler (#3606 ) Expose num_cycles kwarg of get_schedule() through args.lr_num_cycles.	2023-06-05 10:18:29 +05:30
0x1355	b95cbdf6fc	Set step_rules correctly for piecewise_constant scheduler (#3605 ) So that schedule_func() calls get_piecewise_constant_schedule() with correctly named kwarg.	2023-06-05 10:16:26 +05:30
Will Berman	7a39691362	linting fix (#3653 )	2023-06-02 13:33:19 -07:00
Will Berman	5911a3aa47	dreambooth if docs - stage II, more info (#3628 ) * dreambooth if docs - stage II, more info * Update docs/source/en/training/dreambooth.mdx Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update docs/source/en/training/dreambooth.mdx Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update docs/source/en/training/dreambooth.mdx Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * download instructions for downsized images * update source README to match docs --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2023-06-02 10:37:13 -07:00
Will Berman	b7af946138	set config from original module but set compiled module on class (#3650 ) * set config from original module but set compiled module on class * add test	2023-06-02 10:26:41 -07:00
asfiyab-nvidia	d3717e6368	add Stable Diffusion TensorRT Inpainting pipeline (#3642 ) * add tensorrt inpaint pipeline Signed-off-by: Asfiya Baig <asfiyab@nvidia.com> * run make style Signed-off-by: Asfiya Baig <asfiyab@nvidia.com> --------- Signed-off-by: Asfiya Baig <asfiyab@nvidia.com> Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2023-06-02 18:14:31 +01:00
Kadir Nar	0dbdc0cbae	[Community Doc] Updated the filename and readme file. (#3634 ) * Updated the filename and readme file. * reformatter * reformetter	2023-06-02 17:53:09 +01:00
YiYi Xu	0e8688113a	fix inpainting pipeline when providing initial latents (#3641 ) * fix latents * fix copies --------- Co-authored-by: yiyixuxu <yixu310@gmail,com>	2023-06-02 17:03:15 +01:00
Kashif Rasul	f1d4743394	fixed typo in example train_text_to_image.py (#3608 ) fixed typo	2023-06-02 20:54:54 +05:30
Lachlan Nicholson	a6c7b5b6b7	Iterate over unique tokens to avoid duplicate replacements for multivector embeddings (#3588 ) * iterate over unique tokens to avoid duplicate replacements * added test for multiple references to multi embedding * adhere to black formatting * reorder test post-rebase	2023-06-02 16:10:22 +01:00
Takuma Mori	8e552bb4fe	Support Kohya-ss style LoRA file format (in a limited capacity) (#3437 ) * add _convert_kohya_lora_to_diffusers * make style * add scaffold * match result: unet attention only * fix monkey-patch for text_encoder * with CLIPAttention While the terrible images are no longer produced, the results do not match those from the hook ver. This may be due to not setting the network_alpha value. * add to support network_alpha * generate diff image * fix monkey-patch for text_encoder * add test_text_encoder_lora_monkey_patch() * verify that it's okay to release the attn_procs * fix closure version * add comment * Revert "fix monkey-patch for text_encoder" This reverts commit `bb9c61e6fa`. * Fix to reuse utility functions * make LoRAAttnProcessor targets to self_attn * fix LoRAAttnProcessor target * make style * fix split key * Update src/diffusers/loaders.py * remove TEXT_ENCODER_TARGET_MODULES loop * add print memory usage * remove test_kohya_loras_scaffold.py * add: doc on LoRA civitai * remove print statement and refactor in the doc. * fix state_dict test for kohya-ss style lora * Apply suggestions from code review Co-authored-by: Takuma Mori <takuma104@gmail.com> --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2023-06-02 17:40:24 +05:30
Patrick von Platen	32ea2142c0	[Kandinsky] Improve kandinsky API a bit (#3636 ) * Improve docs * up * Update docs/source/en/api/pipelines/kandinsky.mdx * up * up * correct more * further improve * Update docs/source/en/api/pipelines/kandinsky.mdx Co-authored-by: YiYi Xu <yixu310@gmail.com> --------- Co-authored-by: YiYi Xu <yixu310@gmail.com>	2023-06-02 08:57:20 +01:00
Sayak Paul	55dbfa0229	[Docs] include the instruction-tuning blog link in the InstructPix2Pix docs (#3644 ) include the instruction-tuning blog link.	2023-06-02 08:04:35 +05:30
Will Berman	4f14b36329	Full Dreambooth IF stage II upscaling (#3561 ) * update dreambooth lora to work with IF stage II * Update dreambooth script for IF stage II upscaler	2023-05-31 09:39:31 -07:00
Will Berman	f751b8844e	update dreambooth lora to work with IF stage II (#3560 )	2023-05-31 09:39:03 -07:00
Prathik Rao	abb89da4de	update code to reflect latest changes as of May 30th (#3616 ) * update code to reflect latest changes as of May 30th * update text to image example * reflect changes to textual inversion * make style * fix typo * Revert unnecessary readme changes --------- Co-authored-by: root <root@orttrainingdev8.d32nl1ml4oruzj4qz3bqlggovf.px.internal.cloudapp.net> Co-authored-by: Prathik Rao <prathikrao@microsoft.com@orttrainingdev8.d32nl1ml4oruzj4qz3bqlggovf.px.internal.cloudapp.net>	2023-05-31 11:29:04 +02:00
Will Berman	7d0ac4eeab	goodbye frog (#3617 )	2023-05-30 23:18:01 +01:00
Patrick von Platen	0cc3a7a123	Make sure we also change the config when setting `encoder_hid_dim_type=="text_proj"` and allow xformers (#3615 ) * fix if * make style * make style * add tests for xformers * make style * update	2023-05-30 20:47:14 +01:00
Patrick von Platen	9d3ff0794d	fix tests (#3614 )	2023-05-30 18:59:07 +01:00
Patrick von Platen	a359ab4e29	Update README.md	2023-05-30 18:26:32 +01:00
Patrick von Platen	160c377ddc	Make style	2023-05-30 13:14:09 +01:00
Denis	bb22d546c0	[Community] CLIP Guided Images Mixing with Stable DIffusion Pipeline (#3587 ) * added clip_guided_images_mixing_stable_diffusion file and readme description * apply pre-commit --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2023-05-30 13:13:45 +01:00
Greg Hunkins	799f5b4e12	[Feat] Enable State Dict For Textual Inversion Loader (#3439 ) * enable state dict for textual inversion loader * Empty-Commit \| restart CI * Empty-Commit \| restart CI * Empty-Commit \| restart CI * Empty-Commit \| restart CI * add tests * fix tests * fix tests * fix tests --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2023-05-30 13:13:34 +01:00
takuoko	07ef4855cd	[Community, Enhancement] Add reference tricks in README (#3589 ) add reference tricks	2023-05-30 12:38:16 +01:00
Kadir Nar	6cbddf558a	[Community] Support StableDiffusionTilingPipeline (#3586 ) * added mixture pipeline * added docstring * update docstring	2023-05-30 12:24:15 +01:00
Rupert Menneer	35a740427e	#3487 Fix inpainting strength for various samplers (#3532 ) * Throw error if strength adjusted num_inference_steps < 1 * Added new fast test to check ValueError raised when num_inference_steps < 1 when strength adjusts the num_inference_steps then the inpainting pipeline should fail * fix #3487 initial latents are now only scaled by init_noise_sigma when pure noise updated this commit w.r.t the latest merge here: https://github.com/huggingface/diffusers/pull/3533 * fix --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2023-05-30 12:17:42 +01:00
Sayak Paul	0612f48cd0	[UniDiffuser Tests] Fix some tests (#3609 ) * fix: unidiffuser test failures. * living room.	2023-05-30 12:07:18 +01:00
Kadir Nar	c059cc0992	[docs] update the broken links (#3577 )	2023-05-30 11:44:53 +01:00
Patrick von Platen	c0f867afd1	Fix temb attention (#3607 ) * Fix temb attention * Apply suggestions from code review * make style * Add tests and fix docker * Apply suggestions from code review	2023-05-30 11:26:23 +01:00
Sayak Paul	c6ae883751	remove print statements from attention processor. (#3592 )	2023-05-29 09:20:31 +05:30
Steven Liu	5559d04237	[docs] Working with different formats (#3534 ) * add ckpt * fix format * apply feedback * fix * include pb * rename file	2023-05-26 14:37:51 -07:00
Brandon	9917c32916	[docs] update the broken links (#3568 ) update the broken links update the broken links for training folder doc	2023-05-26 12:10:32 -07:00

1 2 3 4 5 ...

2431 Commits