diffusers

mirror of https://github.com/huggingface/diffusers.git synced 2026-01-29 07:22:12 +03:00

Author	SHA1	Message	Date
Patrick von Platen	51843fd7d0	Refactor full determinism (#3485 ) * up * fix more * Apply suggestions from code review * fix more * fix more * Check it * Remove 16:8 * fix more * fix more * fix more * up * up * Test only stable diffusion * Test only two files * up * Try out spinning up processes that can be killed * up * Apply suggestions from code review * up * up	2023-05-22 11:15:11 +01:00
Sayak Paul	4bbc51d94d	[Attention processor] Better warning message when shifting to `AttnProcessor2_0` (#3457 ) * add: debugging to enabling memory efficient processing * add: better warning message.	2023-05-21 15:26:47 +05:30
Will Berman	85eff637aa	[{Up,Down}sample1d] explicit view kernel size as number elements in flattened indices (#3479 ) explicit view kernel size as number elements in flattened indices	2023-05-19 10:45:56 -07:00
Will Berman	c9f939bf98	Update full dreambooth script to work with IF (#3425 )	2023-05-17 10:42:20 -07:00
Patrick von Platen	2858d7e15e	[From ckpt] Fix from_ckpt (#3466 ) * Correct from_ckpt * make style	2023-05-17 13:26:53 +01:00
Glaceon-Hyy	88295f92d9	Add inpaint lora scale support (#3460 ) * add inpaint lora scale support * add inpaint lora scale test --------- Co-authored-by: yueyang.hyy <yueyang.hyy@alibaba-inc.com>	2023-05-17 16:58:19 +05:30
cmdr2	bd78f63a54	Reduce peak VRAM by releasing large attention tensors (as soon as they're unnecessary) (#3463 ) Release large tensors in attention (as soon as they're no longer required). Reduces peak VRAM by nearly 2 GB for 1024x1024 (even after slicing), and the savings scale up with image size.	2023-05-17 11:24:59 +01:00
7eu7d7	15f1bab13b	Fix gradient checkpointing bugs in freezing part of models (requires_grad=False) (#3404 ) * gradient checkpointing bug fix * bug fix; changes for reviews * reformat * reformat --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2023-05-17 11:06:04 +01:00
Vimarsh Chaturvedi	415c616712	[WIP] Bugfix - Pipeline.from_pretrained is broken when the pipeline is partially downloaded (#3448 ) Added bugfix using f strings.	2023-05-17 11:05:33 +01:00
Rupert Menneer	c09c4f3ab7	Adding 'strength' parameter to StableDiffusionInpaintingPipeline (#3424 ) * Added explanation of 'strength' parameter * Added get_timesteps function which relies on new strength parameter * Added `strength` parameter which defaults to 1. * Swapped ordering so `noise_timestep` can be calculated before masking the image this is required when you aren't applying 100% noise to the masked region, e.g. strength < 1. * Added strength to check_inputs, throws error if out of range * Changed `prepare_latents` to initialise latents w.r.t strength inspired from the stable diffusion img2img pipeline, init latents are initialised by converting the init image into a VAE latent and adding noise (based upon the strength parameter passed in), e.g. random when strength = 1, or the init image at strength = 0. * WIP: Added a unit test for the new strength parameter in the StableDiffusionInpaintingPipeline still need to add correct regression values * Created a is_strength_max to initialise from pure random noise * Updated unit tests w.r.t new strength parameter + fixed new strength unit test * renamed parameter to avoid confusion with variable of same name * Updated regression values for new strength test - now passes * removed 'copied from' comment as this method is now different and divergent from the cpy * Update src/diffusers/pipelines/stable_diffusion/pipeline_stable_diffusion_inpaint.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Ensure backwards compatibility for prepare_mask_and_masked_image created a return_image boolean and initialised to false * Ensure backwards compatibility for prepare_latents * Fixed copy check typo * Fixes w.r.t backward compibility changes * make style * keep function argument ordering same for backwards compatibility in callees with copied from statements * make fix-copies --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: William Berman <WLBberman@gmail.com>	2023-05-17 11:05:16 +01:00
Dev Aggarwal	6070b32fcf	Allow arbitrary aspect ratio in IFSuperResolutionPipeline (#3298 ) * Update pipeline_if_superresolution.py Allow arbitrary aspect ratio in IFSuperResolutionPipeline by using the input image shape * IFSuperResolutionPipeline: allow the user to override the height and width through the arguments * update IFSuperResolutionPipeline width/height doc string to match StableDiffusionInpaintPipeline conventions --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2023-05-16 19:21:07 -07:00
superlabs-dev	92ea5baca2	fix tiled vae blend extent range (#3384 ) fix tiled vae bleand extent range	2023-05-16 19:33:47 +01:00
Laureηt	754fac82d2	[Docs] Fix incomplete docstring for resnet.py (#3438 ) Fix incomplete docstrings for resnet.py	2023-05-16 19:33:34 +01:00
clarencechen	17f9aed79c	[Scheduler] DPM-Solver (++) Inverse Scheduler (#3335 ) * Add DPM-Solver Multistep Inverse Scheduler * Add draft tests for DiffEdit * Add inverse sde-dpmsolver steps to tune image diversity from inverted latents * Fix tests --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2023-05-16 19:26:53 +01:00
Patrick von Platen	886575ee43	Refactor controlnet and add img2img and inpaint (#3386 ) * refactor controlnet and add img2img and inpaint * First draft to get pipelines to work * make style * Fix more * Fix more * More tests * Fix more * Make inpainting work * make style and more tests * Apply suggestions from code review * up * make style * Fix imports * Fix more * Fix more * Improve examples * add test * Make sure import is correctly deprecated * Make sure everything works in compile mode * make sure authorship is correctly attributed	2023-05-16 19:07:21 +01:00
Patrick von Platen	d2285f5158	fix warning message pipeline loading (#3446 )	2023-05-16 12:58:24 +01:00
Will Berman	29b1325a5a	unCLIP scheduler do not use note (#3417 )	2023-05-15 09:47:14 -06:00
Will Berman	909742dbd6	attention refactor: the trilogy (#3387 ) * Replace `AttentionBlock` with `Attention` * use _from_deprecated_attn_block check re: @patrickvonplaten	2023-05-12 08:54:09 -06:00
Laureηt	7f6373d264	[Docs] Add `sigmoid` beta_scheduler to docstrings of relevant Schedulers (#3399 ) * Add `sigmoid` beta scheduler to `DDPMScheduler` docstring * Add `sigmoid` beta scheduler to `RePaintScheduler` docstring --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2023-05-12 12:48:26 +01:00
Patrick von Platen	f92253015c	Fix various bugs with LoRA Dreambooth and Dreambooth script (#3353 ) * Improve checkpointing lora * fix more * Improve doc string * Update src/diffusers/loaders.py * make stytle * Apply suggestions from code review * Update src/diffusers/loaders.py * Apply suggestions from code review * Apply suggestions from code review * better * Fix all * Fix multi-GPU dreambooth * Apply suggestions from code review Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * Fix all * make style * make style --------- Co-authored-by: Pedro Cuenca <pedro@huggingface.co>	2023-05-11 19:28:09 +01:00
Patrick von Platen	58c6f9cb71	Add omegaconf for tests (#3400 ) Add omegaconfg	2023-05-11 18:03:27 +01:00
Stas Bekman	af2a237676	[deepspeed] partial ZeRO-3 support (#3076 ) * [deepspeed] partial ZeRO-3 support * cleanup * improve deepspeed fixes * Improve * make style --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2023-05-11 16:59:20 +01:00
Takuma Mori	01c056f094	Support ControlNet v1.1 shuffle properly (#3340 ) * add inferring_controlnet_cond_batch * Revert "add inferring_controlnet_cond_batch" This reverts commit `abe8d6311d`. * set guess_mode to True whenever global_pool_conditions is True Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * nit * add integration test --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2023-05-11 14:58:07 +01:00
Patrick von Platen	82e6fa56f0	make style	2023-05-10 20:16:18 +02:00
Rupert Menneer	edb087a217	StableDiffusionInpaintingPipeline - resize image w.r.t height and width (#3322 ) * StableDiffusionInpaintingPipeline now resizes input images and masks w.r.t to passed input height and width. Default is already set to 512. This addresses the common tensor mismatch error. Also moved type check into relevant funciton to keep main pipeline body tidy. * Fixed StableDiffusionInpaintingPrepareMaskAndMaskedImageTests Due to previous commit these tests were failing as height and width need to be passed into the prepare_mask_and_masked_image function, I have updated the code and added a height/width variable per unit test as it seemed more appropriate than the current hard coded solution * Added a resolution test to StableDiffusionInpaintPipelineSlowTests this unit test simply gets the input and resizes it into some that would fail (e.g. would throw a tensor mismatch error/not a mult of 8). Then passes it through the pipeline and verifies it produces output with correct dims w.r.t the passed height and width --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2023-05-10 19:14:25 +01:00
Sayak Paul	94a0c644a8	add: a warning message when using xformers in a PT 2.0 env. (#3365 ) * add: a warning message when using xformers in a PT 2.0 env. * Apply suggestions from code review Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2023-05-10 07:22:04 +05:30
Steven Liu	26832aa5ef	[docs] Improve safetensors docstring (#3368 ) * clarify safetensor docstring * fix typo * apply feedback	2023-05-09 16:15:05 -07:00
YiYi Xu	c559479592	Postprocessing refactor all others (#3337 ) * add text2img * fix-copies * add * add all other pipelines * add * add * add * add * add * make style * style + fix copies --------- Co-authored-by: yiyixuxu <yixu310@gmail,com>	2023-05-09 22:28:30 +01:00
Will Berman	a757b2db6e	if dreambooth lora (#3360 ) * update IF stage I pipelines add fixed variance schedulers and lora loading * added kv lora attn processor * allow loading into alternative lora attn processor * make vae optional * throw away predicted variance * allow loading into added kv lora layer * allow load T5 * allow pre compute text embeddings * set new variance type in schedulers * fix copies * refactor all prompt embedding code class prompts are now included in pre-encoding code max tokenizer length is now configurable embedding attention mask is now configurable * fix for when variance type is not defined on scheduler * do not pre compute validation prompt if not present * add example test for if lora dreambooth * add check for train text encoder and pre compute text embeddings	2023-05-09 10:24:36 -07:00
Steven Liu	571bc1ea11	[docs] Fix docstring (#3334 ) fix docstring Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2023-05-08 12:08:23 -07:00
Patrick von Platen	f381402ec8	make fix-copies	2023-05-08 10:55:02 +02:00
pdoane	3d8b3d7cd8	Batched load of textual inversions (#3277 ) * Batched load of textual inversions - Only call resize_token_embeddings once per batch as it is the most expensive operation - Allow pretrained_model_name_or_path and token to be an optional list - Remove Dict from type annotation pretrained_model_name_or_path as it was not supported in this function - Add comment that single files (e.g. .pt/.safetensors) are supported - Add comment for token parameter - Convert token override log message from warning to info * Update src/diffusers/loaders.py Check for duplicate tokens Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update condition for None tokens --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2023-05-08 09:54:30 +01:00
Isotr0py	0ffac97933	Add `use_Karras_sigmas` to LMSDiscreteScheduler (#3351 ) * add karras sigma to lms discrete scheduler * add test for lms_scheduler karras * reformat test lms	2023-05-06 12:19:27 +01:00
At-sushi	7ce3fa010a	Fix TypeError when using prompt_embeds and negative_prompt (#2982 ) * test: Added test case * fix: fixed type checking issue on _encode_prompt * fix: fixed copies consistency * fix: one copy was not sufficient	2023-05-06 12:04:07 +01:00
Will Rice	36f43ea75a	Add upsample_size to AttnUpBlock2D, AttnDownBlock2D (#3275 ) The argument `upsample_size` needs to be added to these modules to allow compatibility with other blocks that require this argument.	2023-05-05 19:50:41 +01:00
Cheng Lu	27522b585b	Add the SDE variant of DPM-Solver and DPM-Solver++ (#3344 ) * add SDE variant of DPM-Solver and DPM-Solver++ * add test * fix typo * fix typo	2023-05-05 16:03:47 +01:00
Patrick von Platen	8d4c7d0ea0	Fix config dpm (#3343 )	2023-05-05 12:02:33 +01:00
Patrick von Platen	29ad75dc3b	[Quality] Make style (#3341 )	2023-05-05 10:06:09 +01:00
Cheng Lu	022479416f	Fix multistep dpmsolver for cosine schedule (suitable for deepfloyd-if) (#3314 ) * fix multistep dpmsolver for cosine schedule (deepfloy-if) * fix a typo * Update src/diffusers/schedulers/scheduling_dpmsolver_multistep.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/diffusers/schedulers/scheduling_dpmsolver_multistep.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/diffusers/schedulers/scheduling_dpmsolver_multistep.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/diffusers/schedulers/scheduling_dpmsolver_multistep.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/diffusers/schedulers/scheduling_dpmsolver_multistep.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * update all dpmsolver (singlestep, multistep, dpm, dpm++) for cosine noise schedule * add test, fix style --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2023-05-03 18:00:59 +01:00
Mylo	63a8ef7b73	Fix missing variable assign in DeepFloyd-IF-II (#3315 ) Fix missing variable assign lol	2023-05-03 17:31:04 +01:00
Patrick von Platen	5c7a35a259	[Torch 2.0 compile] Fix more torch compile breaks (#3313 ) * Fix more torch compile breaks * add tests * Fix all * fix controlnet * fix more * Add Horace He as co-author. > > Co-authored-by: Horace He <horacehe2007@yahoo.com> * Add Horace He as co-author. Co-authored-by: Horace He <horacehe2007@yahoo.com> --------- Co-authored-by: Horace He <horacehe2007@yahoo.com>	2023-05-02 18:51:00 +01:00
YiYi Xu	a7f25b4a88	Postprocessing refactor img2img (#3268 ) * refactor img2img VaeImageProcessor.postprocess * remove copy from for init, run_safety_checker, decode_latents Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> --------- Co-authored-by: yiyixuxu <yixu@yis-macbook-pro.lan> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2023-05-01 07:54:09 -10:00
Patrick von Platen	0e82fb19e1	Torch compile graph fix (#3286 ) * fix more * Fix more * fix more * Apply suggestions from code review * fix * make style * make fix-copies * fix * make sure torch compile * Clean * fix test	2023-05-01 16:45:43 +02:00
Ilia Larchenko	536684eb2f	Changed sample[0] to images[0] (#3304 ) A pipeline object stores the results in `images` not in `sample`. Current code blocks don't work.	2023-05-01 15:33:51 +02:00
Will Berman	384c83aa9a	temp disable spectogram diffusion tests (#3278 ) The note-seq package throws an error on import because the default installed version of Ipython is not compatible with python 3.8 which we run in the CI. https://github.com/huggingface/diffusers/actions/runs/4830121056/jobs/8605954838#step:7:9	2023-04-28 12:05:53 -07:00
Patrick von Platen	4d35d7fea3	Allow disabling torch 2_0 attention (#3273 ) * Allow disabling torch 2_0 attention * make style * Update src/diffusers/models/attention.py	2023-04-28 13:31:11 +02:00
Jason Kuan	a7b0671c07	add constant learning rate with custom rule (#3133 ) * add constant lr with rules * add constant with rules in TYPE_TO_SCHEDULER_FUNCTION * add constant lr rate with rule * hotfix code quality * fix doc style * change name constant_with_rules to piecewise constant	2023-04-28 16:29:56 +05:30
clarencechen	be0bfcec4d	Diffedit Zero-Shot Inpainting Pipeline (#2837 ) * Update Pix2PixZero Auto-correlation Loss * Add Stable Diffusion DiffEdit pipeline * Add draft documentation and import code * Bugfixes and refactoring * Add option to not decode latents in the inversion process * Harmonize preprocessing * Revert "Update Pix2PixZero Auto-correlation Loss" This reverts commit `b218062fed`. * Update annotations * rename `compute_mask` to `generate_mask` * Update documentation * Update docs * Update Docs * Fix copy * Change shape of output latents to batch first * Update docs * Add first draft for tests * Bugfix and update tests * Add `cross_attention_kwargs` support for all pipeline methods * Fix Copies * Add support for PIL image latents Add support for mask broadcasting Update docs and tests Align `mask` argument to `mask_image` Remove height and width arguments * Enable MPS Tests * Move example docstrings * Fix test * Fix test * fix pipeline inheritance * Harmonize `prepare_image_latents` with StableDiffusionPix2PixZeroPipeline * Register modules set to `None` in config for `test_save_load_optional_components` * Move fixed logic to specific test class * Clean changes to other pipelines * Update new tests to coordinate with #2953 * Update slow tests for better results * Safety to avoid potential problems with torch.inference_mode * Add reference in SD Pipeline Overview * Fix tests again * Enforce determinism in noise for generate_mask * Fix copies * Widen test tolerance for fp16 based on `test_stable_diffusion_upscale_pipeline_fp16` * Add LoraLoaderMixin and update `prepare_image_latents` * clean up repeat and reg * bugfix * Remove invalid args from docs Suppress spurious warning by repeating image before latent to mask gen	2023-04-28 16:28:26 +05:30
Sayak Paul	71de5b7051	[LoRA] quality of life improvements in the loading semantics and docs (#3180 ) * 👽 qol improvements for LoRA. * better function name? * fix: LoRA weight loading with the new format. * address Patrick's comments. * Apply suggestions from code review Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * change wording around encouraging the use of load_lora_weights(). * fix: function name. --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2023-04-28 11:36:49 +05:30
Patrick von Platen	364d59d13b	Fix community pipelines (#3266 )	2023-04-27 17:12:08 +01:00

1 2 3 4 5 ...

1322 Commits