diffusers

mirror of https://github.com/huggingface/diffusers.git synced 2026-01-27 17:22:53 +03:00

Author	SHA1	Message	Date
Hari Krishna	f3d570c273	feat: allow disk offload for diffuser models (#3285 ) * allow disk offload for diffuser models * sort import * add max_memory argument * Changed sample[0] to images[0] (#3304) A pipeline object stores the results in `images` not in `sample`. Current code blocks don't work. * Typo in tutorial (#3295) * Torch compile graph fix (#3286) * fix more * Fix more * fix more * Apply suggestions from code review * fix * make style * make fix-copies * fix * make sure torch compile * Clean * fix test * Postprocessing refactor img2img (#3268) * refactor img2img VaeImageProcessor.postprocess * remove copy from for init, run_safety_checker, decode_latents Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> --------- Co-authored-by: yiyixuxu <yixu@yis-macbook-pro.lan> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * [Torch 2.0 compile] Fix more torch compile breaks (#3313) * Fix more torch compile breaks * add tests * Fix all * fix controlnet * fix more * Add Horace He as co-author. > > Co-authored-by: Horace He <horacehe2007@yahoo.com> * Add Horace He as co-author. Co-authored-by: Horace He <horacehe2007@yahoo.com> --------- Co-authored-by: Horace He <horacehe2007@yahoo.com> * fix: scale_lr and sync example readme and docs. (#3299) * fix: scale_lr and sync example readme and docs. * fix doc link. * Update stable_diffusion.mdx (#3310) fixed import statement * Fix missing variable assign in DeepFloyd-IF-II (#3315) Fix missing variable assign lol * Correct doc build for patch releases (#3316) Update build_documentation.yml * Add Stable Diffusion RePaint to community pipelines (#3320) * Add Stable Diffsuion RePaint to community pipelines - Adds Stable Diffsuion RePaint to community pipelines - Add Readme enty for pipeline * Fix: Remove wrong import - Remove wrong import - Minor change in comments * Fix: Code formatting of stable_diffusion_repaint * Fix: ruff errors in stable_diffusion_repaint * Fix multistep dpmsolver for cosine schedule (suitable for deepfloyd-if) (#3314) * fix multistep dpmsolver for cosine schedule (deepfloy-if) * fix a typo * Update src/diffusers/schedulers/scheduling_dpmsolver_multistep.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/diffusers/schedulers/scheduling_dpmsolver_multistep.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/diffusers/schedulers/scheduling_dpmsolver_multistep.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/diffusers/schedulers/scheduling_dpmsolver_multistep.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update src/diffusers/schedulers/scheduling_dpmsolver_multistep.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * update all dpmsolver (singlestep, multistep, dpm, dpm++) for cosine noise schedule * add test, fix style --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * [docs] Improve LoRA docs (#3311) * update docs * add to toctree * apply feedback * Added input pretubation (#3292) * Added input pretubation * Fixed spelling * Update write_own_pipeline.mdx (#3323) * update controlling generation doc with latest goodies. (#3321) * [Quality] Make style (#3341) * Fix config dpm (#3343) * Add the SDE variant of DPM-Solver and DPM-Solver++ (#3344) * add SDE variant of DPM-Solver and DPM-Solver++ * add test * fix typo * fix typo * Add upsample_size to AttnUpBlock2D, AttnDownBlock2D (#3275) The argument `upsample_size` needs to be added to these modules to allow compatibility with other blocks that require this argument. * Rename --only_save_embeds to --save_as_full_pipeline (#3206) * Set --only_save_embeds to False by default Due to how the option is named, it makes more sense to behave like this. * Refactor only_save_embeds to save_as_full_pipeline * [AudioLDM] Generalise conversion script (#3328) Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Fix TypeError when using prompt_embeds and negative_prompt (#2982) * test: Added test case * fix: fixed type checking issue on _encode_prompt * fix: fixed copies consistency * fix: one copy was not sufficient * Fix pipeline class on README (#3345) Update README.md * Inpainting: typo in docs (#3331) Typo in docs Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Add `use_Karras_sigmas` to LMSDiscreteScheduler (#3351) * add karras sigma to lms discrete scheduler * add test for lms_scheduler karras * reformat test lms * Batched load of textual inversions (#3277) * Batched load of textual inversions - Only call resize_token_embeddings once per batch as it is the most expensive operation - Allow pretrained_model_name_or_path and token to be an optional list - Remove Dict from type annotation pretrained_model_name_or_path as it was not supported in this function - Add comment that single files (e.g. .pt/.safetensors) are supported - Add comment for token parameter - Convert token override log message from warning to info * Update src/diffusers/loaders.py Check for duplicate tokens Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update condition for None tokens --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * make fix-copies * [docs] Fix docstring (#3334) fix docstring Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * if dreambooth lora (#3360) * update IF stage I pipelines add fixed variance schedulers and lora loading * added kv lora attn processor * allow loading into alternative lora attn processor * make vae optional * throw away predicted variance * allow loading into added kv lora layer * allow load T5 * allow pre compute text embeddings * set new variance type in schedulers * fix copies * refactor all prompt embedding code class prompts are now included in pre-encoding code max tokenizer length is now configurable embedding attention mask is now configurable * fix for when variance type is not defined on scheduler * do not pre compute validation prompt if not present * add example test for if lora dreambooth * add check for train text encoder and pre compute text embeddings * Postprocessing refactor all others (#3337) * add text2img * fix-copies * add * add all other pipelines * add * add * add * add * add * make style * style + fix copies --------- Co-authored-by: yiyixuxu <yixu310@gmail,com> * [docs] Improve safetensors docstring (#3368) * clarify safetensor docstring * fix typo * apply feedback * add: a warning message when using xformers in a PT 2.0 env. (#3365) * add: a warning message when using xformers in a PT 2.0 env. * Apply suggestions from code review Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * StableDiffusionInpaintingPipeline - resize image w.r.t height and width (#3322) * StableDiffusionInpaintingPipeline now resizes input images and masks w.r.t to passed input height and width. Default is already set to 512. This addresses the common tensor mismatch error. Also moved type check into relevant funciton to keep main pipeline body tidy. * Fixed StableDiffusionInpaintingPrepareMaskAndMaskedImageTests Due to previous commit these tests were failing as height and width need to be passed into the prepare_mask_and_masked_image function, I have updated the code and added a height/width variable per unit test as it seemed more appropriate than the current hard coded solution * Added a resolution test to StableDiffusionInpaintPipelineSlowTests this unit test simply gets the input and resizes it into some that would fail (e.g. would throw a tensor mismatch error/not a mult of 8). Then passes it through the pipeline and verifies it produces output with correct dims w.r.t the passed height and width --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * make style * [docs] Adapt a model (#3326) * first draft * apply feedback * conv_in.weight thrown away * [docs] Load safetensors (#3333) * safetensors * apply feedback * apply feedback * Apply suggestions from code review --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * make style * [Docs] Fix stable_diffusion.mdx typo (#3398) Fix typo in last code block. Correct "prommpts" to "prompt" * Support ControlNet v1.1 shuffle properly (#3340) * add inferring_controlnet_cond_batch * Revert "add inferring_controlnet_cond_batch" This reverts commit `abe8d6311d`. * set guess_mode to True whenever global_pool_conditions is True Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * nit * add integration test --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * [Tests] better determinism (#3374) * enable deterministic pytorch and cuda operations. * disable manual seeding. * make style && make quality for unet_2d tests. * enable determinism for the unet2dconditional model. * add CUBLAS_WORKSPACE_CONFIG for better reproducibility. * relax tolerance (very weird issue, though). * revert to torch manual_seed() where needed. * relax more tolerance. * better placement of the cuda variable and relax more tolerance. * enable determinism for 3d condition model. * relax tolerance. * add: determinism to alt_diffusion. * relax tolerance for alt diffusion. * dance diffusion. * dance diffusion is flaky. * test_dict_tuple_outputs_equivalent edit. * fix two more tests. * fix more ddim tests. * fix: argument. * change to diff in place of difference. * fix: test_save_load call. * test_save_load_float16 call. * fix: expected_max_diff * fix: paint by example. * relax tolerance. * add determinism to 1d unet model. * torch 2.0 regressions seem to be brutal * determinism to vae. * add reason to skipping. * up tolerance. * determinism to vq. * determinism to cuda. * determinism to the generic test pipeline file. * refactor general pipelines testing a bit. * determinism to alt diffusion i2i * up tolerance for alt diff i2i and audio diff * up tolerance. * determinism to audioldm * increase tolerance for audioldm lms. * increase tolerance for paint by paint. * increase tolerance for repaint. * determinism to cycle diffusion and sd 1. * relax tol for cycle diffusion 🚲 * relax tol for sd 1.0 * relax tol for controlnet. * determinism to img var. * relax tol for img variation. * tolerance to i2i sd * make style * determinism to inpaint. * relax tolerance for inpaiting. * determinism for inpainting legacy * relax tolerance. * determinism to instruct pix2pix * determinism to model editing. * model editing tolerance. * panorama determinism * determinism to pix2pix zero. * determinism to sag. * sd 2. determinism * sd. tolerance * disallow tf32 matmul. * relax tolerance is all you need. * make style and determinism to sd 2 depth * relax tolerance for depth. * tolerance to diffedit. * tolerance to sd 2 inpaint. * up tolerance. * determinism in upscaling. * tolerance in upscaler. * more tolerance relaxation. * determinism to v pred. * up tol for v_pred * unclip determinism * determinism to unclip img2img * determinism to text to video. * determinism to last set of tests * up tol. * vq cumsum doesn't have a deterministic kernel * relax tol * relax tol * [docs] Add transformers to install (#3388) add transformers to install * [deepspeed] partial ZeRO-3 support (#3076) * [deepspeed] partial ZeRO-3 support * cleanup * improve deepspeed fixes * Improve * make style --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Add omegaconf for tests (#3400) Add omegaconfg * Fix various bugs with LoRA Dreambooth and Dreambooth script (#3353) * Improve checkpointing lora * fix more * Improve doc string * Update src/diffusers/loaders.py * make stytle * Apply suggestions from code review * Update src/diffusers/loaders.py * Apply suggestions from code review * Apply suggestions from code review * better * Fix all * Fix multi-GPU dreambooth * Apply suggestions from code review Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * Fix all * make style * make style --------- Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * Fix docker file (#3402) * up * up * fix: deepseepd_plugin retrieval from accelerate state (#3410) * [Docs] Add `sigmoid` beta_scheduler to docstrings of relevant Schedulers (#3399) * Add `sigmoid` beta scheduler to `DDPMScheduler` docstring * Add `sigmoid` beta scheduler to `RePaintScheduler` docstring --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Don't install accelerate and transformers from source (#3415) * Don't install transformers and accelerate from source (#3414) * Improve fast tests (#3416) Update pr_tests.yml * attention refactor: the trilogy (#3387) * Replace `AttentionBlock` with `Attention` * use _from_deprecated_attn_block check re: @patrickvonplaten * [Docs] update the PT 2.0 optimization doc with latest findings (#3370) * add: benchmarking stats for A100 and V100. * Apply suggestions from code review Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * address patrick's comments. * add: rtx 4090 stats * ⚔ benchmark reports done * Apply suggestions from code review Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * 3313 pr link. * add: plots. Co-authored-by: Pedro <pedro@huggingface.co> * fix formattimg * update number percent. --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * Fix style rendering (#3433) * Fix style rendering. * Fix typo * unCLIP scheduler do not use note (#3417) * Replace deprecated command with environment file (#3409) Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * fix warning message pipeline loading (#3446) * add stable diffusion tensorrt img2img pipeline (#3419) * add stable diffusion tensorrt img2img pipeline Signed-off-by: Asfiya Baig <asfiyab@nvidia.com> * update docstrings Signed-off-by: Asfiya Baig <asfiyab@nvidia.com> --------- Signed-off-by: Asfiya Baig <asfiyab@nvidia.com> * Refactor controlnet and add img2img and inpaint (#3386) * refactor controlnet and add img2img and inpaint * First draft to get pipelines to work * make style * Fix more * Fix more * More tests * Fix more * Make inpainting work * make style and more tests * Apply suggestions from code review * up * make style * Fix imports * Fix more * Fix more * Improve examples * add test * Make sure import is correctly deprecated * Make sure everything works in compile mode * make sure authorship is correctly attributed * [Scheduler] DPM-Solver (++) Inverse Scheduler (#3335) * Add DPM-Solver Multistep Inverse Scheduler * Add draft tests for DiffEdit * Add inverse sde-dpmsolver steps to tune image diversity from inverted latents * Fix tests --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * [Docs] Fix incomplete docstring for resnet.py (#3438) Fix incomplete docstrings for resnet.py * fix tiled vae blend extent range (#3384) fix tiled vae bleand extent range * Small update to "Next steps" section (#3443) Small update to "Next steps" section: - PyTorch 2 is recommended. - Updated improvement figures. * Allow arbitrary aspect ratio in IFSuperResolutionPipeline (#3298) * Update pipeline_if_superresolution.py Allow arbitrary aspect ratio in IFSuperResolutionPipeline by using the input image shape * IFSuperResolutionPipeline: allow the user to override the height and width through the arguments * update IFSuperResolutionPipeline width/height doc string to match StableDiffusionInpaintPipeline conventions --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Adding 'strength' parameter to StableDiffusionInpaintingPipeline (#3424) * Added explanation of 'strength' parameter * Added get_timesteps function which relies on new strength parameter * Added `strength` parameter which defaults to 1. * Swapped ordering so `noise_timestep` can be calculated before masking the image this is required when you aren't applying 100% noise to the masked region, e.g. strength < 1. * Added strength to check_inputs, throws error if out of range * Changed `prepare_latents` to initialise latents w.r.t strength inspired from the stable diffusion img2img pipeline, init latents are initialised by converting the init image into a VAE latent and adding noise (based upon the strength parameter passed in), e.g. random when strength = 1, or the init image at strength = 0. * WIP: Added a unit test for the new strength parameter in the StableDiffusionInpaintingPipeline still need to add correct regression values * Created a is_strength_max to initialise from pure random noise * Updated unit tests w.r.t new strength parameter + fixed new strength unit test * renamed parameter to avoid confusion with variable of same name * Updated regression values for new strength test - now passes * removed 'copied from' comment as this method is now different and divergent from the cpy * Update src/diffusers/pipelines/stable_diffusion/pipeline_stable_diffusion_inpaint.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Ensure backwards compatibility for prepare_mask_and_masked_image created a return_image boolean and initialised to false * Ensure backwards compatibility for prepare_latents * Fixed copy check typo * Fixes w.r.t backward compibility changes * make style * keep function argument ordering same for backwards compatibility in callees with copied from statements * make fix-copies --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: William Berman <WLBberman@gmail.com> * [WIP] Bugfix - Pipeline.from_pretrained is broken when the pipeline is partially downloaded (#3448) Added bugfix using f strings. * Fix gradient checkpointing bugs in freezing part of models (requires_grad=False) (#3404) * gradient checkpointing bug fix * bug fix; changes for reviews * reformat * reformat --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Make dreambooth lora more robust to orig unet (#3462) * Make dreambooth lora more robust to orig unet * up * Reduce peak VRAM by releasing large attention tensors (as soon as they're unnecessary) (#3463) Release large tensors in attention (as soon as they're no longer required). Reduces peak VRAM by nearly 2 GB for 1024x1024 (even after slicing), and the savings scale up with image size. * Add min snr to text2img lora training script (#3459) add min snr to text2img lora training script * Add inpaint lora scale support (#3460) * add inpaint lora scale support * add inpaint lora scale test --------- Co-authored-by: yueyang.hyy <yueyang.hyy@alibaba-inc.com> * [From ckpt] Fix from_ckpt (#3466) * Correct from_ckpt * make style * Update full dreambooth script to work with IF (#3425) * Add IF dreambooth docs (#3470) * parameterize pass single args through tuple (#3477) * attend and excite tests disable determinism on the class level (#3478) * dreambooth docs torch.compile note (#3471) * dreambooth docs torch.compile note * Update examples/dreambooth/README.md Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * Update examples/dreambooth/README.md Co-authored-by: Pedro Cuenca <pedro@huggingface.co> --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * add: if entry in the dreambooth training docs. (#3472) * [docs] Textual inversion inference (#3473) * add textual inversion inference to docs * add to toctree --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * [docs] Distributed inference (#3376) * distributed inference * move to inference section * apply feedback * update with split_between_processes * apply feedback * [{Up,Down}sample1d] explicit view kernel size as number elements in flattened indices (#3479) explicit view kernel size as number elements in flattened indices * mps & onnx tests rework (#3449) * Remove ONNX tests from PR. They are already a part of push_tests.yml. * Remove mps tests from PRs. They are already performed on push. * Fix workflow name for fast push tests. * Extract mps tests to a workflow. For better control/filtering. * Remove --extra-index-url from mps tests * Increase tolerance of mps test This test passes in my Mac (Ventura 13.3) but fails in the CI hardware (Ventura 13.2). I ran the local tests following the same steps that exist in the CI workflow. * Temporarily run mps tests on pr So we can test. * Revert "Temporarily run mps tests on pr" Tests passed, go back to running on push. --------- Signed-off-by: Asfiya Baig <asfiyab@nvidia.com> Co-authored-by: Ilia Larchenko <41329713+IliaLarchenko@users.noreply.github.com> Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: YiYi Xu <yixu310@gmail.com> Co-authored-by: yiyixuxu <yixu@yis-macbook-pro.lan> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: Horace He <horacehe2007@yahoo.com> Co-authored-by: Umar <55330742+mu94-csl@users.noreply.github.com> Co-authored-by: Mylo <36931363+gitmylo@users.noreply.github.com> Co-authored-by: Markus Pobitzer <markuspobitzer@gmail.com> Co-authored-by: Cheng Lu <lucheng.lc15@gmail.com> Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> Co-authored-by: Isamu Isozaki <isamu.website@gmail.com> Co-authored-by: Cesar Aybar <csaybar@gmail.com> Co-authored-by: Will Rice <will@spokestack.io> Co-authored-by: Adrià Arrufat <1671644+arrufat@users.noreply.github.com> Co-authored-by: Sanchit Gandhi <93869735+sanchit-gandhi@users.noreply.github.com> Co-authored-by: At-sushi <dkahw210@kyoto.zaq.ne.jp> Co-authored-by: Lucca Zenóbio <luccazen@gmail.com> Co-authored-by: Lysandre Debut <lysandre@huggingface.co> Co-authored-by: Isotr0py <41363108+Isotr0py@users.noreply.github.com> Co-authored-by: pdoane <pdoane2@gmail.com> Co-authored-by: Will Berman <wlbberman@gmail.com> Co-authored-by: yiyixuxu <yixu310@gmail,com> Co-authored-by: Rupert Menneer <71332436+rupertmenneer@users.noreply.github.com> Co-authored-by: sudowind <wfpkueecs@163.com> Co-authored-by: Takuma Mori <takuma104@gmail.com> Co-authored-by: Stas Bekman <stas00@users.noreply.github.com> Co-authored-by: Pedro Cuenca <pedro@huggingface.co> Co-authored-by: Laureηt <laurentfainsin@protonmail.com> Co-authored-by: Jongwoo Han <jongwooo.han@gmail.com> Co-authored-by: asfiyab-nvidia <117682710+asfiyab-nvidia@users.noreply.github.com> Co-authored-by: clarencechen <clarencechenct@gmail.com> Co-authored-by: Laureηt <laurent@fainsin.bzh> Co-authored-by: superlabs-dev <133080491+superlabs-dev@users.noreply.github.com> Co-authored-by: Dev Aggarwal <devxpy@gmail.com> Co-authored-by: Vimarsh Chaturvedi <vimarsh.c@gmail.com> Co-authored-by: 7eu7d7 <31194890+7eu7d7@users.noreply.github.com> Co-authored-by: cmdr2 <shashank.shekhar.global@gmail.com> Co-authored-by: wfng92 <43742196+wfng92@users.noreply.github.com> Co-authored-by: Glaceon-Hyy <ffheyy0017@gmail.com> Co-authored-by: yueyang.hyy <yueyang.hyy@alibaba-inc.com>	2023-05-22 16:11:08 +01:00
Patrick von Platen	2b56e8ca68	make style	2023-05-22 16:49:46 +02:00
Ambrosiussen	b8b5daaee3	DataLoader respecting EXIF data in Training Images (#3465 ) * DataLoader will now bake in any transforms or image manipulations contained in the EXIF Images may have rotations stored in EXIF. Training using such images will cause those transforms to be ignored while training and thus produce unexpected results * Fixed the Dataloading EXIF issue in main DreamBooth training as well * Run make style (black & isort)	2023-05-22 15:49:35 +01:00
Seongsu Park	229fd8cbca	[Docs] Korean translation (optimization, training) (#3488 ) * feat) optimization kr translation * fix) typo, italic setting * feat) dreambooth, text2image kr * feat) lora kr * fix) LoRA * fix) fp16 fix * fix) doc-builder style * fix) fp16 일부 단어 수정 * fix) fp16 style fix * fix) opt, training docs update * feat) toctree update * feat) toctree update --------- Co-authored-by: Chanran Kim <seriousran@gmail.com>	2023-05-22 15:46:16 +01:00
Patrick von Platen	a2874af297	make style	2023-05-22 16:44:48 +02:00
w4ffl35	0160e5146f	Adds local_files_only bool to prevent forced online connection (#3486 )	2023-05-22 15:44:36 +01:00
Isotr0py	194b0a425d	Add `use_Karras_sigmas` to DPMSolverSinglestepScheduler (#3476 ) * add use_karras_sigmas * add karras test * add doc	2023-05-22 15:43:56 +01:00
Patrick von Platen	6dd3871ae0	Fix DPM single (#3413 ) * Fix DPM single * add test * fix one more bug * Apply suggestions from code review Co-authored-by: StAlKeR7779 <stalkek7779@yandex.ru> --------- Co-authored-by: StAlKeR7779 <stalkek7779@yandex.ru>	2023-05-22 14:32:39 +01:00
Patrick von Platen	51843fd7d0	Refactor full determinism (#3485 ) * up * fix more * Apply suggestions from code review * fix more * fix more * Check it * Remove 16:8 * fix more * fix more * fix more * up * up * Test only stable diffusion * Test only two files * up * Try out spinning up processes that can be killed * up * Apply suggestions from code review * up * up	2023-05-22 11:15:11 +01:00
Sayak Paul	49ad61c204	[Docs] add note on local directory path. (#3397 ) add note on local directory path. Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2023-05-21 15:26:56 +05:30
Sayak Paul	4bbc51d94d	[Attention processor] Better warning message when shifting to `AttnProcessor2_0` (#3457 ) * add: debugging to enabling memory efficient processing * add: better warning message.	2023-05-21 15:26:47 +05:30
Pedro Cuenca	f7b4f51cc2	mps & onnx tests rework (#3449 ) * Remove ONNX tests from PR. They are already a part of push_tests.yml. * Remove mps tests from PRs. They are already performed on push. * Fix workflow name for fast push tests. * Extract mps tests to a workflow. For better control/filtering. * Remove --extra-index-url from mps tests * Increase tolerance of mps test This test passes in my Mac (Ventura 13.3) but fails in the CI hardware (Ventura 13.2). I ran the local tests following the same steps that exist in the CI workflow. * Temporarily run mps tests on pr So we can test. * Revert "Temporarily run mps tests on pr" Tests passed, go back to running on push.	2023-05-20 13:43:07 +02:00
Will Berman	85eff637aa	[{Up,Down}sample1d] explicit view kernel size as number elements in flattened indices (#3479 ) explicit view kernel size as number elements in flattened indices	2023-05-19 10:45:56 -07:00
Steven Liu	e589bdb956	[docs] Distributed inference (#3376 ) * distributed inference * move to inference section * apply feedback * update with split_between_processes * apply feedback	2023-05-19 10:07:33 -07:00
Steven Liu	00c76f6ff1	[docs] Textual inversion inference (#3473 ) * add textual inversion inference to docs * add to toctree --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2023-05-19 09:47:27 -07:00
Sayak Paul	e343443565	add: if entry in the dreambooth training docs. (#3472 )	2023-05-19 07:47:28 +05:30
Will Berman	8d646f2294	dreambooth docs torch.compile note (#3471 ) * dreambooth docs torch.compile note * Update examples/dreambooth/README.md Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * Update examples/dreambooth/README.md Co-authored-by: Pedro Cuenca <pedro@huggingface.co> --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: Pedro Cuenca <pedro@huggingface.co>	2023-05-19 07:40:14 +05:30
Will Berman	8917769499	attend and excite tests disable determinism on the class level (#3478 )	2023-05-18 10:24:49 -07:00
Will Berman	49b7ccfb96	parameterize pass single args through tuple (#3477 )	2023-05-18 10:14:29 -07:00
Will Berman	7200985eab	Add IF dreambooth docs (#3470 )	2023-05-17 11:56:10 -07:00
Will Berman	c9f939bf98	Update full dreambooth script to work with IF (#3425 )	2023-05-17 10:42:20 -07:00
Patrick von Platen	2858d7e15e	[From ckpt] Fix from_ckpt (#3466 ) * Correct from_ckpt * make style	2023-05-17 13:26:53 +01:00
Glaceon-Hyy	88295f92d9	Add inpaint lora scale support (#3460 ) * add inpaint lora scale support * add inpaint lora scale test --------- Co-authored-by: yueyang.hyy <yueyang.hyy@alibaba-inc.com>	2023-05-17 16:58:19 +05:30
wfng92	2faf91dbde	Add min snr to text2img lora training script (#3459 ) add min snr to text2img lora training script	2023-05-17 16:37:45 +05:30
cmdr2	bd78f63a54	Reduce peak VRAM by releasing large attention tensors (as soon as they're unnecessary) (#3463 ) Release large tensors in attention (as soon as they're no longer required). Reduces peak VRAM by nearly 2 GB for 1024x1024 (even after slicing), and the savings scale up with image size.	2023-05-17 11:24:59 +01:00
Patrick von Platen	3ebd2d1f9e	Make dreambooth lora more robust to orig unet (#3462 ) * Make dreambooth lora more robust to orig unet * up	2023-05-17 11:20:13 +01:00
7eu7d7	15f1bab13b	Fix gradient checkpointing bugs in freezing part of models (requires_grad=False) (#3404 ) * gradient checkpointing bug fix * bug fix; changes for reviews * reformat * reformat --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2023-05-17 11:06:04 +01:00
Vimarsh Chaturvedi	415c616712	[WIP] Bugfix - Pipeline.from_pretrained is broken when the pipeline is partially downloaded (#3448 ) Added bugfix using f strings.	2023-05-17 11:05:33 +01:00
Rupert Menneer	c09c4f3ab7	Adding 'strength' parameter to StableDiffusionInpaintingPipeline (#3424 ) * Added explanation of 'strength' parameter * Added get_timesteps function which relies on new strength parameter * Added `strength` parameter which defaults to 1. * Swapped ordering so `noise_timestep` can be calculated before masking the image this is required when you aren't applying 100% noise to the masked region, e.g. strength < 1. * Added strength to check_inputs, throws error if out of range * Changed `prepare_latents` to initialise latents w.r.t strength inspired from the stable diffusion img2img pipeline, init latents are initialised by converting the init image into a VAE latent and adding noise (based upon the strength parameter passed in), e.g. random when strength = 1, or the init image at strength = 0. * WIP: Added a unit test for the new strength parameter in the StableDiffusionInpaintingPipeline still need to add correct regression values * Created a is_strength_max to initialise from pure random noise * Updated unit tests w.r.t new strength parameter + fixed new strength unit test * renamed parameter to avoid confusion with variable of same name * Updated regression values for new strength test - now passes * removed 'copied from' comment as this method is now different and divergent from the cpy * Update src/diffusers/pipelines/stable_diffusion/pipeline_stable_diffusion_inpaint.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Ensure backwards compatibility for prepare_mask_and_masked_image created a return_image boolean and initialised to false * Ensure backwards compatibility for prepare_latents * Fixed copy check typo * Fixes w.r.t backward compibility changes * make style * keep function argument ordering same for backwards compatibility in callees with copied from statements * make fix-copies --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: William Berman <WLBberman@gmail.com>	2023-05-17 11:05:16 +01:00
Dev Aggarwal	6070b32fcf	Allow arbitrary aspect ratio in IFSuperResolutionPipeline (#3298 ) * Update pipeline_if_superresolution.py Allow arbitrary aspect ratio in IFSuperResolutionPipeline by using the input image shape * IFSuperResolutionPipeline: allow the user to override the height and width through the arguments * update IFSuperResolutionPipeline width/height doc string to match StableDiffusionInpaintPipeline conventions --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2023-05-16 19:21:07 -07:00
Pedro Cuenca	0392eceba8	Small update to "Next steps" section (#3443 ) Small update to "Next steps" section: - PyTorch 2 is recommended. - Updated improvement figures.	2023-05-16 19:35:47 +01:00
superlabs-dev	92ea5baca2	fix tiled vae blend extent range (#3384 ) fix tiled vae bleand extent range	2023-05-16 19:33:47 +01:00
Laureηt	754fac82d2	[Docs] Fix incomplete docstring for resnet.py (#3438 ) Fix incomplete docstrings for resnet.py	2023-05-16 19:33:34 +01:00
clarencechen	17f9aed79c	[Scheduler] DPM-Solver (++) Inverse Scheduler (#3335 ) * Add DPM-Solver Multistep Inverse Scheduler * Add draft tests for DiffEdit * Add inverse sde-dpmsolver steps to tune image diversity from inverted latents * Fix tests --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2023-05-16 19:26:53 +01:00
Patrick von Platen	886575ee43	Refactor controlnet and add img2img and inpaint (#3386 ) * refactor controlnet and add img2img and inpaint * First draft to get pipelines to work * make style * Fix more * Fix more * More tests * Fix more * Make inpainting work * make style and more tests * Apply suggestions from code review * up * make style * Fix imports * Fix more * Fix more * Improve examples * add test * Make sure import is correctly deprecated * Make sure everything works in compile mode * make sure authorship is correctly attributed	2023-05-16 19:07:21 +01:00
asfiyab-nvidia	9d44e2fb66	add stable diffusion tensorrt img2img pipeline (#3419 ) * add stable diffusion tensorrt img2img pipeline Signed-off-by: Asfiya Baig <asfiyab@nvidia.com> * update docstrings Signed-off-by: Asfiya Baig <asfiyab@nvidia.com> --------- Signed-off-by: Asfiya Baig <asfiyab@nvidia.com>	2023-05-16 14:28:01 +01:00
Patrick von Platen	d2285f5158	fix warning message pipeline loading (#3446 )	2023-05-16 12:58:24 +01:00
Jongwoo Han	326f326e17	Replace deprecated command with environment file (#3409 ) Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2023-05-16 12:51:10 +01:00
Will Berman	29b1325a5a	unCLIP scheduler do not use note (#3417 )	2023-05-15 09:47:14 -06:00
Pedro Cuenca	7a32b6beeb	Fix style rendering (#3433 ) * Fix style rendering. * Fix typo	2023-05-15 14:32:34 +05:30
Sayak Paul	bdefabd1a8	[Docs] update the PT 2.0 optimization doc with latest findings (#3370 ) * add: benchmarking stats for A100 and V100. * Apply suggestions from code review Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * address patrick's comments. * add: rtx 4090 stats * ⚔ benchmark reports done * Apply suggestions from code review Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * 3313 pr link. * add: plots. Co-authored-by: Pedro <pedro@huggingface.co> * fix formattimg * update number percent. --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: Pedro Cuenca <pedro@huggingface.co>	2023-05-13 15:12:01 +05:30
Will Berman	909742dbd6	attention refactor: the trilogy (#3387 ) * Replace `AttentionBlock` with `Attention` * use _from_deprecated_attn_block check re: @patrickvonplaten	2023-05-12 08:54:09 -06:00
Patrick von Platen	28f404349d	Improve fast tests (#3416 ) Update pr_tests.yml	2023-05-12 14:01:03 +01:00
Patrick von Platen	03e5126978	Don't install transformers and accelerate from source (#3414 )	2023-05-12 13:15:23 +01:00
Patrick von Platen	b1b92f4a98	Don't install accelerate and transformers from source (#3415 )	2023-05-12 13:14:04 +01:00
Laureηt	7f6373d264	[Docs] Add `sigmoid` beta_scheduler to docstrings of relevant Schedulers (#3399 ) * Add `sigmoid` beta scheduler to `DDPMScheduler` docstring * Add `sigmoid` beta scheduler to `RePaintScheduler` docstring --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2023-05-12 12:48:26 +01:00
Sayak Paul	3a237f4fa2	fix: deepseepd_plugin retrieval from accelerate state (#3410 )	2023-05-12 10:02:22 +01:00
Patrick von Platen	1a5797c6d4	Fix docker file (#3402 ) * up * up	2023-05-11 20:28:37 +01:00
Patrick von Platen	f92253015c	Fix various bugs with LoRA Dreambooth and Dreambooth script (#3353 ) * Improve checkpointing lora * fix more * Improve doc string * Update src/diffusers/loaders.py * make stytle * Apply suggestions from code review * Update src/diffusers/loaders.py * Apply suggestions from code review * Apply suggestions from code review * better * Fix all * Fix multi-GPU dreambooth * Apply suggestions from code review Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * Fix all * make style * make style --------- Co-authored-by: Pedro Cuenca <pedro@huggingface.co>	2023-05-11 19:28:09 +01:00
Patrick von Platen	58c6f9cb71	Add omegaconf for tests (#3400 ) Add omegaconfg	2023-05-11 18:03:27 +01:00

1 2 3 4 5 ...

2351 Commits