diffusers

mirror of https://github.com/huggingface/diffusers.git synced 2026-01-27 17:22:53 +03:00

Files

MatthieuTPHR 98c42134a5 Up to 2x speedup on GPUs using memory efficient attention (#532 )

* 2x speedup using memory efficient attention

* remove einops dependency

* Swap K, M in op instantiation

* Simplify code, remove unnecessary maybe_init call and function, remove unused self.scale parameter

* make xformers a soft dependency

* remove one-liner functions

* change one letter variable to appropriate names

* Remove Env variable dependency, remove MemoryEfficientCrossAttention class and use enable_xformers_memory_efficient_attention method

* Add memory efficient attention toggle to img2img and inpaint pipelines

* Clearer management of xformers' availability

* update optimizations markdown to add info about memory efficient attention

* add benchmarks for TITAN RTX

* More detailed explanation of how the mem eff benchmark were ran

* Removing autocast from optimization markdown

* import_utils: import torch only if is available

Co-authored-by: Nouamane Tazi <nouamane98@gmail.com>

2022-11-02 10:29:06 +01:00

fp16.mdx

Up to 2x speedup on GPUs using memory efficient attention (#532 )

2022-11-02 10:29:06 +01:00

mps.mdx

mps changes for PyTorch 1.13 (#926 )

2022-10-25 16:41:51 +02:00

onnx.mdx

v1-5 docs updates (#921 )

2022-10-24 22:50:23 +02:00

open_vino.mdx

[Docs] Minor fixes in optimization section (#420 )

2022-09-08 13:13:46 +02:00