diffusers

mirror of https://github.com/huggingface/diffusers.git synced 2026-01-27 17:22:53 +03:00

Author	SHA1	Message	Date
Suraj Patil	7482178162	default fast model loading 🔥 (#1115 ) * make accelerate hard dep * default fast init * move params to cpu when device map is None * handle device_map=None * handle torch < 1.9 * remove device_map="auto" * style * add accelerate in torch extra * remove accelerate from extras["test"] * raise an error if torch is available but not accelerate * update installation docs * Apply suggestions from code review Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * improve defautl loading speed even further, allow disabling fats loading * address review comments * adapt the tests * fix test_stable_diffusion_fast_load * fix test_read_init * temp fix for dummy checks * Trigger Build * Apply suggestions from code review Co-authored-by: Anton Lozhkov <anton@huggingface.co> Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: Anton Lozhkov <anton@huggingface.co>	2022-11-03 17:25:57 +01:00
Will Berman	ef2ea33c3b	VQ-diffusion (#658 ) * Changes for VQ-diffusion VQVAE Add specify dimension of embeddings to VQModel: `VQModel` will by default set the dimension of embeddings to the number of latent channels. The VQ-diffusion VQVAE has a smaller embedding dimension, 128, than number of latent channels, 256. Add AttnDownEncoderBlock2D and AttnUpDecoderBlock2D to the up and down unet block helpers. VQ-diffusion's VQVAE uses those two block types. * Changes for VQ-diffusion transformer Modify attention.py so SpatialTransformer can be used for VQ-diffusion's transformer. SpatialTransformer: - Can now operate over discrete inputs (classes of vector embeddings) as well as continuous. - `in_channels` was made optional in the constructor so two locations where it was passed as a positional arg were moved to kwargs - modified forward pass to take optional timestep embeddings ImagePositionalEmbeddings: - added to provide positional embeddings to discrete inputs for latent pixels BasicTransformerBlock: - norm layers were made configurable so that the VQ-diffusion could use AdaLayerNorm with timestep embeddings - modified forward pass to take optional timestep embeddings CrossAttention: - now may optionally take a bias parameter for its query, key, and value linear layers FeedForward: - Internal layers are now configurable ApproximateGELU: - Activation function in VQ-diffusion's feedforward layer AdaLayerNorm: - Norm layer modified to incorporate timestep embeddings * Add VQ-diffusion scheduler * Add VQ-diffusion pipeline * Add VQ-diffusion convert script to diffusers * Add VQ-diffusion dummy objects * Add VQ-diffusion markdown docs * Add VQ-diffusion tests * some renaming * some fixes * more renaming * correct * fix typo * correct weights * finalize * fix tests * Apply suggestions from code review Co-authored-by: Anton Lozhkov <aglozhkov@gmail.com> * Apply suggestions from code review Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * finish * finish * up Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: Anton Lozhkov <aglozhkov@gmail.com> Co-authored-by: Pedro Cuenca <pedro@huggingface.co>	2022-11-03 16:10:28 +01:00
Pedro Cuenca	269109dbfb	Continuation of #1035 (#1120 ) * remove batch size from repeat * repeat empty string if uncond_tokens is none * fix inpaint pipes * return back whitespace to pass code quality * Apply suggestions from code review * Fix typos. Co-authored-by: Had <had-95@yandex.ru>	2022-11-03 15:49:20 +01:00
Revist	d38c804320	feat: add repaint (#974 ) * feat: add repaint * fix: fix quality check with `make fix-copies` * fix: remove old unnecessary arg * chore: change default to DDPM (looks better in experiments) * ".to(device)" changed to "device=" Co-authored-by: Anton Lozhkov <aglozhkov@gmail.com> * make generator device-specific Co-authored-by: Anton Lozhkov <aglozhkov@gmail.com> * make generator device-specific and change shape Co-authored-by: Anton Lozhkov <aglozhkov@gmail.com> * fix: add preprocessing for image and mask Co-authored-by: Anton Lozhkov <aglozhkov@gmail.com> * fix: update test Co-authored-by: Anton Lozhkov <aglozhkov@gmail.com> * Update src/diffusers/pipelines/repaint/pipeline_repaint.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Add docs and examples * Fix toctree Co-authored-by: fja <fja@zurich.ibm.com> Co-authored-by: Anton Lozhkov <aglozhkov@gmail.com> Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: Anton Lozhkov <anton@huggingface.co>	2022-11-03 15:42:46 +01:00
Anton Lozhkov	4a38166afe	Allow saving `None` pipeline components (#1118 ) * Allow saving `None` pipeline components * support flax as well * style	2022-11-03 15:41:33 +01:00
Anton Lozhkov	0edf9ca082	Fix hub-dependent tests for PRs (#1119 ) * Remove the hub token * replace repos * style	2022-11-03 15:24:32 +01:00
Patrick von Platen	c39a511b5f	[Loading] Ignore unneeded files (#1107 ) * [Loading] Ignore unneeded files * up	2022-11-02 19:20:42 +01:00
Denis	cbcd0512f0	Training to predict x0 in training example (#1031 ) * changed training example to add option to train model that predicts x0 (instead of eps), changed DDPM pipeline accordingly * Revert "changed training example to add option to train model that predicts x0 (instead of eps), changed DDPM pipeline accordingly" This reverts commit `c5efb52564`. * changed training example to add option to train model that predicts x0 (instead of eps), changed DDPM pipeline accordingly * fixed code style Co-authored-by: lukovnikov <lukovnikov@users.noreply.github.com>	2022-11-02 17:43:40 +01:00
Kashif Rasul	0b61cea347	[Flax] time embedding (#1081 ) * initial get_sinusoidal_embeddings * added asserts * better var name * fix docs	2022-11-02 16:54:30 +01:00
Yuta Hayashibe	33c487455e	Fix padding in dreambooth (#1030 )	2022-11-02 16:37:05 +01:00
Grigory Sizov	5cd29d623a	Fix tests for equivalence of DDIM and DDPM pipelines (#1069 ) * Fix equality test for ddim and ddpm * add docs for use_clipped_model_output in DDIM * fix inline comment * reorder imports in test_pipelines.py * Ignore use_clipped_model_output if scheduler doesn't take it	2022-11-02 14:50:32 +01:00
Omiita	1216a3b122	Fix a small typo of a variable name (#1063 ) Fix a small typo fix a typo in `models/attention.py`. weight -> width	2022-11-02 14:46:52 +01:00
Anton Lozhkov	4e59bcc680	[CI] Framework and hardware-specific CI tests (#997 ) * [WIP][CI] Framework and hardware-specific docker images for CI tests * username * fix cpu * try out the image * push latest * update workspace * no root isolation for actions * add a flax image * flax and onnx matrix * fix runners * add reports * onnxruntime image * retry tpu * fix * fix * build onnxruntime * naming * onnxruntime-gpu image * onnxruntime-gpu image, slow tests * latest jax version * trigger flax * run flax tests in one thread * fast flax tests on cpu * fast flax tests on cpu * trigger slow tests * rebuild torch cuda * force cuda provider * fix onnxruntime tests * trigger slow * don't specify gpu for tpu * optimize * memory limit * fix flax tests * disable docker cache	2022-11-02 14:07:07 +01:00
Suraj Patil	b1ec61ee45	fix model card url in text inversion readme. (#1103 ) Update README.md	2022-11-02 14:02:52 +01:00
Jonathan Rahn	0025626cd9	fix typo in examples dreambooth README.md (#1073 ) Update README.md fixed typo	2022-11-02 13:15:30 +01:00
Patrick von Platen	d53ffbbdf4	Rename latent (#1102 ) * Rename latent * uP	2022-11-02 11:59:00 +01:00
rafael	bdbcaa9852	lpw_stable_diffusion: Add is_cancelled_callback (#1053 ) * [Community Pipelines] lpw_stable_diffusion: Add is_cancelled_callback * [Community pipelines] lpw_stable_diffusion_onnx: Add is_cancelled_callback	2022-11-02 11:51:18 +01:00
Lewington-pitsos	8ee21915bf	Integration tests precision improvement for inpainting (#1052 ) * improve test precision get tests passing with greater precision using lewington images * make old numpy load function a wrapper around a more flexible numpy loading function * adhere to black formatting * add more black formatting * adhere to isort * loosen precision and replace path Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2022-11-02 11:47:26 +01:00
Suraj Patil	8608795711	[docs] add euler scheduler in docs, how to use differnet schedulers (#1089 ) * add euler scheduler in docs * add a section for how to use different scheds * address patrck's comments	2022-11-02 11:32:46 +01:00
MatthieuTPHR	98c42134a5	Up to 2x speedup on GPUs using memory efficient attention (#532 ) * 2x speedup using memory efficient attention * remove einops dependency * Swap K, M in op instantiation * Simplify code, remove unnecessary maybe_init call and function, remove unused self.scale parameter * make xformers a soft dependency * remove one-liner functions * change one letter variable to appropriate names * Remove Env variable dependency, remove MemoryEfficientCrossAttention class and use enable_xformers_memory_efficient_attention method * Add memory efficient attention toggle to img2img and inpaint pipelines * Clearer management of xformers' availability * update optimizations markdown to add info about memory efficient attention * add benchmarks for TITAN RTX * More detailed explanation of how the mem eff benchmark were ran * Removing autocast from optimization markdown * import_utils: import torch only if is available Co-authored-by: Nouamane Tazi <nouamane98@gmail.com>	2022-11-02 10:29:06 +01:00
MarkRich	a793b1fe7e	Add imagic to community pipelines (#958 ) * initial commit to add imagic to stable diffusion community pipelines * remove some testing changes * comments from PR review for imagic stable diffusion * remove changes from pipeline_stable_diffusion as part of imagic pipeline * clean up example code and add line back in to pipeline_stable_diffusion for imagic pipeline * remove unused functions * small code quality changes for imagic pipeline * clean up readme * remove hardcoded logging values for imagic community example * undo change for DDIMScheduler	2022-11-01 11:17:51 +01:00
Laurent Mazare	7fb4b882b9	Remove some unused parameter in CrossAttnUpBlock2D (#1034 ) Remove some unused parameter The `downsample_padding` parameter does not seem to be used in `CrossAttnUpBlock2D` (or by any up block for that matter) so removing it.	2022-10-31 19:15:15 +01:00
Patrick von Platen	888468dd90	Remove nn sequential (#1086 ) * Remove nn sequential * up	2022-10-31 19:01:42 +01:00
Patrick von Platen	17c2c0600b	[Tests] Fix slow tests (#1087 )	2022-10-31 18:59:58 +01:00
Patrick von Platen	010bc4ea19	incorrect model id	2022-10-31 16:35:59 +00:00
Patrick von Platen	c18941b01a	[Better scheduler docs] Improve usage examples of schedulers (#890 ) * [Better scheduler docs] Improve usage examples of schedulers * finish * fix warnings and add test * finish * more replacements * adapt fast tests hf token * correct more * Apply suggestions from code review Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * Integrate compatibility with euler Co-authored-by: Pedro Cuenca <pedro@huggingface.co>	2022-10-31 17:26:30 +01:00
hlky	a1ea8c01c3	k-diffusion-euler (#1019 ) * k-diffusion-euler * make style make quality * make fix-copies * fix tests for euler a * Update src/diffusers/schedulers/scheduling_euler_ancestral_discrete.py Co-authored-by: Anton Lozhkov <aglozhkov@gmail.com> * Update src/diffusers/schedulers/scheduling_euler_ancestral_discrete.py Co-authored-by: Anton Lozhkov <aglozhkov@gmail.com> * Update src/diffusers/schedulers/scheduling_euler_discrete.py Co-authored-by: Anton Lozhkov <aglozhkov@gmail.com> * Update src/diffusers/schedulers/scheduling_euler_discrete.py Co-authored-by: Anton Lozhkov <aglozhkov@gmail.com> * remove unused arg and method * update doc * quality * make flake happy * use logger instead of warn * raise error instead of deprication * don't require scipy * pass generator in step * fix tests * Apply suggestions from code review Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * Update tests/test_scheduler.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * remove unused generator * pass generator as extra_step_kwargs * update tests * pass generator as kwarg * pass generator as kwarg * quality * fix test for lms * fix tests Co-authored-by: patil-suraj <surajp815@gmail.com> Co-authored-by: Anton Lozhkov <aglozhkov@gmail.com> Co-authored-by: Pedro Cuenca <pedro@huggingface.co> Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2022-10-31 16:20:38 +01:00
Pedro Cuenca	bf7b0bc25b	Allow `safety_checker` to be `None` when using CPU offload (#1078 ) Allow None safety_checker when using CPU offload.	2022-10-31 15:03:33 +01:00
Patrick von Platen	e4d264e4eb	[GitBot] Automatically close issues after inactivitiy (#1079 ) * [GitBot] Automatically close issues after inactivitiy * improve * Add unstale * typo Co-authored-by: anton-l <anton@huggingface.co>	2022-10-31 14:06:03 +01:00
Anton Lozhkov	1606eb994a	Fix pipelines user_agent, ignore CI requests (#1058 ) * Fix pipelines user_agent, ignore CI requests * fix circular import * N/A versions * N/A versions	2022-10-31 13:38:43 +01:00
Patrick von Platen	82d56cf192	Merge branch 'main' of https://github.com/huggingface/diffusers into main	2022-10-31 09:13:40 +00:00
Patrick von Platen	707b8684b3	fix slow test	2022-10-31 09:13:37 +00:00
Jonatan Kłosko	8e4fd686e0	Move safety detection to model call in Flax safety checker (#1023 ) * Move safety detection to model call in Flax safety checker * Update src/diffusers/pipelines/stable_diffusion/safety_checker_flax.py	2022-10-30 20:07:55 +01:00
Pedro Cuenca	95414bd6bf	Experimental: allow fp16 in `mps` (#961 ) * Docs: refer to pre-RC version of PyTorch 1.13.0. * Remove temporary workaround for unavailable op. * Update comment to make it less ambiguous. * Remove use of contiguous in mps. It appears to not longer be necessary. * Special case: use einsum for much better performance in mps * Update mps docs. * MPS: make pipeline work in half precision.	2022-10-29 21:09:32 +02:00
Pedro Cuenca	a59f9990fc	Tests: upgrade PyTorch cuda to 11.7 to fix examples tests. (#1048 ) Tests: upgrade PyTorch cuda to 11.7. Otherwise the cuda versions of torch and torchvision mismatch, and examples tests fail. We were requesting cuda 11.6 for PyTorch, and the default torchvision (via setup.py). Another option would be to include torchvision in the same pip install line as torch.	2022-10-29 20:27:00 +02:00
MarkRich	1fc208825d	Add seed resizing to community pipelines (#1011 ) * add seed resizing to community examples * actually add the file responsible for seed resizing	2022-10-29 09:31:42 +02:00
Nathan Lambert	12fd0736dc	clean incomplete pages (#1008 )	2022-10-29 09:28:26 +02:00
Minwoo Byeon	fc0ca47456	Fix speedup ratio in fp16.mdx (#837 )	2022-10-29 09:26:23 +02:00
Pedro Cuenca	6b185b6acd	Update training and fine-tuning docs (#1020 ) * Update training and fine-tuning docs. * Update examples README. * Update README. * Add Flax fine-tuning section. * Accept suggestion Co-authored-by: Anton Lozhkov <anton@huggingface.co> * Accept suggestion Co-authored-by: Anton Lozhkov <anton@huggingface.co> Co-authored-by: Anton Lozhkov <anton@huggingface.co>	2022-10-28 21:02:08 +02:00
Patrick von Platen	81b6fbf19d	higher precision for vae	2022-10-28 18:19:06 +00:00
Patrick von Platen	a7ae808ee2	increase tolerance	2022-10-28 17:50:22 +00:00
Patrick von Platen	ea01a4c7f9	fix	2022-10-28 16:55:43 +00:00
Patrick von Platen	cbbb29398a	hot fix	2022-10-28 16:55:21 +00:00
Patrick von Platen	d37f08da72	[Tests] no random latents anymore (#1045 )	2022-10-28 18:52:25 +02:00
Patrick von Platen	c4ef1efe46	[Tests] Better prints (#1043 )	2022-10-28 17:38:31 +02:00
Patrick von Platen	8d6487f3cb	Fix some failing tests (#1041 ) * up * up * up * Update src/diffusers/pipelines/stable_diffusion/pipeline_stable_diffusion.py * Apply suggestions from code review	2022-10-28 17:05:00 +02:00
Patrick von Platen	d2d9764f35	[Tests] Speed up slow tests (#1040 ) * [Tests] Speed up slow tests * Up * up	2022-10-28 14:46:39 +02:00
Patrick von Platen	a80480f0f2	[Tests] Improve unet / vae tests (#1018 ) * improve tests * up * finish * upload * add init * up * finish vae * finish * reduce loading time with device_map * remove device_map from CPU * uP	2022-10-28 13:43:26 +02:00
Nouamane Tazi	ab079f27cf	fix `F.interpolate()` for large batch sizes (#1006 ) * fix `upsample_nearest_nhwc` for large bsz * fix `upsample_nearest_nhwc` for large bsz	2022-10-28 11:25:21 +02:00
Duong A. Nguyen	1e07b6b334	[Flax SD finetune] Fix dtype (#1038 ) fix jnp dtype	2022-10-28 11:21:34 +02:00

... 2 3 4 5 6 ...

1372 Commits