diffusers

mirror of https://github.com/huggingface/diffusers.git synced 2026-01-29 07:22:12 +03:00

Author	SHA1	Message	Date
Patrick von Platen	69caa96472	fix slow test	2023-01-31 07:39:30 +00:00
Patrick von Platen	0c39f53cbb	Allow lora from pipeline (#2129 ) * [LoRA] All to use in inference with pipeline * [LoRA] allow cross attention kwargs passed to pipeline * finish	2023-01-27 08:19:46 +01:00
Patrick von Platen	09779cbb40	[Bump version] 0.13.0dev0 & Deprecate `predict_epsilon` (#2109 ) * [Bump version] 0.13 * Bump model up * up	2023-01-25 17:59:02 +01:00
Patrick von Platen	6ba2231d72	Reproducibility 3/3 (#1924 ) * make tests deterministic * run slow tests * prepare for testing * finish * refactor * add print statements * finish more * correct some test failures * more fixes * set up to correct tests * more corrections * up * fix more * more prints * add * up * up * up * uP * uP * more fixes * uP * up * up * up * up * fix more * up * up * clean tests * up * up * up * more fixes * Apply suggestions from code review Co-authored-by: Suraj Patil <surajp815@gmail.com> * make * correct * finish * finish Co-authored-by: Suraj Patil <surajp815@gmail.com>	2023-01-25 13:44:22 +01:00
Patrick von Platen	b562b6611f	Allow directly passing text embeddings to Stable Diffusion Pipeline for prompt weighting (#2071 ) * add text embeds to sd * add text embeds to sd * finish tests * finish * finish * make style * fix tests * make style * make style * up * better docs * fix * fix * new try * up * up * finish	2023-01-25 12:29:49 +01:00
Patrick von Platen	69c76173fa	fix tests	2023-01-22 14:31:05 +02:00
Patrick von Platen	926b34b40c	improve tests	2023-01-22 14:30:15 +02:00
Suraj Patil	aa265f74bd	[StableDiffusionInstructPix2Pix] use cpu generator in slow tests (#2051 ) * use cpu generator in slow tests * ifx get_inputs	2023-01-20 21:43:00 +02:00
Suraj Patil	e5ff75540c	Add InstructPix2Pix pipeline (#2040 ) * being pix2pix * ifx * cfg image_latents * fix some docstr * fix * fix * hack * fix * Apply suggestions from code review Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * add comments to explain the hack * move __call__ to the top * doc * remove height and width * remove depreications * fix doc str * quality * fast tests * chnage model id * fast tests * fix test * address Pedro's comments * copyright * Simple doc page. * Apply suggestions from code review * style * Remove import * address some review comments * Apply suggestions from code review Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * style Co-authored-by: Pedro Cuenca <pedro@huggingface.co> Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2023-01-20 16:25:46 +01:00
Patrick von Platen	013955b5a7	[Dit] Fix dit tests (#2034 ) * [Dit] Fix dit tests * up	2023-01-19 01:50:22 +01:00
Kashif Rasul	37d113cce7	DiT Pipeline (#1806 ) * added dit model * import * initial pipeline * initial convert script * initial pipeline * make style * raise valueerror * single function * rename classes * use DDIMScheduler * timesteps embedder * samples to cpu * fix var names * fix numpy type * use timesteps class for proj * fix typo * fix arg name * flip_sin_to_cos and better var names * fix C shape cal * make style * remove unused imports * cleanup * add back patch_size * initial dit doc * typo * Update docs/source/api/pipelines/dit.mdx Co-authored-by: Suraj Patil <surajp815@gmail.com> * added copyright license headers * added example usage and toc * fix variable names asserts * remove comment * added docs * fix typo * upstream changes * set proper device for drop_ids * added initial dit pipeline test * update docs * fix imports * make fix-copies * isort * fix imports * get rid of more magic numbers * fix code when guidance is off * remove block_kwargs * cleanup script * removed to_2tuple * use FeedForward class instead of another MLP * style * work on mergint DiTBlock with BasicTransformerBlock * added missing final_dropout and args to BasicTransformerBlock * use norm from block * fix arg * remove unused arg * fix call to class_embedder * use timesteps * make style * attn_output gets multiplied * removed commented code * use Transformer2D * use self.is_input_patches * fix flags * fixed conversion to use Transformer2DModel * fixes for pipeline * remove dit.py * fix timesteps device * use randn_tensor and fix fp16 inf. * timesteps_emb already the right dtype * fix dit test class * fix test and style * fix norm2 usage in vq-diffusion * added author names to pipeline and lmagenet labels link * fix tests * use norm_type as string * rename dit to transformer * fix name * fix test * set norm_type = "layer" by default * fix tests * do not skip common tests * Update src/diffusers/models/attention.py Co-authored-by: Suraj Patil <surajp815@gmail.com> * revert AdaLayerNorm API * fix norm_type name * make sure all components are in eval mode * revert norm2 API * compact * finish deprecation * add slow tests * remove @ * refactor some stuff * upload * Update src/diffusers/pipelines/dit/pipeline_dit.py * finish more * finish docs * improve docs * finish docs Co-authored-by: Suraj Patil <surajp815@gmail.com> Co-authored-by: William Berman <WLBberman@gmail.com> Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2023-01-17 23:09:29 +01:00
Jerry Jiarui XU	a43bdd01cd	[Flax] Add Flax inpainting impl (#1966 ) * [Flax] Add Flax inpainting impl * fixed copies, add README.md * fixed README.md * add test * format * update README.md	2023-01-17 10:42:04 +01:00
Will Berman	07c0fe4b87	Use pipeline tests mixin for UnCLIP pipeline tests + unCLIP MPS fixes (#1908 ) re: https://github.com/huggingface/diffusers/issues/1857 We relax some of the checks to deal with unclip reproducibility issues. Mainly by checking the average pixel difference (measured w/in 0-255) instead of the max pixel difference (measured w/in 0-1). - [x] add mixin to UnCLIPPipelineFastTests - [x] add mixin to UnCLIPImageVariationPipelineFastTests - [x] Move UnCLIPPipeline flags in mixin to base class - [x] Small MPS fixes for F.pad and F.interpolate - [x] Made test unCLIP model's dimensions smaller to run tests faster	2023-01-16 15:21:58 +01:00
Vladimir Sotnikov	9b37ed33b5	[SD Img2Img] resize source images to multiple of 8 instead of 32 (#1571 ) * [Stable Diffusion Img2Img] resize source images to integer multiple of 8 instead of 32 * [Alt Diffusion Img2Img] resize source images to multiple of 8 instead of 32 * [Img2Img] fix AltDiffusion Img2Img resolution test * [Img2Img] add Stable Diffusion Img2Img resolution test * [Cycle Diffusion] round resolution to multiplies of 8 instead of 32 * [ONNX SD Img2Img] round resolution to multiplies of 64 instead of 32 * [SD Depth2Img] round resolution to multiplies of 8 instead of 32 * [Repaint] round resolution to multiplies of 8 instead of 32 * fix make style Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2023-01-13 16:02:22 +01:00
Patrick von Platen	6d3adf6570	Fix slow tests (#1983 ) * [Slow tests] Fix tests * Update tests/pipelines/karras_ve/test_karras_ve.py	2023-01-12 18:24:51 +01:00
Patrick von Platen	9b63854886	Improve reproduceability 2/3 (#1906 ) * [Repro] Correct reproducability * up * up * uP * up * need better image * allow conversion from no state dict checkpoints * up * up * up * up * check tensors * check tensors * check tensors * check tensors * next try * up * up * better name * up * up * Apply suggestions from code review * correct more * up * replace all torch randn * fix * correct * correct * finish * fix more * up	2023-01-04 23:51:17 +01:00
Patrick von Platen	8ed08e4270	[Deterministic torch randn] Allow tensors to be generated on CPU (#1902 ) * [Deterministic torch randn] Allow tensors to be generated on CPU * fix more * up * fix more * up * Update src/diffusers/utils/torch_utils.py Co-authored-by: Anton Lozhkov <anton@huggingface.co> * Apply suggestions from code review * up * up * Apply suggestions from code review Co-authored-by: Pedro Cuenca <pedro@huggingface.co> Co-authored-by: Anton Lozhkov <anton@huggingface.co> Co-authored-by: Pedro Cuenca <pedro@huggingface.co>	2023-01-03 18:22:40 +01:00
Robert Dargavel Smith	4a7e4cec38	Add condtional generation to AudioDiffusionPipeline (#1826 ) * Add condtional generation * add fast test for conditional audio generation	2023-01-03 14:09:14 +01:00
Patrick von Platen	b28ab30215	[Unclip] Make sure text_embeddings & image_embeddings can directly be passed to enable interpolation tasks. (#1858 ) * [Unclip] Make sure latents can be reused * allow one to directly pass embeddings * up * make unclip for text work * finish allowing to pass embeddings * correct more * make style	2022-12-30 12:18:19 +01:00
Patrick von Platen	29b2c93c90	Make repo structure consistent (#1862 ) * move files a bit * more refactors * fix more * more fixes * fix more onnx * make style * upload * fix * up * fix more * up again * up * small fix * Update src/diffusers/__init__.py Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * correct Co-authored-by: Pedro Cuenca <pedro@huggingface.co>	2022-12-30 11:51:08 +01:00
Patrick von Platen	03bf877bf4	[StableDiffusionInpaint] Correct test (#1859 )	2022-12-29 14:47:56 +01:00
Will Berman	53c8147afe	unCLIP image variation (#1781 ) * unCLIP image variation * remove prior comment re: @pcuenca * stable diffusion -> unCLIP re: @pcuenca * add copy froms re: @patil-suraj	2022-12-28 14:17:09 +01:00
Pedro Cuenca	df2b548e89	Make safety_checker optional in more pipelines (#1796 ) * Make safety_checker optional in more pipelines. * Remove inappropriate comment in inpaint pipeline. * InPaint Test: set feature_extractor to None. * Remove import * img2img test: set feature_extractor to None. * inpaint sd2 test: set feature_extractor to None. Co-authored-by: Suraj Patil <surajp815@gmail.com>	2022-12-25 21:58:45 +01:00
Anton Lozhkov	8331da4683	Bump to 0.12.0.dev0 (#1771 )	2022-12-19 18:44:08 +01:00
Anton Lozhkov	f1a32203aa	[Tests] Fix UnCLIP cpu offload tests (#1769 )	2022-12-19 18:25:08 +01:00
Patrick von Platen	ce1c27adc8	[Revision] Don't recommend using revision (#1764 )	2022-12-19 16:25:41 +01:00
Anton Lozhkov	c7b4acfb37	Add CPU offloading to UnCLIP (#1761 ) * Add CPU offloading to UnCLIP * use fp32 for testing the offload	2022-12-19 14:44:08 +01:00
Will Berman	830a9d1f01	[fix] pipeline_unclip generator (#1751 ) * [fix] pipeline_unclip generator pass generator to all schedulers * fix fast tests test data	2022-12-19 10:27:18 +01:00
Will Berman	2dcf64b72a	kakaobrain unCLIP (#1428 ) * [wip] attention block updates * [wip] unCLIP unet decoder and super res * [wip] unCLIP prior transformer * [wip] scheduler changes * [wip] text proj utility class * [wip] UnCLIPPipeline * [wip] kakaobrain unCLIP convert script * [unCLIP pipeline] fixes re: @patrickvonplaten remove callbacks move denoising loops into call function * UNCLIPScheduler re: @patrickvonplaten Revert changes to DDPMScheduler. Make UNCLIPScheduler, a modified DDPM scheduler with changes to support karlo * mask -> attention_mask re: @patrickvonplaten * [DDPMScheduler] remove leftover change * [docs] PriorTransformer * [docs] UNet2DConditionModel and UNet2DModel * [nit] UNCLIPScheduler -> UnCLIPScheduler matches existing unclip naming better * [docs] SchedulingUnCLIP * [docs] UnCLIPTextProjModel * refactor * finish licenses * rename all to attention_mask and prep in models * more renaming * don't expose unused configs * final renaming fixes * remove x attn mask when not necessary * configure kakao script to use new class embedding config * fix copies * [tests] UnCLIPScheduler * finish x attn * finish * remove more * rename condition blocks * clean more * Apply suggestions from code review * up * fix * [tests] UnCLIPPipelineFastTests * remove unused imports * [tests] UnCLIPPipelineIntegrationTests * correct * make style Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2022-12-18 15:15:30 -08:00
Anton Lozhkov	c2a38ef9df	Fix/update the LDM pipeline and tests (#1743 ) * Fix/update LDM tests * batched generators	2022-12-18 11:49:53 +01:00
Anton Lozhkov	086c7f9ea8	Nightly integration tests (#1664 ) * [WIP] Nightly integration tests * initial SD tests * update SD slow tests * style * repaint * ImageVariations * style * finish imgvar * img2img tests * debug * inpaint 1.5 * inpaint legacy * torch isn't happy about deterministic ops * allclose -> max diff for shorter logs * add SD2 * debug * Update tests/pipelines/stable_diffusion_2/test_stable_diffusion.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * Update tests/pipelines/stable_diffusion/test_stable_diffusion.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * fix refs * Update src/diffusers/utils/testing_utils.py Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * fix refs * remove debug Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: Pedro Cuenca <pedro@huggingface.co>	2022-12-16 18:51:11 +01:00
Patrick von Platen	c6d0dff4a3	Fix ldm tests on master by not running the CPU tests on GPU (#1729 )	2022-12-16 15:28:40 +01:00
Anton Lozhkov	a40095dd22	Fix ONNX img2img preprocessing and add fast tests coverage (#1727 ) * Fix ONNX img2img preprocessing and add fast tests coverage * revert * disable progressbars	2022-12-16 15:24:16 +01:00
Anton Lozhkov	13994b2d3f	RePaint fast tests and API conforming (#1701 ) * add fast tests * better tests and fp16 * batch fixes * Reuse preprocessing * quickfix	2022-12-15 18:35:31 +01:00
Patrick von Platen	244e16a7ab	[Version] Bump to 0.11.0.dev0 (#1682 ) upgrade version	2022-12-13 13:51:36 +01:00
Patrick von Platen	b345c74d4d	Make sure all pipelines can run with batched input (#1669 ) * [SD] Make sure batched input works correctly * uP * uP * up * up * uP * up * fix mask stuff * up * uP * more up * up * uP * up * finish * Apply suggestions from code review Co-authored-by: Pedro Cuenca <pedro@huggingface.co> Co-authored-by: Pedro Cuenca <pedro@huggingface.co>	2022-12-13 12:50:15 +01:00
Suraj Patil	5383188c7e	StableDiffusionDepth2ImgPipeline (#1531 ) * begin depth pipeline * add depth estimation model * fix prepare_depth_mask * add a comment about autocast * copied from, quality, cleanup * begin tests * handle tensors * norm image tensor * fix batch size * fix tests * fix enable_sequential_cpu_offload * fix save load * fix test_save_load_float16 * fix test_save_load_optional_components * fix test_float16_inference * fix test_cpu_offload_forward_pass * fix test_dict_tuple_outputs_equivalent * up * fix fast tests * fix test_stable_diffusion_img2img_multiple_init_images * fix few more fast tests * don't use device map for DPT * fix test_stable_diffusion_pipeline_with_sequential_cpu_offloading * accept external depth maps * prepare_depth_mask -> prepare_depth_map * fix file name * fix file name * quality * check transformers version * fix test names * use skipif * fix import * add docs * skip tests on mps * correct version * uP * Update docs/source/api/pipelines/stable_diffusion_2.mdx * fix fix-copies * fix fix-copies Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: anton- <anton@huggingface.co>	2022-12-08 18:25:12 +01:00
Patrick von Platen	a643c6300e	[K Diffusion] Add k diffusion sampler natively (#1603 ) * uP * uP	2022-12-08 12:48:37 +01:00
Anton Lozhkov	eb1abee693	[ONNX] Fix flaky tests (#1593 ) * [ONNX] Fix flaky tests * revert	2022-12-07 19:53:13 +01:00
Randolph-zeng	ca68ab3eef	Update scheduling_repaint.py (#1582 ) * Update scheduling_repaint.py * update the expected image Co-authored-by: anton- <anton@huggingface.co>	2022-12-07 17:41:07 +01:00
Suraj Patil	ced7c9601a	fix upcast in slice attention (#1591 ) * fix upcast in slice attention * fix dtype * add test * fix test	2022-12-07 15:14:34 +01:00
Anton Lozhkov	dc87f526d4	Fix common tests for FP16 (#1588 ) * Fix common tests for FP16 * revert	2022-12-07 14:09:51 +01:00
Patrick von Platen	896c98a2ae	Add paint by example (#1533 ) * add paint by example * mkae loading possibel * up * Update src/diffusers/models/attention.py * up * finalize weight structure * make example work * make it work * up * up * fix * del * add * update * Apply suggestions from code review * correct transformer 2d * finish * up * up * up * up * fix * Apply suggestions from code review Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * Apply suggestions from code review * up * finish Co-authored-by: Pedro Cuenca <pedro@huggingface.co>	2022-12-07 11:06:30 +01:00
Anton Lozhkov	02d83c9ff1	Standardize fast pipeline tests with PipelineTestMixin (#1526 ) * [WIP] Standardize fast pipeline tests with PipelineTestMixin * refactor the sd tests a bit * add more common tests * add xformers * add progressbar test * cleanup * upd fp16 * CycleDiffusionPipelineFastTests * DanceDiffusionPipelineFastTests * AltDiffusionPipelineFastTests * StableDiffusion2PipelineFastTests * StableDiffusion2InpaintPipelineFastTests * StableDiffusionImageVariationPipelineFastTests * StableDiffusionImg2ImgPipelineFastTests * StableDiffusionInpaintPipelineFastTests * remove unused mixins * quality * add missing inits * try to fix mps tests * fix mps tests * add mps warmups * skip for some pipelines * style * Update tests/test_pipelines_common.py Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2022-12-06 18:35:30 +01:00
Robert Dargavel Smith	48d0123f0f	add AudioDiffusionPipeline and LatentAudioDiffusionPipeline #1334 (#1426 ) * add AudioDiffusionPipeline and LatentAudioDiffusionPipeline * add docs to toc * fix tests * fix tests * fix tests * fix tests * fix tests * Update pr_tests.yml Fix tests * parent `499ff34b3e` author teticio <teticio@gmail.com> 1668765652 +0000 committer teticio <teticio@gmail.com> 1669041721 +0000 parent `499ff34b3e` author teticio <teticio@gmail.com> 1668765652 +0000 committer teticio <teticio@gmail.com> 1669041704 +0000 add colab notebook [Flax] Fix loading scheduler from subfolder (#1319) [FLAX] Fix loading scheduler from subfolder Fix/Enable all schedulers for in-painting (#1331) * inpaint fix k lms * onnox as well * up Correct path to schedlure (#1322) * [Examples] Correct path * uP Avoid nested fix-copies (#1332) * Avoid nested `# Copied from` statements during `make fix-copies` * style Fix img2img speed with LMS-Discrete Scheduler (#896) Casting `self.sigmas` into a different dtype (the one of original_samples) is not advisable. In my img2img pipeline this leads to a long running time in the `integrate.quad` call later on- by long I mean more than 10x slower. Co-authored-by: Anton Lozhkov <anton@huggingface.co> Fix the order of casts for onnx inpainting (#1338) Legacy Inpainting Pipeline for Onnx Models (#1237) * Add legacy inpainting pipeline compatibility for onnx * remove commented out line * Add onnx legacy inpainting test * Fix slow decorators * pep8 styling * isort styling * dummy object * ordering consistency * style * docstring styles * Refactor common prompt encoding pattern * Update tests to permanent repository home * support all available schedulers until ONNX IO binding is available Co-authored-by: Anton Lozhkov <aglozhkov@gmail.com> * updated styling from PR suggested feedback Co-authored-by: Anton Lozhkov <aglozhkov@gmail.com> Jax infer support negative prompt (#1337) * support negative prompts in sd jax pipeline * pass batched neg_prompt * only encode when negative prompt is None Co-authored-by: Juan Acevedo <jfacevedo@google.com> Update README.md: Minor change to Imagic code snippet, missing dir error (#1347) Minor change to Imagic Readme Missing dir causes an error when running the example code. make style change the sample model (#1352) * Update alt_diffusion.mdx * Update alt_diffusion.mdx Add bit diffusion [WIP] (#971) * Create bit_diffusion.py Bit diffusion based on the paper, arXiv:2208.04202, Chen2022AnalogBG * adding bit diffusion to new branch ran tests * tests * tests * tests * tests * removed test folders + added to README * Update README.md Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> * move Mel to module in pipeline construction, make librosa optional * fix imports * fix copy & paste error in comment * fix style * add missing register_to_config * fix class docstrings * fix class docstrings * tweak docstrings * tweak docstrings * update slow test * put trailing commas back * respect alphabetical order * remove LatentAudioDiffusion, make vqvae optional * move Mel from models back to pipelines :-) * allow loading of pretrained audiodiffusion models * fix tests * fix dummies * remove reference to latent_audio_diffusion in docs * unused import * inherit from SchedulerMixin to make loadable * Apply suggestions from code review * Apply suggestions from code review Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2022-12-05 18:06:30 +01:00
Patrick von Platen	7932971542	[Upscaling] Fix batch size (#1525 )	2022-12-05 13:28:55 +01:00
Patrick von Platen	513fc68104	[Stable Diffusion Inpaint] Allow tensor as input image & mask (#1527 ) up	2022-12-05 12:18:02 +01:00
Anton Lozhkov	cc22bda5f6	[CI] Add slow MPS tests (#1104 ) * [CI] Add slow MPS tests * fix yml * temporarily resolve caching * Tests: fix mps crashes. * Skip test_load_pipeline_from_git on mps. Not compatible with float16. * Increase tolerance, use CPU generator, alt. slices. * Move to nightly * style Co-authored-by: Pedro Cuenca <pedro@huggingface.co>	2022-12-05 11:50:24 +01:00
Patrick von Platen	cf4664e885	fix tests	2022-12-02 17:27:58 +00:00
fboulnois	52eb0348e5	Standardize on using `image` argument in all pipelines (#1361 ) * feat: switch core pipelines to use image arg * test: update tests for core pipelines * feat: switch examples to use image arg * docs: update docs to use image arg * style: format code using black and doc-builder * fix: deprecate use of init_image in all pipelines	2022-12-01 16:55:22 +01:00

... 13 14 15 16 17

815 Commits