diffusers

mirror of https://github.com/huggingface/diffusers.git synced 2026-01-27 17:22:53 +03:00

Author	SHA1	Message	Date
Fabio Rigano	a0cf607667	Multi-image masking for single IP Adapter (#7499 ) * Support multiimage masking --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: YiYi Xu <yixu310@gmail.com>	2024-04-09 09:20:57 -10:00
YiYi Xu	a341b536a8	disable test_conversion_when_using_device_map (#7620 ) * disable test * update --------- Co-authored-by: yiyixuxu <yixu310@gmail,com>	2024-04-09 09:01:19 -10:00
Christopher Beckham	8e46d97cd8	Add missing restore() EMA call in train SDXL script (#7599 ) * Restore unet params back to normal from EMA when validation call is finished * empty commit --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-04-09 18:07:55 +05:30
Junjie	7e808e768a	[Docs] fix bugs in callback docs (#7594 )	2024-04-08 08:46:30 -10:00
w4ffl35	7e39516627	Allow more arguments to be passed to convert_from_ckpt (#7222 ) Allow safety and feature extractor arguments to be passed to convert_from_ckpt Allows management of safety checker and feature extractor from outside of the convert ckpt class. Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-04-08 10:13:48 +05:30
Nguyễn Công Tú Anh	56a76082ed	Add AudioLDM2 TTS (#5381 ) * add audioldm2 tts * change gpt2 max new tokens * remove unnecessary pipeline and class * add TTS to AudioLDM2Pipeline * add TTS docs * delete unnecessary file * remove unnecessary import * add audioldm2 slow testcase * fix code quality * remove AudioLDMLearnablePositionalEmbedding * add variable check vits encoder * add use_learned_position_embedding --------- Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>	2024-04-08 10:11:24 +05:30
YiYi Xu	6133d98ff7	[IF\| add set_begin_index for all IF pipelines (#7577 ) add set_begin_index for all if pipelines	2024-04-05 06:54:07 -10:00
Sayak Paul	1c60e094de	[Tests] reduce block sizes of UNet and VAE tests (#7560 ) * reduce block sizes for unet1d. * reduce blocks for unet_2d. * reduce block size for unet_motion * increase channels. * correctly increase channels. * reduce number of layers in unet2dconditionmodel tests. * reduce block sizes for unet2dconditionmodel tests * reduce block sizes for unet3dconditionmodel. * fix: test_feed_forward_chunking * fix: test_forward_with_norm_groups * skip spatiotemporal tests on MPS. * reduce block size in AutoencoderKL. * reduce block sizes for vqmodel. * further reduce block size. * make style. * Empty-Commit * reduce sizes for ConsistencyDecoderVAETests * further reduction. * further block reductions in AutoencoderKL and AssymetricAutoencoderKL. * massively reduce the block size in unet2dcontionmodel. * reduce sizes for unet3d * fix tests in unet3d. * reduce blocks further in motion unet. * fix: output shape * add attention_head_dim to the test configuration. * remove unexpected keyword arg * up a bit. * groups. * up again * fix	2024-04-05 10:08:32 +05:30
UmerHA	71f49a5d2a	Skip `test_freeu_enabled` on MPS (#7570 ) * Skip `test_freeu_enabled ` on MPS * Small fixes - import skip_mps correctly - disable all instances of test_freeu_enabled * Empty commit to trigger tests * Empty commit to trigger CI	2024-04-04 12:16:04 +02:00
Abhinav Gopal	35db2fdea9	Update pipeline_animatediff_video2video.py (#7457 ) * Update pipeline_animatediff_video2video.py * commit with test for whether latent input can be passed into animatediffvid2vid	2024-04-03 19:34:28 +05:30
Sayak Paul	ad55ce6100	[Chore] increase number of workers for the tests. (#7558 ) * increase number of workers for the tests. * move to beefier runner. * improve the fast push tests too. * use a beefy machine for pytorch pipeline tests * up the number of workers further.	2024-04-03 17:11:42 +05:30
Sayak Paul	a9a5b14f35	[Core] refactor transformers 2d into multiple init variants. (#7491 ) * refactor transformers 2d into multiple legacy variants. * fix: init. * fix recursive init. * add inits. * make transformer block creation more modular. * complete refactor. * remove forward * debug * remove legacy blocks and refactor within the module itself. * remove print * guard caption projection * remove fetcher. * reduce the number of args. * fix: norm_type * group variables that are shared. * remove _get_transformer_blocks * harmonize the init function signatures. * transformer_blocks to common * repeat .	2024-04-03 12:56:17 +05:30
Beinsezii	aa19025989	UniPC Multistep add `rescale_betas_zero_snr` (#7531 ) * UniPC Multistep add `rescale_betas_zero_snr` Same patch as DPM and Euler with the patched final alpha cumprod BF16 doesn't seem to break down, I think cause UniPC upcasts during some phases already? We could still force an upcast since it only loses ≈ 0.005 it/s for me but the difference in output is very small. A better endeavor might upcasting in step() and removing all the other upcasts elsewhere? * UniPC ZSNR UT * Re-add `rescale_betas_zsnr` doc oops	2024-04-02 17:23:55 -10:00
Beinsezii	19ab04ff56	UniPC Multistep fix tensor dtype/device on order=3 (#7532 ) * UniPC UTs iterate solvers on FP16 It wasn't catching errs on order==3. Might be excessive? * UniPC Multistep fix tensor dtype/device on order=3 * UniPC UTs Add v_pred to fp16 test iter For completions sake. Probably overkill?	2024-04-02 15:41:29 -10:00
Sayak Paul	4a34307702	add: utility to format our docs too 📜 (#7314 ) * add: utility to format our docs too 📜 * debugging saga * fix: message * checking * should be fixed. * revert pipeline_fixture * remove empty line * make style * fix: setup.py * style.	2024-04-02 20:49:43 +05:30
Bagheera	8e963d1c2a	7529 do not disable autocast for cuda devices (#7530 ) * 7529 do not disable autocast for cuda devices * Remove typecasting error check for non-mps platforms, as a correct autocast implementation makes it a non-issue * add autocast fix to other training examples * disable native_amp for dreambooth (sdxl) * disable native_amp for pix2pix (sdxl) * remove tests from remaining files * disable native_amp on huggingface accelerator for every training example that uses it * convert more usages of autocast to nullcontext, make style fixes * make style fixes * style. * Empty-Commit --------- Co-authored-by: bghira <bghira@users.github.com> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-04-02 20:15:06 +05:30
Sayak Paul	2b04ec2ff7	[Tests] Speed up fast pipelines part II (#7521 ) * start printing the tensors. * print full throttle * set static slices for 7 tests. * remove printing. * flatten * disable test for controlnet * what happens when things are seeded properly? * set the right value * style./ * make pia test fail to check things * print. * fix pia. * checking for animatediff. * fix: animatediff. * video synthesis * final piece. * style. * print guess. * fix: assertion for control guess. --------- Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>	2024-04-02 13:24:56 +05:30
Sayak Paul	000fa82a1e	[Chore] remove class assignments for linear and conv. (#7553 ) * remove class assignments for linear and conv. * fix: self.nn	2024-04-02 13:01:04 +05:30
Sayak Paul	5d83f50c23	[Release tests] make nightly workflow dispatchable. (#7541 ) * make nightly workflow dispatchable. * add a note about running the release tests to setup.py	2024-04-02 12:21:17 +05:30
Dhruv Nair	5d21d4a204	Fix FreeU tests (#7540 ) update	2024-04-02 11:05:50 +05:30
Álvaro Somoza	73ba81090e	[Community pipeline] SDXL Differential Diffusion Img2Img Pipeline (#7550 ) * initial-commit pipeline created * updated README.md	2024-04-01 18:15:30 -10:00
YiYi Xu	7956c36aaa	add a `from_pipe` method to `DiffusionPipeline` (#7241 ) * add from_pipe --------- Co-authored-by: yiyixuxu <yixu310@gmail,com> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>	2024-04-01 13:02:00 -10:00
haikmanukyan	5266ab7935	add HD-Painter pipeline (#7520 ) * add HD-Painter pipeline * style fixing * refactor, change doc, fix ruff * fix docs * used correct ruff version --------- Co-authored-by: Hayk Manukyan <youremail@yourdomain.com> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-04-01 15:10:44 +05:30
YiYi Xu	7f724a930e	fix the cpu offload tests (#7544 ) fix	2024-04-01 14:27:14 +05:30
Jianbing Wu	9bef9f4be7	Fix SVD bug (shape of `time_context`) (#7268 ) * Fix SVD bug (shape of `time_context`) * Formatting code * Formatting src/diffusers/models/transformers/transformer_temporal.py by `make style && make quality` --------- Co-authored-by: kevinkhwu <kevinkhwu@tencent.com> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>	2024-04-01 14:05:52 +05:30
Dhruv Nair	7aa4514260	Fix typo in CPU offload test (#7542 ) update	2024-03-31 22:07:17 -10:00
Bingxin Ke	c2e87869be	[Community pipeline] Marigold depth estimation update -- align with marigold v0.1.5 (#7524 ) * add resample option; check denoise_step; update ckpt path * Add seeding in pipeline to increase reproducibility * fix typo * fix typo	2024-03-30 07:09:02 -10:00
Stephen	ca61287daa	Fix IP Adapter Support for SAG Pipeline (#7260 ) * fix ip adapter support * Update sag pipelines tests, adjust sag pipeline to pass tests --------- Co-authored-by: YiYi Xu <yixu310@gmail.com>	2024-03-30 06:15:29 -10:00
Beinsezii	f0c81562a4	Add `final_sigma_zero` to UniPCMultistep (#7517 ) * Add `final_sigma_zero` to UniPCMultistep Effectively the same trick as DDIM's `set_alpha_to_one` and DPM's `final_sigma_type='zero'`. Currently False by default but maybe this should be True? * `final_sigma_zero: bool` -> `final_sigmas_type: str` Should 1:1 match DPM Multistep now. * Set `final_sigmas_type='sigma_min'` in UniPC UTs	2024-03-29 22:23:45 -10:00
Hyoungwon Cho	9d20ed37a2	Perturbed-Attention Guidance (#7512 ) * pag_initial * pag_docs * edit_docs * custom * typo * delete_docs * whitespace * make style --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-03-30 10:52:51 +05:30
Linoy Tsaban	bda1d4faf8	add Instant id sdxl image2image pipeline (#7507 ) * initial commit - instantid img2img * adapting to img2img * change add_time_ids * change add_time_ids * WIP changes * add strength to timesteps * check insightface import * style * check insightface import changed to warning * check insightface import changed to warning * style --------- Co-authored-by: apolinário <joaopaulo.passos@gmail.com>	2024-03-30 10:25:21 +05:30
UmerHA	77103d71ca	Quick-Fix for #7352 block-lora (#7523 ) Fixed important typo	2024-03-30 06:42:28 +05:30
UmerHA	0302446819	Implements Blockwise lora (#7352 ) * Initial commit * Implemented block lora - implemented block lora - updated docs - added tests * Finishing up * Reverted unrelated changes made by make style * Fixed typo * Fixed bug + Made text_encoder_2 scalable * Integrated some review feedback * Incorporated review feedback * Fix tests * Made every module configurable * Adapter to new lora test structure * Final cleanup * Some more final fixes - Included examples in `using_peft_for_inference.md` - Added hint that only attns are scaled - Removed NoneTypes - Added test to check mismatching lens of adapter names / weights raise error * Update using_peft_for_inference.md * Update using_peft_for_inference.md * Make style, quality, fix-copies * Updated tutorial;Warning if scale/adapter mismatch * floats are forwarded as-is; changed tutorial scale * make style, quality, fix-copies * Fixed typo in tutorial * Moved some warnings into `lora_loader_utils.py` * Moved scale/lora mismatch warnings back * Integrated final review suggestions * Empty commit to trigger CI * Reverted emoty commit to trigger CI --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-03-29 21:15:57 +05:30
Dhruv Nair	4d39b7483d	Memory clean up on all Slow Tests (#7514 ) * update * update --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-03-29 14:23:28 +05:30
Sayak Paul	fac761694a	[Tests] Speed up some fast pipeline tests (#7477 ) * speed up test_vae_slicing in animatediff * speed up test_karras_schedulers_shape for attend and excite. * style. * get the static slices out. * specify torch print options. * modify * test run with controlnet * specify kwarg * fix: things * not None * flatten * controlnet img2img * complete controlet sd * finish more * finish more * finish more * finish more * finish the final batch * add cpu check for expected_pipe_slice. * finish the rest * remove print * style * fix ssd1b controlnet test * checking ssd1b * disable the test. * make the test_ip_adapter_single controlnet test more robust * fix: simple inpaint * multi * disable panorama * enable again * panorama is shaky so leave it for now * remove print * raise tolerance.	2024-03-29 14:11:38 +05:30
YiYi Xu	34c90dbb31	fix OOM for test_vae_tiling (#7510 ) use float16 and add torch.no_grad()	2024-03-29 08:22:39 +05:30
Lvkesheng Shen	e49c04d5d6	Bug fix for controlnetpipeline check_image (#7103 ) * Bug fix for controlnetpipeline check_image Bug fix for controlnetpipeline check_image when using multicontrolnet and prompt list * Update test_inference_multiple_prompt_input function * Update test_controlnet.py add test for multiple prompts and multiple image conditioning * Update test_controlnet.py Fix format error --------- Co-authored-by: Lvkesheng Shen <45848260+Fantast416@users.noreply.github.com> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-03-28 08:25:18 -10:00
YiYi Xu	f238cb0736	cpu_offload: remove all hooks before offload (#7448 ) * add remove_all_hooks * a few more fix and tests * up * Update src/diffusers/pipelines/pipeline_utils.py Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * split tests * add --------- Co-authored-by: Pedro Cuenca <pedro@huggingface.co>	2024-03-28 08:23:02 -10:00
Bagheera	d78acdedc1	apple mps: training support for SDXL (ControlNet, LoRA, Dreambooth, T2I) (#7447 ) * apple mps: training support for SDXL LoRA * sdxl: support training lora, dreambooth, t2i, pix2pix, and controlnet on apple mps --------- Co-authored-by: bghira <bghira@users.github.com> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-03-28 14:26:18 +05:30
Sayak Paul	6df103deba	add: a helpful message when quality and repo consistency checks fail. (#7475 )	2024-03-28 13:51:56 +05:30
Sayak Paul	73f28708be	Improve nightly tests (#7385 ) * flesh out the nightly tests * address feedback.	2024-03-28 13:26:34 +05:30
Sayak Paul	0cbc78f04c	[Modeling utils chore] import load_model_dict_into_meta only once (#7437 ) import load_model_dict_into_meta only once	2024-03-28 13:01:53 +05:30
Thomas Liang	0cc5630945	[Chore] Fix Colab notebook links in README.md (#7495 )	2024-03-27 12:36:36 -10:00
UmerHA	0b8e29289d	Skip `test_lora_fuse_nan` on mps (#7481 ) Skipping test_lora_fuse_nan on mps Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-03-27 14:35:59 +05:30
Sayak Paul	ab38ddf64f	[chore] make the istructions on fetching all commits clearer. (#7474 ) * make the istructions on fetching all commits clearer. * Update setup.py Co-authored-by: YiYi Xu <yixu310@gmail.com> --------- Co-authored-by: YiYi Xu <yixu310@gmail.com>	2024-03-27 08:16:46 +05:30
YiYi Xu	ead82fedea	fix torch.compile for multi-controlnet of sdxl inpaint (#7476 ) fix Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-03-27 08:08:32 +05:30
Disty0	45b42d1203	Add device arg to offloading with combined pipelines (#7471 ) Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-03-26 13:45:16 -10:00
Long(Tony) Lian	5199ee4f7b	Fix missing raise statements in check_inputs (#7473 ) Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-03-26 13:34:28 -10:00
Bagheera	544710ef0f	diffusers#7426 fix stable diffusion xl inference on MPS when dtypes shift unexpectedly due to pytorch bugs (#7446 ) * mps: fix XL pipeline inference at training time due to upstream pytorch bug * Update src/diffusers/pipelines/stable_diffusion_xl/pipeline_stable_diffusion_xl.py Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * apply the safe-guarding logic elsewhere. --------- Co-authored-by: bghira <bghira@users.github.com> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-03-26 20:05:49 +05:30
M. Tolga Cangöz	443aa14e41	Fix Tiling in `ConsistencyDecoderVAE` (#7290 ) * Fix typos * Add docstring to `decode` method in `ConsistencyDecoderVAE` * Fix tiling * Enable tiled VAE decoding with customizable tile sample size and overlap factor * Revert "Enable tiled VAE decoding with customizable tile sample size and overlap factor" This reverts commit `181049675e`. * Add VAE tiling test for `ConsistencyDecoderVAE` --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-03-26 17:59:08 +05:30

1 2 3 4 5 ...

3962 Commits