diffusers

mirror of https://github.com/huggingface/diffusers.git synced 2026-01-27 17:22:53 +03:00

Author	SHA1	Message	Date
UmerHA	fda1531d8a	Fixing implementation of ControlNet-XS (#6772 ) * CheckIn - created DownSubBlocks * Added extra channels, implemented subblock fwd * Fixed connection sizes * checkin * Removed iter, next in forward * Models for SD21 & SDXL run through * Added back pipelines, cleared up connections * Cleaned up connection creation * added debug logs * updated logs * logs: added input loading * Update umer_debug_logger.py * log: Loading hint * Update umer_debug_logger.py * added logs * Changed debug logging * debug: added more logs * Fixed num_norm_groups * Debug: Logging all of SDXL input * Update umer_debug_logger.py * debug: updated logs * checkim * Readded tests * Removed debug logs * Fixed Slow Tests * Added value ckecks \| Updated model_cpu_offload_seq * accelerate-offloading works ; fast tests work * Made unet & addon explicit in controlnet * Updated slow tests * Added dtype/device to ControlNetXS * Filled in test model paths * Added image_encoder/feature_extractor to XL pipe * Fixed fast tests * Added comments and docstrings * Fixed copies * Added docs ; Updates slow tests * Moved changes to UNetMidBlock2DCrossAttn * tiny cleanups * Removed stray prints * Removed ip adapters + freeU - Removed ip adapters + freeU as they don't make sense for ControlNet-XS - Fixed imports of UNet components * Fixed test_save_load_float16 * Make style, quality, fix-copies * Changed loading/saving API for ControlNetXS - Changed loading/saving API for ControlNetXS - other small fixes * Removed ControlNet-XS from research examples * Make style, quality, fix-copies * Small fixes - deleted ControlNetXSModel.init_original - added time_embedding_mix to StableDiffusionControlNetXSPipeline .from_pretrained / StableDiffusionXLControlNetXSPipeline.from_pretrained - fixed copy hints * checkin May 11 '23 * CheckIn Mar 12 '24 * Fixed tests for SD * Added tests for UNetControlNetXSModel * Fixed SDXL tests * cleanup * Delete Pipfile * CheckIn Mar 20 Started replacing sub blocks by `ControlNetXSCrossAttnDownBlock2D` and `ControlNetXSCrossAttnUplock2D` * check-in Mar 23 * checkin 24 Mar * Created init for UNetCnxs and CnxsAddon * CheckIn * Made from_modules, from_unet and no_control work * make style,quality,fix-copies & small changes * Fixed freezing * Added gradient ckpt'ing; fixed tests * Fix slow tests(+compile) ; clear naming confusion * Don't create UNet in init ; removed class_emb * Incorporated review feedback - Deleted get_base_pipeline / get_controlnet_addon for pipes - Pipes inherit from StableDiffusionXLPipeline - Made module dicts for cnxs-addon's down/mid/up classes - Added support for qkv fusion and freeU * Make style, quality, fix-copies * Implemented review feedback * Removed compatibility check for vae/ctrl embedding * make style, quality, fix-copies * Delete Pipfile * Integrated review feedback - Importing ControlNetConditioningEmbedding now - get_down/mid/up_block_addon now outside class - renamed `do_control` to `apply_control` * Reduced size of test tensors For this, added `norm_num_groups` as parameter everywhere * Renamed cnxs-`Addon` to cnxs-`Adapter` - `ControlNetXSAddon` -> `ControlNetXSAdapter` - `ControlNetXSAddonDownBlockComponents` -> `DownBlockControlNetXSAdapter`, and similarly for mid/up - `get_mid_block_addon` -> `get_mid_block_adapter`, and similarly for mid/up * Fixed save_pretrained/from_pretrained bug * Removed redundant code --------- Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>	2024-04-16 21:56:20 +05:30
Sayak Paul	cf6e0407e0	don't install peft from the source with uv for now. (#7679 )	2024-04-15 09:33:02 +05:30
Sayak Paul	1c000d46e1	fix: metadata token (#7631 )	2024-04-15 08:32:27 +05:30
Sayak Paul	08bf754507	make docker-buildx mandatory. (#7652 )	2024-04-13 07:26:34 +05:30
kabachuha	2f23437618	Add (Scheduled) Pseudo-Huber Loss training scripts to research projects (#7527 ) * add scheduled pseudo-huber loss training scripts See #7488 * add reduction modes to huber loss * [DB Lora] 2 multiplier to huber loss cause of 1/2 a^2 conv. pairing of `c6495def1f` [DB Lora] add option for smooth l1 (huber / delta) Pairing of `dd22958caa` * [DB Lora] unify huber scheduling Pairing of `19a834c3ab` * [DB Lora] add snr huber scheduler Pairing of `47fb1a6854` * fixup examples link * use snr schedule by default in DB * update all huber scripts with snr * code quality * huber: make style && make quality --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-04-13 07:26:08 +05:30
Benjamin Bossan	2523390c26	FIX Setting device for DoRA parameters (#7655 ) Fix a bug that causes the the call to set_lora_device to ignore the DoRA parameters.	2024-04-12 13:55:46 +02:00
Sai-Suraj-27	279de3c3ff	fix: Replaced deprecated `logger.warn` with `logger.warning` (#7643 ) Fixed deprecated logger.warn with logger.warning.	2024-04-11 09:43:01 -10:00
Yiqin Zhao	8e14535708	Fixed YAML loading. (#7579 )	2024-04-11 09:08:42 -10:00
dg845	0bee4d336b	LCM Distill Scripts Fix Bug when Initializing Target U-Net (#6848 ) * Initialize target_unet from unet rather than teacher_unet so that we correctly add time_embedding.cond_proj if necessary. * Use UNet2DConditionModel.from_config to initialize target_unet from unet's config. --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-04-11 07:52:12 -10:00
Steven Munn	42f25d601a	Skip PEFT LoRA Scaling if the scale is 1.0 (#7576 ) * Skip scaling if scale is identity * move check for weight one to scale and unscale lora * fix code style/quality * Empty-Commit --------- Co-authored-by: Steven Munn <stevenjmunn@gmail.com> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: Steven Munn <5297082+stevenjlm@users.noreply.github.com>	2024-04-11 11:02:31 +05:30
Sayak Paul	33c5d125cb	[Core] fix img2img pipeline for Playground (#7627 ) * playground vae encoding should use std and mean of the vae. * style. * fix-copies.	2024-04-11 09:07:38 +05:30
YiYi Xu	aa1f00fd01	Fix cpu offload related slow tests (#7618 ) * fix * up --------- Co-authored-by: yiyixuxu <yixu310@gmail,com>	2024-04-10 14:53:45 -10:00
Steven Liu	d95b993427	[docs] T2I (#7623 ) * refactor t2i * add code snippets	2024-04-10 17:10:41 -07:00
Steven Liu	1d480298c1	[docs] Prompt enhancer (#7565 ) * prompt enhance * edits * align titles * feedback * feedback * feedback * link to style	2024-04-10 16:09:06 -07:00
Sayak Paul	b2323aa2b7	[Tests] reduce the model sizes in the SD fast tests (#7580 ) * give it a shot. * print. * correct assertion. * gather results from the rest of the tests. * change the assertion values where needed. * remove print statements.	2024-04-10 11:36:28 -10:00
satani99	37e9d695af	Modularize instruct_pix2pix SD inferencing during and after training in examples (#7603 ) * Modularize instruct_pix2pix code * quality check * quality check --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-04-10 11:19:16 +05:30
Sayak Paul	a402431de0	[docs] remove duplicate tip block. (#7625 ) remove duplicate tip block.	2024-04-10 10:31:11 +05:30
IDKiro	b99b1617cf	add the option of upsample function for tiny vae (#7604 ) Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: YiYi Xu <yixu310@gmail.com>	2024-04-10 09:27:39 +05:30
Sayak Paul	3e4a6bd2d4	[Core] add "balanced" `device_map` support to pipelines (#6857 ) * get device <-> component mapping when using multiple gpus. * condition the device_map bits. * relax condition * device_map progress. * device_map enhancement * some cleaning up and debugging * Apply suggestions from code review Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com> * incorporate suggestions from PR. * remove multi-gpu condition for now. * guard check the component -> device mapping * fix: device_memory variable * dispatching transformers model to have force_hooks=True * better guarding for transformers device_map * introduce support balanced_low_memory and balanced_ultra_low_memory. * remove device_map patch. * fix: intermediate variable scoping. * fix: condition in cpu offload. * fix: flax class restrictions. * remove modifications from cpu_offload and model_offload * incorporate changes. * add a simple forward pass test * add: torch_device in get_inputs() * add: tests * remove print * safe-guard to(), model offloading and cpu offloading when balanced is used as a device_map. * style * remove . * safeguard device_map with more checks and remove invalid device_mapping strategues. * make a class attribute and adjust tests accordingly. * fix device_map check * fix test * adjust comment * fix: device_map attribute * fix: dispatching. * max_memory test for pipeline * version guard the tests * fix guard. * address review feedback. * reset_device_map method. * add: test for reset_hf_device_map * fix a couple things. * add reset_device_map() in the error message. * add tests for checking reset_device_map doesn't have unintended consequences. * fix reset_device_map and offloading tests. * create _get_final_device_map utility. * hf_device_map -> _hf_device_map * add documentation * add notes suggested by Marc. * styling. * Apply suggestions from code review Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> Co-authored-by: Pedro Cuenca <pedro@huggingface.co> * move updates within gpu condition. * other docs related things * note on ignore a device not specified in . * provide a suggestion if device mapping errors out. * fix: typo. * _hf_device_map -> hf_device_map * Empty-Commit * add: example hf_device_map. --------- Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com> Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> Co-authored-by: Pedro Cuenca <pedro@huggingface.co>	2024-04-10 08:59:05 +05:30
Sayak Paul	c827e94da0	[Workflows] remove installation of `libsndfile1-dev` and `libgl1` from workflows (#7543 ) * remove libsndfile1-dev and libgl1 from workflows and ensure that re present in the respective dockerfiles. * change to self-hosted runner; let's see 🤞 * add libsndfile1-dev libgl1 for now * use self-hosted runners for building and push too.	2024-04-10 08:34:56 +05:30
Sayak Paul	44f6b859bf	[Core] refactor `transformer_2d` forward logic into meaningful conditions. (#7489 ) * refactor transformer_2d forward logic into meaningful conditions. * Empty-Commit * fix: _operate_on_patched_inputs * fix: _operate_on_patched_inputs * check * fix: patch output computation block. * fix: _operate_on_patched_inputs. * remove print. * move operations to blocks. * more readability neats. * empty commit * Apply suggestions from code review Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com> * Revert "Apply suggestions from code review" This reverts commit `12178b1aa0`. --------- Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>	2024-04-10 08:33:19 +05:30
Sayak Paul	ac7ff7d4a3	add utilities for updating diffusers pipeline metadata. (#7573 ) * add utilities for updating diffusers pipeline metadata. * style * remove first empty line	2024-04-10 08:28:49 +05:30
Fabio Rigano	a0cf607667	Multi-image masking for single IP Adapter (#7499 ) * Support multiimage masking --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: YiYi Xu <yixu310@gmail.com>	2024-04-09 09:20:57 -10:00
YiYi Xu	a341b536a8	disable test_conversion_when_using_device_map (#7620 ) * disable test * update --------- Co-authored-by: yiyixuxu <yixu310@gmail,com>	2024-04-09 09:01:19 -10:00
Christopher Beckham	8e46d97cd8	Add missing restore() EMA call in train SDXL script (#7599 ) * Restore unet params back to normal from EMA when validation call is finished * empty commit --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-04-09 18:07:55 +05:30
Junjie	7e808e768a	[Docs] fix bugs in callback docs (#7594 )	2024-04-08 08:46:30 -10:00
w4ffl35	7e39516627	Allow more arguments to be passed to convert_from_ckpt (#7222 ) Allow safety and feature extractor arguments to be passed to convert_from_ckpt Allows management of safety checker and feature extractor from outside of the convert ckpt class. Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-04-08 10:13:48 +05:30
Nguyễn Công Tú Anh	56a76082ed	Add AudioLDM2 TTS (#5381 ) * add audioldm2 tts * change gpt2 max new tokens * remove unnecessary pipeline and class * add TTS to AudioLDM2Pipeline * add TTS docs * delete unnecessary file * remove unnecessary import * add audioldm2 slow testcase * fix code quality * remove AudioLDMLearnablePositionalEmbedding * add variable check vits encoder * add use_learned_position_embedding --------- Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>	2024-04-08 10:11:24 +05:30
YiYi Xu	6133d98ff7	[IF\| add set_begin_index for all IF pipelines (#7577 ) add set_begin_index for all if pipelines	2024-04-05 06:54:07 -10:00
Sayak Paul	1c60e094de	[Tests] reduce block sizes of UNet and VAE tests (#7560 ) * reduce block sizes for unet1d. * reduce blocks for unet_2d. * reduce block size for unet_motion * increase channels. * correctly increase channels. * reduce number of layers in unet2dconditionmodel tests. * reduce block sizes for unet2dconditionmodel tests * reduce block sizes for unet3dconditionmodel. * fix: test_feed_forward_chunking * fix: test_forward_with_norm_groups * skip spatiotemporal tests on MPS. * reduce block size in AutoencoderKL. * reduce block sizes for vqmodel. * further reduce block size. * make style. * Empty-Commit * reduce sizes for ConsistencyDecoderVAETests * further reduction. * further block reductions in AutoencoderKL and AssymetricAutoencoderKL. * massively reduce the block size in unet2dcontionmodel. * reduce sizes for unet3d * fix tests in unet3d. * reduce blocks further in motion unet. * fix: output shape * add attention_head_dim to the test configuration. * remove unexpected keyword arg * up a bit. * groups. * up again * fix	2024-04-05 10:08:32 +05:30
UmerHA	71f49a5d2a	Skip `test_freeu_enabled` on MPS (#7570 ) * Skip `test_freeu_enabled ` on MPS * Small fixes - import skip_mps correctly - disable all instances of test_freeu_enabled * Empty commit to trigger tests * Empty commit to trigger CI	2024-04-04 12:16:04 +02:00
Abhinav Gopal	35db2fdea9	Update pipeline_animatediff_video2video.py (#7457 ) * Update pipeline_animatediff_video2video.py * commit with test for whether latent input can be passed into animatediffvid2vid	2024-04-03 19:34:28 +05:30
Sayak Paul	ad55ce6100	[Chore] increase number of workers for the tests. (#7558 ) * increase number of workers for the tests. * move to beefier runner. * improve the fast push tests too. * use a beefy machine for pytorch pipeline tests * up the number of workers further.	2024-04-03 17:11:42 +05:30
Sayak Paul	a9a5b14f35	[Core] refactor transformers 2d into multiple init variants. (#7491 ) * refactor transformers 2d into multiple legacy variants. * fix: init. * fix recursive init. * add inits. * make transformer block creation more modular. * complete refactor. * remove forward * debug * remove legacy blocks and refactor within the module itself. * remove print * guard caption projection * remove fetcher. * reduce the number of args. * fix: norm_type * group variables that are shared. * remove _get_transformer_blocks * harmonize the init function signatures. * transformer_blocks to common * repeat .	2024-04-03 12:56:17 +05:30
Beinsezii	aa19025989	UniPC Multistep add `rescale_betas_zero_snr` (#7531 ) * UniPC Multistep add `rescale_betas_zero_snr` Same patch as DPM and Euler with the patched final alpha cumprod BF16 doesn't seem to break down, I think cause UniPC upcasts during some phases already? We could still force an upcast since it only loses ≈ 0.005 it/s for me but the difference in output is very small. A better endeavor might upcasting in step() and removing all the other upcasts elsewhere? * UniPC ZSNR UT * Re-add `rescale_betas_zsnr` doc oops	2024-04-02 17:23:55 -10:00
Beinsezii	19ab04ff56	UniPC Multistep fix tensor dtype/device on order=3 (#7532 ) * UniPC UTs iterate solvers on FP16 It wasn't catching errs on order==3. Might be excessive? * UniPC Multistep fix tensor dtype/device on order=3 * UniPC UTs Add v_pred to fp16 test iter For completions sake. Probably overkill?	2024-04-02 15:41:29 -10:00
Sayak Paul	4a34307702	add: utility to format our docs too 📜 (#7314 ) * add: utility to format our docs too 📜 * debugging saga * fix: message * checking * should be fixed. * revert pipeline_fixture * remove empty line * make style * fix: setup.py * style.	2024-04-02 20:49:43 +05:30
Bagheera	8e963d1c2a	7529 do not disable autocast for cuda devices (#7530 ) * 7529 do not disable autocast for cuda devices * Remove typecasting error check for non-mps platforms, as a correct autocast implementation makes it a non-issue * add autocast fix to other training examples * disable native_amp for dreambooth (sdxl) * disable native_amp for pix2pix (sdxl) * remove tests from remaining files * disable native_amp on huggingface accelerator for every training example that uses it * convert more usages of autocast to nullcontext, make style fixes * make style fixes * style. * Empty-Commit --------- Co-authored-by: bghira <bghira@users.github.com> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-04-02 20:15:06 +05:30
Sayak Paul	2b04ec2ff7	[Tests] Speed up fast pipelines part II (#7521 ) * start printing the tensors. * print full throttle * set static slices for 7 tests. * remove printing. * flatten * disable test for controlnet * what happens when things are seeded properly? * set the right value * style./ * make pia test fail to check things * print. * fix pia. * checking for animatediff. * fix: animatediff. * video synthesis * final piece. * style. * print guess. * fix: assertion for control guess. --------- Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>	2024-04-02 13:24:56 +05:30
Sayak Paul	000fa82a1e	[Chore] remove class assignments for linear and conv. (#7553 ) * remove class assignments for linear and conv. * fix: self.nn	2024-04-02 13:01:04 +05:30
Sayak Paul	5d83f50c23	[Release tests] make nightly workflow dispatchable. (#7541 ) * make nightly workflow dispatchable. * add a note about running the release tests to setup.py	2024-04-02 12:21:17 +05:30
Dhruv Nair	5d21d4a204	Fix FreeU tests (#7540 ) update	2024-04-02 11:05:50 +05:30
Álvaro Somoza	73ba81090e	[Community pipeline] SDXL Differential Diffusion Img2Img Pipeline (#7550 ) * initial-commit pipeline created * updated README.md	2024-04-01 18:15:30 -10:00
YiYi Xu	7956c36aaa	add a `from_pipe` method to `DiffusionPipeline` (#7241 ) * add from_pipe --------- Co-authored-by: yiyixuxu <yixu310@gmail,com> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>	2024-04-01 13:02:00 -10:00
haikmanukyan	5266ab7935	add HD-Painter pipeline (#7520 ) * add HD-Painter pipeline * style fixing * refactor, change doc, fix ruff * fix docs * used correct ruff version --------- Co-authored-by: Hayk Manukyan <youremail@yourdomain.com> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-04-01 15:10:44 +05:30
YiYi Xu	7f724a930e	fix the cpu offload tests (#7544 ) fix	2024-04-01 14:27:14 +05:30
Jianbing Wu	9bef9f4be7	Fix SVD bug (shape of `time_context`) (#7268 ) * Fix SVD bug (shape of `time_context`) * Formatting code * Formatting src/diffusers/models/transformers/transformer_temporal.py by `make style && make quality` --------- Co-authored-by: kevinkhwu <kevinkhwu@tencent.com> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>	2024-04-01 14:05:52 +05:30
Dhruv Nair	7aa4514260	Fix typo in CPU offload test (#7542 ) update	2024-03-31 22:07:17 -10:00
Bingxin Ke	c2e87869be	[Community pipeline] Marigold depth estimation update -- align with marigold v0.1.5 (#7524 ) * add resample option; check denoise_step; update ckpt path * Add seeding in pipeline to increase reproducibility * fix typo * fix typo	2024-03-30 07:09:02 -10:00
Stephen	ca61287daa	Fix IP Adapter Support for SAG Pipeline (#7260 ) * fix ip adapter support * Update sag pipelines tests, adjust sag pipeline to pass tests --------- Co-authored-by: YiYi Xu <yixu310@gmail.com>	2024-03-30 06:15:29 -10:00

1 2 3 4 5 ...

3984 Commits