diffusers

mirror of https://github.com/huggingface/diffusers.git synced 2026-01-29 07:22:12 +03:00

Author	SHA1	Message	Date
Tolga Cangöz	56c21150d8	[`Community Pipeline`] Add 🪆Matryoshka Diffusion Models (#9157 )	2024-10-14 11:38:44 -10:00
Leo Jiang	5956b68a69	Improve the performance and suitable for NPU computing (#9642 ) * Improve the performance and suitable for NPU * Improve the performance and suitable for NPU computing * Improve the performance and suitable for NPU * Improve the performance and suitable for NPU * Improve the performance and suitable for NPU * Improve the performance and suitable for NPU --------- Co-authored-by: 蒋硕 <jiangshuo9@h-partners.com> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-10-14 21:39:33 +05:30
Yuxuan.Zhang	8d81564b27	CogView3Plus DiT (#9570 ) * merge 9588 * max_shard_size="5GB" for colab running * conversion script updates; modeling test; refactor transformer * make fix-copies * Update convert_cogview3_to_diffusers.py * initial pipeline draft * make style * fight bugs 🐛🪳 * add example * add tests; refactor * make style * make fix-copies * add co-author YiYi Xu <yixu310@gmail.com> * remove files * add docs * add co-author Co-Authored-By: YiYi Xu <yixu310@gmail.com> * fight docs * address reviews * make style * make model work * remove qkv fusion * remove qkv fusion tets * address review comments * fix make fix-copies error * remove None and TODO * for FP16(draft) * make style * remove dynamic cfg * remove pooled_projection_dim as a parameter * fix tests --------- Co-authored-by: Aryan <aryan@huggingface.co> Co-authored-by: YiYi Xu <yixu310@gmail.com>	2024-10-14 19:30:36 +05:30
Ryan Lin	68d16f7806	Flux - soft inpainting via differential diffusion (#9268 ) * Flux - soft inpainting via differential diffusion * . * track changes to FluxInpaintPipeline * make mask arrangement simplier * make style --------- Co-authored-by: YiYi Xu <yixu310@gmail.com> Co-authored-by: Álvaro Somoza <asomoza@users.noreply.github.com> Co-authored-by: asomoza <somoza.alvaro@gmail.com>	2024-10-14 10:07:48 -03:00
Sayak Paul	86bcbc389e	[Tests] increase transformers version in `test_low_cpu_mem_usage_with_loading` (#9662 ) increase transformers version in test_low_cpu_mem_usage_with_loading	2024-10-13 22:39:38 +05:30
Jinzhe Pan	6a5f06488c	[docs] Fix xDiT doc image damage (#9655 ) * docs: fix xDiT doc image damage * doc: move xdit images to hf dataset --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-10-12 13:05:07 +05:30
Sayak Paul	c7a6d77b5f	[CI] replace ubuntu version to 22.04. (#9656 ) replace ubuntu version to 22.04.	2024-10-12 11:55:36 +05:30
hlky	0f8fb75c7b	FluxMultiControlNetModel (#9647 )	2024-10-11 14:39:19 -03:00
M Saqlain	3033f08201	Add Differential Diffusion to Kolors (#9423 ) * Added diff diff support for kolors img2img * Fized relative imports * Fized relative imports * Added diff diff support for Kolors * Fized import issues * Added map * Fized import issues * Fixed naming issues * Added diffdiff support for Kolors img2img pipeline * Removed example docstrings * Added map input * Updated latents Co-authored-by: Álvaro Somoza <asomoza@users.noreply.github.com> * Updated `original_with_noise` Co-authored-by: Álvaro Somoza <asomoza@users.noreply.github.com> * Improved code quality --------- Co-authored-by: Álvaro Somoza <asomoza@users.noreply.github.com>	2024-10-11 10:47:31 -03:00
GSSun	164ec9f423	fix IsADirectoryError when running the training code for sd3_dreambooth_lora_16gb.ipynb (#9634 ) Add files via upload fix IsADirectoryError when running the training code	2024-10-11 13:33:39 +05:30
Subho Ghosh	38a3e4df92	flux controlnet control_guidance_start and control_guidance_end implement (#9571 ) * flux controlnet control_guidance_start and control_guidance_end implement * minor fix - added docstrings, consistent controlnet scale flux and SD3	2024-10-10 09:29:02 -03:00
Sayak Paul	e16fd93d0a	[LoRA] fix dora test to catch the warning properly. (#9627 ) fix dora test.	2024-10-10 11:47:49 +05:30
Pakkapon Phongthawee	07bd2fabb6	make controlnet support interrupt (#9620 ) * make controlnet support interrupt * remove white space in controlnet interrupt	2024-10-09 12:03:13 -10:00
SahilCarterr	af28ae2d5b	add PAG support for SD Img2Img (#9463 ) * added pag to sd img2img pipeline --------- Co-authored-by: YiYi Xu <yixu310@gmail.com>	2024-10-09 10:40:58 -10:00
Sayak Paul	31058cdaef	[LoRA] allow loras to be loaded with low_cpu_mem_usage. (#9510 ) * allow loras to be loaded with low_cpu_mem_usage. * add flux support but note https://github.com/huggingface/diffusers/pull/9510\#issuecomment-2378316687 * low_cpu_mem_usage. * fix-copies * fix-copies again * tests * _LOW_CPU_MEM_USAGE_DEFAULT_LORA * _peft_version default. * version checks. * version check. * version check. * version check. * require peft 0.13.1. * explicitly specify low_cpu_mem_usage=False. * docs. * transformers version 4.45.2. * update * fix * empty * better name initialize_dummy_state_dict. * doc todos. * Apply suggestions from code review Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * style * fix-copies --------- Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2024-10-09 10:57:16 +05:30
Yijun Lee	ec9e5264c0	refac/pipeline_output (#9582 )	2024-10-08 16:11:13 -10:00
sanaka	acd6d2c42f	Fix the bug that `joint_attention_kwargs` is not passed to the FLUX's transformer attention processors (#9517 ) * Update transformer_flux.py	2024-10-08 11:25:48 -10:00
v2ray	86bd991ee5	Fixed noise_pred_text referenced before assignment. (#9537 ) * Fixed local variable noise_pred_text referenced before assignment when using PAG with guidance scale and guidance rescale at the same time. * Fixed style. * Made returning text pred noise an argument.	2024-10-08 09:27:10 -10:00
Sayak Paul	02eeb8e77e	[LoRA] Handle DoRA better (#9547 ) * handle dora. * print test * debug * fix * fix-copies * update logits * add warning in the test. * make is_dora check consistent. * fix-copies	2024-10-08 21:47:44 +05:30
glide-the	66eef9a6dc	fix: CogVideox train dataset _preprocess_data crop video (#9574 ) * Removed int8 to float32 conversion (`* 2.0 - 1.0`) from `train_transforms` as it caused image overexposure. Added `_resize_for_rectangle_crop` function to enable video cropping functionality. The cropping mode can be configured via `video_reshape_mode`, supporting options: ['center', 'random', 'none']. * The number 127.5 may experience precision loss during division operations. * wandb request pil image Type * Resizing bug * del jupyter * make style * Update examples/cogvideo/README.md * make style --------- Co-authored-by: --unset <--unset> Co-authored-by: Aryan <aryan@huggingface.co>	2024-10-08 12:52:52 +05:30
Sayak Paul	63a5c8742a	Update distributed_inference.md to include `transformer.device_map` (#9553 ) * Update distributed_inference.md to include `transformer.device_map` * Update docs/source/en/training/distributed_inference.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> --------- Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2024-10-08 08:03:51 +05:30
Eliseu Silva	1287822973	Fix for use_safetensors parameters, allow use of parameter on loading submodels (#9576 ) (#9587 ) * Fix for use_safetensors parameters, allow use of parameter on loading submodels (#9576)	2024-10-07 10:41:32 -10:00
Yijun Lee	a80f689200	refac: docstrings in import_utils.py (#9583 ) * refac: docstrings in import_utils.py * Update import_utils.py	2024-10-07 13:27:35 -07:00
captainzz	2cb383f591	fix vae dtype when accelerate config using --mixed_precision="fp16" (#9601 ) * fix vae dtype when accelerate config using --mixed_precision="fp16" * Add param for upcast vae	2024-10-07 21:00:25 +05:30
Sayak Paul	31010ecc45	[Chore] add a note on the versions in Flux LoRA integration tests (#9598 ) add a note on the versions.	2024-10-07 17:43:48 +05:30
Clem	3159e60d59	fix xlabs FLUX lora conversion typo (#9581 ) * fix startswith syntax in xlabs lora conversion * Trigger CI https://github.com/huggingface/diffusers/pull/9581#issuecomment-2395530360	2024-10-07 10:47:54 +05:30
YiYi Xu	99f608218c	[sd3] make sure height and size are divisible by `16` (#9573 ) * check size * up	2024-10-03 08:36:26 -10:00
Xiangchendong	7f323f0f31	fix cogvideox autoencoder decode (#9569 ) Co-authored-by: Aryan <aryan@huggingface.co>	2024-10-02 09:07:06 -10:00
Darren Hsu	61d37640ad	Support bfloat16 for Upsample2D (#9480 ) * Support bfloat16 for Upsample2D * Add test and use is_torch_version * Resolve comments and add decorator * Simplify require_torch_version_greater_equal decorator * Run make style --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: YiYi Xu <yixu310@gmail.com>	2024-10-01 16:08:12 -10:00
JuanCarlosPi	33fafe3d14	Add PAG support to StableDiffusionControlNetPAGInpaintPipeline (#8875 ) * Add pag to controlnet inpainting pipeline --------- Co-authored-by: YiYi Xu <yixu310@gmail.com>	2024-09-30 20:04:42 -10:00
hlky	c4a8979f30	Add beta sigmas to other schedulers and update docs (#9538 )	2024-09-30 09:00:54 -10:00
Sayak Paul	f9fd511466	[LoRA] support Kohya Flux LoRAs that have text encoders as well (#9542 ) * support kohya flux loras that have tes.	2024-09-30 07:59:39 -10:00
Sayak Paul	8e7d6c03a3	[chore] fix: retain memory utility. (#9543 ) * fix: retain memory utility. * fix * quality * free_memory.	2024-09-28 21:08:45 +05:30
Anand Kumar	b28675c605	[train_instruct_pix2pix.py]Fix the LR schedulers when `num_train_epochs` is passed in a distributed training env (#9316 ) Fixed pix2pix lr scheduler Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-09-28 21:01:37 +05:30
Aryan	bd4df2856a	[refactor] remove conv_cache from CogVideoX VAE (#9524 ) * remove conv cache from the layer and pass as arg instead * make style * yiyi's cleaner implementation Co-Authored-By: YiYi Xu <yixu310@gmail.com> * sayak's compiled implementation Co-Authored-By: Sayak Paul <spsayakpaul@gmail.com> --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: YiYi Xu <yixu310@gmail.com>	2024-09-28 17:09:30 +05:30
Sayak Paul	11542431a5	[Core] fix variant-identification. (#9253 ) * fix variant-idenitification. * fix variant * fix sharded variant checkpoint loading. * Apply suggestions from code review * fixes. * more fixes. * remove print. * fixes * fixes * comments * fixes * apply suggestions. * hub_utils.py * fix test * updates * fixes * fixes * Apply suggestions from code review Co-authored-by: YiYi Xu <yixu310@gmail.com> * updates. * removep patch file. --------- Co-authored-by: YiYi Xu <yixu310@gmail.com>	2024-09-28 09:57:31 +05:30
Sayak Paul	81cf3b2f15	[Tests] [LoRA] clean up the serialization stuff. (#9512 ) * clean up the serialization stuff. * better	2024-09-27 07:57:09 -10:00
PromeAI	534848c370	[examples] add train flux-controlnet scripts in example. (#9324 ) * add train flux-controlnet scripts in example. * fix error * fix subfolder error * fix preprocess error * Update examples/controlnet/README_flux.md Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * Update examples/controlnet/README_flux.md Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * fix readme * fix note error * add some Tutorial for deepspeed * fix some Format Error * add dataset_path example * remove print, add guidance_scale CLI, readable apply * Update examples/controlnet/README_flux.md Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * update,push_to_hub,save_weight_dtype,static method,clear_objs_and_retain_memory,report_to=wandb * add push to hub in readme * apply weighting schemes * add note * Update examples/controlnet/README_flux.md Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * make code style and quality * fix some unnoticed error * make code style and quality * add example controlnet in readme * add test controlnet * rm Remove duplicate notes * Fix formatting errors * add new control image * add model cpu offload * update help for adafactor * make quality & style * make quality and style * rename flux_controlnet_model_name_or_path * fix back src/diffusers/pipelines/flux/pipeline_flux_controlnet.py * fix dtype error by pre calculate text emb * rm image save * quality fix * fix test * fix tiny flux train error * change report to to tensorboard * fix save name error when test * Fix shrinking errors --------- Co-authored-by: YiYi Xu <yixu310@gmail.com> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: Your Name <you@example.com>	2024-09-27 13:31:47 +05:30
Sayak Paul	2daedc0ad3	[LoRA] make set_adapters() method more robust. (#9535 ) * make set_adapters() method more robust. * remove patch * better and concise code. * Update src/diffusers/loaders/lora_base.py Co-authored-by: YiYi Xu <yixu310@gmail.com> --------- Co-authored-by: YiYi Xu <yixu310@gmail.com>	2024-09-27 07:32:43 +05:30
Aryan	665c6b47a2	[bug] Precedence of operations in VAE should be slicing -> tiling (#9342 ) * bugfix: precedence of operations should be slicing -> tiling * fix typo * fix another typo * deprecate current implementation of tiled_encode and use new impl * Update src/diffusers/models/autoencoders/autoencoder_kl.py Co-authored-by: YiYi Xu <yixu310@gmail.com> * Update src/diffusers/models/autoencoders/autoencoder_kl.py --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: YiYi Xu <yixu310@gmail.com>	2024-09-26 22:12:07 +05:30
Álvaro Somoza	066ea374c8	[Tests] Fix ChatGLMTokenizer (#9536 ) fix	2024-09-25 22:10:15 -10:00
YiYi Xu	9cd37557d5	flux controlnet fix (control_modes batch & others) (#9507 ) * flux controlnet mode to take into account batch size * incorporate yiyixuxu's suggestions (cleaner logic) as well as clean up control mode handling for multi case * fix * fix use_guidance when controlnet is a multi and does not have config --------- Co-authored-by: Christopher Beckham <christopher.j.beckham@gmail.com> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-09-25 19:09:54 -10:00
hlky	1c6ede9371	[Schedulers] Add beta sigmas / beta noise schedule (#9509 ) Add beta sigmas / beta noise schedule	2024-09-25 13:30:32 -10:00
v2ray	aa3c46d99a	[Doc] Improved level of clarity for latents_to_rgb. (#9529 ) Fixed latents_to_rgb doc. Co-authored-by: Álvaro Somoza <asomoza@users.noreply.github.com>	2024-09-25 19:26:58 -03:00
YiYi Xu	c76e88405c	update get_parameter_dtype (#9526 ) * up * Update src/diffusers/models/modeling_utils.py Co-authored-by: Aryan <aryan@huggingface.co> --------- Co-authored-by: Aryan <aryan@huggingface.co>	2024-09-25 11:00:57 -10:00
Steven Liu	d9c969172d	[docs] Model sharding (#9521 ) * flux shard * feedback	2024-09-25 09:33:54 -07:00
Lee Penkman	065ce07ac3	Update community_projects.md (#9266 )	2024-09-25 08:54:36 -07:00
Sayak Paul	6ca5a58e43	[Community Pipeline] Batched implementation of Flux with CFG (#9513 ) * batched implementation of flux cfg. * style. * readme * remove comments.	2024-09-25 15:25:15 +05:30
hlky	b52684c3ed	Add exponential sigmas to other schedulers and update docs (#9518 )	2024-09-24 14:50:12 -10:00
YiYi Xu	bac8a2412d	a few fix for SingleFile tests (#9522 ) * update sd15 repo * update more	2024-09-24 13:36:53 -10:00

... 20 21 22 23 24 ...

5665 Commits