diffusers

mirror of https://github.com/huggingface/diffusers.git synced 2026-01-27 17:22:53 +03:00

Author	SHA1	Message	Date
Aryan	2ffbb88f1c	[training] CogVideoX-I2V LoRA (#9482 ) * update * update * update * update * update * add coauthor Co-Authored-By: yuan-shenghai <963658029@qq.com> * add coauthor Co-Authored-By: Shenghai Yuan <140951558+SHYuanBest@users.noreply.github.com> * update Co-Authored-By: yuan-shenghai <963658029@qq.com> * update --------- Co-authored-by: yuan-shenghai <963658029@qq.com> Co-authored-by: Shenghai Yuan <140951558+SHYuanBest@users.noreply.github.com>	2024-10-16 02:07:07 +05:30
Ahnjj_DEV	d40da7b68a	Fix some documentation in ./src/diffusers/models/adapter.py (#9591 ) * Fix some documentation in ./src/diffusers/models/adapter.py * Update src/diffusers/models/adapter.py * Update src/diffusers/models/adapter.py * Update src/diffusers/models/adapter.py * Update src/diffusers/models/adapter.py * Update src/diffusers/models/adapter.py * Update src/diffusers/models/adapter.py * Update src/diffusers/models/adapter.py * Update src/diffusers/models/adapter.py * Update src/diffusers/models/adapter.py * Update src/diffusers/models/adapter.py * Update src/diffusers/models/adapter.py * Update src/diffusers/models/adapter.py * Update src/diffusers/models/adapter.py * Update src/diffusers/models/adapter.py * Update src/diffusers/models/adapter.py * Update src/diffusers/models/adapter.py * Update src/diffusers/models/adapter.py * Update src/diffusers/models/adapter.py Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update src/diffusers/models/adapter.py Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update src/diffusers/models/adapter.py Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update src/diffusers/models/adapter.py Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update src/diffusers/models/adapter.py Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update src/diffusers/models/adapter.py Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update src/diffusers/models/adapter.py Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * run make style * make style & fix * make style : 0.1.5 version ruff * revert changes to examples --------- Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> Co-authored-by: Aryan <aryan@huggingface.co>	2024-10-15 10:27:39 -07:00
wony617	a3e8d3f7de	[docs] refactoring docstrings in `models/embeddings_flax.py` (#9592 ) * [docs] refactoring docstrings in `models/embeddings_flax.py` * Update src/diffusers/models/embeddings_flax.py * make style --------- Co-authored-by: Aryan <aryan@huggingface.co>	2024-10-15 19:15:14 +05:30
wony617	fff4be8e23	[docs] refactoring docstrings in `community/hd_painter.py` (#9593 ) * [docs] refactoring docstrings in community/hd_painter.py * Update examples/community/hd_painter.py Co-authored-by: Aryan <contact.aryanvs@gmail.com> * make style --------- Co-authored-by: Aryan <contact.aryanvs@gmail.com> Co-authored-by: Aryan <aryan@huggingface.co>	2024-10-15 18:50:12 +05:30
Jiwook Han	355bb641e3	[doc] Fix some docstrings in `src/diffusers/training_utils.py` (#9606 ) * refac: docstrings in training_utils.py * fix: manual edits * run make style * add docstring at cast_training_params --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-10-15 18:47:52 +05:30
Charchit Sharma	92d2baf643	refactor image_processor.py file (#9608 ) * refactor image_processor file * changes as requested * +1 edits * quality fix * indent issue --------- Co-authored-by: Aryan <aryan@huggingface.co> Co-authored-by: YiYi Xu <yixu310@gmail.com>	2024-10-15 17:20:33 +05:30
0x名無し	dccf39f01e	Dreambooth lora flux bug 3dtensor to 2dtensor (#9653 ) * fixed issue #9350, Tensor is deprecated * ran make style	2024-10-15 17:18:13 +05:30
Sayak Paul	99d87474fd	[Chore] fix import of EntryNotFoundError. (#9676 ) fix import of EntryNotFoundError.	2024-10-15 14:07:08 +05:30
Robin	79b118e863	[Fix] when run load pretain with local_files_only, local variable 'cached_folder' referenced before assignment (#9376 ) Fix local variable 'cached_folder' referenced before assignment in hub_utils.py Fix when use `local_files_only=True` with `subfolder`, local variable 'cached_folder' referenced before assignment issue. Co-authored-by: YiYi Xu <yixu310@gmail.com>	2024-10-14 20:49:36 -10:00
hlky	9d0616189e	Slight performance improvement to `Euler`, `EDMEuler`, `FlowMatchHeun`, `KDPM2Ancestral` (#9616 ) * Slight performance improvement to Euler * Slight performance improvement to EDMEuler * Slight performance improvement to FlowMatchHeun * Slight performance improvement to KDPM2Ancestral * Update KDPM2AncestralDiscreteSchedulerTest --------- Co-authored-by: YiYi Xu <yixu310@gmail.com>	2024-10-14 19:34:25 -10:00
hlky	5f0df17703	Refactor SchedulerOutput and add pred_original_sample in `DPMSolverSDE`, `Heun`, `KDPM2Ancestral` and `KDPM2` (#9650 ) Refactor SchedulerOutput and add pred_original_sample Co-authored-by: YiYi Xu <yixu310@gmail.com>	2024-10-14 18:11:01 -10:00
hlky	957e5cabff	Convert list/tuple of `HunyuanDiT2DControlNetModel` to `HunyuanDiT2DMultiControlNetModel` (#9651 ) Convert list/tuple of HunyuanDiT2DControlNetModel to HunyuanDiT2DMultiControlNetModel Co-authored-by: YiYi Xu <yixu310@gmail.com>	2024-10-14 18:09:30 -10:00
hlky	3e4c5707c3	Convert list/tuple of `SD3ControlNetModel` to `SD3MultiControlNetModel` (#9652 ) Convert list/tuple of SD3ControlNetModel to SD3MultiControlNetModel Co-authored-by: YiYi Xu <yixu310@gmail.com>	2024-10-14 17:57:34 -10:00
hlky	1bcd19e4d0	Add pred_original_sample to `if not return_dict` path (#9649 ) Co-authored-by: YiYi Xu <yixu310@gmail.com>	2024-10-14 17:56:54 -10:00
SahilCarterr	22ed39f571	Added Lora Support to SD3 Img2Img Pipeline (#9659 ) * add lora	2024-10-14 11:39:20 -10:00
Tolga Cangöz	56c21150d8	[`Community Pipeline`] Add 🪆Matryoshka Diffusion Models (#9157 )	2024-10-14 11:38:44 -10:00
Leo Jiang	5956b68a69	Improve the performance and suitable for NPU computing (#9642 ) * Improve the performance and suitable for NPU * Improve the performance and suitable for NPU computing * Improve the performance and suitable for NPU * Improve the performance and suitable for NPU * Improve the performance and suitable for NPU * Improve the performance and suitable for NPU --------- Co-authored-by: 蒋硕 <jiangshuo9@h-partners.com> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-10-14 21:39:33 +05:30
Yuxuan.Zhang	8d81564b27	CogView3Plus DiT (#9570 ) * merge 9588 * max_shard_size="5GB" for colab running * conversion script updates; modeling test; refactor transformer * make fix-copies * Update convert_cogview3_to_diffusers.py * initial pipeline draft * make style * fight bugs 🐛🪳 * add example * add tests; refactor * make style * make fix-copies * add co-author YiYi Xu <yixu310@gmail.com> * remove files * add docs * add co-author Co-Authored-By: YiYi Xu <yixu310@gmail.com> * fight docs * address reviews * make style * make model work * remove qkv fusion * remove qkv fusion tets * address review comments * fix make fix-copies error * remove None and TODO * for FP16(draft) * make style * remove dynamic cfg * remove pooled_projection_dim as a parameter * fix tests --------- Co-authored-by: Aryan <aryan@huggingface.co> Co-authored-by: YiYi Xu <yixu310@gmail.com>	2024-10-14 19:30:36 +05:30
Ryan Lin	68d16f7806	Flux - soft inpainting via differential diffusion (#9268 ) * Flux - soft inpainting via differential diffusion * . * track changes to FluxInpaintPipeline * make mask arrangement simplier * make style --------- Co-authored-by: YiYi Xu <yixu310@gmail.com> Co-authored-by: Álvaro Somoza <asomoza@users.noreply.github.com> Co-authored-by: asomoza <somoza.alvaro@gmail.com>	2024-10-14 10:07:48 -03:00
Sayak Paul	86bcbc389e	[Tests] increase transformers version in `test_low_cpu_mem_usage_with_loading` (#9662 ) increase transformers version in test_low_cpu_mem_usage_with_loading	2024-10-13 22:39:38 +05:30
Jinzhe Pan	6a5f06488c	[docs] Fix xDiT doc image damage (#9655 ) * docs: fix xDiT doc image damage * doc: move xdit images to hf dataset --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-10-12 13:05:07 +05:30
Sayak Paul	c7a6d77b5f	[CI] replace ubuntu version to 22.04. (#9656 ) replace ubuntu version to 22.04.	2024-10-12 11:55:36 +05:30
hlky	0f8fb75c7b	FluxMultiControlNetModel (#9647 )	2024-10-11 14:39:19 -03:00
M Saqlain	3033f08201	Add Differential Diffusion to Kolors (#9423 ) * Added diff diff support for kolors img2img * Fized relative imports * Fized relative imports * Added diff diff support for Kolors * Fized import issues * Added map * Fized import issues * Fixed naming issues * Added diffdiff support for Kolors img2img pipeline * Removed example docstrings * Added map input * Updated latents Co-authored-by: Álvaro Somoza <asomoza@users.noreply.github.com> * Updated `original_with_noise` Co-authored-by: Álvaro Somoza <asomoza@users.noreply.github.com> * Improved code quality --------- Co-authored-by: Álvaro Somoza <asomoza@users.noreply.github.com>	2024-10-11 10:47:31 -03:00
GSSun	164ec9f423	fix IsADirectoryError when running the training code for sd3_dreambooth_lora_16gb.ipynb (#9634 ) Add files via upload fix IsADirectoryError when running the training code	2024-10-11 13:33:39 +05:30
Subho Ghosh	38a3e4df92	flux controlnet control_guidance_start and control_guidance_end implement (#9571 ) * flux controlnet control_guidance_start and control_guidance_end implement * minor fix - added docstrings, consistent controlnet scale flux and SD3	2024-10-10 09:29:02 -03:00
Sayak Paul	e16fd93d0a	[LoRA] fix dora test to catch the warning properly. (#9627 ) fix dora test.	2024-10-10 11:47:49 +05:30
Pakkapon Phongthawee	07bd2fabb6	make controlnet support interrupt (#9620 ) * make controlnet support interrupt * remove white space in controlnet interrupt	2024-10-09 12:03:13 -10:00
SahilCarterr	af28ae2d5b	add PAG support for SD Img2Img (#9463 ) * added pag to sd img2img pipeline --------- Co-authored-by: YiYi Xu <yixu310@gmail.com>	2024-10-09 10:40:58 -10:00
Sayak Paul	31058cdaef	[LoRA] allow loras to be loaded with low_cpu_mem_usage. (#9510 ) * allow loras to be loaded with low_cpu_mem_usage. * add flux support but note https://github.com/huggingface/diffusers/pull/9510\#issuecomment-2378316687 * low_cpu_mem_usage. * fix-copies * fix-copies again * tests * _LOW_CPU_MEM_USAGE_DEFAULT_LORA * _peft_version default. * version checks. * version check. * version check. * version check. * require peft 0.13.1. * explicitly specify low_cpu_mem_usage=False. * docs. * transformers version 4.45.2. * update * fix * empty * better name initialize_dummy_state_dict. * doc todos. * Apply suggestions from code review Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * style * fix-copies --------- Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2024-10-09 10:57:16 +05:30
Yijun Lee	ec9e5264c0	refac/pipeline_output (#9582 )	2024-10-08 16:11:13 -10:00
sanaka	acd6d2c42f	Fix the bug that `joint_attention_kwargs` is not passed to the FLUX's transformer attention processors (#9517 ) * Update transformer_flux.py	2024-10-08 11:25:48 -10:00
v2ray	86bd991ee5	Fixed noise_pred_text referenced before assignment. (#9537 ) * Fixed local variable noise_pred_text referenced before assignment when using PAG with guidance scale and guidance rescale at the same time. * Fixed style. * Made returning text pred noise an argument.	2024-10-08 09:27:10 -10:00
Sayak Paul	02eeb8e77e	[LoRA] Handle DoRA better (#9547 ) * handle dora. * print test * debug * fix * fix-copies * update logits * add warning in the test. * make is_dora check consistent. * fix-copies	2024-10-08 21:47:44 +05:30
glide-the	66eef9a6dc	fix: CogVideox train dataset _preprocess_data crop video (#9574 ) * Removed int8 to float32 conversion (`* 2.0 - 1.0`) from `train_transforms` as it caused image overexposure. Added `_resize_for_rectangle_crop` function to enable video cropping functionality. The cropping mode can be configured via `video_reshape_mode`, supporting options: ['center', 'random', 'none']. * The number 127.5 may experience precision loss during division operations. * wandb request pil image Type * Resizing bug * del jupyter * make style * Update examples/cogvideo/README.md * make style --------- Co-authored-by: --unset <--unset> Co-authored-by: Aryan <aryan@huggingface.co>	2024-10-08 12:52:52 +05:30
Sayak Paul	63a5c8742a	Update distributed_inference.md to include `transformer.device_map` (#9553 ) * Update distributed_inference.md to include `transformer.device_map` * Update docs/source/en/training/distributed_inference.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> --------- Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2024-10-08 08:03:51 +05:30
Eliseu Silva	1287822973	Fix for use_safetensors parameters, allow use of parameter on loading submodels (#9576 ) (#9587 ) * Fix for use_safetensors parameters, allow use of parameter on loading submodels (#9576)	2024-10-07 10:41:32 -10:00
Yijun Lee	a80f689200	refac: docstrings in import_utils.py (#9583 ) * refac: docstrings in import_utils.py * Update import_utils.py	2024-10-07 13:27:35 -07:00
captainzz	2cb383f591	fix vae dtype when accelerate config using --mixed_precision="fp16" (#9601 ) * fix vae dtype when accelerate config using --mixed_precision="fp16" * Add param for upcast vae	2024-10-07 21:00:25 +05:30
Sayak Paul	31010ecc45	[Chore] add a note on the versions in Flux LoRA integration tests (#9598 ) add a note on the versions.	2024-10-07 17:43:48 +05:30
Clem	3159e60d59	fix xlabs FLUX lora conversion typo (#9581 ) * fix startswith syntax in xlabs lora conversion * Trigger CI https://github.com/huggingface/diffusers/pull/9581#issuecomment-2395530360	2024-10-07 10:47:54 +05:30
YiYi Xu	99f608218c	[sd3] make sure height and size are divisible by `16` (#9573 ) * check size * up	2024-10-03 08:36:26 -10:00
Xiangchendong	7f323f0f31	fix cogvideox autoencoder decode (#9569 ) Co-authored-by: Aryan <aryan@huggingface.co>	2024-10-02 09:07:06 -10:00
Darren Hsu	61d37640ad	Support bfloat16 for Upsample2D (#9480 ) * Support bfloat16 for Upsample2D * Add test and use is_torch_version * Resolve comments and add decorator * Simplify require_torch_version_greater_equal decorator * Run make style --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: YiYi Xu <yixu310@gmail.com>	2024-10-01 16:08:12 -10:00
JuanCarlosPi	33fafe3d14	Add PAG support to StableDiffusionControlNetPAGInpaintPipeline (#8875 ) * Add pag to controlnet inpainting pipeline --------- Co-authored-by: YiYi Xu <yixu310@gmail.com>	2024-09-30 20:04:42 -10:00
hlky	c4a8979f30	Add beta sigmas to other schedulers and update docs (#9538 )	2024-09-30 09:00:54 -10:00
Sayak Paul	f9fd511466	[LoRA] support Kohya Flux LoRAs that have text encoders as well (#9542 ) * support kohya flux loras that have tes.	2024-09-30 07:59:39 -10:00
Sayak Paul	8e7d6c03a3	[chore] fix: retain memory utility. (#9543 ) * fix: retain memory utility. * fix * quality * free_memory.	2024-09-28 21:08:45 +05:30
Anand Kumar	b28675c605	[train_instruct_pix2pix.py]Fix the LR schedulers when `num_train_epochs` is passed in a distributed training env (#9316 ) Fixed pix2pix lr scheduler Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-09-28 21:01:37 +05:30
Aryan	bd4df2856a	[refactor] remove conv_cache from CogVideoX VAE (#9524 ) * remove conv cache from the layer and pass as arg instead * make style * yiyi's cleaner implementation Co-Authored-By: YiYi Xu <yixu310@gmail.com> * sayak's compiled implementation Co-Authored-By: Sayak Paul <spsayakpaul@gmail.com> --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: YiYi Xu <yixu310@gmail.com>	2024-09-28 17:09:30 +05:30

1 2 3 4 5 ...

4630 Commits