diffusers

mirror of https://github.com/huggingface/diffusers.git synced 2026-01-29 07:22:12 +03:00

Author	SHA1	Message	Date
Juan Acevedo	45aa8bb187	Ptxla sd training (#9381 ) * enable pxla training of stable diffusion 2.x models. * run linter/style and run pipeline test for stable diffusion and fix issues. * update xla libraries * fix read me newline. * move files to research folder. * update per comments. * rename readme. --------- Co-authored-by: Juan Acevedo <jfacevedo@google.com> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-09-12 08:35:06 +05:30
Aryan	5e1427a7da	[docs] AnimateDiff FreeNoise (#9414 ) * update docs * apply suggestions from review * Update docs/source/en/api/pipelines/animatediff.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/api/pipelines/animatediff.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/api/pipelines/animatediff.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * apply suggestions from review --------- Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2024-09-11 12:59:58 -07:00
asfiyab-nvidia	b9e2f886cd	FluxPosEmbed: Remove Squeeze No-op (#9409 ) Remove Squeeze op Signed-off-by: Asfiya Baig <asfiyab@nvidia.com> Co-authored-by: YiYi Xu <yixu310@gmail.com>	2024-09-10 19:12:36 -10:00
dianyo	b19827f6b4	Migrate the BrownianTree to BrownianInterval in DPM solver (#9335 ) migrate the BrownianTree to BrownianInterval Co-authored-by: YiYi Xu <yixu310@gmail.com>	2024-09-10 18:29:15 -10:00
Yu Zheng	c002731d93	[examples] add controlnet sd3 example (#9249 ) * add controlnet sd3 example * add controlnet sd3 example * update controlnet sd3 example * add controlnet sd3 example test * fix quality and style * update test * update test --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-09-11 07:04:37 +05:30
Sayak Paul	adf1f911f0	[Tests] fix some fast gpu tests. (#9379 ) fix some fast gpu tests.	2024-09-11 06:50:02 +05:30
captainzz	f28a8c257a	fix from_transformer() with extra conditioning channels (#9364 ) * fix from_transformer() with extra conditioning channels * style fix --------- Co-authored-by: YiYi Xu <yixu310@gmail.com> Co-authored-by: Álvaro Somoza <somoza.alvaro@gmail.com>	2024-09-09 07:51:48 -10:00
Jinzhe Pan	2c6a6c97b3	[docs] Add xDiT in section optimization (#9365 ) * docs: add xDiT to optimization methods * fix: picture layout problem * docs: add more introduction about xdit & apply suggestions * Apply suggestions from code review Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> --------- Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2024-09-09 10:31:07 -07:00
Igor Filippov	a7361dccdc	[Pipeline] animatediff + vid2vid + controlnet (#9337 ) * add animatediff + vid2vide + controlnet * post tests fixes * PR discussion fixes * update docs * change input video to links on HF + update an example * make quality fix * fix ip adapter test * fix ip adapter test input * update ip adapter test	2024-09-09 22:48:21 +05:30
YiYi Xu	485b8bb000	refactor `get_timesteps` for SDXL img2img + add set_begin_index (#9375 ) * refator + add begin_index * add kolors img2img to doc	2024-09-09 06:38:22 -10:00
Sayak Paul	d08ad65819	modify benchmarks to replace sdv1.5 with dreamshaper. (#9334 )	2024-09-09 20:54:56 +05:30
YiYi Xu	8cdcdd9e32	add flux inpaint + img2img + controlnet to auto pipeline (#9367 )	2024-09-06 07:14:48 -10:00
Dhruv Nair	d269cc8a4e	[CI] Quick fix for Cog Video Test (#9373 ) update	2024-09-06 15:25:53 +05:30
Aryan	6dfa49963c	[core] Freenoise memory improvements (#9262 ) * update * implement prompt interpolation * make style * resnet memory optimizations * more memory optimizations; todo: refactor * update * update animatediff controlnet with latest changes * refactor chunked inference changes * remove print statements * update * chunk -> split * remove changes from incorrect conflict resolution * remove changes from incorrect conflict resolution * add explanation of SplitInferenceModule * update docs * Revert "update docs" This reverts commit `c55a50a271`. * update docstring for freenoise split inference * apply suggestions from review * add tests * apply suggestions from review	2024-09-06 12:51:20 +05:30
Haruya Ishikawa	5249a2666e	fix one uncaught deprecation warning for accessing vae_latent_channels in VaeImagePreprocessor (#9372 ) deprecation warning vae_latent_channels	2024-09-05 07:32:27 -10:00
Linoy Tsaban	55ac421f7b	improve README for flux dreambooth lora (#9290 ) * improve readme * improve readme * improve readme * improve readme	2024-09-05 17:53:23 +05:30
Dhruv Nair	53051cf282	[CI] Update Single file Nightly Tests (#9357 ) * update * update	2024-09-05 14:33:44 +05:30
Tolga Cangöz	3000551729	Update `UNet2DConditionModel`'s error messages (#9230 ) * refactor	2024-09-04 10:49:56 -10:00
Vishnu V Jaddipal	249a9e48e8	Add Flux inpainting and Flux Img2Img (#9135 ) --------- Co-authored-by: yiyixuxu <yixu310@gmail.com>	2024-09-04 10:31:43 -10:00
Fanli Lin	2ee3215949	[tests] make 2 tests device-agnostic (#9347 ) * enabel on xpu * fix style	2024-09-03 16:34:03 -10:00
Eduardo Escobar	8ecf499d8b	Enable `load_lora_weights` for `StableDiffusion3InpaintPipeline` (#9330 ) Enable load_lora_weights for StableDiffusion3InpaintPipeline Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-09-03 15:19:37 -10:00
YiYi Xu	dcf320f293	small update on rotary embedding (#9354 ) * update * fix --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-09-03 07:18:33 -10:00
Sayak Paul	8ba90aa706	chore: add a cleaning utility to be useful during training. (#9240 )	2024-09-03 15:00:17 +05:30
Aryan	9d49b45b19	[refactor] move positional embeddings to patch embed layer for CogVideoX (#9263 ) * remove frame limit in cogvideox * remove debug prints * Update src/diffusers/models/transformers/cogvideox_transformer_3d.py * revert pipeline; remove frame limitation * revert transformer changes * address review comments * add error message * apply suggestions from review	2024-09-03 14:45:12 +05:30
Dhruv Nair	81da2e1c95	[CI] Add option to dispatch Fast GPU tests on main (#9355 ) update	2024-09-03 14:35:13 +05:30
Aryan	24053832b5	[tests] remove/speedup some low signal tests (#9285 ) * remove 2 shapes from SDFunctionTesterMixin::test_vae_tiling * combine freeu enable/disable test to reduce many inference runs * remove low signal unet test for signature * remove low signal embeddings test * remove low signal progress bar test from PipelineTesterMixin * combine ip-adapter single and multi tests to save many inferences * fix broken tests * Update tests/pipelines/test_pipelines_common.py * Update tests/pipelines/test_pipelines_common.py * add progress bar tests	2024-09-03 13:59:18 +05:30
Dhruv Nair	f6f16a0c11	[CI] More Fast GPU Test Fixes (#9346 ) * update * update * update * update	2024-09-03 13:22:38 +05:30
Vishnu V Jaddipal	1c1ccaa03f	Xlabs lora fix (#9348 ) * Fix ```from_single_file``` for xl_inpaint * Add basic flux inpaint pipeline * style, quality, stray print * Fix stray changes * Add inpainting model support * Change lora conversion for xlabs * Fix stray changes * Apply suggestions from code review * style --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-09-03 10:43:43 +05:30
Dhruv Nair	007ad0e2aa	[CI] More fixes for Fast GPU Tests on main (#9300 ) update	2024-09-02 17:51:48 +05:30
Aryan	0e6a8403f6	[core] Support VideoToVideo with CogVideoX (#9333 ) * add vid2vid pipeline for cogvideox * make fix-copies * update docs * fake context parallel cache, vae encode tiling * add test for cog vid2vid * use video link from HF docs repo * add copied from comments; correctly rename test class	2024-09-02 16:54:58 +05:30
Aryan	af6c0fb766	[core] CogVideoX memory optimizations in VAE encode (#9340 ) fake context parallel cache, vae encode tiling (cherry picked from commit `bf890bca0e`)	2024-09-02 15:48:37 +05:30
YiYi Xu	d8a16635f4	update runway repo for single_file (#9323 ) update to a place holder	2024-08-30 08:51:21 -10:00
Aryan	e417d02811	[docs] Add a note on torchao/quanto benchmarks for CogVideoX and memory-efficient inference (#9296 ) * add a note on torchao/quanto benchmarks and memory-efficient inference * apply suggestions from review * update * Update docs/source/en/api/pipelines/cogvideox.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/api/pipelines/cogvideox.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * add note on enable sequential cpu offload --------- Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2024-08-30 13:53:25 +05:30
Dhruv Nair	1d4d71875b	[CI] Update Hub Token on nightly tests (#9318 ) update	2024-08-30 10:23:50 +05:30
YiYi Xu	61d96c3ae7	refactor rotary embedding 3: so it is not on cpu (#9307 ) change get_1d_rotary to accept pos as torch tensors	2024-08-30 01:07:15 +05:30
YiYi Xu	4f495b06dc	rotary embedding refactor 2: update comments, fix dtype for use_real=False (#9312 ) fix notes and dtype	2024-08-28 23:31:47 -10:00
Anand Kumar	40c13fe5b4	[train_custom_diffusion.py] Fix the LR schedulers when `num_train_epochs` is passed in a distributed training env (#9308 ) * Update train_custom_diffusion.py to fix the LR schedulers for `num_train_epochs` * Fix saving text embeddings during safe serialization * Fixed formatting	2024-08-29 14:23:36 +05:30
Sayak Paul	2a3fbc2cc2	[LoRA] support kohya and xlabs loras for flux. (#9295 ) * support kohya lora in flux. * format * support xlabs * diffusion_model prefix. * Apply suggestions from code review Co-authored-by: apolinário <joaopaulo.passos@gmail.com> * empty commit. Co-authored-by: Leommm-byte <leom20031@gmail.com> --------- Co-authored-by: apolinário <joaopaulo.passos@gmail.com> Co-authored-by: Leommm-byte <leom20031@gmail.com>	2024-08-29 07:41:46 +05:30
apolinário	089cf798eb	Change default for `guidance_scale`in FLUX (#9305 ) To match the original code, 7.0 is too high	2024-08-28 07:39:45 -10:00
Aryan	cbc2ec8f44	AnimateDiff prompt travel (#9231 ) * update * implement prompt interpolation * make style * resnet memory optimizations * more memory optimizations; todo: refactor * update * update animatediff controlnet with latest changes * refactor chunked inference changes * remove print statements * undo memory optimization changes * update docstrings * fix tests * fix pia tests * apply suggestions from review * add tests * update comment	2024-08-28 14:48:12 +05:30
Frank (Haofan) Wang	b5f591fea8	Update __init__.py (#9286 )	2024-08-27 07:57:25 -10:00
Dhruv Nair	05b38c3c0d	Fix Flux CLIP prompt embeds repeat for num_images_per_prompt > 1 (#9280 ) update	2024-08-27 07:41:12 -10:00
Dhruv Nair	8f7fde5701	[CI] Update Release Tests (#9274 ) * update * update	2024-08-27 18:34:00 +05:30
Dhruv Nair	a59672655b	Fix Freenoise for AnimateDiff V3 checkpoint. (#9288 ) update	2024-08-27 18:30:39 +05:30
Marçal Comajoan Cara	9aca79f2b8	Replace transformers.deepspeed with transformers.integrations.deepspeed (#9281 ) to avoid "FutureWarning: transformers.deepspeed module is deprecated and will be removed in a future version. Please import deepspeed modules directly from transformers.integrations" Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-08-27 18:08:23 +05:30
Steven Liu	bbcf2a8589	[docs] Add pipelines to table (#9282 ) update pipelines	2024-08-27 12:15:30 +05:30
Álvaro Somoza	4cfb2164fb	[IP Adapter] Fix `cache_dir` and `local_files_only` for image encoder (#9272 ) initial fix	2024-08-26 09:03:08 -10:00
Linoy Tsaban	c977966502	[Dreambooth flux] bug fix for dreambooth script (align with dreambooth lora) (#9257 ) * fix shape * fix prompt encoding * style * fix device * add comment	2024-08-26 17:29:58 +05:30
YiYi Xu	1ca0a75567	refactor 3d rope for cogvideox (#9269 ) * refactor 3d rope * repeat -> expand	2024-08-25 11:57:12 -10:00
王奇勋	c1e6a32ae4	[Flux] Support Union ControlNet (#9175 ) * refactor --------- Co-authored-by: haofanwang <haofanwang.ai@gmail.com>	2024-08-25 00:24:21 -10:00

... 32 33 34 35 36 ...

6185 Commits