diffusers

mirror of https://github.com/huggingface/diffusers.git synced 2026-01-29 07:22:12 +03:00

Author	SHA1	Message	Date
satani99	9003d75f20	Add StableDiffusionXLControlNetPAGImg2ImgPipeline (#8990 ) * Added pad controlnet sdxl img2img pipeline --------- Co-authored-by: YiYi Xu <yixu310@gmail.com>	2024-08-21 07:24:22 -10:00
YiYi Xu	214372aa99	fix a regression in `is_safetensors_compatible` (#9234 ) fix	2024-08-21 18:56:55 +05:30
Vinh H. Pham	867e0c919e	StableDiffusionLatentUpscalePipeline - positive/negative prompt embeds support (#8947 ) * make latent upscaler accept prompt embeds --------- Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: YiYi Xu <yixu310@gmail.com>	2024-08-20 18:00:55 -10:00
Dhruv Nair	940b8e0358	[CI] Multiple Slow Test fixes. (#9198 ) * update * update * update * update	2024-08-19 13:31:09 +05:30
Dhruv Nair	b2add10d13	Update `is_safetensors_compatible` check (#8991 ) * update * update * update * update * update	2024-08-19 11:35:22 +05:30
Aryan	a85b34e7fd	[refactor] CogVideoX followups + tiled decoding support (#9150 ) * refactor context parallel cache; update torch compile time benchmark * add tiling support * make style * remove num_frames % 8 == 0 requirement * update default num_frames to original value * add explanations + refactor * update torch compile example * update docs * update * clean up if-statements * address review comments * add test for vae tiling * update docs * update docs * update docstrings * add modeling test for cogvideox transformer * make style	2024-08-14 03:53:21 +05:30
王奇勋	5ffbe14c32	[FLUX] Support ControlNet (#9126 ) * cnt model * cnt model * cnt model * fix Loader "Copied" * format * txt_ids for multiple images * add test and format * typo * Update pipeline_flux_controlnet.py * remove * make quality * fix copy * Update src/diffusers/pipelines/flux/pipeline_flux_controlnet.py Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com> * Update src/diffusers/pipelines/flux/pipeline_flux_controlnet.py Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com> * Update src/diffusers/pipelines/flux/pipeline_flux_controlnet.py Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com> * Update src/diffusers/pipelines/flux/pipeline_flux_controlnet.py Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com> * Update src/diffusers/models/controlnet_flux.py Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com> * fix * make copies * test * bs --------- Co-authored-by: haofanwang <haofanwang.ai@gmail.com> Co-authored-by: haofanwang <haofan@HaofandeMBP.lan> Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>	2024-08-13 18:17:40 +05:30
林金鹏	cc0513091a	Support SD3 controlnet inpainting (#9099 ) * add controlnet inpainting pipeline * [SD3] add controlnet inpaint example * update example and fix code style * fix code style with ruff * Update controlnet_sd3.md : add control inpaint pipeline * Update docs/source/en/api/pipelines/controlnet_sd3.md Co-authored-by: Aryan <contact.aryanvs@gmail.com> * Update docs/source/en/api/pipelines/controlnet_sd3.md Co-authored-by: Aryan <contact.aryanvs@gmail.com> * Update docs/source/en/api/pipelines/controlnet_sd3.md Co-authored-by: Aryan <contact.aryanvs@gmail.com> * Update src/diffusers/pipelines/controlnet_sd3/pipeline_stable_diffusion_3_controlnet_inpainting.py Co-authored-by: Aryan <contact.aryanvs@gmail.com> * Update __init__.py : add sd3 control pipelines * Update pipeline : add new param doc & check input reference. * fix typo * make style & make quality * add unittest for sd3 controlnet inpaint --------- Co-authored-by: 鹏徙 <linjinpeng.ljp@alibaba-inc.com> Co-authored-by: Aryan <contact.aryanvs@gmail.com>	2024-08-13 17:30:46 +05:30
zR	2dad462d9b	Add CogVideoX text-to-video generation model (#9082 ) * add CogVideoX --------- Co-authored-by: Aryan <aryan@huggingface.co> Co-authored-by: sayakpaul <spsayakpaul@gmail.com> Co-authored-by: Aryan <contact.aryanvs@gmail.com> Co-authored-by: yiyixuxu <yixu310@gmail.com> Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2024-08-06 21:23:57 -10:00
Aryan	16a93f1a25	[core] FreeNoise (#8948 ) * initial work draft for freenoise; needs massive cleanup * fix freeinit bug * add animatediff controlnet implementation * revert attention changes * add freenoise * remove old helper functions * add decode batch size param to all pipelines * make style * fix copied from comments * make fix-copies * make style * copy animatediff controlnet implementation from #8972 * add experimental support for num_frames not perfectly fitting context length, ocntext stride * make unet motion model lora work again based on #8995 * copy load video utils from #8972 * copied from AnimateDiff::prepare_latents * address the case where last batch of frames does not match length of indices in prepare latents * decode_batch_size->vae_batch_size; batch vae encode support in animatediff vid2vid * revert sparsectrl and sdxl freenoise changes * revert pia * add freenoise tests * make fix-copies * improve docstrings * add freenoise tests to animatediff controlnet * update tests * Update src/diffusers/models/unets/unet_motion_model.py * add freenoise to animatediff pag * address review comments * make style * update tests * make fix-copies * fix error message * remove copied from comment * fix imports in tests * update --------- Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>	2024-08-07 10:35:18 +05:30
Álvaro Somoza	39e1f7eaa4	[Kolors] Add PAG (#8934 ) * txt2img pag added * autopipe added, fixed case * style * apply suggestions * added fast tests, added todo tests * revert dummy objects for kolors * fix pag dummies * fix test imports * update pag tests * add kolor pag to docs --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-08-07 09:29:52 +05:30
Ahn Donghoon (안동훈 / suno)	926daa30f9	add PAG support for Stable Diffusion 3 (#8861 ) add pag sd3 --------- Co-authored-by: HyoungwonCho <jhw9811@korea.ac.kr> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: crepejung00 <jaewoojung00@naver.com> Co-authored-by: YiYi Xu <yixu310@gmail.com> Co-authored-by: Aryan <contact.aryanvs@gmail.com> Co-authored-by: Aryan <aryan@huggingface.co>	2024-08-06 09:11:35 -10:00
Sayak Paul	52f1378e64	[Core] add QKV fusion to AuraFlow and PixArt Sigma (#8952 ) * add fusion support to pixart * add to auraflow. * add tests * apply review feedback. * add back args and kwargs * style	2024-08-05 14:09:37 -10:00
Tolga Cangöz	3dc97bd148	Update `CLIPFeatureExtractor` to `CLIPImageProcessor` and `DPTFeatureExtractor` to `DPTImageProcessor` (#9002 ) * fix: update `CLIPFeatureExtractor` to `CLIPImageProcessor` in codebase * `make style && make quality` * Update `DPTFeatureExtractor` to `DPTImageProcessor` in codebase * `make style` --------- Co-authored-by: Aryan <aryan@huggingface.co>	2024-08-05 09:20:29 -10:00
YiYi Xu	bc3c73ad0b	add sentencepiece as a soft dependency (#9065 ) * add sentencepiece as soft dependency for kolors * up --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-08-05 08:04:51 -10:00
Aryan	b7058d142c	PAG variant for HunyuanDiT, PAG refactor (#8936 ) * copy hunyuandit pipeline * pag variant of hunyuan dit * add tests * update docs * make style * make fix-copies * Update src/diffusers/pipelines/pag/pag_utils.py * remove incorrect copied from * remove pag hunyuan attn procs to resolve conflicts * add pag attn procs again * new implementation for pag_utils * revert pag changes * add pag refactor back; update pixart sigma * update pixart pag tests * apply suggestions from review Co-Authored-By: yixu310@gmail.com * make style * update docs, fix tests * fix tests * fix test_components_function since list not accepted as valid __init__ param * apply patch to fix broken tests Co-Authored-By: Sayak Paul <spsayakpaul@gmail.com> * make style * fix hunyuan tests --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-08-05 17:56:09 +05:30
Sayak Paul	0e460675e2	[Flux] allow tests to run (#9050 ) * fix tests * fix * float64 skip * remove sample_size. * remove * remove more * default_sample_size. * credit black forest for flux model. * skip * fix: tests * remove OriginalModelMixin * add transformer model test * add: transformer model tests	2024-08-02 11:49:59 +05:30
Sayak Paul	7b98c4cc67	[Core] Add PAG support for PixArtSigma (#8921 ) * feat: add pixart sigma pag. * inits. * fixes * fix * remove print. * copy paste methods to the pixart pag mixin * fix-copies * add documentation. * add tests. * remove correction file. * remove pag_applied_layers * empty	2024-08-02 07:12:41 +05:30
Sayak Paul	27637a5402	Flux pipeline (#9043 ) add flux! Signed-off-by: Adrien <adrien@huggingface.co> Co-authored-by: Adrien <adrien.69740@gmail.com> Co-authored-by: Anatoly Belikov <abelikov@singularitynet.io> Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com> Co-authored-by: yiyixuxu <yixu310@gmail.com>	2024-08-01 11:30:52 -10:00
Aryan	05b706c003	PAG variant for AnimateDiff (#8789 ) * add animatediff pag pipeline * remove unnecessary print * make fix-copies * fix ip-adapter bug * update docs * add fast tests and fix bugs * update * update * address review comments * update ip adapter single test expected slice * implement test_from_pipe_consistent_config; fix expected slice values * LoraLoaderMixin->StableDiffusionLoraLoaderMixin; add latest freeinit test	2024-08-01 12:39:39 +05:30
Yoach Lacombe	ea1b4ea7ca	Fix Stable Audio repository id (#9016 ) Fix Stable Audio repo id	2024-07-30 23:17:44 +05:30
Aryan	e5b94b4c57	[core] Move community AnimateDiff ControlNet to core (#8972 ) * add animatediff controlnet to core * make style; remove unused method * fix copied from comment * add tests * changes to make tests work * add utility function to load videos * update docs * update pipeline example * make style * update docs with example * address review comments * add latest freeinit test from #8969 * LoraLoaderMixin -> StableDiffusionLoraLoaderMixin * fix docs * Update src/diffusers/utils/loading_utils.py Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com> * fix: variable out of scope --------- Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>	2024-07-30 17:10:37 +05:30
Yoach Lacombe	69e72b1dd1	Stable Audio integration (#8716 ) * WIP modeling code and pipeline * add custom attention processor + custom activation + add to init * correct ProjectionModel forward * add stable audio to __initèè * add autoencoder and update pipeline and modeling code * add half Rope * add partial rotary v2 * add temporary modfis to scheduler * add EDM DPM Solver * remove TODOs * clean GLU * remove att.group_norm to attn processor * revert back src/diffusers/schedulers/scheduling_dpmsolver_multistep.py * refactor GLU -> SwiGLU * remove redundant args * add channel multiples in autoencoder docstrings * changes in docsrtings and copyright headers * clean pipeline * further cleaning * remove peft and lora and fromoriginalmodel * Delete src/diffusers/pipelines/stable_audio/diffusers.code-workspace * make style * dummy models * fix copied from * add fast oobleck tests * add brownian tree * oobleck autoencoder slow tests * remove TODO * fast stable audio pipeline tests * add slow tests * make style * add first version of docs * wrap is_torchsde_available to the scheduler * fix slow test * test with input waveform * add input waveform * remove some todos * create stableaudio gaussian projection + make style * add pipeline to toctree * fix copied from * make quality * refactor timestep_features->time_proj * refactor joint_attention_kwargs->cross_attention_kwargs * remove forward_chunk * move StableAudioDitModel to transformers folder * correct convert + remove partial rotary embed * apply suggestions from yiyixuxu -> removing attn.kv_heads * remove temb * remove cross_attention_kwargs * further removal of cross_attention_kwargs * remove text encoder autocast to fp16 * continue removing autocast * make style * refactor how text and audio are embedded * add paper * update example code * make style * unify projection model forward + fix device placement * make style * remove fuse qkv * apply suggestions from review * Update src/diffusers/pipelines/stable_audio/pipeline_stable_audio.py Co-authored-by: YiYi Xu <yixu310@gmail.com> * make style * smaller models in fast tests * pass sequential offloading fast tests * add docs for vae and autoencoder * make style and update example * remove useless import * add cosine scheduler * dummy classes * cosine scheduler docs * better description of scheduler --------- Co-authored-by: YiYi Xu <yixu310@gmail.com>	2024-07-30 15:29:06 +05:30
Álvaro Somoza	73acebb8cf	[Kolors] Add IP Adapter (#8901 ) * initial draft * apply suggestions * fix failing test * added ipa to img2img * add docs * apply suggestions	2024-07-26 14:25:44 -04:00
Aryan	5c53ca5ed8	[core] AnimateDiff SparseCtrl (#8897 ) * initial sparse control model draft * remove unnecessary implementation * copy animatediff pipeline * remove deprecated callbacks * update * update pipeline implementation progress * make style * make fix-copies * update progress * add partially working pipeline * remove debug prints * add model docs * dummy objects * improve motion lora conversion script * fix bugs * update docstrings * remove unnecessary model params; docs * address review comment * add copied from to zero_module * copy animatediff test * add fast tests * update docs * update * update pipeline docs * fix expected slice values * fix license * remove get_down_block usage * remove temporal_double_self_attention from get_down_block * update * update docs with org and documentation images * make from_unet work in sparsecontrolnetmodel * add latest freeinit test from #8969 * make fix-copies * LoraLoaderMixin -> StableDiffsuionLoraLoaderMixin	2024-07-26 17:46:05 +05:30
Aryan	57a021d5e4	[fix] FreeInit step index out of bounds (#8969 ) * fix step index out of bounds * add test for free_init with different schedulers * add test to vid2vid and pia	2024-07-26 16:45:55 +05:30
Aryan	3ae0ee88d3	[tests] speed up animatediff tests (#8846 ) * speed up animatediff tests * fix pia test_ip_adapter_single * fix tests/pipelines/pia/test_pia.py::PIAPipelineFastTests::test_dict_tuple_outputs_equivalent * update * fix ip adapter tests * skip test_from_pipe_consistent_config tests * fix prompt_embeds test * update test_from_pipe_consistent_config tests * fix expected_slice values * remove temporal_norm_num_groups from UpBlockMotion --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>	2024-07-25 17:35:43 +05:30
Sayak Paul	d8bcb33f4b	[Tests] fix slices of 26 tests (first half) (#8959 ) * check for assertions. * update with correct slices. * okay * style * get it ready * update * update * update --------- Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>	2024-07-25 14:56:49 +05:30
Dhruv Nair	93983b6780	[CI] Skip flaky download tests in PR CI (#8945 ) update	2024-07-24 09:25:06 +05:30
Sayak Paul	41b705f42d	remove residual i from auraflow. (#8949 ) * remove residual i. * rename to aura_flow in pipeline test	2024-07-24 07:31:54 +05:30
Sayak Paul	50d21f7c6a	[Core] fix QKV fusion for attention (#8829 ) * start debugging the problem, * start * fix * fix * fix imports. * handle hunyuan * remove residuals. * add a check for making sure there's appropriate procs. * add more rigor to the tests. * fix test * remove redundant check * fix-copies * move check_qkv_fusion_matches_attn_procs_length and check_qkv_fusion_processors_exist.	2024-07-24 06:52:19 +05:30
Aritra Roy Gosthipaty	8b21feed42	[Tests] reduce the model size in the audioldm2 fast test (#7846 ) * chore: initial model size reduction * chore: fixing expected values for failing tests * requested edits --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-07-23 14:34:07 +05:30
shinetzh	3b04cdc816	fix loop bug in SlicedAttnProcessor (#8836 ) * fix loop bug in SlicedAttnProcessor --------- Co-authored-by: neoshang <neoshang@tencent.com>	2024-07-19 18:14:29 -10:00
Aryan	bbd2f9d4e9	[tests] fix typo in pag tests (#8845 ) * fix typo in pag tests * fix typo	2024-07-12 17:41:34 +05:30
Nguyễn Công Tú Anh	d704b3bf8c	add PAG support sd15 controlnet (#8820 ) * add pag support sd15 controlnet * fix quality import * remove unecessary import * remove if state * fix tests * remove useless function * add sd1.5 controlnet pag docs --------- Co-authored-by: anhnct8 <anhnct8@fpt.com>	2024-07-12 15:42:56 +05:30
Sayak Paul	2261510bbc	[Core] Add AuraFlow (#8796 ) * add lavender flow transformer --------- Co-authored-by: YiYi Xu <yixu310@gmail.com>	2024-07-11 08:50:19 -10:00
Álvaro Somoza	87b9db644b	[Core] Add Kolors (#8812 ) * initial draft	2024-07-11 06:09:17 -10:00
Xin Ma	b8cf84a3f9	Latte: Latent Diffusion Transformer for Video Generation (#8404 ) * add Latte to diffusers * remove print * remove print * remove print * remove unuse codes * remove layer_norm_latte and add a flag * remove layer_norm_latte and add a flag * update latte_pipeline * update latte_pipeline * remove unuse squeeze * add norm_hidden_states.ndim == 2: # for Latte * fixed test latte pipeline bugs * fixed test latte pipeline bugs * delete sh * add doc for latte * add licensing * Move Transformer3DModelOutput to modeling_outputs * give a default value to sample_size * remove the einops dependency * change norm2 for latte * modify pipeline of latte * update test for Latte * modify some codes for latte * modify for Latte pipeline * modify for Latte pipeline * modify for Latte pipeline * modify for Latte pipeline * modify for Latte pipeline * modify for Latte pipeline * modify for Latte pipeline * modify for Latte pipeline * modify for Latte pipeline * modify for Latte pipeline * modify for Latte pipeline * modify for Latte pipeline * modify for Latte pipeline * modify for Latte pipeline * modify for Latte pipeline * modify for Latte pipeline * modify for Latte pipeline * modify for Latte pipeline * modify for Latte pipeline * modify for Latte pipeline * modify for Latte pipeline * modify for Latte pipeline * modify for Latte pipeline * modify for Latte pipeline * modify for Latte pipeline * modify for Latte pipeline * modify for Latte pipeline * video_length -> num_frames; update prepare_latents copied from * make fix-copies * make style * typo: videe -> video * update * modify for Latte pipeline * modify latte pipeline * modify latte pipeline * modify latte pipeline * modify latte pipeline * modify for Latte pipeline * Delete .vscode directory * make style * make fix-copies * add latte transformer 3d to docs _toctree.yml * update example * reduce frames for test * fixed bug of _text_preprocessing * set num frame to 1 for testing * remove unuse print * add text = self._clean_caption(text) again --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: YiYi Xu <yixu310@gmail.com> Co-authored-by: Aryan <contact.aryanvs@gmail.com> Co-authored-by: Aryan <aryan@huggingface.co>	2024-07-11 15:06:22 +05:30
Xu Cao	35cc66dc4c	Add pipeline_stable_diffusion_3_inpaint.py for SD3 Inference (#8709 ) * Add pipeline_stable_diffusion_3_inpaint --------- Co-authored-by: Xu Cao <xucao2@jrehg-work-01.cs.illinois.edu> Co-authored-by: IrohXu <irohcao@gmail.com> Co-authored-by: YiYi Xu <yixu310@gmail.com>	2024-07-08 15:53:02 -10:00
Tolga Cangöz	57084dacc5	Remove unnecessary lines (#8569 ) * Remove unused line --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-07-08 10:42:02 -10:00
PommesPeter	98388670d2	[Alpha-VLLM Team] Add Lumina-T2X to diffusers (#8652 ) --------- Co-authored-by: zhuole1025 <zhuole1025@gmail.com> Co-authored-by: YiYi Xu <yixu310@gmail.com>	2024-07-07 17:12:09 -10:00
Aryan	a7b9634e95	Fix minor bug in SD3 img2img test (#8779 ) fix minor bug in sd3 img2img	2024-07-03 07:45:37 -10:00
Shauray Singh	8690e8b9d6	add PAG support for SD architecture (#8725 ) * add pag to sd pipelines	2024-06-29 09:26:11 -10:00
Dhruv Nair	150142c537	[Tests] Fix precision related issues in slow pipeline tests (#8720 ) update	2024-06-28 08:13:46 +05:30
Dhruv Nair	effe4b9784	Update xformers SD3 test (#8712 ) update	2024-06-26 10:24:27 -10:00
XCL	fa2abfdb03	[Tencent Hunyuan Team] Add Hunyuan-DiT ControlNet Inference (#8694 ) * add controlnet support --------- Co-authored-by: xingchaoliu <xingchaoliu@tencent.com> Co-authored-by: yiyixuxu <yixu310@gmail,com>	2024-06-26 00:43:03 -10:00
Dhruv Nair	0f0b531827	Add decorator for compile tests (#8703 ) * update * update --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-06-26 11:26:47 +05:30
YiYi Xu	540399f540	add PAG support (#7944 ) * first draft --------- Co-authored-by: yiyixuxu <yixu310@gmail,com> Co-authored-by: Junhwa Song <ethan9867@gmail.com> Co-authored-by: Ahn Donghoon (안동훈 / suno) <suno.vivid@gmail.com> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2024-06-25 08:40:02 -10:00
Sayak Paul	f088027e93	[Marigold tests] add `is_flaky` decorator to some Marigold tests (#8696 ) okay	2024-06-25 06:27:28 -10:00
Tolga Cangöz	138fac703a	Discourage using deprecated `revision` parameter (#8573 ) * Discourage using `revision` * `make style && make quality` * Refactor code to use 'variant' instead of 'revision' * `revision="bf16"` -> `variant="bf16"` --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-06-24 10:06:49 -07:00

... 3 4 5 6 7 ...

815 Commits