diffusers

mirror of https://github.com/huggingface/diffusers.git synced 2026-01-27 17:22:53 +03:00

Author	SHA1	Message	Date
Omar Awile	df355ea2c6	Fix documentation for FluxPipeline (#10563 ) Fix argument name in 8bit quantized example Found a tiny mistake in the documentation where the text encoder model was passed to the wrong argument in the FluxPipeline.from_pretrained function.	2025-01-13 11:56:32 -08:00
Junsong Chen	ae019da9e3	[Sana] add Sana to auto-text2image-pipeline; (#10538 ) add Sana to auto-text2image-pipeline;	2025-01-13 09:54:37 -10:00
Sayak Paul	329771e542	[LoRA] improve failure handling for peft. (#10551 ) * improve failure handling for peft. * emppty * Update src/diffusers/loaders/peft.py Co-authored-by: Benjamin Bossan <BenjaminBossan@users.noreply.github.com> --------- Co-authored-by: Benjamin Bossan <BenjaminBossan@users.noreply.github.com>	2025-01-13 09:20:49 -10:00
Dhruv Nair	f7cb595428	[Single File] Fix loading Flux Dev finetunes with Comfy Prefix (#10545 ) * update * update * update * update --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2025-01-13 21:25:07 +05:30
hlky	c3478a42b9	Fix Nightly AudioLDM2PipelineFastTests (#10556 ) * Fix Nightly AudioLDM2PipelineFastTests * add phonemizer to setup extras test * fix * make style	2025-01-13 13:54:06 +00:00
hlky	980736b792	Fix train_dreambooth_lora_sd3_miniature (#10554 )	2025-01-13 13:47:27 +00:00
hlky	50c81df4e7	Fix StableDiffusionInstructPix2PixPipelineSingleFileSlowTests (#10557 )	2025-01-13 13:47:10 +00:00
Aryan	e1c7269720	Fix Latte output_type (#10558 ) update	2025-01-13 19:15:59 +05:30
Sayak Paul	edb8c1bce6	[Flux] Improve true cfg condition (#10539 ) * improve flux true cfg condition * add test	2025-01-12 18:33:34 +05:30
Sayak Paul	0785dba4df	[Docs] Add negative prompt docs to FluxPipeline (#10531 ) * add negative_prompt documentation. * add proper docs for negative prompts * fix-copies * remove comment. * Apply suggestions from code review Co-authored-by: hlky <hlky@hlky.ac> * fix-copies --------- Co-authored-by: hlky <hlky@hlky.ac>	2025-01-12 18:02:46 +05:30
Muyang Li	5cda8ea521	Use `randn_tensor` to replace `torch.randn` (#10535 ) `torch.randn` requires `generator` and `latents` on the same device, while the wrapped function `randn_tensor` does not have this issue.	2025-01-12 11:41:41 +05:30
Sayak Paul	36acdd7517	[Tests] skip tests properly with `unittest.skip()` (#10527 ) * skip tests properly. * more * more	2025-01-11 08:46:22 +05:30
Junyu Chen	e7db062e10	[DC-AE] support tiling for DC-AE (#10510 ) * autoencoder_dc tiling * add tiling and slicing support in SANA pipelines * create variables for padding length because the line becomes too long * add tiling and slicing support in pag SANA pipelines * revert changes to tile size * make style * add vae tiling test --------- Co-authored-by: Aryan <aryan@huggingface.co>	2025-01-11 07:15:26 +05:30
andreabosisio	1b0fe63656	Typo fix in the table number of a referenced paper (#10528 ) Correcting a typo in the table number of a referenced paper (in scheduling_ddim_inverse.py) Changed the number of the referenced table from 1 to 2 in a comment of the set_timesteps() method of the DDIMInverseScheduler class (also according to the description of the 'timestep_spacing' attribute of its __init__ method).	2025-01-10 17:15:25 -08:00
chaowenguo	d6c030fd37	add the xm.mark_step for the first denosing loop (#10530 ) * Update rerender_a_video.py * Update rerender_a_video.py * Update examples/community/rerender_a_video.py Co-authored-by: hlky <hlky@hlky.ac> * Update rerender_a_video.py * make style --------- Co-authored-by: hlky <hlky@hlky.ac> Co-authored-by: YiYi Xu <yixu310@gmail.com>	2025-01-10 21:03:41 +00:00
Sayak Paul	9f06a0d1a4	[CI] Match remaining assertions from big runner (#10521 ) * print * remove print. * print * update slice. * empty	2025-01-10 16:37:36 +05:30
Daniel Hipke	52c05bd4cd	Add a `disable_mmap` option to the `from_single_file` loader to improve load performance on network mounts (#10305 ) * Add no_mmap arg. * Fix arg parsing. * Update another method to force no mmap. * logging * logging2 * propagate no_mmap * logging3 * propagate no_mmap * logging4 * fix open call * clean up logging * cleanup * fix missing arg * update logging and comments * Rename to disable_mmap and update other references. * [Docs] Update ltx_video.md to remove generator from `from_pretrained()` (#10316) Update ltx_video.md to remove generator from `from_pretrained()` * docs: fix a mistake in docstring (#10319) Update pipeline_hunyuan_video.py docs: fix a mistake * [BUG FIX] [Stable Audio Pipeline] Resolve torch.Tensor.new_zeros() TypeError in function prepare_latents caused by audio_vae_length (#10306) [BUG FIX] [Stable Audio Pipeline] TypeError: new_zeros(): argument 'size' failed to unpack the object at pos 3 with error "type must be tuple of ints,but got float" torch.Tensor.new_zeros() takes a single argument size (int...) – a list, tuple, or torch.Size of integers defining the shape of the output tensor. in function prepare_latents: audio_vae_length = self.transformer.config.sample_size * self.vae.hop_length audio_shape = (batch_size // num_waveforms_per_prompt, audio_channels, audio_vae_length) ... audio = initial_audio_waveforms.new_zeros(audio_shape) audio_vae_length evaluates to float because self.transformer.config.sample_size returns a float Co-authored-by: hlky <hlky@hlky.ac> * [docs] Fix quantization links (#10323) Update overview.md * [Sana]add 2K related model for Sana (#10322) add 2K related model for Sana * Update src/diffusers/loaders/single_file_model.py Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com> * Update src/diffusers/loaders/single_file.py Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com> * make style --------- Co-authored-by: hlky <hlky@hlky.ac> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: Leojc <liao_junchao@outlook.com> Co-authored-by: Aditya Raj <syntaxticsugr@gmail.com> Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> Co-authored-by: Junsong Chen <cjs1020440147@icloud.com> Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>	2025-01-10 15:41:04 +05:30
Sayak Paul	a6f043a80f	[LoRA] allow big CUDA tests to run properly for LoRA (and others) (#9845 ) * allow big lora tests to run on the CI. * print * print. * print * print * print * print * more * print * remove print. * remove print * directly place on cuda. * remove pipeline. * remove * fix * fix * spaces * quality * updates * directly place flux controlnet pipeline on cuda. * torch_device instead of cuda. * style * device placement. * fixes * add big gpu marker for mochi; rename test correctly * address feedback * fix --------- Co-authored-by: Aryan <aryan@huggingface.co>	2025-01-10 12:50:24 +05:30
hlky	12fbe3f7dc	Use Pipelines without unet (#10440 ) * Use Pipelines without unet * unet.config.in_channels * default_sample_size * is_unet_version_less_0_9_0 --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2025-01-10 04:45:42 +00:00
Linoy Tsaban	83ba01a38d	small readme changes for advanced training examples (#10473 ) add to readme about hf login and wandb installation to address https://github.com/huggingface/diffusers/issues/10142#issuecomment-2571655570 Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2025-01-10 07:35:19 +05:30
Zehuan Huang	7116fd24e5	Support pass kwargs to cogvideox custom attention processor (#10456 ) * Support pass kwargs to cogvideox custom attention processor * remove args in cogvideox attn processor * remove unused kwargs	2025-01-09 11:57:03 -10:00
Sayak Paul	553b13845f	[LoRA] clean up `load_lora_into_text_encoder()` and `fuse_lora()` copied from (#10495 ) * factor out text encoder loading. * make fix-copies * remove copied from fuse_lora and unfuse_lora as needed. * remove unused imports	2025-01-09 11:29:16 -10:00
chaowenguo	7bc8b92384	add callable object to convert frame into control_frame to reduce cpu memory usage. (#10501 ) * Update rerender_a_video.py * Update rerender_a_video.py * Update examples/community/rerender_a_video.py Co-authored-by: hlky <hlky@hlky.ac> --------- Co-authored-by: hlky <hlky@hlky.ac> Co-authored-by: YiYi Xu <yixu310@gmail.com>	2025-01-09 11:25:53 -10:00
Vladimir Mandic	f0c6d9784b	flux: make scheduler config params optional (#10384 ) * dont assume scheduler has optional config params * make style, make fix-copies * calculate_shift * fix-copies, usage in pipelines --------- Co-authored-by: hlky <hlky@hlky.ac>	2025-01-09 10:44:26 -10:00
Steven Liu	d006f0769b	[docs] Fix missing parameters in docstrings (#10419 ) * fix docstrings * add	2025-01-09 10:54:39 -08:00
geronimi73	a26d57097a	AutoModel instead of AutoModelForCausalLM (#10507 )	2025-01-09 16:28:04 +05:30
Sayak Paul	daf9d0f119	[chore] remove prints from tests. (#10505 ) remove prints from tests.	2025-01-09 14:19:43 +05:30
hlky	95c5ce4e6f	PyTorch/XLA support (#10498 ) Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2025-01-08 12:31:27 -10:00
Junsong Chen	c0964571fc	[Sana 4K] (#10493 ) add 4K support for Sana	2025-01-08 11:58:11 -10:00
hlky	b13cdbb294	UNet2DModel mid_block_type (#10469 )	2025-01-08 10:50:29 -10:00
Bagheera	a0acbdc989	fix for #7365 , prevent pipelines from overriding provided prompt embeds (#7926 ) * fix for #7365, prevent pipelines from overriding provided prompt embeds * fix-copies * fix implementation * update --------- Co-authored-by: bghira <bghira@users.github.com> Co-authored-by: Aryan <aryan@huggingface.co> Co-authored-by: sayakpaul <spsayakpaul@gmail.com>	2025-01-08 10:12:12 -10:00
Parag Ekbote	5655b22ead	Notebooks for Community Scripts-5 (#10499 ) Add 5 Notebooks for Diffusers Community Pipelines.	2025-01-08 08:56:17 -08:00
hlky	4df9d49218	Fix tokenizers install from main in LoRA tests (#10494 ) * Fix tokenizers install from main in LoRA tests * @ * rust * -e * uv * just update tokenizers	2025-01-08 16:14:25 +00:00
Dhruv Nair	9731773d39	[CI] Torch Min Version Test Fix (#10491 ) update	2025-01-08 19:43:38 +05:30
Marc Sun	e2deb82e69	Fix compatibility with pipeline when loading model with device_map on single gpu (#10390 ) * fix device issue in single gpu case * Update src/diffusers/pipelines/pipeline_utils.py Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2025-01-08 11:35:00 +01:00
hlky	1288c8560a	Update tokenizers in `pr_test_peft_backend` (#10132 ) Update tokenizers	2025-01-08 10:09:32 +00:00
AstraliteHeart	cb342b745a	Add AuraFlow GGUF support (#10463 ) * Add support for loading AuraFlow models from GGUF https://huggingface.co/city96/AuraFlow-v0.3-gguf * Update AuraFlow documentation for GGUF, add GGUF tests and model detection. * Address code review comments. * Remove unused config. --------- Co-authored-by: hlky <hlky@hlky.ac>	2025-01-08 13:23:12 +05:30
Junsong Chen	80fd9260bb	[Sana][bug fix]change clean_caption from True to False. (#10481 ) change clean_caption from True to False.	2025-01-07 15:31:23 -10:00
Aryan	71ad16b463	Add `_no_split_modules` to some models (#10308 ) * set supports gradient checkpointing to true where necessary; add missing no split modules * fix cogvideox tests * update --------- Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>	2025-01-08 06:34:19 +05:30
hlky	ee7e141d80	Use pipelines without vae (#10441 ) * Use pipelines without vae * getattr * vqvae --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2025-01-07 13:26:51 -10:00
hlky	01bd79649e	Fix HunyuanVideo produces NaN on PyTorch<2.5 (#10482 ) Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2025-01-07 13:13:55 -10:00
Teriks	03bcf5aefe	RFInversionFluxPipeline, small fix for enable_model_cpu_offload & enable_sequential_cpu_offload compatibility (#10480 ) RFInversionFluxPipeline.encode_image, device fix Use self._execution_device instead of self.device when selecting a device for the input image tensor. This allows for compatibility with enable_model_cpu_offload & enable_sequential_cpu_offload Co-authored-by: Teriks <Teriks@users.noreply.github.com> Co-authored-by: Linoy Tsaban <57615435+linoytsaban@users.noreply.github.com>	2025-01-07 15:47:28 +01:00
dependabot[bot]	e0b96ba7b0	Bump jinja2 from 3.1.4 to 3.1.5 in /examples/research_projects/realfill (#10377 ) Bumps [jinja2](https://github.com/pallets/jinja) from 3.1.4 to 3.1.5. - [Release notes](https://github.com/pallets/jinja/releases) - [Changelog](https://github.com/pallets/jinja/blob/main/CHANGES.rst) - [Commits](https://github.com/pallets/jinja/compare/3.1.4...3.1.5) --- updated-dependencies: - dependency-name: jinja2 dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2025-01-07 19:59:41 +05:30
Dhruv Nair	854a04659c	[CI] Add minimal testing for legacy Torch versions (#10479 ) * update * update	2025-01-07 18:51:41 +05:30
hlky	628f2c544a	Use Pipelines without scheduler (#10439 ) Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2025-01-07 12:07:08 +00:00
Aryan	811560b1d7	[LoRA] Support original format loras for HunyuanVideo (#10376 ) * update * fix make copies * update * add relevant markers to the integration test suite. * add copied. * fox-copies * temporarily add print. * directly place on CUDA as CPU isn't that big on the CIO. * fixes to fuse_lora, aryan was right. * fixes --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2025-01-07 13:18:57 +05:30
Rahul Raman	f1e0c7ce4a	Refactor instructpix2pix lora to support peft (#10205 ) * make base code changes referred from train_instructpix2pix script in examples * change code to use PEFT as discussed in issue 10062 * update README training command * update README training command * refactor variable name and freezing unet * Update examples/research_projects/instructpix2pix_lora/train_instruct_pix2pix_lora.py Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * update README installation instructions. * cleanup code using make style and quality --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2025-01-07 12:00:45 +05:30
Sayak Paul	b94cfd7937	[Training] QoL improvements in the Flux Control training scripts (#10461 ) * qol improvements to the Flux script. * propagate the dataloader changes.	2025-01-07 11:56:17 +05:30
Aryan	661bde0ff2	Fix style (#10478 ) fix	2025-01-07 11:06:36 +05:30
Ameer Azam	4f5e3e35d2	Regarding the RunwayML path for V1.5 did change to stable-diffusion-v1-5/[stable-diffusion-v1-5/ stable-diffusion-inpainting] (#10476 ) * Update pipeline_controlnet.py * Update pipeline_controlnet_img2img.py runwayml Take-down so change all from to this stable-diffusion-v1-5/stable-diffusion-v1-5 * Update pipeline_controlnet_inpaint.py * runwayml take-down make change to sd-legacy * runwayml take-down make change to sd-legacy * runwayml take-down make change to sd-legacy * runwayml take-down make change to sd-legacy * Update convert_blipdiffusion_to_diffusers.py style change	2025-01-06 15:01:52 -08:00

1 2 3 4 5 ...

5008 Commits