diffusers

mirror of https://github.com/huggingface/diffusers.git synced 2026-01-29 07:22:12 +03:00

Author	SHA1	Message	Date
Inigo Goiri	841504bb1a	Add support to pass image embeddings to the WAN I2V pipeline. (#11175 ) * Add support to pass image embeddings to the pipeline. --------- Co-authored-by: hlky <hlky@hlky.ac> Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com> Co-authored-by: YiYi Xu <yixu310@gmail.com>	2025-04-07 15:47:06 -10:00
Steven Liu	fc7a867ae5	[docs] MPS update (#11212 ) mps	2025-04-07 14:32:27 -10:00
alex choi	5ded26cdc7	ensure dtype match between diffused latents and vae weights (#8391 )	2025-04-07 12:59:10 -10:00
Yao Matrix	506f39af3a	enable 1 case on XPU (#11219 ) enable case on XPU: 1. tests/quantization/bnb/test_mixed_int8.py::BnB8bitTrainingTests::test_training Signed-off-by: YAO Matrix <matrix.yao@intel.com>	2025-04-07 08:24:21 +01:00
Mikko Tukiainen	8ad68c1393	Add missing MochiEncoder3D.gradient_checkpointing attribute (#11146 ) * Add missing 'gradient_checkpointing = False' attr * Add (limited) tests for Mochi autoencoder * Apply style fixes * pass 'conv_cache' as arg instead of kwarg --------- Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>	2025-04-06 02:46:45 +05:30
Edna	41afb6690c	Add Wan with STG as a community pipeline (#11184 ) * Add stg wan to community pipelines * remove debug prints * remove unused comment * Update doc * Add credit + fix typo * Apply style fixes --------- Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>	2025-04-05 04:00:40 +02:00
Tolga Cangöz	13e48492f0	[LTX0.9.5] Refactor `LTXConditionPipeline` for text-only conditioning (#11174 ) * Refactor `LTXConditionPipeline` to add text-only conditioning * style * up * Refactor `LTXConditionPipeline` to streamline condition handling and improve clarity * Improve condition checks * Simplify latents handling based on conditioning type * Refactor rope_interpolation_scale preparation for clarity and efficiency * Update LTXConditionPipeline docstring to clarify supported input types * Add LTX Video 0.9.5 model to documentation * Clarify documentation to indicate support for text-only conditioning without passing `conditions` * refactor: comment out unused parameters in LTXConditionPipeline * fix: restore previously commented parameters in LTXConditionPipeline * fix: remove unused parameters from LTXConditionPipeline * refactor: remove unnecessary lines in LTXConditionPipeline	2025-04-04 16:43:15 +02:00
Suprhimp	94f2c48d58	[feat]Add strength in flux_fill pipeline (denoising strength for fluxfill) (#10603 ) * [feat]add strength in flux_fill pipeline * Update src/diffusers/pipelines/flux/pipeline_flux_fill.py * Update src/diffusers/pipelines/flux/pipeline_flux_fill.py * Update src/diffusers/pipelines/flux/pipeline_flux_fill.py * [refactor] refactor after review * [fix] change comment * Apply style fixes * empty * fix * update prepare_latents from flux.img2img pipeline * style * Update src/diffusers/pipelines/flux/pipeline_flux_fill.py ---------	2025-04-04 11:23:30 -03:00
Dhruv Nair	aabf8ce20b	Fix Single File loading for LTX VAE (#11200 ) update	2025-04-04 18:02:39 +05:30
Kenneth Gerald Hamilton	f10775b1b5	Fixed requests.get function call by adding timeout parameter. (#11156 ) * Fixed requests.get function call by adding timeout parameter. * declare DIFFUSERS_REQUEST_TIMEOUT in constants and import when needed * remove unneeded os import * Apply style fixes --------- Co-authored-by: Sai-Suraj-27 <sai.suraj.27.729@gmail.com> Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>	2025-04-04 07:23:14 +01:00
célina	6edb774b5e	Update Style Bot workflow (#11202 ) update style bot workflow	2025-04-03 19:31:49 +02:00
Basile Lewandowski	480510ada9	Change KolorsPipeline LoRA Loader to StableDiffusion (#11198 ) Change LoRA Loader to StableDiffusion Replace the SDXL LoRA Loader Mixin inheritance with the StableDiffusion one	2025-04-03 11:21:11 -03:00
Abhipsha Das	d9023a671a	[Model Card] standardize advanced diffusion training sdxl lora (#7615 ) * model card gen code * push modelcard creation * remove optional from params * add import * add use_dora check * correct lora var use in tags * make style && make quality --------- Co-authored-by: Aryan <aryan@huggingface.co> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2025-04-03 07:43:01 +05:30
Eliseu Silva	c4646a3931	feat: [Community Pipeline] - FaithDiff Stable Diffusion XL Pipeline (#11188 ) * feat: [Community Pipeline] - FaithDiff Stable Diffusion XL Pipeline for Image SR. * added pipeline	2025-04-02 11:33:19 -10:00
Dhruv Nair	c97b709afa	Add CacheMixin to Wan and LTX Transformers (#11187 ) * update * update * update	2025-04-02 10:16:31 -10:00
lakshay sharma	b0ff822ed3	Update import_utils.py (#10329 ) added onnxruntime-vitisai for custom build onnxruntime pkg	2025-04-02 20:47:10 +01:00
hlky	78c2fdc52e	SchedulerMixin from_pretrained and ConfigMixin Self type annotation (#11192 )	2025-04-02 08:24:02 -10:00
hlky	54dac3a87c	Fix enable_sequential_cpu_offload in CogView4Pipeline (#11195 ) * Fix enable_sequential_cpu_offload in CogView4Pipeline * make fix-copies	2025-04-02 16:51:23 +01:00
hlky	e5c6027ef8	[docs] `torch_dtype` map (#11194 )	2025-04-02 12:46:28 +01:00
hlky	da857bebb6	Revert `save_model` in ModelMixin save_pretrained and use safe_serialization=False in test (#11196 )	2025-04-02 12:45:36 +01:00
Fanli Lin	52b460feb9	[tests] HunyuanDiTControlNetPipeline inference precision issue on XPU (#11197 ) * add xpu part * fix more cases * remove some cases * no canny * format fix	2025-04-02 12:45:02 +01:00
hlky	d8c617ccb0	allow models to run with a user-provided dtype map instead of a single dtype (#10301 ) * allow models to run with a user-provided dtype map instead of a single dtype * make style * Add warning, change `_` to `default` * make style * add test * handle shared tensors * remove warning --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2025-04-02 09:05:46 +01:00
Bruno Magalhaes	fe2b397426	remove unnecessary call to `F.pad` (#10620 ) * rewrite memory count without implicitly using dimensions by @ic-synth * replace F.pad by built-in padding in Conv3D * in-place sums to reduce memory allocations * fixed trailing whitespace * file reformatted * in-place sums * simpler in-place expressions * removed in-place sum, may affect backward propagation logic * removed in-place sum, may affect backward propagation logic * removed in-place sum, may affect backward propagation logic * reverted change	2025-04-02 08:19:51 +01:00
Eliseu Silva	be0b7f55cc	fix: for checking mandatory and optional pipeline components (#11189 ) fix: optional componentes verification on load	2025-04-02 08:07:24 +01:00
jiqing-feng	4d5a96e40a	fix autocast (#11190 ) Signed-off-by: jiqing-feng <jiqing.feng@intel.com>	2025-04-02 07:26:27 +01:00
Yao Matrix	a7f07c1ef5	map BACKEND_RESET_MAX_MEMORY_ALLOCATED to reset_peak_memory_stats on XPU (#11191 ) Signed-off-by: YAO Matrix <matrix.yao@intel.com>	2025-04-02 07:25:48 +01:00
Dhruv Nair	df1d7b01f1	[WIP] Add Wan Video2Video (#11053 ) * update * update * update * update * update * update * update * update * update * update * update * update * update * update * update * update	2025-04-01 17:22:11 +05:30
Fanli Lin	5a6edac087	[tests] no hard-coded cuda (#11186 ) no cuda only	2025-04-01 12:14:31 +01:00
kakukakujirori	e8fc8b1f81	Bug fix in LTXImageToVideoPipeline.prepare_latents() when latents is already set (#10918 ) * Bug fix in ltx * Assume packed latents. --------- Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com> Co-authored-by: YiYi Xu <yixu310@gmail.com>	2025-03-31 12:15:43 -10:00
hlky	d6f4774c1c	Add `latents_mean` and `latents_std` to `SDXLLongPromptWeightingPipeline` (#11034 )	2025-03-31 11:32:29 -10:00
Mark	eb50defff2	[Docs] Fix environment variables in `installation.md` (#11179 )	2025-03-31 09:15:25 -07:00
Aryan	2c59af7222	Raise warning and round down if Wan num_frames is not 4k + 1 (#11167 ) * update * raise warning and round to nearest multiple of scale factor	2025-03-31 13:33:28 +05:30
hlky	75d7e5cc45	Fix LatteTransformer3DModel dtype mismatch with enable_temporal_attentions (#11139 )	2025-03-29 15:52:56 +01:00
Dhruv Nair	617c208bb4	[Docs] Update Wan Docs with memory optimizations (#11089 ) * update * update	2025-03-28 19:05:56 +05:30
hlky	5d970a4aa9	WanI2V encode_image (#11164 ) * WanI2V encode_image	2025-03-28 18:05:34 +05:30
kentdan3msu	de6a88c2d7	Set self._hf_peft_config_loaded to True when LoRA is loaded using `load_lora_adapter` in PeftAdapterMixin class (#11155 ) set self._hf_peft_config_loaded to True on successful lora load Sets the `_hf_peft_config_loaded` flag if a LoRA is successfully loaded in `load_lora_adapter`. Fixes bug huggingface/diffusers/issues/11148 Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2025-03-26 18:31:18 +01:00
Dhruv Nair	7dc52ea769	[Quantization] dtype fix for GGUF + fix BnB tests (#11159 ) * update * update * update * update	2025-03-26 22:22:16 +05:30
Junsong Chen	739d6ec731	add a timestep scale for sana-sprint teacher model (#11150 )	2025-03-25 08:47:39 -10:00
Aryan	1ddf3f3a19	Improve information about group offloading and layerwise casting (#11101 ) * update * Update docs/source/en/optimization/memory.md * Apply suggestions from code review Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com> * apply review suggestions * update --------- Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>	2025-03-24 23:25:59 +05:30
Jun Yeop Na	7aac77affa	[doc] Fix Korean Controlnet Train doc (#11141 ) * remove typo from korean controlnet train doc * removed more paragraphs to remain in sync with the english document	2025-03-24 09:38:21 -07:00
Aryan	8907a70a36	New HunyuanVideo-I2V (#11066 ) * update * update * update * add tests * update docs * raise value error * warning for true cfg and guidance scale * fix test	2025-03-24 21:18:40 +05:30
Junsong Chen	5dbe4f5de6	[fix SANA-Sprint] (#11142 ) * fix bug in sana conversion script; * add more model paths; --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2025-03-23 23:38:14 -10:00
Yuxuan Zhang	1d37f42055	Modify the implementation of retrieve_timesteps in CogView4-Control. (#11125 ) * 1 * change to channel 1 * cogview4 control training * add CacheMixin * 1 * remove initial_input_channels change for val * 1 * update * use 3.5 * new loss * 1 * use imagetoken * for megatron convert * 1 * train con and uc * 2 * remove guidance_scale * Update pipeline_cogview4_control.py * fix * use cogview4 pipeline with timestep * update shift_factor * remove the uncond * add max length * change convert and use GLMModel instead of GLMForCasualLM * fix * [cogview4] Add attention mask support to transformer model * [fix] Add attention mask for padded token * update * remove padding type * Update train_control_cogview4.py * resolve conflicts with #10981 * add control convert * use control format * fix * add missing import * update with cogview4 formate * make style * Update pipeline_cogview4_control.py * Update pipeline_cogview4_control.py * remove * Update pipeline_cogview4_control.py * put back * Apply style fixes --------- Co-authored-by: OleehyO <leehy0357@gmail.com> Co-authored-by: yiyixuxu <yixu310@gmail.com> Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>	2025-03-23 21:17:14 +05:30
Tolga Cangöz	0213179ba8	Update README and example code for AnyText usage (#11028 ) * [Documentation] Update README and example code with additional usage instructions for AnyText * [Documentation] Update README for AnyTextPipeline and improve logging in code * Remove wget command for font file from example docstring in anytext.py	2025-03-23 21:15:57 +05:30
hlky	a7d53a5939	Don't override `torch_dtype` and don't use when `quantization_config` is set (#11039 ) * Don't use `torch_dtype` when `quantization_config` is set * up * djkajka * Apply suggestions from code review --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2025-03-21 21:58:38 +05:30
YiYi Xu	8a63aa5e4f	add sana-sprint (#11074 ) * add sana-sprint --------- Co-authored-by: Junsong Chen <cjs1020440147@icloud.com> Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: Aryan <aryan@huggingface.co>	2025-03-21 06:21:18 -10:00
Aryan	844221ae4e	[core] FasterCache (#10163 ) * init * update * update * update * make style * update * fix * make it work with guidance distilled models * update * make fix-copies * add tests * update * apply_faster_cache -> apply_fastercache * fix * reorder * update * refactor * update docs * add fastercache to CacheMixin * update tests * Apply suggestions from code review * make style * try to fix partial import error * Apply style fixes * raise warning * update --------- Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>	2025-03-21 09:35:04 +05:30
CyberVy	9b2c0a7dbe	fix _callback_tensor_inputs of sd controlnet inpaint pipeline missing some elements (#11073 ) * Update pipeline_controlnet_inpaint.py * Apply style fixes	2025-03-20 23:56:12 -03:00
Parag Ekbote	f424b1b062	Notebooks for Community Scripts-8 (#11128 ) Add 4 Notebooks and update the missing links for the example README.	2025-03-20 12:24:46 -07:00
YiYi Xu	e9fda3924f	remove F.rms_norm for now (#11126 ) up	2025-03-20 07:55:01 -10:00

... 3 4 5 6 7 ...

5507 Commits