diffusers

mirror of https://github.com/huggingface/diffusers.git synced 2026-01-27 17:22:53 +03:00

Author	SHA1	Message	Date
Thanh Le	5d2d23986e	Fix inconsistent random transform in instruct pix2pix (#10698 ) * Update train_instruct_pix2pix.py Fix inconsistent random transform in instruct_pix2pix * Update train_instruct_pix2pix_sdxl.py	2025-01-31 08:29:29 -10:00
Dimitri Barbot	196aef5a6f	Fix pipeline dtype unexpected change when using SDXL reference community pipelines in float16 mode (#10670 ) Fix pipeline dtype unexpected change when using SDXL reference community pipelines	2025-01-28 10:46:41 -03:00
Aryan	c4d4ac21e7	Refactor gradient checkpointing (#10611 ) * update * remove unused fn * apply suggestions based on review * update + cleanup 🧹 * more cleanup 🧹 * make fix-copies * update test	2025-01-28 06:51:46 +05:30
hlky	41571773d9	[training] Convert to ImageFolder script (#10664 ) * [training] Convert to ImageFolder script * make	2025-01-27 09:43:51 -10:00
Marlon May	f7f36c7d3d	Add community pipeline for semantic guidance for FLUX (#10610 ) * add community pipeline for semantic guidance for flux * fix imports in community pipeline for semantic guidance for flux * Update examples/community/pipeline_flux_semantic_guidance.py Co-authored-by: hlky <hlky@hlky.ac> * fix community pipeline for semantic guidance for flux --------- Co-authored-by: Linoy Tsaban <57615435+linoytsaban@users.noreply.github.com> Co-authored-by: hlky <hlky@hlky.ac>	2025-01-27 16:19:46 +02:00
Yuqian Hong	4fa24591a3	create a script to train autoencoderkl (#10605 ) * create a script to train vae * update main.py * update train_autoencoderkl.py * update train_autoencoderkl.py * add a check of --pretrained_model_name_or_path and --model_config_name_or_path * remove the comment, remove diffusers in requiremnets.txt, add validation_image ote * update autoencoderkl.py * quality --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2025-01-27 16:41:34 +05:30
Leo Jiang	07860f9916	NPU Adaption for Sanna (#10409 ) * NPU Adaption for Sanna --------- Co-authored-by: J石页 <jiangshuo9@h-partners.com> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2025-01-24 09:08:52 -10:00
Wenhao Sun	87252d80c3	Add pipeline_stable_diffusion_xl_attentive_eraser (#10579 ) * add pipeline_stable_diffusion_xl_attentive_eraser * add pipeline_stable_diffusion_xl_attentive_eraser_make_style * make style and add example output * update Docs Co-authored-by: Other Contributor <a457435687@126.com> * add Oral Co-authored-by: Other Contributor <a457435687@126.com> * update_review Co-authored-by: Other Contributor <a457435687@126.com> * update_review_ms Co-authored-by: Other Contributor <a457435687@126.com> --------- Co-authored-by: Other Contributor <a457435687@126.com>	2025-01-24 13:52:45 +00:00
Yaniv Galron	a451c0ed14	removing redundant requires_grad = False (#10628 ) We already set the unet to requires grad false at line 506 Co-authored-by: Aryan <aryan@huggingface.co>	2025-01-24 03:25:33 +05:30
Muyang Li	158a5a87fb	Remove the FP32 Wrapper when evaluating (#10617 ) Remove the FP32 Wrapper Co-authored-by: Linoy Tsaban <57615435+linoytsaban@users.noreply.github.com>	2025-01-21 16:16:54 +05:30
jiqing-feng	012d08b1bc	Enable dreambooth lora finetune example on other devices (#10602 ) * enable dreambooth_lora on other devices Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * enable xpu Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * check cuda device before empty cache Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * fix comment Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * import free_memory Signed-off-by: jiqing-feng <jiqing.feng@intel.com> --------- Signed-off-by: jiqing-feng <jiqing.feng@intel.com>	2025-01-21 14:09:45 +05:30
Sayak Paul	4ace7d0483	[chore] change licensing to 2025 from 2024. (#10615 ) change licensing to 2025 from 2024.	2025-01-20 16:57:27 -10:00
baymax591	75a636da48	bugfix for npu not support float64 (#10123 ) * bugfix for npu not support float64 * is_mps is_npu --------- Co-authored-by: 白超 <baichao19@huawei.com> Co-authored-by: hlky <hlky@hlky.ac>	2025-01-20 09:35:24 -10:00
Sayak Paul	328e0d20a7	[training] set rest of the blocks with `requires_grad` False. (#10607 ) set rest of the blocks with requires_grad False.	2025-01-19 19:34:53 +05:30
Juan Acevedo	aeac0a00f8	implementing flux on TPUs with ptxla (#10515 ) * implementing flux on TPUs with ptxla * add xla flux attention class * run make style/quality * Update src/diffusers/models/attention_processor.py Co-authored-by: YiYi Xu <yixu310@gmail.com> * Update src/diffusers/models/attention_processor.py Co-authored-by: YiYi Xu <yixu310@gmail.com> * run style and quality --------- Co-authored-by: Juan Acevedo <jfacevedo@google.com> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: YiYi Xu <yixu310@gmail.com>	2025-01-16 08:46:02 -10:00
Leo Jiang	b0c8973834	[Sana 4K] Add vae tiling option to avoid OOM (#10583 ) Co-authored-by: J石页 <jiangshuo9@h-partners.com>	2025-01-16 02:06:07 +05:30
hlky	980736b792	Fix train_dreambooth_lora_sd3_miniature (#10554 )	2025-01-13 13:47:27 +00:00
chaowenguo	d6c030fd37	add the xm.mark_step for the first denosing loop (#10530 ) * Update rerender_a_video.py * Update rerender_a_video.py * Update examples/community/rerender_a_video.py Co-authored-by: hlky <hlky@hlky.ac> * Update rerender_a_video.py * make style --------- Co-authored-by: hlky <hlky@hlky.ac> Co-authored-by: YiYi Xu <yixu310@gmail.com>	2025-01-10 21:03:41 +00:00
hlky	12fbe3f7dc	Use Pipelines without unet (#10440 ) * Use Pipelines without unet * unet.config.in_channels * default_sample_size * is_unet_version_less_0_9_0 --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2025-01-10 04:45:42 +00:00
Linoy Tsaban	83ba01a38d	small readme changes for advanced training examples (#10473 ) add to readme about hf login and wandb installation to address https://github.com/huggingface/diffusers/issues/10142#issuecomment-2571655570 Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2025-01-10 07:35:19 +05:30
chaowenguo	7bc8b92384	add callable object to convert frame into control_frame to reduce cpu memory usage. (#10501 ) * Update rerender_a_video.py * Update rerender_a_video.py * Update examples/community/rerender_a_video.py Co-authored-by: hlky <hlky@hlky.ac> --------- Co-authored-by: hlky <hlky@hlky.ac> Co-authored-by: YiYi Xu <yixu310@gmail.com>	2025-01-09 11:25:53 -10:00
Vladimir Mandic	f0c6d9784b	flux: make scheduler config params optional (#10384 ) * dont assume scheduler has optional config params * make style, make fix-copies * calculate_shift * fix-copies, usage in pipelines --------- Co-authored-by: hlky <hlky@hlky.ac>	2025-01-09 10:44:26 -10:00
Bagheera	a0acbdc989	fix for #7365 , prevent pipelines from overriding provided prompt embeds (#7926 ) * fix for #7365, prevent pipelines from overriding provided prompt embeds * fix-copies * fix implementation * update --------- Co-authored-by: bghira <bghira@users.github.com> Co-authored-by: Aryan <aryan@huggingface.co> Co-authored-by: sayakpaul <spsayakpaul@gmail.com>	2025-01-08 10:12:12 -10:00
Parag Ekbote	5655b22ead	Notebooks for Community Scripts-5 (#10499 ) Add 5 Notebooks for Diffusers Community Pipelines.	2025-01-08 08:56:17 -08:00
hlky	ee7e141d80	Use pipelines without vae (#10441 ) * Use pipelines without vae * getattr * vqvae --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2025-01-07 13:26:51 -10:00
Teriks	03bcf5aefe	RFInversionFluxPipeline, small fix for enable_model_cpu_offload & enable_sequential_cpu_offload compatibility (#10480 ) RFInversionFluxPipeline.encode_image, device fix Use self._execution_device instead of self.device when selecting a device for the input image tensor. This allows for compatibility with enable_model_cpu_offload & enable_sequential_cpu_offload Co-authored-by: Teriks <Teriks@users.noreply.github.com> Co-authored-by: Linoy Tsaban <57615435+linoytsaban@users.noreply.github.com>	2025-01-07 15:47:28 +01:00
dependabot[bot]	e0b96ba7b0	Bump jinja2 from 3.1.4 to 3.1.5 in /examples/research_projects/realfill (#10377 ) Bumps [jinja2](https://github.com/pallets/jinja) from 3.1.4 to 3.1.5. - [Release notes](https://github.com/pallets/jinja/releases) - [Changelog](https://github.com/pallets/jinja/blob/main/CHANGES.rst) - [Commits](https://github.com/pallets/jinja/compare/3.1.4...3.1.5) --- updated-dependencies: - dependency-name: jinja2 dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2025-01-07 19:59:41 +05:30
hlky	628f2c544a	Use Pipelines without scheduler (#10439 ) Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2025-01-07 12:07:08 +00:00
Rahul Raman	f1e0c7ce4a	Refactor instructpix2pix lora to support peft (#10205 ) * make base code changes referred from train_instructpix2pix script in examples * change code to use PEFT as discussed in issue 10062 * update README training command * update README training command * refactor variable name and freezing unet * Update examples/research_projects/instructpix2pix_lora/train_instruct_pix2pix_lora.py Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * update README installation instructions. * cleanup code using make style and quality --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2025-01-07 12:00:45 +05:30
Sayak Paul	b94cfd7937	[Training] QoL improvements in the Flux Control training scripts (#10461 ) * qol improvements to the Flux script. * propagate the dataloader changes.	2025-01-07 11:56:17 +05:30
Ameer Azam	4f5e3e35d2	Regarding the RunwayML path for V1.5 did change to stable-diffusion-v1-5/[stable-diffusion-v1-5/ stable-diffusion-inpainting] (#10476 ) * Update pipeline_controlnet.py * Update pipeline_controlnet_img2img.py runwayml Take-down so change all from to this stable-diffusion-v1-5/stable-diffusion-v1-5 * Update pipeline_controlnet_inpaint.py * runwayml take-down make change to sd-legacy * runwayml take-down make change to sd-legacy * runwayml take-down make change to sd-legacy * runwayml take-down make change to sd-legacy * Update convert_blipdiffusion_to_diffusers.py style change	2025-01-06 15:01:52 -08:00
chaowenguo	4e44534845	Update rerender_a_video.py fix dtype error (#10451 ) Update rerender_a_video.py	2025-01-04 14:52:50 +00:00
chaowenguo	a17832b2d9	add pythor_xla support for render a video (#10443 ) * Update rerender_a_video.py * Update rerender_a_video.py * make style --------- Co-authored-by: hlky <hlky@hlky.ac>	2025-01-03 16:00:02 +00:00
Doug J	f7822ae4bf	Update train_text_to_image_sdxl.py (#8830 ) Enable VAE hash to be able to change with args change. If not, train_dataset_with_embeddiings may have row number inconsistency with train_dataset_with_vae. Co-authored-by: Linoy Tsaban <57615435+linoytsaban@users.noreply.github.com>	2025-01-02 10:41:18 -10:00
Dev Rajput	4b9f1c7d8c	Add correct number of channels when resuming from checkpoint for Flux Control LoRa training (#10422 ) * Add correct number of channels when resuming from checkpoint * Fix Formatting	2025-01-02 15:51:44 +05:30
Sayak Paul	5f72473543	[training] add ds support to lora sd3. (#10378 ) * add ds support to lora sd3. Co-authored-by: leisuzz <jiangshuonb@gmail.com> * style. --------- Co-authored-by: leisuzz <jiangshuonb@gmail.com> Co-authored-by: Linoy Tsaban <57615435+linoytsaban@users.noreply.github.com>	2024-12-30 19:31:05 +05:30
Sayak Paul	825979ddc3	[training] fix: registration of out_channels in the control flux scripts. (#10367 ) * fix: registration of out_channels in the control flux scripts. * free memory.	2024-12-24 21:44:44 +05:30
Sayak Paul	92933ec36a	[chore] post release 0.32.0 (#10361 ) * post release 0.32.0 * stylew	2024-12-23 10:03:34 -10:00
Sayak Paul	76e2727b5c	[SANA LoRA] sana lora training tests and misc. (#10296 ) * sana lora training tests and misc. * remove push to hub * Update examples/dreambooth/train_dreambooth_lora_sana.py Co-authored-by: Aryan <aryan@huggingface.co> --------- Co-authored-by: Aryan <aryan@huggingface.co>	2024-12-23 12:35:13 +05:30
Sayak Paul	9c0e20de61	[chore] Update README_sana.md to update the default model (#10285 ) Update README_sana.md to update the default model	2024-12-19 10:24:57 +05:30
Sayak Paul	63cdf9c0ba	[chore] fix: reamde -> readme (#10276 ) fix: reamde -> readme	2024-12-18 10:56:08 +05:30
hlky	0ac52d6f09	Use `torch` in `get_2d_rotary_pos_embed` (#10155 ) * Use `torch` in `get_2d_rotary_pos_embed` * Add deprecation	2024-12-17 18:26:52 -10:00
Sayak Paul	9408aa2dfc	[LoRA] feat: lora support for SANA. (#10234 ) * feat: lora support for SANA. * make fix-copies * rename test class. * attention_kwargs -> cross_attention_kwargs. * Revert "attention_kwargs -> cross_attention_kwargs." This reverts commit `23433bf9bc`. * exhaust 119 max line limit * sana lora fine-tuning script. * readme * add a note about the supported models. * Apply suggestions from code review Co-authored-by: Aryan <aryan@huggingface.co> * style * docs for attention_kwargs. * remove lora_scale from pag pipeline. * copy fix --------- Co-authored-by: Aryan <aryan@huggingface.co>	2024-12-18 08:22:31 +05:30
cjkangme	9c68c945e9	[Community Pipeline] Fix typo that cause error on regional prompting pipeline (#10251 ) fix: fix typo that cause error	2024-12-17 21:09:50 +00:00
Junjie	96a9097445	Add offload option in flux-control training (#10225 ) * Add offload option in flux-control training * Update examples/flux-control/train_control_flux.py Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * modify help message * fix format --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-12-15 20:49:17 +05:30
Linoy Tsaban	cef0e3677e	[RF inversion community pipeline] add eta_decay (#10199 ) * add decay * add decay * style	2024-12-13 11:04:26 +02:00
hlky	f2d348d904	Remove `negative_` from SDXL callback (#10203 ) Remove `negative_` from SDXL callback Change example and add XL version	2024-12-12 20:58:50 +00:00
Sayak Paul	8170dc368d	[WIP][Training] Flux Control LoRA training script (#10130 ) * update * add * update * add control-lora conversion script; make flux loader handle norms; fix rank calculation assumption * control lora updates * remove copied-from * create separate pipelines for flux control * make fix-copies * update docs * add tests * fix * Apply suggestions from code review Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * remove control lora changes * apply suggestions from review * Revert "remove control lora changes" This reverts commit `73cfc519c9`. * update * update * improve log messages * updates. * updates * support register_config. * fix * fix * fix * updates * updates * updates * fix-copies * fix * apply suggestions from review * add tests * remove conversion script; enable on-the-fly conversion * bias -> lora_bias. * fix-copies * peft.py * fix lora conversion * changes Co-authored-by: a-r-r-o-w <contact.aryanvs@gmail.com> * fix-copies * updates for tests * fix * alpha_pattern. * add a test for varied lora ranks and alphas. * revert changes in num_channels_latents = self.transformer.config.in_channels // 8 * revert moe * add a sanity check on unexpected keys when loading norm layers. * contro lora. * fixes * fixes * fixes * tests * reviewer feedback * fix * proper peft version for lora_bias * fix-copies * updates * updates * updates * remove debug code * update docs * integration tests * nis * fuse and unload. * fix * add slices. * more updates. * button up readme * train() * add full fine-tuning version. * fixes * Apply suggestions from code review Co-authored-by: Aryan <aryan@huggingface.co> * set_grads_to_none remove. * readme --------- Co-authored-by: Aryan <aryan@huggingface.co> Co-authored-by: yiyixuxu <yixu310@gmail.com> Co-authored-by: a-r-r-o-w <contact.aryanvs@gmail.com>	2024-12-12 15:34:57 +05:30
Ethan Smith	26e80e0143	fix min-snr implementation (#8466 ) * fix min-snr implementation https://github.com/kohya-ss/sd-scripts/blob/main/library/custom_train_functions.py#L66 * Update train_dreambooth.py fix variable name mse_loss_weights * fix divisor * make style --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: YiYi Xu <yixu310@gmail.com>	2024-12-12 09:55:59 +05:30
Linoy Tsaban	43534a8d1f	[community pipeline rf-inversion] - fix example in doc (#10179 ) * fix example in doc * remove redundancies * change param	2024-12-11 00:30:05 +02:00

... 2 3 4 5 6 ...

1269 Commits