diffusers

mirror of https://github.com/huggingface/diffusers.git synced 2026-01-27 17:22:53 +03:00

Author	SHA1	Message	Date
Eliseu Silva	66bf7ea5be	feat: add Mixture-of-Diffusers ControlNet Tile upscaler Pipeline for SDXL (#10951 ) * feat: add Mixture-of-Diffusers ControlNet Tile upscaler Pipeline for SDXL * make style make quality	2025-03-04 17:17:36 -03:00
Alexey Zolotenkov	b8215b1c06	Fix incorrect seed initialization when args.seed is 0 (#10964 ) * Fix seed initialization to handle args.seed = 0 correctly * Apply style fixes --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>	2025-03-04 10:09:52 -10:00
SahilCarterr	170833c22a	[Fix] fp16 unscaling in train_dreambooth_lora_sdxl (#10889 ) Fix fp16 bug Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2025-02-24 06:49:23 -10:00
hlky	6f74ef550d	Fix `torch_dtype` in Kolors text encoder with `transformers` v4.49 (#10816 ) * Fix `torch_dtype` in Kolors text encoder with `transformers` v4.49 * Default torch_dtype and warning	2025-02-24 13:37:54 +05:30
Parag Ekbote	51941387dc	Notebooks for Community Scripts-7 (#10846 ) Add 5 Notebooks, improve their example scripts and update the missing links for the example README.	2025-02-20 09:02:09 -08:00
Sayak Paul	f10d3c6d04	[LoRA] add LoRA support to Lumina2 and fine-tuning script (#10818 ) * feat: lora support for Lumina2. * fix-copies. * updates * updates * docs. * fix * add: training script. * tests * updates * updates * major updates. * updates * fixes * docs. * updates * updates	2025-02-20 09:41:51 +05:30
puhuk	b75b204a58	Fix max_shift value in flux and related functions to 1.15 (issue #10675 ) (#10807 ) This PR updates the max_shift value in flux to 1.15 for consistency across the codebase. In addition to modifying max_shift in flux, all related functions that copy and use this logic, such as calculate_shift in `src/diffusers/pipelines/stable_diffusion_3/pipeline_stable_diffusion_3_img2img.py`, have also been updated to ensure uniform behavior.	2025-02-18 06:54:56 +00:00
Yaniv Galron	952b9131a2	typo fix (#10802 )	2025-02-16 20:56:54 +05:30
Eliseu Silva	051ebc3c8d	fix: [Community pipeline] Fix flattened elements on image (#10774 ) * feat: new community mixture_tiling_sdxl pipeline for SDXL mixture-of-diffusers support * fix use of variable latents to tile_latents * removed references to modules that are not being used in this pipeline * make style, make quality * fixfeat: added _get_crops_coords_list function to pipeline to automatically define ctop,cleft coord to focus on image generation, helps to better harmonize the image and corrects the problem of flattened elements.	2025-02-12 19:50:41 -03:00
Eliseu Silva	c470274865	feat: new community mixture_tiling_sdxl pipeline for SDXL (#10759 ) * feat: new community mixture_tiling_sdxl pipeline for SDXL mixture-of-diffusers support * fix use of variable latents to tile_latents * removed references to modules that are not being used in this pipeline * make style, make quality	2025-02-11 18:01:42 -03:00
Leo Jiang	cd0a4a82cf	[bugfix] NPU Adaption for Sana (#10724 ) * NPU Adaption for Sanna * NPU Adaption for Sanna * NPU Adaption for Sanna * NPU Adaption for Sanna * NPU Adaption for Sanna * NPU Adaption for Sanna * NPU Adaption for Sanna * NPU Adaption for Sanna * NPU Adaption for Sanna * NPU Adaption for Sanna * NPU Adaption for Sanna * NPU Adaption for Sanna * NPU Adaption for Sanna * NPU Adaption for Sanna * NPU Adaption for Sanna * NPU Adaption for Sanna * NPU Adaption for Sanna * NPU Adaption for Sanna * [bugfix]NPU Adaption for Sanna --------- Co-authored-by: J石页 <jiangshuo9@h-partners.com> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2025-02-06 19:29:58 +05:30
suzukimain	145522cbb7	[Community] Enhanced `Model Search` (#10417 ) * Added `auto_load_textual_inversion` and `auto_load_lora_weights` * update README.md * fix * make quality * Fix and `make style`	2025-02-05 14:43:53 -10:00
Parag Ekbote	dbe0094e86	Notebooks for Community Scripts-6 (#10713 ) * Fix Doc Tutorial. * Add 4 Notebooks and improve their example scripts.	2025-02-04 10:12:17 -08:00
Nicolas	f63d32233f	Fix train_text_to_image.py --help (#10711 )	2025-02-04 11:26:23 +05:30
Thanh Le	5d2d23986e	Fix inconsistent random transform in instruct pix2pix (#10698 ) * Update train_instruct_pix2pix.py Fix inconsistent random transform in instruct_pix2pix * Update train_instruct_pix2pix_sdxl.py	2025-01-31 08:29:29 -10:00
Dimitri Barbot	196aef5a6f	Fix pipeline dtype unexpected change when using SDXL reference community pipelines in float16 mode (#10670 ) Fix pipeline dtype unexpected change when using SDXL reference community pipelines	2025-01-28 10:46:41 -03:00
Aryan	c4d4ac21e7	Refactor gradient checkpointing (#10611 ) * update * remove unused fn * apply suggestions based on review * update + cleanup 🧹 * more cleanup 🧹 * make fix-copies * update test	2025-01-28 06:51:46 +05:30
hlky	41571773d9	[training] Convert to ImageFolder script (#10664 ) * [training] Convert to ImageFolder script * make	2025-01-27 09:43:51 -10:00
Marlon May	f7f36c7d3d	Add community pipeline for semantic guidance for FLUX (#10610 ) * add community pipeline for semantic guidance for flux * fix imports in community pipeline for semantic guidance for flux * Update examples/community/pipeline_flux_semantic_guidance.py Co-authored-by: hlky <hlky@hlky.ac> * fix community pipeline for semantic guidance for flux --------- Co-authored-by: Linoy Tsaban <57615435+linoytsaban@users.noreply.github.com> Co-authored-by: hlky <hlky@hlky.ac>	2025-01-27 16:19:46 +02:00
Yuqian Hong	4fa24591a3	create a script to train autoencoderkl (#10605 ) * create a script to train vae * update main.py * update train_autoencoderkl.py * update train_autoencoderkl.py * add a check of --pretrained_model_name_or_path and --model_config_name_or_path * remove the comment, remove diffusers in requiremnets.txt, add validation_image ote * update autoencoderkl.py * quality --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2025-01-27 16:41:34 +05:30
Leo Jiang	07860f9916	NPU Adaption for Sanna (#10409 ) * NPU Adaption for Sanna --------- Co-authored-by: J石页 <jiangshuo9@h-partners.com> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2025-01-24 09:08:52 -10:00
Wenhao Sun	87252d80c3	Add pipeline_stable_diffusion_xl_attentive_eraser (#10579 ) * add pipeline_stable_diffusion_xl_attentive_eraser * add pipeline_stable_diffusion_xl_attentive_eraser_make_style * make style and add example output * update Docs Co-authored-by: Other Contributor <a457435687@126.com> * add Oral Co-authored-by: Other Contributor <a457435687@126.com> * update_review Co-authored-by: Other Contributor <a457435687@126.com> * update_review_ms Co-authored-by: Other Contributor <a457435687@126.com> --------- Co-authored-by: Other Contributor <a457435687@126.com>	2025-01-24 13:52:45 +00:00
Yaniv Galron	a451c0ed14	removing redundant requires_grad = False (#10628 ) We already set the unet to requires grad false at line 506 Co-authored-by: Aryan <aryan@huggingface.co>	2025-01-24 03:25:33 +05:30
Muyang Li	158a5a87fb	Remove the FP32 Wrapper when evaluating (#10617 ) Remove the FP32 Wrapper Co-authored-by: Linoy Tsaban <57615435+linoytsaban@users.noreply.github.com>	2025-01-21 16:16:54 +05:30
jiqing-feng	012d08b1bc	Enable dreambooth lora finetune example on other devices (#10602 ) * enable dreambooth_lora on other devices Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * enable xpu Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * check cuda device before empty cache Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * fix comment Signed-off-by: jiqing-feng <jiqing.feng@intel.com> * import free_memory Signed-off-by: jiqing-feng <jiqing.feng@intel.com> --------- Signed-off-by: jiqing-feng <jiqing.feng@intel.com>	2025-01-21 14:09:45 +05:30
Sayak Paul	4ace7d0483	[chore] change licensing to 2025 from 2024. (#10615 ) change licensing to 2025 from 2024.	2025-01-20 16:57:27 -10:00
baymax591	75a636da48	bugfix for npu not support float64 (#10123 ) * bugfix for npu not support float64 * is_mps is_npu --------- Co-authored-by: 白超 <baichao19@huawei.com> Co-authored-by: hlky <hlky@hlky.ac>	2025-01-20 09:35:24 -10:00
Sayak Paul	328e0d20a7	[training] set rest of the blocks with `requires_grad` False. (#10607 ) set rest of the blocks with requires_grad False.	2025-01-19 19:34:53 +05:30
Juan Acevedo	aeac0a00f8	implementing flux on TPUs with ptxla (#10515 ) * implementing flux on TPUs with ptxla * add xla flux attention class * run make style/quality * Update src/diffusers/models/attention_processor.py Co-authored-by: YiYi Xu <yixu310@gmail.com> * Update src/diffusers/models/attention_processor.py Co-authored-by: YiYi Xu <yixu310@gmail.com> * run style and quality --------- Co-authored-by: Juan Acevedo <jfacevedo@google.com> Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> Co-authored-by: YiYi Xu <yixu310@gmail.com>	2025-01-16 08:46:02 -10:00
Leo Jiang	b0c8973834	[Sana 4K] Add vae tiling option to avoid OOM (#10583 ) Co-authored-by: J石页 <jiangshuo9@h-partners.com>	2025-01-16 02:06:07 +05:30
hlky	980736b792	Fix train_dreambooth_lora_sd3_miniature (#10554 )	2025-01-13 13:47:27 +00:00
chaowenguo	d6c030fd37	add the xm.mark_step for the first denosing loop (#10530 ) * Update rerender_a_video.py * Update rerender_a_video.py * Update examples/community/rerender_a_video.py Co-authored-by: hlky <hlky@hlky.ac> * Update rerender_a_video.py * make style --------- Co-authored-by: hlky <hlky@hlky.ac> Co-authored-by: YiYi Xu <yixu310@gmail.com>	2025-01-10 21:03:41 +00:00
hlky	12fbe3f7dc	Use Pipelines without unet (#10440 ) * Use Pipelines without unet * unet.config.in_channels * default_sample_size * is_unet_version_less_0_9_0 --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2025-01-10 04:45:42 +00:00
Linoy Tsaban	83ba01a38d	small readme changes for advanced training examples (#10473 ) add to readme about hf login and wandb installation to address https://github.com/huggingface/diffusers/issues/10142#issuecomment-2571655570 Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2025-01-10 07:35:19 +05:30
chaowenguo	7bc8b92384	add callable object to convert frame into control_frame to reduce cpu memory usage. (#10501 ) * Update rerender_a_video.py * Update rerender_a_video.py * Update examples/community/rerender_a_video.py Co-authored-by: hlky <hlky@hlky.ac> --------- Co-authored-by: hlky <hlky@hlky.ac> Co-authored-by: YiYi Xu <yixu310@gmail.com>	2025-01-09 11:25:53 -10:00
Vladimir Mandic	f0c6d9784b	flux: make scheduler config params optional (#10384 ) * dont assume scheduler has optional config params * make style, make fix-copies * calculate_shift * fix-copies, usage in pipelines --------- Co-authored-by: hlky <hlky@hlky.ac>	2025-01-09 10:44:26 -10:00
Bagheera	a0acbdc989	fix for #7365 , prevent pipelines from overriding provided prompt embeds (#7926 ) * fix for #7365, prevent pipelines from overriding provided prompt embeds * fix-copies * fix implementation * update --------- Co-authored-by: bghira <bghira@users.github.com> Co-authored-by: Aryan <aryan@huggingface.co> Co-authored-by: sayakpaul <spsayakpaul@gmail.com>	2025-01-08 10:12:12 -10:00
Parag Ekbote	5655b22ead	Notebooks for Community Scripts-5 (#10499 ) Add 5 Notebooks for Diffusers Community Pipelines.	2025-01-08 08:56:17 -08:00
hlky	ee7e141d80	Use pipelines without vae (#10441 ) * Use pipelines without vae * getattr * vqvae --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2025-01-07 13:26:51 -10:00
Teriks	03bcf5aefe	RFInversionFluxPipeline, small fix for enable_model_cpu_offload & enable_sequential_cpu_offload compatibility (#10480 ) RFInversionFluxPipeline.encode_image, device fix Use self._execution_device instead of self.device when selecting a device for the input image tensor. This allows for compatibility with enable_model_cpu_offload & enable_sequential_cpu_offload Co-authored-by: Teriks <Teriks@users.noreply.github.com> Co-authored-by: Linoy Tsaban <57615435+linoytsaban@users.noreply.github.com>	2025-01-07 15:47:28 +01:00
dependabot[bot]	e0b96ba7b0	Bump jinja2 from 3.1.4 to 3.1.5 in /examples/research_projects/realfill (#10377 ) Bumps [jinja2](https://github.com/pallets/jinja) from 3.1.4 to 3.1.5. - [Release notes](https://github.com/pallets/jinja/releases) - [Changelog](https://github.com/pallets/jinja/blob/main/CHANGES.rst) - [Commits](https://github.com/pallets/jinja/compare/3.1.4...3.1.5) --- updated-dependencies: - dependency-name: jinja2 dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>	2025-01-07 19:59:41 +05:30
hlky	628f2c544a	Use Pipelines without scheduler (#10439 ) Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2025-01-07 12:07:08 +00:00
Rahul Raman	f1e0c7ce4a	Refactor instructpix2pix lora to support peft (#10205 ) * make base code changes referred from train_instructpix2pix script in examples * change code to use PEFT as discussed in issue 10062 * update README training command * update README training command * refactor variable name and freezing unet * Update examples/research_projects/instructpix2pix_lora/train_instruct_pix2pix_lora.py Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * update README installation instructions. * cleanup code using make style and quality --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2025-01-07 12:00:45 +05:30
Sayak Paul	b94cfd7937	[Training] QoL improvements in the Flux Control training scripts (#10461 ) * qol improvements to the Flux script. * propagate the dataloader changes.	2025-01-07 11:56:17 +05:30
Ameer Azam	4f5e3e35d2	Regarding the RunwayML path for V1.5 did change to stable-diffusion-v1-5/[stable-diffusion-v1-5/ stable-diffusion-inpainting] (#10476 ) * Update pipeline_controlnet.py * Update pipeline_controlnet_img2img.py runwayml Take-down so change all from to this stable-diffusion-v1-5/stable-diffusion-v1-5 * Update pipeline_controlnet_inpaint.py * runwayml take-down make change to sd-legacy * runwayml take-down make change to sd-legacy * runwayml take-down make change to sd-legacy * runwayml take-down make change to sd-legacy * Update convert_blipdiffusion_to_diffusers.py style change	2025-01-06 15:01:52 -08:00
chaowenguo	4e44534845	Update rerender_a_video.py fix dtype error (#10451 ) Update rerender_a_video.py	2025-01-04 14:52:50 +00:00
chaowenguo	a17832b2d9	add pythor_xla support for render a video (#10443 ) * Update rerender_a_video.py * Update rerender_a_video.py * make style --------- Co-authored-by: hlky <hlky@hlky.ac>	2025-01-03 16:00:02 +00:00
Doug J	f7822ae4bf	Update train_text_to_image_sdxl.py (#8830 ) Enable VAE hash to be able to change with args change. If not, train_dataset_with_embeddiings may have row number inconsistency with train_dataset_with_vae. Co-authored-by: Linoy Tsaban <57615435+linoytsaban@users.noreply.github.com>	2025-01-02 10:41:18 -10:00
Dev Rajput	4b9f1c7d8c	Add correct number of channels when resuming from checkpoint for Flux Control LoRa training (#10422 ) * Add correct number of channels when resuming from checkpoint * Fix Formatting	2025-01-02 15:51:44 +05:30
Sayak Paul	5f72473543	[training] add ds support to lora sd3. (#10378 ) * add ds support to lora sd3. Co-authored-by: leisuzz <jiangshuonb@gmail.com> * style. --------- Co-authored-by: leisuzz <jiangshuonb@gmail.com> Co-authored-by: Linoy Tsaban <57615435+linoytsaban@users.noreply.github.com>	2024-12-30 19:31:05 +05:30

1 2 3 4 5 ...

1133 Commits