sayakpaul
cc605aa33e
remove syncs before denoising in Kontext
2025-06-27 10:52:57 +05:30
Aryan
d7dd924ece
Kontext fixes ( #11815 )
...
fix
2025-06-26 13:03:44 -10:00
Sayak Paul
00f95b9755
Kontext training ( #11813 )
...
* support flux kontext
* make fix-copies
* add example
* add tests
* update docs
* update
* add note on integrity checker
* initial commit
* initial commit
* add readme section and fixes in the training script.
* add test
* rectify ckpt_id
* fix ckpt
* fixes
* change id
* update
* Update examples/dreambooth/train_dreambooth_lora_flux_kontext.py
Co-authored-by: Aryan <aryan@huggingface.co >
* Update examples/dreambooth/README_flux.md
---------
Co-authored-by: Aryan <aryan@huggingface.co >
Co-authored-by: linoytsaban <linoy@huggingface.co >
Co-authored-by: Linoy Tsaban <57615435+linoytsaban@users.noreply.github.com >
2025-06-26 19:31:42 +03:00
Aryan
eea76892e8
Flux Kontext ( #11812 )
...
* support flux kontext
* make fix-copies
* add example
* add tests
* update docs
* update
* add note on integrity checker
* make fix-copies issue
* add copied froms
* make style
* update repository ids
* more copied froms
2025-06-26 21:29:59 +05:30
kaixuanliu
27bf7fcd0e
adjust tolerance criteria for test_float16_inference in unit test ( #11809 )
...
Signed-off-by: Liu, Kaixuan <kaixuan.liu@intel.com >
2025-06-26 13:19:59 +05:30
Sayak Paul
a185e1ab91
[tests] add a test on torch compile for varied resolutions ( #11776 )
...
* add test for checking compile on different shapes.
* update
* update
* Apply suggestions from code review
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
---------
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
2025-06-26 10:07:03 +05:30
Animesh Jain
d93381cd41
[rfc][compile] compile method for DiffusionPipeline ( #11705 )
...
* [rfc][compile] compile method for DiffusionPipeline
* Apply suggestions from code review
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
* Apply style fixes
* Update docs/source/en/optimization/fp16.md
* check
---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
2025-06-26 08:41:38 +05:30
Dhruv Nair
3649d7b903
Follow up for Group Offload to Disk ( #11760 )
...
* update
* update
* update
---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
2025-06-26 07:24:24 +05:30
Sayak Paul
10c36e0b78
[chore] post release v0.34.0 ( #11800 )
...
* post release v0.34.0
* code quality
---------
Co-authored-by: YiYi Xu <yixu310@gmail.com >
2025-06-26 06:56:46 +05:30
Sayak Paul
8846635873
fix deprecation in lora after 0.34.0 release ( #11802 )
2025-06-25 08:48:20 -10:00
kaixuanliu
dd285099eb
adjust to get CI test cases passed on XPU ( #11759 )
...
* adjust to get CI test cases passed on XPU
Signed-off-by: Liu, Kaixuan <kaixuan.liu@intel.com >
* fix format issue
Signed-off-by: Liu, Kaixuan <kaixuan.liu@intel.com >
* Apply style fixes
---------
Signed-off-by: Liu, Kaixuan <kaixuan.liu@intel.com >
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
Co-authored-by: Aryan <aryan@huggingface.co >
2025-06-25 14:02:17 +05:30
Sayak Paul
80f27d7e8d
[tests] skip instead of returning. ( #11793 )
...
skip instead of returning.
2025-06-25 08:59:36 +05:30
Sayak Paul
d3e27e05f0
guard omnigen processor. ( #11799 )
2025-06-24 19:15:34 +05:30
Aryan
5df02fc171
[tests] Fix group offloading and layerwise casting test interaction ( #11796 )
...
* update
* update
* update
2025-06-24 17:33:32 +05:30
Sayak Paul
7392c8ff5a
[chore] raise as early as possible in group offloading ( #11792 )
...
* raise as early as possible in group offloading
* remove check from ModuleGroup
2025-06-24 15:05:23 +05:30
Aryan
474a248f10
[tests] Fix HunyuanVideo Framepack device tests ( #11789 )
...
update
2025-06-24 13:49:37 +05:30
YiYi Xu
7bc0a07b19
[lora] only remove hooks that we add back ( #11768 )
...
up
2025-06-23 16:49:19 -10:00
Sayak Paul
92542719ed
[docs] minor cleanups in the lora docs. ( #11770 )
...
* minor cleanups in the lora docs.
* Apply suggestions from code review
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
* format docs
* fix copies
---------
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
2025-06-24 08:10:07 +05:30
imbr92
6760300202
Add --lora_alpha and metadata handling to train_dreambooth_lora_sana.py ( #11744 )
...
Co-authored-by: Linoy Tsaban <57615435+linoytsaban@users.noreply.github.com >
2025-06-23 15:46:44 +03:00
Yuanchen Guo
798265f2b6
[Wan] Fix mask padding in Wan VACE pipeline. ( #11778 )
2025-06-23 16:28:21 +05:30
Dhruv Nair
cd813499be
[CI] Skip ONNX Upscale tests ( #11774 )
...
update
2025-06-23 12:14:01 +05:30
Sayak Paul
fbddf02807
[tests] properly skip tests instead of return ( #11771 )
...
model test updates
2025-06-23 11:59:59 +05:30
Yao Matrix
f20b83a04f
enable cpu offloading of new pipelines on XPU & use device agnostic empty to make pipelines work on XPU ( #11671 )
...
* commit 1
Signed-off-by: YAO Matrix <matrix.yao@intel.com >
* patch 2
Signed-off-by: YAO Matrix <matrix.yao@intel.com >
* Update pipeline_pag_sana.py
* Update pipeline_sana.py
* Update pipeline_sana_controlnet.py
* Update pipeline_sana_sprint_img2img.py
* Update pipeline_sana_sprint.py
* fix style
Signed-off-by: YAO Matrix <matrix.yao@intel.com >
* fix fat-thumb while merge conflict
Signed-off-by: YAO Matrix <matrix.yao@intel.com >
* fix ci issues
Signed-off-by: YAO Matrix <matrix.yao@intel.com >
---------
Signed-off-by: YAO Matrix <matrix.yao@intel.com >
Co-authored-by: Ilyas Moutawwakil <57442720+IlyasMoutawwakil@users.noreply.github.com >
2025-06-23 09:44:16 +05:30
jiqing-feng
ee40088fe5
enable deterministic in bnb 4 bit tests ( #11738 )
...
* enable deterministic in bnb 4 bit tests
Signed-off-by: jiqing-feng <jiqing.feng@intel.com >
* fix 8bit test
Signed-off-by: jiqing-feng <jiqing.feng@intel.com >
---------
Signed-off-by: jiqing-feng <jiqing.feng@intel.com >
2025-06-23 08:17:36 +05:30
Tolga Cangöz
7fc53b5d66
Fix dimensionalities in apply_rotary_emb functions' comments ( #11717 )
...
Fix dimensionality in `apply_rotary_emb` functions' comments.
2025-06-21 12:09:28 -10:00
Steven Liu
0874dd04dc
[docs] LoRA scale scheduling ( #11727 )
...
draft
2025-06-20 10:15:29 -07:00
Steven Liu
6184d8a433
[docs] device_map ( #11711 )
...
draft
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
2025-06-20 10:14:48 -07:00
Steven Liu
5a6e386464
[docs] Quantization + torch.compile + offloading ( #11703 )
...
* draft
* feedback
* update
* feedback
* fix
* feedback
* feedback
* fix
* feedback
2025-06-20 10:11:39 -07:00
Dhruv Nair
42077e6c73
Fix failing cpu offload test for LTX Latent Upscale ( #11755 )
...
update
2025-06-20 06:07:34 +02:00
Sayak Paul
3d8d8485fc
fix invalid component handling behaviour in PipelineQuantizationConfig ( #11750 )
...
* start
* updates
2025-06-20 07:54:12 +05:30
Dhruv Nair
195926bbdc
Update Chroma Docs ( #11753 )
...
* update
* update
---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
2025-06-19 19:33:19 +02:00
Sayak Paul
85a916bb8b
make group offloading work with disk/nvme transfers ( #11682 )
...
* start implementing disk offloading in group.
* delete diff file.
* updates.patch
* offload_to_disk_path
* check if safetensors already exist.
* add test and clarify.
* updates
* update todos.
* update more docs.
* update docs
2025-06-19 18:09:30 +05:30
Dhruv Nair
3287ce2890
Fix HiDream pipeline test module ( #11754 )
...
update
2025-06-19 17:06:14 +05:30
Dhruv Nair
0c11c8c1ac
[CI] Fix SANA tests ( #11756 )
...
update
2025-06-19 17:06:02 +05:30
Dhruv Nair
fc51583c8a
[CI] Fix WAN VACE tests ( #11757 )
...
update
2025-06-19 17:03:12 +05:30
Sayak Paul
fb57c76aa1
[LoRA] refactor lora loading at the model-level ( #11719 )
...
* factor out stuff from load_lora_adapter().
* simplifying text encoder lora loading.
* fix peft.py
* fix logging locations.
* formatting
* fix
* update
* update
* update
2025-06-19 13:06:25 +05:30
dependabot[bot]
7251bb4fd0
Bump urllib3 from 2.2.3 to 2.5.0 in /examples/server ( #11748 )
...
Bumps [urllib3](https://github.com/urllib3/urllib3 ) from 2.2.3 to 2.5.0.
- [Release notes](https://github.com/urllib3/urllib3/releases )
- [Changelog](https://github.com/urllib3/urllib3/blob/main/CHANGES.rst )
- [Commits](https://github.com/urllib3/urllib3/compare/2.2.3...2.5.0 )
---
updated-dependencies:
- dependency-name: urllib3
dependency-version: 2.5.0
dependency-type: indirect
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-06-19 11:09:33 +05:30
Aryan
3fba74e153
Add missing HiDream license ( #11747 )
...
update
2025-06-19 08:07:47 +05:30
Aryan
a4df8dbc40
Update more licenses to 2025 ( #11746 )
...
update
2025-06-19 07:46:01 +05:30
Sayak Paul
48eae6f420
[Quantizers] add is_compileable property to quantizers. ( #11736 )
...
add is_compileable property to quantizers.
2025-06-19 07:45:06 +05:30
Dhruv Nair
66394bf6c7
Chroma Follow Up ( #11725 )
...
* update
* update
* update
* update
* update
* update
* update
* update
* update
* update
* update
* update
* update
* update
* update
* update
* update
* update
* update
* update
* updte
* update
* update
* update
2025-06-18 22:24:41 +05:30
Sayak Paul
62cce3045d
[chore] change to 2025 licensing for remaining ( #11741 )
...
change to 2025 licensing for remaining
2025-06-18 20:56:00 +05:30
Sayak Paul
05e867784d
[tests] device_map tests for all models. ( #11708 )
...
* device_map tests for all models.
* updates
* Update tests/models/test_modeling_common.py
Co-authored-by: Aryan <aryan@huggingface.co >
* fix device_map in test
---------
Co-authored-by: Aryan <aryan@huggingface.co >
2025-06-18 10:52:06 +05:30
Leo Jiang
d72184eba3
[training] add ds support to lora hidream ( #11737 )
...
* [training] add ds support to lora hidream
* Apply style fixes
---------
Co-authored-by: J石页 <jiangshuo9@h-partners.com >
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
2025-06-18 09:26:02 +05:30
Saurabh Misra
5ce4814af1
⚡ ️ Speed up method AutoencoderKLWan.clear_cache by 886% ( #11665 )
...
* ⚡ ️ Speed up method `AutoencoderKLWan.clear_cache` by 886%
**Key optimizations:**
- Compute the number of `WanCausalConv3d` modules in each model (`encoder`/`decoder`) **only once during initialization**, store in `self._cached_conv_counts`. This removes unnecessary repeated tree traversals at every `clear_cache` call, which was the main bottleneck (from profiling).
- The internal helper `_count_conv3d_fast` is optimized via a generator expression with `sum` for efficiency.
All comments from the original code are preserved, except for updated or removed local docstrings/comments relevant to changed lines.
**Function signatures and outputs remain unchanged.**
* Apply style fixes
* Apply suggestions from code review
Co-authored-by: Aryan <contact.aryanvs@gmail.com >
* Apply style fixes
---------
Co-authored-by: codeflash-ai[bot] <148906541+codeflash-ai[bot]@users.noreply.github.com>
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
Co-authored-by: Aryan <aryan@huggingface.co >
Co-authored-by: Aryan <contact.aryanvs@gmail.com >
Co-authored-by: Aseem Saxena <aseem.bits@gmail.com >
2025-06-18 08:46:03 +05:30
Linoy Tsaban
1bc6f3dc0f
[LoRA training] update metadata use for lora alpha + README ( #11723 )
...
* lora alpha
* Apply style fixes
* Update examples/advanced_diffusion_training/README_flux.md
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
* fix readme format
---------
Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com >
2025-06-17 12:19:27 +03:00
Aryan
79bd7ecc78
Support more Wan loras (VACE) ( #11726 )
...
update
2025-06-17 10:39:18 +05:30
David Berenstein
9b834f8710
Add Pruna optimization framework documentation ( #11688 )
...
* Add Pruna optimization framework documentation
- Introduced a new section for Pruna in the table of contents.
- Added comprehensive documentation for Pruna, detailing its optimization techniques, installation instructions, and examples for optimizing and evaluating models
* Enhance Pruna documentation with image alt text and code block formatting
- Added alt text to images for better accessibility and context.
- Changed code block syntax from diff to python for improved clarity.
* Add installation section to Pruna documentation
- Introduced a new installation section in the Pruna documentation to guide users on how to install the framework.
- Enhanced the overall clarity and usability of the documentation for new users.
* Update pruna.md
* Update pruna.md
* Update Pruna documentation for model optimization and evaluation
- Changed section titles for consistency and clarity, from "Optimizing models" to "Optimize models" and "Evaluating and benchmarking optimized models" to "Evaluate and benchmark models".
- Enhanced descriptions to clarify the use of `diffusers` models and the evaluation process.
- Added a new example for evaluating standalone `diffusers` models.
- Updated references and links for better navigation within the documentation.
* Refactor Pruna documentation for clarity and consistency
- Removed outdated references to FLUX-juiced and streamlined the explanation of benchmarking.
- Enhanced the description of evaluating standalone `diffusers` models.
- Cleaned up code examples by removing unnecessary imports and comments for better readability.
* Apply suggestions from code review
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
* Enhance Pruna documentation with new examples and clarifications
- Added an image to illustrate the optimization process.
- Updated the explanation for sharing and loading optimized models on the Hugging Face Hub.
- Clarified the evaluation process for optimized models using the EvaluationAgent.
- Improved descriptions for defining metrics and evaluating standalone diffusers models.
---------
Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com >
2025-06-16 12:25:05 -07:00
Carl Thomé
81426b0f19
Fix misleading comment ( #11722 )
2025-06-16 08:47:00 -10:00
Sayak Paul
f0dba33d82
[training] show how metadata stuff should be incorporated in training scripts. ( #11707 )
...
* show how metadata stuff should be incorporated in training scripts.
* typing
* fix
---------
Co-authored-by: Linoy Tsaban <57615435+linoytsaban@users.noreply.github.com >
2025-06-16 16:42:34 +05:30