Dhruv Nair
|
bcd4d77ba6
|
[CI] Remove big accelerator requirements from Quanto Tests (#12266)
update
|
2025-09-03 08:29:31 +05:30 |
|
Dhruv Nair
|
7aa6af1138
|
[Refactor] Move testing utils out of src (#12238)
* update
* update
* update
* update
* update
* merge main
* Revert "merge main"
This reverts commit 65efbcead5.
|
2025-08-28 19:53:02 +05:30 |
|
Yao Matrix
|
9ce89e2efa
|
enable group_offload cases and quanto cases on XPU (#11405)
* enable group_offload cases and quanto cases on XPU
Signed-off-by: YAO Matrix <matrix.yao@intel.com>
* use backend APIs
Signed-off-by: Yao Matrix <matrix.yao@intel.com>
* fix style
Signed-off-by: Yao Matrix <matrix.yao@intel.com>
---------
Signed-off-by: YAO Matrix <matrix.yao@intel.com>
Signed-off-by: Yao Matrix <matrix.yao@intel.com>
|
2025-04-28 06:37:16 +05:30 |
|
Sayak Paul
|
2f0f281b0d
|
[Tests] restrict memory tests for quanto for certain schemes. (#11052)
* restrict memory tests for quanto for certain schemes.
* Apply suggestions from code review
Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>
* fixes
* style
---------
Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>
|
2025-03-14 10:35:19 +05:30 |
|
Sayak Paul
|
e7e6d85282
|
[Tests] improve quantization tests by additionally measuring the inference memory savings (#11021)
* memory usage tests
* fixes
* gguf
|
2025-03-10 21:42:24 +05:30 |
|
Dhruv Nair
|
f5edaa7894
|
[Quantization] Add Quanto backend (#10756)
* update
* updaet
* update
* update
* update
* update
* update
* update
* update
* update
* update
* update
* Update docs/source/en/quantization/quanto.md
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>
* update
* update
* update
* update
* update
* update
* update
* update
* update
* update
* update
* update
* update
* update
* update
* update
* update
* update
* Update src/diffusers/quantizers/quanto/utils.py
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>
* update
* update
---------
Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>
|
2025-03-10 08:33:05 +05:30 |
|