1
0
mirror of https://github.com/vladmandic/sdnext.git synced 2026-01-27 15:02:48 +03:00

89 Commits

Author SHA1 Message Date
Disty0
5d01cb5c2c Unify compile Upscaler compile command 2025-09-03 18:51:21 +03:00
Vladimir Mandic
287c3600d7 torch compile for llm
Signed-off-by: Vladimir Mandic <mandic00@live.com>
2025-07-20 12:07:30 -04:00
Vladimir Mandic
27ce0dea9a torch.compile use repeated blocks
Signed-off-by: Vladimir Mandic <mandic00@live.com>
2025-07-20 08:44:25 -04:00
Disty0
1acbabb276 Upate OpenVINO to PyTorch 2.6 and fix mismatched shapes error on too many resolution changes 2025-02-09 01:17:06 +03:00
Disty0
62e1826faf OpenVINO safety check for compiled_model_state 2025-01-31 18:35:36 +03:00
Vladimir Mandic
d0d9759840 compile traceback
Signed-off-by: Vladimir Mandic <mandic00@live.com>
2025-01-31 09:11:11 -05:00
Vladimir Mandic
06ba03cf80 settings option to disable reference models
Signed-off-by: Vladimir Mandic <mandic00@live.com>
2025-01-23 15:19:43 -05:00
Disty0
9b579bfd96 Move quant functions to model_quant.py 2025-01-23 21:50:26 +03:00
Vladimir Mandic
935cac62a8 lint fixes
Signed-off-by: Vladimir Mandic <mandic00@live.com>
2024-12-31 12:28:53 -05:00
Disty0
1998997189 OpenVINO fix shapes resolution change and disable re-compile 2024-12-31 17:45:23 +03:00
Vladimir Mandic
ed3e5f06d6 linting
Signed-off-by: Vladimir Mandic <mandic00@live.com>
2024-12-20 18:48:06 -05:00
Disty0
468c7d6bc8 Use apply_compile_model with torchao 2024-12-17 22:24:25 +03:00
Vladimir Mandic
fd7fe8cea5 add torchao
Signed-off-by: Vladimir Mandic <mandic00@live.com>
2024-12-17 13:29:36 -05:00
Vladimir Mandic
164ce252dc add sd35 controlnets
Signed-off-by: Vladimir Mandic <mandic00@live.com>
2024-11-28 08:46:10 -05:00
Vladimir Mandic
ae4591ac0b reimplement torchao quantization
Signed-off-by: Vladimir Mandic <mandic00@live.com>
2024-10-18 09:34:04 -04:00
Vladimir Mandic
6bb688c371 add set_accelerate
Signed-off-by: Vladimir Mandic <mandic00@live.com>
2024-10-14 13:57:05 -04:00
Vladimir Mandic
ea0dfebe2d better handle any quant lib requirements
Signed-off-by: Vladimir Mandic <mandic00@live.com>
2024-10-12 13:36:16 -04:00
Disty0
012a7f3572 Update OpenVINO to 2024.3.0 2024-09-13 03:57:06 +03:00
Vladimir Mandic
f2c5cbbb36 lint updates and diffusers installer 2024-09-06 14:10:53 -04:00
Vladimir Mandic
85b26e03ff minor updates 2024-09-06 10:13:32 -04:00
Vladimir Mandic
5ed58ac7cc end-to-end update flux, see changelog and wiki 2024-08-28 08:04:24 -04:00
Disty0
963940b9ae Fix no half vae 2024-08-21 22:45:02 +03:00
Disty0
b706083541 Quanto Activations fix Diffuser's model offload bug 2024-08-21 20:48:32 +03:00
Disty0
e40e13a330 Quanto fix Flux activations 2024-08-21 20:04:05 +03:00
Disty0
c3ff21c15e Quanto freeze the model before calibration 2024-08-21 19:18:57 +03:00
Disty0
694d25c161 Fix quanto 2024-08-21 19:17:04 +03:00
Disty0
16d6c03d45 Optimum Quanto activations support 2024-08-21 17:30:45 +03:00
Disty0
3f5c3ba0d8 Add warning to Quanto with balanced and sequential offload 2024-08-16 02:58:43 +03:00
Disty0
f3f721e39a Quanto disable gemm kernels 2024-08-14 20:26:46 +03:00
Disty0
e3b087b6c0 Add balanced offload mode and make offload modes a single choice list 2024-08-11 17:27:30 +03:00
Disty0
7eacec4c39 Quant send to gpu with shuffle option on high vram systems 2024-08-04 23:01:58 +03:00
Disty0
dc9e60aa67 Quant add shuffle models option 2024-08-04 04:46:06 +03:00
Disty0
bb707e4509 FLUX support 2024-08-02 18:22:06 +03:00
Disty0
9965ef75e7 De-dupe Cascade 2024-08-01 18:12:02 +03:00
Disty0
b50a8601fe Fix T5 INT8 and add QINT8 2024-07-30 18:23:21 +03:00
Disty0
6c75bcca0a Optimum Quanto support 2024-07-30 17:35:56 +03:00
Disty0
9c1c8feeb8 NNCF fix AuraFlow 2024-07-22 23:02:30 +03:00
Vladimir Mandic
7a163a34f2 check deepcache 2024-06-28 10:37:43 -04:00
Disty0
0aaabfc2e6 NNCF fix Lora support without reloading 2024-06-21 15:18:17 +03:00
Disty0
bf9565cb46 NNCF compression support on CPU and add INT8 option for T5 2024-06-19 21:23:47 +03:00
Disty0
77a3f0ab2f Cleanup 2024-06-16 21:49:41 +03:00
Disty0
4c7b4f382e Fix NNCF with T5 2024-06-16 21:47:20 +03:00
Disty0
042cac8846 Stable Cascade fix NNCF compress 2024-05-29 16:48:41 +03:00
Vladimir Mandic
9a7a5ba81c lint cleanup 2024-05-28 10:48:27 -04:00
Disty0
47806837e9 Cleanup compile code 2024-05-20 01:18:01 +03:00
Disty0
5ae658d91a Cleanup 2024-05-19 23:32:15 +03:00
Disty0
b7246ef4e6 Stable Cascade compile fixes 2024-05-19 23:20:04 +03:00
Vladimir Mandic
b137f67edc lint changes 2024-05-07 09:56:32 -04:00
Disty0
29e5d88e37 Add migraphx compile backend 2024-04-05 18:13:20 +03:00
Vladimir Mandic
25bc3c9bb6 Merge pull request #3000 from aifartist/dev
Partial support for onediff
2024-03-25 15:00:43 -04:00