Disty0
|
5d01cb5c2c
|
Unify compile Upscaler compile command
|
2025-09-03 18:51:21 +03:00 |
|
Vladimir Mandic
|
287c3600d7
|
torch compile for llm
Signed-off-by: Vladimir Mandic <mandic00@live.com>
|
2025-07-20 12:07:30 -04:00 |
|
Vladimir Mandic
|
27ce0dea9a
|
torch.compile use repeated blocks
Signed-off-by: Vladimir Mandic <mandic00@live.com>
|
2025-07-20 08:44:25 -04:00 |
|
Disty0
|
1acbabb276
|
Upate OpenVINO to PyTorch 2.6 and fix mismatched shapes error on too many resolution changes
|
2025-02-09 01:17:06 +03:00 |
|
Disty0
|
62e1826faf
|
OpenVINO safety check for compiled_model_state
|
2025-01-31 18:35:36 +03:00 |
|
Vladimir Mandic
|
d0d9759840
|
compile traceback
Signed-off-by: Vladimir Mandic <mandic00@live.com>
|
2025-01-31 09:11:11 -05:00 |
|
Vladimir Mandic
|
06ba03cf80
|
settings option to disable reference models
Signed-off-by: Vladimir Mandic <mandic00@live.com>
|
2025-01-23 15:19:43 -05:00 |
|
Disty0
|
9b579bfd96
|
Move quant functions to model_quant.py
|
2025-01-23 21:50:26 +03:00 |
|
Vladimir Mandic
|
935cac62a8
|
lint fixes
Signed-off-by: Vladimir Mandic <mandic00@live.com>
|
2024-12-31 12:28:53 -05:00 |
|
Disty0
|
1998997189
|
OpenVINO fix shapes resolution change and disable re-compile
|
2024-12-31 17:45:23 +03:00 |
|
Vladimir Mandic
|
ed3e5f06d6
|
linting
Signed-off-by: Vladimir Mandic <mandic00@live.com>
|
2024-12-20 18:48:06 -05:00 |
|
Disty0
|
468c7d6bc8
|
Use apply_compile_model with torchao
|
2024-12-17 22:24:25 +03:00 |
|
Vladimir Mandic
|
fd7fe8cea5
|
add torchao
Signed-off-by: Vladimir Mandic <mandic00@live.com>
|
2024-12-17 13:29:36 -05:00 |
|
Vladimir Mandic
|
164ce252dc
|
add sd35 controlnets
Signed-off-by: Vladimir Mandic <mandic00@live.com>
|
2024-11-28 08:46:10 -05:00 |
|
Vladimir Mandic
|
ae4591ac0b
|
reimplement torchao quantization
Signed-off-by: Vladimir Mandic <mandic00@live.com>
|
2024-10-18 09:34:04 -04:00 |
|
Vladimir Mandic
|
6bb688c371
|
add set_accelerate
Signed-off-by: Vladimir Mandic <mandic00@live.com>
|
2024-10-14 13:57:05 -04:00 |
|
Vladimir Mandic
|
ea0dfebe2d
|
better handle any quant lib requirements
Signed-off-by: Vladimir Mandic <mandic00@live.com>
|
2024-10-12 13:36:16 -04:00 |
|
Disty0
|
012a7f3572
|
Update OpenVINO to 2024.3.0
|
2024-09-13 03:57:06 +03:00 |
|
Vladimir Mandic
|
f2c5cbbb36
|
lint updates and diffusers installer
|
2024-09-06 14:10:53 -04:00 |
|
Vladimir Mandic
|
85b26e03ff
|
minor updates
|
2024-09-06 10:13:32 -04:00 |
|
Vladimir Mandic
|
5ed58ac7cc
|
end-to-end update flux, see changelog and wiki
|
2024-08-28 08:04:24 -04:00 |
|
Disty0
|
963940b9ae
|
Fix no half vae
|
2024-08-21 22:45:02 +03:00 |
|
Disty0
|
b706083541
|
Quanto Activations fix Diffuser's model offload bug
|
2024-08-21 20:48:32 +03:00 |
|
Disty0
|
e40e13a330
|
Quanto fix Flux activations
|
2024-08-21 20:04:05 +03:00 |
|
Disty0
|
c3ff21c15e
|
Quanto freeze the model before calibration
|
2024-08-21 19:18:57 +03:00 |
|
Disty0
|
694d25c161
|
Fix quanto
|
2024-08-21 19:17:04 +03:00 |
|
Disty0
|
16d6c03d45
|
Optimum Quanto activations support
|
2024-08-21 17:30:45 +03:00 |
|
Disty0
|
3f5c3ba0d8
|
Add warning to Quanto with balanced and sequential offload
|
2024-08-16 02:58:43 +03:00 |
|
Disty0
|
f3f721e39a
|
Quanto disable gemm kernels
|
2024-08-14 20:26:46 +03:00 |
|
Disty0
|
e3b087b6c0
|
Add balanced offload mode and make offload modes a single choice list
|
2024-08-11 17:27:30 +03:00 |
|
Disty0
|
7eacec4c39
|
Quant send to gpu with shuffle option on high vram systems
|
2024-08-04 23:01:58 +03:00 |
|
Disty0
|
dc9e60aa67
|
Quant add shuffle models option
|
2024-08-04 04:46:06 +03:00 |
|
Disty0
|
bb707e4509
|
FLUX support
|
2024-08-02 18:22:06 +03:00 |
|
Disty0
|
9965ef75e7
|
De-dupe Cascade
|
2024-08-01 18:12:02 +03:00 |
|
Disty0
|
b50a8601fe
|
Fix T5 INT8 and add QINT8
|
2024-07-30 18:23:21 +03:00 |
|
Disty0
|
6c75bcca0a
|
Optimum Quanto support
|
2024-07-30 17:35:56 +03:00 |
|
Disty0
|
9c1c8feeb8
|
NNCF fix AuraFlow
|
2024-07-22 23:02:30 +03:00 |
|
Vladimir Mandic
|
7a163a34f2
|
check deepcache
|
2024-06-28 10:37:43 -04:00 |
|
Disty0
|
0aaabfc2e6
|
NNCF fix Lora support without reloading
|
2024-06-21 15:18:17 +03:00 |
|
Disty0
|
bf9565cb46
|
NNCF compression support on CPU and add INT8 option for T5
|
2024-06-19 21:23:47 +03:00 |
|
Disty0
|
77a3f0ab2f
|
Cleanup
|
2024-06-16 21:49:41 +03:00 |
|
Disty0
|
4c7b4f382e
|
Fix NNCF with T5
|
2024-06-16 21:47:20 +03:00 |
|
Disty0
|
042cac8846
|
Stable Cascade fix NNCF compress
|
2024-05-29 16:48:41 +03:00 |
|
Vladimir Mandic
|
9a7a5ba81c
|
lint cleanup
|
2024-05-28 10:48:27 -04:00 |
|
Disty0
|
47806837e9
|
Cleanup compile code
|
2024-05-20 01:18:01 +03:00 |
|
Disty0
|
5ae658d91a
|
Cleanup
|
2024-05-19 23:32:15 +03:00 |
|
Disty0
|
b7246ef4e6
|
Stable Cascade compile fixes
|
2024-05-19 23:20:04 +03:00 |
|
Vladimir Mandic
|
b137f67edc
|
lint changes
|
2024-05-07 09:56:32 -04:00 |
|
Disty0
|
29e5d88e37
|
Add migraphx compile backend
|
2024-04-05 18:13:20 +03:00 |
|
Vladimir Mandic
|
25bc3c9bb6
|
Merge pull request #3000 from aifartist/dev
Partial support for onediff
|
2024-03-25 15:00:43 -04:00 |
|