sdnext

AI/sdnext

Fork 0

mirror of https://github.com/vladmandic/sdnext.git synced 2026-01-29 05:02:09 +03:00

Commit Graph

Select branches

Hide Pull Requests

dev

master

upstream

#1

#10

#1035

#1035

#1037

#1037

#1089

#11

#1109

#112

#1120

#1147

#1152

#1166

#1187

#1199

#12

#1227

#1236

#1237

#1239

#1255

#1258

#1260

#1262

#1270

#1277

#13

#1322

#1330

#1336

#1348

#1382

#14

#1402

#1403

#1414

#1417

#143

#1436

#1452

#1460

#1472

#1479

#1480

#1483

#1489

#1491

#1551

#1553

#1579

#1582

#1583

#16

#1605

#1606

#1609

#1613

#1626

#1637

#1639

#1648

#1662

#1669

#168

#1682

#1683

#1688

#1692

#1694

#17

#1704

#1713

#1729

#1742

#1746

#1747

#1769

#1783

#1792

#1793

#1793

#18

#1800

#1804

#1805

#1827

#1831

#1838

#1840

#1847

#1848

#1861

#1874

#1875

#1876

#1883

#1886

#1895

#1899

#19

#1900

#1919

#1921

#1934

#1935

#1937

#1950

#1952

#1959

#1960

#1964

#1968

#1969

#1970

#1971

#1972

#1976

#1985

#1987

#2

#20

#2002

#2009

#2014

#2047

#2048

#2052

#2088

#21

#2118

#2128

#2130

#2138

#2142

#2145

#2151

#2175

#2192

#2194

#22

#2203

#2214

#2221

#2222

#2224

#2226

#2228

#2233

#2246

#2256

#2259

#2263

#227

#2272

#2273

#2275

#2276

#228

#2284

#229

#2293

#2296

#2297

#23

#230

#2302

#2307

#231

#2314

#2315

#232

#233

#234

#2343

#235

#2354

#2358

#236

#2368

#237

#2375

#238

#2386

#239

#2391

#24

#240

#2401

#241

#2410

#2411

#242

#2426

#2427

#243

#2430

#244

#2440

#2443

#2444

#2445

#2447

#245

#246

#2467

#247

#248

#2489

#249

#2490

#2496

#25

#250

#2506

#2507

#251

#2510

#2511

#2512

#252

#2523

#2527

#253

#2537

#254

#2540

#255

#2550

#256

#2561

#2567

#2569

#257

#2585

#2586

#2589

#26

#2607

#2609

#2620

#2622

#2631

#2634

#2636

#2637

#2645

#2650

#2653

#2669

#2671

#2679

#2680

#2682

#2683

#2690

#2693

#2694

#2697

#27

#2701

#2710

#2713

#2717

#2719

#2722

#2723

#2726

#2727

#2729

#2730

#2731

#2735

#2747

#2748

#2755

#2764

#2765

#2772

#2774

#2775

#2780

#2784

#2787

#2788

#2800

#2803

#2805

#2814

#2817

#2819

#2825

#2826

#2827

#2828

#2830

#2832

#2835

#2852

#2871

#2878

#2889

#2891

#2896

#2897

#2898

#2899

#29

#2900

#2903

#2907

#2908

#2909

#2911

#2913

#2915

#2925

#2930

#2931

#2932

#2933

#2934

#2938

#2939

#2941

#2945

#2954

#2967

#2976

#2978

#2986

#2998

#2999

#3

#3000

#3004

#3005

#3007

#3016

#3018

#3021

#3061

#3063

#3068

#3086

#3092

#31

#3111

#3117

#312

#3136

#3141

#3153

#3156

#316

#3174

#3178

#3179

#3194

#3195

#32

#3210

#3218

#3221

#3223

#3228

#3248

#3250

#3275

#3277

#3278

#3283

#3295

#33

#3302

#3306

#3308

#3313

#3320

#3321

#3323

#3325

#3326

#3328

#333

#3332

#3335

#3336

#3359

#336

#3368

#3370

#3372

#3381

#3388

#3390

#3391

#34

#3412

#3426

#3428

#3436

#3447

#3454

#3455

#3456

#3456

#3457

#3458

#3461

#3490

#3493

#3495

#3496

#3497

#3499

#35

#3503

#3504

#3505

#3508

#3518

#3519

#352

#3533

#3534

#3535

#3546

#3549

#3577

#3579

#3581

#3583

#3586

#3593

#3599

#36

#3600

#3603

#3608

#3617

#3620

#3642

#3652

#366

#3663

#3671

#3675

#3677

#3678

#3682

#3696

#37

#3704

#3708

#3713

#3720

#3721

#3722

#3728

#3740

#3745

#3750

#3754

#3754

#3755

#3756

#3759

#3760

#3763

#3764

#3776

#3779

#3782

#3789

#3790

#3793

#3794

#38

#3801

#3802

#3810

#3811

#3814

#3815

#3817

#3818

#3819

#3820

#383

#3831

#3845

#385

#3851

#3853

#3856

#3857

#3866

#3871

#3873

#3874

#3875

#3878

#3892

#3893

#3895

#3903

#3914

#3919

#3921

#3929

#393

#3931

#3932

#3933

#3936

#3939

#3940

#3941

#3957

#3960

#3961

#3970

#3981

#3982

#3983

#3984

#3985

#3986

#3988

#3989

#3992

#4

#40

#4005

#4009

#4013

#4014

#4017

#4019

#403

#4039

#4058

#4058

#4062

#4078

#4083

#4086

#4087

#4088

#4089

#4091

#4094

#4095

#4096

#41

#411

#4115

#4117

#4126

#4132

#4135

#4138

#4139

#4142

#4144

#4149

#4151

#4152

#4154

#4155

#4160

#4161

#4162

#4165

#4168

#4175

#4177

#4178

#4179

#4186

#4188

#4193

#4195

#42

#4204

#4206

#4207

#4210

#4216

#4222

#4228

#4230

#4231

#4233

#4236

#4242

#4245

#4248

#4250

#4251

#4252

#4262

#4268

#4270

#4275

#4276

#4277

#4283

#4292

#4293

#4294

#4295

#4296

#43

#4312

#4314

#4317

#4321

#4322

#4325

#4330

#4331

#4332

#4333

#4334

#4335

#4336

#4337

#4338

#4339

#4340

#4341

#4345

#4346

#4348

#4349

#435

#4350

#4351

#4352

#4356

#4359

#4361

#4364

#4367

#4369

#437

#4371

#4374

#4378

#4379

#438

#4381

#4382

#4383

#4384

#4388

#4389

#4392

#4393

#4399

#44

#4401

#4402

#4403

#4407

#4411

#4413

#4420

#4421

#4424

#4425

#4426

#4427

#4428

#4429

#4430

#4436

#4440

#4441

#4442

#4445

#4447

#4448

#4453

#4455

#4456

#4457

#4458

#4460

#4462

#4463

#4465

#4466

#4471

#4475

#4477

#4479

#4483

#4486

#4493

#4497

#4499

#4500

#4501

#4502

#4504

#4505

#4507

#4509

#4514

#4517

#452

#4521

#4522

#4531

#4534

#4537

#4538

#4541

#4544

#4545

#4547

#4548

#4550

#4553

#4555

#4556

#4557

#4559

#4560

#4561

#4562

#4566

#4567

#4568

#4569

#4572

#4573

#4575

#4577

#4579

#4581

#4583

#4584

#4587

#4592

#4595

#4596

#4596

#4599

#46

#4600

#4601

#47

#48

#49

#490

#490

#495

#498

#5

#502

#510

#516

#516

#52

#521

#53

#568

#57

#576

#577

#578

#58

#59

#593

#6

#60

#601

#606

#61

#62

#620

#622

#628

#632

#635

#638

#643

#65

#653

#66

#67

#69

#7

#712

#721

#728

#744

#744

#750

#768

#782

#786

#791

#791

#8

#827

#828

#829

#830

#835

#837

#841

#842

#844

#9

#940

#944

#98

2023-12-29

2024-02-07

2024-02-22

2024-03-19

2024-05-29

2024-06-13

2024-06-23

2024-07-09

2024-08-31

2024-09-13

2024-10-23

2024-11-01

2024-11-22

2024-12-31

2025-01-29

2025-02-27

2025-04-03

2025-04-28

2025-05-16

2025-06-30

2025-07-31

2025-08-20

2025-09-15

2025-10-18

2025-11-06

2025-12-11

2025-12-26

2026-01-22

3f3a986c0e SDNQ fix scale staying in fp32 with tensorwise fp8 matmul Disty0 2025-06-02 23:18:10 +03:00
3ca9f29c8f Merge pull request #3957 from vladmandic/dev Vladimir Mandic 2025-06-02 15:52:22 +02:00
7d05bed459 update changelog Vladimir Mandic 2025-06-02 15:50:42 +02:00
f147bca20f Merge branch 'master' into dev Vladimir Mandic 2025-06-02 15:45:38 +02:00
b1d6897621 update changelog Vladimir Mandic 2025-06-02 15:44:52 +02:00
4e3795a0a5 SDNQ fix packed int8 matmul Disty0 2025-06-02 03:31:51 +03:00
82f5634d53 SDNQ use torch.bool for uint1 Disty0 2025-06-02 01:39:51 +03:00
ea7fe3bb73 OpenVINO update dtype_mapping Disty0 2025-06-02 01:11:41 +03:00
e8588c91ea SDNQ enable matmul support for float8_e5m2 Disty0 2025-06-02 00:53:10 +03:00
8f1a1d7311 SDNQ expand quantized_matmul_dtypes for CPU Disty0 2025-06-02 00:28:29 +03:00
b146025a5e SDNQ add int2 Disty0 2025-06-02 00:17:39 +03:00
766aec32d5 Update changelog Disty0 2025-06-01 23:35:00 +03:00
9669b36010 SDNQ fix older PyTorch with FP8 matmul Disty0 2025-06-01 23:29:16 +03:00
acefa58834 SDNQ don't force fp32 with fp8 tensorwise matmul Disty0 2025-06-01 23:16:00 +03:00
839295f79a Add fp8 fnuz to sdnq options Disty0 2025-06-01 23:10:08 +03:00
c77162fb82 update wiki and changelog Vladimir Mandic 2025-06-01 21:31:43 +02:00
539fae3234 Update naming Disty0 2025-06-01 21:01:56 +03:00
cefe460052 SDNQ skip FP8 matmul for input len < 32 Disty0 2025-05-31 01:27:59 +03:00
046840c8be Fix HiDream sampling Disty0 2025-05-31 00:52:56 +03:00
109c0d7e49 SDNQ use tensorwise FP8 matmul on CPU Disty0 2025-05-30 21:09:53 +03:00
959b759721 Cleanup Disty0 2025-05-30 16:45:59 +03:00
b5d588fa45 SDNQ remove unnecessary bitwise ands Disty0 2025-05-30 16:29:59 +03:00
db816d7088 Cleanup Disty0 2025-05-30 16:02:26 +03:00
c85cc6b397 SDNQ enable quant with GPU by default and don't do unnecessary clones Disty0 2025-05-30 15:21:29 +03:00
4654acde3c SDNQ re-enable memory fix for diffusers Disty0 2025-05-30 14:59:45 +03:00
87a801e24d SDNQ remove memory fix hijack Disty0 2025-05-30 13:54:49 +03:00
f81cb22c00 SDNQ fix new transformers Disty0 2025-05-30 13:32:03 +03:00
36febda6e6 SDNQ update supported dtypes Disty0 2025-05-30 13:07:23 +03:00
29bd2af779 SDNQ add 6-bit support Disty0 2025-05-30 12:20:13 +03:00
98a11fc86c fix gallery duplicate entries Vladimir Mandic 2025-05-30 11:04:56 +02:00
9168a66fd2 update requirements Vladimir Mandic 2025-05-30 08:53:39 +02:00
4e184f41af update changelog Vladimir Mandic 2025-05-30 08:44:30 +02:00
d1491962d9 One bit Disty0 2025-05-30 05:41:02 +03:00
3c8be0f55f SDNQ add uint2 Disty0 2025-05-30 04:47:29 +03:00
599224d392 SDNQ reduce 5 reshape ops to 2 with quantized input Disty0 2025-05-30 01:31:41 +03:00
d8dea9031f SDNQ do FP8 matmul shape check only once Disty0 2025-05-30 01:13:37 +03:00
b4e615e760 SDNQ add FP8 row wise scaling workaround for SM89 on Windows Disty0 2025-05-30 00:16:54 +03:00
54154cf698 Cleanup Disty0 2025-05-29 20:22:49 +03:00
90324f9c8c SDNQ fix lora with quant matmul Disty0 2025-05-29 18:25:12 +03:00
df8b31fcfc Don't downcast scale with fp8 matmul Disty0 2025-05-29 16:35:40 +03:00
2351efb8f7 Remove redundant shape check Disty0 2025-05-29 14:58:00 +03:00
14893b7617 Don't make the weights contiguous with int8 matmul Disty0 2025-05-29 03:43:57 +03:00
cf2d1e56a6 Update changelog Disty0 2025-05-29 03:32:11 +03:00
2cc5a58b0f Update changelog Disty0 2025-05-29 03:26:47 +03:00
67e0f4d833 Cleanup Disty0 2025-05-29 03:22:40 +03:00
3698f8bb84 SDNQ add experimental FP8 matmul Disty0 2025-05-29 03:11:59 +03:00
dd33c4d583 Fix scale and zero_point not being moved by tensor.to Disty0 2025-05-28 17:46:06 +03:00
dd0dbc476f SDNQ fix asym quant formula for dtypes with non zero minimums Disty0 2025-05-28 17:25:38 +03:00
e06cbea7aa Cleanup Disty0 2025-05-28 15:55:08 +03:00
d8e8f47ce5 SDNQ add an option to toggle quantize with GPU Disty0 2025-05-28 15:18:39 +03:00
1961e88c13 Set SDPA as the default on all backends and enable Dyn SDPA on ROCm, DML, CPU and MPS Disty0 2025-05-28 13:42:29 +03:00
569e9099d7 Use torch.amax instead of torch.max Disty0 2025-05-28 12:44:07 +03:00
0b564e2373 Cleanup Disty0 2025-05-28 04:07:45 +03:00
1433dfe3de SDNQ fix high RAM usage with pre mode Disty0 2025-05-28 03:16:29 +03:00
4ed15f5cce SDNQ revert device_map = gpu Disty0 2025-05-27 23:32:58 +03:00
d3e3fb98b0 Don't override user set device_map Disty0 2025-05-27 21:45:52 +03:00
b1b29e9001 SDNQ disable device_map = gpu with TE and LLM Disty0 2025-05-27 21:32:32 +03:00
b724cd7c57 Update changelog Disty0 2025-05-27 21:21:42 +03:00
5d3c1832b2 SDNQ add FP8 quants Disty0 2025-05-27 20:29:15 +03:00
3618e39cff SDNQ use device_map = gpu Disty0 2025-05-27 19:46:30 +03:00
73999ac710 Add soft gc to nncf quant layer Disty0 2025-05-27 16:24:04 +03:00
e94128a02e SDNQ add force torch_gc to pre load mode Disty0 2025-05-27 16:11:04 +03:00
dece497f10 Refactor SDNQ to use weights_dtype and rename decompress_int8_matmul to use_quantized_matmul Disty0 2025-05-27 15:49:21 +03:00
79bb348927 SDNQ sort quant schemes by recommended order Disty0 2025-05-27 13:06:17 +03:00
dec460e665 SDNQ use torch.bitwise ops instead of python Disty0 2025-05-27 03:02:36 +03:00
280be31883 SDNQ fix Lora change Disty0 2025-05-27 00:08:32 +03:00
4d9c2a8608 Cleanup Disty0 2025-05-26 22:41:12 +03:00
84ddfb2868 SDNQ fix lora apply Disty0 2025-05-26 22:39:20 +03:00
6dee9f5ac7 Fix HiDream teacache not reseting Disty0 2025-05-26 21:21:01 +03:00
742cd61d1f Add TeaCache for HiDream Disty0 2025-05-26 19:59:43 +03:00
687c50dcc8 SDNQ fix Lora Disty0 2025-05-26 19:48:45 +03:00
ccf9deaf28 Move SDNQ to the top of the settings list Disty0 2025-05-26 18:30:50 +03:00
02f15b28cc Cleanup Disty0 2025-05-26 15:57:17 +03:00
91bb07f650 SDNQ remove unused args and simplify decompressors Disty0 2025-05-26 15:51:53 +03:00
d2159af10e cleanup Disty0 2025-05-26 04:24:28 +03:00
4ad404182d cleanup Disty0 2025-05-26 04:17:22 +03:00
3f8ae754a0 Update readme Disty0 2025-05-26 03:35:25 +03:00
46e9a9a631 IPEX disable Dynamic Attention by default on PyTorch 2.7 Disty0 2025-05-26 03:03:17 +03:00
5fcd0be79c Update changelog Disty0 2025-05-26 02:50:05 +03:00
e314a7ca19 Update changelog Disty0 2025-05-26 02:49:12 +03:00
17df7ba83b Cleanup whitespace Disty0 2025-05-26 02:41:29 +03:00
4453efee76 Rename NNCF to SDNQ and rename quant schemes Disty0 2025-05-26 02:39:51 +03:00
9c2e15433e NNCF set required_packages to None Disty0 2025-05-26 01:39:09 +03:00
cbc1bfe710 Cleanup Disty0 2025-05-26 01:24:06 +03:00
2d79380bd7 NNCF implement better layer hijacks and remove all NNCF imports Disty0 2025-05-26 01:12:28 +03:00
af3a44ccbe optional skimage Vladimir Mandic 2025-05-24 08:58:04 +02:00
bfc5c7c457 installer version check Vladimir Mandic 2025-05-24 08:42:13 +02:00
85f00f9edb Enable dyn atten by default for ROCm Disty0 2025-05-23 18:24:47 +03:00
50e3a134ca Merge pull request #3941 from hypercryptoman/patch-1 Vladimir Mandic 2025-05-23 10:31:01 +02:00
abc081d242 Merge branch 'dev' into patch-1 Vladimir Mandic 2025-05-23 10:30:33 +02:00
ac05b96838 Update prompt_enhance.py hypercryptoman 2025-05-19 13:55:16 +10:00
05fced7395 Update prompt_enhance.py hypercryptoman 2025-05-19 13:53:24 +10:00
ba2eaaf295 Fix: Correct model_file parameter usage for custom load button hypercryptoman 2025-05-19 13:45:05 +10:00
2b824daf64 Revert MIOPEN_FIND_ENFORCE Disty0 2025-05-18 21:28:44 +03:00
7f2d77e956 ROCm set MIOPEN_FIND_ENFORCE to SEARCH Disty0 2025-05-18 16:36:27 +03:00
b23162a36b Fix: Correct arguments for prompt_enhance.py apply method hypercryptoman 2025-05-18 23:35:55 +10:00
3d8390de9b IPEX return devices.dtype instead of bf16 Disty0 2025-05-18 04:44:51 +03:00
d0e6f01286 IPEX remove GradScaler and use torch.amp instead Disty0 2025-05-18 04:39:48 +03:00
a009e17d2b NNCF use per token input quantization with int8 matmul Disty0 2025-05-17 19:46:47 +03:00
12ebadccd4 Merge pull request #3940 from vladmandic/dev 2025-05-16 Vladimir Mandic 2025-05-16 10:40:17 -04:00

... 21 22 23 24 25 ...