Docs
.to('cuda')
.enable_model_cpu_offload()
Remove .to('cuda') before cpu_offload, trim trailing whitespaces