1
0
mirror of https://github.com/huggingface/diffusers.git synced 2026-01-29 07:22:12 +03:00
Files
diffusers/docs/source/en/api/models/autoencoderkl_cogvideox.md
2025-06-19 07:46:01 +05:30

1.4 KiB

AutoencoderKLCogVideoX

The 3D variational autoencoder (VAE) model with KL loss used in CogVideoX was introduced in CogVideoX: Text-to-Video Diffusion Models with An Expert Transformer by Tsinghua University & ZhipuAI.

The model can be loaded with the following code snippet.

from diffusers import AutoencoderKLCogVideoX

vae = AutoencoderKLCogVideoX.from_pretrained("THUDM/CogVideoX-2b", subfolder="vae", torch_dtype=torch.float16).to("cuda")

AutoencoderKLCogVideoX

autodoc AutoencoderKLCogVideoX - decode - encode - all

AutoencoderKLOutput

autodoc models.autoencoders.autoencoder_kl.AutoencoderKLOutput

DecoderOutput

autodoc models.autoencoders.vae.DecoderOutput