diffusers

mirror of https://github.com/huggingface/diffusers.git synced 2026-01-29 07:22:12 +03:00

Author	SHA1	Message	Date
Daniel Gu	ef199118e2	Point original checkpoint to LTX 2.0 official checkpoint	2026-01-06 06:35:51 +01:00
Sayak Paul	93a417f24a	Tests for T2V and I2V (#6 ) * add ltx2 pipeline tests. * up * up * up * up * remove content * style * Denormalize audio latents in I2V pipeline (analogous to T2V change) * Initial refactor to put video and audio text encoder connectors in transformer * Get LTX 2 transformer tests working after connector refactor * up * up * i2v tests. * up * Address review comments * Calculate RoPE double precisions freqs using torch instead of np * Further simplify LTX 2 RoPE freq calc * revert unneded changes. * up * up * update to split style rope. * up --------- Co-authored-by: Daniel Gu <dgu8957@gmail.com>	2026-01-06 08:05:30 +05:30
Sayak Paul	c5b52d6c9f	address initial feedback from lightricks team (#16 ) * cross_attn_timestep_scale_multiplier to 1000 * implement split rope type. * up * propagate rope_type to rope embed classes as well. * up	2026-01-05 21:13:10 +05:30
Sayak Paul	0be4f31620	up (#19 )	2026-01-05 21:13:01 +05:30
dg845	caae16768a	Move Video and Audio Text Encoder Connectors to Transformer (#12 ) * Denormalize audio latents in I2V pipeline (analogous to T2V change) * Initial refactor to put video and audio text encoder connectors in transformer * Get LTX 2 transformer tests working after connector refactor * precompute run_connectors,. * fixes * Address review comments * Calculate RoPE double precisions freqs using torch instead of np * Further simplify LTX 2 RoPE freq calc * Make connectors a separate module (#18) * remove text_encoder.py * address yiyi's comments. * up * up * up * up --------- Co-authored-by: sayakpaul <spsayakpaul@gmail.com>	2026-01-05 20:11:13 +05:30
Daniel Gu	6a236a27fb	Merge branch 'ltx-2-transformer' into make-scheduler-consistent	2025-12-30 20:25:59 +01:00
Sayak Paul	280e347814	Refactor Audio VAE to be simpler and remove helpers (#7 ) * remove resolve causality axes stuff. * remove a bunch of helpers. * remove adjust output shape helper. * remove the use of audiolatentshape. * move normalization and patchify out of pipeline. * fix * up * up * Remove unpatchify and patchify ops before audio latents denormalization (#9) --------- Co-authored-by: dg845 <58458699+dg845@users.noreply.github.com>	2025-12-30 08:05:56 +05:30
Daniel Gu	581f21c431	Make LTX 2.0 scheduler more consistent with original code	2025-12-29 23:44:52 +01:00
Daniel Gu	b5891b19b1	Get LTX 2 T2V pipeline to produce reasonable outputs	2025-12-24 06:07:38 +01:00
Daniel Gu	595f485ad8	LTX 2.0 scheduler and full pipeline conversion	2025-12-23 07:41:28 +01:00
Daniel Gu	ae3b6e7cc2	Merge branch 'ltx-2-transformer' into ltx-2-t2v-pipeline	2025-12-23 02:59:33 +01:00
Daniel Gu	d303e2a6ff	Conversion script for LTX 2.0 Audio VAE Decoder	2025-12-23 02:48:15 +01:00
Daniel Gu	0028955c37	Initial LTX 2.0 text encoder implementation	2025-12-22 10:06:01 +01:00
Daniel Gu	c6a11a5530	Initial LTX 2.0 vocoder implementation	2025-12-19 12:17:10 +01:00
Daniel Gu	a748975a7c	Get diffusers implementation on par with official LTX 2.0 video VAE implementation	2025-12-19 07:02:38 +01:00
Daniel Gu	baf23e2da3	Explicitly specify temporal and spatial VAE scale factors when converting	2025-12-17 11:14:45 +01:00
Daniel Gu	269cf7b40d	Initial implementation of LTX 2.0 video VAE	2025-12-17 10:51:34 +01:00
Daniel Gu	57a8b9c330	Allow LTX 2 transformer to be loaded from local path for conversion	2025-12-16 10:38:03 +01:00
Daniel Gu	a5f2d2da6c	Initial script to convert LTX 2 transformer to diffusers	2025-12-15 07:09:42 +01:00

19 Commits