diffusers

mirror of https://github.com/huggingface/diffusers.git synced 2026-01-29 07:22:12 +03:00

Author	SHA1	Message	Date
Steven Liu	cc5b31ffc9	[docs] Migrate syntax (#12390 ) * change syntax * make style	2025-09-30 10:11:19 -07:00
Aryan	a4df8dbc40	Update more licenses to 2025 (#11746 ) update	2025-06-19 07:46:01 +05:30
Quentin Gallouédec	c8bb1ff53e	Use HF Papers (#11567 ) * Use HF Papers * Apply style fixes --------- Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com>	2025-05-19 06:22:33 -10:00
SahilCarterr	6da6406529	[Fix] broken links in docs (#10434 ) * Fix broken links in docs * fix parenthesis	2025-01-06 10:07:38 -08:00
Tolga Cangöz	468ae09ed8	Errata - Trim trailing white space in the whole repo (#8575 ) * Trim all the trailing white space in the whole repo * Remove unnecessary empty places * make style && make quality * Trim trailing white space * trim --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2024-06-24 18:39:15 +05:30
Nguyễn Công Tú Anh	56a76082ed	Add AudioLDM2 TTS (#5381 ) * add audioldm2 tts * change gpt2 max new tokens * remove unnecessary pipeline and class * add TTS to AudioLDM2Pipeline * add TTS docs * delete unnecessary file * remove unnecessary import * add audioldm2 slow testcase * fix code quality * remove AudioLDMLearnablePositionalEmbedding * add variable check vits encoder * add use_learned_position_embedding --------- Co-authored-by: Dhruv Nair <dhruv.nair@gmail.com>	2024-04-08 10:11:24 +05:30
Sayak Paul	30e5e81d58	change to 2024 in the license (#6902 ) change to 2024	2024-02-08 08:19:31 -10:00
M. Tolga Cangöz	8092017d3f	[`Docs`] Fix typos and update files at API's Pipelines page 1 (#5744 ) * Fix typos, update, add Copyright info, and trim trailing whitespace * Update alt_diffusion.md * Remove nonoperational demo * Update docs/source/en/api/pipelines/consistency_models.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> * Update docs/source/en/api/pipelines/latent_consistency_models.md Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com> --------- Co-authored-by: Steven Liu <59462357+stevhliu@users.noreply.github.com>	2023-11-14 10:36:20 -08:00
Steven Liu	6b06c30a65	[docs] Fix links (#5499 ) fix links	2023-10-23 20:39:29 +02:00
Sanchit Gandhi	24c5e7708b	[AudioLDM2] Doc fixes (#4739 ) * [AudioLDM2] Doc fixes * update docstrings * fix unet docstring * Apply suggestions from code review Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com> --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>	2023-08-24 07:20:27 +05:30
Sanchit Gandhi	7a24977ce3	Add AudioLDM 2 (#4549 ) * from audioldm * unet down + mid * vae, clap, flan-t5 * start sequence audio mae * iterate on audioldm encoder * finish encoder * finish weight conversion * text pre-processing * gpt2 pre-processing * fix projection model * working * unet equivalence * finish in base * add unet cond * finish unet * finish custom unet * start clean-up * revert base unet changes * refactor pre-processing * tests: from audioldm * fix some tests * more fixes * iterate on tests * make fix copies * harden fast tests * slow integration tests * finish tests * update checkpoint * update copyright * docs * remove outdated method * add docstring * make style * remove decode latents * enable cpu offload * (text_encoder_1, tokenizer_1) -> (text_encoder, tokenizer) * more clean up * more refactor * build pr docs * Update docs/source/en/api/pipelines/audioldm2.md Co-authored-by: Sayak Paul <spsayakpaul@gmail.com> * small clean * tidy conversion * update for large checkpoint * generate -> generate_language_model * full clap model * shrink clap-audio in tests * fix large integration test * fix fast tests * use generation config * make style * update docs * finish docs * finish doc * update tests * fix last test * syntax * finalise tests * refactor projection model in prep for TTS * fix fast tests * style --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>	2023-08-21 12:34:21 +01:00

11 Commits