mirror of
https://github.com/huggingface/diffusers.git
synced 2026-01-29 07:22:12 +03:00
* why mdx? * why mdx? * why mdx? * no x for kandinksy either --------- Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>
1.5 KiB
1.5 KiB
Audio Diffusion
Audio Diffusion is by Robert Dargavel Smith, and it leverages the recent advances in image generation from diffusion models by converting audio samples to and from Mel spectrogram images.
The original codebase, training scripts and example notebooks can be found at teticio/audio-diffusion.
Make sure to check out the Schedulers guide to learn how to explore the tradeoff between scheduler speed and quality, and see the reuse components across pipelines section to learn how to efficiently load the same components into multiple pipelines.
AudioDiffusionPipeline
autodoc AudioDiffusionPipeline - all - call
AudioPipelineOutput
autodoc pipelines.AudioPipelineOutput
ImagePipelineOutput
autodoc pipelines.ImagePipelineOutput
Mel
autodoc Mel