Stability AI, a company mostly known for AI-generated visuals, launched a text-to-audio generative AI platform called Stable Audio.
Stable Audio uses a diffusion model, the same AI model that powers the company’s more popular image platform, Stable Diffusion, but trained with audio rather than images. Users can use it to generate songs or background audio for any project.
Audio diffusion models tend to generate a fixed length of audio, which is terrible for music production as songs can vary in length. Stability AI’s new platform lets users make sounds at different lengths, requiring the company to train on music and add text metadata around a song’s start and end time.
Previously, audio taught on a 30-second clip can only generate 30…