AI Music Generation - Model Explorer

Name:

Description: Open source text-to-audio model for generating samples and sound effects from text descriptions. The model enables audio variations and style transfer of audio samples. The creators claim it is ideal for creating drum beats, instrument riffs, ambient sounds, foley recordings and other audio samples for music production and sound design. Generates stereo audio at 44.1kHz.

Year:

Website:

Input types:

Output types:

Output length:

Technology:

Dataset:

License type:

Has real time inference:

Is free:

Is open source:

Are checkpoints available:

Can finetune:

Can train from scratch:

Tags:

Guide: ### Running the model The Stable Audio Open model weights are available on [Hugging Face](https://huggingface.co/stabilityai/stable-audio-open-1.0). The Hugging Face page provides instructions and example Python scripts demonstrating how to use the model locally. You can also use the readily-available Jupyter notebooks and immediately try the model in [Google Colab](https://huggingface.co/stabilityai/stable-audio-open-1.0/colab) or [Kaggle](https://huggingface.co/stabilityai/stable-audio-open-1.0/kaggle). ### Fine-tuning Stable Audio Open Here is a comprehensive video tutorial on fine-tuning the model, contributed by an active community member lyraaaa: [Finetuning Stable Audio Open on YouTube](https://www.youtube.com/watch?v=ex4OBD_lrds). lyraaaa's Jupyter notebook for fine-tuning the model is available on [Google Drive](https://drive.google.com/file/d/1EG2faHovvfU6SJyn-3tl9dKUZLASrwri/view) and can either be dowloaded for running locally or opened directly in Google Collab. ### Community There is a lively community of practitioners using the model, communicating on a dedicated [Discord server](https://discord.gg/stablediffusion). The authors of the model who work for Stability.ai are active there as well and frequently join discussions and answer questions. ### Running Stable Audio Open in MaxMSP/PureData This is an option to export the autoencoder in Stable Audio Open 1.0 for realtime continuous inference in MaxMSP/PureData: [Streaming Stable Audio Open](https://github.com/jasper-zheng/streamable-stable-audio-open). This field renders Markdown

Captcha: captcha

Edit Stable Audio Open