Name:
Description: SongGen is an open-source music generation model. The user can input lyrics and description information as text, along with an optional audio clip of voice for cloning, with the input audio clip needing to be around 3 seconds. SongGen will output its generated music in an audio file that is up to 30 seconds in length.
Year:
Website:
Input types: Audio MIDI Text None Genre Metadata Image
Output types: Audio MIDI
Output length:
Technology: Not Specified Latent Consistency Model Latent Diffusion LSTM VAE Sequence-to-sequence neural network Transformer Suite of AI tools Diffusion Hierarchical Recurrent Neural Network (RNN) Autoregressive Convolutional Neural Network
Dataset:
License type:
Has real time inference: Yes No Not known
Is free: Yes No Yes and No, depending on the plan Not known
Is open source: Yes No Not known
Are checkpoints available: Yes No Not known
Can finetune: Yes No Not known
Can train from scratch: Yes No Not known
Tags: text-to-audio MIDI text-prompt small-dataset open-source low-resource free checkpoints proprietary no-input image-to-audio API
Guide: To run the model locally, follow the steps given [here](https://github.com/LiuZH-19/SongGen) This field renders Markdown
Captcha: