Music2Latent

Encode and decode audio samples to/from compressed representations! Useful for efficient generative modeling applications and for other downstream tasks.

Year: 2024

Website: https://github.com/SonyCSLParis/music2latent

Input types: Audio

Output types: Audio

Output length: Variable / Audio input length

AI Technique: Latent Consistency Model

Dataset: MTG Jamendo and DNS Challenge 4

License type: CC-BY-NC

Real time:

Free:

Open source:

Checkpoints:

Fine-tune:

Train from scratch:

#open-source #free #checkpoints

Guide to using the model

Option to export the model to torchscript to be used in MaxMSP or PureData: https://github.com/jasper-zheng/music2latent-scripted

Edit