Encode and decode audio samples to/from compressed representations! Useful for efficient generative modeling applications and for other downstream tasks.
Year: 2024
Website: https://github.com/SonyCSLParis/music2latent
Input types: Audio
Output types: Audio
Output length: Variable / Audio input length
AI Technique: Latent Consistency Model
Dataset: MTG Jamendo and DNS Challenge 4
License type: CC-BY-NC
Real time:
Free:
Open source:
Checkpoints:
Fine-tune:
Train from scratch:
Option to export the model to torchscript to be used in MaxMSP or PureData: https://github.com/jasper-zheng/music2latent-scripted