top | item 36389302

(no title)

fagerhult | 2 years ago

Here is the MusicGen paper from Facebook research: https://arxiv.org/abs/2306.05284

MusicGen is an LLM on top of EnCodec tokens, instead of working directly with audio. EnCodec is neural audio compression algorithm that encodes audio as tokens from a codebook. It's a really clever trick!

discuss

order