top | item 42859098

Toward a Sparse Interpretable Audio Codec

19 points| cochlear | 1 year ago |blog.cochlea.xyz

2 comments

order

rasz|1 year ago

Article about audio codec with no mention of compression rates and "speech results" audio samples are presented as animated gifs.

cochlear|1 year ago

These are both great points and I'll use them to refine my writing on the subject, I appreciate the feedback!

Apologies if it isn't clear, but the animated gifs are meant as an illustration of the iterative encoding process, where the encoder decomposes the signal step-by-step, as in matching pursuit. I'll be sure to clarify that point.

I'll add a paragraph on compression rates/ratios, although that isn't necessarily the main focus here; codecs may compress a signal, but they might also transform it into a more useful, easy-to-understand and easy-to-manipulate representation.