Never had the pleasure of a Sennheiser but when working in radio I got
my hands on a lot of rack vocoders for doing branding, stings and
idents. Funny how the number 9000 comes up a lot, like Roland VP9000
and Eventide H9000. 80 and 90s vintage ones like Korg VC-10 or
Elektronik EM-26 had unique sounds, but tbh the modern digital
recreations are amazing models. There's not a world of difference
between vocoding, autotune, shifting, harmonising etc once you realise
how all the fx are now based in FFT, convolution etc - just different
variations on processing and control graphs - and so it's fun to
create your own vocal effects in things like
Max/MSP/PureData. Technically there's a distinction between "effects"
and "processing" in terms of how much of the direct (parallel) signal
is put through. Chers Believe is a yardstick for "effect", whereas a
lot of what I hear with Daft Punk (and Air, Kraftwerk) is quite
heavily processed as to disguise the original voice entirely - just
letting a bit of top/sibilant through to define the stops and
fricatives.
TheOtherHobbes|10 months ago
The shape of the filters, the smoothing between the filters and the synthesis section, and (on some models) the patchability all create a very different result.
The reason the best analog vocoders are so expensive is because the filter for each band is much more complex than a plain old bandpass filter, with a much higher component count. Typically there's a flatter passband and a steeper slope than you'd expect.
You can do digital convolution with thousands of bins and it sounds nothing like analog vocoding. It's much cleaner, doesn't have those lovely harmonically spaced filter resonances, and creates sounds that can feel more acoustic than electronic.
marcedwards|10 months ago
Oh, it also might be of interest that the IVL algorithm isn’t FFT-based. I think their harmonizers sound better than the rest, so maybe FFT isn’t the best way to go.
jschafer|10 months ago
If you look at the code of (phone/voice) codecs GSM/Speex/Opus you can see that you can estimate the spectral envelope (or the configuration of a physical tube model for the vocal tract) in time domain with linear prediction coefficients (LPC).
And it is simple, e.g. the often used Levinson-Durbin algorithm is just 22 lines of C code. It is an interesting exercise to build your own vocoder from scratch that fits in a single screen page.
Many of the code snippets I have seen (which likely have already processed your voice) are just translations of the Fortran code of the book "Linear Prediction of Speech" by Markel and Gray (1976).
nonrandomstring|10 months ago
Has anyone got more details?