I remember attending a conference presentation back in the early 2000's by one of the FFTW authors. They claimed that the mapping between architecture and optimal FT algorithm was complex enough that the only sensible approach was implementing several and empirically determining the best one at runtime.
magicalhippo|2 years ago
blitzar|2 years ago