top | item 44890898 (no title) briansm | 6 months ago I believe youtube still uses 40 mel-scale vectors as feature data, whisper uses 80 (which provides finer spectral detail but is computationally more intensive to process naturally, but modern hardware allows for that) discuss order hn newest No comments yet.
No comments yet.