(no title)
noahkay13 | 3 days ago
What it does: - Runs 7 model families: offline transcription (CTC, RNNT, TDT, TDT-CTC), streaming (EOU, Nemotron), and speaker diarization (Sortformer) - Word-level timestamps - Streaming transcription from microphone input - Speaker diarization detecting up to 4 speakers
aaronbrethorst|3 days ago
noahkay13|2 days ago
computerex|2 days ago
pdyc|2 days ago