(no title)
atty | 2 years ago
Insanely fast whisper (god I hate the name) is really a CLI around Transformers’ whisper pipeline, so you can just use that and use any of the settings Transformers exposes, which includes beam size.
We also deal with very poor audio, which is one of the reasons we went with faster whisper. However, we have identified failure modes in faster whisper that are only present because of the conditioning on the previous segment, so everything is really a trade off.
sanchit-gandhi|2 years ago
Just call the pipeline with:
result = pipe(sample, generate_kwargs={"num_beams": 5})