top | item 44711348

(no title)

conjecTech | 7 months ago

Very nice work. Training these from scratch is a big undertaking.

- Did you train the encoder & decoder together or separately? It would be nice to have the encoder representation be compatible with the existing whisper implementation since it would mean you could swap your implementation into models where its used as a component, like in the recent Voxtral model. I'd imagine it also might make training a bit faster as well.

- Did you consider training the turbo model as well?

discuss

order

No comments yet.