top | item 46040884

(no title)

ArcaneMoose | 3 months ago

I've been really impressed with this model specifically because of how insanely cheap it is: https://replicate.com/ibm-granite/granite-vision-3.3-2b

I didn't expect IBM to be making relevant AI models but this thing is priced at $1 per 4,000,000 output tokens... I'm using it to transcribe handwritten input text and it works very well and super fast.

discuss

order

rubikscubeguy|3 months ago

I'm the dev who made this:) We are looking into adding granite!

intalentive|3 months ago

IBM and Nvidia speech to text models are also SOTA (according to HF leaderboard) and relatively lightweight. Replicate hosts those too, although some (like Parakeet) run easily on consumer GPU.

nicman23|3 months ago

English only :( . it seems only 2 orders of magnitude larger models have support for ie greek :/

irjustin|3 months ago

Thanks for this! Will test this model out because we do a lot of in between steps to get around the output token limits.

Super nice if it worked for our use case to simply get full output.