WingNews

d4rkp4ttern|13 days ago

Big fan of handy and it’s cross platform as well. Parakeet V3 gives the best experience with very fast and accurate-enough transcriptions when talking to AIs that can read between the lines. It does have stuttering issues though. My primary use of these is when talking to coding agents.

But a few weeks ago someone on HN pointed me to Hex, which also supports Parakeet-V3 , and incredibly enough, is even faster than Handy because it’s a native MacOS-only app that leverages CoreML/Neural Engine for extremely quick transcriptions. Long ramblings transcribed in under a second!

It’s now my favorite fully local STT for MacOS:

https://github.com/kitlangton/Hex

Barbing|13 days ago

I installed a few different STT apps at the same time that used Parakeet and I think they disagreed with each other. But Hex otherwise would’ve won for me I think. Wanna reformat the Mac & try again (been a while anyway).

My comment on this from a month back: https://news.ycombinator.com/item?id=46637040

james2doyle|13 days ago

I was having the same journey but landed on https://github.com/hoomanaskari/mac-dictate-anywhere

zachlatta|14 days ago

I just learned about Handy in this thread and it looks great!

I think the biggest difference between FreeFlow and Handy is that FreeFlow implements what Monologue calls "deep context", where it post-processes the raw transcription with context from your currently open window.

This fixes misspelled names if you're replying to an email / makes sure technical terms are spelled right / etc.

The original hope for FreeFlow was for it to use all local models like Handy does, but with the post-processing step the pipeline took 5-10 seconds instead of <1 second with Groq.

sipjca|13 days ago

There's an open PR in the repo which will be merged which adds this support. Post processing is an optional feature if you want to use it, and when using it, end to end latency can still be under 3 seconds easily

lemming|13 days ago

Could you go into a little more detail about the deep context - what does it grab, and which model is used to process it? Are you also using a groq model for the transcription?

k9294|13 days ago

You can try ottex for this use case - it has both context capture (app screenshots), native LLMs support, meaning it can send audio AND screenshot directly to gemini 3 flash to produce the bespoke result.

stavros|14 days ago

As a very happy Handy user, it doesn't do that indeed. It will be interesting to see if it works better, I'll give FreeFlow a shot, thanks!

hendersoon|14 days ago

Yes, I also use Handy. It supports local transcription via Nvidia Parakeet TDT2, which is extremely fast and accurate. I also use gemini 2.5 flash lite for post-processing via the free AI studio API (post-processing is optional and can also use a locally-hosted LM).

jimmySixDOF|13 days ago

I didn't try Handy but been using Whisper-Key its super simple get out of your way all local single file executable (portable so zero install too) -- thats for Windows idk about the Mac version

[1] https://github.com/PinW/whisper-key-local

hackernewds|13 days ago

the astroturfing here off topic of op post is unbearable

vogtb|14 days ago

Handy rocks. I recently had minor surgery on my shoulder that required me to be in a sling for about a month, and I thought I'd give Handy a try for dictating notes and so on. It works phenomenally well for most text-to-speech use cases - homonyms included.

gurjeet|13 days ago

Thanks for the recommendation! I picked the smallest model (Moonshine Base @ 58MB), and it works great for transcribing English.

Surprisingly, it produced a better output (at least I liked its version) than the recommended but heavy model (Parakeet V3 @ 478 MB).

sipjca|13 days ago

Great feedback :) also support for the v2 versions of the moonshine models should be out today!

arach|13 days ago

Handy's great! I find the latency to be just a bit too much for my taste. Like half the people on this thread, built my own but with a bit more emphasis on speed

https://usetalkie.com

odiroot|13 days ago

Not sure if it's just me but Handy crashes on my Arch setup. Never mind which version I run. Could be something with Wayland or Pipewire but didn't see anything obvious in the logs.

arach|13 days ago

https://github.com/goodroot/hyprwhspr have you tried this? I have a nice 64GB new linux machine waiting to be set up for me to kick the tires on this

pretty sure it's awesome - sorry OP about mentioning another project, we're all learning here :)

irrationalfab|13 days ago

Handy is genuinely great and it supports Parakeet V3. It’s starting to change how I "type" on my computer.

stavros|14 days ago

I use handy as well, and love it.

smcleod|13 days ago

Handy is nothing short of fantastic, really brilliant when combined with Parakeet v2!

(no title)

discuss