WingNews logo WingNews
top | new | best | ask | show | jobs
top | item 46130230

(no title)

logankeenan | 2 months ago

It’s been about a year since I looked into this sort of thing, but molmo will give you x,y coordinates. I hacked together a project about it. I also think Microsoft’s omniparser is good at finding coordinates too.

https://huggingface.co/allenai/Molmo-7B-D-0924

https://github.com/logankeenan/george

https://github.com/microsoft/OmniParser

discuss

order

chhxdjsj|2 months ago

Thanks ill try this!
powered by hn/api // news.ycombinator.com