Show HN: Gemini Cursor – A Multimodal AI Cursor for Your Desktop (Open Source)
22 points| 13point5 | 1 year ago |github.com
It leverages Gemini 2.0 Flash and Google's live multimodal API to analyze what's on screen and provide real-time assistance.
In this demo, my friend tries to add a payment method to Amazon, and the AI cursor walks them through the entire process with visual cues and spoken instructions.
I've also used it to interpret diagrams from research papers—curious to see what other use cases people find this useful for!
wifipunk|1 year ago
Hope you keep working on the project. Different cursor settings for size, shape, and color would be nice.
jcmp|1 year ago