Show HN: I taught GPT-OSS-120B to see using Google Lens and OpenCV
43 points| vkaufmann | 19 days ago
The latest feature: google_lens_detect uses OpenCV to find objects in an image, crops each one, and sends them to Google Lens for identification. GPT-OSS-120B, a text-only model with
zero vision support, correctly identified an NVIDIA DGX Spark and a SanDisk USB drive from a desk photo.
Also includes Google Search, News, Shopping, Scholar, Maps, Finance, Weather, Flights, Hotels, Translate, Images, Trends, and more. 17 tools total.
Two commands: pip install noapi-google-search-mcp && playwright install chromium
GitHub: https://github.com/VincentKaufmann/noapi-google-search-mcp
PyPI: https://pypi.org/project/noapi-google-search-mcp/
Booyah!
l1am0|19 days ago
Why do I need gpt-oss-120B at all in this scenario? Couldn't I just directly call e.g. gemini-3-pro api from the python script?
unknown|19 days ago
[deleted]
unknown|18 days ago
[deleted]
reedf1|19 days ago
What part here is the knowing or understanding? Does solving an integral symbolically provide more knowledge than numerically or otherwise?
Understanding the underlying functions themselves and the areas they sweep; has substitution or by-parts, actually provided you with this?
villgax|19 days ago
leumon|19 days ago
vkaufmann|19 days ago
vkaufmann|19 days ago
magic_hamster|19 days ago
But wasn't it Google Lens that actually identified them?
vessenes|19 days ago
vkaufmann|19 days ago
N_Lens|19 days ago
embedding-shape|19 days ago
If something was built by violating TOS' and you use that to do more TOS violations against the ones who initially did the TOS violations to build the thing, do they cancel out each other?
Not about GPT-OSS specifically, but say you used Gemma for the same purpose instead for this hypothetical.
vkaufmann|19 days ago
Easiest and fastest way and the impact is massive
speedgoose|19 days ago
vkaufmann|19 days ago
TZubiri|19 days ago
embedding-shape|19 days ago
tanduv|19 days ago