But can you get this to run in Linux? From the article I can see it's a VSCode extension or something like that but the name kind of implies it's Windows only.
Anyone knows when Microsoft will release a local OCR model officially? I haven't seen anyone talking about this, but the one they're shipping with Snipping Tool (OneOcr) is top-tier and beats everything out there like Tesseract, easyOCR etc.
The model is technically in everyone's Windows installs, but we don't have the C++ projected WinRT headers to use the Microsoft.Windows.Vision library.
That's the weird thing. They seem to have the best hardware for the price, for individuals to develop and use local LLMs, but so far they have been pretty quiet on all this.
Companies of the size of Microsoft usually function more like a country with various companies within, rather than one company with a unified view and unified goals.
So probably, the group wasn't even aware of that technology because it's far away by either professional connection, or by personal/relationship connections, or they knew about it but had another goal than "maximize use of own stuff" and made the call that the tradeoffs wasn't worth it.
Installed this, and it didn't let me do anything. Despite the fact the readme says stuff runs locally, first it asked me to link to my github account, and then all models required me to ask someone (I think meta?) for permission on github or use a huggingface token or whatever.
So I uninstalled it and now my wsl prompt starts with (base) and I don't know how to disable it and all my python scripts are broken because they can't find all the libraries I've installed from pip throughout the years.
[+] [-] blackoil|2 years ago|reply
[+] [-] severino|2 years ago|reply
[+] [-] mmis1000|2 years ago|reply
[+] [-] troupo|2 years ago|reply
[+] [-] Jayakumark|2 years ago|reply
https://marketplace.visualstudio.com/items?itemName=ms-windo...
[+] [-] davikr|2 years ago|reply
The model is technically in everyone's Windows installs, but we don't have the C++ projected WinRT headers to use the Microsoft.Windows.Vision library.
[+] [-] capableweb|2 years ago|reply
Image-to-text models, filtered by "Microsoft": https://huggingface.co/models?pipeline_tag=image-to-text&sor...
[+] [-] andruby|2 years ago|reply
I do hope they are considering going in that direction though.
[+] [-] paradite|2 years ago|reply
[+] [-] TheRoque|2 years ago|reply
[+] [-] m3kw9|2 years ago|reply
[+] [-] lostmsu|2 years ago|reply
[+] [-] Const-me|2 years ago|reply
I wonder why Microsoft helps nVidia, instead of using their own technology?
Here’s an example: https://github.com/Const-me/Cgml
[+] [-] pjmlp|2 years ago|reply
https://blogs.windows.com/windowsdeveloper/2023/11/15/elevat...
https://blogs.windows.com/windowsdeveloper/2023/12/14/direct...
[+] [-] capableweb|2 years ago|reply
So probably, the group wasn't even aware of that technology because it's far away by either professional connection, or by personal/relationship connections, or they knew about it but had another goal than "maximize use of own stuff" and made the call that the tradeoffs wasn't worth it.
[+] [-] lostmsu|2 years ago|reply
[+] [-] DeathArrow|2 years ago|reply
[+] [-] a1o|2 years ago|reply
[+] [-] ourmandave|2 years ago|reply
Coming soon!
=(
[+] [-] franzb|2 years ago|reply
[+] [-] Mashimo|2 years ago|reply
[+] [-] mngdtt|2 years ago|reply
So I uninstalled it and now my wsl prompt starts with (base) and I don't know how to disable it and all my python scripts are broken because they can't find all the libraries I've installed from pip throughout the years.
0/10 would not recommend.
[+] [-] ZunarJ5|2 years ago|reply
Try: conda config --set auto_activate_base false
[+] [-] dazzlefruit|2 years ago|reply