kenarsa
|
7 months ago
|
on: Show HN: Kitten TTS – 25MB CPU-Only, Open-Source TTS Model
[flagged]
kenarsa
|
7 months ago
|
on: Show HN: Kitten TTS – 25MB CPU-Only, Open-Source TTS Model
kenarsa
|
1 year ago
kenarsa
|
1 year ago
|
on: The importance of initialization and momentum in deep learning [pdf]
Reflecting on Ilya Sutskever's early contribution to DL. This paper saved my job a decade ago
kenarsa
|
1 year ago
|
on: Why is WebAssembly not supporting 256/512 SIMD registers?
100% of AVX/AVX2 have 256 SIMD registers, and Arm also has non-NEON registers, which are 256/512. of course, this requires runtime detection, but that is ok because even for native code, people need to do that.
kenarsa
|
1 year ago
|
on: How Many Languages a Developer Should Know?
"Know" is a vague word. Everyone should know one language inside out as if they can code within a Notepad without IDE support. I think if you know three languages really well, that puts you in the top 1% of devs. I would go for C, Python, and JS.
kenarsa
|
1 year ago
|
on: OpenAI Launches Voice Assistant Inspired by Hollywood Vision of AI
It's interesting to see the cycle. Alexa said everyone wanted to toggle the lights and get the weather forecast. Hence, they folded. Now, we are back with "stronger" tech and want people to change their behavior.
kenarsa
|
3 years ago
|
on: Offline Voice Assistant on a Microcontroller with 192KB RAM
Have you noted that the board has no connectivity chip? If I had a way to connect to internet without the required chip I had a better story to tell. You snippet of FAQ is correct for all other platforms we support aside from microcontrollers ...
[1] https://www.st.com/en/evaluation-tools/stm32f4discovery.html
kenarsa
|
3 years ago
|
on: Offline Voice Assistant on a Microcontroller with 192KB RAM
Picovoice runs on almost anything: web browsers, mobile, desktop, single board computers, and microcontrollers. For the platforms that have connectivity (i.e. almost anything aside from microcontrollers), we do call home for license management. This helps us keep the `Free Tier` free for personal users, hackers , and skunkworks projects, but make sure we get paid by enterprise customers with deployments at scale [1]. On a microcontroller like the one in this tutorial, there is NO connectivity option. Hence, in this specific case it is 100% offline with no license management. In other cases voice recognition is 100% offline but the call home for license management needs connectivity.
[1] https://picovoice.ai/pricing/
kenarsa
|
3 years ago
|
on: Offline Voice Assistant on a Microcontroller with 192KB RAM
kenarsa
|
3 years ago
|
on: Offline Voice Assistant on a Microcontroller with 192KB RAM
kenarsa
|
3 years ago
|
on: Offline Voice Assistant on a Microcontroller with 192KB RAM
would love to check out what you build with it :) I've been constantly and pleasantly surprised but what people can build given this tech
kenarsa
|
3 years ago
|
on: Offline Voice Assistant on a Microcontroller with 192KB RAM
Yeah, I post about my company that I founded and I am super into it. Which part of this is MISLEADING? The fact that I care or are you saying Picovoice's tech doesn't work? I made the latter easy cause you can now go and try it without me in your way. You comment is misleading.
kenarsa
|
3 years ago
|
on: Offline Voice Assistant on a Microcontroller with 192KB RAM
kenarsa
|
3 years ago
|
on: Offline Voice Assistant on a Microcontroller with 192KB RAM
have you noted that there is `Free Tier` that cost you $0? You can train using that. For this tutorial the cost of board ($20) is all you need to pay. Same for personal projects and even small skunkworks projects within companies. Picovoice makes money from large-scale deployments done by device makers
[1] https://picovoice.ai/pricing/
kenarsa
|
3 years ago
|
on: Offline Voice Assistant on a Microcontroller with 192KB RAM
That board doesn't have connectivity! If we had a way to connect to the internet without a connectivity chip I would have had a more exciting post!
kenarsa
|
4 years ago
|
on: DeepSpeech 60x Smaller, 9x faster, and 2x accuracy
kenarsa
|
4 years ago
|
on: DeepSpeech 60x Smaller, 9x faster, and 2x accuracy
mycroft already integrated our wake word (porcupine). we remain neutral and anyone can use our tech :)
kenarsa
|
4 years ago
|
on: DeepSpeech 60x Smaller, 9x faster, and 2x accuracy
Clarification. Google STT has to services. standard and enhanced. we are just better than enhanced and much better than standard. enhanced is much pricier than standard if you wonder what the diff is.
It runs real-time on NVIDIA Jetson Nano and RPI 3/4.
If you think we should consider other embedded platforms we love to hear what and why
kenarsa
|
4 years ago
|
on: DeepSpeech 60x Smaller, 9x faster, and 2x accuracy
Thank you for the clarification. `[1]` and `[2]` hyperlinks in the intro section point the benchmark