top | item 46959555 (no title) explosion-s | 20 days ago Just curious, is there any smaller version of this model capable of running on edge devices? Even my Mac M1 with 8gb ram couldn't run the C version. discuss order hn newest guskel|20 days ago This semi-quantized version targets the Jetson Orin Nano, but only comes with a simple inference engine.https://huggingface.co/Teaspoon-AI/Voxtral-Mini-4B-INT4-Jets... sofixa|20 days ago https://kyutai.org/stt has an implementation for MLX and mentions iPhones, so it should work on edge devices, Macs and iPhones. adefa|19 days ago I'm curious to see if you are able to run the model now from the CLI?
guskel|20 days ago This semi-quantized version targets the Jetson Orin Nano, but only comes with a simple inference engine.https://huggingface.co/Teaspoon-AI/Voxtral-Mini-4B-INT4-Jets...
sofixa|20 days ago https://kyutai.org/stt has an implementation for MLX and mentions iPhones, so it should work on edge devices, Macs and iPhones.
guskel|20 days ago
https://huggingface.co/Teaspoon-AI/Voxtral-Mini-4B-INT4-Jets...
sofixa|20 days ago
adefa|19 days ago