top | item 40145750 Show HN: Bark.cpp, fast TTS model for multilingual realistic audio generation 3 points| el_pa_b | 1 year ago |github.com 3 comments order hn newest el_pa_b|1 year ago Hello!I ported Suno AI's Bark text-to-speech model in C/C++ to allow fast, realistic, multilingual audio generation on the CPU.Generating a 5-second audio with vanilla Bark takes 1 minute on a M1 Pro CPU. Using my port in C++ with ggml, it goes down to 15 seconds.I aim to bring it down to a second to allow on-device real-time audio generation. jilijeanlouis|1 year ago Congrats that's pretty impressive ! load replies (1)
el_pa_b|1 year ago Hello!I ported Suno AI's Bark text-to-speech model in C/C++ to allow fast, realistic, multilingual audio generation on the CPU.Generating a 5-second audio with vanilla Bark takes 1 minute on a M1 Pro CPU. Using my port in C++ with ggml, it goes down to 15 seconds.I aim to bring it down to a second to allow on-device real-time audio generation. jilijeanlouis|1 year ago Congrats that's pretty impressive ! load replies (1)
el_pa_b|1 year ago
I ported Suno AI's Bark text-to-speech model in C/C++ to allow fast, realistic, multilingual audio generation on the CPU.
Generating a 5-second audio with vanilla Bark takes 1 minute on a M1 Pro CPU. Using my port in C++ with ggml, it goes down to 15 seconds.
I aim to bring it down to a second to allow on-device real-time audio generation.
jilijeanlouis|1 year ago