top | item 41839686

Ichigo: Local real-time voice AI

217 points| egnehots | 1 year ago |github.com

There was an announcement on LocalLLaMA: https://www.reddit.com/r/LocalLLaMA/comments/1g38e9s/ichigol...

There were several links:

- Blog for details: https://homebrew.ltd/blog/llama-learns-to-talk

- Code: https://github.com/homebrewltd/ichigo

- Run locally: https://github.com/homebrewltd/ichigo-demo/tree/docker

- Demo on a single 3090: https://ichigo.homebrew.ltd/

40 comments

order

emreckartal|1 year ago

Emre here from Homebrew Research. It's great to see Ichigo on HN!

A quick intro: We're a Local AI company building local AI tools and training open-source models.

Ichigo is our training method that enables LLMs to understand human speech and talk back with low latency - thanks to FishSpeech integration. It is open data, open weights, and weight initialized with Llama 3.1, extending its reasoning ability.

Plus, we are the creators and lead maintainers of: https://jan.ai/, Local AI Assistant - an alternative to ChatGPT & https://cortex.so/, Local AI Toolkit (soft launch coming soon)

Everything we build and train is done out in the open - we share our progress on:

https://x.com/homebrewltd https://discord.gg/hTmEwgyrEg

You can check out all our products on our simple website: https://homebrew.ltd/

gnuly|1 year ago

any plans to share progress on open channels like matrix.org or even irc?

cassepipe|1 year ago

Finally I can use one of the random facts that have entered my brain for decades now even though I can't remember where my keys are.

If I remember correctly, "ichigo" means strawberry in japanese. You are welcome.

SapporoChris|1 year ago

Sorry, you're wrong. It means 1 5. Just kidding, it is strawberry but it can also be read as one and five. However, it is not fifteen.

d3w3y|1 year ago

There are strawberries all over the readme so I reck you're right.

AtlasBarfed|1 year ago

Getsuga tenshou!!

adammarples|1 year ago

From the book tomorrow and tomorrow and tomorrow?

zarmin|1 year ago

Your keys are in the fridge with the remote control.

greydius|1 year ago

I think it's a bit of word play. 苺 (strawberry) and 一語 (one word) are both read "Ichigo".

thruflo|1 year ago

Great stuff. Voice AI is great to run locally not just for privacy / access to personal data but also because of the low latency requirement. If there's a delay in conversation caused by a network call, it just feels weird, like an old satellite phone call.

tmshapland|1 year ago

This is a really cool project! What have people built with it? I'd love to learn about what local apps people are building on this.

emreckartal|1 year ago

Thanks! We've received feedback on use cases like live translation, safe and untrackable educational tools for kids, and language-learning apps. There are so many possibilities, and hope to see guys building amazing products on top of Ichigo.

famahar|1 year ago

Looks impressive. I'm guessing the demo isn't representative of the full possibilities of this? Tried to have a basic conversation in Japanese and it kept on sticking with English. When it did eventually speak Japanese the pronunciation was completely off. I'm really excited about the possibility of local language learning with near realtime conversation practice. Will keep an eye on this.

mentalgear|1 year ago

Kudos to the team, this is truly impressive work! It's exciting to see how AI connects with the local-first movement, which is also really exploding in popularity. (The idea of local-first, where data processing and functionality are prioritized on users' own devices, aligns perfectly with emerging privacy concerns and the push for decentralization.)

Bringing AI into this space enhances user experience while respecting their autonomy over data. It feels like a promising step toward a future where we can leverage the power of AI without compromising on privacy or control. Really looking forward to seeing how this evolves!

cchance|1 year ago

its amazing to see cool projects like this really REALLY based in opensource and open training like this wow

emreckartal|1 year ago

Thanks! It's all open research, source code, data, and weights.

frankensteins|1 year ago

Great initiative! before adding more comments, I'm trying to set up on my local Mac M3 machine. I'm having a hard time to install dependencies. Anyone here have the same issue?

emreckartal|1 year ago

Thanks! You can't run Ichigo on a Mac M3 just yet. It'll be possible to run it locally on Mac once we integrate it with Jan.ai

lostmsu|1 year ago

Very cool, but a bit less practical than some alternatives because it does not seem to do request transcription.

emreckartal|1 year ago

Actually, it does. You can turn on the transcription feature from the bottom right corner and even type to Ichigo if you want. We didn’t show it in the launch video since we were focusing on the verbal interaction side of things.

p0larboy|1 year ago

Tried demo but all I got was "I'm sorry, I can't quite catch that".