top | item 36793160

Petals runs Llama 2 (70B) from Colab at 5 tokens/sec

5 points| borzunov | 2 years ago |github.com

3 comments

borzunov|2 years ago

borzunov|2 years ago

We've moved to a new domain, the chat is now at https://chat.petals.dev

amrb|2 years ago

Great project and I'm happy to see it expand to more models!