top | item 36793160

Petals runs Llama 2 (70B) from Colab at 5 tokens/sec

5 points| borzunov | 2 years ago |github.com

3 comments

order

amrb|2 years ago

Great project and I'm happy to see it expand to more models!