(no title)
davej
|
1 year ago
Llama 3 is tuned very nicely for English answers. What is most surprising to me is that the 8B model is performing similarly to Mistral's large model and the original GPT4 model (in English answers). Easily the most efficient model currently available.
swalsh|1 year ago
I suspect the future is going to be owned by lots of smaller more specific models, possibly trained by much larger models.
These smaller models have the advantage of faster and cheaper inference.
theLiminator|1 year ago