(no title)
pico_creator | 1 year ago
This is a full drop in replacement for any transformer model use cases on model sizes 32B and under, as it has equal performance to existing open 32B models in most benchmarks
We are in works on a 70B, which will be a full drop in replacement for most text use cases
lostmsu|1 year ago
pico_creator|1 year ago
swyx|1 year ago
pico_creator|1 year ago
It's definitely something we are tracking to do as well =)