(no title)
lgbr | 2 years ago
However, given that the usefulness of chatbots depends more on the model being used, what I would find a lot more useful is a ranking of the various models that are available. Currently I'm having to rely on comments on the internet to find out if Alpaca 7B or LlaMA 65B is genuinely productive to use. As new models come out, I'd love it if I knew how well it tells jokes, answers complicated questions, or generates code.
LASR|2 years ago
Short answer: none of them do as well as the OG Davinci-003. Not even close. Even the 3.5 Turbo models from OpenAI don’t do as well.
We throw some sophisticated prompts at them to attempt chain of thought reasoning.
WinstonSmith84|2 years ago
inciampati|2 years ago
simonw|2 years ago
dr_dshiv|2 years ago
dotancohen|2 years ago
This actually sounds fascinating. Not unlike birdwatching! ))
joenot443|2 years ago