(no title)
chaxor | 1 year ago
As it has been for decades now, the 'Nan' type of answer in NLP is important, adds great capability, and is often glossed over.
chaxor | 1 year ago
As it has been for decades now, the 'Nan' type of answer in NLP is important, adds great capability, and is often glossed over.
bcherry|1 year ago
They don't really describe what "success" would look like but it seems to me like the primary goal is to minimize "incorrect", rather than to maximize "correct". the mini models would get there by maximizing "not attempted" with the larger models having much higher "correct". Then both model sizes could hopefully reach 90%+ "correct" when given access to external lookup tools.