top | item 44174261 (no title) attemptone | 9 months ago I checked the BlEU-Score and Perplexity of popular models and both have stagnated around 2021. As a disclaimer this was a cursory check and I didn't dive into the details of how individuals scores were evaluated. discuss order hn newest mountainriver|9 months ago on what benchmarks? pretty much every major one is linear improvement
mountainriver|9 months ago