top | item 36901452

(no title)

byt143 | 2 years ago

What tasks?

discuss

order

treprinum|2 years ago

For processing trillion documents for example NER can be done much better.

chaxor|2 years ago

This tradeoff is ridiculous, even if it is "better" by .01% F score. I would much rather have a dataset created in 1 day from BERT at 98% F-score than 1000 years at 98.01% F-score from a 540B parameter model, or even a 33B parameter model. The performance in million parameter models for NER is still excellent, and works at speed that are usable. Running things through OpenAI is also useless, as it would cost a few million $.

byt143|2 years ago

It's really depressing that a handful of big corporations will be able to exert such control over labor and productivity

jerrygenser|2 years ago

You are literally using trillion documents? Or are you exaggerating?