top | item 47050780

(no title)

acuozzo | 12 days ago

> an LLM with such little data

There is a mountain of data pre-1905. Certainly enough to train a decent 30B parameter model.

Now, digitizing & OCRing all of that data... THAT is a challenge.

discuss

order

No comments yet.