top | item 35075124 (no title) throwaway2214 | 3 years ago you assume they just reproduces the training set, but when they get big enough they start to "understand" things, when the input is really big it actually can never "guess" the next word in the batch, so it has to "learn" concepts discuss order hn newest No comments yet.
No comments yet.