top | item 26434742 (no title) desku | 5 years ago Here's a paper on how BERT (a large Transformer model trained using self-supervised learning) implicitly learns the traditional NLP pipeline: https://arxiv.org/abs/1905.05950 discuss order hn newest No comments yet.
No comments yet.