Long-range transformers in NLP: existing approaches, assumptions and trade-offs (huggingface.co) 1 pts|5 years ago|discuss