top | item 29455559 (no title) rajansaini | 4 years ago You should check out the VOLT paper, I think it would work well. It's a new technique for splitting up a vocabulary into subwords while minimizing entropy. These subwords could then be mixed and matched, maybe by a neural model, for better results. discuss order hn newest lioeters|4 years ago Thank you for the reference. To save others a search, I believe this is the paper:Vocabulary Learning via Optimal Transport for Neural Machine Translation - https://arxiv.org/abs/2012.15671https://jingjing-nlp.github.io/volt-blog/https://github.com/Jingjing-NLP/VOLT
lioeters|4 years ago Thank you for the reference. To save others a search, I believe this is the paper:Vocabulary Learning via Optimal Transport for Neural Machine Translation - https://arxiv.org/abs/2012.15671https://jingjing-nlp.github.io/volt-blog/https://github.com/Jingjing-NLP/VOLT
lioeters|4 years ago
Vocabulary Learning via Optimal Transport for Neural Machine Translation - https://arxiv.org/abs/2012.15671
https://jingjing-nlp.github.io/volt-blog/
https://github.com/Jingjing-NLP/VOLT