top | item 14394448

How we built Tagger News: machine learning on a tight schedule

71 points| var_explained | 8 years ago |varianceexplained.org | reply

8 comments

order
[+] minimaxir|8 years ago|reply
Link to original HN submission: https://news.ycombinator.com/item?id=14337275

It's worth noting for future reference that in terms of supervised learning of labels given a text document input, fasttext (https://github.com/facebookresearch/fastText) is leagues ahead of conventional approaches in both accuracy and training speed, and there is a Python interface (https://github.com/salestock/fastText.py) for use with Django/Flask (unfortunately, recent fasttext changes have broken the interface for now).

[+] snissn|8 years ago|reply
Can you suggest any unsupervised learning? I want to take a body of text associated with users and come up with keywords/topics with each user. Thanks! :)
[+] projectorlochsa|8 years ago|reply
How does fasttext compare to vowpal wabbit?
[+] _eht|8 years ago|reply
I noticed some things were pretty terribly tagged earlier this week. Robots have a ways to go, this is good news.