vkhuc | 4 years ago | on: Google Dataset Search
vkhuc's comments
vkhuc | 10 years ago | on: Obstacles on the Path to AI
vkhuc | 10 years ago | on: Obstacles on the Path to AI
vkhuc | 10 years ago | on: Curated list of speech and natural language processing resources
vkhuc | 11 years ago | on: SpaCy: Industrial-strength NLP with Python and Cython
Also, it would be great if you include SENNA into your benchmark: http://ml.nec-labs.com/senna/
vkhuc | 11 years ago | on: Nvidia's demo of real-time object recognition using deep learning [video]
vkhuc | 11 years ago | on: Twitter Natural Language Processing
I planned to replace the optimizer in CMU's POS tagger with the one implemented in OpenNLP to make the tagger fully Apache. Unfortunately, so busy right now. Currently, I'm running the tagger on AWS, so the GPL doesn't hurt me much.
BTW, besides the POS tagger, CMU's TweeboParser depends on Turbo Parser which again is licensed under GPL.
vkhuc | 11 years ago | on: Twitter Natural Language Processing
vkhuc | 11 years ago | on: Twitter Natural Language Processing
However, it's possible to remove GPL libraries out of the POS tagger as mentioned here: https://github.com/brendano/ark-tweet-nlp/blob/master/LICENS...
vkhuc | 11 years ago | on: Twitter Natural Language Processing
vkhuc | 11 years ago | on: Level-Up Your Machine Learning
1) "Data Mining and Analysis: Fundamental Concepts and Algorithms" by Zaki and Meira http://www.cs.rpi.edu/~zaki/PaperDir/DMABOOK.pdf
This book covers many ML topics with concrete examples.
2) "Computer Vision: Models, Learning, and Inference" by Simon Prince: http://web4.cs.ucl.ac.uk/staff/s.prince/book/book.pdf
Despite a CV book, the first half of it is like a statistics book that comes with examples in CV which are very easy to follow.
vkhuc | 11 years ago | on: Level-Up Your Machine Learning
For mathematical foundations of ML, I would recommend the book "Understanding Machine Learning: From Theory to Algorithms" by Shai Shalev-Shwartz.
A brief version of the book is available to download on the author's website: http://www.cs.huji.ac.il/~shais/Handouts.pdf
vkhuc | 12 years ago | on: Parsing English with 500 lines of Python (2013)
It's super fast (thanks to C) and very accurate (thanks to Deep Learning approach). The license is not for commercial usage though.
SENNA can be used with NLTK: http://pydoc.net/Python/nltk/2.0.2/nltk.tag.senna/
vkhuc | 12 years ago | on: How slow is Python really? Or how fast is your language?
The learning curve for Scala may be steep though.
vkhuc | 12 years ago | on: Parsing English with 500 lines of Python (2013)
You can also check out the great online NLP course taught by the author and Prof. Chris Manning from Stanford: https://www.youtube.com/watch?v=nfoudtpBV68&list=PL6397E4B26...