This is great. I particularly like that they also automatically generated dirty versions for their training set, because that's exactly what I ended up doing for my dissertation project (a computer vision system [1] that automatically referees Scrabble boards). I also used dictionary analysis and the classifier's own confusion matrix to boost its accuracy.If you're also interested in real time OCR like this, I did a write up [2] of the approach that worked well for my project. It only needed to recognize Scrabble fonts, but it could be extended to more fonts by using more training examples.
[1] http://brm.io/kwyjibo/
[2] http://brm.io/real-time-ocr/
joe_the_user|10 years ago
Can't get [1]https://www.dcs.shef.ac.uk/intranet/teaching/campus/projects...
liabru|10 years ago
JabavuAdams|10 years ago
More generally, I really like the idea of generating controlled synthetic images and then messing them up for regularization.
megalodon|10 years ago
liabru|10 years ago
I've not tried it on anything else, but I remember thinking that it has a lot of potential uses. Also I only used it on gray-scale features, but I'm sure it could make use of full RGB too. I'll have to try it some time!
zem|10 years ago