I think it must be getting slammed; I was able to get a couple of descriptions out of it, but that was balanced by probably 2 times as many instances of the above error.
Rekognition API released similar image to text API and it's much more reliable than this. At least the demo works smooth and response fast.
https://rekognition.com/demo/concept
Even leaving aside the reliability issue (which can be chalked up to the fact that this one is a demo of a non-commercial project that got overloaded), you're comparing two entirely different things.
For this image, the University of Toronto software generates sentences like "a cow is standing in the grass by a car", whereas Rekognition only produces a ranked list of categories. ("sports_car", "car_wheel", etc.)
The demo is clearly designed for the small community of machine learning researchers to play around with it to better evaluate the papers they wrote. They aren't selling a product and probably have a hard time justifying using a lot of computing resources to host the demo. Furthermore, the models are probably optimized for result quality, not speed.
[+] [-] JacobEdelman|11 years ago|reply
[+] [-] YoukaiCountry|11 years ago|reply
Anyone having any luck?
[+] [-] bootynuke|11 years ago|reply
[+] [-] finin|11 years ago|reply
A picture of a rabbit in a wooden box => "a cat looking into a bin full of apples"
Mistaking a rabbit for a cat is not too bad. A bin is like a box, I suppose. I'm not sure where the apples came from.
[+] [-] thomasahle|11 years ago|reply
[+] [-] tly_alex|11 years ago|reply
[+] [-] teraflop|11 years ago|reply
Check out the "static demo" pages, e.g. http://www.cs.toronto.edu/~nitish/nips2014demo/results/79133...
For this image, the University of Toronto software generates sentences like "a cow is standing in the grass by a car", whereas Rekognition only produces a ranked list of categories. ("sports_car", "car_wheel", etc.)
EDIT: this is an even better example: http://www.cs.toronto.edu/~nitish/nips2014demo/results/89407... I'm cherry-picking the cases where the algorithm does well, of course. But even if it's unreliable, the fact that this works at all is impressive.
[+] [-] CardinalAgnelo|11 years ago|reply
[+] [-] tonydiv|11 years ago|reply
Take a look here: http://learnimmersive.com
[+] [-] CardinalAgnelo|11 years ago|reply
[+] [-] misiti3780|11 years ago|reply
Comment: If you click on source code right now it gives me to javascript alerts that were trying to print out JSON objects.
[+] [-] vonnik|11 years ago|reply
[+] [-] cmyr|11 years ago|reply
"a man and a girl are learning to play with a small pool", while poetic, is a stretch in this case.
[+] [-] JacobEdelman|11 years ago|reply