Seems like natural language processing would be an interesting direction for captchas.
- A man is running. A dog is behind him barking and growling. What does the man think might happen?
- A man goes up the stairs to the roof. He walks to the very edge of the building. He takes one more step. What is the man trying to do?
The correct answer should be pretty easy to parse out. And I'd expect a better success rate for humans than some of the captchas today that increasingly are looking more like magic eye puzzles than character recognition. But of course the big question is generation. Can these sort of implication based stories be generated in a way such that the final text can not trivially be reversed to the answer (without even considering the 'meaning' of the question)? And for that matter can these even be realistically generated in mass?
You're in a desert walking along the sand when all of a sudden you look down and see a tortise. You reach down and flip the tortise on it's back. The tortise lays on it's back, it's belly baking in the hot sun but you're not helping. Why is that leon?
People always come at this from an angle of "what can I do that computers can't?". You need to take into account the incredible diversity of people who use the internet, and what they can and can't do. There's already a viral article written by an old lady who can't pass the current captchas. Add to this, people who don't speak english, or don't speak it well; people who battle to read and comprehend text in any language; people who battle with logical reasoning; etc, etc, etc. The lowest common denominator for a task that be easily solved by any human is pretty low.
When writing such captcha questions for a forum, I generally use google as a validation to see that google can't answer the question in the top listed links. This allow me to easily adjust questions to the point where natural language processing should not be able to answer the question but a human person would.
Yep, Question-Answering Semantic Role labeling is an interesting research project around crowdsourcing NLP datasets. https://dada.cs.washington.edu/qasrl/
This [1] is the article they're citing. Note that a cursory search turns up similar claims from back in 2013; it might be worth waiting for someone with more experience and less bias to express their opinions before dumping your captcha-related stocks.
Since when was captcha not broken? Sites like http://www.deathbycaptcha.com/user/order have been around for ages. Yes, a mere $6.95 gets you 5000 captchas solved by OCR and humans in an avg of 6 seconds. Imagine that job.
Sure, AI can break captcha, but it can be done at scale for far less than an AI research and GPU rig costs.
Google's approach to bot recognition is training their own bots incidentally, so even an adversarial network attempting to bypass it would give it a ton of training along the way to breaking in.
I don't believe it's a job. Isn't this the thing where captchas on target sites are simply mirrored on other sites like sketchy filehosts? Real human users are solving captchas to access some content hosted by this service, and the solution they enter is passed through to the target site.
I don't see how captchas are "fundamentally cracked" if they only claim a success rate at best around 2/3rds. Nor do they give an explanation for what they mean by fundamentally cracked.
A captcha is cracked if it becomes economical to try to pass it over and over again. If you have a script that succeeds in spamming a forum 2/3 times it tries, you've got a successful spamming system.
What they mean by fundamentally cracked is that this method seems to be more robust against minor variations of spacing, font, etc. than CNN-based models.
Many types of CAPTCHA systems can be defeated with machine learning models and OCR. Google provides its own called Google Vision API. Here is a brief example how this is done in practice: https://blog.websecurify.com/2017/10/cracking-captchas.html
Perhaps this is an old news as this technique has been out for a while but I find that it is still relevant in the many cases I have encountered.
Furthermore, in my experience, I attribute Google's failure to improve reCAPTCHA's "I am not a robot" visual appeal as one of the key factors why many organisations are simply not using it.
I think rather than being broken, captcha models are just going to be made more complex. Maybe they'll start asking you to write a poem or play a mini problem solving game.
a lot of services use facebook to verify that someone is a human. there should be a service that exists only to manage peoples identities online. sign up, provide some id, an address and last four of your social. later, maybe a letter is sent to the address and returned with a verification code. then, every other service on the internet could use that service to prevent bots, spam and other things.
Here in Norway we use something like that for access to banks, tax, pensions, social services, etc. All of these services allow you to log in with what is called BankID. You apply for BankID and supply an ID like your passport then all the other banks and institutions accept that. It uses a two factor scheme with SMS, code cards, apps in SIM cards, etc.
But of course this isn't available to be used by some random kitten video trading site.
Also why would I want to give up my real identity to a lot of the sites that use a captcha?
[+] [-] indubitable|8 years ago|reply
- A man is running. A dog is behind him barking and growling. What does the man think might happen?
- A man goes up the stairs to the roof. He walks to the very edge of the building. He takes one more step. What is the man trying to do?
The correct answer should be pretty easy to parse out. And I'd expect a better success rate for humans than some of the captchas today that increasingly are looking more like magic eye puzzles than character recognition. But of course the big question is generation. Can these sort of implication based stories be generated in a way such that the final text can not trivially be reversed to the answer (without even considering the 'meaning' of the question)? And for that matter can these even be realistically generated in mass?
[+] [-] dmead|8 years ago|reply
[+] [-] sixhobbits|8 years ago|reply
[+] [-] RobertoG|8 years ago|reply
[+] [-] belorn|8 years ago|reply
[+] [-] Hyperbolic|8 years ago|reply
[+] [-] olegkikin|8 years ago|reply
So what's the correct answer here?
* That's a mean dog
* I hope it's on a leash
* I hope it's not going to start running after me
* I hope I don't get bitten
* Where can I hide?
* OMG, I will get rabies!
[+] [-] DalasNoin|8 years ago|reply
[+] [-] taesis|8 years ago|reply
[1]: http://science.sciencemag.org/content/early/2017/10/26/scien...
[+] [-] thisisit|8 years ago|reply
Are there companies relying only only selling captcha for revenues?
[+] [-] vonnik|8 years ago|reply
http://www.slate.com/blogs/future_tense/2013/10/28/captcha_c...
I thought the world moved on.
[+] [-] reilly3000|8 years ago|reply
Sure, AI can break captcha, but it can be done at scale for far less than an AI research and GPU rig costs.
Google's approach to bot recognition is training their own bots incidentally, so even an adversarial network attempting to bypass it would give it a ton of training along the way to breaking in.
[+] [-] notatoad|8 years ago|reply
I don't believe it's a job. Isn't this the thing where captchas on target sites are simply mirrored on other sites like sketchy filehosts? Real human users are solving captchas to access some content hosted by this service, and the solution they enter is passed through to the target site.
[+] [-] _pdp_|8 years ago|reply
[+] [-] partycoder|8 years ago|reply
Dileep George, cofounder of Vicarious, is the former Numenta CTO, and claimed to use probabilistic graphical models as a basis for their tech.
https://www.youtube.com/watch?v=-H185jPf-7o
[+] [-] habitue|8 years ago|reply
[+] [-] ComputerGuru|8 years ago|reply
[+] [-] sobellian|8 years ago|reply
What they mean by fundamentally cracked is that this method seems to be more robust against minor variations of spacing, font, etc. than CNN-based models.
[+] [-] willchang|8 years ago|reply
[+] [-] taneq|8 years ago|reply
[+] [-] reacweb|8 years ago|reply
[+] [-] gruez|8 years ago|reply
[+] [-] Sir_Cmpwn|8 years ago|reply
[+] [-] trophycase|8 years ago|reply
[+] [-] taneq|8 years ago|reply
It's right there in the name: "Completely Automated Public Turing test to tell Computers and Humans Apart"
[+] [-] nabla9|8 years ago|reply
Computers try to figure out who is human and who is not. In Turing test humans try to who is human and who is not.
[+] [-] mitchty|8 years ago|reply
At a certain point I just give up and refuse to use the worst sites that use this junk.
[+] [-] _pdp_|8 years ago|reply
Perhaps this is an old news as this technique has been out for a while but I find that it is still relevant in the many cases I have encountered.
Furthermore, in my experience, I attribute Google's failure to improve reCAPTCHA's "I am not a robot" visual appeal as one of the key factors why many organisations are simply not using it.
[+] [-] briga|8 years ago|reply
[+] [-] taeric|8 years ago|reply
[+] [-] wheresmyusern|8 years ago|reply
[+] [-] pinum|8 years ago|reply
[+] [-] sitepodmatt|8 years ago|reply
[+] [-] kwhitefoot|8 years ago|reply
But of course this isn't available to be used by some random kitten video trading site.
Also why would I want to give up my real identity to a lot of the sites that use a captcha?
[+] [-] lsseckman|8 years ago|reply
[+] [-] HealthGoth|8 years ago|reply
[deleted]