Top

Google has algorithm to crack CAPTCHA codes

Google’s Street View algorithm accidentally stumbled on cracking CAPTCHA

Google’s Street View cameras are scanning the globe from all positions and making sense of all alphabets in the images. The image-to-text conversion algorithm (similar to OCR or optical character recognition) can convert alphabets and numbers from name plates and street signs into plain text for use on the Google Map and Google Street View data.

However, the CAPTCHA used to protect numerous websites from bots also works on the similar method, but in reverse. The CAPTCHA is meant to be hard for bots to scan to text while being easier on humans to read it. If Google’s algorithm was to be put in this place, any CAPTCHA word can be easily converted into text and a bot can be created easily.

Google has acknowledged this discovery and are working closely with the same to implement stronger and smarter reCAPTCHA codes. We refer to you the information from an article (below) posted on the Google Blog.

Have you ever wondered how Google Maps knows the exact location of your neighborhood coffee shop? Or of the hotel you’re staying at next month? Translating a street address to an exact location on a map is harder than it seems. To take on this challenge and make Google Maps even more useful, we’ve been working on a new system to help locate addresses even more accurately, using some of the technology from the Street View and reCAPTCHA teams.

This technology finds and reads street numbers in Street View, and correlates those numbers with existing addresses to pinpoint their exact location on Google Maps. We’ve described these findings in a scientific paper at the International Conference on Learning Representations (ICLR). In this paper, we show that this system is able to accurately detect and read difficult numbers in Street View with 90% accuracy.

These findings have surprising implications for spam and abuse protection on the Internet as well. For more than a decade, CAPTCHAs have used visual puzzles in the form of distorted text to help webmasters prevent automated software from engaging in abusive activities on their sites. Turns out that this new algorithm can also be used to read CAPTCHA puzzles—we found that it can decipher the hardest distorted text puzzles from reCAPTCHA with over 99% accuracy. This shows that the act of typing in the answer to a distorted image should not be the only factor when it comes to determining a human versus a machine.

Fortunately, Google’s reCAPTCHA has taken this into consideration, and reCAPTCHA is more secure today than ever before. Last year, we announced that we’ve significantly reduced our dependence on text distortions as the main differentiator between human and machine, and instead perform advanced risk analysis. This has also allowed us to simplify both our text CAPTCHAs as well as our audio CAPTCHAs, so that getting through this security measure is easy for humans, but still keeps websites protected.

Thanks to this research, we know that relying on distorted text alone isn’t enough. However, it’s important to note that simply identifying the text in CAPTCHA puzzles correctly doesn’t mean that reCAPTCHA itself is broken or ineffective. On the contrary, these findings have helped us build additional safeguards against bad actors in reCAPTCHA.

As the Street View and reCAPTCHA teams continue to work closely together, both will continue to improve, making Maps more precise and useful and reCAPTCHA safer and more effective.

Next Story