Is It Possible to Bypass the CAPTCHA When Extracting Data From Web pages

4/15/2016 3:39:18 AM




Have you ever been asked to read blurred letters and type them into a box? That’s a CAPTCHA.

CAPTCHA (Completely Automated Public Turing test to tell Computers and Humans Apart) is a method that websites use to tell the difference between robots and humans accessing their pages. CAPTCHAs are there to actually stop you for automating the login. This is an ongoing struggle between CAPTCHAs providers and the ones who want to beat the system by bypassing them.




There are many websites that use CAPTCHA to prevent robots from visiting their websites. So it’ll be very tricky for you to extract data from these websites. Well, is it possible to bypass the CAPTCHA when extracting data From web pages?

There are ways to get around CAPTCHA. By using some artificial technique , it can bypass the verification code. The most common way is to hook your program up to a service in an offshore center where someone sits before a screen all day filling in those little authentication screens.

So far Octoparse does not handle captchas. But we will catch it up.




Author: The Octoparse Team




Download Octoparse Today



For more information about Octoparse, please click here.

Sign up today.



Author's Picks


About Octoparse

Octoparse 6.0 is Now Available

What A Price Monitor Can Help you?

Examples of Businesses Who Use Data Scraping

Collect Data from Facebook

Collect Data from Craigslist

Collect Data from LinkedIn




Recent Posts


Leave us a message

Your name*

Your email*




Attach file
Attach file
Please enter details of your issue and we will get back to you ASAP.