Octoparse
Learn about what's new on Octoparse, check out new features, and never miss the latest upgrade.
93 posts

A Full Guide to Build A Web Crawler with Python
This article will talk about 2 methods to build a web crawler with Pythod coding language. Also, you can find the best alternative to create web crawlers without any coding skills.
September 20, 2022 · 5 min read

How to Scrape Web Pages with Load More Button
In this article, we will tell you how to scrape data from a website with the "Load More" button using Octoparse web scraping tool and the Python method.
September 6, 2022 · 5 min read

How to Scrape Websites at Large Scale
There are challenges to scrape data from websites, especially web scraping at large scale. This article reveals the challenges and how we can deal with it.
August 30, 2022 · 4 min read

RegEx: How to Extract All Phone Numbers from Strings
This article shows you how to extract phone numbers from large strings by using regular expressions.
July 10, 2022 · 5 min read

Top 20 Web Crawling Tools to Scrape the Websites Quickly
Here are the top 20 web crawling tools that may fit your needs to extract news, blogs, product data, or URLs from any website. Web scraping is a perfect way to automate your data collection process and boost productivity.
June 23, 2022 · 5 min read

Best Web Scraper for Mac: Scrape Data from Any Website
If you're looking for an easy-to-use web scraper for your macOS devices, then you can find the answer on this page. Octoparse can help you scrape any websites easily and quickly on your Mac.
May 27, 2022 · 5 min read

Use Octoparse to Download Web Data Easily – User Guide
If you want to download structured data from a set of websites, you should try a web scraping tool like Octoparse. Just click the data you want in the built-in browser, and the robot will do the jobs for you.
March 29, 2022 · 4 min read

Using This RegEx Tool to Match HTML Tags
Octoparse provides a RegEx tool for generating regular expressions. It can easily generate some simple regular expressions to meet your different needs to extract content in HTML documents.
March 9, 2022 · 5 min read

Cloud Extraction Works 24/7 with Speed 3-10 Times Faster than Local Extraction
This article introduces Cloud extraction, Ip ban, Octoparse API.
March 7, 2022 · 3 min read

Octoparse 8.5: Empowering Local Scraping and More
Octoparse now makes web scraping on local devices faster and easier. Learn what's new about Octoparse 8.5 and how it improves your efficiency.
February 16, 2022 · 6 min read

What is Web Harvesting?
Web harvesting, also known as web scraping, is the process of data collection from target web pages on the Internet by specialized programs or software. Data is further exported to the database of your choice. Web Harvesting still mainly focus on web content pages that are based on HTML / XML. You may need to grasp some technical terms like XQuery and RegEx (Regular Expression) that can help you screen the content of text / XML documents and thus to collect the exact information.
February 7, 2022 · 2 min read

What is a task in Octoparse?
This blog explain that the concept of a task in Octoparse so that you can use Octoparse better.
January 18, 2022 · 4 min read