Octoparse

Learn about what's new on Octoparse, check out new features, and never miss the latest upgrade.
93 posts

A Full Guide to Build A Web Crawler with Python

avatarAnsel Barrett
This article will talk about 2 methods to build a web crawler with Pythod coding language. Also, you can find the best alternative to create web crawlers without any coding skills.
September 20, 2022 · 5 min read

How to Scrape Web Pages with Load More Button

avatarAnsel Barrett
In this article, we will tell you how to scrape data from a website with the "Load More" button using Octoparse web scraping tool and the Python method.
September 6, 2022 · 5 min read

How to Scrape Websites at Large Scale

avatarAnsel Barrett
There are challenges to scrape data from websites, especially web scraping at large scale. This article reveals the challenges and how we can deal with it.
August 30, 2022 · 4 min read

RegEx: How to Extract All Phone Numbers from Strings

avatarAnsel Barrett
This article shows you how to extract phone numbers from large strings by using regular expressions.
July 10, 2022 · 5 min read

Top 20 Web Crawling Tools to Scrape the Websites Quickly

avatarAnsel Barrett
Here are the top 20 web crawling tools that may fit your needs to extract news, blogs, product data, or URLs from any website. Web scraping is a perfect way to automate your data collection process and boost productivity.
June 23, 2022 · 5 min read

Best Web Scraper for Mac: Scrape Data from Any Website

avatarAbigail Jones
If you're looking for an easy-to-use web scraper for your macOS devices, then you can find the answer on this page. Octoparse can help you scrape any websites easily and quickly on your Mac.
May 27, 2022 · 5 min read

Use Octoparse to Download Web Data Easily – User Guide

avatarAnsel Barrett
If you want to download structured data from a set of websites, you should try a web scraping tool like Octoparse. Just click the data you want in the built-in browser, and the robot will do the jobs for you.
March 29, 2022 · 4 min read

Using This RegEx Tool to Match HTML Tags

avatarAnsel Barrett
Octoparse provides a RegEx tool for generating regular expressions. It can easily generate some simple regular expressions to meet your different needs to extract content in HTML documents.
March 9, 2022 · 5 min read

Cloud Extraction Works 24/7 with Speed 3-10 Times Faster than Local Extraction

avatarAnsel Barrett
This article introduces Cloud extraction, Ip ban, Octoparse API.
March 7, 2022 · 3 min read

Octoparse 8.5: Empowering Local Scraping and More

avatarAbigail Jones
Octoparse now makes web scraping on local devices faster and easier. Learn what's new about Octoparse 8.5 and how it improves your efficiency.
February 16, 2022 · 6 min read

What is Web Harvesting?

avatarAnsel Barrett
Web harvesting, also known as web scraping, is the process of data collection from target web pages on the Internet by specialized programs or software. Data is further exported to the database of your choice. Web Harvesting still mainly focus on web content pages that are based on HTML / XML. You may need to grasp some technical terms like XQuery and RegEx (Regular Expression) that can help you screen the content of text / XML documents and thus to collect the exact information.
February 7, 2022 · 2 min read

What is a task in Octoparse?

avatarAbigail Jones
This blog explain that the concept of a task in Octoparse so that you can use Octoparse better.
January 18, 2022 · 4 min read