Regular expressions are patterns used to match character combinations in strings.
In this tutorial I will take glassdoor for example to show you how to use regular expressions to scrape data from we...
Thursday, October 13, 2016
Data re-format in Octoparse is a useful tool when the extracted data are not in the form you want. It mainly has 8 different functions(Replace, Replace with regular expression, Match with regular expr...
Friday, August 18, 2017
In this document, we want to extract some stock data of a company. I’m going to show you how to use Octoparse to get financial data from the web. Let’s get started.
Sunday, May 29, 2016
Thursday, May 26, 2016
Octoparse 7.X has simplified the operations of scraping data from directory websites. In this tutorial, we will show you how to extract data from directories by using yelp.com as an example.
Sunday, April 8, 2018
In this tutorial, we will show you how to extract data from source code (extract data from inner HTML and extract data from HTML). We will also match HTML transcoding and use Regular Expression to fur...
Sunday, April 8, 2018
In this tutorial, I will show you how to extract information from Ebay and how to re-format extracted data with regular expressions. You can grab exact data from Ebay with Octoparse.
Wednesday, April 27, 2016
With Octoparse, you can easily scrape any data you want such as top news, hot topics, worldwide trends etc. from a variety of social media websites. In this tutorial, we would show you how to extract ...
Sunday, April 8, 2018
During your web scraping project, some data might not be the format you wanted. In this case, Octoparse offers 8 types of re-formatting features for you to further process or clean the data extracted ...
Sunday, April 8, 2018
You may doubt whether you could extract text from an HTML document as it usually contains tags that you don’t need. Or you may have queries whether you could scrape the hidden text behind the website ...
Thursday, September 29, 2016