Creating a loop for pagination manually by dragging a Loop Item into the workflow and clicking the Next Page button to create a Click Item action. Then copy the XPath of the Next Page into the Loop It...
Monday, November 14, 2016
The extraction does however stop after 3 pages and does not cycle through all the pages. You need to modify the XPath for the pagination loop to correctly find the web elements in order to scrape mult...
Friday, November 18, 2016
Monday, December 12, 2016
In this tutorial, you will learn how to speed up Cloud Extraction by telling the Octoparse to split up one task into multiple sub tasks.
Thursday, January 19, 2017
In this tutorial, we will show you how to make judgement about whether a specific image is within a particular web page or not. To execute the branch judgement, we need to modify and edit the XPath o...
Wednesday, March 8, 2017
Many users have encountered such case that Octoparse skips some pages when scraping. For example, after it successfully scrapes the first two pages, it directly jumps to the page 5, then maybe page 10...
Thursday, August 23, 2018
When you set a completed task to run locally or in the cloud, you may have data extracted to the wrong "columns" or not being extracted at all. This is likely due to incorrect XPath failing to locate ...
Friday, August 24, 2018
Paginated content exists throughout the web. To scrape data from the whole category, you would need to configure pagination in your task to complete your data extraction project. This tutorial cover...
Sunday, April 8, 2018
With Octoparse, you can easily implement data acquisition and web scraping from different kinds of websites to analyze industry advantages and shortcomings. In this tutorial, we will guide you to scra...
Sunday, April 8, 2018
Thursday, December 29, 2016