Creating a loop for pagination manually by dragging a Loop Item into the workflow and clicking the Next Page button to create a Click Item action. Then copy the XPath of the Next Page into the Loop It...
Monday, November 14, 2016
The extraction does however stop after 3 pages and does not cycle through all the pages. You need to modify the XPath for the pagination loop to correctly find the web elements in order to scrape mult...
Friday, November 18, 2016
Monday, December 12, 2016
Paginated content exists throughout the web. To scrape data from the whole category, you would need to configure pagination in your task to complete your data extraction project. This tutorial cover...
Sunday, April 08, 2018
With Octoparse, you can easily implement data acquisition and web scraping from different kinds of websites to analyze industry advantages and shortcomings. In this tutorial, we will guide you to scra...
Sunday, April 08, 2018
Many users have encountered such case that Octoparse skips some pages when scraping. For example, after it successfully scrapes the first two pages, it directly jumps to the page 5, then maybe page 10...
Thursday, August 23, 2018
When you set a completed task to run locally or in the cloud, you may have data extracted to the wrong "columns" or not being extracted at all. This is likely due to incorrect XPath failing to locate ...
Friday, August 24, 2018
What should you do if you only want to scrape the non-ads items?
You need to modify the XPath of the “Loop Item” to make it only locate the non-ads items.
Friday, August 31, 2018
In this tutorial, we will show you how to scrape restaurant info from Grubhub. Details like name, price, address, review, menu and such can be easily extracted using Octoparse, no coding needed at all...
Thursday, January 31, 2019
In this web scraping tutorial we will show you how to deal with a pagination issue. Whenever a pagination issue is observed, it is very likely due to the fact that the auto-generated XPath for 'Next' ...
Tuesday, March 07, 2017