When the XPath of the "Loop Item" box only collect the first item from each page, you need to modify the XPath by using our Octoparse XPath tool or other tools like Firepath to get the correct one.
...
Wednesday, December 07, 2016
Loop Item is really frequently used in Octoparse and is very useful when dealing with pagination button or load more button. But many users have questions about when would a loop end. Does the loop en...
Thursday, December 14, 2017
Extracting data from multiple pages through pagination is a very common case since most of the time you need more than one page of data for your project. But you might find that Octoparse sometimes ke...
Monday, July 30, 2018
“Next” button for pagination is not always available and pagination on some websites may be done by page numbers. So in this case, to extract multiple pages of data, we will need to modify the XPath o...
Tuesday, August 07, 2018
Octoparse detects for items belonging to a list via their coding pattern in the underlying HTML source code. To make Octoparse be able to detect and select the missing items, we need to replace the ol...
Wednesday, August 15, 2018
Once you select an element on the web page in the built-in browser, Octoparse intelligently figures out the specific pattern (that is, XPath) which represents it in the source code of the page. To loc...
Wednesday, August 22, 2018
In this tutorial, you will learn how to speed up Cloud Extraction by telling the Octoparse to split up one task into multiple sub tasks.
Thursday, January 19, 2017
If there is one situation that the website you want to scraped contains a “Load More” button, Octoparse will normally extract data after all the content are displayed by clicking the “Load More” butto...
Thursday, March 02, 2017
In this tutorial, we will show you how to make judgement about whether a specific image is within a particular web page or not. To execute the branch judgement, we need to modify and edit the XPath o...
Wednesday, March 08, 2017
In this tutorial, we will walk through the detailed steps to crawl data from retail website, rakuten.com.
List of features covered:
Set up pagination
Build a loop list
Modify XPath
Wednesday, May 10, 2017