When the XPath of the "Loop Item" box only collect the first item from each page, you need to modify the XPath by using our Octoparse XPath tool or other tools like Firepath to get the correct one.
...
Wednesday, December 7, 2016
Loop Item is really frequently used in Octoparse and is very useful when dealing with pagination button or load more button. But many users have questions about when would a loop end. Does the loop en...
Thursday, December 14, 2017
“Next” button for pagination is not always available and pagination on some websites may be done by page numbers. So in this case, to extract multiple pages of data, we will need to modify the XPath o...
Tuesday, August 7, 2018
Once you select an element on the web page in the built-in browser, Octoparse intelligently figures out the specific pattern (that is, XPath) which represents it in the source code of the page. To loc...
Wednesday, August 22, 2018
Extracting data from multiple pages through pagination is a very common case since most of the time you need more than one page of data for your project. But you might find that Octoparse sometimes ke...
Monday, July 30, 2018
Octoparse detects for items belonging to a list via their coding pattern in the underlying HTML source code. To make Octoparse be able to detect and select the missing items, we need to replace the ol...
Wednesday, August 15, 2018
In this web scraping tutorial we will show you how to deal with a pagination issue. Whenever a pagination issue is observed, it is very likely due to the fact that the auto-generated XPath for 'Next' ...
Tuesday, March 7, 2017
In this tutorial, we will walk through the detailed steps to crawl data from retail website, rakuten.com.
List of features covered:
Set up pagination
Build a loop list
Modify XPath
Wednesday, May 10, 2017
What should you do if you only want to scrape the non-ads items?
You need to modify the XPath of the “Loop Item” to make it only locate the non-ads items.
Friday, August 31, 2018
In this tutorial, we will show you how to scrape restaurant info from Grubhub. Details like name, price, address, review, menu and such can be easily extracted using Octoparse, no coding needed at all...
Thursday, January 31, 2019