XPath is a language that allows you to locate specific elements from a page. Modifying XPath in Octoparse works very well with more flexibility and accuracy than the XPath auto-generated by clicking e...
Sunday, April 8, 2018
What should you do if you only want to scrape the non-ads items?
You need to modify the XPath of the “Loop Item” to make it only locate the non-ads items.
Friday, August 31, 2018
The extraction does however stop after 3 pages and does not cycle through all the pages. You need to modify the XPath for the pagination loop to correctly find the web elements in order to scrape mult...
Friday, November 18, 2016
Tuesday, December 13, 2016
Monday, December 12, 2016
Wednesday, April 5, 2017
Octoparse Cloud-based Service has added more featured services so that users can crawl or scrape data with increasingly high speed and large scale. Octoparse featured Cloud Service differs from the Lo...
Wednesday, March 1, 2017
In this web scraping tutorial we will teach you how to scrape a site that required login with Octoparse. The examples of websites which required login we'd like to use are Facebook, Twitter and Linked...
Thursday, January 12, 2017
In this web scraping tutorial we will scrape the article information from the Google search results of “cancer”. We will scrape latest articles from this website to get the abstract of latest articles...
Wednesday, January 11, 2017
This tutorial shows you how to set scheduled data extraction in Octoparse. You can collect the data/scrape the website at some specific times on selected days. Octoparse enables you to do the scheduli...
Monday, November 28, 2016