Step-by-step tutorials for you to get started with web scrapingDownload Octoparse
The latest version for this tutorial is available here. Go to have a check now!
In this tutorial, we are going to show you how to scrape information from Craigslist.
To follow through you might want to use the URL in this tutorial:
Here are the main steps in this tutorial:[Download task file here]
1) "Go To Web Page" - to open the targeted web page
2) Create a pagination loop - to scrape all the results from multiple pages
3)Create a "Loop Item" - to loop click into each item on each list
We are now on the second page. When creating a "Loop Item", we should always start with the first item on the first page. Thus, we 'd better go back to the first page.
By doing this, we can help Octoparse decide the execution order and generate the Loop Item at the appropriate position in the workflow.
The first item is highlighted in green while the others are highlighted in red
All of the items are highlighted in green
4) Extract data - to select data you need to scrape
5) Run extraction - to run your task and get data
Here is the sample output:
Was this article helpful? Contact us anytime if you need our help!