undefined

Web Scraping Case Study | Scraping Yelp Reviews

Saturday, December 31, 2016 12:39 AM

For the latest tutorials, visit our new self-service portal. Sharpen your skills and explore new ways to use Octoparse.

 

Yelp is one of the largest business directory websites on the Internet. In this tutorial, we are going to show you how to scrape customer reviews from Yelp.

For Yelp scraping, you can use our ready-to-use Task Template available on the home page or follow this tutorial to build the task from scratch.

cr_yelp1

To demonstrate, we will use this URL as an example: https://www.yelp.com/biz/pike-place-chowder-seattle

 

Here are the main steps in this tutorial: [Download demo task file here]

1. Go to Web Page - to open the target web page

2. Create Pagination - to scrape from multiple pages

3. Extract review information

4. Check Data Preview and Workflow

5. Run Task and Export Data

 

1. Go to Web Page - to open the target web page

Paste the URL on the home screen and click Start

 

2. Create Pagination - to scrape from multiple pages

  a. Scroll down to find the paging button for the review section (>), click on it

  b. Select Loop click next page on the Tips 

cr_yelp2

  c. Adjust Set AJAX timeout to 10s

 

 

3. Extract review information

  a. Click on Pagination in the workflow

  b. Click on 2 random review blocks - Select all sub-elements - Extract data

You will see a Loop Item created inside the Pagination.

 

4. Check Data Preview and workflow

  a. Go to Data Preview, double click the field header to rename it 

  b. Click ... to delete it

Below is what the final workflow looks like. Once everything is in place, you can continue to run the task

cr_yelp3

 

5. Run Task - to get the data

    Run the task on the top right corner: Run task on your device to run the task on your local device, or select Run task in the cloud to run the      task on the Cloud (for premium users only)

 

Here is the sample output:

cr_yelp4

 

Happy Data Hunting!

Author: The Octoparse Team

Download Octoparse Today

 

For more information about Octoparse, please click here.

Sign up today. 

We use cookies to enhance your browsing experience. Read about how we use cookies and how you can control them by clicking cookie settings. If you continue to use this site, you consent to our use of cookies.
Accept decline