To get data from Yelp, it only takes 5 minutes to build a crawler with Octoparse.
Step 1: Input the Yelp website URL to build a web crawler
You can do this by simple copy-and-paste. Give it a few seconds. Octoparse will detect the webpage data automatically. Once the detection is done, you can see data fields highlighted in red. This means that all the highlighted data is preselected by the bot.
Step 2: Check the pagination setting and the data preview
Usually, the pagination button is auto-detected, and you can check its position. But if that’s not the case, you can easily select the button manually by clicking on “edit” on the Tips panel and confirm your selection.
There is a data preview section below that allows you to preview your data at the bottom and choose how you'd like the data to appear. For example, you can edit the names of the data fields, change the sequence or delete them.
Step 3: Create your workflow and execute the Yelp crawler
Once you’ve made sure the data columns look perfect, simply hit “create workflow” and Octoparse will auto-generate a scraping workflow for you on the left-hand side. The workflow tells us that our crawler will extract the listing data one by one on the first page, and then head to the following pages to repeat the extraction on each page.
You can choose to run your crawler on your computer or on Octoparse cloud servers. We usually recommend the latter as it allows you to schedule your extractions and can get data for you while you are sleeping. But the local extraction also works great for a one-time project. It is totally up to you.
When you’ve got enough data, just stop the extraction and export it into formats of your choice such as Excel, JSON, CSV, HTML, etc. Octoparse also allows you to export the data to your database or data visualization tools via Octoparse APIs.
Besides Yelp, Octoparse can also crawl Yellowpages, Google Maps, Amazon, eBay, Twitter, Instagram, and many other websites. You can check out our web scraping tutorials and shoot us an email at firstname.lastname@example.org. If you are in need of data, we are so ready to help!