Scrape Data from Airbnb into ExcelFriday, October 07, 2016 10:09 PM
(Download my extraction task of this tutorial HERE just in case you need it.)
Sometimes we need to scrape data from websites and export them to an Excel Spreadsheet. This is definitely not a problem for scraping tools as they provide many formats to export the data. In this tutorial, I will take Airbnb for example to show you how to scrape data from websites into excel.
Step 1. You need to configure a rule first. Choose “Advanced Mode” ➜ Complete basic information.
Enter the target URL of Airbnb in the built-in browser. ➜ Click “Go” icon to open the webpage.
Step 2. Click the pagination link. Click “Expand the selection area” until “Loop click in the element” appears. ➜ Choose “Loop click in the element” to turn the page.
(Note: If you want to extract some information from every page of search result, you need to add a page navigation action.)
Step 3. Move your cursor over the section with similar layout, where you would extract data.
Click the first highlighted link ➜ Create a list of sections with similar layout. Click “Create a list of items” (sections with similar layout). ➜ “Add current item to the list”. Then the first highlighted link has been added to the list. ➜ Click “Continue to edit the list”.
Click the second highlighted link ➜ Click “Add current item to the list” again. Now we get all the links with similar layout. ➜Then click “Finish Creating List” ➜ Click “loop” to process the list for extracting the elements in each page.
Step 4. Extract the title of the first section ➜ Click the title. ➜ Select “Extract text”. Other contents can be extracted in the same way.
Step 5. All the content will be selected in Data Fields. ➜ Click the “Field Name” to modify.
Step 6. Drag the second “Loop Item” before “Click to paginate” action in the Workflow Designer so that we can grab all the elements of sections from multiple pages.
Step 7. Click “Next” ➜ Click “Next” ➜ Click “Local Extraction”. Octoparse will automatically extract all the data selected.
Step 8. The data extracted will be shown in "Data Extracted" pane. Click “View Data” button to view data. You then could export the results to Excel file. Click “Export” button and then save the file as Excel to your computer.
Author: The Octoparse Team
For more information about Octoparse, please click here.
Sign up today!