How to Extract Data from eBayThursday, March 31, 2016 11:45 PM
Step 1. Download Octoparse and install it. Register a new account at www.octoparse.com. Or directly click the “Sign up” option the Login interface.
Step 2. Click “start”to build a new task./Hit the “Quick start” button in Navigation Panel to Create a new task.( Here we use Advanced Mode.)
Step 3. Complete basic information. ➜ Click “Next”.
Step 4. Design Workflow to configure the extraction rule. You can check your configuration rule in Workflow Designer here if something goes wrong.
2)Then choose a “loop mode”. ➜ “List of URL” under Advanced Options.
I’m going to take 4 brands of Television on eBay for example.
3)Click “List of URL” box. ➜ Enter all the URLs. (Note: All the pages must share the similar layout.)
After choosing “OK”, the gray “a” turns into blue “A”. (Note: Don’t forget to click “Save” button to save the task.
After creating a “Go to the webpage” action in the “Loop Item” action, the program will start the process to navigate every URL just added.
4)Scroll down the page to the bottom. Click "Next page"➜ “Loop click next page” to create a loop action to process all the web pages.
5)Go back to the first product section. If you need more information in the detail page, click the detail link to get into the detail page. If not, just extract information you need on this page.
To extract information in the detail page: Click the first product title. ➜ Click “Create a list of items”. ➜ “Add current item to the list”.
Click “Continue to edit the list”.
Then click the second product title. ➜ Click “Add current item to the list” again.
As can be seen, all the detail links on the first page are in the box. ➜ Then click “Finish creating list”.
Click”Loop” to process the list.
Then we’re on the detail page.
6)Extract data. You can extract any data you need on the page. Click on the product title. ➜ Click “Extract text”.
All the content will be selected in Data Fields. ➜ Click the "Field Name" to modify.➜ Click “Next”.
7)Drag the second “Loop Item” before “Click to paginate” action. Once all data fields are created, click “Next”
Step 3: Now we are done configuring extraction rule. ➜ Click “Next” to process configured rule. When images are not needed, you can choose not to load images to speed up the extraction.
Step 4: Now the task is completed! Choose the “Local extraction” to run the task on your computer.
Step 5: The data extracted will be shown in "Data Extracted" pane. Click button to export the results to Excel file, databases or other formats and save the file to your computer.
1)When you’ve done exporting file, click “open folder” to check the result.
2)The result looks awesome! If there’s no data to be extracted, it would leave blank in the result.
Happy Data Hunting!
Author: The Octoparse Team
For more information about Octoparse, please click here.
Sign up today.
If this video tutorial is not available for you, you can click here to see the corresponding graphic tutorial.