List & Detail Web Page - Wizard ModeWednesday, March 9, 2016 9:23 PM
Extract List & Detail Webpage
Step 1. Download Octoparse and install it. Register a new account at www.octoparse.com. Or directly click the “Sign up” option the Login interface.
If it's the first time you log in, you’ll see the User Guide interface.
Step 2. In Wizard Mode: Hit “Start” ➜ Go to “List&Detail Extraction” ➜ “+Create” to start building a task.
(Note: You need to learn the training section at least one time. you’ll automatically enter into its training section mode after you click “+Create”.)
Step 3. Complete basic information. ➜ Click “Continue” ➜ Click “Next”.
Step 4. In the “List of URL” box: Copy and paste all the URLs. ➜ Click “Continue” ➜ Click “Next”.
Step 5. Octoparse will automatically open the first webpage and display the content in the built-in browser. (Note: You can select any content you want to extract.)
All the links will be selected. Click the "Field Name" to modify. Click “Continue”➜ Click “Next”.
Step 6. Pagination: Choose “Pagination” option. ➜ Click “Next Page” in the built-in browser. ➜ Click "Next".
Step 7. Octoparse will automatically open the first link and display the detailed web page content in the built-in browser.
Step 8. (Note: You can select any content you want to extract.) All the data selected will be listed in "Define Fields". Click the "Field Name" to modify. ➜ Go to “Continue” ➜ “Next”.
Step 9. Choose “Local Extraction” ➜ “OK”.
Octoparse will automatically extract all the data selected. The data extracted will be shown in "Data Extracted" pane.
It’s done! Click “OK” in the pop-up “Extraction Completed” window.
Step 10: Click button to export the results to Excel file, databases or other formats and save the file to your computer.
Happy Data Hunting!
Author: The Octoparse Team
For more information about Octoparse, please click here.
Sign up today.
Get Started with Octoparse in 2 minutes
If this video tutorial is not available for you, you can click hereto see the corresponding graphic tutorial.