List & Detail Web Page - Advanced Mode

Thursday, March 24, 2016 5:47 AM

Extract List & Detail Content from Webpages

 

 

Step 1. Download Octoparse and install it. Register a new account at www.octoparse.com. Or directly click the “Sign up” option the Login interface.

 

 

Step 2. Advanced Mode: Go to “Advanced Mode” ➜ “List & Detail” ➜ “Start”.    

 

Step 3. Complete basic information. ➜ Click “Continue” ➜ Click “Next”.

 

Step 4. Enter the target URL in the built-in browser. ➜ Click “Continue”. ➜ “Go” icon to open the webpage.

 

Step 5. Click "Next" ➜ “Loop click next page”( Note: Create a loop action to process all the web pages. ) The action of pagination has been added to the extraction rule.

 

 

Step 6. Click the first highlighted link.

 

Create a list of sections with similar layout. Click "create a list of items”. ➜ "Add current item to the list".  Then the first highlighted link has been added to the list. ➜ Click "Continue to edit the list".

 

 

 

Click the second highlighted link.

 

Click "Add current item to the list" again. Now we get all the links with similar layout. 

 

Then click "Finish Creating List".

 

Click“loop” to process the list for extracting the elements in each page

 

Step 7. Extract the title of the first section. ➜ Click the title. Select "Extract text".

 

 

Step 8. All the content will be selected in Data Fields.

Before executing the extraction rules, drag the second “Loop Item” before "Click to paginate" action in the Workflow Designer so that we can grab all the elements of sections from multiple pages.

 

Click “Next” ➜ Click “Next”. 

 

 

 

Step 9. Click “Local Extraction”.  “OK” to run the task on your computer. Octoparse will automatically extract all the data selected.

 

The data extracted will be shown in "Data Extracted" pane. Click button to export the results to Excel file, databases or other formats and save the file to your computer.

 

 

Happy Data Hunting!

 

 

 

 

 

 

Author: The Octoparse Team

 

 

 

Download Octoparse Today

 

 

 

 

For more information about Octoparse, please click here.

Sign up today.

 

 

Author's Pick

 

About Octoparse

Collect Data from LinkedIn

Collect Data from Amazon

Collect Data from Yelp

Collect Data from eBay

Collect Data from Gumtree.com

Collect Data from Facebook

Get Started with Octoparse in 2 minutes

A Comparison among Three Editions of Octoparse

 

 

 

 

Contact
us

Leave us a message

Your name*

Your email*

Subject*

Description*

Attachment(s)

Attach file
Attach file
Please enter details of your issue and we will get back to you ASAP.