undefined

Scrape Influencers from Linkedin

Wednesday, September 28, 2016 9:26 AM

(Download my extraction task of this tutorial HERE just in case you need it.)

If you are interested in the influencers in different areas, you could browse it from Linkedin. If you want to dig more information behind that, you could scrape the data from Linkedin and export it in visual format like Excel and then analyze it.

In this tutorial, I will take Linkedin for example to show you how to scrape data from similar layout of the page by using the “Loop Item” of a URLs list.

Step 1. Set up basic information.

Choose “Advanced Mode” ➜ Choose “List of URLs” ➜Complete basic information.

 

Step 2. Design Workflow.

Drag a “Loop” action into the Workflow Designer. ➜ Then paste the URL list you want to scrape in the “List of URLs”. ➜ Click “OK” ➜ Click “Save”.

Then it will automatically go to the website.

 

Step 3. Extract data.

Since the elements you want to scrape is in the similar layout, you could create a list of items to loop the data extraction process.

Click the first highlighted link ➜ Create a list of sections with similar layout.

Click “Create a list of items” (sections with similar layout). ➜ “Add current item to the list”. Then the first highlighted link has been added to the list. ➜ Click “Continue to edit the list”.

Click the second highlighted link ➜ Click “Add current item to the list” again. Now we get all the links with similar layout.

Then click “Finish Creating List” ➜ Click “Loop” to process the list for extracting the elements in each page.

 

Step 4. Extract the search results.

Extract the title of the first section. ➜ Click the title. ➜ Select “Extract text”. Other contents can be extracted in the same way.

 

Step 5. Modify

All the content will be selected in Data Fields. ➜ Click the “Field Name” to modify.

 

Step 6. Run the extraction task.

Click “Next” ➜ Click “Next” ➜ Click “Local Extraction” ➜ “OK” to run the task on your computer. Octoparse will automatically extract all the data selected.

 

Step 7. Export the extracted data.

The data extracted will be shown in “Data Extracted” pane. Click “Export” button to export the results to Excel file, databases or other formats and save the file to your computer.

 

 

Author: The Octoparse Team

Download Octoparse Today

 

For more information about Octoparse, please click here.

Sign up today!

 

Author's Picks

URLs - Advanced Mode

Extract URLs of Web Pages

Getting started with XPath 1

Getting started with XPath 2

Getting started with XPath 1

Collect Data from LinkedIn

To 30 Free Web Scraping Software

30 Free Web Scraping Software

Collect Data from Amazon

Top 30 Free Web Scraping Software

- See more at: http://www.octoparse.com/tutorial/pagination-scrape-data-from-websites-with-query-strings-2/#sthash.gDCJJmOQ.dpuf

We use cookies to enhance your browsing experience. Read about how we use cookies and how you can control them by clicking cookie settings. If you continue to use this site, you consent to our use of cookies.
Accept Close