Scrape Influencers from LinkedinWednesday, September 28, 2016 9:26 AM
(Download my extraction task of this tutorial HERE just in case you need it.)
If you are interested in the influencers in different areas, you could browse it from Linkedin. If you want to dig more information behind that, you could scrape the data from Linkedin and export it in visual format like Excel and then analyze it.
In this tutorial, I will take Linkedin for example to show you how to scrape data from similar layout of the page by using the “Loop Item” of a URLs list.
Step 1. Set up basic information.
Choose “Advanced Mode” ➜ Choose “List of URLs” ➜Complete basic information.
Step 2. Design Workflow.
Drag a “Loop” action into the Workflow Designer. ➜ Then paste the URL list you want to scrape in the “List of URLs”. ➜ Click “OK” ➜ Click “Save”.
Then it will automatically go to the website.
Step 3. Extract data.
Since the elements you want to scrape is in the similar layout, you could create a list of items to loop the data extraction process.
Click the first highlighted link ➜ Create a list of sections with similar layout.
Click “Create a list of items” (sections with similar layout). ➜ “Add current item to the list”. Then the first highlighted link has been added to the list. ➜ Click “Continue to edit the list”.
Click the second highlighted link ➜ Click “Add current item to the list” again. Now we get all the links with similar layout.
Then click “Finish Creating List” ➜ Click “Loop” to process the list for extracting the elements in each page.
Step 4. Extract the search results.
Extract the title of the first section. ➜ Click the title. ➜ Select “Extract text”. Other contents can be extracted in the same way.
Step 5. Modify
All the content will be selected in Data Fields. ➜ Click the “Field Name” to modify.
Step 6. Run the extraction task.
Click “Next” ➜ Click “Next” ➜ Click “Local Extraction” ➜ “OK” to run the task on your computer. Octoparse will automatically extract all the data selected.
Step 7. Export the extracted data.
The data extracted will be shown in “Data Extracted” pane. Click “Export” button to export the results to Excel file, databases or other formats and save the file to your computer.
Author: The Octoparse Team
For more information about Octoparse, please click here.
Sign up today!