URLs - Wizard Mode

Wednesday, March 9, 2016 9:23 PM

Extract data from a list of URLs with similar layouts


Step 1. Download Octoparse and install it. Register a new account at www.octoparse.com. Or directly click the “Sign up” option the Login interface.


Start interface.


If it's the first time you log in, you’ll see the User Guide interface.


Step 2. Wizard Mode: Click “Start” ➜ Go to “URL List Extraction” ➜  “+Create” to start a task.

(Note: You need to learn the training section at least one time. you’ll automatically enter into its training section mode after you click “+Create”.)


Step 3. Complete basic information. ➜ Click “Continue” ➜ Click “Next”.


Step 4. In the “List of URL” box: Copy and paste all the URLs. ➜  Click “Continue” ➜ Click “Next”.


Step 5. Octoparse will automatically open the first webpage and display the content in the built-in browser. (Note: You can select any content you want to extract.)


All the content selected is  in the Data Fields. Click the "Field Name" to modify.    Click “Continue” ➜ Click “Next”.


Step 6. Run the task on your computer (Local Extraction) ➜ Click “OK” .


Octoparse will automatically extract all the data selected. The data extracted will be shown in "Data Extracted" pane.


It’s done! Click “OK” in the pop-up “Extraction Completed” window.

Step 7. Click  button to export the results to Excel file, databases or other formats and save the file to your computer.


Happy Data Hunting!







Author: The Octoparse Team




Download Octoparse Today





For more information about Octoparse, please click here.

Sign up today.



Author's Pick


About Octoparse

Collect Data from LinkedIn

Collect Data from Amazon

Collect Data from Yelp

Collect Data from eBay

Collect Data from Gumtree.com

Collect Data from Facebook

Get Started with Octoparse in 2 minutes

A Comparison among Three Editions of Octoparse





We use cookies to enhance your browsing experience. Read about how we use cookies and how you can control them by clicking cookie settings. If you continue to use this site, you consent to our use of cookies.
Accept decline