2. Get Started

 

 

2.1. Octoparse Basics

       2.1.1. User Interfaces

                  . Operation Panel (Start Interface)

                  . Navigation Panel

                  . Complete Basic Information

                  . Operation Panel (Design Workflow)

                  . Extraction Options

                  . Extraction Options (Done)

                  . Data Extraction Panel (Local Extraction)

                  . Data Extraction Panel (Cloud Extraction)

                  . Workflow Designer (Extraction Rule)

                  . Tasks Manager

       2.1.2. Two Modes in Octoparse

                  . Wizard Mode

                  . Advanced Mode

 

2.1.1. User Interfaces

. Operation Panel (Start Interface)

Currently Octoparse has two modes in the Operation panel: the Wizard mode and the Advanced mode. In the Operation panel, you can choose one of these two modes to start an extraction task.

. Wizard mode is more suitable for beginners. You can grab data from simple web pages by just following the instructions step by step to configure your own task.

. Advanced mode is most commonly used. With advanced mode, you can easily deal with any complex page structures by more powerful features like scheduling feature and cloud servers.

. There are four types of web pages in the learning section of each mode. Before building your very first task, you can click one of these to learn a little bit.

 

. Navigation Panel

Use the Navigation panel to quickly find and open your tasks. In the Navigation panel, you can quickly start a task, manage all the task and check the task status.

 

. Complete Basic Information

Give your task a name. Save it to a category.

Enter any additional notes for the task.

 

. Operation Panel (Design Workflow)

In this Operation Panel, you can start to configure a rule for your task.

Octoparse has a built-in browser that allows you to open the website you plan to crawl. There are many different action icons list on the left hand side of the workflow designer. You can see each action of the rule in the Workflow Designer. These icons are different actions that you may use to configure a task.

 

 

. Extraction Options

You can choose not to load images to speed up the extraction. But sometimes may cause problems on certain websites. 

 

 

. Extraction Options (Done)

Once the Task is completed, you can choose the Local extraction to run the task on your computer/choose Cloud Extraction to run the task in the cloud/create API.

 

 

. Data Extraction Panel (Local Extraction)

The data extracted will be showed in the data extracted pane. You can also see the configured rule of the task. You can also check out the built-in browser to see if the task runs as expected.

Then export the results to Excel files, or other formats and save the file to the computer.

 

 

. Data Extraction Panel (Cloud Extraction)

This is the Cloud Extraction screen. When you choose Cloud Extraction, Octoparse would automatically extract any data you want with high speed and productivity. Cloud Extraction means data extraction tasks running in the cloud. You need to configure a rule and upload it to our cloud platform. Then your task will be reasonably assigned to one or several cloud servers to extract data simultaneously via central control commands.

 

 

. Workflow Designer (Extraction Rule)

Rule: Extraction rule is one of the most important features of Octoparse. The rule configured would tell Octoparse: which website is to be open; where is the data you plan to crawl; what kind of data you want, etc. There are 10 actions for you to make a rule. Just drag and drop these actions to configure your rule and make sure the information that you plan to crawl from the website, then Octoparse would collect the data fro you automatically.

You do not need to write any code in Octoparse. Just tell Octoparse what you want it to do by dragging actions into the workflow designer and selecting options to optimize the process.

 

. Tasks Manager

You can manage all the task you create in the Navigation Panel. Right click one of you task and choose one option in the pop-up menu.

 

2.1.2. Two Modes in Octoparse

 

. Wizard Mode

Wizard mode is more suitable for beginners. You can grab data from simple web pages by just following the instructions step by step to configure your own task.

 

. Advanced Mode

Advanced mode is is most commonly used. With advanced mode, you can easily deal with any complex page structures by more powerful features like scheduling feature and cloud servers.

 

 

Download Octoparse Today

 

 

 

Contact
us

Leave us a message

Your name*

Your email*

Subject*

Description*

Attachment(s)

Attach file
Attach file
Please enter details of your issue and we will get back to you ASAP.