undefined

Scheduled Data Extraction - Octoparse Cloud Web Scraping Service

Monday, November 28, 2016 3:44 AM

For the latest tutorials, visit our new self-service portal. Sharpen your skills and explore new ways to use Octoparse.

 

By now, you've finished building your first scraping task and know how to run the task to get the data you need. Let's take it to the next level and find out how you can make your daily scraping routines more effective and efficient using task scheduling, auto-data export, and API. 

1. Task Scheduling

2. Auto-data export (for Cloud data)

3. Connect using API

4. Connect using Zapier

 

Task scheduling

If you are planning on getting data extracted on any regular basis, task scheduling is exactly what you need and can save you a lot of time. You can schedule your task to run once, on a recurring schedule, or even run repeatedly, such as every 1 min, 5 mins, 10 mins, or 30 mins. 

1. Find your task on the Dashboard, click the show more icon then choose "Cloud runs" and select "Set schedule".

 schedule1

You can also set up local schedules to run the tasks on your device on a regular basis.

2. Choose how often you would like to run the task. 

3. For recurring crawls, select the day of the week/day of the month and the time of the day to run your task.

For repeating crawls, select the desired time interval.

 schedule2

4. You can also save the settings for later use. Give the settings a name and click "Save". This way, you can always select the saved schedule setting and apply it directly to any other tasks. 

5. After everything's done, click "Save and Run" to start running the task on the schedule right away.  

6. Once you have the schedule set up, you can easily turn it ON and OFF by clicking the show more icon on the Dashboard, then select "Cloud runs", there you can choose "Schedule ON" or "Schedule OFF". 

7. When a task is scheduled, you'll see the next run time on the Dashboard. Click the + sign on the Dashboard, then select "Next Run". This way, you'll have a clear picture of the tasks that are scheduled and when the next run is expected. 

schedule3 

 

Auto-data export (for Cloud data)

Data export to the database can also be automated and scheduled. If you need to export data to your databases on a regular basis, data export scheduling can save you tons of work. 

1. Load the cloud data for your task. 

2. Click on "Export Data" 

3. Click open "Auto-export to database", then select the type of database you have.

4. Complete the information to connect to your database. Click "Test connection" to test if the database is connected successfully. Then, click "Next" to proceed and map the data fields, and choose the desired time interval for the export. 

 

5. Lastly, click "Next" to finish the process.

 

Connect using API

With the Octoparse API, you can run scraping tasks, retrieve the extracted data, and even edit your tasks programmatically via coordinating with your own application. 

Check this step-by-step tutorial on how to connect to the Octoparse API or see the API documentation for details. 

Connect using Zapier

You can connect the data scraped with other applications (e.g., Google Drive, Google Sheet, Dropbox) easily without coding by using Zapier. Check this to find out more details: How to Connect Octoparse with Zapier

 

Happy Data Hunting!

Author: The Octoparse Team

Download Octoparse Today

 

For more information about Octoparse, please click here.

Sign up today. 

We use cookies to enhance your browsing experience. Read about how we use cookies and how you can control them by clicking cookie settings. If you continue to use this site, you consent to our use of cookies.
Accept decline