undefined

How to avoid data being marked as duplicates in the cloud?

The updated version of this tutorial (based on the latest webpage) is available now. Go to have a check here! 

 

Octoparse cloud extraction would identify and delete duplicates automatically. It is convenient when users only need the newly updated data but also causes some problems when they try to compare the results with the last extraction.

Here is one tip for those who need to keep duplicates in the cloud: simply use "Add Pre-defined Fields" to add the extraction time. As the extraction time is always different, every data row would be identified as a new row in the cloud.

 

Want to know more about Pre-defined Fields in Octoparse? We have some related tutorials for you:

How to add a fixed value when scraping in Octoparse?

How to add a blank field when scraping in Octoparse?

How to get current page URL when scraping in Octoparse?