Re-format Captured Data (Add prefix, replace text,etc.) in Octoparse
Thursday, May 26, 2016 6:27 AMWelcome to Octoparse’ s tutorial. Sometimes you want to replace the captured content with something else you need. In this video, I’m going to show you how to reformat captured data. Let’s get started.
I’m going to take a single page on www. realtor. com.
First, open the page in the built-in browser.
If you extract the price here, there will be a dollar sign in the data extracted. If you export the data with dollar sign to your database, things will get complicated if you do the statistical work. So we need to get rid of it.
First, choose the field you want to reformat > Select the “Customize Field” button > Choose “Reformat extracted data” > Click “Add step”.Select “Replace Strings”.
Copy the dollar sign and paste it into the “Replace” box.
Don’t type anything in the “With” box. Leave it blank. Click “Calculate”. And the dollar sign will be removed. Then click “OK”. Now the final output data has no dollar sign. Click “done”.
Now there’s no dollar sign in the data you captured.
You’ve know how to reformat the captured data now. Why not try it now?
Another article you can't miss that teach you how to re-format extract data with RegEx: How to Extract information from Yelp
Download Octoparse at www.octoparse.com.
Join us on Facebook, Spaces, Pinterest and share your ideas with us!
If this video tutorial is not available for you, you can click hereto see the corresponding graphic tutorial.