undefined

What is HTML Transcoding?

The updated version of this tutorial (based on the latest webpage) is available now. Go to have a check here! 

 

HTML transcoding is a kind of data re-format, which converts some HTML tags into plain text to help users to observe the source code easily after they extract the HTML of a web. For example, it can transcode “&gt” into “>” or “&lt;” into a “<”.

It is easy to find it if you follow these steps:

Choose a data field ➜Click "Customize Field". ➜Click "Re-format extracted data".➜Click “Add steps”.➜ Choose “Html transcoding”.

All the conversion will be automatically done well after you click OK.

This function actually is seldom used compared to other data re-format functions such as “Replace with Regular Expression”. Click here to know more about the powerful functions of data re-format, helping to make your data clearer!

 

 

Author: The Octoparse Team

Download Octoparse Today

For more information about Octoparse, please click here.

Sign up today!