What is HTML Transcoding?

The updated version of this tutorial (based on the latest webpage) is available now. Go to have a check here! 


HTML transcoding is a kind of data re-format, which converts some HTML tags into plain text to help users to observe the source code easily after they extract the HTML of a web. For example, it can transcode “&gt” into “>” or “&lt;” into a “<”.

It is easy to find it if you follow these steps:

Choose a data field ➜Click "Customize Field". ➜Click "Re-format extracted data".➜Click “Add steps”.➜ Choose “Html transcoding”.

All the conversion will be automatically done well after you click OK.


This function actually is seldom used compared to other data re-format functions such as “Replace with Regular Expression”. Click here to know more about the powerful functions of data re-format, helping to make your data clearer!



Author: The Octoparse Team

Download Octoparse Today

For more information about Octoparse, please click here.

Sign up today!

We use cookies to enhance your browsing experience. Read about how we use cookies and how you can control them by clicking cookie settings. If you continue to use this site, you consent to our use of cookies.
Accept decline