This project has moved and is read-only. For the latest updates, please go here.

How to handle free text containg characters such as '<' or '>'?

Sep 9, 2013 at 2:11 PM
Edited Sep 9, 2013 at 2:16 PM
The example below does not work:
<span class='ocrx_word' title='bbox 741 2162 915 2291'><>¢f,/‘;/.</span>
Whenever a html file is processed using the HTML Agility Pack (HAP) libraries that I am using the above free text found above between the span tags fails and throws an exception.

The below works:
<span class='ocrx_word' title='bbox 741 2162 915 2291'>¢f,/‘;/.</span>
For some reason it doesn't like '<' and '>'. Any ideas how to get around this problem, HAP seems to be doing the job for me so far. I don't think I should have to code in a manual step to ensure that angle brackets are properly substituted (& l t ;). Any ideas on how I might solve this problem? I would have thought HAP would have me covered here but it seems like it doesn't.