Html Tidy option in Html Agility Pack

Topics: Developer Forum, Project Management Forum, User Forum
Mar 22, 2010 at 6:55 AM

I want to parse html content using Html Agility Pack. I want to extract the tabular information from html file. Now in some cases there are missing ending tags in some html files, so where there is missing ending tag the information is not parsed properly. So does the html agility pack insert the ending tags where necessary , is there any option like it  or does html agility pack perform html tidy before parsing the html content ?

Mar 24, 2010 at 11:20 AM

It fixes the end tag, while parsing the html content

Mar 24, 2010 at 11:27 AM
Edited Mar 24, 2010 at 11:28 AM

No it does not because the result with parsing of the table

<table>

<tr><td>Id</td><td>Name</td>

<tr><td>00001</td><td>Mr.Aleson</td>

<tr><td>00002</td><td>Mr.Bill</td>

</table>

gives me wrong output instead of

Id Name
00001 Mr.Aleson
00002 Mr.Bill