Processing .MHTML file

Topics: Developer Forum, Project Management Forum, User Forum
Mar 4, 2010 at 4:30 AM

Hi,

I am new to XML and MHTML Parsing,

I have a requirement to Extract data from an .mhtml file,which is of 40 MB, I don't know how to do it. I am using VB.net (Visual Studio- 2005)

My File Structure will be Something like this:

 

<html>

-----
-----
<h1> 1.1 My Header1 [Key1] </h1>

<h2> 1.1.2 My Sub Header1 [Key2] </h2>

<h3> 1.1.2.1 My Child Header1 [Key3] </h3>

----

----


</html>
I need to Extract all the Keys [Key1, Key2, Key3] Separately and it's corresponding Header tag, I need to Store them in a tree like pattern, kindly let me know how to do this using the HtmlAgilityPack.
 
Thanks in Advance