Html consistency

Topics: Developer Forum
Feb 5, 2009 at 8:53 PM

I use HTML agility API's tp fix relative URI after fetching the raw html from a web site. Bit when I save the HtmlDocument .... I see new nodes which really wasn't there. So it is changing the DOC tree which is unexpected behaviour. It changes the layout of the page completely.

Any flags I need to set to get around the behavior.

Mar 10, 2009 at 4:56 AM
You're going to need to write this relative URI resolver yourself, AFAIK.

Browsers resolve relative URLs based on base URL (if exists) and the URL itself.

AFAIK, the cross browser resolver is consistent.

There are plenty of articles about this on the net, for example, seems pretty good

Mar 10, 2009 at 5:08 AM
Heaps of articles on google