I have several html documents that I need to extract data out off. Currently I am using MS Excel to exctract the tables from the html pages, my company would like to get away from this practice. I'm still realatively new to programming so bare
First Question. How do I Parse a web page from an external (not mine) website or with a saved html document to my Harddrive, the examples that I have seen are vary vague.
I have used the HAPExplorer to get to the table that I need in one of my HTML documents however the XPath looks like this: /html/body/table/tr/td/table/tr/td/placeholder/table/tr/td/table/tr/td/table/tr/td/table/tr/td/div/table/tr/td.
How in the world do I write this in code?
Any assistance would be greatly welcomed.