An agile HTML parser that builds a read/write DOM and supports XPATH or XSLT. Here is an assembly that allows you to parse “out of the web” HTML files. The parser is very tolerant with “real world” malformed HTML. The object model is very similar to what proposes System.Xml, but for HTML documents.