This class can be used to convert HTML pages into Microsoft Word documents in the XML format.
It can parse a HTML document given as a HTML data string or a page URL. Then it extracts the HTML document header and body and rewrite it with a Microsoft Word document XML header.