developerWorks: Convert from HTML to XML with HTML Tidy
Oct 01, 2003, 07:00 (0 Talkback[s])
(Other stories by Benoit Marchal)
"One the challenges that webmasters face when converting from
pure HTML to XML/XSL is the preservation of their legacy Web sites.
Because it would be too costly to dump the old site and start again
from scratch, some sort of automated procedure that brings the HTML
site to XML is required.
"Even XML converts have to deal with HTML files: Many products
have added an option for exporting HTML documents -- an option you
might want to integrate into your Web site.
"This tip discusses HTML Tidy, a powerful tool to help convert
old HTML pages to newer standards, such as XML. Tidy is distributed
as open source..."