"Hard line breaks, for example, ruin the text flow,
making it virtually impossible to read the book on a mobile device.
Another problem is that most books are stored as single files, so
locating a particular chapter or section in a lengthy book can
quickly become a serious nuisance. Then there are minor, but still
annoying formatting quirks, such as inconsistent handling of
italicized text, use of straight quotes instead of smart ones, and
so on.
"Fixing all these and other issues manually to make an etext
readable -- or even printable -- is a very daunting proposition,
especially when dealing with longer texts. Thankfully, the
GutenMark tool can take most of the burden off your shoulders. The
utility converts Project Gutenberg etexts into neatly formatted
HTML or LaTeX files. The goal of the GutenMark project is to create
a tool that produces files that don't require any additional
cleanup and tweaking. While there is still some way to go before
this goal is achieved, GutenMark does a remarkable job of turning
etexts into readable and printable files."