“This document describes how to set up Tesseract OCR on Ubuntu
7.04. OCR means ‘Optical Character Recognition.’ The resulting
system will be able to convert images with embedded text to text
files. Tesseract is licensed under the Apache License v2.0.
“This howto is meant as a practical guide; it does not cover the
theoretical backgrounds. They are treated in a lot of other
documents in the web…”