Scripting an OCR text archiver for Trove

Trove is the National Library of Australia’s online database. It contains almost 400 000 000 digital items, including Australian newspaper articles from 1803 to 1954.
Trove’s newspaper portal is particularly useful. You can view an entire digitised newspaper page, or select (or search for) an individual article. Each article has been OCR’ed and the OCR text is presented in a separate box at the left of the page viewer.