Page:Crowdsourcing and Open Access.djvu/19

 historical texts and make them freely available online would appear to be one obvious solution.

1. Distributed Proofreaders and Project Gutenberg
Project Gutenberg is one of the oldest digital library projects, having been launched in 1971 at the University of Illinois. In the decades since, founder Michael Hart and many other Project Gutenberg contributors have made tens of thousands of books available for online browsing or download in a variety of formats.

In 2000, a group of Project Gutenberg contributors launched a companion Web site, Distributed Proofreaders (“DP”), with the aim of using collaborative techniques to help expand the library of texts available through Project Gutenberg. Their efforts have paid handsome dividends; indeed, as of the time of this writing, a majority of all texts available through Project Gutenberg were contributed via Distributed Proofreaders.

Distributed Proofreaders hosts scanned images—that is to say, pictures—of the pages of new texts that are candidates to be added to Project Gutenberg. Registered users of the site may select a text of interest to them from the listing of currently active proofreading projects. The site then displays to the user a split-screen window showing both a scanned image of the selected page of the work and the corresponding text that appears on that page (generated initially via optical character recognition (OCR) software). Because the uncorrected OCR output frequently contains errors, the text in the lower portion of the split-screen display may not exactly match the