User:Rho9998/Adding Old English works to Wikisource

Choosing a text
Generally speaking, works written in English up to c.1100 AD are considered Old English works.

A list of Old English works may be found at the Category pages for prose and poetry.

At list of texts at the Toronto Dictionary of Old English, editions for each Old English text within the public domain can be found.

Uploading texts to Wikisource
Once you have chosen a text and found an edition, you can look it up on the [archive.org|Internet Archive].

It is then necessary to upload the text in .djvu format to Wikimedia Commons and then create an Index page for it on Wikisource. For instructions of how to upload texts from Internet Archive, see this |YouTube tutorial.

NOTE: Once uploaded to Wikimedia Commons, if the text's language is marked as 'ang' (Old English), it is possible that you will have to change the 'Wikisource' detail on the file description from | Wikisource  = s:ang:Index:PAGENAME to | Wikisource   = s:Index:PAGENAME. If this isn't done then the index will be created on the now-defunct Englisċ Wicipedia. If this is done, then an Index page for the edition will be created when you press on the Wikisource logo on the tab of the 'Summary' template on Wikimedia Commons.

Proofreading
Proofreading Old English works may present difficulties unencountered in other texts, owing to the presence of extra letters (e.g. þ, æ), punctuation marks (·, ⁊), and letters with diacritics such (ġ, ċ, ū, ȳ).

Use Google OCR
For best results, use Google OCR to transcribe the text. Failing that, Tesseract OCR with the language changed to 'enm' (Middle English) under 'Advanced Options' will sometimes transcribe the OE special characters.

Apply vowel accent fixes
Since Google OCR will interpret letters with macrons as various accents, User:Inductiveload has made a fix for this, which replaces various accents with a macron. Copy lines 1-35 of the code found here (User:Rho9998/common.js) and paste/edit into the page User:Your username/common.js, saving changes afterwards.

Special characters
After using Google OCR and the vowel accent fixes, there will likely still be some errors in the transcription. It may thus be necessary to input special characters yourself.

The special characters required are at the bottom of the list on the left hand side once 'Special characters' has been selected at the top of the editor toolbar.