User:Zoeannl/PG Proofreaders Guidelines

Italics and Bold
italic for inline italics (within a paragraph). Whole paragraphs need to be italicised individually with this template. italic block spans more than one paragraph.

"That James Franklin ſhould no longer print the newſpaper entitled the ."

"That James Franklin ſhould no longer print the newſpaper entitled the ."

Italicised dates etc.
Many typefaces found in older books used the same type for numbers in both regular text and italics or bold. For dates and similar phrases, format the entire phrase with one set of markup, rather than marking the words as italics (or bold) and not the numbers.

29 1876 in the scan should be proofread

Italics in poems
Inline italics

Italic verse

Italicized links
The italics markup must be outside the link markup, or the link will not work; however, internal italicization can be used in piped links.

Correct: He died with still unfinished. He died with still unfinished. Correct: The USS (SS-3) was a submarine. The USS (SS-3) was a submarine.
 * He died with still unfinished.

Italicized word spanning pages
Use hws and hwe. Italicize the parts and whole words within the template. When transcluded the split words are joined automatically.

begin- at the bottom of the first page.

-ing at the beginning of the following page.

Bold
bolder

Light
lighter

Underline
Emphasised

Emphasised

Overline
$\overline{Obscure maths}$

$\overline{Obscure maths}$

Strikethrough
strike

Gothic Type
Note:

Serif
serif

Sans-serif
sans-serif

Fantasy
fantasy

Typewritten
Teletype

Monospace

Mono monospaced

TT

Text (gesperrt)
Format    text with the sp template. Remove the extra spaces between letters in each word. This was a typesetting technique used for emphasis in some older books, especially in German.

Coloured Text
Text can be colored using the greyed and red templates. Red text was often used as a highlight in older works, especially on the title page. Greyed text can be used to indicate (important) text that has been written or typed onto the original document.

, ,

, ,

Non-Keyboard Characters
Please proofread these using the proper symbols or accented characters to match the image, where possible, including the use or non-use of accents. Non-keyboard characters can be found on your pull-down menu, see Inserting Special Characters for information on how to input these characters during proofreading. There is also a Gadget Editing tool for keyboard shortcuts under Preferences (tab at top right of this page, if you are logged in).

Characters can be cut and pasted from character lists. See Unicode/List_of_useful_symbols, w:Wikipedia:Mathematical_symbols, right menu lists punctuation, List_of_XML_and_HTML_character_entity_references, punctuation, Unicode_symbols currency symbols, Mathematical_operators_and_symbols_in_Unicode

Typographic_ligature Latin ligatures See the Basic Latin block starting at 0020 FB00 ff LATIN SMALL LIGATURE FF ≈  f  f FB01 fi LATIN SMALL LIGATURE FI ≈  f   i FB02 fl LATIN SMALL LIGATURE FL ≈  f   l FB03 ffi LATIN SMALL LIGATURE FFI ≈  f  f   i FB04 ffl LATIN SMALL LIGATURE FFL ≈  f  f   l FB05 ſt LATIN SMALL LIGATURE LONG S T ≈  ſ   t FB06 st LATIN SMALL LIGATURE ST ≈  s   t ﬀ

Templates are available for

,, SS 1 SS undefined, SS 2, SS 3, SS 4. These should have their own menu.

s, 2 s|2, 1 ss, 2 ss|2, 3 ss|3, 4 ss|4, f f ditto


 * Scriptorium/Archives/2015-05
 * Scriptorium/Help/Archives/2012
 * lang; Category:Typography templates


 * &amp;ensp; |&ensp;| en space[d]
 * &amp;emsp; |&emsp;| emsp
 * &amp;thinsp; |&thinsp;|  thinsp

P.S. style point: Never, ever, EVER use & nbsp; There are always much better options. The code & #160; can be used in a pinch, or, or cellpadding, but there is always a better option than nbsp;. --EncycloPetey (talk) 23:01, 29 March 2015 (UTC)

Bracket, Tilde, Miscellaneous_Technical, Ellipsis, Numerals in Unicode

Suggestions for Gadget Proofreading shortcut keys ^oe ^ae -- macron for Diacritical Marks

diacritical mark	sample	above	below macron (straight line)	¯	[=x]	[x=] 2 dots (dieresis, umlaut)	¨	[:x]	[x:] 1 dot	·	[.x]	[x.] grave accent	`	[`x]	[x`] acute accent (aigu)	´	['x]	[x'] circumflex	ˆ	[^x]	[x^] caron (v-shaped symbol)	∨	[vx]	[xv] breve (u-shaped symbol)	∪	[)x]	[x)] tilde	˜	[~x]	[x~] cedilla	¸	[,x]	[x,]

Special characters muck up templates: # [&amp;#35;] : |  |  &#124;

Non-Latin Characters
Some projects contain text printed in non-Latin languages; that is, characters other than the Latin A...Z—for example, Greek, Cyrillic (used in Russian, Slavic, and other languages), Hebrew, or Arabic characters. These characters are available in the pull-down menus but if you are not sure of your Greek (etc.), you can get help, using a missing template e.g. where the Greek characters should be, for an expert to fill in the gap.

lang

ארץ זבת חלב ודבש ארץ זבת חלב ודבש

funcho funcho

Superscripts
Insert superscript from the pulldown menu. This works well only for single letters. Older books often abbreviated words as contractions, and printed them as superscripts, insert them into the sup $x$ template. For example:

Original Image:

Gen$rl$ Washington defeated Lᵗ Cornwall's army.

Correctly Proofread Text: Gen $rl$ Washington defeated Lᵗ Cornwall's army.

If the superscript represents a footnote marker, then see the Footnotes section instead.

The Project Manager may specify in the Project Comments that superscripted text be marked differently.

Italics are inside the template. E.g. 107$th$ 107$th$

Subscripts
Subscripted text is often found in scientific works, but is not common in other material. Proofread these with subscript from the pulldown menu or if characters are unavailable, insert into the sub $x$ template using the subscripted text. For example:

Original Image: H₂O. H$2$O.

Correctly Proofread Text: H₂O. H$2$O.

Large, Opening Capital Letter
Large initial letters at the start of a chapter, section, or paragraph are duplicated with a template. If it is an ornate initial, note it in the Index Discussion page (with .djvu number) as someone may take the trouble to insert an image instead.

roofread a large first letter that sits below the first line using the drop initial template di.

arely, you may have a large first letter sitting on the first line. Use li  for these.

f you have an apostophe before the initial, use before the initial template

c.f.

Words in Small Capitals
Using the Small capitals template, proofread the characters in Small Caps (capital letters which are smaller than the standard capitals) as lower case letters; capitals remain as capitals. For example

Original Image:


 * B.C.
 * B.C.
 * B.C.
 * B.C.

Correctly Proofread Text:


 * B.C.
 * B.C.
 * B.C.
 * B.C.

It is considered good form to include a whole name etc. within the template rather than only the part with small caps, as in. the If the OCR has converted small caps to all capitals, you must retype the words into the edit page. If it is a long passage it may be worth using the lower case template, lc, previewed, to convert the passage. Copy and paste into the Edit page.

lowercase lowercase lc

UPPERCASE UPPERCASE uc

capitalize

asc

Upper and lower case
All caps

uc

lc

Double Quotes
Proofread “double quotes” (straight or curly) as plain ASCII (keyboard) " (shift ') double quotes. Do not change double quotes to single quotes. Leave them as the author wrote them. See Chapter Headings if a double quote is missing at the start of a chapter.

The French equivalent, guillemets «like this», are available from the pulldown menus when editing or creating pages.

For other quotation marks, use '. This applies to languages which use marks »like this«; „like this“; “this way„

The Project Manager may instruct you in the Project Comments to proofread non-English language quotation marks differently for a particular book. Please be sure not to apply those directions to other projects.

Check for matching opening and closing quotes and close up the space between the marks and the quoted text. Some books have quotes where every new line has another quotemark—these are to be removed, leaving the beginning and ending quote, as the line spacing will change in transclusion. Check the Project Discussion for alternative directions.

Single Quotes
Proofread these as the plain ASCII (keyboard) ' single quote (apostrophe). Do not change single quotes to double quotes. Leave them as the author wrote them. Convert ‘ ’ to ' '.

Single quote, double quote
' "

Quote Marks on Each Line
Proofread quotation marks at the beginning of each line of a quotation by removing all of them except for the one at the start of the quotation. If a quotation like this goes on for multiple paragraphs, leave the quote mark that appears on the first line of each paragraph.

However, in poetry keep the extra quote marks where they appear in the image, since the line breaks will not be changed.

Often there is no closing quotation mark until the very end of the quoted section of text, which may not be on the same page you are proofreading. Leave it that way—do not add closing quotation marks that are not in the page image.

There are some language-specific exceptions. In French, for example, dialog within quotations uses a combination of different punctuation to indicate various speakers. If you are not familiar with a particular language, check the Project Discussion, or leave a message for the Project Manager in the Project Discussion for clarification.

Original Image: Clearly he wasn't an academic with a preface like this one. “I do not give the name of the play, act or scene, “in head or foot lines, in my numerous quotations from “Shakspere, designedly leaving the reader to trace and “find for himself a liberal education by studying the “wisdom of the Divine Bard.

“There are many things in this volume that the ordinary “mind will not understand, yet I only contract with the “present and future generations to give rare and rich “food for thought, and cannot undertake to furnish the “reader brains with each book!”

Correctly Proofread Text:

Clearly he wasn't an academic with a preface like this one. "I do not give the name of the play, act or scene, in head or foot lines, in my numerous quotations from Shakspere, designedly leaving the reader to trace and find for himself a liberal education by studying the wisdom of the Divine Bard.

"There are many things in this volume that the ordinary mind will not understand, yet I only contract with the present and future generations to give rare and rich food for thought, and cannot undertake to furnish the reader brains with each book!"

End-of-sentence Periods
Proofread periods that end sentences with a single space after them.

Punctuation Spacing
Spaces before punctuation sometimes appear because books typeset in the 1700's & 1800's often used partial spaces before punctuation such as a semicolon or colon.

In general, a punctuation mark should have a space after it but no space before it. If the OCR'd text has no space after a punctuation mark, add one; if there is a space before punctuation, remove it. This applies even to languages such as French that normally use spaces before punctuation characters. However, punctuation marks that normally appear in pairs, such as "quotation marks", (parentheses), [brackets], and {braces} normally have a space before the opening mark, which should be retained.

Original Image:

and so it goes ; ever and ever.

Correctly Proofread Text:

and so it goes; ever and ever.

Trailing Space at End-of-line
Do not bother inserting spaces at the ends of lines of text; any such spaces will automatically be removed from the text when you save the page. When the text is transcluded, each end-of-line will be converted into a space.

Dashes, Hyphens, and Minus Signs
There are generally four such marks you will see in books:

Hyphens.-These are used to join words together, or sometimes to join prefixes or suffixes to a word. Leave these as a single hyphen, with no spaces on either side. Note that there is a common exception to this shown in the second example below.

En-dashes.-These are just a little longer, and are used for a range of numbers, or for a mathematical minus sign. Proofread these as a single hyphen, too. Spaces before or after are determined by the way it was done in the book; usually no spaces in number ranges, usually spaces around mathematical minus signs, sometimes both sides, sometimes just before.

Em-dashes & long dashes.—These serve as separators between words—sometimes for emphasis like this—or when a speaker gets a word caught in his throat——! Proofread these as an em-dash (from the pull-down menu) if the dash is as long as 2-3 letters (an em-dash) or use bar for a custom length. looks like this. Don't leave a space before or after, even if it looks like there was a space in the original book image.

E.g.

Deliberately Omitted or Censored Words or Names. If represented by a dash in the image, proofread these as an equivalent length bar. When it represents a word, we leave appropriate space around it like it's really a word. If it's only part of a word, then no spaces—join it with the rest of the word.

longdash longdash

See also the guidelines for end-of-line and end-of-page hyphens and dashes.

End-of-line Hyphenation and Dashes
Where a hyphen appears at the end of a line, join the two halves of the hyphenated word back together. Remove the hyphen when you join it, unless it is really a hyphenated word like well-meaning. See Dashes, Hyphens, and Minus Signs for examples of each kind. If possible, keep the joined word on the top line, and put a line break after it to preserve the line formatting—this makes it easier for volunteers in later rounds. If the word is followed by punctuation, then carry that punctuation onto the top line, too.

Words like to-day and to-morrow that we don't commonly hyphenate now were often hyphenated in the old books we are working on. Leave them hyphenated the way the author did. Check the Index Discussion page for any advise if you're not sure if the author hyphenated it or not. If no consensus noted, leave the hyphen when you join the word. Leave a note on the Index discussion page of the .djvu page on which this occurs so someone can determine how the author typically wrote this word.

Similarly, if an em-dash appears at the start or end of a line of your OCR'd text, join it with the other line so that there are no spaces or line breaks around it. However, if the author used an em-dash to start or end a paragraph or a line of poetry, you should leave it as it is, without joining it to the next line. See Dashes, Hyphens, and Minus Signs for examples.

End-of-page Hyphenation and Dashes
Where a hyphen appears at the end of a page, the first part is used without the hyphen or em-dash. Use hyphanated word start hws and end hwe templates: and. If it is really a hyphenated word like well-meaning, include it in the whole word i.e. and. See Dashes, Hyphens, and Minus Signs for examples of each kind.

For example:

Original Image:

something Pat had already become accus-

Correctly Proofread Text:

something Pat had already become

To continue the above example on the next page:

Original Image:

tomed to from having to do his own family

Correctly Proofread Text:

to from having to do his own family

These templates rejoin the word when the pages are combined to produce the final e-book (transcluded). Please do not join the fragments across the pages yourself.

For Footnotes that split words over pages, see Footnotes.

Where the hyphen or em-dash should be retained as punctuation, use peh

Ellipsis "…"
The guidelines are different for English and Languages Other Than English (LOTE).

Ellipses of omission should be entered as the actual character (i.e. …, in the Symbol section of the pull-down menu) without surrounding spaces. However, note that not all strings of dots within written dialogue are ellipses of omission. In some cases, an author uses a sequence of three or more dots to indicate a pause, and in such situations there should be separate consecutive dots in order to preserve the tempo of the dialogue.

ENGLISH: An ellipsis is a character on its own, consisting of three consecutive dots. Regarding the spacing, in the middle of a sentence treat the three dots as a single word (i.e., usually a space before the 3 dots and a space after). At the end of a sentence treat the ellipsis as ending punctuation, with no space before it.

Note that there will also be an ending punctuation mark at the end of a sentence, in the case of a period there will be .+an ellipsis—4 dots total. A good hint that you're at the end of a sentence is the use of a capital letter at the start of the next word, or the presence of an ending punctuation mark (e.g., a question mark or exclamation point).

LOTE: (Languages Other Than English) Use the general rule "Follow closely the style used in the printed page." In particular, insert spaces, if there are spaces before or between the periods, and use the same number of periods as appear in the image. Sometimes the printed page is unclear; in that case, save as a Problematic page.

English examples:

Braces
brace2

spans 3 lines, facing left: r for right

To include text within the bracket,

use float right or float right e.g.

OR use a table:

Contractions
In English, remove any extra space in contractions. For example, would n't should be proofread as wouldn't and 't is as 'tis.

This was a 19th century printers' convention in which the space was retained to indicate that 'would' and 'not' were originally separate words. It is also sometimes an artifact of the OCR. Remove the extra space in either case.

Some Project Managers may specify in the Project Discussion page not to remove extra spaces in contractions, particularly in the case of books that contain slang, dialect, or poetry.

Fractions
Proofread fractions as follows:
 * a diagonal fraction bar (a virgule)—¾ is written : frac, do not use the actual fraction symbols unless specifically requested in the Project Discussion, please.
 * A horizontal fraction bar (a vinculum)—use sfrac encased in fs70 as  to fit inline.

For fractions with a numerator of one, only the denominator need be inputted: ;

Leaders
Period at the end. Remove

Problem templates
These should be used if there is a problem that you cannot fix yourself. When using one of these, also set the progress to "problematic" (blue).

Abbreviations
There are exceptions to the Primary Rule with regards to abbreviations. See the Manual of Style. Specific examples are i.e and e.g., and B.C. and other abbreviations from Latin, which are always contracted to avoid splitting across lines. Abbreviations should be SIC'd in full, except when "very common"—Manual of Style

Obscure words
Foreign words should be linked to Wiktionary using word. Common phrases may also be in Wiktionary. E.g.

To add a word to Wiktionary, go to Requested entries

Longer quotes in foreign languages may have translations in Wikiquotes and should be linked with displayed name of Author. If the author’s page exists (blue link), check your quote is already there; if not, be brave and add it or leave a note in the Project Discussion. You can get it translated at ??. If your link is red, click the link and search Wikiquote—maybe they spell it differently. To add an author, click red link and follow: q:Help:Starting_a_new_page, or leave a note in the Project Discussion.