Wikisource:Scan Lab



 



Participants
Add your name to Module:Mass notification/groups/Scan Lab to be notified via. Also add your name below with details of any particular tasks you can help with.

Jane Austen Juvenilia Volume 2 and 3
The scans of the manuscripts of Austen's Juvenelia are available on here and here. They're both in the PD, but I have absolutely no clue as how to download them. The images are higher resolution than the ones on the BL website, but they're in the zoomify flash format. Languageseeker (talk) 02:58, 2 February 2022 (UTC)
 * Languageseeker: I know Volume the Second is in the public domain; it’s already been transcribed here. Are we sure that Volume the Third is in the public domain? It could easily fall into a copyright trap, so I just want to make sure. TE(æ)A,ea. (talk) 22:59, 8 February 2022 (UTC)
 * @TE(æ)A,ea. The British Library has it listed as "Public Domain in most countries other than the UK." Languageseeker (talk) 23:07, 8 February 2022 (UTC)
 * So it looks like it was definitely published in 1951 (which would imply copyright expiry in 2001 in the UK as 50 years after publication), which makes the UK copyright claim weird. If true that would postdate the URAA date ... MarkLSteadman (talk) 00:06, 9 February 2022 (UTC)
 * That is volume 3 (Evelyn and Kitty the Bower). Volume 1 was published in 1933 (so it was in the PD on the URAA date). MarkLSteadman (talk) 00:19, 9 February 2022 (UTC)

Mooresville, Indiana High School yearbooks, 1914–1930
These scans exist in the form of galleries on the Mooresville High School Alumni Association's Facebook page, and extracting them by hand is tedious enough that I'm hoping someone can do it with a bot. The procedure I have in mind is:
 * Go to the Facebook galleries, which are more conveniently linked at https://mooresvillelib.org/mooresville-high-school-digitized-yearbooks/
 * Except the 1920 yearbook, because there are two albums for it and the one linked on the library website is missing a page. For 1920, use this album: https://www.facebook.com/media/set/?set=a.2615532255167809&type=3
 * Extract the images.
 * Upload the images to Commons, numbered sequentially, to allow for for later image extraction (and scan repair if needed; it turned out their scan of the 1909 yearbook was missing some pages).
 * Combine the images into a PDF or DJVU and upload that to Commons as well.
 * Use commons:Category:Mooresville High School Yearbook, 1911 as a model for categorization, metadata, license tags and naming convention.
 * Main file: File:The Levenite, Mooresville High School, 1911.pdf; page image: File:The Levenite, Mooresville High School, 1911 01.jpg
 * The 1930 yearbook should be tagged.
 * Note that the yearbooks aren't all called "The Levenite"; they changed the title each year. This matters for filling out the book template anyway, so I will also request that the title be used in the file name.

Thanks! —CalendulaAsteraceae (talk • contribs) 01:59, 22 September 2023 (UTC)

Penny Cyclopedia volumes 1 to 27
The IA scans currently linked on the page are unusable (blank pages where there should be content), so I checked HathiTrust ([here's the search I used]). There are four complete sets of scans attached to [this record] (ignoring the supplements for now), but I'm not sure at the moment which ones would be the best to import. Arcorann (talk) 02:14, 24 December 2023 (UTC)


 * I've found pretty good scans of volumes 4 and 24 which are already on Commons, and I've added the links to the Penny Cyclopedia page. I don't have a Hathi Trust account, so I can't help you there. Ciridae (talk) 05:21, 27 December 2023 (UTC)

Journal of the Optical Society of America
Volumes 1-40 of this fairly esteemed journal are out of copyright. Vol. 30, issue 12 and Vol. 33, issue 7 are here already, but there are *a lot* that are not here: https://archive.org/details/pub_optical-society-of-america-journal If you upload them, I can tidy the pile up at commons and get them ready to go here. For copyright concerns: https://onlinebooks.library.upenn.edu/webbin/cinfo/jopticalsocamerica --RaboKarbakian (talk) 20:43, 8 February 2024 (UTC)

The Singing Bone (Freeman)
would someone with access please download the scan from HathiTrust? Thanks! —Beleg Âlt BT (talk) 17:34, 22 April 2024 (UTC)
 * Beleg Âlt: Here you go: File:The Singing Bone.pdf. TE(æ)A,ea. (talk) 22:46, 17 May 2024 (UTC)

The Coming of Cassidy
would someone with access please download the scan from HathiTrust? Thanks! —Beleg Âlt BT (talk) 16:31, 6 May 2024 (UTC)


 * -- c:File:The Coming of Cassidy and the Others - Clarence E. Mulford.pdf -- Hrishikes (talk) 14:20, 4 July 2024 (UTC)

A Dictionary of Hymnology
Per this discussion, I'd like to replace the current PDF scan with a DJVU scan. However, both IA-Upload and Any2Djvu are stalling out on me, and pdf2djvu.com gave pretty shoddy results. Could someone please upload this scan as a new version of File:Dictionary of Hymnology 1908.djvu? Thanks!
 * The IAA-Upload failure looks to be: https://phabricator.wikimedia.org/T215647 caused by the number of pages. MarkLSteadman (talk) 03:34, 4 July 2024 (UTC)
 * -- c:File:A Dictionary of Hymnology - John Julian.djvu -- Hrishikes (talk) 16:38, 5 July 2024 (UTC)
 * @Hrishikes you're the best!! —Beleg Tâl (talk) 17:59, 5 July 2024 (UTC)

The Criterion Volume 2 and 3
Would it be possible to locate Volumes 2 and 3 of The Criterion? I'm especially trying to complete The Woman Who Rode Away that began in Volume 3. Languageseeker (talk) 18:36, 23 December 2022 (UTC)
 * Languageseeker: Volume 2 is available here. I can’t find volume 3; however, it is available on microfilm. TE(æ)A,ea. (talk) 22:05, 26 December 2022 (UTC)
 * Volume 2 now at Index:The Criterion - Volume 2.djvu (and v1 replaced too, it was previously a reprint). Inductiveload— talk/contribs 15:54, 29 December 2022 (UTC)
 * @TE(æ)A,ea. @Inductiveload Thank you both for these. Hopefully, Volume 3 turns up at some point. Soon we can also look for Volume 5. Hooray for PD day! Languageseeker (talk) 17:35, 29 December 2022 (UTC)
 * Languageseeker: The Criterion, volume 3, is available here. Some extra work will need to be done: the pages are two-to-one (scan), the 102–103 spread is duplicated, the 340–341 spread is duplicated, and the first page of the index (spread) is duplicated. I still have the reel, so if you need anything from this volume or from volumes 1 or 2 of The Criterion or volume 4 of The New Criterion, I can go through the reel. TE(æ)A,ea. (talk) 16:19, 13 January 2023 (UTC)

Index:Kaempfer History of Japan 1727 vol 1 (IA historyofjapangi01kaem).pdf
This scan is missing two pages (xxvi–xxvii). Also, it would be nice if the images for this volume and the second volume could be regenerated, as they are of quite poor quality. TE(æ)A,ea. (talk) 22:21, 3 December 2023 (UTC)

Index:Alumni Oxoniensis (1715-1886) volume 2.djvu
Pages 482 and 483 of this volume were missing in the original scan; pageholders have been introduced, so all that is necessary is the replacement. That replacement can come from Index:Alumnioxonienses02univ.pdf, which exists solely for the purpose of supplying that gap. So, the missing pages from the PDF should be added in over the pageholders from the DJVU; the transclusion fixed; and the PDF deleted. TE(æ)A,ea. (talk) 23:46, 3 December 2023 (UTC)


 * Not sure I follow, pages 482 and 483 (djvu/99 and djvu/100) seem to be legit images and the 2 missing pages should be inserted between djvu/100 and djvu/101. Or ...? Mpaa (talk) 18:09, 4 December 2023 (UTC)

Index:The Atlantic Monthly Volume 135.djvu
This file claims to be Volume 135 and is residing in the list of volumes as Volume 135 but it is actually Volume 136, probably (but not verified) a duplicate of Index:The Atlantic Monthly Volume 135.djvu. Can the file be replaced with https://babel.hathitrust.org/cgi/pt?id=uc1.32106019602660 ?--RaboKarbakian (talk) 15:47, 29 March 2024 (UTC)


 * Also, while you are at it:
 * https://babel.hathitrust.org/cgi/pt?id=mdp.39015030146099 Vol. 139
 * https://babel.hathitrust.org/cgi/pt?id=mdp.39015030146081 Vol. 140
 * https://babel.hathitrust.org/cgi/pt?id=mdp.39015030145968 Vol. 141
 * https://babel.hathitrust.org/cgi/pt?id=mdp.39015030145745 Vol. 142
 * --RaboKarbakian (talk)

Index:A dictionary of the language of Mota.djvu
File was renamed at Commons, and needs re-aligning.

https://en.wikisource.org/w/index.php?search=intitle%3A%2FA+dictionary+of+the+language+of+Mota.djvu%2F&title=Special:Search&profile=advanced&fulltext=1&ns0=1&ns100=1&ns102=1&ns104=1&ns106=1&ns114=1 ShakespeareFan00 (talk) 17:40, 1 May 2024 (UTC)

Index:Sm all cc.pdf
A bit of a different one this time. This work contains several copyrighted images that need to be blanked out in the scan. The affected pages are listed here: Index talk:Sm all cc.pdf. —Beleg Tâl (talk) 15:17, 14 May 2024 (UTC)
 * Beleg Tâl: I’ve redacted all of the images marked as copyrighted in the text from the talk page and re-uploaded the file. The images should be gone once you clear the caches. TE(æ)A,ea. (talk) 17:18, 17 May 2024 (UTC)
 * @Beleg Tâl (CC: @Beleg Âlt): Is this resolved? Xover (talk) 16:38, 1 June 2024 (UTC)
 * Hmm, the mediawiki software doesn't want to load the page images for me, but I trust TE(æ) to have done a good job :) —Beleg Tâl (talk) 13:17, 3 June 2024 (UTC)
 * @TE(æ)A,ea.: It looks like the new PDF you uploaded makes MediaWiki choke. Could you try to regenerate it using different tools or options? Xover (talk) 09:18, 4 June 2024 (UTC)
 * Xover: MediaWiki (maybe just Commons?) has been causing me a lot of problems lately in terms of PDFs, so many times I’m not sure if any of the files work. TE(æ)A,ea. (talk) 00:17, 16 June 2024 (UTC)

Index:Anthology of Japanese Literature.pdf
The pages here are offset when they are loaded. The PDF is correct, the text layer is correct, and if you call OCR the right pages are referenced; however, the wrong pages show us visually. I don’t where this problem originates. TE(æ)A,ea. (talk) 00:16, 16 June 2024 (UTC)


 * @TE(æ)A,ea. it seems ok to me. Mpaa (talk) 20:15, 6 July 2024 (UTC)
 * Mpaa: Yes, this has been fixed. Feel free to close this request. TE(æ)A,ea. (talk) 18:40, 8 July 2024 (UTC)