Wikisource:Scan Lab/Archives/2021-10

Index:The Dial (Volume 75).pdf
Missing pages available at Languageseeker (talk) 05:13, 8 October 2021 (UTC)


 * ✅ (rederived from a different scan, realised that was also missing pages and generally made a mess in the process). I think it's sorted now: Index:The_Dial_(Volume_75).djvu.
 * We might as well use the DJVU since if we rip the rest of the bunch, they'll come in as DJVUs too, most likely. Inductiveload— talk/contribs 16:16, 8 October 2021 (UTC)


 * This section was archived on a request by: Inductiveload— talk/contribs 16:16, 8 October 2021 (UTC)

The fables of Aesop, as first printed by William Caxton in 1484
Greetings, everyone. This scan is missing two pages after page 26: the first one is the end of the Preface, the second one is a chart. They can be found in here and also. Unfortunately, the file already has proofreading. Can something be done? thanks in advance. — Genesis Bustamante  (talk) 17:23, 9 October 2021 (UTC)


 * @Genoskill ✅. Existing pages shifted to match the new scan. ^_^ Inductiveload— talk/contribs 19:56, 9 October 2021 (UTC)


 * Thank you, very much! — Genesis Bustamante  (talk) 20:31, 9 October 2021 (UTC)
 * : This section was archived on a request by: —  Genesis Bustamante  (talk) 20:31, 9 October 2021 (UTC)

Two requests from MarkLSteadman

 * From HathiTrust Author: Samuel Johnson (1846–1901), Country: Nigeria, Date: 1921, suggested file name: "Samuel Johnson - The History of the Yorubas (1921)" MarkLSteadman (talk) 12:42, 12 October 2021 (UTC)
 * From HathiTrust Author: John Atkinson Hobson (1858–1940), Country: UK, Date: 1902, suggested file name "John Atkinson Hobson - Imperialism - 1902" MarkLSteadman (talk) 12:42, 12 October 2021 (UTC)
 * Inductiveload— talk/contribs 16:40, 12 October 2021 (UTC)
 * @MarkLSteadman ✅ Ding! Cooked! Index:Samuel Johnson - The History of the Yorubas (1921).djvu and Index:John Atkinson Hobson - Imperialism - 1902.djvu. As always please check/correct details and the pagelists are just a best-effort conversion. Inductiveload— talk/contribs 20:53, 12 October 2021 (UTC)
 * Thank you very much! MarkLSteadman (talk) 15:59, 21 October 2021 (UTC)
 * This section was archived on a request by: --Xover (talk) 15:19, 17 October 2021 (UTC)

The Origin of Continents and Oceans (1924)

 * From HathiTrust  Author: Author:Alfred Wegener (1880-1930), Country: London, Date: 1924, suggested file name: "The Origin of Continents and Oceans (1924)"; Also, needs to localized. Languageseeker (talk) 03:01, 14 October 2021 (UTC)
 * Index:The origin of continents and oceans - Wegener, tr. Skerl - 1924.djvu. Inductiveload— talk/contribs 16:51, 17 October 2021 (UTC)


 * This section was archived on a request by: Inductiveload— talk/contribs 08:55, 22 October 2021 (UTC)

creation of djvu from scans?
There wasn't a heading for my question, so I decided "having page scans" is close.

If page scans of a book are supplied (perhaps in PNG format or TIF) can those be turned into DJVU here? And if so, what is the ideal format and if that is not available, what would be a second (and maybe third) choice (png, tiff, xcf?)

Also, I just read the instructions and am sorry that I have not closed any of my previous requests....--RaboKarbakian (talk) 19:02, 8 October 2021 (UTC)


 * @RaboKarbakian: Yes. We can build a DjVu file with an OCR text layer from scan images. Images can pretty much be any file format and we'll figure it out, but it's important that the images are as high-resolution as possible. It is far better to get whatever format your scanning process produced than to have them re-encoded to a different format after the fact (every re-encoding looses quality, and we'll have to re-encode at least once for the DjVu). If your scanning software offers you options, TIFF, JPEG, and PNG are all good options for format, just so long as the encoding / quality settings are reasonable. For example: JPEGs at "80%" quality are often plenty good enough, and higher values have rapidly diminishing returns in most cases. XCF is not a good format for this purpose.If you're scanning yourself you should also either make sure the images are cropped just inside the page border (no black edge around the page, and the paper edge should just be cropped out, but the gutter retained), or you should make sure the page is in the same position within the image for every page. We can bulk crop a series of images, but in almost all cases that requires giving a set of fixed pixel coordinates to apply to each image in the series. If using a camera (vs. a flatbed) also try to make each page as flat as possible, since OCR has trouble with text that isn't on a straight line. Xover (talk) 06:25, 9 October 2021 (UTC)
 * commons:Category:Goblin Market (Rackham) 45 files. TIFF, with lwz. 600ppi. And the text is straight even if the page edges are not.  I did not want to hurt the book and I was confused about the size of the scanner glass and the scan area so, many of the edges are not really there and some were reconstructed.  Various sizes but within a certain error which I have not calculated.  The TIFF are as large or larger than the PNG but they were made in a fraction of the time it took to make PNG, so TIFF it is.  The scans were to PNG though.  I have been dealing with some artifacts on another project and did not want to have them in this one also.  libtiff requires libjpeg to build....


 * At one time, I had a script that would download a whole category of images -- sure wish I had that now to give to you. Let me know if you would like me to rewrite that.... And be quick with the ping if you need anything else!!  I surely thank you greatly for this--RaboKarbakian (talk) 04:33, 10 October 2021 (UTC)
 * @RaboKarbakian: According to enwp, Goblin Market illustrated by Arthur Rackham was first published in 1933 in London. Rackham died in 1939, so his pma. 70 UK term of copyright didn't expire until 2010. Being in copyright in its country of origin on the URAA date (January 1, 1996 for the UK) its US copyright would have been restored to a pub. + 95 year term, which will not expire until 2029. In other words, going by these the scan cannot be hosted either here or on Commons until 2029. Xover (talk) 14:00, 18 October 2021 (UTC)
 * Thanks for getting back with this. PseudoSkull mentioned something about "being printed the in the same two weeks" and that being difficult to prove.  I had an idea (about efficiency) that this had been printed at the same time, in Edinburgh, just for efficiency and costliness of setting up the printer a second time.  My foray in the "difficult to prove part"; it is also difficult to disprove....--RaboKarbakian (talk) 14:14, 18 October 2021 (UTC)
 * @RaboKarbakian: If a work was published in the US within 30 days of being published in another country, it is ineligible for restoration under the URAA. If that is the case its US copyright status will depend on whether it met all the formal requirements for copyright protection that were in effect at the time of publication. Simplifying down to rule-of-thumb level, that means it had to have a visible copyright notice and a renewal filed with the copyright office in the 28th year after first publication.Proving publication within 30 days is hard, but you can often get a reasonable level of certainty by looking for advertisments, books received, or reviews that pinpoint the publication in the respective countries. Xover (talk) 14:52, 18 October 2021 (UTC)
 * I'm going to close this and have those tif deleted. Just this one question tho': Is this where to put other similar (with the exception of being provably legal) requests?--RaboKarbakian (talk) 16:34, 20 October 2021 (UTC)
 * @RaboKarbakian: Yes, indeed. The Scan Lab was set up specifically to ask for various kinds of help with scans. Xover (talk) 05:44, 22 October 2021 (UTC)
 * This section was archived on a request by: RaboKarbakian (talk) 13:43, 21 October 2021 (UTC)

Index:Spencer - The Shepheardes Calender, conteining twelue æglogues proportionable to the twelue monethes, 1586.djvu
This is missing two pages that I know of. Two pages that should go between page 27 and 28. While not the same book, the missing pages match perfectly from https://archive.org/details/shepheardscalend00spenc The missing pages are:
 * for page 28 and
 * for page 29.

I don't know about the rest of the scan; it is like another language and the missing pages had an image on them which was a big clue that there was a problem. If you would prefer to just upload the other text, I would be perfectly happy making whatever changes to make the already proofed text match. --RaboKarbakian (talk) 00:29, 13 October 2021 (UTC)


 * ✅ and pages shifted. No changes should be needed (other than proofreading the pages) unless there are transclusions I haven't seen. Inductiveload— talk/contribs 20:55, 19 October 2021 (UTC)
 * Thanks!! --RaboKarbakian (talk) 14:00, 20 October 2021 (UTC)
 * This section was archived on a request by: --Xover (talk) 05:45, 22 October 2021 (UTC)

File:Draft Constitution of the Republic of the United States of Indonesia.pdf
The pages in this scan are doubled up, could they be split, please? TE(æ)A,ea. (talk) 19:39, 19 October 2021 (UTC)
 * Inductiveload— talk/contribs 19:43, 19 October 2021 (UTC)
 * ✅: Split, dewarped and OCR'd: Index:Draft Constitution of the Republic of the United States of Indonesia.djvu. Inductiveload— talk/contribs 20:41, 19 October 2021 (UTC)
 * Thank you. TE(æ)A,ea. (talk) 15:43, 21 October 2021 (UTC)
 * This section was archived on a request by: --Xover (talk) 05:46, 22 October 2021 (UTC)

File:Code Revision Commission v. Public.Resource.Org, Inc. (F.Supp.3d).pdf and File:Code Revision Commission v. Public.Resource.Org, Inc. (F.3d).pdf

 * This section was archived on a request by: Inductiveload— talk/contribs 10:45, 12 November 2021 (UTC)

File:COMBAT1.tif, File:COMBAT2.tif, and File:COMBAT3.tif
Please collate these scans into one file, preferable not a multi-page TIF. In addition, please remove the last page of the last scan, as that was made to show the text on that page. (Also, rotate the actual last page of the last scan; that was rotated incorrectly, for some reason.) TE(æ)A,ea. (talk) 20:15, 21 October 2021 (UTC)


 * ✅ see File:COMBAT.djvu. Mpaa (talk) 21:01, 21 October 2021 (UTC)
 * Thank you! TE(æ)A,ea. (talk) 23:27, 21 October 2021 (UTC)
 * This section was archived on a request by: --Xover (talk) 05:47, 22 October 2021 (UTC)

File:The collected works of Henrik Ibsen (Volume 5).djvu

 * This section was archived on a request by: Inductiveload— talk/contribs 09:18, 12 November 2021 (UTC)

Scan Repair for The Elizabethan stage (Volume 1).pdf

 * This section was archived on a request by: Inductiveload— talk/contribs 10:45, 12 November 2021 (UTC)

File:On the border with Crook (IA onborderwithcroo00bourrich).pdf
After Page:On the border with Crook (IA onborderwithcroo00bourrich).pdf/227, pages 196 and 197 need to be inserted from ]. Many Thanks! Languageseeker (talk) 22:11, 25 October 2021 (UTC)


 * Durrr you did this. Sorry! Inductiveload— talk/contribs 19:13, 11 November 2021 (UTC)
 * Could you also insert pg 490 and 491 after Page:On the border with Crook - Bourke - 1892.djvu/527 from . Sorry, did not notice earlier.Languageseeker (talk) 00:27, 12 November 2021 (UTC)
 * ✅ Inductiveload— talk/contribs 07:19, 16 November 2021 (UTC)
 * I think that you might have forgotten to upload the file to Commons. :) Languageseeker (talk) 14:55, 16 November 2021 (UTC)
 * Odd, I must have closed the tab or something. Anyhoo, it's there now. Inductiveload— talk/contribs 16:41, 16 November 2021 (UTC)
 * Thank you! Languageseeker (talk) 11:26, 23 November 2021 (UTC)
 * This section was archived on a request by: Languageseeker (talk) 11:26, 23 November 2021 (UTC)
 * This section was archived on a request by: Languageseeker (talk) 11:26, 23 November 2021 (UTC)

File:Travels through the states of North America, and the provinces of Upper and Lower Canada, during the years 1795, 1796, and 1797 (IA travelsthroughst01weld).pdf
After Page:Travels through the states of North America, and the provinces of Upper and Lower Canada, during the years 1795, 1796, and 1797 (IA travelsthroughst01weld).pdf/25, pages xviii and xix need to be inserted from ]. Many Thanks! Languageseeker (talk) 22:11, 25 October 2021 (UTC)


 * ✅: Index:Travels through the states of North America - Weld - 1799 - Volume 1.djvu and Index:Travels through the states of North America - Weld - 1799 - Volume 2.djvu Inductiveload— talk/contribs 16:59, 16 November 2021 (UTC)


 * This section was archived on a request by: Inductiveload— talk/contribs 08:48, 3 December 2021 (UTC)