Rome Wasn't Digitized in a Day - Council on Library and Information ...
Rome Wasn't Digitized in a Day - Council on Library and Information ...
Rome Wasn't Digitized in a Day - Council on Library and Information ...
You also want an ePaper? Increase the reach of your titles
YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.
15<br />
manuscript pages <strong>and</strong> the transcripti<strong>on</strong>s of the text are available for download <strong>on</strong>l<str<strong>on</strong>g>in</str<strong>on</strong>g>e. 50 Scholars are<br />
work<str<strong>on</strong>g>in</str<strong>on</strong>g>g with digital images rather than the manuscript itself, <strong>and</strong> scholars from diverse discipl<str<strong>on</strong>g>in</str<strong>on</strong>g>es,<br />
<str<strong>on</strong>g>in</str<strong>on</strong>g>clud<str<strong>on</strong>g>in</str<strong>on</strong>g>g palaeography, the history of mathematics <strong>and</strong> science, <strong>and</strong> Byzant<str<strong>on</strong>g>in</str<strong>on</strong>g>e liturgy, have d<strong>on</strong>e<br />
extensive work with this palimpsest. Much of the image-process<str<strong>on</strong>g>in</str<strong>on</strong>g>g work with the palimpsest has<br />
focused <strong>on</strong> develop<str<strong>on</strong>g>in</str<strong>on</strong>g>g algorithms to extract the text of Archimedes <str<strong>on</strong>g>in</str<strong>on</strong>g> particular from page images.<br />
Salerno et al. (2007) used pr<str<strong>on</strong>g>in</str<strong>on</strong>g>cipal comp<strong>on</strong>ent analysis (PCA) <strong>and</strong> <str<strong>on</strong>g>in</str<strong>on</strong>g>dependent comp<strong>on</strong>ent analysis<br />
(ICA) techniques to extract “clean maps of the primary Archimedes text, the overwritten text, <strong>and</strong> the<br />
mold pattern present <str<strong>on</strong>g>in</str<strong>on</strong>g> the pages” from 14 hyperspectral images of the Archimedes. Their goals were<br />
to provide better access to the text <strong>and</strong> to develop techniques that could be used <str<strong>on</strong>g>in</str<strong>on</strong>g> other palimpsestdigitizati<strong>on</strong><br />
projects. The authors also report that:<br />
A further aspect of the problem is to partly automate the read<str<strong>on</strong>g>in</str<strong>on</strong>g>g <strong>and</strong> transcripti<strong>on</strong> tasks. This<br />
cannot be <str<strong>on</strong>g>in</str<strong>on</strong>g>tended as a substituti<strong>on</strong> of the human experts <str<strong>on</strong>g>in</str<strong>on</strong>g> a task where they perform better<br />
than any presently c<strong>on</strong>ceivable numerical strategy, but as an accelerati<strong>on</strong> of the human work<br />
(Salerno et al. 2007).<br />
The importance of not replac<str<strong>on</strong>g>in</str<strong>on</strong>g>g expert scholars with systems but rather of develop<str<strong>on</strong>g>in</str<strong>on</strong>g>g tools that assist<br />
them <str<strong>on</strong>g>in</str<strong>on</strong>g> their traditi<strong>on</strong>al tasks is a theme seen throughout the literature.<br />
Other significant work <str<strong>on</strong>g>in</str<strong>on</strong>g> the area of provid<str<strong>on</strong>g>in</str<strong>on</strong>g>g access to fragile manuscripts has been c<strong>on</strong>duced by the<br />
EDUCE (Enhanced Digital Unwrapp<str<strong>on</strong>g>in</str<strong>on</strong>g>g for C<strong>on</strong>servati<strong>on</strong> <strong>and</strong> Educati<strong>on</strong>) Project. 51 Investigators <strong>on</strong><br />
this Nati<strong>on</strong>al Science Foundati<strong>on</strong>–funded project have been work<str<strong>on</strong>g>in</str<strong>on</strong>g>g to develop systems that support<br />
the “virtual unwrapp<str<strong>on</strong>g>in</str<strong>on</strong>g>g <strong>and</strong> visualizati<strong>on</strong> of ancient texts.” Accord<str<strong>on</strong>g>in</str<strong>on</strong>g>g to their website:<br />
The overall purpose is to capture <str<strong>on</strong>g>in</str<strong>on</strong>g> digital form fragile 3D texts, such as ancient papyrus <strong>and</strong><br />
scrolls of other materials us<str<strong>on</strong>g>in</str<strong>on</strong>g>g a custom built, portable, multi-power CT scann<str<strong>on</strong>g>in</str<strong>on</strong>g>g device <strong>and</strong><br />
then to virtually “unroll” the scroll us<str<strong>on</strong>g>in</str<strong>on</strong>g>g image algorithms, render<str<strong>on</strong>g>in</str<strong>on</strong>g>g a digital facsimile that<br />
exposes <strong>and</strong> makes legible <str<strong>on</strong>g>in</str<strong>on</strong>g>scripti<strong>on</strong>s <strong>and</strong> other mark<str<strong>on</strong>g>in</str<strong>on</strong>g>gs <strong>on</strong> the artifact, all <str<strong>on</strong>g>in</str<strong>on</strong>g> a n<strong>on</strong>-<str<strong>on</strong>g>in</str<strong>on</strong>g>vasive<br />
process.<br />
Some of the EDUCE Project’s image-process<str<strong>on</strong>g>in</str<strong>on</strong>g>g techniques have been used by the Homer Multitext 52<br />
Project as described by Baumann <strong>and</strong> Seales (2009), who presented an applicati<strong>on</strong> of imageregistrati<strong>on</strong><br />
techniques, or the “process of mapp<str<strong>on</strong>g>in</str<strong>on</strong>g>g a sensed image <str<strong>on</strong>g>in</str<strong>on</strong>g>to the coord<str<strong>on</strong>g>in</str<strong>on</strong>g>ate system of a<br />
reference image,” to the Venetus A manuscript of the Iliad used <str<strong>on</strong>g>in</str<strong>on</strong>g> this project. The Homer Multitext<br />
Project <str<strong>on</strong>g>in</str<strong>on</strong>g>cluded 3-D scann<str<strong>on</strong>g>in</str<strong>on</strong>g>g as part of its digitizati<strong>on</strong> strategy, but as the 3-D scann<str<strong>on</strong>g>in</str<strong>on</strong>g>g system<br />
acquired un-textured 3-D models a “procedure to register the 2D photography to the 3D scans was<br />
performed periodically.” Dur<str<strong>on</strong>g>in</str<strong>on</strong>g>g <strong>on</strong>e photography sessi<strong>on</strong> it was discovered that technical issues had<br />
produced a number of images of poor quality. While these images were reshot, time c<strong>on</strong>stra<str<strong>on</strong>g>in</str<strong>on</strong>g>ts<br />
prevented perform<str<strong>on</strong>g>in</str<strong>on</strong>g>g the 3-D geometry capture for these pages aga<str<strong>on</strong>g>in</str<strong>on</strong>g>. The result was a number of<br />
folios that had two sets of data—a “dirty” image that had registered 3-D geometry <strong>and</strong> a “clean” image<br />
with no associated geometry—to which the project wished to apply digital flatten<str<strong>on</strong>g>in</str<strong>on</strong>g>g algorithms. The<br />
ma<str<strong>on</strong>g>in</str<strong>on</strong>g> computati<strong>on</strong>al problem was thus to determ<str<strong>on</strong>g>in</str<strong>on</strong>g>e a means of obta<str<strong>on</strong>g>in</str<strong>on</strong>g><str<strong>on</strong>g>in</str<strong>on</strong>g>g a “high-quality deformati<strong>on</strong><br />
of the ‘clean image’ such that the text was <str<strong>on</strong>g>in</str<strong>on</strong>g> the same positi<strong>on</strong> as the ‘dirty image’” that would then<br />
allow them to “apply digital flatten<str<strong>on</strong>g>in</str<strong>on</strong>g>g us<str<strong>on</strong>g>in</str<strong>on</strong>g>g the acquired corresp<strong>on</strong>d<str<strong>on</strong>g>in</str<strong>on</strong>g>g 3D geometry.”<br />
50 http://archimedespalimpsest.net/<br />
51 http://www.stoa.org/educe/<br />
52 http://chs.harvard.edu/wa/pageRtn=ArticleWrapper&bdc=12&mn=1169