Rome Wasn't Digitized in a Day - Council on Library and Information ...
Rome Wasn't Digitized in a Day - Council on Library and Information ...
Rome Wasn't Digitized in a Day - Council on Library and Information ...
Create successful ePaper yourself
Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.
100<br />
text <strong>and</strong> to previous annotati<strong>on</strong>s, they can make reflecti<strong>on</strong>s <strong>on</strong> earlier work or create a new<br />
<str<strong>on</strong>g>in</str<strong>on</strong>g>terpretati<strong>on</strong>. We have designed a schema to h<strong>and</strong>le these references <strong>and</strong> to support the<br />
distributed <strong>and</strong> collaborative work with us<str<strong>on</strong>g>in</str<strong>on</strong>g>g more Annotati<strong>on</strong> Layers <str<strong>on</strong>g>in</str<strong>on</strong>g> <strong>on</strong>e editi<strong>on</strong> (Bauer et<br />
al. 2008).<br />
At the same time, the authors noted that to make exact references to any po<str<strong>on</strong>g>in</str<strong>on</strong>g>t <str<strong>on</strong>g>in</str<strong>on</strong>g> the text, they needed<br />
to be able to describe the physical structure of the text. The base text layer was stored as a basic TEI<br />
document, <strong>and</strong> as the palimpsest provided an exist<str<strong>on</strong>g>in</str<strong>on</strong>g>g physical structure with “codices, quires, leaves,<br />
sides, columns <strong>and</strong> l<str<strong>on</strong>g>in</str<strong>on</strong>g>es,” these were used as the primary structure for their reference system <str<strong>on</strong>g>in</str<strong>on</strong>g> order<br />
to def<str<strong>on</strong>g>in</str<strong>on</strong>g>e exact references to specific document parts. Such references were needed to support<br />
“philological process<str<strong>on</strong>g>in</str<strong>on</strong>g>g,” text annotati<strong>on</strong>, <strong>and</strong> mapp<str<strong>on</strong>g>in</str<strong>on</strong>g>g between images <strong>and</strong> transcripti<strong>on</strong>. The authors<br />
did not discuss whether they c<strong>on</strong>sidered us<str<strong>on</strong>g>in</str<strong>on</strong>g>g Can<strong>on</strong>ical Text Service (CTS) references.<br />
Bauer et al. argued that TEI P4 <strong>and</strong> EpiDoc were less useful than the Hypereidoc model <str<strong>on</strong>g>in</str<strong>on</strong>g> the creati<strong>on</strong><br />
of philological annotati<strong>on</strong>s because they required that such annotati<strong>on</strong>s be stored <str<strong>on</strong>g>in</str<strong>on</strong>g> the form of XML<br />
tags <str<strong>on</strong>g>in</str<strong>on</strong>g>serted <str<strong>on</strong>g>in</str<strong>on</strong>g>to a document, mean<str<strong>on</strong>g>in</str<strong>on</strong>g>g that annotati<strong>on</strong>s could be embedded by philologists <strong>on</strong>ly if the<br />
tags were balanced (because of the need for a well-formed XML document). Their proposed soluti<strong>on</strong><br />
was to develop a reference system based <strong>on</strong> the physical structure of the document. “This enables the<br />
h<strong>and</strong>l<str<strong>on</strong>g>in</str<strong>on</strong>g>g of any overlapp<str<strong>on</strong>g>in</str<strong>on</strong>g>g annotati<strong>on</strong>,” Bauer et al. stated. “With this reference system miss<str<strong>on</strong>g>in</str<strong>on</strong>g>g word<br />
<strong>and</strong> sentence boundaries can easily be described, even if <str<strong>on</strong>g>in</str<strong>on</strong>g>terpreted differently by various philologists.<br />
Punctuati<strong>on</strong>s miss<str<strong>on</strong>g>in</str<strong>on</strong>g>g from the document can also easily be coded,” they wrote.<br />
The unique nature of the palimpsest, however, with sec<strong>on</strong>dary text written over other orig<str<strong>on</strong>g>in</str<strong>on</strong>g>al texts, led<br />
them to def<str<strong>on</strong>g>in</str<strong>on</strong>g>e the text of Archimedes <strong>and</strong> Hypereides as the “undertext” <strong>and</strong> the new texts that were<br />
written over them as the “overtext.” The page number<str<strong>on</strong>g>in</str<strong>on</strong>g>g of the “overtext” was used as the base for<br />
their reference system, <strong>and</strong> they def<str<strong>on</strong>g>in</str<strong>on</strong>g>ed their “order<str<strong>on</strong>g>in</str<strong>on</strong>g>g <strong>and</strong> <str<strong>on</strong>g>in</str<strong>on</strong>g>dex<str<strong>on</strong>g>in</str<strong>on</strong>g>g layer” <str<strong>on</strong>g>in</str<strong>on</strong>g>dependently from the<br />
“base text layer” <strong>and</strong> stored these data <str<strong>on</strong>g>in</str<strong>on</strong>g> an external XML file, not<str<strong>on</strong>g>in</str<strong>on</strong>g>g that philologists would not<br />
necessarily agree <strong>on</strong> a page order <strong>and</strong> might want to use their own “order<str<strong>on</strong>g>in</str<strong>on</strong>g>g <strong>and</strong> <str<strong>on</strong>g>in</str<strong>on</strong>g>dex<str<strong>on</strong>g>in</str<strong>on</strong>g>g layer.”<br />
While the “base text layer’s” physical structure was based <strong>on</strong> the overtext, <strong>on</strong>ly pages were identified<br />
with overtext leaf <strong>and</strong> side while columns <strong>and</strong> l<str<strong>on</strong>g>in</str<strong>on</strong>g>es were marked us<str<strong>on</strong>g>in</str<strong>on</strong>g>g the undertext so that the l<str<strong>on</strong>g>in</str<strong>on</strong>g>es<br />
of Hypereides text could be specifically identified.<br />
The Hypereidoc reference system supported three types of references: absolute references that po<str<strong>on</strong>g>in</str<strong>on</strong>g>t at<br />
a character positi<strong>on</strong> <str<strong>on</strong>g>in</str<strong>on</strong>g> the base text; <str<strong>on</strong>g>in</str<strong>on</strong>g>ternal relative references that po<str<strong>on</strong>g>in</str<strong>on</strong>g>t to a character positi<strong>on</strong> <str<strong>on</strong>g>in</str<strong>on</strong>g> the<br />
text “<str<strong>on</strong>g>in</str<strong>on</strong>g>serted by a previous annotati<strong>on</strong> <str<strong>on</strong>g>in</str<strong>on</strong>g> the same annotati<strong>on</strong> layer”; <strong>and</strong> external relative references<br />
that po<str<strong>on</strong>g>in</str<strong>on</strong>g>t to a character positi<strong>on</strong> <str<strong>on</strong>g>in</str<strong>on</strong>g> the text “<str<strong>on</strong>g>in</str<strong>on</strong>g>serted by an annotati<strong>on</strong> <str<strong>on</strong>g>in</str<strong>on</strong>g> a previous Annotati<strong>on</strong> Layer.”<br />
Several types of annotati<strong>on</strong> are supported, <str<strong>on</strong>g>in</str<strong>on</strong>g>clud<str<strong>on</strong>g>in</str<strong>on</strong>g>g embedded <strong>and</strong> overlapp<str<strong>on</strong>g>in</str<strong>on</strong>g>g annotati<strong>on</strong>s. They then<br />
developed a customized system that made use of the XML Po<str<strong>on</strong>g>in</str<strong>on</strong>g>ter Language (XPo<str<strong>on</strong>g>in</str<strong>on</strong>g>ter), 321 which<br />
allows <strong>on</strong>e “to po<str<strong>on</strong>g>in</str<strong>on</strong>g>t to an arbitrary po<str<strong>on</strong>g>in</str<strong>on</strong>g>t <str<strong>on</strong>g>in</str<strong>on</strong>g> an XML document.” While TEI P5 has developed<br />
specificati<strong>on</strong>s for the use of XPo<str<strong>on</strong>g>in</str<strong>on</strong>g>ter, 322 Bauer et al. criticized these guidel<str<strong>on</strong>g>in</str<strong>on</strong>g>es for th<str<strong>on</strong>g>in</str<strong>on</strong>g>k<str<strong>on</strong>g>in</str<strong>on</strong>g>g about<br />
XPo<str<strong>on</strong>g>in</str<strong>on</strong>g>ter <strong>on</strong>ly as a po<str<strong>on</strong>g>in</str<strong>on</strong>g>ter to an arbitrary tag rather than as an arbitrary positi<strong>on</strong> <str<strong>on</strong>g>in</str<strong>on</strong>g> a text, a feature<br />
that did not support the type of overlapp<str<strong>on</strong>g>in</str<strong>on</strong>g>g annotati<strong>on</strong> that they needed. N<strong>on</strong>etheless, want<str<strong>on</strong>g>in</str<strong>on</strong>g>g to<br />
ma<str<strong>on</strong>g>in</str<strong>on</strong>g>ta<str<strong>on</strong>g>in</str<strong>on</strong>g> maximum compatibility with TEI P5, they made use of the <strong>and</strong> tags <strong>and</strong><br />
publish an additi<strong>on</strong>al “flat file format” of their publicati<strong>on</strong>s that does not make use of XPo<str<strong>on</strong>g>in</str<strong>on</strong>g>ter.<br />
321 http://www.w3.org/TR/xptr-framework/<br />
322 http://www.tei-c.org/release/doc/tei-p5-doc/en/html/SA.html#SATS