Rome Wasn't Digitized in a Day - Council on Library and Information ...
Rome Wasn't Digitized in a Day - Council on Library and Information ...
Rome Wasn't Digitized in a Day - Council on Library and Information ...
You also want an ePaper? Increase the reach of your titles
YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.
63<br />
scholars, will <str<strong>on</strong>g>in</str<strong>on</strong>g>tegrate archival materials with computati<strong>on</strong>al l<str<strong>on</strong>g>in</str<strong>on</strong>g>guistics <strong>and</strong> other text-m<str<strong>on</strong>g>in</str<strong>on</strong>g><str<strong>on</strong>g>in</str<strong>on</strong>g>g<br />
capabilities (Harley et al. 2010, 118).<br />
These themes of “next-generati<strong>on</strong>” digital archives <strong>and</strong> the need to comb<str<strong>on</strong>g>in</str<strong>on</strong>g>e them with sophisticated<br />
language technologies are revisited <str<strong>on</strong>g>in</str<strong>on</strong>g> later secti<strong>on</strong>s of this report.<br />
To c<strong>on</strong>clude this secti<strong>on</strong> we describe <strong>on</strong>e major technology project that seeks to support the encod<str<strong>on</strong>g>in</str<strong>on</strong>g>g<br />
of historical <str<strong>on</strong>g>in</str<strong>on</strong>g>formati<strong>on</strong> wherever it is found, namely, the HEML, Historical Event <strong>and</strong> Markup<br />
L<str<strong>on</strong>g>in</str<strong>on</strong>g>k<str<strong>on</strong>g>in</str<strong>on</strong>g>g Project (Roberts<strong>on</strong> 2009), which seeks to provide markup st<strong>and</strong>ards <strong>and</strong> tools with which to<br />
encode historical <str<strong>on</strong>g>in</str<strong>on</strong>g>formati<strong>on</strong> <strong>on</strong> the web. In a state of evoluti<strong>on</strong> s<str<strong>on</strong>g>in</str<strong>on</strong>g>ce 2001, HEML now has an RDF<br />
data model that allows it to represent nested events <strong>and</strong> relati<strong>on</strong>s of causality between events. The<br />
HEML data format supports collecti<strong>on</strong>s of events where each is tagged with heml:Event, which at their<br />
simplest are bound to mach<str<strong>on</strong>g>in</str<strong>on</strong>g>e-readable spans of time <strong>and</strong> references to evidence. Other important<br />
features of the model <str<strong>on</strong>g>in</str<strong>on</strong>g>clude the assignment of a URI 185 to each entity <strong>and</strong> the utilizati<strong>on</strong> of an<br />
evidence element:<br />
Pers<strong>on</strong>s, roles, locati<strong>on</strong>s, <strong>and</strong> keywords are assigned m<strong>and</strong>atory URIs so that they may be<br />
referred to <str<strong>on</strong>g>in</str<strong>on</strong>g> multiple events. F<str<strong>on</strong>g>in</str<strong>on</strong>g>ally, <strong>on</strong>e or more heml:Evidence elements must be attributed<br />
to each event, <strong>and</strong> with<str<strong>on</strong>g>in</str<strong>on</strong>g> these there is a means by which different editi<strong>on</strong>s <strong>and</strong> l<str<strong>on</strong>g>in</str<strong>on</strong>g>guistic<br />
representati<strong>on</strong>s of the same text may be grouped together for the researcher's benefit<br />
(Roberts<strong>on</strong> 2009).<br />
These encod<str<strong>on</strong>g>in</str<strong>on</strong>g>g choices illustrate the importance of us<str<strong>on</strong>g>in</str<strong>on</strong>g>g unique URIs to identify specific entities so<br />
that they can not <strong>on</strong>ly be referred to <str<strong>on</strong>g>in</str<strong>on</strong>g> multiple encoded historical events but also be reused as “l<str<strong>on</strong>g>in</str<strong>on</strong>g>ked<br />
data” 186 by other applicati<strong>on</strong>s. The ability to l<str<strong>on</strong>g>in</str<strong>on</strong>g>k encoded events to attestati<strong>on</strong>s <str<strong>on</strong>g>in</str<strong>on</strong>g> primary texts is also<br />
of critical importance. One criticism often made of HEML, Roberts<strong>on</strong> stated, is that “it is not possible<br />
to encode the variati<strong>on</strong>s <str<strong>on</strong>g>in</str<strong>on</strong>g> op<str<strong>on</strong>g>in</str<strong>on</strong>g>i<strong>on</strong>s regard<str<strong>on</strong>g>in</str<strong>on</strong>g>g historical events.” The ability to encode multiple<br />
scholarly op<str<strong>on</strong>g>in</str<strong>on</strong>g>i<strong>on</strong>s <strong>and</strong> to <str<strong>on</strong>g>in</str<strong>on</strong>g>dicate the uncerta<str<strong>on</strong>g>in</str<strong>on</strong>g>ty of knowledge regard<str<strong>on</strong>g>in</str<strong>on</strong>g>g dates or other <str<strong>on</strong>g>in</str<strong>on</strong>g>formati<strong>on</strong><br />
are important features of any markup for historical texts. Roberts<strong>on</strong> does argue, however, that URIs<br />
could eventually be created for scholars, <strong>and</strong> specific encoded arguments could be l<str<strong>on</strong>g>in</str<strong>on</strong>g>ked to those<br />
URIs.<br />
Classical Archaeology<br />
Overview<br />
Archaeology is the study of the material rema<str<strong>on</strong>g>in</str<strong>on</strong>g>s <strong>and</strong> envir<strong>on</strong>mental effects of human behavior<br />
throughout prehistory to the modern era. Scholarship <str<strong>on</strong>g>in</str<strong>on</strong>g> archaeology is divided <str<strong>on</strong>g>in</str<strong>on</strong>g>to a large number of<br />
subdiscipl<str<strong>on</strong>g>in</str<strong>on</strong>g>es, many def<str<strong>on</strong>g>in</str<strong>on</strong>g>ed geographically (e.g., North America, Egypt, Near East, Oceania) <strong>and</strong>/or<br />
by time period (e.g., Paleolithic, Neolithic, Classical). A moderately sized field, archaeology overlaps<br />
with a range of other scholarly discipl<str<strong>on</strong>g>in</str<strong>on</strong>g>es, <str<strong>on</strong>g>in</str<strong>on</strong>g>clud<str<strong>on</strong>g>in</str<strong>on</strong>g>g biological anthropology, ethnobotany,<br />
paleozoology, geology, <strong>and</strong> classics (<str<strong>on</strong>g>in</str<strong>on</strong>g> particular, palaeography, philology, papyrology, epigraphy,<br />
numismatics, history of the ancient world, Hellenic literature, <strong>and</strong> art <strong>and</strong> architectural history)<br />
(Harley et al. 2010, 30).<br />
185 A URI has been def<str<strong>on</strong>g>in</str<strong>on</strong>g>ed as a “compact sequence of characters that identifies an abstract or physical resource.” For more <strong>on</strong> the syntax <strong>and</strong> architecture<br />
of URIs, see http://labs.apache.org/webarch/uri/rfc/rfc3986.html - overview<br />
186 Tim Berners-Lee has described l<str<strong>on</strong>g>in</str<strong>on</strong>g>ked data as essential to the creati<strong>on</strong> of the Semantic Web, <strong>and</strong> the creati<strong>on</strong> of l<str<strong>on</strong>g>in</str<strong>on</strong>g>ked data must follow four essential<br />
rules: 1. “Use URIs as names for th<str<strong>on</strong>g>in</str<strong>on</strong>g>gs,” 2.“Use HTTP URIs so that people can look up those names,” 3. Use st<strong>and</strong>ards such as RDF <strong>and</strong> SPARQL so that<br />
when some<strong>on</strong>e looks up a URI it provides useful <str<strong>on</strong>g>in</str<strong>on</strong>g>formati<strong>on</strong>, <strong>and</strong> 4. Include l<str<strong>on</strong>g>in</str<strong>on</strong>g>ks to other URIs to support further discovery.<br />
http://www.w3.org/DesignIssues/L<str<strong>on</strong>g>in</str<strong>on</strong>g>kedData.html