Rome Wasn't Digitized in a Day - Council on Library and Information ...
Rome Wasn't Digitized in a Day - Council on Library and Information ...
Rome Wasn't Digitized in a Day - Council on Library and Information ...
Create successful ePaper yourself
Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.
112<br />
used to po<str<strong>on</strong>g>in</str<strong>on</strong>g>t between alternate descripti<strong>on</strong>s of an <str<strong>on</strong>g>in</str<strong>on</strong>g>scripti<strong>on</strong> <str<strong>on</strong>g>in</str<strong>on</strong>g> different databases. Their use of an<br />
<strong>on</strong>tology <strong>and</strong> a l<str<strong>on</strong>g>in</str<strong>on</strong>g>ked-data approach thus could be used to support the cross-l<str<strong>on</strong>g>in</str<strong>on</strong>g>k<str<strong>on</strong>g>in</str<strong>on</strong>g>g of exist<str<strong>on</strong>g>in</str<strong>on</strong>g>g<br />
databases.<br />
The challenges of l<str<strong>on</strong>g>in</str<strong>on</strong>g>k<str<strong>on</strong>g>in</str<strong>on</strong>g>g vary<str<strong>on</strong>g>in</str<strong>on</strong>g>g epigraphy databases with other types of digital classics resources<br />
such as papyrological databases is also a grow<str<strong>on</strong>g>in</str<strong>on</strong>g>g challenge for which various soluti<strong>on</strong>s have been<br />
explored by projects such as the recently completed LaQuAT (L<str<strong>on</strong>g>in</str<strong>on</strong>g>k<str<strong>on</strong>g>in</str<strong>on</strong>g>g <strong>and</strong> Query<str<strong>on</strong>g>in</str<strong>on</strong>g>g of Ancient<br />
Texts). 366 LaQuAT was a collaborati<strong>on</strong> between the Centre for e-Research, K<str<strong>on</strong>g>in</str<strong>on</strong>g>g’s College,<br />
L<strong>on</strong>d<strong>on</strong>, 367 <strong>and</strong> EPCC 368 at the University of Ed<str<strong>on</strong>g>in</str<strong>on</strong>g>burgh. Two recent articles by Bodard et al. (2009)<br />
<strong>and</strong> Jacks<strong>on</strong> et al. (2009) have described the goals of <strong>and</strong> technological challenges faced by this<br />
project. The LaQuAT project used OGSA-DAI, 369 an open-source distributed data-management<br />
software, to create a dem<strong>on</strong>strator that provided uniform access to different epigraphic <strong>and</strong><br />
papyrological resources. Basically the LaQuAT project sought to build a proof of c<strong>on</strong>cept that<br />
explored the possibilities of “creat<str<strong>on</strong>g>in</str<strong>on</strong>g>g virtual data centres for the coord<str<strong>on</strong>g>in</str<strong>on</strong>g>ated shar<str<strong>on</strong>g>in</str<strong>on</strong>g>g of such<br />
resources” <strong>and</strong> exam<str<strong>on</strong>g>in</str<strong>on</strong>g>ed how “distributed data resources can be mean<str<strong>on</strong>g>in</str<strong>on</strong>g>gfully federated <strong>and</strong> queried.”<br />
From a prelim<str<strong>on</strong>g>in</str<strong>on</strong>g>ary analysis of digital classics resources, Jacks<strong>on</strong> et al. reas<strong>on</strong>ed that a data-<str<strong>on</strong>g>in</str<strong>on</strong>g>tegrati<strong>on</strong><br />
project would need to deal with the various complexities of annotated corpora, material <str<strong>on</strong>g>in</str<strong>on</strong>g> relati<strong>on</strong>al<br />
databases, <strong>and</strong> large numbers of XML files. Such research is of grow<str<strong>on</strong>g>in</str<strong>on</strong>g>g importance because of the<br />
large number of <str<strong>on</strong>g>in</str<strong>on</strong>g>dividual <strong>and</strong> isolated digital resources that have been created. “In the fields of<br />
archaeology <strong>and</strong> classics al<strong>on</strong>e,” Bodard et al. (2009) expla<str<strong>on</strong>g>in</str<strong>on</strong>g>ed, “there are numerous datasets, often<br />
small <strong>and</strong> isolated, that would be of great utility if the <str<strong>on</strong>g>in</str<strong>on</strong>g>formati<strong>on</strong> they c<strong>on</strong>ta<str<strong>on</strong>g>in</str<strong>on</strong>g>ed could be <str<strong>on</strong>g>in</str<strong>on</strong>g>tegrated.”<br />
The researchers found that four major issues needed to be addressed <str<strong>on</strong>g>in</str<strong>on</strong>g> terms of potential <str<strong>on</strong>g>in</str<strong>on</strong>g>tegrati<strong>on</strong><br />
(1) the formats of resources were very diverse; (2) resources were often not very accessible (e.g.,<br />
stored <strong>on</strong> an <str<strong>on</strong>g>in</str<strong>on</strong>g>dividual department’s or a scholar’s computers), <strong>and</strong> even data published <strong>on</strong> websites<br />
were typically not available for reuse; (3) resources were available to be used <strong>on</strong>ly <str<strong>on</strong>g>in</str<strong>on</strong>g> isolati<strong>on</strong> (e.g.,<br />
s<str<strong>on</strong>g>in</str<strong>on</strong>g>gle <str<strong>on</strong>g>in</str<strong>on</strong>g>scripti<strong>on</strong> databases); <strong>and</strong> (4) resources were owned by different <str<strong>on</strong>g>in</str<strong>on</strong>g>dividuals <strong>and</strong> communities<br />
with vary<str<strong>on</strong>g>in</str<strong>on</strong>g>g rights schemes. The LaQuAT project thus wanted to explore whether bridges could be<br />
built between data silos <str<strong>on</strong>g>in</str<strong>on</strong>g> order to support federated search<str<strong>on</strong>g>in</str<strong>on</strong>g>g at the least <strong>and</strong> they thus brought<br />
together experts <str<strong>on</strong>g>in</str<strong>on</strong>g> distributed data management <strong>and</strong> digital humanities.<br />
The orig<str<strong>on</strong>g>in</str<strong>on</strong>g>al plan of LaQuAT was to l<str<strong>on</strong>g>in</str<strong>on</strong>g>k three projects: Project Volterra 370 (an Access database of<br />
Roman legal texts <strong>and</strong> metadata); Heidelberger Gesamtverzeichnis der griechischen Papyrusurkunden<br />
Ägyptens (HGV) 371 (a “database of papyrological metadata <str<strong>on</strong>g>in</str<strong>on</strong>g> relati<strong>on</strong>al <strong>and</strong> TEI-XML format” that<br />
<str<strong>on</strong>g>in</str<strong>on</strong>g>cludes <str<strong>on</strong>g>in</str<strong>on</strong>g>formati<strong>on</strong> <strong>on</strong> 55,000 papyri <strong>and</strong> is stored <str<strong>on</strong>g>in</str<strong>on</strong>g> FileMaker Pro); <strong>and</strong> the Inscripti<strong>on</strong>s of<br />
Aphrodisias (IAph). These collecti<strong>on</strong>s span about 500 years of the Roman Empire <strong>and</strong> overlap <str<strong>on</strong>g>in</str<strong>on</strong>g> terms<br />
of places <strong>and</strong> people. While all the data sets are freely available <strong>and</strong> both the IAph <strong>and</strong> HGV<br />
collecti<strong>on</strong>s have been published as EpiDoc XML that can be downloaded under a CC Attributi<strong>on</strong><br />
License, it was the master databases of both the HGV <strong>and</strong> Volterra that were needed for this project<br />
<strong>and</strong> they had to be specifically requested (Bodard et al. 2009). Despite their <str<strong>on</strong>g>in</str<strong>on</strong>g>itial desire to support<br />
cross-database search<str<strong>on</strong>g>in</str<strong>on</strong>g>g of all three projects, they found that the challenges of <str<strong>on</strong>g>in</str<strong>on</strong>g>tegrat<str<strong>on</strong>g>in</str<strong>on</strong>g>g the<br />
relati<strong>on</strong>al databases were so complicated that they focused <strong>on</strong> simply Volterra <strong>and</strong> HGV <str<strong>on</strong>g>in</str<strong>on</strong>g> this project.<br />
One questi<strong>on</strong> they still wished to explore was whether <str<strong>on</strong>g>in</str<strong>on</strong>g>formati<strong>on</strong> <str<strong>on</strong>g>in</str<strong>on</strong>g> HGV could be used to reduce<br />
366 http://www.kcl.ac.uk/iss/cerch/projects/completed/laquat.html<br />
367 http://www.kcl.ac.uk/iss/cerch<br />
368 http://www.epcc.ed.ac.uk/<br />
369 http://www.ogsadai.org.uk/<br />
370 http://www.ucl.ac.uk/history2/volterra/<br />
371 HGV is also federated as part of Trismegistos, http://aquila.papy.uni-heidelberg.de/gvzFM.html