26.12.2014 Views

Rome Wasn't Digitized in a Day - Council on Library and Information ...

Rome Wasn't Digitized in a Day - Council on Library and Information ...

Rome Wasn't Digitized in a Day - Council on Library and Information ...

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

216<br />

the related MILARQ project, 649 which had the end goal of support<str<strong>on</strong>g>in</str<strong>on</strong>g>g more efficient <str<strong>on</strong>g>in</str<strong>on</strong>g>formati<strong>on</strong><br />

retrieval from the <str<strong>on</strong>g>in</str<strong>on</strong>g>tegrated database. This was accomplished by enhanc<str<strong>on</strong>g>in</str<strong>on</strong>g>g Jena, “an exist<str<strong>on</strong>g>in</str<strong>on</strong>g>g, widely<br />

used, open source Semantic Web data management platform” <strong>and</strong> through the creati<strong>on</strong> of “multiple<br />

<str<strong>on</strong>g>in</str<strong>on</strong>g>dexes over the underly<str<strong>on</strong>g>in</str<strong>on</strong>g>g RDF triple store, Jena TDB, <strong>and</strong> other optimizati<strong>on</strong>s relat<str<strong>on</strong>g>in</str<strong>on</strong>g>g to filter<br />

performance.” The newly released project website also <str<strong>on</strong>g>in</str<strong>on</strong>g>cludes a detailed step-by-step technical<br />

overview as to how the CLAROS database was created 650 <strong>and</strong> the guid<str<strong>on</strong>g>in</str<strong>on</strong>g>g pr<str<strong>on</strong>g>in</str<strong>on</strong>g>ciples used <str<strong>on</strong>g>in</str<strong>on</strong>g> its design.<br />

C<strong>on</strong>cordia<br />

The C<strong>on</strong>cordia <str<strong>on</strong>g>in</str<strong>on</strong>g>itiative 651 was established by the Center for Comput<str<strong>on</strong>g>in</str<strong>on</strong>g>g <str<strong>on</strong>g>in</str<strong>on</strong>g> the Humanities at K<str<strong>on</strong>g>in</str<strong>on</strong>g>g's<br />

College, L<strong>on</strong>d<strong>on</strong>, <strong>and</strong> the ISAW at New York University. It is a “a transatlantic collaborati<strong>on</strong>” that will<br />

support “dissem<str<strong>on</strong>g>in</str<strong>on</strong>g>ati<strong>on</strong> of key epigraphical, papyrological <strong>and</strong> geographic resources for Greek <strong>and</strong><br />

Roman culture <str<strong>on</strong>g>in</str<strong>on</strong>g> North Africa, <strong>and</strong> pilot<str<strong>on</strong>g>in</str<strong>on</strong>g>g of reusable, st<strong>and</strong>ard techniques for web-based<br />

cyber<str<strong>on</strong>g>in</str<strong>on</strong>g>frastructure.” 652 A number of major projects are participat<str<strong>on</strong>g>in</str<strong>on</strong>g>g <str<strong>on</strong>g>in</str<strong>on</strong>g> this effort, <str<strong>on</strong>g>in</str<strong>on</strong>g>clud<str<strong>on</strong>g>in</str<strong>on</strong>g>g the Duke<br />

Data Bank of Documentary Papyri, Epigraphische Datenbank Heidelberg (EDH), Inscripti<strong>on</strong>s of<br />

Aphrodisias (2007), Inscripti<strong>on</strong>s of Roman Cyrenaica, Inscripti<strong>on</strong>s of Roman Tripolitania, <strong>and</strong><br />

Pleiades. Designed as a dem<strong>on</strong>strati<strong>on</strong> project, C<strong>on</strong>cordia will unite these digital collecti<strong>on</strong>s of<br />

<str<strong>on</strong>g>in</str<strong>on</strong>g>scripti<strong>on</strong>s <strong>and</strong> papyri (that <str<strong>on</strong>g>in</str<strong>on</strong>g>clude 50,000 papyrological <strong>and</strong> 3,000 epigraphic texts) with the<br />

geographic data set of Pleiades. Some newly digitized c<strong>on</strong>tent will also be <str<strong>on</strong>g>in</str<strong>on</strong>g>cluded, such as 950<br />

epigraphic texts. C<strong>on</strong>cordia will use basic web architecture <strong>and</strong> st<strong>and</strong>ard formats (XHTML,<br />

EpiDoc/TEI XML, <strong>and</strong> Atom+GeoRSS). Its ma<str<strong>on</strong>g>in</str<strong>on</strong>g> goal is to provide users with <strong>on</strong>e textual search<br />

across these collecti<strong>on</strong>s as well as “dynamic mapp<str<strong>on</strong>g>in</str<strong>on</strong>g>g <strong>and</strong> geographical correlati<strong>on</strong> for arbitrary<br />

collecti<strong>on</strong>s of humanities c<strong>on</strong>tent, hosted anywhere <strong>on</strong> the web.”<br />

This project is set to c<strong>on</strong>clude <str<strong>on</strong>g>in</str<strong>on</strong>g> 2010 <strong>and</strong> has created a project wiki that tracks deliverables, workshop<br />

“results, <strong>and</strong> other general <str<strong>on</strong>g>in</str<strong>on</strong>g>formati<strong>on</strong>.” 653 A number of software tools have been created, <str<strong>on</strong>g>in</str<strong>on</strong>g>clud<str<strong>on</strong>g>in</str<strong>on</strong>g>g<br />

epidoc2atom (a set XSLT sheets for “creat<str<strong>on</strong>g>in</str<strong>on</strong>g>g web feeds from EpiDoc c<strong>on</strong>formant XML documents”),<br />

the C<strong>on</strong>cordia Matchtool, a “framework for def<str<strong>on</strong>g>in</str<strong>on</strong>g><str<strong>on</strong>g>in</str<strong>on</strong>g>g <strong>and</strong> execut<str<strong>on</strong>g>in</str<strong>on</strong>g>g rulesets to effect match<str<strong>on</strong>g>in</str<strong>on</strong>g>g of<br />

records <str<strong>on</strong>g>in</str<strong>on</strong>g> two datasets,” <strong>and</strong> C<strong>on</strong>cordia Harvester, “software for crawl<str<strong>on</strong>g>in</str<strong>on</strong>g>g <strong>and</strong> <str<strong>on</strong>g>in</str<strong>on</strong>g>dex<str<strong>on</strong>g>in</str<strong>on</strong>g>g<br />

Atom+GeoRSS feeds.” Important deliverables that the C<strong>on</strong>cordia project also plans to create <str<strong>on</strong>g>in</str<strong>on</strong>g>clude<br />

Atom + GeoRSS web feeds for all papyri <strong>and</strong> <str<strong>on</strong>g>in</str<strong>on</strong>g>scripti<strong>on</strong> collecti<strong>on</strong>s <strong>and</strong> the C<strong>on</strong>cordiaThesaurus, “a<br />

c<strong>on</strong>trolled vocabulary for express<str<strong>on</strong>g>in</str<strong>on</strong>g>g classes of relati<strong>on</strong>ships (or even asserti<strong>on</strong>s) between web-based<br />

resources <str<strong>on</strong>g>in</str<strong>on</strong>g> the c<strong>on</strong>text of Atom+GeoRSS feeds.”<br />

Digital Antiquity<br />

This project has been described <str<strong>on</strong>g>in</str<strong>on</strong>g> greater detail <str<strong>on</strong>g>in</str<strong>on</strong>g> the Archaeology subsecti<strong>on</strong>.<br />

Digital Classicist<br />

This project has been discussed <str<strong>on</strong>g>in</str<strong>on</strong>g> greater detail <str<strong>on</strong>g>in</str<strong>on</strong>g> the secti<strong>on</strong> <strong>on</strong> Open Access.<br />

eAQUA<br />

eAQUA 654 is a major German project that seeks to use NLP techniques such as text m<str<strong>on</strong>g>in</str<strong>on</strong>g><str<strong>on</strong>g>in</str<strong>on</strong>g>g to generate<br />

“structured knowledge” from ancient texts <strong>and</strong> to provide this knowledge to classicists through a<br />

649 http://code.google.com/p/vreri/wiki/MILARQ<br />

650 http://explore.clarosnet.org/XDB/ASP/clarosHome/technicalIntro.html<br />

651 http://c<strong>on</strong>cordia.atlantides.org/<br />

652 http://www.atlantides.org/trac/c<strong>on</strong>cordia/wiki/ProjectOverview<br />

653 http://www.atlantides.org/trac/c<strong>on</strong>cordia/wiki<br />

654 http://www.eaqua.net/en/<str<strong>on</strong>g>in</str<strong>on</strong>g>dex.php

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!