26.12.2014 Views

Rome Wasn't Digitized in a Day - Council on Library and Information ...

Rome Wasn't Digitized in a Day - Council on Library and Information ...

Rome Wasn't Digitized in a Day - Council on Library and Information ...

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

147<br />

comp<strong>on</strong>ent. While C<strong>on</strong>cordia <strong>and</strong> LaQuAT seek to <str<strong>on</strong>g>in</str<strong>on</strong>g>tegrate papyri collecti<strong>on</strong>s with other digital<br />

classical resources, such as epigraphical databases, <str<strong>on</strong>g>in</str<strong>on</strong>g>to larger “virtual” collecti<strong>on</strong>s that can be<br />

simultaneously searched, eAQUA <strong>and</strong> eSAD are develop<str<strong>on</strong>g>in</str<strong>on</strong>g>g technologies to assist papyrologists <str<strong>on</strong>g>in</str<strong>on</strong>g> the<br />

<str<strong>on</strong>g>in</str<strong>on</strong>g>terpretati<strong>on</strong> of their ancient texts.<br />

Focused exclusively <strong>on</strong> papyri collecti<strong>on</strong>s, the IDP project (Sos<str<strong>on</strong>g>in</str<strong>on</strong>g> et al. 2007, Sos<str<strong>on</strong>g>in</str<strong>on</strong>g> et al. 2008), 497<br />

which is a jo<str<strong>on</strong>g>in</str<strong>on</strong>g>t effort of the oldest digital resource <str<strong>on</strong>g>in</str<strong>on</strong>g> papyrology the DDbDP, the HGV, <strong>and</strong> the APIS,<br />

is work<str<strong>on</strong>g>in</str<strong>on</strong>g>g to create a s<str<strong>on</strong>g>in</str<strong>on</strong>g>gle <str<strong>on</strong>g>in</str<strong>on</strong>g>terface to these three collecti<strong>on</strong>s, a project that has largely been realized<br />

through the creati<strong>on</strong> of the Papyrological Navigator (PN). 498 Active research <strong>on</strong> improv<str<strong>on</strong>g>in</str<strong>on</strong>g>g the PN is<br />

<strong>on</strong>go<str<strong>on</strong>g>in</str<strong>on</strong>g>g, as illustrated by a recent blog post by Hugh Cayless (Cayless 2010c). One particular<br />

comp<strong>on</strong>ent of the PN that he has recently improved is a service that provides “lookup of identifiers” of<br />

papyri <str<strong>on</strong>g>in</str<strong>on</strong>g> <strong>on</strong>e collecti<strong>on</strong> <strong>and</strong> “correlates them with related records <str<strong>on</strong>g>in</str<strong>on</strong>g> other collecti<strong>on</strong>s.” While this<br />

service was orig<str<strong>on</strong>g>in</str<strong>on</strong>g>ally based <strong>on</strong> a Lucene-based numbers server, Cayless is work<str<strong>on</strong>g>in</str<strong>on</strong>g>g to replace it with a<br />

RDF triplestore. One particular challenge is that of data <str<strong>on</strong>g>in</str<strong>on</strong>g>tegrati<strong>on</strong> <strong>and</strong> the difficulties of model<str<strong>on</strong>g>in</str<strong>on</strong>g>g the<br />

relati<strong>on</strong>ships between the same items <str<strong>on</strong>g>in</str<strong>on</strong>g> different databases. The complicated nature of these<br />

relati<strong>on</strong>ships <str<strong>on</strong>g>in</str<strong>on</strong>g>cludes several dimensi<strong>on</strong>s, such as different levels of hierarchy <str<strong>on</strong>g>in</str<strong>on</strong>g> database structures<br />

<strong>and</strong> various FRBR type relati<strong>on</strong>ships (e.g., the ancient document is the work but then it has various<br />

expressi<strong>on</strong>s <str<strong>on</strong>g>in</str<strong>on</strong>g> different pr<str<strong>on</strong>g>in</str<strong>on</strong>g>ted editi<strong>on</strong>s (<str<strong>on</strong>g>in</str<strong>on</strong>g>clud<str<strong>on</strong>g>in</str<strong>on</strong>g>g translati<strong>on</strong>s), <strong>and</strong> each of those editi<strong>on</strong>s has various<br />

manifestati<strong>on</strong>s (HTML, EpiDoc transcripti<strong>on</strong>s, etc.). In additi<strong>on</strong>, while the papyrological items <strong>and</strong><br />

their metadata <str<strong>on</strong>g>in</str<strong>on</strong>g> different databases can sometimes have a 1:1 relati<strong>on</strong>ship (such as is usually the case<br />

between the DDbDP <strong>and</strong> the HGV) there can also be overlap (such as between the APIS <strong>and</strong> the other<br />

two databases). Each database also has complicated <str<strong>on</strong>g>in</str<strong>on</strong>g>ternal relati<strong>on</strong>ships; for example, although the<br />

HGV utilizes the idea of a “pr<str<strong>on</strong>g>in</str<strong>on</strong>g>cipal editi<strong>on</strong>” <strong>and</strong> chooses a s<str<strong>on</strong>g>in</str<strong>on</strong>g>gle can<strong>on</strong>ical publicati<strong>on</strong> of a papyrus,<br />

it also <str<strong>on</strong>g>in</str<strong>on</strong>g>cludes other earlier publicati<strong>on</strong>s of the same papyrus <str<strong>on</strong>g>in</str<strong>on</strong>g> its metadata. The DDbDP follows the<br />

same basic idea but creates a new record that l<str<strong>on</strong>g>in</str<strong>on</strong>g>ks to stub records for the older editi<strong>on</strong>s of each<br />

papyrus.<br />

To better represent the complexity of these relati<strong>on</strong>ships, Cayless graphed them <str<strong>on</strong>g>in</str<strong>on</strong>g> Mulgara 499 (a<br />

scalable RDF database that is based <strong>on</strong> Java), so that he could use SPARQL queries to fetch data <strong>and</strong><br />

then map these to easily retrievable <strong>and</strong> citable URLs that follow a st<strong>and</strong>ard pattern. Results from<br />

SPARQL queries will also be made available as Notati<strong>on</strong>3 500 <strong>and</strong> JSON formats to create both humanreadable<br />

<strong>and</strong> -usable mach<str<strong>on</strong>g>in</str<strong>on</strong>g>e <str<strong>on</strong>g>in</str<strong>on</strong>g>terfaces to the data available through the PN. Cayless also reported<br />

that he was look<str<strong>on</strong>g>in</str<strong>on</strong>g>g <str<strong>on</strong>g>in</str<strong>on</strong>g>to us<str<strong>on</strong>g>in</str<strong>on</strong>g>g the DC TERMS vocabulary as well as other relevant <strong>on</strong>tologies such as<br />

the FRBR vocabulary. 501 Ultimately, Cayless hoped to l<str<strong>on</strong>g>in</str<strong>on</strong>g>k the bibliography <str<strong>on</strong>g>in</str<strong>on</strong>g> <str<strong>on</strong>g>in</str<strong>on</strong>g>dividual papyrus<br />

records to Zotero 502 <strong>and</strong> to ancient places names <str<strong>on</strong>g>in</str<strong>on</strong>g> Pleiades. “It all works well with my design<br />

philosophy for papyri.<str<strong>on</strong>g>in</str<strong>on</strong>g>fo,” Cayless c<strong>on</strong>cluded, “which is that it should c<strong>on</strong>sist of data (<str<strong>on</strong>g>in</str<strong>on</strong>g> the form of<br />

EpiDoc source files <strong>and</strong> representati<strong>on</strong>s of those files), retrievable via sensible URLs, with modular<br />

services surround<str<strong>on</strong>g>in</str<strong>on</strong>g>g the data to make it discoverable <strong>and</strong> usable.”<br />

A recent article by Roger Bagnall has offered an <str<strong>on</strong>g>in</str<strong>on</strong>g>-depth discussi<strong>on</strong> of the IDP project. As he<br />

expla<str<strong>on</strong>g>in</str<strong>on</strong>g>ed, the goals of the IDP have changed s<str<strong>on</strong>g>in</str<strong>on</strong>g>ce it was first c<strong>on</strong>ceptualized <str<strong>on</strong>g>in</str<strong>on</strong>g> 1992 <str<strong>on</strong>g>in</str<strong>on</strong>g> two specific<br />

ways:<br />

497 http://idp.atlantides.org/trac/idp/wiki/<br />

498 http://www.papyri.<str<strong>on</strong>g>in</str<strong>on</strong>g>fo<br />

499 http://www.mulgara.org/<br />

500 Notati<strong>on</strong>3 or N3 is a “shorth<strong>and</strong> n<strong>on</strong>-XML serializati<strong>on</strong> of Resource Descripti<strong>on</strong> Framework models, designed with human-readability <str<strong>on</strong>g>in</str<strong>on</strong>g> m<str<strong>on</strong>g>in</str<strong>on</strong>g>d.”<br />

http://en.wikipedia.org/wiki/Notati<strong>on</strong>3<br />

501 http://vocab.org/frbr/core.html<br />

502 http://www.zotero.org/

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!