26.12.2014 Views

Rome Wasn't Digitized in a Day - Council on Library and Information ...

Rome Wasn't Digitized in a Day - Council on Library and Information ...

Rome Wasn't Digitized in a Day - Council on Library and Information ...

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

57<br />

While the FRBR hierarchy <str<strong>on</strong>g>in</str<strong>on</strong>g>cludes Works, Expressi<strong>on</strong>s, Manifestati<strong>on</strong>s, <strong>and</strong> Items, the CTS protocol<br />

uses the terms Work, Editi<strong>on</strong>, Translati<strong>on</strong>, <strong>and</strong> Exemplar. As communicated by Porter et al. (2006),<br />

CTS extends the FRBR hierarchy upward by “group<str<strong>on</strong>g>in</str<strong>on</strong>g>g Works under a noti<strong>on</strong>al entity called<br />

“TextGroup,’” an entity that can refer to authors for literary texts or to corpora such as <str<strong>on</strong>g>in</str<strong>on</strong>g>scripti<strong>on</strong>s<br />

(e.g., “Berl<str<strong>on</strong>g>in</str<strong>on</strong>g>” for a published corpus of papyri). It also extends the FRBR hierarchy downward to<br />

support the “identificati<strong>on</strong> <strong>and</strong> abstracti<strong>on</strong> of citable chunks of text (Homer, Iliad Book 1, L<str<strong>on</strong>g>in</str<strong>on</strong>g>e 123), or<br />

ranges of citable chunks (Hom.~ Il. 1.123-2.22).” This both downward <strong>and</strong> upward extensi<strong>on</strong> of FRBR<br />

is important, for as Romanello (2008) underscores, “it allows <strong>on</strong>e to reach a higher granularity when<br />

access<str<strong>on</strong>g>in</str<strong>on</strong>g>g documents hierarchically <strong>and</strong> supports the use of a citati<strong>on</strong> scheme referr<str<strong>on</strong>g>in</str<strong>on</strong>g>g to each level of<br />

the entire document hierarchical structure.” Romanello notes that another important feature of CTS is<br />

that it enables the differentiati<strong>on</strong> of “different exemplars” of the same text.<br />

As noted above, citati<strong>on</strong>s <str<strong>on</strong>g>in</str<strong>on</strong>g> CTS are expressed as CTS URNs. CTS URNs “provide the permanent<br />

can<strong>on</strong>ical references that Can<strong>on</strong>ical Text Services (CTS) rely <strong>on</strong> <str<strong>on</strong>g>in</str<strong>on</strong>g> order to identify or retrieve<br />

passages of text. These references are a k<str<strong>on</strong>g>in</str<strong>on</strong>g>d of Uniform Resource Name (URN).” 169 URNs, accord<str<strong>on</strong>g>in</str<strong>on</strong>g>g<br />

to RFC2141 170 , “are <str<strong>on</strong>g>in</str<strong>on</strong>g>tended to serve as persistent, locati<strong>on</strong>-<str<strong>on</strong>g>in</str<strong>on</strong>g>dependent, resource identifiers.” Smith<br />

(2009) provides extensive <str<strong>on</strong>g>in</str<strong>on</strong>g>formati<strong>on</strong> <strong>on</strong> the syntax of CTS-URNs <strong>and</strong> their role <str<strong>on</strong>g>in</str<strong>on</strong>g> the CTS.<br />

This same article also addressed the importance of st<strong>and</strong>ards such as CTS (Smith 2009). “Source<br />

citati<strong>on</strong> is just <strong>on</strong>e part of scholarly publicati<strong>on</strong>, <strong>and</strong> c<strong>on</strong>venti<strong>on</strong>s for cit<str<strong>on</strong>g>in</str<strong>on</strong>g>g resources digitally must be<br />

viewed as part of a larger architectural design,” Smith expla<str<strong>on</strong>g>in</str<strong>on</strong>g>ed, “when the digital library is the global<br />

<str<strong>on</strong>g>in</str<strong>on</strong>g>ternet, the natural architecture for scholarly publicati<strong>on</strong>s is a hierarchy of service.” In additi<strong>on</strong> to the<br />

need for st<strong>and</strong>ards <strong>and</strong> c<strong>on</strong>venti<strong>on</strong>s for cit<str<strong>on</strong>g>in</str<strong>on</strong>g>g resources or parts of resources, the importance of<br />

st<strong>and</strong>ard c<strong>on</strong>venti<strong>on</strong>s or <strong>on</strong>tologies for add<str<strong>on</strong>g>in</str<strong>on</strong>g>g semantic encod<str<strong>on</strong>g>in</str<strong>on</strong>g>g for named entities (such as citati<strong>on</strong>s)<br />

to sec<strong>on</strong>dary literature <strong>and</strong> also to web pages <strong>and</strong> then l<str<strong>on</strong>g>in</str<strong>on</strong>g>k them to <strong>on</strong>l<str<strong>on</strong>g>in</str<strong>on</strong>g>e representati<strong>on</strong>s of primary<br />

<strong>and</strong> other sources has been the subject of a series of recent blog posts by Sebastian Heath of the<br />

American Numismatics Society (ANS). 171<br />

In a 2010 post entitled “RDFa Patterns for Ancient World References,” 172 Heath described his efforts<br />

to encode the year, different named entities such as Polem<strong>on</strong>, an imperial cult, <strong>and</strong> two text citati<strong>on</strong>s<br />

with<str<strong>on</strong>g>in</str<strong>on</strong>g> a chosen text. He wanted to embed this <str<strong>on</strong>g>in</str<strong>on</strong>g>formati<strong>on</strong> <str<strong>on</strong>g>in</str<strong>on</strong>g>to a sample web page us<str<strong>on</strong>g>in</str<strong>on</strong>g>g st<strong>and</strong>ards<br />

such as RDFa 173 to make the data “automatically recognizable by third-parties.” To encode this<br />

<str<strong>on</strong>g>in</str<strong>on</strong>g>formati<strong>on</strong> he utilized RDFa <strong>and</strong> a number of other <strong>on</strong>tologies with resolvable namespaces (e.g.,<br />

Dbpedia, CITO, FOAF, FRBR, OWL, RDFS, SKOS). Heath listed two references with<str<strong>on</strong>g>in</str<strong>on</strong>g> his sample<br />

text, the first to a published <str<strong>on</strong>g>in</str<strong>on</strong>g>scripti<strong>on</strong> <strong>and</strong> the sec<strong>on</strong>d to a recently published book. Interest<str<strong>on</strong>g>in</str<strong>on</strong>g>gly,<br />

while Heath was able to l<str<strong>on</strong>g>in</str<strong>on</strong>g>k to a bibliographic descripti<strong>on</strong> of the published book with<str<strong>on</strong>g>in</str<strong>on</strong>g> WorldCat,<br />

encod<str<strong>on</strong>g>in</str<strong>on</strong>g>g the reference to the published <str<strong>on</strong>g>in</str<strong>on</strong>g>scripti<strong>on</strong> with CITO (Citati<strong>on</strong> Typ<str<strong>on</strong>g>in</str<strong>on</strong>g>g Otology) was<br />

problematic because there was no value for “cites as a primary source” available with<str<strong>on</strong>g>in</str<strong>on</strong>g> this <strong>on</strong>tology.<br />

An additi<strong>on</strong>al complicati<strong>on</strong> was that Heath simply wanted to cite the <str<strong>on</strong>g>in</str<strong>on</strong>g>dividual <str<strong>on</strong>g>in</str<strong>on</strong>g>scripti<strong>on</strong>, <strong>and</strong> there<br />

was no way to cite just <strong>on</strong>e <str<strong>on</strong>g>in</str<strong>on</strong>g>scripti<strong>on</strong> or “work” with<str<strong>on</strong>g>in</str<strong>on</strong>g> the larger published volume of <str<strong>on</strong>g>in</str<strong>on</strong>g>scripti<strong>on</strong>s.<br />

“The c<strong>on</strong>cept of “Primary Source” <strong>and</strong> references thereto is important for the Humanities <strong>and</strong> we need<br />

a way of <str<strong>on</strong>g>in</str<strong>on</strong>g>dicat<str<strong>on</strong>g>in</str<strong>on</strong>g>g its usage,” Heath noted. “It's also important that I'm referr<str<strong>on</strong>g>in</str<strong>on</strong>g>g to the publicati<strong>on</strong> of<br />

169 http://chs75.chs.harvard.edu/projects/dig<str<strong>on</strong>g>in</str<strong>on</strong>g>c/techpub/cts-urn-overview<br />

170 RFC2141 is a “memo” released by the Internet Eng<str<strong>on</strong>g>in</str<strong>on</strong>g>eer<str<strong>on</strong>g>in</str<strong>on</strong>g>g Task Force (IETF) that specified the “can<strong>on</strong>ical syntax” for URNs<br />

(http://tools.ietf.org/html/rfc2141)<br />

171 http://numismatics.org<br />

172 http://mediterraneanceramics.blogspot.com/2010/01/rdfa-patterns-for-ancient-world.html<br />

173 “RDFa is a specificati<strong>on</strong> for attributes to express structured data <str<strong>on</strong>g>in</str<strong>on</strong>g> any markup language” (http://www.w3.org/TR/rdfa-syntax/). See also<br />

http://www.w3.org/TR/xhtml-rdfa-primer/

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!