26.12.2014 Views

Rome Wasn't Digitized in a Day - Council on Library and Information ...

Rome Wasn't Digitized in a Day - Council on Library and Information ...

Rome Wasn't Digitized in a Day - Council on Library and Information ...

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

43<br />

“develop a new web-based, modular, collaborative image markup tool for both manual <strong>and</strong> semiautomated<br />

l<str<strong>on</strong>g>in</str<strong>on</strong>g>k<str<strong>on</strong>g>in</str<strong>on</strong>g>g between encoded text <strong>and</strong> image of text, <strong>and</strong> image annotati<strong>on</strong>” <strong>and</strong> has just<br />

announced the release of TILE 0.9. 136 Doug Reside of this project recently outl<str<strong>on</strong>g>in</str<strong>on</strong>g>ed <strong>on</strong> the TILE blog a<br />

“four-layer model for image-based editi<strong>on</strong>s” that was designed to address l<strong>on</strong>g-term preservati<strong>on</strong><br />

issues <strong>and</strong> clearly outl<str<strong>on</strong>g>in</str<strong>on</strong>g>e resp<strong>on</strong>sibilities for digital librarians <strong>and</strong> scholars (Reside 2010).<br />

The first level <str<strong>on</strong>g>in</str<strong>on</strong>g>volves the digitizati<strong>on</strong> of source materials, particularly their l<strong>on</strong>g-term curati<strong>on</strong> <strong>and</strong><br />

distributi<strong>on</strong> <str<strong>on</strong>g>in</str<strong>on</strong>g> open formats with the use of regular <strong>and</strong> progressive nam<str<strong>on</strong>g>in</str<strong>on</strong>g>g systems. Reside made the<br />

useful suggesti<strong>on</strong> that grant<str<strong>on</strong>g>in</str<strong>on</strong>g>g agencies should c<strong>on</strong>sider requir<str<strong>on</strong>g>in</str<strong>on</strong>g>g c<strong>on</strong>tent providers to ma<str<strong>on</strong>g>in</str<strong>on</strong>g>ta<str<strong>on</strong>g>in</str<strong>on</strong>g> stable<br />

uniform resource identifiers (URIs) for at least 10 to 15 years for all digital objects. The sec<strong>on</strong>d level<br />

for image based-editi<strong>on</strong>s <str<strong>on</strong>g>in</str<strong>on</strong>g>volves metadata creati<strong>on</strong>, <strong>and</strong> Reside argued that all metadata external to<br />

the file itself (e.g., descriptive rather than technical metadata) bel<strong>on</strong>g at this level. He also proposed<br />

that <str<strong>on</strong>g>in</str<strong>on</strong>g>stituti<strong>on</strong>s or <str<strong>on</strong>g>in</str<strong>on</strong>g>dividuals that did not create the digital files should probably create such metadata:<br />

While the impulse towards quality assurance <strong>and</strong> thorough work is laudable, a perfecti<strong>on</strong>ist<br />

policy that delays publicati<strong>on</strong> of prelim<str<strong>on</strong>g>in</str<strong>on</strong>g>ary work is better suited for immutable pr<str<strong>on</strong>g>in</str<strong>on</strong>g>t media<br />

than an extensible digital archive. In our model, c<strong>on</strong>tent providers need not wait to provide<br />

c<strong>on</strong>tent until it has been processed <strong>and</strong> catalogued (Reside 2010).<br />

By open<str<strong>on</strong>g>in</str<strong>on</strong>g>g the task of catalog<str<strong>on</strong>g>in</str<strong>on</strong>g>g <strong>and</strong> resource descripti<strong>on</strong> to a larger audience, Reside hypothesized<br />

that far more c<strong>on</strong>tent could get <strong>on</strong>l<str<strong>on</strong>g>in</str<strong>on</strong>g>e quickly <strong>and</strong> be available for reuse. Separat<str<strong>on</strong>g>in</str<strong>on</strong>g>g metadata <strong>and</strong><br />

c<strong>on</strong>tent would also allow multiple transcripti<strong>on</strong>s or metadata to po<str<strong>on</strong>g>in</str<strong>on</strong>g>t to the same item’s URI.<br />

The third level of the TILE model <str<strong>on</strong>g>in</str<strong>on</strong>g>volves the <str<strong>on</strong>g>in</str<strong>on</strong>g>terface layer, an often-ignored feature <str<strong>on</strong>g>in</str<strong>on</strong>g> the move to<br />

get open c<strong>on</strong>tent available <strong>on</strong>l<str<strong>on</strong>g>in</str<strong>on</strong>g>e. While Reside granted that more transcripti<strong>on</strong>s <strong>and</strong> files <str<strong>on</strong>g>in</str<strong>on</strong>g> open<br />

repositories is a useful first step, many humanities scholars still need <str<strong>on</strong>g>in</str<strong>on</strong>g>terfaces that do more than<br />

access <strong>on</strong> file at a time. He also recognized that while Software Envir<strong>on</strong>ment for the Advancement of<br />

Scholarly Research (SEASR) is try<str<strong>on</strong>g>in</str<strong>on</strong>g>g to create a susta<str<strong>on</strong>g>in</str<strong>on</strong>g>able model for <str<strong>on</strong>g>in</str<strong>on</strong>g>teroperable digital<br />

humanities tools, their work has not yet met with wide-scale adopti<strong>on</strong>. At this most critical layer,<br />

Reside outl<str<strong>on</strong>g>in</str<strong>on</strong>g>ed the TILE approach:<br />

We propose a code framework for web-based editi<strong>on</strong>s, first implemented <str<strong>on</strong>g>in</str<strong>on</strong>g> JavaScript us<str<strong>on</strong>g>in</str<strong>on</strong>g>g<br />

the popular jQuery library, but adaptable to other languages when the prevalent w<str<strong>on</strong>g>in</str<strong>on</strong>g>ds of web<br />

development change. An <str<strong>on</strong>g>in</str<strong>on</strong>g>stance of this framework is composed of a manifest file (probably <str<strong>on</strong>g>in</str<strong>on</strong>g><br />

XML or JSON 137 format) that identifies the locati<strong>on</strong>s of the relevant c<strong>on</strong>tent <strong>and</strong> any associated<br />

metadata <strong>and</strong> a core file (similar to, but c<strong>on</strong>siderably leaner than, the core jQuery.js file at the<br />

heart of the popular JavaScript library) with a system of “hooks” <strong>on</strong>to which developers might<br />

hang widgets they develop for their own editi<strong>on</strong>s. A widget, <str<strong>on</strong>g>in</str<strong>on</strong>g> this c<strong>on</strong>text, is a program with<br />

limited functi<strong>on</strong>ality that provides well-def<str<strong>on</strong>g>in</str<strong>on</strong>g>ed resp<strong>on</strong>ses to specific <str<strong>on</strong>g>in</str<strong>on</strong>g>put (Reside 2009).<br />

This model thus <str<strong>on</strong>g>in</str<strong>on</strong>g>cludes a manifest file that c<strong>on</strong>ta<str<strong>on</strong>g>in</str<strong>on</strong>g>s all c<strong>on</strong>tent locati<strong>on</strong>s <strong>and</strong> associated metadata,<br />

<strong>and</strong> a core file, or base text, that can be used by different developers to create their own digital editi<strong>on</strong>s<br />

utiliz<str<strong>on</strong>g>in</str<strong>on</strong>g>g their own tools or “widgets.” Widgets should depend <strong>on</strong>ly <strong>on</strong> the core files, Reside argued,<br />

not <strong>on</strong> each other, <strong>and</strong> ideally they could be shared between scholars. Reside admitted that basically<br />

they are propos<str<strong>on</strong>g>in</str<strong>on</strong>g>g the development of a “c<strong>on</strong>tent management system” (CMS) for manag<str<strong>on</strong>g>in</str<strong>on</strong>g>g<br />

136 This software <strong>and</strong> its functi<strong>on</strong>ality are discussed later <str<strong>on</strong>g>in</str<strong>on</strong>g> this paper.<br />

137 JSON, or JavaScript Object Notati<strong>on</strong> is a “lightweight data-<str<strong>on</strong>g>in</str<strong>on</strong>g>terchange format” that is based <strong>on</strong> the JavaScript programm<str<strong>on</strong>g>in</str<strong>on</strong>g>g language but is also a<br />

“text format that is completely language <str<strong>on</strong>g>in</str<strong>on</strong>g>dependent.” (http://www.js<strong>on</strong>.org/)

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!