26.12.2014 Views

Rome Wasn't Digitized in a Day - Council on Library and Information ...

Rome Wasn't Digitized in a Day - Council on Library and Information ...

Rome Wasn't Digitized in a Day - Council on Library and Information ...

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

223<br />

S<str<strong>on</strong>g>in</str<strong>on</strong>g>ce HiTHeR also aimed to provide a comprehensive research platform, they chose to offer several<br />

text-m<str<strong>on</strong>g>in</str<strong>on</strong>g><str<strong>on</strong>g>in</str<strong>on</strong>g>g algorithms for their users <str<strong>on</strong>g>in</str<strong>on</strong>g> terms of creat<str<strong>on</strong>g>in</str<strong>on</strong>g>g this “cha<str<strong>on</strong>g>in</str<strong>on</strong>g> of read<str<strong>on</strong>g>in</str<strong>on</strong>g>gs.” In additi<strong>on</strong>, users<br />

could upload their own documents, not just use this tool with NCSE collecti<strong>on</strong>s.<br />

The HiTHer project quickly discovered, however, that st<strong>and</strong>ard comput<str<strong>on</strong>g>in</str<strong>on</strong>g>g envir<strong>on</strong>ments did not<br />

provide the level of process<str<strong>on</strong>g>in</str<strong>on</strong>g>g power necessary to run these algorithms. To resolve this problem, they<br />

built an <str<strong>on</strong>g>in</str<strong>on</strong>g>frastructure based <strong>on</strong> high-throughput comput<str<strong>on</strong>g>in</str<strong>on</strong>g>g (HTC) that uses many computati<strong>on</strong>al<br />

resources to accomplish a s<str<strong>on</strong>g>in</str<strong>on</strong>g>gle computati<strong>on</strong>al task. They made use of the C<strong>on</strong>dor toolkit that let them<br />

rely <strong>on</strong> two types of computers at K<str<strong>on</strong>g>in</str<strong>on</strong>g>g’s College L<strong>on</strong>d<strong>on</strong>, underutilized desktop computers <strong>and</strong><br />

dedicated servers. The authors thus assert that HiTHeR “illustrates how e-Humanities centres can be<br />

served by implement<str<strong>on</strong>g>in</str<strong>on</strong>g>g their own local research <str<strong>on</strong>g>in</str<strong>on</strong>g>frastructure, which they can relatively easily build<br />

us<str<strong>on</strong>g>in</str<strong>on</strong>g>g exist<str<strong>on</strong>g>in</str<strong>on</strong>g>g resources like st<strong>and</strong>ard desktop networks” (Blanke, Hedges, <strong>and</strong> Palmer 2009).<br />

Another <str<strong>on</strong>g>in</str<strong>on</strong>g>sight offered by the HiTHeR research group was that for most applicati<strong>on</strong>s <str<strong>on</strong>g>in</str<strong>on</strong>g> the<br />

humanities, “large comput<str<strong>on</strong>g>in</str<strong>on</strong>g>g power will <strong>on</strong>ly be needed to prepare data sets for human analysis”<br />

(Blanke, Hedges, <strong>and</strong> Palmer 2009). They suggested that for much humanities research, a user would<br />

simply need to call <strong>on</strong> heavy process<str<strong>on</strong>g>in</str<strong>on</strong>g>g power to analyze a data set <strong>on</strong>ce, <strong>and</strong> would want to spend the<br />

rest of his or her time access<str<strong>on</strong>g>in</str<strong>on</strong>g>g <strong>and</strong> analyz<str<strong>on</strong>g>in</str<strong>on</strong>g>g the results; <str<strong>on</strong>g>in</str<strong>on</strong>g> other words, most humanists would need<br />

a “create <strong>on</strong>ce-read many resources” applicati<strong>on</strong> envir<strong>on</strong>ment. This led them to ultimately deploy<br />

HiTHer as a restful web service where humanities scholars could call up<strong>on</strong> a variety of text-m<str<strong>on</strong>g>in</str<strong>on</strong>g><str<strong>on</strong>g>in</str<strong>on</strong>g>g<br />

algorithms <strong>and</strong> then receive the results <str<strong>on</strong>g>in</str<strong>on</strong>g> a variety of formats (XHTML, Atom, etc.)<br />

The importance of services, or digital tools more specifically, as <str<strong>on</strong>g>in</str<strong>on</strong>g>frastructure has also been discussed<br />

by Geoffrey Rockwell, who provided an overview of the development of <str<strong>on</strong>g>in</str<strong>on</strong>g>frastructure for textual<br />

analysis that <str<strong>on</strong>g>in</str<strong>on</strong>g>cluded the creati<strong>on</strong> of a portal for textual research called TAPoR <strong>and</strong> the development<br />

of a set of reference tools TAPoRware. 677 The <str<strong>on</strong>g>in</str<strong>on</strong>g>tent was that this portal could be used to discover <strong>and</strong><br />

use tools that had been registered by their creators as web services that were runn<str<strong>on</strong>g>in</str<strong>on</strong>g>g <str<strong>on</strong>g>in</str<strong>on</strong>g> various<br />

locati<strong>on</strong>s. The portal was to provide scholars easy access to already-exist<str<strong>on</strong>g>in</str<strong>on</strong>g>g tools <strong>and</strong> to support the<br />

registrati<strong>on</strong>, creati<strong>on</strong>, <strong>and</strong> publish<str<strong>on</strong>g>in</str<strong>on</strong>g>g of new services. Currently the portal is be<str<strong>on</strong>g>in</str<strong>on</strong>g>g re<str<strong>on</strong>g>in</str<strong>on</strong>g>vented,<br />

Rockwell reported, s<str<strong>on</strong>g>in</str<strong>on</strong>g>ce many scholars did not f<str<strong>on</strong>g>in</str<strong>on</strong>g>d it easy to use. He also suggested that web services<br />

are often not as reliable as they should be, <strong>and</strong> that most users require both simplicity <strong>and</strong> reliability.<br />

“My po<str<strong>on</strong>g>in</str<strong>on</strong>g>t here is that the model was to keep tool development as research but make the research tools<br />

easy to discover <strong>and</strong> use through portal-like <str<strong>on</strong>g>in</str<strong>on</strong>g>frastructure,” Rockwell expla<str<strong>on</strong>g>in</str<strong>on</strong>g>ed; add<str<strong>on</strong>g>in</str<strong>on</strong>g>g that “a further<br />

paradigm was that tools could be embedded <str<strong>on</strong>g>in</str<strong>on</strong>g> <strong>on</strong>l<str<strong>on</strong>g>in</str<strong>on</strong>g>e texts as small viral badges, thereby hid<str<strong>on</strong>g>in</str<strong>on</strong>g>g the<br />

portal <strong>and</strong> foreground<str<strong>on</strong>g>in</str<strong>on</strong>g>g the visible text, an experiment we are just embark<str<strong>on</strong>g>in</str<strong>on</strong>g>g <strong>on</strong>” (Rockwell 2010).<br />

While Rockwell accentuated that digital tools were an important part of the portal <str<strong>on</strong>g>in</str<strong>on</strong>g>frastructure that at<br />

times needed to be “<str<strong>on</strong>g>in</str<strong>on</strong>g>visible” to make the c<strong>on</strong>tent primary, he also argued that tool development is an<br />

important part of the humanities research process <str<strong>on</strong>g>in</str<strong>on</strong>g> itself.<br />

Research libraries <strong>and</strong> digital repositories, as potential key comp<strong>on</strong>ents of cyber<str<strong>on</strong>g>in</str<strong>on</strong>g>frastructure for the<br />

humanities, will also need to address the complexities of provid<str<strong>on</strong>g>in</str<strong>on</strong>g>g access to both c<strong>on</strong>tent <strong>and</strong> services<br />

as part of a larger networked <str<strong>on</strong>g>in</str<strong>on</strong>g>frastructure, accord<str<strong>on</strong>g>in</str<strong>on</strong>g>g to a recent Associati<strong>on</strong> of Research Libraries<br />

(ARL) report <strong>on</strong> digital repository services for research libraries:<br />

… manag<str<strong>on</strong>g>in</str<strong>on</strong>g>g unique c<strong>on</strong>tent, not just traditi<strong>on</strong>al special collecti<strong>on</strong>s but entirely new k<str<strong>on</strong>g>in</str<strong>on</strong>g>ds of<br />

works <strong>and</strong> locally-created c<strong>on</strong>tent, will be an important emphasis for collecti<strong>on</strong> <strong>and</strong><br />

management. As users exercise new capabilities <strong>and</strong> require new services, library services will<br />

677 http://portal.tapor.ca <strong>and</strong> http://taporware.cmaster.ca

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!