Rome Wasn't Digitized in a Day - Council on Library and Information ...
Rome Wasn't Digitized in a Day - Council on Library and Information ...
Rome Wasn't Digitized in a Day - Council on Library and Information ...
Create successful ePaper yourself
Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.
223<br />
S<str<strong>on</strong>g>in</str<strong>on</strong>g>ce HiTHeR also aimed to provide a comprehensive research platform, they chose to offer several<br />
text-m<str<strong>on</strong>g>in</str<strong>on</strong>g><str<strong>on</strong>g>in</str<strong>on</strong>g>g algorithms for their users <str<strong>on</strong>g>in</str<strong>on</strong>g> terms of creat<str<strong>on</strong>g>in</str<strong>on</strong>g>g this “cha<str<strong>on</strong>g>in</str<strong>on</strong>g> of read<str<strong>on</strong>g>in</str<strong>on</strong>g>gs.” In additi<strong>on</strong>, users<br />
could upload their own documents, not just use this tool with NCSE collecti<strong>on</strong>s.<br />
The HiTHer project quickly discovered, however, that st<strong>and</strong>ard comput<str<strong>on</strong>g>in</str<strong>on</strong>g>g envir<strong>on</strong>ments did not<br />
provide the level of process<str<strong>on</strong>g>in</str<strong>on</strong>g>g power necessary to run these algorithms. To resolve this problem, they<br />
built an <str<strong>on</strong>g>in</str<strong>on</strong>g>frastructure based <strong>on</strong> high-throughput comput<str<strong>on</strong>g>in</str<strong>on</strong>g>g (HTC) that uses many computati<strong>on</strong>al<br />
resources to accomplish a s<str<strong>on</strong>g>in</str<strong>on</strong>g>gle computati<strong>on</strong>al task. They made use of the C<strong>on</strong>dor toolkit that let them<br />
rely <strong>on</strong> two types of computers at K<str<strong>on</strong>g>in</str<strong>on</strong>g>g’s College L<strong>on</strong>d<strong>on</strong>, underutilized desktop computers <strong>and</strong><br />
dedicated servers. The authors thus assert that HiTHeR “illustrates how e-Humanities centres can be<br />
served by implement<str<strong>on</strong>g>in</str<strong>on</strong>g>g their own local research <str<strong>on</strong>g>in</str<strong>on</strong>g>frastructure, which they can relatively easily build<br />
us<str<strong>on</strong>g>in</str<strong>on</strong>g>g exist<str<strong>on</strong>g>in</str<strong>on</strong>g>g resources like st<strong>and</strong>ard desktop networks” (Blanke, Hedges, <strong>and</strong> Palmer 2009).<br />
Another <str<strong>on</strong>g>in</str<strong>on</strong>g>sight offered by the HiTHeR research group was that for most applicati<strong>on</strong>s <str<strong>on</strong>g>in</str<strong>on</strong>g> the<br />
humanities, “large comput<str<strong>on</strong>g>in</str<strong>on</strong>g>g power will <strong>on</strong>ly be needed to prepare data sets for human analysis”<br />
(Blanke, Hedges, <strong>and</strong> Palmer 2009). They suggested that for much humanities research, a user would<br />
simply need to call <strong>on</strong> heavy process<str<strong>on</strong>g>in</str<strong>on</strong>g>g power to analyze a data set <strong>on</strong>ce, <strong>and</strong> would want to spend the<br />
rest of his or her time access<str<strong>on</strong>g>in</str<strong>on</strong>g>g <strong>and</strong> analyz<str<strong>on</strong>g>in</str<strong>on</strong>g>g the results; <str<strong>on</strong>g>in</str<strong>on</strong>g> other words, most humanists would need<br />
a “create <strong>on</strong>ce-read many resources” applicati<strong>on</strong> envir<strong>on</strong>ment. This led them to ultimately deploy<br />
HiTHer as a restful web service where humanities scholars could call up<strong>on</strong> a variety of text-m<str<strong>on</strong>g>in</str<strong>on</strong>g><str<strong>on</strong>g>in</str<strong>on</strong>g>g<br />
algorithms <strong>and</strong> then receive the results <str<strong>on</strong>g>in</str<strong>on</strong>g> a variety of formats (XHTML, Atom, etc.)<br />
The importance of services, or digital tools more specifically, as <str<strong>on</strong>g>in</str<strong>on</strong>g>frastructure has also been discussed<br />
by Geoffrey Rockwell, who provided an overview of the development of <str<strong>on</strong>g>in</str<strong>on</strong>g>frastructure for textual<br />
analysis that <str<strong>on</strong>g>in</str<strong>on</strong>g>cluded the creati<strong>on</strong> of a portal for textual research called TAPoR <strong>and</strong> the development<br />
of a set of reference tools TAPoRware. 677 The <str<strong>on</strong>g>in</str<strong>on</strong>g>tent was that this portal could be used to discover <strong>and</strong><br />
use tools that had been registered by their creators as web services that were runn<str<strong>on</strong>g>in</str<strong>on</strong>g>g <str<strong>on</strong>g>in</str<strong>on</strong>g> various<br />
locati<strong>on</strong>s. The portal was to provide scholars easy access to already-exist<str<strong>on</strong>g>in</str<strong>on</strong>g>g tools <strong>and</strong> to support the<br />
registrati<strong>on</strong>, creati<strong>on</strong>, <strong>and</strong> publish<str<strong>on</strong>g>in</str<strong>on</strong>g>g of new services. Currently the portal is be<str<strong>on</strong>g>in</str<strong>on</strong>g>g re<str<strong>on</strong>g>in</str<strong>on</strong>g>vented,<br />
Rockwell reported, s<str<strong>on</strong>g>in</str<strong>on</strong>g>ce many scholars did not f<str<strong>on</strong>g>in</str<strong>on</strong>g>d it easy to use. He also suggested that web services<br />
are often not as reliable as they should be, <strong>and</strong> that most users require both simplicity <strong>and</strong> reliability.<br />
“My po<str<strong>on</strong>g>in</str<strong>on</strong>g>t here is that the model was to keep tool development as research but make the research tools<br />
easy to discover <strong>and</strong> use through portal-like <str<strong>on</strong>g>in</str<strong>on</strong>g>frastructure,” Rockwell expla<str<strong>on</strong>g>in</str<strong>on</strong>g>ed; add<str<strong>on</strong>g>in</str<strong>on</strong>g>g that “a further<br />
paradigm was that tools could be embedded <str<strong>on</strong>g>in</str<strong>on</strong>g> <strong>on</strong>l<str<strong>on</strong>g>in</str<strong>on</strong>g>e texts as small viral badges, thereby hid<str<strong>on</strong>g>in</str<strong>on</strong>g>g the<br />
portal <strong>and</strong> foreground<str<strong>on</strong>g>in</str<strong>on</strong>g>g the visible text, an experiment we are just embark<str<strong>on</strong>g>in</str<strong>on</strong>g>g <strong>on</strong>” (Rockwell 2010).<br />
While Rockwell accentuated that digital tools were an important part of the portal <str<strong>on</strong>g>in</str<strong>on</strong>g>frastructure that at<br />
times needed to be “<str<strong>on</strong>g>in</str<strong>on</strong>g>visible” to make the c<strong>on</strong>tent primary, he also argued that tool development is an<br />
important part of the humanities research process <str<strong>on</strong>g>in</str<strong>on</strong>g> itself.<br />
Research libraries <strong>and</strong> digital repositories, as potential key comp<strong>on</strong>ents of cyber<str<strong>on</strong>g>in</str<strong>on</strong>g>frastructure for the<br />
humanities, will also need to address the complexities of provid<str<strong>on</strong>g>in</str<strong>on</strong>g>g access to both c<strong>on</strong>tent <strong>and</strong> services<br />
as part of a larger networked <str<strong>on</strong>g>in</str<strong>on</strong>g>frastructure, accord<str<strong>on</strong>g>in</str<strong>on</strong>g>g to a recent Associati<strong>on</strong> of Research Libraries<br />
(ARL) report <strong>on</strong> digital repository services for research libraries:<br />
… manag<str<strong>on</strong>g>in</str<strong>on</strong>g>g unique c<strong>on</strong>tent, not just traditi<strong>on</strong>al special collecti<strong>on</strong>s but entirely new k<str<strong>on</strong>g>in</str<strong>on</strong>g>ds of<br />
works <strong>and</strong> locally-created c<strong>on</strong>tent, will be an important emphasis for collecti<strong>on</strong> <strong>and</strong><br />
management. As users exercise new capabilities <strong>and</strong> require new services, library services will<br />
677 http://portal.tapor.ca <strong>and</strong> http://taporware.cmaster.ca