Rome Wasn't Digitized in a Day - Council on Library and Information ...
Rome Wasn't Digitized in a Day - Council on Library and Information ...
Rome Wasn't Digitized in a Day - Council on Library and Information ...
You also want an ePaper? Increase the reach of your titles
YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.
251<br />
required for efficient scientific work. Even if we have focused here <strong>on</strong> the issue of publicati<strong>on</strong><br />
repositories, which, for many reas<strong>on</strong>s, lie currently at the centre of most debates, it is important<br />
to c<strong>on</strong>sider that this perspective is just <strong>on</strong>e element with<str<strong>on</strong>g>in</str<strong>on</strong>g> a larger set of digital scholarly<br />
services that have to be managed <str<strong>on</strong>g>in</str<strong>on</strong>g> a coord<str<strong>on</strong>g>in</str<strong>on</strong>g>ated way (Romary <strong>and</strong> Armbruster 2009).<br />
Although Romary <strong>and</strong> Armbruster have suggested that the creati<strong>on</strong> of large-scale centralized digital<br />
repositories may be the best soluti<strong>on</strong>, the DRIVER 706 project has <str<strong>on</strong>g>in</str<strong>on</strong>g>stead developed an <str<strong>on</strong>g>in</str<strong>on</strong>g>frastructure<br />
that supports federated access to over 249 <str<strong>on</strong>g>in</str<strong>on</strong>g>dividual digital repositories across Europe. Their <str<strong>on</strong>g>in</str<strong>on</strong>g>itial<br />
research (Ween<str<strong>on</strong>g>in</str<strong>on</strong>g>k et al. 2008, Feijen et al. 2007) identified a number of issues that needed to be<br />
addressed to create such an <str<strong>on</strong>g>in</str<strong>on</strong>g>frastructure, <str<strong>on</strong>g>in</str<strong>on</strong>g>clud<str<strong>on</strong>g>in</str<strong>on</strong>g>g <str<strong>on</strong>g>in</str<strong>on</strong>g>tellectual property rights, data curati<strong>on</strong>, <strong>and</strong><br />
l<strong>on</strong>g-term preservati<strong>on</strong>. The DRIVER project guidel<str<strong>on</strong>g>in</str<strong>on</strong>g>es m<strong>and</strong>ated a st<strong>and</strong>ard way for repository data<br />
to be exposed but also provided technology to harvest “c<strong>on</strong>tent from multiple repositories <strong>and</strong> manage<br />
its transformati<strong>on</strong> <str<strong>on</strong>g>in</str<strong>on</strong>g>to a comm<strong>on</strong> <strong>and</strong> uniform 'shared <str<strong>on</strong>g>in</str<strong>on</strong>g>formati<strong>on</strong> space’” (Feijen et al. 2007). This<br />
shared <str<strong>on</strong>g>in</str<strong>on</strong>g>formati<strong>on</strong> space provides a variety of services, <str<strong>on</strong>g>in</str<strong>on</strong>g>clud<str<strong>on</strong>g>in</str<strong>on</strong>g>g (1) “services needed to ma<str<strong>on</strong>g>in</str<strong>on</strong>g>ta<str<strong>on</strong>g>in</str<strong>on</strong>g> it”<br />
so data stores, <str<strong>on</strong>g>in</str<strong>on</strong>g>dexes, <strong>and</strong> aggregators are distributed <strong>on</strong> computers owned by various organizati<strong>on</strong>s;<br />
(2) the ability to add services as necessary; (3) a clean<str<strong>on</strong>g>in</str<strong>on</strong>g>g <strong>and</strong> enhancement service that st<strong>and</strong>ardizes<br />
c<strong>on</strong>tent that is harvested <str<strong>on</strong>g>in</str<strong>on</strong>g>to DRIVER records; <strong>and</strong> (4) a search (SRW/CQL 707 ) <strong>and</strong> OAI-Publisher<br />
service that allows all DRIVER records to be used by external applicati<strong>on</strong>s. C<strong>on</strong>sequently, any<br />
repository that wishes to participate can register with<str<strong>on</strong>g>in</str<strong>on</strong>g> the DRIVER <str<strong>on</strong>g>in</str<strong>on</strong>g>frastructure <strong>and</strong> have their<br />
c<strong>on</strong>tent “extracted, 'cleaned', <strong>and</strong> aggregated with<str<strong>on</strong>g>in</str<strong>on</strong>g> an <str<strong>on</strong>g>in</str<strong>on</strong>g>formati<strong>on</strong> space for <str<strong>on</strong>g>in</str<strong>on</strong>g>tegrated use” (Feijen et<br />
al. 2007). Ultimately, the DRIVER project focused <strong>on</strong> a centralized <str<strong>on</strong>g>in</str<strong>on</strong>g>frastructure with an extendable<br />
service model:<br />
S<str<strong>on</strong>g>in</str<strong>on</strong>g>ce the focus of DRIVER has been <strong>on</strong> develop<str<strong>on</strong>g>in</str<strong>on</strong>g>g <str<strong>on</strong>g>in</str<strong>on</strong>g>frastructure, it has not aimed to provide a<br />
pre-def<str<strong>on</strong>g>in</str<strong>on</strong>g>ed set of services. The <str<strong>on</strong>g>in</str<strong>on</strong>g>frastructure <str<strong>on</strong>g>in</str<strong>on</strong>g>cludes open, def<str<strong>on</strong>g>in</str<strong>on</strong>g>ed <str<strong>on</strong>g>in</str<strong>on</strong>g>terfaces which allow<br />
any service providers work<str<strong>on</strong>g>in</str<strong>on</strong>g>g at a local, nati<strong>on</strong>al or subject-based level, to build services <strong>on</strong><br />
top. They will be able to reuse the data <str<strong>on</strong>g>in</str<strong>on</strong>g>frastructure (the Informati<strong>on</strong> Space) <strong>and</strong> the software<br />
<str<strong>on</strong>g>in</str<strong>on</strong>g>frastructure to build or enhance their systems. Services can therefore be developed accord<str<strong>on</strong>g>in</str<strong>on</strong>g>g<br />
to the needs of users (Feiijen et al. 2007).<br />
The DRIVER project illustrates the need for <strong>and</strong> viability of a comm<strong>on</strong> <str<strong>on</strong>g>in</str<strong>on</strong>g>frastructure for digital<br />
preservati<strong>on</strong> <strong>and</strong> data storage, while also support<str<strong>on</strong>g>in</str<strong>on</strong>g>g the ability to develop <str<strong>on</strong>g>in</str<strong>on</strong>g>novative services by<br />
different projects.<br />
One <str<strong>on</strong>g>in</str<strong>on</strong>g>novative approach to support<str<strong>on</strong>g>in</str<strong>on</strong>g>g even more sophisticated levels of repository <str<strong>on</strong>g>in</str<strong>on</strong>g>teroperability<br />
has been <str<strong>on</strong>g>in</str<strong>on</strong>g>troduced by Tarrant et al. (2009). Their work used the Object Reuse <strong>and</strong> Exchange (ORE)<br />
framework 708 that was developed by the OAI to support the “descripti<strong>on</strong> <strong>and</strong> exchange of aggregati<strong>on</strong>s<br />
of Web resources” <strong>and</strong> was c<strong>on</strong>ducted as part of the JISC-funded Preserv 2 project, 709 which sought to<br />
f<str<strong>on</strong>g>in</str<strong>on</strong>g>d a way to replicate entire IRs across any repository platforms. As the OAI-ORE specificati<strong>on</strong><br />
<str<strong>on</strong>g>in</str<strong>on</strong>g>cludes approaches for both describ<str<strong>on</strong>g>in</str<strong>on</strong>g>g digital objects <strong>and</strong> “facilitates access <strong>and</strong> <str<strong>on</strong>g>in</str<strong>on</strong>g>gest of these<br />
706 http://www.driver-repository.eu/<br />
707 SRW st<strong>and</strong>s for “Search & Retrieve Web Service” <strong>and</strong> accord<str<strong>on</strong>g>in</str<strong>on</strong>g>g to OCLC the development of the SRW st<strong>and</strong>ard is part of a larger <str<strong>on</strong>g>in</str<strong>on</strong>g>ternati<strong>on</strong>al effort<br />
to “develop a st<strong>and</strong>ard web-based text-search<str<strong>on</strong>g>in</str<strong>on</strong>g>g <str<strong>on</strong>g>in</str<strong>on</strong>g>terface” (http://www.oclc.org/research/activities/srw/default.htm), it has been built us<str<strong>on</strong>g>in</str<strong>on</strong>g>g “comm<strong>on</strong> web<br />
development tools” <str<strong>on</strong>g>in</str<strong>on</strong>g>clud<str<strong>on</strong>g>in</str<strong>on</strong>g>g WSDL, SOAP, HTTP <strong>and</strong> XML. A related st<strong>and</strong>ard is SRU, which st<strong>and</strong>s for “Search & Retrieve Web Service”, a “URLbased<br />
alternative to SRW” where “messages are sent via HTTP us<str<strong>on</strong>g>in</str<strong>on</strong>g>g the GET method” <strong>and</strong> SRW-SOAP comp<strong>on</strong>ents are mapped to HTTP parameters.<br />
The <strong>Library</strong> of C<strong>on</strong>gress actively ma<str<strong>on</strong>g>in</str<strong>on</strong>g>ta<str<strong>on</strong>g>in</str<strong>on</strong>g>s the SRW/SRU st<strong>and</strong>ard (http://www.loc.gov/st<strong>and</strong>ards/sru/). CQL st<strong>and</strong>s for “c<strong>on</strong>textual query language”<br />
(http://www.loc.gov/st<strong>and</strong>ards/sru/specs/cql.html) <strong>and</strong> it has been developed as a both human-writable <strong>and</strong> mach<str<strong>on</strong>g>in</str<strong>on</strong>g>e readable “formal language for<br />
represent<str<strong>on</strong>g>in</str<strong>on</strong>g>g queries to <str<strong>on</strong>g>in</str<strong>on</strong>g>formati<strong>on</strong> retrieval systems such as web <str<strong>on</strong>g>in</str<strong>on</strong>g>dexes, bibliographic catalogs <strong>and</strong> museum collecti<strong>on</strong> <str<strong>on</strong>g>in</str<strong>on</strong>g>formati<strong>on</strong>.” It is used by SRU<br />
as its st<strong>and</strong>ard query syntax.<br />
708 http://www.openarchives.org/ore/ <strong>and</strong> for more <strong>on</strong> the development of OAI-ORE, see Van de Sompel <strong>and</strong> Lagoze (2007).<br />
709 http://www.preserv.org.uk/