26.12.2014 Views

Rome Wasn't Digitized in a Day - Council on Library and Information ...

Rome Wasn't Digitized in a Day - Council on Library and Information ...

Rome Wasn't Digitized in a Day - Council on Library and Information ...

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

251<br />

required for efficient scientific work. Even if we have focused here <strong>on</strong> the issue of publicati<strong>on</strong><br />

repositories, which, for many reas<strong>on</strong>s, lie currently at the centre of most debates, it is important<br />

to c<strong>on</strong>sider that this perspective is just <strong>on</strong>e element with<str<strong>on</strong>g>in</str<strong>on</strong>g> a larger set of digital scholarly<br />

services that have to be managed <str<strong>on</strong>g>in</str<strong>on</strong>g> a coord<str<strong>on</strong>g>in</str<strong>on</strong>g>ated way (Romary <strong>and</strong> Armbruster 2009).<br />

Although Romary <strong>and</strong> Armbruster have suggested that the creati<strong>on</strong> of large-scale centralized digital<br />

repositories may be the best soluti<strong>on</strong>, the DRIVER 706 project has <str<strong>on</strong>g>in</str<strong>on</strong>g>stead developed an <str<strong>on</strong>g>in</str<strong>on</strong>g>frastructure<br />

that supports federated access to over 249 <str<strong>on</strong>g>in</str<strong>on</strong>g>dividual digital repositories across Europe. Their <str<strong>on</strong>g>in</str<strong>on</strong>g>itial<br />

research (Ween<str<strong>on</strong>g>in</str<strong>on</strong>g>k et al. 2008, Feijen et al. 2007) identified a number of issues that needed to be<br />

addressed to create such an <str<strong>on</strong>g>in</str<strong>on</strong>g>frastructure, <str<strong>on</strong>g>in</str<strong>on</strong>g>clud<str<strong>on</strong>g>in</str<strong>on</strong>g>g <str<strong>on</strong>g>in</str<strong>on</strong>g>tellectual property rights, data curati<strong>on</strong>, <strong>and</strong><br />

l<strong>on</strong>g-term preservati<strong>on</strong>. The DRIVER project guidel<str<strong>on</strong>g>in</str<strong>on</strong>g>es m<strong>and</strong>ated a st<strong>and</strong>ard way for repository data<br />

to be exposed but also provided technology to harvest “c<strong>on</strong>tent from multiple repositories <strong>and</strong> manage<br />

its transformati<strong>on</strong> <str<strong>on</strong>g>in</str<strong>on</strong>g>to a comm<strong>on</strong> <strong>and</strong> uniform 'shared <str<strong>on</strong>g>in</str<strong>on</strong>g>formati<strong>on</strong> space’” (Feijen et al. 2007). This<br />

shared <str<strong>on</strong>g>in</str<strong>on</strong>g>formati<strong>on</strong> space provides a variety of services, <str<strong>on</strong>g>in</str<strong>on</strong>g>clud<str<strong>on</strong>g>in</str<strong>on</strong>g>g (1) “services needed to ma<str<strong>on</strong>g>in</str<strong>on</strong>g>ta<str<strong>on</strong>g>in</str<strong>on</strong>g> it”<br />

so data stores, <str<strong>on</strong>g>in</str<strong>on</strong>g>dexes, <strong>and</strong> aggregators are distributed <strong>on</strong> computers owned by various organizati<strong>on</strong>s;<br />

(2) the ability to add services as necessary; (3) a clean<str<strong>on</strong>g>in</str<strong>on</strong>g>g <strong>and</strong> enhancement service that st<strong>and</strong>ardizes<br />

c<strong>on</strong>tent that is harvested <str<strong>on</strong>g>in</str<strong>on</strong>g>to DRIVER records; <strong>and</strong> (4) a search (SRW/CQL 707 ) <strong>and</strong> OAI-Publisher<br />

service that allows all DRIVER records to be used by external applicati<strong>on</strong>s. C<strong>on</strong>sequently, any<br />

repository that wishes to participate can register with<str<strong>on</strong>g>in</str<strong>on</strong>g> the DRIVER <str<strong>on</strong>g>in</str<strong>on</strong>g>frastructure <strong>and</strong> have their<br />

c<strong>on</strong>tent “extracted, 'cleaned', <strong>and</strong> aggregated with<str<strong>on</strong>g>in</str<strong>on</strong>g> an <str<strong>on</strong>g>in</str<strong>on</strong>g>formati<strong>on</strong> space for <str<strong>on</strong>g>in</str<strong>on</strong>g>tegrated use” (Feijen et<br />

al. 2007). Ultimately, the DRIVER project focused <strong>on</strong> a centralized <str<strong>on</strong>g>in</str<strong>on</strong>g>frastructure with an extendable<br />

service model:<br />

S<str<strong>on</strong>g>in</str<strong>on</strong>g>ce the focus of DRIVER has been <strong>on</strong> develop<str<strong>on</strong>g>in</str<strong>on</strong>g>g <str<strong>on</strong>g>in</str<strong>on</strong>g>frastructure, it has not aimed to provide a<br />

pre-def<str<strong>on</strong>g>in</str<strong>on</strong>g>ed set of services. The <str<strong>on</strong>g>in</str<strong>on</strong>g>frastructure <str<strong>on</strong>g>in</str<strong>on</strong>g>cludes open, def<str<strong>on</strong>g>in</str<strong>on</strong>g>ed <str<strong>on</strong>g>in</str<strong>on</strong>g>terfaces which allow<br />

any service providers work<str<strong>on</strong>g>in</str<strong>on</strong>g>g at a local, nati<strong>on</strong>al or subject-based level, to build services <strong>on</strong><br />

top. They will be able to reuse the data <str<strong>on</strong>g>in</str<strong>on</strong>g>frastructure (the Informati<strong>on</strong> Space) <strong>and</strong> the software<br />

<str<strong>on</strong>g>in</str<strong>on</strong>g>frastructure to build or enhance their systems. Services can therefore be developed accord<str<strong>on</strong>g>in</str<strong>on</strong>g>g<br />

to the needs of users (Feiijen et al. 2007).<br />

The DRIVER project illustrates the need for <strong>and</strong> viability of a comm<strong>on</strong> <str<strong>on</strong>g>in</str<strong>on</strong>g>frastructure for digital<br />

preservati<strong>on</strong> <strong>and</strong> data storage, while also support<str<strong>on</strong>g>in</str<strong>on</strong>g>g the ability to develop <str<strong>on</strong>g>in</str<strong>on</strong>g>novative services by<br />

different projects.<br />

One <str<strong>on</strong>g>in</str<strong>on</strong>g>novative approach to support<str<strong>on</strong>g>in</str<strong>on</strong>g>g even more sophisticated levels of repository <str<strong>on</strong>g>in</str<strong>on</strong>g>teroperability<br />

has been <str<strong>on</strong>g>in</str<strong>on</strong>g>troduced by Tarrant et al. (2009). Their work used the Object Reuse <strong>and</strong> Exchange (ORE)<br />

framework 708 that was developed by the OAI to support the “descripti<strong>on</strong> <strong>and</strong> exchange of aggregati<strong>on</strong>s<br />

of Web resources” <strong>and</strong> was c<strong>on</strong>ducted as part of the JISC-funded Preserv 2 project, 709 which sought to<br />

f<str<strong>on</strong>g>in</str<strong>on</strong>g>d a way to replicate entire IRs across any repository platforms. As the OAI-ORE specificati<strong>on</strong><br />

<str<strong>on</strong>g>in</str<strong>on</strong>g>cludes approaches for both describ<str<strong>on</strong>g>in</str<strong>on</strong>g>g digital objects <strong>and</strong> “facilitates access <strong>and</strong> <str<strong>on</strong>g>in</str<strong>on</strong>g>gest of these<br />

706 http://www.driver-repository.eu/<br />

707 SRW st<strong>and</strong>s for “Search & Retrieve Web Service” <strong>and</strong> accord<str<strong>on</strong>g>in</str<strong>on</strong>g>g to OCLC the development of the SRW st<strong>and</strong>ard is part of a larger <str<strong>on</strong>g>in</str<strong>on</strong>g>ternati<strong>on</strong>al effort<br />

to “develop a st<strong>and</strong>ard web-based text-search<str<strong>on</strong>g>in</str<strong>on</strong>g>g <str<strong>on</strong>g>in</str<strong>on</strong>g>terface” (http://www.oclc.org/research/activities/srw/default.htm), it has been built us<str<strong>on</strong>g>in</str<strong>on</strong>g>g “comm<strong>on</strong> web<br />

development tools” <str<strong>on</strong>g>in</str<strong>on</strong>g>clud<str<strong>on</strong>g>in</str<strong>on</strong>g>g WSDL, SOAP, HTTP <strong>and</strong> XML. A related st<strong>and</strong>ard is SRU, which st<strong>and</strong>s for “Search & Retrieve Web Service”, a “URLbased<br />

alternative to SRW” where “messages are sent via HTTP us<str<strong>on</strong>g>in</str<strong>on</strong>g>g the GET method” <strong>and</strong> SRW-SOAP comp<strong>on</strong>ents are mapped to HTTP parameters.<br />

The <strong>Library</strong> of C<strong>on</strong>gress actively ma<str<strong>on</strong>g>in</str<strong>on</strong>g>ta<str<strong>on</strong>g>in</str<strong>on</strong>g>s the SRW/SRU st<strong>and</strong>ard (http://www.loc.gov/st<strong>and</strong>ards/sru/). CQL st<strong>and</strong>s for “c<strong>on</strong>textual query language”<br />

(http://www.loc.gov/st<strong>and</strong>ards/sru/specs/cql.html) <strong>and</strong> it has been developed as a both human-writable <strong>and</strong> mach<str<strong>on</strong>g>in</str<strong>on</strong>g>e readable “formal language for<br />

represent<str<strong>on</strong>g>in</str<strong>on</strong>g>g queries to <str<strong>on</strong>g>in</str<strong>on</strong>g>formati<strong>on</strong> retrieval systems such as web <str<strong>on</strong>g>in</str<strong>on</strong>g>dexes, bibliographic catalogs <strong>and</strong> museum collecti<strong>on</strong> <str<strong>on</strong>g>in</str<strong>on</strong>g>formati<strong>on</strong>.” It is used by SRU<br />

as its st<strong>and</strong>ard query syntax.<br />

708 http://www.openarchives.org/ore/ <strong>and</strong> for more <strong>on</strong> the development of OAI-ORE, see Van de Sompel <strong>and</strong> Lagoze (2007).<br />

709 http://www.preserv.org.uk/

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!