26.12.2014 Views

Rome Wasn't Digitized in a Day - Council on Library and Information ...

Rome Wasn't Digitized in a Day - Council on Library and Information ...

Rome Wasn't Digitized in a Day - Council on Library and Information ...

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

159<br />

was aimed at two types of users: general users of libraries who wished to exam<str<strong>on</strong>g>in</str<strong>on</strong>g>e manuscripts, <strong>and</strong><br />

“professi<strong>on</strong>al students of texts” or philologists, whom they def<str<strong>on</strong>g>in</str<strong>on</strong>g>ed as “critical editors of classical or<br />

medieval works that are h<strong>and</strong>-written <strong>on</strong> material supports of various types (paper, papyrus, st<strong>on</strong>e)”<br />

(Bozzi <strong>and</strong> Calabretto 1997). The authors thus developed a “philological workstati<strong>on</strong>” that <str<strong>on</strong>g>in</str<strong>on</strong>g>cluded<br />

four major features: (1) the ability to look up digital images <str<strong>on</strong>g>in</str<strong>on</strong>g> an archive; (2) the transcripti<strong>on</strong>,<br />

annotati<strong>on</strong>, <strong>and</strong> <str<strong>on</strong>g>in</str<strong>on</strong>g>dex<str<strong>on</strong>g>in</str<strong>on</strong>g>g of images; (3) the view<str<strong>on</strong>g>in</str<strong>on</strong>g>g of transcribed versi<strong>on</strong>s of texts <strong>and</strong> creat<str<strong>on</strong>g>in</str<strong>on</strong>g>g an<br />

“Index Locorum”; <strong>and</strong> (4) the automatic match<str<strong>on</strong>g>in</str<strong>on</strong>g>g of words found <str<strong>on</strong>g>in</str<strong>on</strong>g> transcripti<strong>on</strong>s, the “Index<br />

Locorum,” <strong>and</strong> annotati<strong>on</strong>s with the relevant porti<strong>on</strong> of the source-document image that c<strong>on</strong>ta<str<strong>on</strong>g>in</str<strong>on</strong>g>s the<br />

word. This last feature, while desired by many other digital editi<strong>on</strong> <strong>and</strong> manuscript projects, is still an<br />

area of unresolved <strong>and</strong> active research (Cayless 2008, Cayless 2009, Porter et al. 2009).<br />

In an overview of their philological workstati<strong>on</strong>, Bozzi <strong>and</strong> Calabretto listed the functi<strong>on</strong>s that it<br />

supported. To beg<str<strong>on</strong>g>in</str<strong>on</strong>g> with, the workstati<strong>on</strong> allowed users to search manuscript collecti<strong>on</strong>s <strong>and</strong> to create<br />

transcripti<strong>on</strong>s of digital images of manuscripts <strong>and</strong> export them as RTF or SGML. One important<br />

feature was the <str<strong>on</strong>g>in</str<strong>on</strong>g>dex<str<strong>on</strong>g>in</str<strong>on</strong>g>g of transcripti<strong>on</strong>s that could be used by philologists to generate an “Index<br />

Verborum” <strong>and</strong> an “Index Locorum” for each script <str<strong>on</strong>g>in</str<strong>on</strong>g> the manuscript (e.g., Greek <strong>and</strong> Lat<str<strong>on</strong>g>in</str<strong>on</strong>g>). The<br />

“Index Verborum” c<strong>on</strong>ta<str<strong>on</strong>g>in</str<strong>on</strong>g>ed all the words appear<str<strong>on</strong>g>in</str<strong>on</strong>g>g <str<strong>on</strong>g>in</str<strong>on</strong>g> the transcripti<strong>on</strong> <strong>and</strong> the words that were<br />

corrected by the user (us<str<strong>on</strong>g>in</str<strong>on</strong>g>g the text variant functi<strong>on</strong>), while the “Index Locorum” displayed “the<br />

positi<strong>on</strong>s <str<strong>on</strong>g>in</str<strong>on</strong>g> which each word occurs <str<strong>on</strong>g>in</str<strong>on</strong>g> the manuscript.” In additi<strong>on</strong>, annotati<strong>on</strong>s could be created <strong>on</strong><br />

manuscript transcripti<strong>on</strong>s, <strong>and</strong> all annotati<strong>on</strong>s c<strong>on</strong>ta<str<strong>on</strong>g>in</str<strong>on</strong>g>ed two dist<str<strong>on</strong>g>in</str<strong>on</strong>g>ct fields, <strong>on</strong>e for free comments <strong>and</strong><br />

the critical apparatus, <strong>and</strong> <strong>on</strong>e for variants, syn<strong>on</strong>yms, <strong>and</strong> the correcti<strong>on</strong> of syntax. The BAMBI<br />

workstati<strong>on</strong> also supported automatic column <strong>and</strong> l<str<strong>on</strong>g>in</str<strong>on</strong>g>e recogniti<strong>on</strong> <strong>and</strong>, even more important, the<br />

automatic creati<strong>on</strong> of a word-image c<strong>on</strong>cordance (if a transcripti<strong>on</strong> for a manuscript was available) that<br />

matches each word of the text with the appropriate porti<strong>on</strong> of the image. The c<strong>on</strong>cordance was built<br />

automatically, <strong>and</strong> this module provided a simultaneous view of the transcripti<strong>on</strong> <strong>and</strong> the image so the<br />

user could check its accuracy. It also allowed the user to query the manuscript collecti<strong>on</strong> by select<str<strong>on</strong>g>in</str<strong>on</strong>g>g a<br />

word <str<strong>on</strong>g>in</str<strong>on</strong>g> either the transcripti<strong>on</strong> or <strong>on</strong> the image. The BAMBI prototype made use of HyTime (an<br />

extensi<strong>on</strong> of SGML) to model works <strong>on</strong> ancient manuscripts, <str<strong>on</strong>g>in</str<strong>on</strong>g> particular because it allowed<br />

“specificati<strong>on</strong> of l<str<strong>on</strong>g>in</str<strong>on</strong>g>ks between text <strong>and</strong> part of image (part of an object).”<br />

While the fuller technical details of this workstati<strong>on</strong> are somewhat outdated as of this writ<str<strong>on</strong>g>in</str<strong>on</strong>g>g, the<br />

unanswered issues identified by the BAMBI project are still largely relevant for digital philology.<br />

Bozzi <strong>and</strong> Calabretto noted that the follow<str<strong>on</strong>g>in</str<strong>on</strong>g>g requirements needed to be met: better st<strong>and</strong>ards-based<br />

tools for the descripti<strong>on</strong> of manuscripts; more sophisticated image-process<str<strong>on</strong>g>in</str<strong>on</strong>g>g rout<str<strong>on</strong>g>in</str<strong>on</strong>g>es (although they<br />

called for the enhancement of microfilm images rather than the images of manuscripts themselves); “a<br />

comprehensive soluti<strong>on</strong> for the management of text variants”; “tools based <strong>on</strong> image process<str<strong>on</strong>g>in</str<strong>on</strong>g>g<br />

facilities <strong>and</strong> l<str<strong>on</strong>g>in</str<strong>on</strong>g>guistic (statistical) facilities for the electr<strong>on</strong>ic restorati<strong>on</strong> of miss<str<strong>on</strong>g>in</str<strong>on</strong>g>g text elements”;<br />

new models for collaborative work (though work today has moved bey<strong>on</strong>d client-server models based<br />

<strong>on</strong> the web); <strong>and</strong> a survey of the technical <strong>and</strong> legal issues <str<strong>on</strong>g>in</str<strong>on</strong>g>volved <str<strong>on</strong>g>in</str<strong>on</strong>g> creat<str<strong>on</strong>g>in</str<strong>on</strong>g>g “widespread, multisource<br />

services offer<str<strong>on</strong>g>in</str<strong>on</strong>g>g digital versi<strong>on</strong>s of library materials <strong>and</strong> the tools for their use” (Bozzi <strong>and</strong><br />

Calabretto 1997). As has been seen <str<strong>on</strong>g>in</str<strong>on</strong>g> this review, the challenges of manuscript descripti<strong>on</strong>, advanced<br />

image process<str<strong>on</strong>g>in</str<strong>on</strong>g>g, the management of text variants, the creati<strong>on</strong> of sophisticated digital tools,<br />

collaborative workspaces, <strong>and</strong> comprehensive open-source digital libraries rema<str<strong>on</strong>g>in</str<strong>on</strong>g> topics of c<strong>on</strong>cern.<br />

Other research <str<strong>on</strong>g>in</str<strong>on</strong>g> digital philology has been c<strong>on</strong>ducted by the Aristarchus project, 519 <strong>and</strong> an article by<br />

Franco M<strong>on</strong>tanari (M<strong>on</strong>tanari 2004) has provided an overview of the electr<strong>on</strong>ic tools for classical<br />

519 http://www.aristarchus.unige.it/<str<strong>on</strong>g>in</str<strong>on</strong>g>dex_<str<strong>on</strong>g>in</str<strong>on</strong>g>glese.php

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!