26.12.2014 Views

Rome Wasn't Digitized in a Day - Council on Library and Information ...

Rome Wasn't Digitized in a Day - Council on Library and Information ...

Rome Wasn't Digitized in a Day - Council on Library and Information ...

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

131<br />

IMT <strong>and</strong> be “capable of produc<str<strong>on</strong>g>in</str<strong>on</strong>g>g TEI-compliant XML for l<str<strong>on</strong>g>in</str<strong>on</strong>g>k<str<strong>on</strong>g>in</str<strong>on</strong>g>g image to text.” Similar to Cayless,<br />

the TILE project wants to support l<str<strong>on</strong>g>in</str<strong>on</strong>g>k<str<strong>on</strong>g>in</str<strong>on</strong>g>g bey<strong>on</strong>d the page level, such as the ability, for example, “to<br />

l<str<strong>on</strong>g>in</str<strong>on</strong>g>k from a word <str<strong>on</strong>g>in</str<strong>on</strong>g> the edited text to its locati<strong>on</strong> <str<strong>on</strong>g>in</str<strong>on</strong>g> the image” or to “click an <str<strong>on</strong>g>in</str<strong>on</strong>g>terest<str<strong>on</strong>g>in</str<strong>on</strong>g>g area <str<strong>on</strong>g>in</str<strong>on</strong>g> the<br />

image to read an annotati<strong>on</strong>” (Porter et al. 2009). While Porter et al. recognized that a number of<br />

tools 421 allowed users to edit or display images with<str<strong>on</strong>g>in</str<strong>on</strong>g> the larger c<strong>on</strong>text of creat<str<strong>on</strong>g>in</str<strong>on</strong>g>g digital editi<strong>on</strong>s,<br />

they found that n<strong>on</strong>e of these tools c<strong>on</strong>ta<str<strong>on</strong>g>in</str<strong>on</strong>g>ed all the functi<strong>on</strong>ality they desired.<br />

Of all of the tools they menti<strong>on</strong>, Porter et al. stated that <strong>on</strong>ly the IMT outputs complete <strong>and</strong> valid TEI<br />

P5 XML, but it runs <strong>on</strong>ly <strong>on</strong> W<str<strong>on</strong>g>in</str<strong>on</strong>g>dows mach<str<strong>on</strong>g>in</str<strong>on</strong>g>es. While TILE will <str<strong>on</strong>g>in</str<strong>on</strong>g>teroperate with the “c<strong>on</strong>stra<str<strong>on</strong>g>in</str<strong>on</strong>g>ed<br />

IMT TEI format,” it will also provide output <str<strong>on</strong>g>in</str<strong>on</strong>g> a variety of formats. A recent blog entry by Dorothy<br />

Porter listed these formats as <str<strong>on</strong>g>in</str<strong>on</strong>g>clud<str<strong>on</strong>g>in</str<strong>on</strong>g>g “any flavour” of TEI, METS 422 files, <strong>and</strong> output that is not <str<strong>on</strong>g>in</str<strong>on</strong>g><br />

XML. “One result of this flexibility is that, aga<str<strong>on</strong>g>in</str<strong>on</strong>g> unlike the IMT, TILE will not be “plug <strong>and</strong> play,”<br />

<strong>and</strong> process<str<strong>on</strong>g>in</str<strong>on</strong>g>g of the output will be the resp<strong>on</strong>sibility of projects us<str<strong>on</strong>g>in</str<strong>on</strong>g>g the software,” Porter<br />

acknowledged, “This will require a bit of work <strong>on</strong> the part of users. On the other h<strong>and</strong>, as a modular set<br />

of tools, TILE will be able to be <str<strong>on</strong>g>in</str<strong>on</strong>g>corporated <str<strong>on</strong>g>in</str<strong>on</strong>g>to other digital edit<str<strong>on</strong>g>in</str<strong>on</strong>g>g software suites that would<br />

otherwise have to design their own text-image l<str<strong>on</strong>g>in</str<strong>on</strong>g>k<str<strong>on</strong>g>in</str<strong>on</strong>g>g functi<strong>on</strong>ality or go without” (Porter 2010).<br />

AXE enabled the collaborative tagg<str<strong>on</strong>g>in</str<strong>on</strong>g>g of TEI texts, the associati<strong>on</strong> of XML with “time stamps <str<strong>on</strong>g>in</str<strong>on</strong>g><br />

video or audio files,” <strong>and</strong> the mark<str<strong>on</strong>g>in</str<strong>on</strong>g>g of image regi<strong>on</strong>s that could then be l<str<strong>on</strong>g>in</str<strong>on</strong>g>ked to external metadata,<br />

<strong>and</strong> TILE will extend these functi<strong>on</strong>alities. One significant issue with AXE was that while it did allow<br />

users to annotate image regi<strong>on</strong>s <strong>and</strong> store those coord<str<strong>on</strong>g>in</str<strong>on</strong>g>ates <str<strong>on</strong>g>in</str<strong>on</strong>g> a database, it did not provide any data<br />

analysis tools for this <str<strong>on</strong>g>in</str<strong>on</strong>g>formati<strong>on</strong>. The most significant way <str<strong>on</strong>g>in</str<strong>on</strong>g> which TILE will extend AXE then is<br />

that it will support:<br />

Semi-automated creati<strong>on</strong> of l<str<strong>on</strong>g>in</str<strong>on</strong>g>ks between transcripti<strong>on</strong>s <strong>and</strong> images of the materials from<br />

which the transcripti<strong>on</strong>s were made. Us<str<strong>on</strong>g>in</str<strong>on</strong>g>g a form of optical character recogniti<strong>on</strong>, our software<br />

will recognize words <str<strong>on</strong>g>in</str<strong>on</strong>g> a page image <strong>and</strong> l<str<strong>on</strong>g>in</str<strong>on</strong>g>k them to a preexist<str<strong>on</strong>g>in</str<strong>on</strong>g>g textual transcripti<strong>on</strong><br />

(Porter et al. 2009).<br />

As with the research of Cayless, the pr<str<strong>on</strong>g>in</str<strong>on</strong>g>cipal goal of this work is to be able to support the l<str<strong>on</strong>g>in</str<strong>on</strong>g>k<str<strong>on</strong>g>in</str<strong>on</strong>g>g of<br />

manuscript transcripti<strong>on</strong>s <strong>and</strong> images at the <str<strong>on</strong>g>in</str<strong>on</strong>g>dividual word level. Some other <str<strong>on</strong>g>in</str<strong>on</strong>g>tended functi<strong>on</strong>alities<br />

<str<strong>on</strong>g>in</str<strong>on</strong>g>clude image annotati<strong>on</strong> with c<strong>on</strong>trolled vocabularies, the creati<strong>on</strong> of editorial annotati<strong>on</strong>s, 423 <strong>and</strong> the<br />

creati<strong>on</strong> of l<str<strong>on</strong>g>in</str<strong>on</strong>g>ks between “different, n<strong>on</strong>-c<strong>on</strong>tiguous areas of primary source images” such as capti<strong>on</strong>s<br />

<strong>and</strong> illustrati<strong>on</strong>s or “analogous texts across different manuscripts.”<br />

Numismatics<br />

Numismatics has been def<str<strong>on</strong>g>in</str<strong>on</strong>g>ed as “the collecti<strong>on</strong> <strong>and</strong> study of m<strong>on</strong>ey (<strong>and</strong> co<str<strong>on</strong>g>in</str<strong>on</strong>g>s <str<strong>on</strong>g>in</str<strong>on</strong>g> particular).” 424 It is<br />

<strong>on</strong>e of the most popular classics topics <str<strong>on</strong>g>in</str<strong>on</strong>g> terms of academic, commercial, <strong>and</strong> enthusiast sites<br />

<strong>on</strong>l<str<strong>on</strong>g>in</str<strong>on</strong>g>e. 425 In fact, accord<str<strong>on</strong>g>in</str<strong>on</strong>g>g to Sebastian Heath (Heath 2010), any discussi<strong>on</strong> of numismatics <strong>on</strong>l<str<strong>on</strong>g>in</str<strong>on</strong>g>e<br />

421 Am<strong>on</strong>g this list were Juxta (http://www.n<str<strong>on</strong>g>in</str<strong>on</strong>g>es.org/tools/juxta.html), developed by the NINES project, which is typically used to compare two<br />

documents but also <strong>on</strong>ly c<strong>on</strong>nects images <strong>and</strong> text <strong>on</strong>ly at the page level, <strong>and</strong> the Versi<strong>on</strong><str<strong>on</strong>g>in</str<strong>on</strong>g>g Mach<str<strong>on</strong>g>in</str<strong>on</strong>g>e (http://v-mach<str<strong>on</strong>g>in</str<strong>on</strong>g>e.org/), a tool with some of the same<br />

basic functi<strong>on</strong>ality as Juxta, but aga<str<strong>on</strong>g>in</str<strong>on</strong>g> <strong>on</strong>e that <strong>on</strong>ly supports the l<str<strong>on</strong>g>in</str<strong>on</strong>g>k<str<strong>on</strong>g>in</str<strong>on</strong>g>g of texts <strong>and</strong> images <strong>on</strong>ly at the page level.<br />

422 METS st<strong>and</strong>s for “Metadata Encod<str<strong>on</strong>g>in</str<strong>on</strong>g>g & Transmissi<strong>on</strong> St<strong>and</strong>ard.” The st<strong>and</strong>ard has been created by the <strong>Library</strong> of C<strong>on</strong>gress “for encod<str<strong>on</strong>g>in</str<strong>on</strong>g>g descriptive,<br />

adm<str<strong>on</strong>g>in</str<strong>on</strong>g>istrative, <strong>and</strong> structural metadata regard<str<strong>on</strong>g>in</str<strong>on</strong>g>g objects with<str<strong>on</strong>g>in</str<strong>on</strong>g> a digital library” (http://www.loc.gov/st<strong>and</strong>ards/mets/). METS is expressed us<str<strong>on</strong>g>in</str<strong>on</strong>g>g XML<br />

<strong>and</strong> has been used by many digital library projects.<br />

423 Other research has also explored the creati<strong>on</strong> of annotati<strong>on</strong> technologies for digital manuscript collecti<strong>on</strong>s <strong>and</strong> the ability to share them; see, for<br />

example, Doumat et al. (2008), who exam<str<strong>on</strong>g>in</str<strong>on</strong>g>ed stor<str<strong>on</strong>g>in</str<strong>on</strong>g>g user annotati<strong>on</strong>s <str<strong>on</strong>g>in</str<strong>on</strong>g> a collaborative workspace so that they could be used <str<strong>on</strong>g>in</str<strong>on</strong>g> a recommender system<br />

for other manuscript users.<br />

424 http://wordnetweb.pr<str<strong>on</strong>g>in</str<strong>on</strong>g>cet<strong>on</strong>.edu/perl/webwns=numismatics<br />

425 For an example of an excellent website created by an enthusiast, see http://www.snible.org/co<str<strong>on</strong>g>in</str<strong>on</strong>g>s/, <strong>and</strong> <str<strong>on</strong>g>in</str<strong>on</strong>g> particular the “Digital Historia Numorum: A<br />

Manual of Greek Numismatics” (http://www.snible.org/co<str<strong>on</strong>g>in</str<strong>on</strong>g>s/hn/), a typed <str<strong>on</strong>g>in</str<strong>on</strong>g> versi<strong>on</strong> of the 1911 editi<strong>on</strong> of the “Historia Numorum” by Barclay Head. Ed

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!