Rome Wasn't Digitized in a Day - Council on Library and Information ...
Rome Wasn't Digitized in a Day - Council on Library and Information ...
Rome Wasn't Digitized in a Day - Council on Library and Information ...
You also want an ePaper? Increase the reach of your titles
YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.
138<br />
present an extensi<strong>on</strong> <strong>and</strong> comb<str<strong>on</strong>g>in</str<strong>on</strong>g>ati<strong>on</strong> of local image descriptors relevant for ancient co<str<strong>on</strong>g>in</str<strong>on</strong>g><br />
recogniti<strong>on</strong> (Kampel <strong>and</strong> Zaharieva 2008)<br />
They identified two major processes that must first be differentiated: co<str<strong>on</strong>g>in</str<strong>on</strong>g> identificati<strong>on</strong>, where a<br />
unique identifier is assigned to a specific co<str<strong>on</strong>g>in</str<strong>on</strong>g>; <strong>and</strong> co<str<strong>on</strong>g>in</str<strong>on</strong>g> classificati<strong>on</strong>, where a co<str<strong>on</strong>g>in</str<strong>on</strong>g> is assigned to a<br />
predef<str<strong>on</strong>g>in</str<strong>on</strong>g>ed type. The authors argued that automatic co<str<strong>on</strong>g>in</str<strong>on</strong>g> identificati<strong>on</strong> was an easier task because of<br />
way <str<strong>on</strong>g>in</str<strong>on</strong>g> which ancient co<str<strong>on</strong>g>in</str<strong>on</strong>g>s were created. For example, the manufactur<str<strong>on</strong>g>in</str<strong>on</strong>g>g process tended to give co<str<strong>on</strong>g>in</str<strong>on</strong>g>s<br />
unique shapes (hammer<str<strong>on</strong>g>in</str<strong>on</strong>g>g procedures, co<str<strong>on</strong>g>in</str<strong>on</strong>g> breakages, etc.) The same features that assist <str<strong>on</strong>g>in</str<strong>on</strong>g> <str<strong>on</strong>g>in</str<strong>on</strong>g>dividual<br />
co<str<strong>on</strong>g>in</str<strong>on</strong>g> identificati<strong>on</strong> complicate automatic classificati<strong>on</strong>, however, for as they noted, “the almost<br />
arbitrary shape of an ancient co<str<strong>on</strong>g>in</str<strong>on</strong>g> narrows the amount of appropriate segmentati<strong>on</strong> algorithms”<br />
(Kampel <strong>and</strong> Zaharieva 2008). Additi<strong>on</strong>ally, algorithms that performed well <strong>on</strong> image collecti<strong>on</strong>s of<br />
modern co<str<strong>on</strong>g>in</str<strong>on</strong>g>s did not fare well <strong>on</strong> medieval <strong>on</strong>es.<br />
Ultimately, Kampel <strong>and</strong> Zaharieva (2008) decided to use “texture sensitive po<str<strong>on</strong>g>in</str<strong>on</strong>g>t detectors” <strong>and</strong><br />
c<strong>on</strong>ducted <str<strong>on</strong>g>in</str<strong>on</strong>g>itial experiments to determ<str<strong>on</strong>g>in</str<strong>on</strong>g>e what local feature descriptors would work best for<br />
identify<str<strong>on</strong>g>in</str<strong>on</strong>g>g a given set of <str<strong>on</strong>g>in</str<strong>on</strong>g>terest po<str<strong>on</strong>g>in</str<strong>on</strong>g>ts <str<strong>on</strong>g>in</str<strong>on</strong>g> ancient co<str<strong>on</strong>g>in</str<strong>on</strong>g>s. After acquir<str<strong>on</strong>g>in</str<strong>on</strong>g>g a set of images of 350 co<str<strong>on</strong>g>in</str<strong>on</strong>g><br />
types from the Fitzwilliam Museum <str<strong>on</strong>g>in</str<strong>on</strong>g> Cambridge, they built a co<str<strong>on</strong>g>in</str<strong>on</strong>g>-recogniti<strong>on</strong> workflow. S<str<strong>on</strong>g>in</str<strong>on</strong>g>ce they<br />
were us<str<strong>on</strong>g>in</str<strong>on</strong>g>g images of <str<strong>on</strong>g>in</str<strong>on</strong>g>dividual co<str<strong>on</strong>g>in</str<strong>on</strong>g>s, they did not need to automatically detect <strong>and</strong> segment co<str<strong>on</strong>g>in</str<strong>on</strong>g>s<br />
found <str<strong>on</strong>g>in</str<strong>on</strong>g> the images <strong>and</strong> <str<strong>on</strong>g>in</str<strong>on</strong>g>stead focused <strong>on</strong> the feature-extracti<strong>on</strong> process. That process <str<strong>on</strong>g>in</str<strong>on</strong>g>volved two<br />
steps: the use of local feature algorithms to extract local image descriptors that could be used <str<strong>on</strong>g>in</str<strong>on</strong>g><br />
<str<strong>on</strong>g>in</str<strong>on</strong>g>dividual co<str<strong>on</strong>g>in</str<strong>on</strong>g> identificati<strong>on</strong>; <strong>and</strong> the extracti<strong>on</strong> of features that could be used to “to reduce the<br />
number of required feature comparis<strong>on</strong>s” by reduc<str<strong>on</strong>g>in</str<strong>on</strong>g>g the co<str<strong>on</strong>g>in</str<strong>on</strong>g>s that needed to be extracted from the<br />
database. After an <str<strong>on</strong>g>in</str<strong>on</strong>g>itial preselecti<strong>on</strong> step, they performed descriptor match<str<strong>on</strong>g>in</str<strong>on</strong>g>g by “identify<str<strong>on</strong>g>in</str<strong>on</strong>g>g the first<br />
two nearest neighbors <str<strong>on</strong>g>in</str<strong>on</strong>g> terms of Euclidean distances.” The f<str<strong>on</strong>g>in</str<strong>on</strong>g>al step was verificati<strong>on</strong>. An algorithm<br />
called SIFT (Scale Invariant Feature transform) provided the best results <str<strong>on</strong>g>in</str<strong>on</strong>g> terms of discrim<str<strong>on</strong>g>in</str<strong>on</strong>g>ant<br />
feature identificati<strong>on</strong>, but its biggest drawback was computati<strong>on</strong>al time. For future experiments, they<br />
plan to exp<strong>and</strong> their evaluati<strong>on</strong> to a larger set of co<str<strong>on</strong>g>in</str<strong>on</strong>g> images. As more collecti<strong>on</strong>s of co<str<strong>on</strong>g>in</str<strong>on</strong>g>s become<br />
available for <strong>on</strong>l<str<strong>on</strong>g>in</str<strong>on</strong>g>e experimentati<strong>on</strong>, the accuracy <strong>and</strong> viability of such approaches will likely <str<strong>on</strong>g>in</str<strong>on</strong>g>crease.<br />
Palaeography<br />
The discipl<str<strong>on</strong>g>in</str<strong>on</strong>g>e of palaeography has received some brief exam<str<strong>on</strong>g>in</str<strong>on</strong>g>ati<strong>on</strong> <str<strong>on</strong>g>in</str<strong>on</strong>g> other secti<strong>on</strong>s, such as <str<strong>on</strong>g>in</str<strong>on</strong>g> the<br />
creati<strong>on</strong> of a palaeographic knowledge base for Cuneiform <strong>and</strong> <str<strong>on</strong>g>in</str<strong>on</strong>g> terms of automatic document<br />
recogniti<strong>on</strong> for Lat<str<strong>on</strong>g>in</str<strong>on</strong>g> manuscripts. 457 A fairly comprehensive def<str<strong>on</strong>g>in</str<strong>on</strong>g>iti<strong>on</strong> has been offered by Moalla et<br />
al. (2006):<br />
The paleography is a complementary discipl<str<strong>on</strong>g>in</str<strong>on</strong>g>e of the philology. … The paleography studies<br />
the layout of old manuscripts <strong>and</strong> their evoluti<strong>on</strong>s whereas the classic philology studies the<br />
c<strong>on</strong>tent of the texts, the languages <strong>and</strong> their evoluti<strong>on</strong>s. The goals of the palaeographic science<br />
are ma<str<strong>on</strong>g>in</str<strong>on</strong>g>ly the study of the correct decod<str<strong>on</strong>g>in</str<strong>on</strong>g>g of the old writ<str<strong>on</strong>g>in</str<strong>on</strong>g>gs <strong>and</strong> the study of the history of<br />
the transmissi<strong>on</strong> of the ancient texts. The palaeography is also the study of the writ<str<strong>on</strong>g>in</str<strong>on</strong>g>g style,<br />
<str<strong>on</strong>g>in</str<strong>on</strong>g>dependently from the author pers<strong>on</strong>al writ<str<strong>on</strong>g>in</str<strong>on</strong>g>g style, which can help to date <strong>and</strong>/or to<br />
transcribe ancient manuscripts (Moalla et al. 2006).<br />
The study of palaeography is thus closely tied to work with many other discipl<str<strong>on</strong>g>in</str<strong>on</strong>g>es, <strong>and</strong> as Ciula (2009)<br />
expla<str<strong>on</strong>g>in</str<strong>on</strong>g>s, “Palaeography cannot proceed without shar<str<strong>on</strong>g>in</str<strong>on</strong>g>g methods, tools <strong>and</strong> outcomes with codiscipl<str<strong>on</strong>g>in</str<strong>on</strong>g>es<br />
such as epigraphy, codicology, philology, textual criticism—to name but a few.” As this<br />
457 Some other computer science research <str<strong>on</strong>g>in</str<strong>on</strong>g> palaeography has focused <strong>on</strong> the development of automatic h<strong>and</strong>writ<str<strong>on</strong>g>in</str<strong>on</strong>g>g recogniti<strong>on</strong> for medieval English<br />
documents (Bulacu <strong>and</strong> Schomaker 2007) <strong>and</strong> for eighteenth- <strong>and</strong> n<str<strong>on</strong>g>in</str<strong>on</strong>g>eteenth-century French manuscripts (Egl<str<strong>on</strong>g>in</str<strong>on</strong>g> et al. 2006).