Unni Cathrine Eiken February 2005
Unni Cathrine Eiken February 2005
Unni Cathrine Eiken February 2005
Create successful ePaper yourself
Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.
. ankomme,etterforsker,?,?<br />
ankomme,etterforsker,?,?<br />
ankomme,etterforsker,åsted,åsted<br />
antyde,politi,?,?<br />
avhøre,?,person,person<br />
avhøre,?,vedkommende,vedkommende<br />
avhøre,politi,vitne,vitne<br />
The output file that is created when TiMBL has classified the input data and run a test with the<br />
test data consists of the input given in the test set with the category predicted by TiMBL added<br />
at the end of each line. Further, the output supplied by TiMBL upon a successful training and<br />
testing round gives information about the actions in the various stages of analysis. TiMBL’s<br />
actions can be divided into three separate phases; in phase 1 the training data is analysed, in<br />
phase 2 the items in the training data are stored for efficient use during testing and in phase 3 the<br />
trained classifier is applied to the test set. For the purposes of the EPAS analysis, the default<br />
algorithm was used in the test phase. This algorithm computes the similarity between a test and<br />
a training item in terms of weighted overlap; the total difference between two patterns is the sum<br />
of relevance weights of those features which are not equal (Daelemans et al. 2003, p. 13).<br />
The classification of the EPAS and the subsequent testing was carried out in two distinct steps;<br />
classification and testing of argument 1 and argument 2 was done separately. The results of the<br />
classification and testing is described in the following sections.<br />
4.1.2.1 Classifying argument 1<br />
Several experiments were run through TiMBL with the aim of classifying occurrences of<br />
argument 1 according to the environment they occur in. The classifier was trained on all EPAS<br />
not containing pronouns and then tested. For the purpose of classifying occurrences of argument<br />
1, an EPAS list with the relevant argument 1 as category label was used. In the following<br />
descriptions of the performed tests, this list will be referred to as EPAS_arg1.<br />
Test 1<br />
Training set: EPAS_arg1 with no pronouns, argument 1 ignored.<br />
Test set: EPAS with pronouns in argument 1 position.<br />
Result: 57,69% (15/26) correct classifications<br />
63