10.04.2013 Views

Unni Cathrine Eiken February 2005

Unni Cathrine Eiken February 2005

Unni Cathrine Eiken February 2005

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

. ankomme,etterforsker,?,?<br />

ankomme,etterforsker,?,?<br />

ankomme,etterforsker,åsted,åsted<br />

antyde,politi,?,?<br />

avhøre,?,person,person<br />

avhøre,?,vedkommende,vedkommende<br />

avhøre,politi,vitne,vitne<br />

The output file that is created when TiMBL has classified the input data and run a test with the<br />

test data consists of the input given in the test set with the category predicted by TiMBL added<br />

at the end of each line. Further, the output supplied by TiMBL upon a successful training and<br />

testing round gives information about the actions in the various stages of analysis. TiMBL’s<br />

actions can be divided into three separate phases; in phase 1 the training data is analysed, in<br />

phase 2 the items in the training data are stored for efficient use during testing and in phase 3 the<br />

trained classifier is applied to the test set. For the purposes of the EPAS analysis, the default<br />

algorithm was used in the test phase. This algorithm computes the similarity between a test and<br />

a training item in terms of weighted overlap; the total difference between two patterns is the sum<br />

of relevance weights of those features which are not equal (Daelemans et al. 2003, p. 13).<br />

The classification of the EPAS and the subsequent testing was carried out in two distinct steps;<br />

classification and testing of argument 1 and argument 2 was done separately. The results of the<br />

classification and testing is described in the following sections.<br />

4.1.2.1 Classifying argument 1<br />

Several experiments were run through TiMBL with the aim of classifying occurrences of<br />

argument 1 according to the environment they occur in. The classifier was trained on all EPAS<br />

not containing pronouns and then tested. For the purpose of classifying occurrences of argument<br />

1, an EPAS list with the relevant argument 1 as category label was used. In the following<br />

descriptions of the performed tests, this list will be referred to as EPAS_arg1.<br />

Test 1<br />

Training set: EPAS_arg1 with no pronouns, argument 1 ignored.<br />

Test set: EPAS with pronouns in argument 1 position.<br />

Result: 57,69% (15/26) correct classifications<br />

63

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!