10.04.2013 Views

Unni Cathrine Eiken February 2005

Unni Cathrine Eiken February 2005

Unni Cathrine Eiken February 2005

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

Intuitively, this is a quite good association of concepts, since all the entities in the grouping<br />

belong to the group law enforcement. If a person were to group nominals from the text<br />

collection into semantically similar concept classes, the grouping in Figure 6 would not be an<br />

unlikely result. The grouping as shown in Figure 6, however, is the result of an association<br />

based on context information from the text itself.<br />

4.2.1 Classify<br />

Manually performing the association method as described above on all the EPAS in the data set<br />

proved to be bordering on the impossible, mainly because it implied consulting the data set<br />

multiple times, each time looking for different values and keeping track of the partial goals in<br />

the process. Based on the method as described above, the Perl script classify was written 5 . In the<br />

following, the algorithm implemented in Classify is outlined in brief.<br />

For each predicate:<br />

1. Level 0:<br />

What is ARG1 and ARG2 in the corpus/EPAS list?<br />

2. Level 1:<br />

For each ARG1 = x that was found in 1:<br />

In connection with which other predicates is ARG1 also= x?<br />

For each of these predicates:<br />

Which other words occur as ARG1?<br />

Produces a list of words which occur in the same contexts as x<br />

3. Level 2:<br />

For each word = y in the list from level 1:<br />

Which other predicates does this word also co-occur with?<br />

For each of these predicates:<br />

Which other words occur as ARG1?<br />

Produces a list of words which occur in the same contexts as y<br />

Same procedure is repeated for ARG2.<br />

5<br />

The algorithm was implemented in Perl by Martin Rasmussen Lie, informatics student at the University of<br />

Bergen.<br />

72

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!