Unni Cathrine Eiken February 2005
Unni Cathrine Eiken February 2005
Unni Cathrine Eiken February 2005
Create successful ePaper yourself
Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.
3.6 Evaluation of the data set<br />
The data set created by the extraction process consisted of 195 elementary predicate-argument<br />
structures in its raw form. The original EPAS list was not directly applicable for the next parts of<br />
the project. Not all of the extracted structures on the list were suitable for further analysis. Some<br />
of the EPAS were not given an optimal analysis (for my purposes) by the grammar, some were<br />
irrelevant for the later analysis and some were not extracted correctly from the MRS by the Perl<br />
script. The dataset was post-edited to achieve a set of EPAS that did not include erroneously<br />
extracted or undesired structures. With such a small collection of structures as is the case in this<br />
project, the inclusion of only a few incorrect structures would be likely to skew the subsequent<br />
analysis and possibly produce false results.<br />
In the following, I will briefly outline some of the reasons why the EPAS list included incorrect<br />
structures and describe how the list was revised.<br />
3.6.1 Errors from the grammar<br />
Some of the undesired structures in the original EPAS list were directly caused by<br />
characteristics in the NorGram grammar. In the original EPAS list, there were for instance<br />
several structures of the type exemplified by (3-18):<br />
(3- 18)<br />
a. verbal predicate, nominal argument<br />
b. preposition, verbal predicate, nominal argument<br />
These structures should preferably have been combined into one EPAS. The example in (3-19)<br />
below shows a concrete instance from the EPAS list and is analogous to several other instances:<br />
(3- 19)<br />
a. bo, Anne live, Anne<br />
b. i, bo, studentkollektiv in, live, student housing<br />
The structure is extracted from the following sentence from the text material:<br />
52