10.04.2013 Views

Unni Cathrine Eiken February 2005

Unni Cathrine Eiken February 2005

Unni Cathrine Eiken February 2005

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

3.6 Evaluation of the data set<br />

The data set created by the extraction process consisted of 195 elementary predicate-argument<br />

structures in its raw form. The original EPAS list was not directly applicable for the next parts of<br />

the project. Not all of the extracted structures on the list were suitable for further analysis. Some<br />

of the EPAS were not given an optimal analysis (for my purposes) by the grammar, some were<br />

irrelevant for the later analysis and some were not extracted correctly from the MRS by the Perl<br />

script. The dataset was post-edited to achieve a set of EPAS that did not include erroneously<br />

extracted or undesired structures. With such a small collection of structures as is the case in this<br />

project, the inclusion of only a few incorrect structures would be likely to skew the subsequent<br />

analysis and possibly produce false results.<br />

In the following, I will briefly outline some of the reasons why the EPAS list included incorrect<br />

structures and describe how the list was revised.<br />

3.6.1 Errors from the grammar<br />

Some of the undesired structures in the original EPAS list were directly caused by<br />

characteristics in the NorGram grammar. In the original EPAS list, there were for instance<br />

several structures of the type exemplified by (3-18):<br />

(3- 18)<br />

a. verbal predicate, nominal argument<br />

b. preposition, verbal predicate, nominal argument<br />

These structures should preferably have been combined into one EPAS. The example in (3-19)<br />

below shows a concrete instance from the EPAS list and is analogous to several other instances:<br />

(3- 19)<br />

a. bo, Anne live, Anne<br />

b. i, bo, studentkollektiv in, live, student housing<br />

The structure is extracted from the following sentence from the text material:<br />

52

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!