10.04.2013 Views

Unni Cathrine Eiken February 2005

Unni Cathrine Eiken February 2005

Unni Cathrine Eiken February 2005

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

value and highly desirable. As such, it was a logical next step following the removal of<br />

unwanted structures, to make sure that all desirable structures had been collected from the texts.<br />

Several structures were added; many of which had been subjected to only partial extraction in<br />

the extraction process. This may in part be due to the syntactic analysis and in part to the<br />

matching by the Perl script. Further, EPAS were manually extracted from one additional text<br />

that had not been parsed, and therefore not been part of the initial extraction process. In total,<br />

this yielded 74 additional EPAS. The example in (3-24) below provides an example of a<br />

manually edited EPAS. (3-24a) shows the EPAS as it was after the automatic extraction process.<br />

While going through the texts, it became clear that this EPAS had not been extracted in a way<br />

that represented the meaning in the sentence it originated from, and therefore did not have an<br />

optimal structure. The EPAS was therefore manually modified to the form shown in (3-24b).<br />

(3- 24)<br />

a. Original EPAS:<br />

ta, syklist, kontakt<br />

make, biker, contact<br />

b. Manually corrected EPAS:<br />

ta-kontakt-med, syklist, politi<br />

make-contact-with, biker, police<br />

Appendix C contains the EPAS list, while Appendix D shows the alignment between sentences<br />

in the text and the extracted EPAS.<br />

3.6.4 Comments about the EPAS list<br />

The revised EPAS list consists of 223 elementary predicate-argument structures. 24 structures<br />

have been modified as described above, and 74 have been added. The list contains most EPAS<br />

present in the text collection and represents a list of verb-subject-object relations found within a<br />

limited thematic domain. While it is clear that the list could have been expanded by adding<br />

further texts to the collection, it was not possible to extend the list further within the frameworks<br />

of this project. The EPAS list is large and varied enough to show a tendency. Certainly, the<br />

instances of individual EPAS would have been higher and the list would also have been<br />

55

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!