Unni Cathrine Eiken February 2005
Unni Cathrine Eiken February 2005
Unni Cathrine Eiken February 2005
Create successful ePaper yourself
Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.
consist of, or to make use of a component that predicts the part of speech of an unknown word.<br />
One solution that would have been suitable for the purposes of the present work would be to<br />
assume that all unknown words were nouns. However, due to the small size of the corpus used<br />
for this project, none of these strategies were implemented, and unknown words were added to<br />
the lexicon by hand.<br />
When parsing texts with NorGram and XLE, the user has several choices with regards to the<br />
format of the final syntactic analysis. For example, it is possible to receive partial parses, or to<br />
let the system return all the potential analyses of the input sentence. For the purposes of this<br />
project, I received full parses of each sentence in the text material and chose to manually check<br />
each instance where the system returned multiple valid parses and actively decided on the<br />
correct one that I wished to extract the EPAS from.<br />
3.3.2 Extracting EPAS from NorGram<br />
The output provided by XLE upon a successful parse using the NorGram grammar is<br />
particularly useful for a subsequent extraction of EPAS. NorGram is based on the LFG grammar<br />
formalism and produces constituent structures (c-structures), functional structures (f-structures)<br />
and minimal recursion semantics structures (MRS-structures) upon parsing a sentence. Each of<br />
these outputs can be useful for a subsequent extraction of predicates and their arguments.<br />
The c-structure in LFG is an external structure which displays an ordered representation of the<br />
words in a sentence or phrase (Bresnan 2001, p. 44). In XLE, the c-structure is represented by a<br />
phrase structure tree, where the terminal nodes are fully inflected word forms. F-structures<br />
represent the internal structure of a sentence. On this level, the “syntactic functions are<br />
associated with semantic predicate argument relations” (Bresnan 2001, p. 45). C-structures and<br />
f-structures are different structures, but display parallel information. Figure 3 below shows the<br />
graphical representation of the c- and f-structures for the sentence Politiet leter etter morderen<br />
(The police is looking for the murderer) generated by NorGram.<br />
44