10.04.2013 Views

Unni Cathrine Eiken February 2005

Unni Cathrine Eiken February 2005

Unni Cathrine Eiken February 2005

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

consist of, or to make use of a component that predicts the part of speech of an unknown word.<br />

One solution that would have been suitable for the purposes of the present work would be to<br />

assume that all unknown words were nouns. However, due to the small size of the corpus used<br />

for this project, none of these strategies were implemented, and unknown words were added to<br />

the lexicon by hand.<br />

When parsing texts with NorGram and XLE, the user has several choices with regards to the<br />

format of the final syntactic analysis. For example, it is possible to receive partial parses, or to<br />

let the system return all the potential analyses of the input sentence. For the purposes of this<br />

project, I received full parses of each sentence in the text material and chose to manually check<br />

each instance where the system returned multiple valid parses and actively decided on the<br />

correct one that I wished to extract the EPAS from.<br />

3.3.2 Extracting EPAS from NorGram<br />

The output provided by XLE upon a successful parse using the NorGram grammar is<br />

particularly useful for a subsequent extraction of EPAS. NorGram is based on the LFG grammar<br />

formalism and produces constituent structures (c-structures), functional structures (f-structures)<br />

and minimal recursion semantics structures (MRS-structures) upon parsing a sentence. Each of<br />

these outputs can be useful for a subsequent extraction of predicates and their arguments.<br />

The c-structure in LFG is an external structure which displays an ordered representation of the<br />

words in a sentence or phrase (Bresnan 2001, p. 44). In XLE, the c-structure is represented by a<br />

phrase structure tree, where the terminal nodes are fully inflected word forms. F-structures<br />

represent the internal structure of a sentence. On this level, the “syntactic functions are<br />

associated with semantic predicate argument relations” (Bresnan 2001, p. 45). C-structures and<br />

f-structures are different structures, but display parallel information. Figure 3 below shows the<br />

graphical representation of the c- and f-structures for the sentence Politiet leter etter morderen<br />

(The police is looking for the murderer) generated by NorGram.<br />

44

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!