10.04.2013 Views

Unni Cathrine Eiken February 2005

Unni Cathrine Eiken February 2005

Unni Cathrine Eiken February 2005

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

Preface<br />

The project presented in this paper is a Cand. Philol. thesis in Computational Linguistics<br />

and Language Technology and is submitted at the University of Bergen in <strong>February</strong> <strong>2005</strong>.<br />

The thesis is written in loose cooperation with the research project KunDoc (KunDoc<br />

2004). KunDoc (Kunnskapsbasert dokumentanalyse / Knowledge-based document<br />

analysis), which was started in October 2003 and is funded by the Norwegian Research<br />

Council (NFR), has functioned as an inspiration for verbalising the approach in the thesis.<br />

The research within KunDoc is carried out in cooperation between the firm CognIT AS<br />

(CognIT 2004) and the University of Bergen. KunDoc aims at developing a method for<br />

the automatic recognition of discourse structures in written Norwegian texts. The project<br />

examines whether automated identification of coreference in a text can be used to create<br />

an unambiguous discourse structure of the text, identifying both its thematic and<br />

contextual structure. A further goal is to examine whether these techniques are useful<br />

within a closed thematic domain to create unambiguous automated summaries. Within<br />

KunDoc, it is of interest to generate ontologies that represent real-world knowledge.<br />

In the work on my thesis I have also worked in co-operation with the research project<br />

NorGram (NorGram 2004) at the University of Bergen. This project develops a<br />

computational grammar for Norwegian bokmål and is a part of the ParGram project at<br />

Palo Alto Research Center. The pre-processing of the text collection used in my project<br />

has been carried out using NorGram’s grammar on the XLE platform.<br />

ii

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!