21.04.2013 Views

Eckhard Bick - VISL

Eckhard Bick - VISL

Eckhard Bick - VISL

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

2<br />

The lexicomorphological level:<br />

Structuring words<br />

2.1 A lexical analyser for Portuguese: PALMORF<br />

PALMORF is a so-called morphological or lexical analyser, a computer program that<br />

takes running text as input and yields an analysed file as output where word and<br />

sentence boundaries have been established, and where each word form or "wordlike"<br />

polylexical unit is tagged for word class (PoS), inflexion and<br />

derivation/composition, with morphologically ambiguous words receiving multiple<br />

tag lines. The notational conventions used by PALMORF match the input<br />

conventions for a CG disambiguation grammar. With a CG-term, an ambiguous list<br />

of morphological readings, as in (1), is called a cohort.<br />

(1)<br />

WORD<br />

FORM BASE FORM SECONDARY TAGS PRIMARY TAGS<br />

revista<br />

"revista" N F S<br />

‘magazine’,‘inspection’<br />

"revestir" V PR 1/3S SUBJ VFIN ‘to cover’<br />

"revistar" V IMP 2S VFIN ‘to review’<br />

"revistar" V PR 3S IND VFIN<br />

"rever" V PCP F S ‘to see again’,‘to leak’<br />

In example (1), the word form 'revista' has been assigned one noun-reading (female<br />

singular) and four verb-readings, the latter covering three different base forms,<br />

subjunctive, imperative, indicative present tense and participle readings. By<br />

convention, PoS and morphological features are regarded as primary tags and coded<br />

by capital letters. In addition there can be secondary lexical information about<br />

valency and semantic class, marked by bracketing, like for intransitive verbs<br />

(“rever” - ‘leak through’) , for monotransitive verbs (“rever” - ‘see again’),<br />

for pre-name distribution (“revista VEJA” - ‘VEJA magazine”), for<br />

'readable object' or for +CONTROL and +PERFECTIVE ASCPECT<br />

(“revista” - ‘review’).<br />

(2)<br />

WORD FORM BASE FORM SECONDARY TAGS PRIMARY TAGS<br />

(i) telehipnotizar<br />

"hipnotizar" V INF 0/1/3S<br />

"hipnotizar" V FUT 1/3S SUBJ VFIN<br />

(ii) corruptograma ALT xxxograma<br />

"corrupt" N M S<br />

- 15 -

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!