21.04.2013 Views

Eckhard Bick - VISL

Eckhard Bick - VISL

Eckhard Bick - VISL

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

oot search<br />

lexicon search<br />

takes a word trunk as input, searches for it in the lexicon, and - if positive -<br />

checks the root for "outward compatibility" (word class and combination<br />

rules) with any inflexion or suffixation element to its right. If compatible, the<br />

root is stored together with its derivational path for the output module.<br />

If no root is found, or if is not compatible, the program tries to cut a (further)<br />

suffix of the trunk.<br />

suffix analysis -> root search (recursive)<br />

cuts a suffix off an input string, checks this suffix for "outward word class<br />

compatibility" and "inward phonologic compatibility" and - if both are positive<br />

- sends the remaining trunk in standardised or phonologically adapted form to<br />

root search, thus allowing for recursion and increasing "depth" in relation to<br />

the number of successive derivational elements involved.<br />

If no suffix is found, or if it is not compatible, the program progresses to<br />

prefix analysis.<br />

prefix -> inflexion analysis<br />

cuts possible prefixes off the word stem, and - if phonologically compatible -<br />

sends the remaining word trunk to normal analysis, both inflexion and<br />

suffixation. Performed when none or only suffixed readings are found for a<br />

given word.<br />

makelistsuffix<br />

establishes pointer tree for suffix-searching<br />

makelistprefix<br />

establishes pointer tree for prefix-searching<br />

preprocessor<br />

polylexical structures whole word analysis ( findword)<br />

looks up all word sequences of up to 4 elements length in the lexicon; if<br />

found, they are marked by '=' ligation between words. In the case of<br />

polylexicals only listed as incorporables - i.e. without another,<br />

autonomous reading - ligation is only performed if a form of the<br />

incorporating verb in question is found in the left hand context.<br />

capitalisation<br />

word initial capital letters are substituted by '*' + lower case letter<br />

numbers<br />

are marked '$'. If a string starts with a number, all of it becomes a<br />

numerical $-expression.<br />

punctuation characters<br />

are isolated as single characters and prefixed with '$'<br />

abbreviations<br />

- 470 -

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!