21.04.2013 Views

Eckhard Bick - VISL

Eckhard Bick - VISL

Eckhard Bick - VISL

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

variable word forms receive '_'- linking, with '%' after element words that can be<br />

inflected. In this last group (with variable non-hyphenated elements) one can find<br />

idiomatic expressions and even proverbs. Since these are mostly of semantic interest,<br />

the program - for now - ignores them, checking only for non-varying expressions,<br />

with the important exception of incorporating verb structures and plurals of complex<br />

nouns or adjectives. Also, with respect to proverbs and clause-level idioms, it seems<br />

to be more interesting for a parser to assign syntactic structure in an analytical way<br />

than to provide a summary treatment in a synthetic way. 17<br />

It is the preprocessor which has to identify and '=' - mark polylexical strings.<br />

Technically this is done by adding up running words to a potential polylexical string,<br />

until a maximum (at present: 4) is reached, or punctuation gets in the way, whichever<br />

happens sooner. This is more difficult than it sounds, - a '.."WORD..' -structure, e.g.,<br />

breaks a running string, but is allowed string-initially, whereas '..WORD,..' becomes<br />

part of the string, but breaks it nevertheless, losing its ',' . When a string reaches<br />

maximum, the following happens:<br />

a) polylexical search with negative result:<br />

A group of 4 words is checked (in a left bounded fashion) first for long, then shorter<br />

polylexicals. If none is found, the 4-word window moves one word to the right, and<br />

the search process is repeated.<br />

17 Another matter is, of course, machine translation, where "synthetical treatment" is preferable and necessary for<br />

assigning an idiomatic translation.<br />

- 37 -

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!