21.11.2013 Views

YEARS OF EUROPEAN ONLINE ANNÉES DE EN LIGNE ...

YEARS OF EUROPEAN ONLINE ANNÉES DE EN LIGNE ...

YEARS OF EUROPEAN ONLINE ANNÉES DE EN LIGNE ...

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

example by using cos (α) instead of the angle α. An inverted value is often used<br />

to express a low distance; in that case a value of 1 signals the complete identicalness<br />

of the vectors (Dörre, Gerstl and Seiffert, 2004, pp. 491–2).<br />

Indexing enforced by computer linguistic technologies<br />

Simple lists of word forms turned out to be rather ineffective because<br />

terms are only recognised if they correspond exactly to the registered occurrence.<br />

Computer linguistics have developed algorithms which allow text forms<br />

to be reduced to basic forms, normally the nominative singular for nouns and<br />

the ininitive for verbs. this process is generally known as lemmatisation and<br />

groups together all word forms found in a text document under the corresponding<br />

forms which introduce articles in a dictionary.<br />

Although lemmatisation is already an important step in various kinds of<br />

analyses, it is not always satisfying when trying to extract information.<br />

words — or even lemmata — are supposed to represent the same concept if<br />

their roots are the same. the following table gives an example, and distinguishes<br />

between formal roots for which the ending morpheme is suppressed,<br />

text forms, lexical roots or lemmata, and the root of the term.<br />

Text form Lexical root (lemma) Formal root Root<br />

absorb<br />

absorbed<br />

absorbing<br />

absorb<br />

absorbs<br />

absorber<br />

absorbers<br />

absorber<br />

absorbable<br />

absorbably<br />

absorbable<br />

absorbance<br />

absorbances<br />

absorbance<br />

absorbancy<br />

absorbancies<br />

absorbancy<br />

absorbent<br />

absorbents absorbent<br />

absorbently<br />

absorption<br />

absorptions<br />

absorption<br />

absorptive<br />

absorptively<br />

absorptive<br />

Source: ferber, 2003, p. 43.<br />

absorb<br />

absorbab<br />

absorbanc<br />

absorbent<br />

absorption<br />

absorptiv<br />

absorb<br />

01_2007_5222_txt_ML.indd 160 6-12-2007 15:14:06

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!