10.04.2013 Views

Unni Cathrine Eiken February 2005

Unni Cathrine Eiken February 2005

Unni Cathrine Eiken February 2005

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

c. PERP:<br />

gjerningsmann, drapsmann<br />

perpetrator, killer<br />

d. PERSON:<br />

person, bilfører, syklist, vedkommende<br />

person, car driver, biker, generic-nom<br />

e. OBSERV:<br />

teori, observasjon<br />

theory, observation<br />

f. PLACE:<br />

studentkollektiv, Førde<br />

student housing, Førde<br />

The classes of words shown in (4-11) form groups of concepts which occur in the same<br />

contextual environments within the thematic domain that the EPAS are extracted from. The<br />

groupings seem to reflect real semantic clusters in the sense that one can easily find a label to<br />

describe each group. For the purpose of the text collection in the present work, these six concept<br />

groups represent six distinct semantic groupings that share many features with respect to pattern<br />

distribution in the data set. With a larger data set to run the concept association on, more concept<br />

groups, and also more members within each group, would have been a likely outcome. The<br />

results of the concept association on the small data set in this project, does, however, suggest the<br />

feasibility of the method, as well as show that frequent patterns in smaller text collections also<br />

work toward capturing interesting concept groupings.<br />

4.3 Step III: Using concept groups in TiMBL<br />

The concept groups which emerged as a result of the association performed in section 4.2 above,<br />

represent clusters of words that occur in similar constellations in the data material. The<br />

emergence of concept groups which intuitively seem to have some semantic resemblance to<br />

each other confirms that the context a word fits into does indeed say something about what the<br />

word means, as per the distributional hypothesis.<br />

74

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!