21.04.2013 Views

Eckhard Bick - VISL

Eckhard Bick - VISL

Eckhard Bick - VISL

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

ambiguity, Constraint Grammar can also be used for syntactic parsing, as efficiently<br />

shown, for instance, in the Bank-of-English-project (200 million words, Järvinen,<br />

1994).<br />

(3) Input to the syntactic CG-rules (after mapping)<br />

""<br />

"nunca" ADV @ADVL<br />

""<br />

"como" V PR 1S VFIN @FMV<br />

""<br />

"peixe" N M S @SUBJ @ACC @SC @OC<br />

[@ADVL=adverbial, @FMV=finite main verb, @SUBJ=subject, @ACC=direct object, @SC=subject complement,<br />

@OC=object complement]<br />

In (3), adding all possible syntactic tags (@) has resulted in fourfold syntactic<br />

ambiguity for peixe. The direct object reading (@ACC) can be selected in a positive<br />

way by means of a 'SELECT'- rule exploiting the transitivity of the verb, but it could<br />

just as well be identified indirectly, - by being the only surviving reading, after CGrules<br />

have discarded all others:<br />

(4) REMOVE (@SUBJ) IF (0 N) (NOT *-1 V3) (NOT *1 V3)<br />

[discard the subject reading, if the target is a noun (N) and there is no verb in the 3.person]<br />

REMOVE (@SC) IF (NOT *-1 ) (NOT *1 )<br />

[discard the subject complement reading (@SC) if there is no copula verb () in the sentence]<br />

REMOVE (@OC) IF (NOT *-1 @ACC) (NOT *1 @ACC)<br />

[discard the object complement reading (@OC) if there is no direct object reading (@ACC) in the sentence] 104<br />

It is this indirect disambiguation, that is most characteristic of Constraint Grammar, and<br />

it is the prime reason for the robustness of this method: even rare or incomplete<br />

constructions will receive at least one reading - the one that survives the most<br />

constraints. The incremental use of the rules, with safe contexts and safe rules before<br />

ambiguous contexts and heuristic rules, furthermore ensures that the parser will prefer a<br />

reading that is "almost correct" to one that is "quite wrong".<br />

CG-grammars have first of all been described for English (e.g. Karlsson et.al.,<br />

1991), but there are - on the morphological level, at least - projects involving several<br />

103 In the mapping modul, constraint grammar rules are used, too, and the list of possible syntactic functions for a given word<br />

form can thus be made context dependent (and, of course, shorter).<br />

104 Note that all 3 rules make use of "unbound" contexts conditions:<br />

*-1 = the context condition is to be true anywhere to the left (1 or more positions to the left)<br />

*1 = the context condition is to be true anywhere to the right (1 or more positions to the right)<br />

Of course,"bound" context conditions can be used, e.g. -2 = second word to the left, 3 = third word to the right. Bound<br />

context conditions can in principle be translated into n-gram rules (as used in probabilistic HMM parsers), while "unbound"<br />

(*-context) conditions are characteristic of Constraint Grammar and not easily translatable into probabilistic systems (cp.<br />

also chapter 3.7.3).<br />

- 148 -

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!