21.04.2013 Views

Eckhard Bick - VISL

Eckhard Bick - VISL

Eckhard Bick - VISL

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

3.7.3 Local vs. global rules: Constraint typology<br />

In this section I shall as far as possible detach myself the CG grammar critic from<br />

myself the CG rule writer, inspecting and quantifying the types of rule architecture used<br />

in the system, and trying to map and interpret possible system immanent structural<br />

regularities or tendencies. The point of this exercise is:<br />

(a) to provide other CG-grammar writers with some standard for comparison and<br />

CG novices with some guidelines as to how a CG may be expected to develop, what<br />

grammar size and complexity to expect, which pitfalls to avoid etc., and<br />

(b) to facilitate cross-system comparison, like when the author of a probabilistic<br />

HMM tagger/parser wants to decide on the possibility to match or "emulate" a CG rule<br />

set (a problem the relevance of which I have personally been confronted with when<br />

discussing with NLP-researchers outside the CG camp).<br />

What a CG grammar architecture looks like, may, of course, depends not only on<br />

general linguistic and analytic factors, but also on the individual grammarian’s approach<br />

to grammatical problem solving in general, and the technical limitations imposed by the<br />

few presently available CG rule compilers in particular, - and with very few Constraint<br />

Grammars around (and even fewer published), real proof of any structural universality<br />

claim must therefore await future research. Still, even regularities found within one<br />

system (and with one type of compiler), may help other researchers understand why CG<br />

rules look the way they do, and how best to learn from their not so bad performance.<br />

One of the ways to assess a given Constraint Grammar in a typological way is to<br />

quantify rule types with regard to their contextual scope and complexity, as suggested<br />

by Anttila in his discussion of the Helsinki group's English CG (Karlsson et. al., 1995,<br />

p.352). Contextual scope is what ordinarily distinguishes probabilistic grammars<br />

(narrow scope) from generative grammars (wide scope). Within Constraint Grammar,<br />

bounded context conditions, especially of low order (i.e. close to the target), are natural<br />

narrow scope tools, whereas unbounded context conditions are characteristic of a wide<br />

scope approach. Thus, a CG rule set can be typologically located between probabilistic<br />

and generative grammars, mimicking the first for part-of-speech discrimination, and the<br />

second for syntactic parsing.<br />

In table (1), a rule count is given for rules with unbounded context conditions<br />

(henceforth "global" rules) or without ("local" rules 119 ), for all three operations<br />

supported by the cg2-compiler. The columns containing numbers for non-heuristic rules<br />

119 The concept of "local" rules is not to be confused with that of "local disambiguation" - the first term is used to describe<br />

rules without unbounded context conditions (i.e. rules where all contexts conditions are bounded), while the second concerns<br />

word-internal disambiguation (the minimal derivational complexity rule, or "Karlsson's law")<br />

- 167 -

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!