21.04.2013 Views

Eckhard Bick - VISL

Eckhard Bick - VISL

Eckhard Bick - VISL

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

Though Finite State Machines (FSM) are fast, finite and efficient, they have a number<br />

of serious shortcomings, due to the low power of the grammar types they represent:<br />

* An FSM’s memory is very short - once a transition is made, the network only<br />

looks at paths departing from that node, and its choice will not be conditioned by how<br />

the algorithm got there. The NP-section of the FSM in the above example can thus not<br />

be used for an NP-subject (by adding a direct path from S to NP, and a ‘v’-path from NP<br />

to VP), because the FSM would confuse subject-NP and object-NP, trying, for example,<br />

a verb-path also after having used the NP-section for object. Therefore two separate NPsections<br />

have to be incorporated into the FSM, for subject and object, linked to S and<br />

VP, respectively. For a similar reason, the co-ordinating conjunction path in the<br />

example is problematic, since it doesn’t distinguish between adding am NP as coordinated<br />

object or as subject for a co-ordinated sentence. To make the distinction,<br />

different “conjunct networks” would have to be inserted into the network right after S<br />

and, and before the NP node, containing conjuncted copies of the relevant network<br />

sections. Thus, an FSM’s complexity can grow enormously for long sentences with<br />

heavy subordination and co-ordination.<br />

* Regular grammars cannot express inflexional agreement as such, - they’d have to<br />

run the whole network or large sections in many parallel versions, one for every<br />

instantiation of every category. This is why unification grammars have to be level 2<br />

grammars (context free grammars), where no restrictions apply to the right side of a<br />

production. Number- and gender-arguments, for example, can be thought of as<br />

“affixes” 95 , attached as additional affix-symbols to the “normal” symbols, both allowing<br />

for either terminal or non-terminal symbols. Number-agreement can then be added to an<br />

ordinary PSG rule by inserting an affix-variable for number in the rewriting chain of<br />

symbols:<br />

regular grammar: S -> pron VP<br />

context free grammar: S -> pron number VP number<br />

Since the ‘number’-variable has to be instantiated with the same value in both places,<br />

the production cannot be produced by simply working step-by-step from left to right.<br />

By comparison, the difference between context free and context sensitive grammars is<br />

more subtle - at least when applied to natural languages. Context sensitive rules can<br />

usually be rewritten as one or more context free rules. A routinely quoted counter<br />

95 For Portuguese, I have worked with the AGFL formalism (Affix Grammars over a Finite Lattice), as described in (Koster,<br />

1991).<br />

- 143 -

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!