21.04.2013 Views

Eckhard Bick - VISL

Eckhard Bick - VISL

Eckhard Bick - VISL

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

1<br />

Introduction<br />

1.1 The ‘what’s, ‘why’s and ‘who’s<br />

This dissertation is about whether and to what degree a computer program can be<br />

made to handle the grammatical analysis of natural language, in the form of ordinary,<br />

“running” text or linearly transcribed speech. The target language chosen is<br />

Portuguese, and the basic method applied in the parser to be described here is<br />

Constraint Grammar (first introduced in Karlsson, 1990), used in a context of<br />

Progressive Level Parsing 1 . Along the way, I will be concerned with the interaction<br />

between grammar system, parsing technique and corpus data, evaluating the trinity’s<br />

mutual influence, and the performance of the system as a whole. In other words, in<br />

computer linguistics, what can computers offer a linguist, and can linguistics inspire<br />

computing?<br />

Yet before trying to answer these questions with a 400-page bore of<br />

technicalities and a load of secondary questions, it would seem relevant to balance<br />

the introduction by asking quite another type of question: Why would any of this<br />

inspire a person? Why would anybody want to court a computer for half a decade or<br />

more? Well, personally - and may the esteemed reader please feel free to skip the<br />

next half page or so -, I find that the most intriguing fact about computers is not their<br />

data-crunching efficiency, nor their much-appraised multimedia capability, but the<br />

plain fact that they react to stimuli in much the same half-predictable-halfunpredictable<br />

way biological entities do. Computers communicate, and many a nerd<br />

has found or created a social surrogate in his computer.<br />

When I had my first naive date with a computer in 1973, the glorious glittering<br />

consumer items of today weren’t called PC’s – or even Mac’s – but went by the<br />

humble name of Wang. They had no hard disc, no floppies or CD-ROM’s, and 4 kB<br />

of RAM rather than 40 MB. Yet, in a subtle way, human-computer relations were<br />

superior to the uses most computers are put to today. Nowadays, most people treat<br />

computers as tools: Gaming devices, mail boxes, type-writers, - all of which, in<br />

different shapes, did exist before the advent of the computer. Then, children could<br />

not shoot their way trough a boring day by handling fire-buttons, joy-sticks and<br />

mouse-ears. They had to program their computer if they wanted it to play a game.<br />

And the computer would respond, as a student surpassing her teacher, by route, at<br />

1 Progressive Level Parsing is mirrored by the order of chapters in this book, which progresses from morphological<br />

analysis and the lexicon to morphological disambiguation, syntax, semantics and applicational considerations. This is<br />

why a discussion of the Constraint Grammar disambiguation formalism as such is “postponed” until chapters 3.5 and<br />

3.6. Though I have tried to avoid literal CG rule quotes in the first chapters, there may be a few passages (notably 2.2.4<br />

and 3.2-3) where readers not familiar with the basic notational conventions of CG might want to use later chapters for<br />

reference.<br />

- 8 -

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!