- Page 1 and 2:
Eckhard Bick ♠ THE PARSING SYSTEM
- Page 3 and 4:
Abstract The dissertation describes
- Page 5 and 6:
3.7.2. Making the most of the lexic
- Page 7 and 8:
Inflexion tags 451 Syntactic tags 4
- Page 9 and 10:
first, - but soon, it would move th
- Page 11 and 12:
disambiguated), same level tags (to
- Page 13 and 14:
chain, but rather the linguistic co
- Page 15 and 16:
2 The lexicomorphological level: St
- Page 17 and 18:
The core of PALMORF is written in C
- Page 19 and 20:
(d) as part of words. For instance,
- Page 21 and 22:
(1) binary search technique: a . .
- Page 23 and 24:
acapitã#=#####B(orn)###413 acara#a
- Page 25 and 26:
Words with graphical accents often
- Page 27 and 28:
2.2.3.2 The inflexional endings lex
- Page 29 and 30:
2.2.3.3 The suffix lexicon (1) 1
- Page 31 and 32:
Suffix combination rules is also us
- Page 33 and 34:
an V a DERP a- [ANT] brad Vi as HV
- Page 35 and 36:
"inimigo" ADJ M P "inimigo" N M P
- Page 37 and 38:
variable word forms receive '_'- li
- Page 39 and 40:
The remaining 2 words of the 4-word
- Page 41 and 42:
appear truncated or not, depending
- Page 43 and 44:
comparison to the substring consist
- Page 45 and 46:
My present linguistic solution 28 i
- Page 47 and 48:
In some non-personal proper nouns,
- Page 49 and 50:
(i.e., not too complex) derivationa
- Page 51 and 52:
2.2.4.5 Abbreviations and sentence
- Page 53 and 54:
Another case, where meaning bearing
- Page 55 and 56:
2.2.4.6 The human factor: variation
- Page 57 and 58:
circumflex-) accented words without
- Page 59 and 60: the tagger tries to identify a word
- Page 61 and 62: xxxar-#1##AaiD######54578 endings-s
- Page 63 and 64: "sombrancelha" N F P '=sobrancelha
- Page 65 and 66: (6) Word class distribution and par
- Page 67 and 68: VFIN 24.96 16 7.77 9 2.46 - - 25 3.
- Page 69 and 70: the present participle by derivatio
- Page 71 and 72: Inflexion tags combine with word cl
- Page 73 and 74: 2.2.5.2 The individual word classes
- Page 75 and 76: SPEC "specifiers": independent pron
- Page 77 and 78: WORD FORM and LEXEME CATEGORIES gen
- Page 79 and 80: WORD FORM and LEXEME CATEGORIES dea
- Page 81 and 82: mood VFIN finite IND indicative SUB
- Page 83 and 84: prefixal derivation in verbs comple
- Page 85 and 86: graph. def. characteristics: hyphen
- Page 87 and 88: PP prepositional group de=aluguel
- Page 89 and 90: 2.2.5.3 Portuguese particles By ‘
- Page 91 and 92: Deictic adverbs refer to discourse
- Page 93 and 94: algo, algum=tanto, nada, nadinha?,
- Page 95 and 96: quando når, da hvornår quanto [QU
- Page 97 and 98: (1) Language distribution and error
- Page 99 and 100: 3 Morphosyntactic disambiguation: T
- Page 101 and 102: (4) "quando" conjunction or adverb
- Page 103 and 104: oth for the simple ambiguities intr
- Page 105 and 106: case of a 3.person possessor, the u
- Page 107 and 108: The fact that these semantic distin
- Page 109: In (Karlsson et. al, 1995:19ff) an
- Page 113 and 114: An important aspect of the local-gl
- Page 115 and 116: 3.2 Morphological ambiguity in Port
- Page 117 and 118: 100 90 80 70 60 50 40 30 20 10 0 0
- Page 119 and 120: VFIN 9185 3748 19 375 1079 540 2827
- Page 121 and 122: VFIN nominal group 3a) '-o' 1S M S
- Page 123 and 124: N 107 - - - - - - - 107 99.7 ADJ 2
- Page 125 and 126: Consider the following "classical l
- Page 127 and 128: Given that a sentence like (5) is m
- Page 129 and 130: 3.4 Word internal (local) disambigu
- Page 131 and 132: 50 45 40 35 30 25 20 15 10 5 0 0 1
- Page 133 and 134: 3.5 Tools for disambiguation In cor
- Page 135 and 136: account, and transition probabiliti
- Page 137 and 138: Even trigrams, however, are far fro
- Page 139 and 140: no lexicon. By combining supervised
- Page 141 and 142: Here, the variable ‘number’ can
- Page 143 and 144: Though Finite State Machines (FSM)
- Page 145 and 146: co-ordination, are syntactically ir
- Page 147 and 148: compiled into a computer program th
- Page 149 and 150: other languages from both the Germa
- Page 151 and 152: 3.6 The rule formalism In principle
- Page 153 and 154: (1) a tag, word form or base form,
- Page 155 and 156: 4. A blocking context, where the wo
- Page 157 and 158: 3.7 Contextual information in const
- Page 159 and 160: REMOVE (@ACC>) IF (*1 @MV BARRIER C
- Page 161 and 162:
, , ... N PP contraste com, respei
- Page 163 and 164:
adjective with obligatorily animal
- Page 165 and 166:
PROP (proper nouns) top hum (places
- Page 167 and 168:
3.7.3 Local vs. global rules: Const
- Page 169 and 170:
(1) Rule scope morf morf morf morf
- Page 171 and 172:
number of rules Rn, but the number
- Page 173 and 174:
(4) Rule complexity number of conte
- Page 175 and 176:
all +2 158 69 49 316 C-percent 79.7
- Page 177 and 178:
particular, mapping rules may - at
- Page 179 and 180:
(5b) context position, polarity (±
- Page 181 and 182:
Portuguese rules use more (distant)
- Page 183 and 184:
NP-functions like @ACC or @SUBJ (
- Page 185 and 186:
@FAUX =finite auxiliary, @FMV =fini
- Page 187 and 188:
3.9 Performance: Measuring correctn
- Page 189 and 190:
2412 words 1837 words 4249 words Er
- Page 191 and 192:
3.10 Speech data tagging: Probing t
- Page 193 and 194:
If a dishesion marker is preceded b
- Page 195 and 196:
N SUBJ> N< P< N< P< FMV principal m
- Page 197 and 198:
Where all goes well, the system tol
- Page 199 and 200:
iterations, or cases, where one spe
- Page 201 and 202:
castelos [castelho] N M P @A velho
- Page 203 and 204:
subclause (functioning as postnomin
- Page 205 and 206:
a > família:NP > de:PP > despesas:
- Page 207 and 208:
adjuncts. By combination of (a) and
- Page 209 and 210:
complemented by the @P< mark on our
- Page 211 and 212:
4.1.3 The clause: Arguments and adj
- Page 213 and 214:
complements, direct objects (the la
- Page 215 and 216:
other adjunct adverbial subcategori
- Page 217 and 218:
4.2 Group types and group level fun
- Page 219 and 220:
NP N noun PROP proper noun (PROP ca
- Page 221 and 222:
modifiers can be placed left of a p
- Page 223 and 224:
’algumas de’ would be that of D
- Page 225 and 226:
Because of these problems, I would
- Page 227 and 228:
Ignoring semantic incompatibilities
- Page 229 and 230:
4.2.3 The prepositional group (PP)
- Page 231 and 232:
To make things even more complicate
- Page 233 and 234:
While this is an intuitive way to h
- Page 235 and 236:
One might be tempted to conclude fr
- Page 237 and 238:
continuar + GER/a and so forth. Als
- Page 239 and 240:
emoving e.g. querer from the modal
- Page 241 and 242:
(2c) Parou de chover. ‘(it) stopp
- Page 243 and 244:
acostumar alg. a, estimular alg. a,
- Page 245 and 246:
List of concatenating verb currentl
- Page 247 and 248:
convidar a indbyde til at costumar
- Page 249 and 250:
pretender # foregive at pretextar #
- Page 251 and 252:
with absolute relative pronoun or a
- Page 253 and 254:
o manda sozinho (ADJ @
- Page 255 and 256:
(4c) Há sempre um garçon discutin
- Page 257 and 258:
(1c) [Chegou]. ('He/she/it arrived.
- Page 259 and 260:
(3a) O filho é mais alto que o pai
- Page 261 and 262:
(4c) Como [como] ADV @COM @#AS-AD
- Page 263 and 264:
(7b) Ele trabalhava como @PRD escra
- Page 265 and 266:
(10e) é mais fácil que [que] KS
- Page 267 and 268:
Another argument in favour of the @
- Page 269 and 270:
passive" (4d, with ser), the latter
- Page 271 and 272:
seem to be ideal candidates for foc
- Page 273 and 274:
focus topic predicative subject 3.
- Page 275 and 276:
O que é isso ? In my parser, which
- Page 277 and 278:
4.5.2 Comparison structures @#....-
- Page 279 and 280:
In my parser, comparative hooks are
- Page 281 and 282:
(AS) (FS) mais/menos..do=que 'th
- Page 283 and 284:
AP S ADV ADJ COMP S mais bonito do=
- Page 285 and 286:
mais, which by itself does not deno
- Page 287 and 288:
o [o] DET M S @>N ‘the’ proce
- Page 289 and 290:
the function of the subclause is co
- Page 291 and 292:
devesse [dever] V IMPF 1/3S SUBJ V
- Page 293 and 294:
temendo [temer] V GER @IMV @#ICL-
- Page 295 and 296:
As can be seen, plural forms are in
- Page 297 and 298:
(8e) Estamos todos @ estamos mais o
- Page 299 and 300:
tornar a 'return' voltar a 'return'
- Page 301 and 302:
(3a) Onde [onde] ADV @ADV> 'where
- Page 303 and 304:
a [a] PRP @N ‘the’ ramos [ram
- Page 305 and 306:
4.5.4.2 Adjunct adverbials In my sy
- Page 307 and 308:
their clause internal function ("ex
- Page 309 and 310:
(1f) @ Com seus lagoas, praias e du
- Page 311 and 312:
unbound subclass, mirroring the cor
- Page 313 and 314:
de 77 % 23 % - em 33 % 12 % 55 % pa
- Page 315 and 316:
6d4) *(algo, meio, nada, um=tanto )
- Page 317 and 318:
An important distinction with regar
- Page 319 and 320:
nascera [nascer] V MQP 1/3S IND VF
- Page 321 and 322:
3a. interrogative complementiser in
- Page 323 and 324:
The adverbs in this group are the o
- Page 325 and 326:
chegar [chegar] V INF 0/1/3S @IM
- Page 327 and 328:
(1c) emprestou- [emprestar] V PS
- Page 329 and 330:
4.5.5 Violating the uniqueness prin
- Page 331 and 332:
for (2): se "se" PERS M/F 3S/P ACC/
- Page 333 and 334:
Portuguese pronoun equivalent to Fr
- Page 335 and 336:
a [a] DET F S @>N ‘-’ violên
- Page 337 and 338:
• lack of a slot for direct objec
- Page 339 and 340:
A typical case is arrancar ('to pul
- Page 341 and 342:
4.6 The transformational potential
- Page 343 and 344:
information is (a) of the same nota
- Page 345 and 346:
SELECT (NUM) (-1C PRP-DE) (-2 MAIS)
- Page 347 and 348:
• 2. Co-ordinators are regarded a
- Page 349 and 350:
(8a) Analysed text, in flat, word b
- Page 351 and 352:
[word classes: DET=determiner, N=no
- Page 353 and 354:
(2c) homens @NPHR e @CO mulheres @N
- Page 355 and 356:
obligatorily. In the intransitive,
- Page 357 and 358:
modify a lexicon entry. Quantitativ
- Page 359 and 360:
Here, the valency tag allows the i
- Page 361 and 362:
5.3 Disambiguating valency tags Nat
- Page 363 and 364:
6 The semantic perspective: Increme
- Page 365 and 366:
Therefore, there is a case for intr
- Page 367 and 368:
order to choose the right translati
- Page 369 and 370:
Illustration: Disambiguation of sem
- Page 371 and 372:
fact + ABSTRACT ÷ LIFE + LIFE dinn
- Page 373 and 374:
± HUMAN EXPRESSION 220 (i.e. quali
- Page 375 and 376:
Atomic semantic features for differ
- Page 377 and 378:
Ee = entities (±CONCRETE) Cc = ±C
- Page 379 and 380:
In the real rules, FEATURE is a pos
- Page 381 and 382:
REMOVE (@=i) (0 @SUBJ> AND @=I) (*1
- Page 383 and 384:
used, or even mapped, in the Constr
- Page 385 and 386:
for every preposition. Though some
- Page 387 and 388:
depending on the semantic class of
- Page 389 and 390:
Another, rarer, example is the verb
- Page 391 and 392:
por=exemplo [por=exemplo] PP @N '
- Page 393 and 394:
... if the following clause boundar
- Page 395 and 396:
N "inspection" PROP - 395 - ([he]
- Page 397 and 398:
Moving beyond the translational lev
- Page 399 and 400:
that make one projection direction
- Page 401 and 402:
7 The applicational level: Teaching
- Page 403 and 404:
(1) The Portuguese grammar page The
- Page 405 and 406:
Here, the running word forms in a s
- Page 407 and 408:
"verbal" and "adjectival" function,
- Page 409 and 410:
tree structures). According to the
- Page 411 and 412:
that the Chinese origin in The man
- Page 413 and 414:
(1) Distributed grammar teaching en
- Page 415 and 416:
(2) flow chart of student - server
- Page 417 and 418:
7.2.5. Syntactic tree structures Wh
- Page 419 and 420:
Finally, students can opt for a sim
- Page 421 and 422:
sent back and forth through the CGI
- Page 423 and 424:
(7) Tutoring in the case of a "clos
- Page 425 and 426:
Here, three results of this quite s
- Page 427 and 428:
(2b) Interfering material in prepos
- Page 429 and 430:
(2c) Interfering adverbs in preposi
- Page 431 and 432:
- 431 -
- Page 433 and 434:
I have shown, in chapter 6, how add
- Page 435 and 436:
7.5 The applicational potential of
- Page 437 and 438:
Applicational add-on program module
- Page 439 and 440:
annotation system (probably inherit
- Page 441 and 442:
(2) CG syntactic tag sets across la
- Page 443 and 444:
(3) STA:fcl =SUBJ:np ==>N:art( M S)
- Page 445 and 446:
is due to the fact that, in my CG-n
- Page 447 and 448:
alternative readings cannot be remo
- Page 449 and 450:
2.2.5.1), without ultimately losing
- Page 451 and 452:
Appendix: The tag set WORD CLASS TA
- Page 453 and 454:
SYNTACTIC TAGS @SUBJ> @ @ @ @ @ @ @
- Page 455 and 456:
VALENCY TAGS and FUNCTIONAL SUBCLAS
- Page 457 and 458:
Syntactic and semantic subclasses o
- Page 459 and 460:
ich 222 fish perca ‘perch’, lob
- Page 461 and 462:
sit 490 situation, state of affairs
- Page 463 and 464:
cc 412 concrete objects (+CONCRETE,
- Page 465 and 466:
, ) unit - unit (always with -> num
- Page 467 and 468:
CPS from (possibly, ) not yet -PL
- Page 469 and 470:
Appendix: PALMORF program architect
- Page 471 and 472:
strings ending in or containing '.'
- Page 473 and 474:
Appendix: CG-rules for proper nouns
- Page 475 and 476:
if the PROP reading is heuristic, a
- Page 477 and 478:
é [ser] V PR 3S IND VFIN @FMV a [a
- Page 479 and 480:
$, domar [domar] V INF 0/1/3S @IMV
- Page 481 and 482:
de [de] PRP @ não [não] ADV @ADVL
- Page 483 and 484:
ex-patrão [patrão] N M S @P< Alci
- Page 485 and 486:
novo [novo] ADJ M S @N< $. Atençã
- Page 487 and 488:
Outros [outro] DET M P @SUBJ> cae
- Page 489 and 490:
, embargada [embargar] V PCP F
- Page 491 and 492:
Cutting, Doug & Kupiec, Julian & Pe
- Page 493 and 494:
Wauschkuhn, Oliver, "Ein Werkzeug z
- Page 495 and 496:
operator adverbs;89;320 post-adverb
- Page 497 and 498:
unbounded vs. absolute;178 contextu
- Page 499 and 500:
word class;23 word root;22 list def
- Page 501 and 502:
as relative adverb;273 real time pa
- Page 503:
uniqueness principle;206 breaches i