12.07.2015 Views

Large-Scale Semi-Supervised Learning for Natural Language ...

Large-Scale Semi-Supervised Learning for Natural Language ...

Large-Scale Semi-Supervised Learning for Natural Language ...

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

Accuracy (%)1009590858075706560SUPERLMSUMLMRATIOLMTRIGRAM100 1000Number of training examplesFigure 3.4: Non-referential detection learning curveother pairs, around the 0.80 considered to be good reliability. These are, perhaps surprisingly,the only it-annotation agreement statistics available <strong>for</strong> written text. They contrastfavourably with the low agreement <strong>for</strong> categorizing it in spoken dialog [Müller, 2006].3.7.4 Non-referential Pronoun Detection ResultsMain ResultsFor non-referential pronoun detection, BASE (always choosing referential) achieves 59.4%,while SUPERLM reaches 82.4%. RATIOLM, with no tuned thresholds, per<strong>for</strong>ms worst(67.4%), while TRIGRAM (74.3%) and SUMLM (79.8%) achieve reasonable per<strong>for</strong>manceby comparing scores <strong>for</strong> it and they. All differences are statistically significant (McNemar’stest, p

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!