12.07.2015 Views

Large-Scale Semi-Supervised Learning for Natural Language ...

Large-Scale Semi-Supervised Learning for Natural Language ...

Large-Scale Semi-Supervised Learning for Natural Language ...

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

Accuracy (%)10095908580757065601001e3N-GM+LEXN-GMLEX1e4Number of training examples1e5Figure 5.1: In-domain learning curve of adjective ordering classifiers on BNC.Accuracy (%)1009590858075706560N-GM+LEXN-GMLEX1001e31e4Number of training examples1e5Figure 5.2: Out-of-domain learning curve of adjective ordering classifiers on Gutenberg.and test pairs helps explain. While 59% of the BNC test pairs were seen in the trainingcorpus, only 25% of Gutenberg and 18% of Medline pairs were seen in training.While other ordering models have also achieved “very poor results” out-of-domain[Mitchell, 2009], we expected our expanded set of LEX features to provide good generalizationon new data. Instead, LEX is very unreliable on new domains.N-GM features do not rely on specific pairs in training data, and thus remain fairly robustcross-domain. Across the three test sets, 84-89% of examples had the correct orderingappear at least once on the web. On new domains, the learned N-GM system maintains anadvantage over the unsupervised c(a 1 ,a 2 ) vs. c(a 2 ,a 1 ), but the difference is reduced. Notethat training with 10-fold cross validation, the N-GM system can achieve up to 87.5% onGutenberg (90.0% <strong>for</strong> N-GM + LEX).The learning curves showing per<strong>for</strong>mance on Gutenberg and Medline (but still trainingon BNC) is particularly instructive (Figures 5.2 and 5.3). The LEX system per<strong>for</strong>ms muchworse than the web-based models across all training sizes. For our top in-domain system,N-GM + LEX, as you add more labeled examples, per<strong>for</strong>mance begins decreasing out-ofdomain.The system disregards the robust N-gram counts as it is more and more confidentin the LEX features, and it suffers the consequences.74

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!