01.11.2014 Views

MACHINE LEARNING TECHNIQUES - LASA

MACHINE LEARNING TECHNIQUES - LASA

MACHINE LEARNING TECHNIQUES - LASA

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

61<br />

Figure 3-18: Linear combination of very simple classifiers (learners) trained by boosting. Each<br />

classifier on its own would not be able to separate the data. Their linear combination is able to<br />

separate complex data.[DEMOS\CLASSIFICATION\BOOSTING.ML]<br />

Note:<br />

Boosting and Bagging are methods based on a supervised process, where the “real” class of the<br />

data is known a priori. In contrast, the clustering techniques seen in the first part of this chapter<br />

work in an unsupervised manner, where the true labeling of the data is unknown.<br />

3.3 Bayes Classifier<br />

The simplest means to perform binary classification in probabilistic models is to use the so-called<br />

i<br />

y ∈− 1, + 1 of a set of i = 1.... M datapoints x i .<br />

Bayes Classifier. Assume a binary labeling [ ]<br />

Assume that you have built two probabilistic models p ( y| x) and p ( y|<br />

x)<br />

+ −<br />

that predict the<br />

probability that the data point x had associated label +1 and -1 respectively. A Bayes classifier will<br />

decide on the correct labeling simply by comparing the relative probabilities of each model, i.e.:<br />

( ) ≥ ( )<br />

If p y| x p y| x , then y=+1.<br />

+ −<br />

Otherwise y=-1.<br />

(3.32)<br />

Clearly such a simplistic model is bound to be very erroneous as it does not take into account the<br />

absolute value of the likelihood associated with each model. If both classifiers are predicting the<br />

labeling of x with a very very low likelihood (which happens when the data point x is very far from<br />

the training points or when the two classes overlap heavily in that region) then, deciding on one<br />

class label over the other is usually no better than random. Besides, when p+ and p are two<br />

−<br />

arbitrary densities, comparing these may be dangerous if one did not make sure that the<br />

© A.G.Billard 2004 – Last Update March 2011

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!