MACHINE LEARNING TECHNIQUES - LASA
MACHINE LEARNING TECHNIQUES - LASA
MACHINE LEARNING TECHNIQUES - LASA
You also want an ePaper? Increase the reach of your titles
YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.
61<br />
Figure 3-18: Linear combination of very simple classifiers (learners) trained by boosting. Each<br />
classifier on its own would not be able to separate the data. Their linear combination is able to<br />
separate complex data.[DEMOS\CLASSIFICATION\BOOSTING.ML]<br />
Note:<br />
Boosting and Bagging are methods based on a supervised process, where the “real” class of the<br />
data is known a priori. In contrast, the clustering techniques seen in the first part of this chapter<br />
work in an unsupervised manner, where the true labeling of the data is unknown.<br />
3.3 Bayes Classifier<br />
The simplest means to perform binary classification in probabilistic models is to use the so-called<br />
i<br />
y ∈− 1, + 1 of a set of i = 1.... M datapoints x i .<br />
Bayes Classifier. Assume a binary labeling [ ]<br />
Assume that you have built two probabilistic models p ( y| x) and p ( y|<br />
x)<br />
+ −<br />
that predict the<br />
probability that the data point x had associated label +1 and -1 respectively. A Bayes classifier will<br />
decide on the correct labeling simply by comparing the relative probabilities of each model, i.e.:<br />
( ) ≥ ( )<br />
If p y| x p y| x , then y=+1.<br />
+ −<br />
Otherwise y=-1.<br />
(3.32)<br />
Clearly such a simplistic model is bound to be very erroneous as it does not take into account the<br />
absolute value of the likelihood associated with each model. If both classifiers are predicting the<br />
labeling of x with a very very low likelihood (which happens when the data point x is very far from<br />
the training points or when the two classes overlap heavily in that region) then, deciding on one<br />
class label over the other is usually no better than random. Besides, when p+ and p are two<br />
−<br />
arbitrary densities, comparing these may be dangerous if one did not make sure that the<br />
© A.G.Billard 2004 – Last Update March 2011