Object Detection Using Nested Cascades of Boosted Classifiers

11.02.2015 Views
5.2. Multiclass Adaboost and Multiclass Weak Classifiers 5.2.3 Multiclass Weak Classifiers The multiclass weak classifiers can have many different structures and this does not need to be related to the boosting classifier being used. This means that the boosting algorithm and the multiclass weak classifier structure do not need to be connected (as done in [Torralba et al., 2007] [Huang et al., 2007]) and that other boosting approaches, such as Gentleboost [Lienhart et al., 2003], Logiboost [Friedman et al., 1998], could be used with the multiclass weak classifiers instead of the proposed generalized version of Adaboost. Taking this into consideration, we identify three different ways of selecting the functions h t ( f t (x),m). In the first one, that we call independent components, the components of⃗h t ( f t (x)), {h t ( f t (x),m)} m=1,...,M , are chosen independently of each other (as in [Huang et al., 2007]). In the second case, joint components, the components are chosen jointly (like in [Torralba et al., 2007]): the same function ¯h t (·) is used for different components/classes and the remaining components output a zero value: h t ( f t (x),m) = β m t ¯h t ( f t (x)),β m t ∈ {0,1}. (5.3) In the third case, we introduce the concept of coupled components, where components share a common function, but for each component this function is multiplied by a different scalar value γ t m : h t ( f t (x),m) = γ t m ¯h t ( f t (x)),γ t m ∈R. (5.4) This last case, coupled components (Equation 5.4), resembles the case of joint components (Equation 5.3), but it presents several advantages: its training time is much faster (unlike [Torralba et al., 2007] it does not need a heuristic search), it is scalable with the number of classes, and as we will see later, it is much more accurate. In some cases coupled components also presents advantages over independent components, e.g. less parameters need to be estimated, which can help to avoid over-fitting, in particular when small training sets are used (this is shown in [Torralba et al., 2007] for joint components). The use of fewer parameters can be also useful in the cases when the memory footprint is important, for example, when the detector is to be used in mobile devices or robots with low memory capacity. Note, however, that there is a trade-off, not only between the complexity of the weak components and the size of the training set, but also the size of the boosted classifier. Optimization Problem As in Chapter 4, the weak classifiers are designed after the domain-partitioning weak hypotheses paradigm [Schapire and Singer, 1999]. Each feature domainFis partitioned into disjoint blocks F 1 ,...,F J , and a weak classifier h( f(x),m) will have an output for each partition block of its associated feature. During training, the use of coupled, independent or joint components, together with domain partitioning classifiers leads to different optimization problems that are outlined in the following. As part of the optimization problem, in all three cases, the values W j,m l with j = {1,...,J}, m = {1,...M}, and l = {−1,+1} need to be evaluated, where W j,m l represents the bin j of a weighted histogram (of the feature values) for the component m for “positive” (l = 1) or “negative” (l = −1) examples: W j,m l = ∑ w i, j , (5.5) i, j: f(x i )∈F j ∧(⃗a i ) m =l 69

5.2. Multiclass Adaboost and Multiclass Weak Classifiers with (⃗a i ) m representing the value of the component m of the vector⃗a i . It is important to note that the evaluation of {W j,m l } m, j,l takes order O(N), with N the number of training examples, and does not depend on J (number of partitions) nor on M (number of classes/components). This linear dependency, on N only, is very important to keep a low computational complexity of the training algorithm. In contrast to the one-class case, the output of the domain partitioning weak classifier also depends on the component of the multiclass classifier, therefore the output of the classifier, for a feature f , is: h( f(x),m) = c j,m ∈R such that f(x) ∈ F j . For each weak classifier, the value associated to each partition block (c j,m ), i.e. its output, is selected to minimize Z t : min Z t = min ∑ W j,m c j,m ∈R c j,m ∈R +1 e−c j,m +W j,m −1 ec j,m (5.6) j,m and the value of c j,m depends on the kind of weak classifier being used. In the case of independent components, this minimization problem has an analytic solution (using standard calculus): ( c j,m = 1 j,m W 2 ln +1 + ε ) m W j,m −1 + ε (5.7) m with ε m a regularization parameter. Note that the output components are independent, i.e., the values of c j,m1 and c j,m2 , for m 1 m 2 , are calculated independently and do not depend on each other. It is important to note how this regularization value, ε m , is selected. In [Schapire and Singer, 1999], in a two class classification problem, is shown that it is appropriate to choose ε ≪ 2J 1 , with J the number of partitions. The authors recommended to use ε on the order of 1/n, with n the number of training examples. If the same analysis is done in the multiclass setting used here, the value of ε m should be of the order of 1/n m , with n m the number of training examples for class m (this analysis is straight forward from Equation (11) on [Schapire and Singer, 1999]). This corresponds to smoothing the weighted histograms taking into account the number of training samples used to evaluate it. In the case of weak classifiers with joint components, the optimization problem also has an analytical solution, but it requires to solve a combinatorial optimization problem. More formally, in this case c j,m = β m c j , with β m ∈ {0,1}. The optimization problem to be solved is: min Z t = min min ∑ W j,m c j ∈R,β m ∈{0,1} β m ∈{0,1} c j ∈R +1 e−c jβ m +W j,m −1 ec jβ m (5.8) j,m In order to obtain the optimal value of this problem, 2 M problems of the form min ĉ j ∈R ∑ j Ŵ j +1 e−ĉ j +Ŵ j −1 eĉ j (5.9) must be solved, with each of these problems having a solution of the form (Ŵ ĉ j = 1 j 2 ln +1 + ε ) Ŵ j −1 + ε , with Ŵ j l = ∑ W j,m l ,l = {−1,+1} (5.10) m:β m =1 70

Page 1: Tesis para optar al grado de Doctor

Page 4 and 5: Acknowledgements First, I want to e

Page 6 and 7: Table of Contents 1 Contributions 1

Page 8 and 9: List of Figures 2.1 Block diagram o

Page 10 and 11: List of Tables 2.1 Face detection r

Page 12 and 13: Chapter 1 Contributions In the pres

Page 14 and 15: Note that some results on hand and

Page 16 and 17: Araujo, H., and Vitria, J., editors

Page 18 and 19: 2.1. Motivation Contents 2.1 Motiva

Page 20 and 21: 2.3. State of the Art • template-

Page 22 and 23: 2.3. State of the Art of features w

Page 24 and 25: 2.4. Proposed Research Work Using t

Page 26 and 27: 2.4. Proposed Research Work Some Co

Page 28 and 29: 2.5. Structure of the Document For

Page 30 and 31: 3.1. Coarse-To-Fine Classifiers Ada

Page 32 and 33: 3.1. Coarse-To-Fine Classifiers etc

Page 34 and 35: 3.2. Multiclass/Multiview Object De

Page 36 and 37: 3.2. Multiclass/Multiview Object De

Page 38 and 39: 4.1. Introduction Contents 4.1 Intr

Page 40 and 41: 4.2. Detection Framework 4.2.2 Nest

Page 42 and 43: 4.3. Cascade Training Framework tex

Page 44 and 45: 4.3. Cascade Training Framework Alg

Page 46 and 47: 4.3. Cascade Training Framework the

Page 48 and 49: 4.3. Cascade Training Framework Nes

Page 50 and 51: 4.4. Evaluation & Results 1 Nested

Page 52 and 53: 4.4. Evaluation & Results 1 Feature

Page 54 and 55: 4.4. Evaluation & Results Face Dete

Page 56 and 57: True Positive Rate 100 95 90 85 Eva

Page 58 and 59: 4.4. Evaluation & Results Table 4.3

Page 60 and 61: 4.4. Evaluation & Results 100 Evalu

Page 62 and 63: 4.4. Evaluation & Results classific

Page 64 and 65: 4.4. Evaluation & Results black bor

Page 66 and 67: 4.4. Evaluation & Results (a) (b) (

Page 68 and 69: 4.4. Evaluation & Results (a) (b) F

Page 70 and 71: 4.4. Evaluation & Results Hand Dete

Page 72 and 73: 4.5. Conclusion vious works are: (1

Page 74 and 75: 5.1. Introduction Contents 5.1 Intr

Page 76 and 77: 5.2. Multiclass Adaboost and Multic

Page 78 and 79: 5.2. Multiclass Adaboost and Multic

Page 82 and 83: 5.3. Multiclass Nested Cascades In

Page 84 and 85: 5.3. Multiclass Nested Cascades the

Page 86 and 87: 5.3. Multiclass Nested Cascades Alg

Page 88 and 89: 5.3. Multiclass Nested Cascades Tab

Page 90 and 91: 5.3. Multiclass Nested Cascades Joi

Page 92 and 93: 5.4. Multiclass Coarse-To-Fine Nest



Page 98 and 99: 5.5. TCAS: Coarse-To-Fine Tree of N




Page 106 and 107: 5.6. Evaluation & Results examples

Page 108 and 109: 5.6. Evaluation & Results than runn

Page 110 and 111: 5.6. Evaluation & Results 95 Face d

Page 112 and 113: 5.6. Evaluation & Results 100 Multi

Page 114 and 115: 5.6. Evaluation & Results (a) Detec

Page 116 and 117: 5.6. Evaluation & Results 105 (a) D



Page 122 and 123: 5.6. Evaluation & Results In the tw

Page 124 and 125: 113 Table 5.10: Processing times [s

Page 126 and 127: 5.7. Conclusions Regarding the trai

Page 128 and 129: 5.7. Conclusions 100 ROC curves, Fr

Page 130 and 131: 5.7. Conclusions (a) HF75 (b) HF75r

Page 132 and 133: Chapter 6 Conclusions The problem o

Page 134 and 135: Bibliography [Agarwal et al., 2004]

Page 136 and 137: BIBLIOGRAPHY [Huang et al., 2005] H

Page 138 and 139: BIBLIOGRAPHY [Rowley et al., 1998]

Page 140 and 141: BIBLIOGRAPHY [Vidal-Naquet and Ullm

detection

classifiers

multiclass

classifier

nested

features

cascades

components

processing

layer

boosted

vision.die.uchile.cl

Object Detection Using Nested Cascades of Boosted Classifiers

Object Detection Using Nested Cascades of Boosted Classifiers ... View more Object Detection Using Nested Cascades of Boosted Classifiers

Delete template?

Save as template ?

Object Detection Using Nested Cascades of Boosted Classifiers Object Detection Using Nested Cascades of Boosted Classifiers