21.04.2014 Views

1h6QSyH

1h6QSyH

1h6QSyH

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

220 6. Linear Model Selection and Regularization<br />

Standardized Coefficients<br />

−200 0 100 200 300 400<br />

20 50 100 200 500 2000 5000<br />

Standardized Coefficients<br />

−300 −100 0 100 200 300 400<br />

Income<br />

Limit<br />

Rating<br />

Student<br />

0.0 0.2 0.4 0.6 0.8 1.0<br />

λ ˆβL λ 1/ ˆβ 1<br />

FIGURE 6.6. The standardized lasso coefficients on the Credit data set are<br />

shown as a function of λ and ‖ ˆβ L λ ‖ 1/‖ ˆβ‖ 1.<br />

As an example, consider the coefficient plots in Figure 6.6, which are generated<br />

from applying the lasso to the Credit data set. When λ =0,then<br />

the lasso simply gives the least squares fit, and when λ becomes sufficiently<br />

large, the lasso gives the null model in which all coefficient estimates equal<br />

zero. However, in between these two extremes, the ridge regression and<br />

lasso models are quite different from each other. Moving from left to right<br />

in the right-hand panel of Figure 6.6, we observe that at first the lasso results<br />

in a model that contains only the rating predictor. Then student and<br />

limit enter the model almost simultaneously, shortly followed by income.<br />

Eventually, the remaining variables enter the model. Hence, depending on<br />

the value of λ, the lasso can produce a model involving any number of variables.<br />

In contrast, ridge regression will always include all of the variables in<br />

the model, although the magnitude of the coefficient estimates will depend<br />

on λ.<br />

Another Formulation for Ridge Regression and the Lasso<br />

One can show that the lasso and ridge regression coefficient estimates solve<br />

the problems<br />

⎧ ⎛<br />

⎞<br />

⎪⎨<br />

⎫ n∑<br />

p∑<br />

⎪ ⎬ p∑<br />

minimize ⎝y i − β 0 − β j x ij<br />

⎠2<br />

subject to |β j |≤s<br />

β ⎪⎩<br />

⎪<br />

i=1<br />

j=1 ⎭ j=1<br />

(6.8)<br />

and<br />

⎧ ⎛<br />

⎞<br />

⎪⎨<br />

⎫ ⎪ n∑<br />

p∑ ⎬ p∑<br />

minimize ⎝y i − β 0 − β j x ⎠2<br />

ij subject to βj 2 ≤ s,<br />

β ⎪⎩<br />

⎪<br />

i=1<br />

j=1 ⎭ j=1<br />

(6.9)

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!