Etude des marchés d'assurance non-vie à l'aide d'équilibres de ...
Etude des marchés d'assurance non-vie à l'aide d'équilibres de ...
Etude des marchés d'assurance non-vie à l'aide d'équilibres de ...
You also want an ePaper? Increase the reach of your titles
YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.
tel-00703797, version 2 - 7 Jun 2012<br />
1.6.1 Mo<strong>de</strong>l presentation<br />
1.6. Other regression mo<strong>de</strong>ls<br />
The Generalized Additive Mo<strong>de</strong>ls (GAM) were introduced by Hastie and Tibshirani (1990)<br />
by unifying generalized linear mo<strong>de</strong>ls and additive mo<strong>de</strong>ls. So, GAMs combine two flexible<br />
and powerful methods: (i) the exponential family which can <strong>de</strong>al with many distribution for<br />
the response variable and (ii) additive mo<strong>de</strong>ls which relax the linearity assumption of the<br />
predictor.<br />
Theoretical presentation<br />
In this subsection, we present Generalized Additive Mo<strong>de</strong>ls in two steps: from linear to<br />
additive mo<strong>de</strong>ls and then from additive to generalized additive mo<strong>de</strong>ls. Fitting algorithms are<br />
then briefly presented, whereas smoothing techniques are <strong>de</strong>tailed in Appendix 1.8.1. Finally,<br />
we apply GAMs on the large dataset of Subsection 1.3.2.<br />
Assuming observations Xi and response variables Yi are i<strong>de</strong>ntically and in<strong>de</strong>pen<strong>de</strong>ntly<br />
distributed random variables having the same distribution of generic random variables X and<br />
Y , respectively. In a linear mo<strong>de</strong>l, the mo<strong>de</strong>l equation is<br />
Y = XΘ + E<br />
where Y as always stands for the response variable, X the <strong><strong>de</strong>s</strong>ign matrix and E the random<br />
noise. Linear mo<strong>de</strong>ls assume by <strong>de</strong>finition a linear relationship motivated by mathematical<br />
tractability rather than empirical evi<strong>de</strong>nce. One candidate to extend the linear mo<strong>de</strong>l is the<br />
additive mo<strong>de</strong>l <strong>de</strong>fined by<br />
p<br />
Y = α + fj(Xj) + E,<br />
j=1<br />
with fj smooth function of the jth explanatory variable Xj and E is assumed to be a centered<br />
random variable with variance σ 2 .<br />
A GAM is characterized by three components:<br />
1. a random component: Yi follows a distribution of the exponential family Fexp(θi, φi, a, b, c),<br />
2. a systematic component: the covariate vector Xi provi<strong><strong>de</strong>s</strong> a smooth predictor ηi =<br />
α + p<br />
j=1 fj(Xij),<br />
3. a link function g : R ↦→ S which is monotone, differentiable and invertible, such that<br />
E(Yi) = g −1 (ηi),<br />
for i ∈ {1, . . . , n}, where θi is the shape parameter, φi the dispersion parameter, a, b, c three<br />
functions (characterizing the distribution), fj’s smooth functions and S a set of possible values<br />
of the expectation E(Yi). Note that linear mo<strong>de</strong>ls (and GLMs) are special cases of additive<br />
mo<strong>de</strong>ls (and GAMs) with fj(x) = βjx.<br />
We present here only the main i<strong>de</strong>a of fitting algorithms and we do not go into <strong>de</strong>tails, see<br />
Appendix 1.8.1 for a list of smoothing procedures. All smoothers have a smoothing parameter<br />
λ, (the polynom <strong>de</strong>gree, the bandwidth or the span). A first concern is how to choose a<br />
criterion on which to optimize λ (hence to have an automatic selection). Then, a second<br />
concern is to find a reliable estimate of the parameters α and smooths coefficients given a<br />
smoothing value λ.<br />
We present the procedure in the reverse way. Assuming a value of λ, we present an algorithm<br />
to fit the mo<strong>de</strong>l. Hastie and Tibshirani (1990) propose a local averaging generalized<br />
67