Etude des marchés d'assurance non-vie à l'aide d'équilibres de ...

More documents

Recommendations

Info

tel-00703797, version 2 - 7 Jun 2012 Chapitre 1. Sur la nécessité d’un modèle de marché which emphasizes the exponential family characteristic. Let us recall the first two moments are E(Yi) = πi and V ar(Yi) = πi(1 − πi) = V (πi). Assuming Yi is a Bernoulli distribution B(πi) implies that πi is both the parameter and the mean value of Yi. So, the link function for a Bernoulli model is expressed as follows πi = g −1 (x T i β). Let us note that if some individuals have identical covariates, then we can group the data and consider Yi follows a binomial distribution B(ni, πi). However, this is only possible if all covariates are categorical. As indicated in McCullagh and Nelder (1989), the link function and the response variable can be reformulated in term of a latent variable approach πi = P (Yi = 1) = P (x T i β − ɛi > 0). If ɛi follows a normal distribution (resp. a logistic distribution), we have πi = Φ(x T i β) (πi = Flogistic(x T i β)). Now, the log-likelihood is derived as ln(L(π1, . . . , πn, y1, . . . , yn)) = n [yi ln(πi) + (1 − yi) ln(1 − πi)] , plus an omitted term not involving πi. Further details can be found in Appendix 1.8.1. Link functions i=1 Generally, the following three functions are considered as link functions for the binary variable π 1. logit link: g(π) = ln 1−π with g−1 being the standard logistic distribution function, 2. probit link: g(π) = Φ −1 (π) with g −1 being the standard normal distribution function, 3. complementary log-log link: g(π) = ln(− ln(1−π)) with g −1 being the standard Gumbel II distribution function ∗ . On Figure 1.4 in Appendix 1.8.1, we plot these three link functions and their inverse functions. All these three functions are the inverses of a distribution function, so other link functions can be obtained using inverses of other distribution function. Let us note that the first two links are symmetrical, while the last one is not. In addition to being the canonical link function for which the fitting procedure is simplified, cf. Appendix 1.8.1, the logit link is generally preferred because of its simple interpretation as the logarithm of the odds ratio. Indeed, assume there is one explanatory variable X, the logit link model is p/(1 − p) = e µ+αX . If ˆα = 2, increasing X by 1 will lead to increase the odds by e 2 ≈ 7.389. 1.2.3 Variable selection and model adequacy As fitting a GLM is quick in most standard software, then a relevant question is to check for its validity on the dataset used. 50 ∗. A Gumbel of second kind is the distribution of −X when X follows a Gumbel distribution of first kind.
tel-00703797, version 2 - 7 Jun 2012 Model adequacy 1.2. GLMs, a brief introduction The deviance, which is one way to measure the model adequacy with the data and generalizes the R 2 measure of linear models, is defined by D(y, ˆπ) = 2(ln(L(y1, . . . , yn, y1, . . . , yn)) − ln(L(ˆπ1, . . . , ˆπn, y1, . . . , yn))), where ˆπ is the estimate of the beta vector. The “best” model is the one having the lowest deviance. However, if all responses are binary data, the first term can be infinite. So in practice, we consider the deviance simply as D(y, ˆπ) = −2 ln(L(ˆπ1, . . . , ˆπn, y1, . . . , yn)). Furthermore, the deviance is used as a relative measure to compare two models. In most softwares, in particular in R, the GLM fitting function provides two deviances: the null deviance and the deviance. The null deviance is the deviance for the model with only an intercept or if not offset only, i.e. when p = 1 and X is only an intercept full of 1 ∗ . The (second) deviance is the deviance for the model D(y, ˆπ) with the p explanatory variables. Note that if there are as many parameters as there are observations, then the deviance will be the best possible, but the model does not explain anything. Another criterion introduced by Akaike in the 70’s is the Akaike Information Criterion (AIC), which is also an adequacy measure of statistical models. Unlike the deviance, AIC aims to penalized overfitted models, i.e. models with too many parameters (compared to the length of the dataset). AIC is defined by AIC(y, ˆπ) = 2k − ln(L(ˆπ1, . . . , ˆπn, y1, . . . , yn)), where k the number of parameters, i.e. the length of β. This criterion is a trade-off between further improvement in terms of log-likelihood with additional variables and the additional model cost of including new variables. To compare two models with different parameter numbers, we look for the one having the lowest AIC. In a linear model, the analysis of residuals (which are assumed to be identical and independent Gaussian variables) may reveal that the model is unappropriate. Typically we can plot the fitted values against the fitted residuals. For GLMs, the analysis of residuals is much more complex, because we loose the normality assumption. Furthermore, for binary data, i.e. not binomial data, the plot of residuals exhibits straight lines, which are hard to interpret, see Appendix 1.8.2. We believe that the residual analysis is not appropriate for binary regressions. Variable selection From the normal asymptotic distribution of the maximum likelihood estimator, we can derive confidence intervals as well as hypothesis tests for coefficents. Therefore, a p-value is available for each coefficient of the regression, which help us to keep only the most significant variable. However, as removing one variable impacts the significance of other variables, it can be hard to find the optimal set of explanatory variables. There are two approaches: either a forward selection, i.e. starting from the null model, we add the most significant variable at each step, or a backward elimination, i.e. starting from the full model, we remove the least significant variable at each step. ∗. It means all the heterogeneity of data comes from the random component. 51
Page 1 and 2:
tel-00703797, version 2 - 7 Jun 201
Page 3 and 4:
tel-00703797, version 2 - 7 Jun 201
Page 5 and 6:
tel-00703797, version 2 - 7 Jun 201
Page 7 and 8:
tel-00703797, version 2 - 7 Jun 201
Page 9 and 10:
tel-00703797, version 2 - 7 Jun 201
Page 11 and 12:
tel-00703797, version 2 - 7 Jun 201
Page 13 and 14: tel-00703797, version 2 - 7 Jun 201
Page 63: tel-00703797, version 2 - 7 Jun 201
Page 115 and 116:
tel-00703797, version 2 - 7 Jun 201
Page 117 and 118:
tel-00703797, version 2 - 7 Jun 201
Page 119 and 120:
tel-00703797, version 2 - 7 Jun 201
Page 121 and 122:
tel-00703797, version 2 - 7 Jun 201
Page 123 and 124:
tel-00703797, version 2 - 7 Jun 201
Page 125 and 126:
tel-00703797, version 2 - 7 Jun 201
Page 127 and 128:
tel-00703797, version 2 - 7 Jun 201
Page 129 and 130:
tel-00703797, version 2 - 7 Jun 201
Page 131 and 132:
tel-00703797, version 2 - 7 Jun 201
Page 133 and 134:
tel-00703797, version 2 - 7 Jun 201
Page 135 and 136:
tel-00703797, version 2 - 7 Jun 201
Page 137 and 138:
tel-00703797, version 2 - 7 Jun 201
Page 139 and 140:
tel-00703797, version 2 - 7 Jun 201
Page 141 and 142:
tel-00703797, version 2 - 7 Jun 201
Page 143 and 144:
tel-00703797, version 2 - 7 Jun 201
Page 145 and 146:
tel-00703797, version 2 - 7 Jun 201
Page 147 and 148:
tel-00703797, version 2 - 7 Jun 201
Page 149 and 150:
tel-00703797, version 2 - 7 Jun 201
Page 151 and 152:
tel-00703797, version 2 - 7 Jun 201
Page 153 and 154:
tel-00703797, version 2 - 7 Jun 201
Page 155 and 156:
tel-00703797, version 2 - 7 Jun 201
Page 157 and 158:
tel-00703797, version 2 - 7 Jun 201
Page 159 and 160:
tel-00703797, version 2 - 7 Jun 201
Page 161 and 162:
tel-00703797, version 2 - 7 Jun 201
Page 163 and 164:
tel-00703797, version 2 - 7 Jun 201
Page 165 and 166:
tel-00703797, version 2 - 7 Jun 201
Page 167 and 168:
tel-00703797, version 2 - 7 Jun 201
Page 169 and 170:
tel-00703797, version 2 - 7 Jun 201
Page 171 and 172:
tel-00703797, version 2 - 7 Jun 201
Page 173 and 174:
tel-00703797, version 2 - 7 Jun 201
Page 175 and 176:
tel-00703797, version 2 - 7 Jun 201
Page 177 and 178:
tel-00703797, version 2 - 7 Jun 201
Page 179 and 180:
tel-00703797, version 2 - 7 Jun 201
Page 181 and 182:
tel-00703797, version 2 - 7 Jun 201
Page 183 and 184:
tel-00703797, version 2 - 7 Jun 201
Page 185 and 186:
tel-00703797, version 2 - 7 Jun 201
Page 187 and 188:
tel-00703797, version 2 - 7 Jun 201
Page 189 and 190:
tel-00703797, version 2 - 7 Jun 201
Page 191 and 192:
tel-00703797, version 2 - 7 Jun 201
Page 193 and 194:
tel-00703797, version 2 - 7 Jun 201
Page 195 and 196:
tel-00703797, version 2 - 7 Jun 201
Page 197 and 198:
tel-00703797, version 2 - 7 Jun 201
Page 199 and 200:
tel-00703797, version 2 - 7 Jun 201
Page 201 and 202:
tel-00703797, version 2 - 7 Jun 201
Page 203 and 204:
tel-00703797, version 2 - 7 Jun 201
Page 205 and 206:
tel-00703797, version 2 - 7 Jun 201
Page 207 and 208:
tel-00703797, version 2 - 7 Jun 201
Page 209 and 210:
tel-00703797, version 2 - 7 Jun 201
Page 211 and 212:
tel-00703797, version 2 - 7 Jun 201
Page 213 and 214:
tel-00703797, version 2 - 7 Jun 201
Page 215 and 216:
tel-00703797, version 2 - 7 Jun 201
Page 217 and 218:
tel-00703797, version 2 - 7 Jun 201
Page 219 and 220:
tel-00703797, version 2 - 7 Jun 201
Page 221 and 222:
tel-00703797, version 2 - 7 Jun 201
Page 223 and 224:
tel-00703797, version 2 - 7 Jun 201
Page 225 and 226:
tel-00703797, version 2 - 7 Jun 201
Page 227 and 228:
tel-00703797, version 2 - 7 Jun 201
Page 229 and 230:
tel-00703797, version 2 - 7 Jun 201
Page 231 and 232:
tel-00703797, version 2 - 7 Jun 201
Page 233 and 234:
tel-00703797, version 2 - 7 Jun 201
Page 235 and 236:
tel-00703797, version 2 - 7 Jun 201
Page 237 and 238:
tel-00703797, version 2 - 7 Jun 201
Page 239 and 240:
tel-00703797, version 2 - 7 Jun 201
Page 241 and 242:
tel-00703797, version 2 - 7 Jun 201
Page 243 and 244:
tel-00703797, version 2 - 7 Jun 201
Page 245 and 246:
tel-00703797, version 2 - 7 Jun 201
Page 247 and 248:
tel-00703797, version 2 - 7 Jun 201
show all

Etude des marchés d'assurance non-vie à l'aide d'équilibres de ...

Create successful ePaper yourself

Delete template?

Save as template?