28.05.2013 Views

Etude des marchés d'assurance non-vie à l'aide d'équilibres de ...

Etude des marchés d'assurance non-vie à l'aide d'équilibres de ...

Etude des marchés d'assurance non-vie à l'aide d'équilibres de ...

SHOW MORE
SHOW LESS

Create successful ePaper yourself

Turn your PDF publications into a flip-book with our unique Google optimized e-Paper software.

tel-00703797, version 2 - 7 Jun 2012<br />

Chapitre 1. Sur la nécessité d’un modèle <strong>de</strong> marché<br />

The parameters θ, called thresholds, have a special interpretation since they link the<br />

response variable Z with a latent variable U by the equation Z = dk ⇔ θk−1 < U ≤ θk.<br />

Hence, the trick to go from a Bernoulli mo<strong>de</strong>l to a polytomous mo<strong>de</strong>l is to have different<br />

or<strong>de</strong>red intercept coefficients θk’s for the different categorical values.<br />

As in Dionne et al. (2001), our choice goes to the or<strong>de</strong>red logit mo<strong>de</strong>l for its simplicity.<br />

So, Z is mo<strong>de</strong>lled by the following equation<br />

<br />

−1<br />

P (Zi ≤ j/Xi, Yi) = g θj + X T i β + Yiγ + <br />

E(Y |Xi)δ ,<br />

for individual i and <strong>de</strong>ductible j, with g −1 the logistic distribution function ∗ and Xi exogeneous<br />

explanatory variables as opposed to en<strong>de</strong>geneous variables Yi. The parameters of this<br />

mo<strong>de</strong>l equation are the regression coefficients β and γ and the threshold parameter θk’s.<br />

1.5.3 Application on the large dataset of Subsection 1.3.2<br />

We want to test for evi<strong>de</strong>nce of adverse selection on the full comprehensive (FC) coverage<br />

product. So, we study in this subsection only the three datasets relative to that coverage.<br />

First, we mo<strong>de</strong>l the claim number, and then we test for the asymmetry of information.<br />

Mo<strong>de</strong>lling the claim number<br />

Mo<strong>de</strong>lling count data in the generalized linear mo<strong>de</strong>l framework can be done by choosing<br />

an appropriate distribution: the Poisson and overdispersed Poisson distribution, cf. Table<br />

1.1, where the ca<strong>non</strong>ical link function is the logarithm. Since for a Poisson distribution P(λ),<br />

P (Y = 0) = e −λ , the GLM Poisson consists in assuming<br />

E(Y/ xi) = e xT i β ⇔ − log P (Y = 0/ xi) = x T i β.<br />

where xi <strong>de</strong>notes the covariates. In practice, this mo<strong>de</strong>ls suffers a subparametrization of the<br />

Poisson distribution, one single parameter.<br />

One could think that the Negative binomial in an exten<strong>de</strong>d GLM † framework will tackle<br />

this issue, but in practice the mass in zero is so high, that both Poisson and negative binomial<br />

distributions are inappropriate. As presented in Table 1.14, the high number of zero-claim<br />

will compromise the good fit of regular discrete distributions.<br />

Claim number 0 1 2 3 4 5 5 <<br />

Frequency 43687 5308 667 94 17 2 38<br />

Table 1.14: Claim number for Full Comp. agent subset<br />

As presented in Zeileis et al. (2008) and the references therein, the issue is solved by using<br />

a zero-inflated distribution, e.g., a zero-inflated Poisson distribution. The mass probability<br />

∗. Note that in this form, it is easy to see that g −1 can be any distribution functions (e.g. normal or<br />

extreme value distributions).<br />

†. The negative binomial distribution does not belong to the exponential family, except if the shape parameter<br />

is known. So, the trick is to use a maximum likelihood procedure for that shape parameter at outer<br />

iteration whereas each inner iteration use a GLM fit given the current value of the shape parameter.<br />

64

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!