Estimation, Evaluation, and Selection of Actuarial Models

Estimation, Evaluation, and Selection of Actuarial Models Estimation, Evaluation, and Selection of Actuarial Models

01.08.2014 Views

106 APPENDIX A. SOLUTIONS TO EXERCISES negative first moment rather than the positive first moment. That is, equate the average reciprocal to the expected reciprocal: 1 nX 1 = E(X −1 )=θ −1 . n x j j=1 This special method of moments estimate is identical to the maximum likelihood estimate. For the inverse gamma distribution with α =2, f(x|θ) = θ2 e −θ/x x 3 Γ(2) , ln f(x|θ) =2lnθ − θx−1 − 3lnx nX nX l(θ) = (2 ln θ − θx −1 j − 3lnx j )=2n ln θ − nyθ − 3 ln x j j=1 l 0 (θ) = 2nθ −1 − ny, ˆθ =2/y. For Data Set B, ˆθ = 395.44 and the value of the loglikelihood function is −169.07. Themethodof moments estimate solves the equation 1, 424.4 = θ α − 1 = θ 2 − 1 , ˆθ =1, 424.4 which differs from the maximum likelihood estimate. For the inverse gamma distribution with both parameters unknown, f(x|θ) = j=1 θα e −θ/x x α+1 Γ(α) , ln f(x|θ) =α ln θ − θx−1 − (α +1)lnx − ln Γ(α). The likelihood function must be maximized numerically. The answer is ˆα =0.70888 and ˆθ = 140.16 and the loglikelihood value is −158.88. The methods of moments estimate is the solution to the two equations 1, 424.4 = 13, 238, 441.9 = θ α − 1 θ 2 (α − 1)(α − 2) . Squaring the first equation and dividing it into the second equation gives 6.52489 = α − 1 α − 2 which leads to ˆα =2.181 and then ˆθ =1, 682.2. This does not match the maximum likelihood estimate (which had to be the case because the mle produces a model that does not have a mean). Exercise 28 For the inverse exponential distribution, the cdf is F (x) =e −θ/x . Numerical maximization yields ˆθ =6, 662.39 and the value of the loglikelihood function is −365.40. For the gamma distribution, the cdf requires numerical evaluation. In Excel the function GAMMADIST(x, α, θ,true) can be used. The estimates are ˆα = 0.37139 and ˆθ = 83, 020. The value of the loglikelihood function is −360.50. For the inverse gamma distribution, the cdf is available in Excel as

107 1− GAMMADIST(1/x, α, 1/θ,true). The estimates are ˆα =0.83556 and ˆθ =5, 113. The value of the loglikelihood function is −363.92. Exercise 29 In each case the likelihood function is f(27)f(82) ···f(243)[1 − F (250)] 13 . The following table provides the estimates for both the original and censored data sets. Model Original Censored exponential ˆθ =1, 424.4 ˆθ =594.14 gamma ˆα =0.55616, ˆθ =2, 561.1 ˆα =1.5183, ˆθ = 295.69 inv. exponential ˆθ =197.72 ˆθ =189.78 inv. gamma ˆα =0.70888, ˆθ = 140.16 ˆα =0.41612, ˆθ =86.290 The censoring tended to disguise the true nature of these numbers and in general had a large impact on the estimates. Exercise 30 The calculations are done as in the example, but with θ unknown. The likelihood must be numerically maximized. For the shifted data the estimates are ˆα =1.4521 and ˆθ =907.98. The two expected costs are 907.98/0.4521 = 2, 008 and 1, 107.98/0.4521 = 2, 451 for the 200 and 400 deductibles respectively. For the unshifted data the estimates are ˆα =1.4521 and ˆθ = 707.98. The three expected costs are 707.98/0.4521 = 1, 566, 2,008, and 2, 451 for the 0, 200, and 400 deductibles respectively. While it is always the case that for the Pareto distribution the two approaches produce identical answers, that will not be true in general. Exercise 31 Thesametablecanbeused. Theonlydifference is that observations that were surrenders are now treated as x-values and deaths are treated as y-values. Observations that ended at 5.0 continue to be treated as y-values. Once again there is no estimate for a Pareto model. The gamma parameter estimates are ˆα =1.229 and ˆθ =6.452. Exercise 32 The contribution to the likelihood for the first five values (number of drivers having zero through four accidents) is unchanged. However, for the last seven drivers, the contribution is [Pr(X ≥ 5)] 7 =[1− p(0) − p(1) − p(2) − p(3) − p(4)] 7 and the maximum must be obtained numerically. The estimated values are ˆλ = 0.16313 and ˆq =0.02039. These answers are similar to those for Example 2.36 because the probability of six or more accidents is so small. Exercise 33 There are four cases, with the likelihood function being the product of the probabilities for those cases raised to a power equal to the number of times each occurred. The following table provides the probabilities. Event Probability F (1)−F (0.4) 1−F (0.4) Observed at age 35.4 and died Observed at age 35.4 and survived 1 − 0.6w Observed at age 35 and died F (1) = w Observed at age 35 and survived 1 − w The likelihood function is µ 0.6w L = 1 − 0.4w and its logarithm is = w−0.4w 1−0.4w = 1−0.4w 0.6w 1−0.4w = 1−0.4w 1−w 6 µ 1 − w 4 w 8 (1 − w) 12 ∝ w14 (1 − w) 16 1 − 0.4w (1 − 0.4w) 10 l =14lnw +16ln(1− w) − 10 ln(1 − 0.4w).

106 APPENDIX A. SOLUTIONS TO EXERCISES<br />

negative first moment rather than the positive first moment. That is, equate the average reciprocal<br />

to the expected reciprocal:<br />

1<br />

nX 1<br />

= E(X −1 )=θ −1 .<br />

n x j<br />

j=1<br />

This special method <strong>of</strong> moments estimate is identical to the maximum likelihood estimate.<br />

For the inverse gamma distribution with α =2,<br />

f(x|θ) = θ2 e −θ/x<br />

x 3 Γ(2) , ln f(x|θ) =2lnθ − θx−1 − 3lnx<br />

nX<br />

nX<br />

l(θ) = (2 ln θ − θx −1<br />

j<br />

− 3lnx j )=2n ln θ − nyθ − 3 ln x j<br />

j=1<br />

l 0 (θ) = 2nθ −1 − ny, ˆθ =2/y.<br />

For Data Set B, ˆθ = 395.44 <strong>and</strong> the value <strong>of</strong> the loglikelihood function is −169.07. Themethod<strong>of</strong><br />

moments estimate solves the equation<br />

1, 424.4 = θ<br />

α − 1 =<br />

θ<br />

2 − 1 , ˆθ =1, 424.4<br />

which differs from the maximum likelihood estimate.<br />

For the inverse gamma distribution with both parameters unknown,<br />

f(x|θ) =<br />

j=1<br />

θα e −θ/x<br />

x α+1 Γ(α) , ln f(x|θ) =α ln θ − θx−1 − (α +1)lnx − ln Γ(α).<br />

The likelihood function must be maximized numerically. The answer is ˆα =0.70888 <strong>and</strong> ˆθ = 140.16<br />

<strong>and</strong> the loglikelihood value is −158.88. The methods <strong>of</strong> moments estimate is the solution to the<br />

two equations<br />

1, 424.4 =<br />

13, 238, 441.9 =<br />

θ<br />

α − 1<br />

θ 2<br />

(α − 1)(α − 2) .<br />

Squaring the first equation <strong>and</strong> dividing it into the second equation gives<br />

6.52489 = α − 1<br />

α − 2<br />

which leads to ˆα =2.181 <strong>and</strong> then ˆθ =1, 682.2. This does not match the maximum likelihood<br />

estimate (which had to be the case because the mle produces a model that does not have a mean).<br />

Exercise 28 For the inverse exponential distribution, the cdf is F (x) =e −θ/x . Numerical maximization<br />

yields ˆθ =6, 662.39 <strong>and</strong> the value <strong>of</strong> the loglikelihood function is −365.40. For the gamma<br />

distribution, the cdf requires numerical evaluation. In Excel the function GAMMADIST(x, α, θ,true)<br />

can be used. The estimates are ˆα = 0.37139 <strong>and</strong> ˆθ = 83, 020. The value <strong>of</strong> the loglikelihood<br />

function is −360.50. For the inverse gamma distribution, the cdf is available in Excel as

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!