01.08.2014 Views

Estimation, Evaluation, and Selection of Actuarial Models

Estimation, Evaluation, and Selection of Actuarial Models

Estimation, Evaluation, and Selection of Actuarial Models

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

4.4. GRAPHICAL COMPARISON OF THE DENSITY AND DISTRIBUTION FUNCTIONS67<br />

For the remainder <strong>of</strong> this Note, when a distribution function or density function is indicated,<br />

a subscript equal to the sample size indicates that it is the empirical model (from Kaplan-Meier,<br />

Nelson-Åalen, the ogive, etc.) while no adornment, or the use <strong>of</strong> an asterisk (*), indicates the<br />

estimated parametric model. There is no notation for the true, underlying distribution, because it<br />

is unknown <strong>and</strong> unknowable.<br />

4.4 Graphical comparison <strong>of</strong> the density <strong>and</strong> distribution functions<br />

The most direct way to see how well the model <strong>and</strong> data match up is to plot the respective density<br />

<strong>and</strong> distribution functions.<br />

Example 4.11 Consider Data Sets B <strong>and</strong> C. However, for this example <strong>and</strong> all that follow in<br />

this Section, replace the value at 15,743 with 3,476 (this is to allow the graphs to fit comfortably<br />

on a page). Truncate Data Set B at 50 <strong>and</strong> Data Set C at 7,500. Estimate the parameter <strong>of</strong> an<br />

exponential model for each data set. Plot the appropriate functions <strong>and</strong> comment on the quality <strong>of</strong><br />

the fit <strong>of</strong> the model. Repeat this for Data Set B censored at 1,000 (without any truncation).<br />

For Data Set B, there are 19 observations (the first observation is removed due to truncation).<br />

A typical contribution to the likelihood function is f(82)/[1 − F (50)]. The maximum likelihood<br />

estimate <strong>of</strong> the exponential parameter is ˆθ =802.32. The empirical distribution function starts at<br />

50 <strong>and</strong> jumps 1/19 at each data point. The distribution function, using a truncation point <strong>of</strong> 50 is<br />

F ∗ (x) = 1 − e−x/802.32 − (1 − e −50/802.32 )<br />

1 − (1 − e −50/802.32 )<br />

=1− e −(x−50)/802.32 .<br />

The following figure presents a plot <strong>of</strong> these two functions.<br />

Exponential Fit<br />

F(x)<br />

1<br />

0.9<br />

0.8<br />

0.7<br />

0.6<br />

0.5<br />

0.4<br />

0.3<br />

0.2<br />

0.1<br />

0<br />

0 700 1400 2100 2800 3500<br />

x<br />

Model<br />

Empirical<br />

Model vs. data cdf plot for Data Set B truncated at 50

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!