01.08.2014 Views

Estimation, Evaluation, and Selection of Actuarial Models

Estimation, Evaluation, and Selection of Actuarial Models

Estimation, Evaluation, and Selection of Actuarial Models

SHOW MORE
SHOW LESS

You also want an ePaper? Increase the reach of your titles

YUMPU automatically turns print PDFs into web optimized ePapers that Google loves.

62 CHAPTER 4. MODEL EVALUATION AND SELECTION<br />

This Chapter begins with a review <strong>of</strong> hypothesis testing, mostly because it is no longer directly<br />

covered on prior actuarial examinations. The remaining sections cover a variety <strong>of</strong> evaluation <strong>and</strong><br />

selection tools. Each tool has its own strengths <strong>and</strong> weaknesses <strong>and</strong> it is possible for different tools<br />

to lead to different models. This makes modeling as much art as science. At times, in real-world<br />

applications, the model’s purpose may lead the analyst to favor one tool over another.<br />

4.2 A review <strong>of</strong> hypothesis tests<br />

The Neyman-Pearson approach to hypothesis testing is covered in detail in most mathematical<br />

statistics texts. This review will be fairly straightforward <strong>and</strong> will not address philosophical issues<br />

or consider alternative approaches. A hypothesis test begins with two hypotheses, one called the<br />

null <strong>and</strong> one called the alternative. The traditional notation is H 0 for the null hypothesis <strong>and</strong> H 1<br />

for the alternative hypothesis. The two hypotheses are not treated symmetrically. Reversing them<br />

may alter the results. To illustrate this process, a simple example will be used.<br />

Example 4.1 Your company has been basing its premiums on an assumption that the average<br />

claim is 1,200. You want to raise the premium <strong>and</strong> a regulator has dem<strong>and</strong>ed that you provide<br />

evidence that the average now exceeds 1,200. To provide such evidence, the numbers in Data Set B<br />

have been obtained. What are the hypotheses for this problem?<br />

Let µ be the population mean. One hypothesis (the one you claim is true) is that µ>1, 200.<br />

Because hypothesis tests must present an either/or situation, the other hypothesis must be µ ≤<br />

1, 200. The only remaining task is to decide which <strong>of</strong> them is the null hypothesis. Whenever the<br />

universe <strong>of</strong> continuous possibilities is divided in two there is likely to be a boundary that needs to<br />

be assigned to one hypothesis or the other. The hypothesis that includes the boundary must be<br />

the null hypothesis. Therefore, the problem can be succinctly stated as:<br />

H 0 : µ ≤ 1, 200<br />

H 1 : µ>1, 200.<br />

¤<br />

The decision is made by calculating a quantity called a test statistic. It is a function <strong>of</strong> the<br />

observations <strong>and</strong> is treated as a r<strong>and</strong>om variable. That is, in designing the test procedure we are<br />

concerned with the samples that might have been obtained <strong>and</strong> not with the particular sample<br />

that was obtained. The test specification is completed by constructing a rejection region. Itisa<br />

subset <strong>of</strong> the possible values <strong>of</strong> the test statistic. If the value <strong>of</strong> the test statistic for the observed<br />

sample is in the rejection region, the null hypothesis is rejected <strong>and</strong> the alternative hypothesis<br />

is announced as the result that is supported by the data. Otherwise, the null hypothesis is not<br />

rejected (more on this later). The boundaries <strong>of</strong> the rejection region (other than plus or minus<br />

infinity) are called the critical values.<br />

Example 4.2 For the continuing example, complete the test using the test statistic <strong>and</strong> rejection<br />

region that is promoted in most statistics books. Assume that the population that produced Data Set<br />

B has a normal distribution with st<strong>and</strong>ard deviation 3,435.

Hooray! Your file is uploaded and ready to be published.

Saved successfully!

Ooh no, something went wrong!