Estimation, Evaluation, and Selection of Actuarial Models

More documents

Recommendations

Info

6 CHAPTER 2. MODEL ESTIMATION 2.2 Estimation using data-dependent distributions 2.2.1 Introduction When observations are collected from a probability distribution, the ideal situation is to have the (essentially) exact 1 value of each observation. This case is referred to as “complete, individual data.” This is the case in Data Sets B and D1. There are two reasons why exact data may not be available. One is grouping, in which all that is recorded is the range of values in which the observation belongs. This is the case for Data Set C and for Data Set A for those with five or more accidents. A second reason that exact values may not be available is the presence of censoring or truncation. When data are censored from below, observations below a given value are known to be below that value, but the exact value is unknown. When data are censored from above, observations above a given value are known to be above that value, but the exact value is unknown. Note that censoring effectively creates grouped data. When the data are grouped in the first place, censoring has no effect. For example, the data in Data Set C may have been censored from above at 300,000, but we cannot know for sure from the data set and that knowledge has no effect on how we treat the data. On the other hand, were Data Set B to be censored at 1,000, we would have fifteen individual observations and then five grouped observations in the interval from 1,000 to infinity. In insurance settings, censoring from above is fairly common. For example, if a policy pays no more than 100,000 for an accident, any time the loss is above 100,000 the actual amount will be unknown, but we will know that it happened. In Data Set D2 we have random censoring. Consider the fifth policy in the table. When the “other information” is not available, all that is known about the time of death is that it will be after 1.8 years. All of the policies are censored at 5 years by the nature of the policy itself. Also, note that Data Set A has been censored from above at 5. This is more common language than to say that Data Set A has some individual data and some grouped data. When data are truncated from below, observations below a given value are not recorded. Truncation from above implies that observations above a given value are not recorded. In insurance settings, truncation from below is fairly common. If an automobile physical damage policy has a per claim deductible of 250, any losses below 250 will not come to the attention of the insurance company and so will not appear in any data sets. Data Set D2 has observations 31—40 truncated from below at varying values. The other data sets may have truncation forced on them. For example, if Data Set B were to be truncated from below at 250, the first seven observations would disappear and the remaining thirteen would be unchanged. 2.2.2 The empirical distribution for complete, individual data AsnotedinDefinition 2.3, the empirical distribution assigns probability 1/n to each data point. That works well when the value of each data point is recorded. An alternative definition is Definition 2.5 The empirical distribution function is number of observations ≤ x F n (x) = n 1 Some measurements are never exact. Ages may be rounded to the nearest whole number, monetary amounts to the nearest dollar, car mileage to the nearest tenth of a mile, and so on. This Note is not concerned with such rounding errors. Rounded values will be treated as if they are exact.
2.2. ESTIMATION USING DATA-DEPENDENT DISTRIBUTIONS 7 where n is the total number of observations. Example 2.6 Provide the empirical probability functions for the data in Data Sets A and B. For Data Set A also provide the empirical distribution function. For Data Set A assume all 7 drivers whohad5ormoreaccidentshadexactly5accidents. For notation in this note, a subscript of the sample size (or of n ifthesamplesizeisnotknown) will be used to indicate an empirical function. Without the subscript, the function represents the true function for the underlying random variable. For Data Set A, the estimated probability function is ⎧ 81,714/94,935 = 0.860736, x =0 11,306/94,935 = 0.119092, x =1 ⎪⎨ 1,618/94,935 = 0.017043, x =2 p 94,935 (x) = 250/94,935 = 0.002633, x =3 40/94,935 = 0.000421, x =4 ⎪⎩ 7/94,935 = 0.000074, x =5 where the values add to 0.999999 due to rounding. The distribution function is a step function with jumps at each data point. ⎧ 0/94,935 = 0.000000, x < 0 81,714/94,935 = 0.860736, 0 ≤ x
Page 1: Estimation, Evaluation, and Selecti
Page 4 and 5: iv CONTENTS 4.5.2 Anderson-Darlingt
Page 6 and 7: 2 CHAPTER 1. INTRODUCTION Exercises
Page 8 and 9: 4 CHAPTER 2. MODEL ESTIMATION Throu
Page 12 and 13: 8 CHAPTER 2. MODEL ESTIMATION and t
Page 14 and 15: 10 CHAPTER 2. MODEL ESTIMATION of t
Page 16 and 17: 12 CHAPTER 2. MODEL ESTIMATION Exer
Page 18 and 19: 14 CHAPTER 2. MODEL ESTIMATION i d
Page 20 and 21: 16 CHAPTER 2. MODEL ESTIMATION 8. C
Page 24 and 25: 20 CHAPTER 2. MODEL ESTIMATION In e
Page 26 and 27: 22 CHAPTER 2. MODEL ESTIMATION Exam
Page 28 and 29: 24 CHAPTER 2. MODEL ESTIMATION That
Page 30 and 31: 26 CHAPTER 2. MODEL ESTIMATION Unle
Page 32 and 33: 28 CHAPTER 2. MODEL ESTIMATION like
Page 34 and 35: 30 CHAPTER 2. MODEL ESTIMATION For
Page 36 and 37: 32 CHAPTER 2. MODEL ESTIMATION like
Page 38 and 39: 34 CHAPTER 2. MODEL ESTIMATION wher
Page 42 and 43: 38 CHAPTER 3. SAMPLING PROPERTIES O
Page 60 and 61:
56 CHAPTER 3. SAMPLING PROPERTIES O
Page 62 and 63:
Page 64 and 65:
Page 66 and 67:
62 CHAPTER 4. MODEL EVALUATION AND
Page 68 and 69:
Page 70 and 71:
Page 72 and 73:
Page 74 and 75:
Page 76 and 77:
Page 78 and 79:
Page 80 and 81:
Page 82 and 83:
Page 84 and 85:
Page 86 and 87:
Page 88 and 89:
Page 90 and 91:
86 CHAPTER 5. MODELS WITH COVARIATE
Page 92 and 93:
Page 94 and 95:
Page 96 and 97:
Page 98 and 99:
Page 100 and 101:
96 APPENDIX A. SOLUTIONS TO EXERCIS
Page 102 and 103:
98 APPENDIX A. SOLUTIONS TO EXERCIS
Page 104 and 105:
100 APPENDIX A. SOLUTIONS TO EXERCI
Page 106 and 107:
Page 108 and 109:
Page 110 and 111:
Page 112 and 113:
Page 114 and 115:
Page 116 and 117:
Page 118 and 119:
Page 120 and 121:
Page 122 and 123:
Page 124 and 125:
Page 126 and 127:
Page 128 and 129:
Page 130 and 131:
Page 132 and 133:
Page 134 and 135:
Page 136 and 137:
132 APPENDIX B. USING MICROSOFT EXC
Page 138 and 139:
Page 140 and 141:
Page 142:
show all

Estimation, Evaluation, and Selection of Actuarial Models

Create successful ePaper yourself

Delete template?

Save as template?