Loading [MathJax]/extensions/TeX/mathchoice.js
Henry III, John B., and Ping-Hung Hsieh. 2009. “Extreme Value Analysis for Partitioned Insurance Losses.” Variance 3 (2): 214–38.
Download all (10)
  • Figure 1. Performance of Hill (top) and Gk (bottom) estimators for underlying Pareto model with true tail index α=1.5(D=1,α=1.5). Hill estimates use all order statistics above F1(p) where F is the distribution function of the underlying distribution. Tail index estimates using grouped data are found using Eq. (3.2) for the given number of upper interval counts k. Sample size = number of replications = 1000.
  • Figure 2. Performance of Hill (top) and Gk (bottom) estimators for underlying generalized Pareto model with true tail index α=1.5(γ=1/1.5,σ=1). Hill estimates use all order statistics above F1(p) where F is the distribution function of the underlying distribution. Tail index estimates using grouped data are found using Eq. (3.2) for the given number of upper interval counts k. Sample size = number of replications =1000.
  • Figure 3. Performance of Hill (top) and Gk (bottom) estimators for underlying Burr model with true tail index α=1.5(λ=1.2,θ=4/2,τ=3/4). Hill estimates use all order statistics above F1(p) where F is the distribution function of the underlying distribution. Tail index estimates using grouped data are found using Eq. (3.2) for the given number of upper interval counts k. Sample size = number of replications =1000.
  • Figure 4. Performance of Hill (top) and Gk (bottom) estimators for underlying half T model with true tail index α=1.5(ϕ=1.5). Hill estimates use all order statistics above F1(p) where F is the distribution function of the underlying distribution. Tail index estimates using grouped data are found using Eq. (3.2) for the given number of upper interval counts k. Sample size = number of replications =1000.
  • Figure 5. Estimation of mean excess value e(q.95). ML estimates are calculated under the assumption of the specified distributions. The true distribution F is Pareto with tail index α=1.5. The top plot uses all data, and the bottom plot uses grouped data. The Hill q.90 and Hill q.80 use all order statistics larger than q.90=F1(.90) and q.80=F1(.80). The G6 and G7 use the counts from top 6 and 7 intervals. Sample size = number of replications = 1000.
  • Figure 6. Estimation of mean excess value e(q.95). ML estimates are calculated under the assumption of the specified distributions. The true distribution F is generalized Pareto with tail index α=1.5(γ=1/1.5,σ=1). The top plot uses all data, and the bottom plot uses grouped data. The Hill_ q.90 and Hill_ q.80 use all order statistics larger than q.90=F1(.90) and q.80=F1(.80). The G6 and G7 use the counts from top 6 and 7 intervals. Sample size = number of replications =1000.
  • Figure 7. Estimation of mean excess value e(q.95). ML estimates are calculated under the assumption of the specified distributions. The true distribution F is Burr with tail index α=1.5(λ=1.2,θ=4/2,τ=3/4). The top plot uses all data, and the bottom plot uses grouped data. The Hill 9.90 and Hill q.80 use all order statistics larger than q.90=F1(.9) and q.80=F1(.8). The G6 and G7 use the counts from top 6 and 7 intervals. Sample size = number of replications = 1000 .
  • Figure 8. Estimation of mean excess value e(q.95). ML estimates are calculated under the assumption of the specified distributions. The true distribution F is half T with tail index α=1.5(ϕ=1.5). The top plot uses all data, and the bottom plot uses grouped data. The Hill_ q.90 and Hill_ q.80 use all order statistics larger than q.90=F1(.9) and q.80=F1(.8). The G6 and G7 use the counts from top 6 and 7 intervals. Sample size = number of replications =1000.
  • Figure 9. Tail index estimation for fire loss data. The estimates for α using Eq. (3.2) are stable in the range 5k8. This suggests to choose the cutoff a8=500 as the threshold and to use the observed counts in top 8 intervals in Eq. (3.2).
  • Figure 10. Comparison of empirical and fitted tail probabilities for fire loss data. ˉFn(x) is given by open circles and ˆˉF(x) by the dashed line where ˆα=0.7905 and ak=500. Note that the x axis is on log scale.

Abstract

The heavy-tailed nature of insurance claims requires that special attention be put into the analysis of the tail behavior of a loss distribution. It has been demonstrated that the distribution of large claims of several lines of insurance have Pareto-type tails. As a result, estimating the tail index, which is a measure of the heavy-tailedness of a distribution, has received a great deal of attention. Although numerous tail index estimators have been proposed in the literature, many of them require detailed knowledge of individual losses and are thus inappropriate for insurance data in partitioned form. In this study we bridge this gap by developing a tail index estimator suitable for partitioned loss data. This estimator is robust in the sense that no particular global density is assumed for the loss distribution. Instead we focus only on fitting the model in the tail of the distribution where it is believed that the Pareto-type form holds. Strengths and weaknesses of the proposed estimator are explored through simulation and an application of the estimator to real world partitioned insurance data is given.