1. Introduction
The American workers’ compensation system is one of the most successful social programs in the country. Many U.S. states are currently using the experience rating plan of the National Council on Compensation Insurance (NCCI). In the current NCCI experience rating plan, a cap, called the state limit, is first applied to each individual claim. Next, a cutoff point, called the split point, is applied to the capped claim; that is, the capped claim is split into two components—(1) the primary loss, which reflects frequency, and (2) the excess loss, which reflects severity—and then two different credibilities are assigned to these two components. In the literature, such a practice is referred to as split credibility. Split credibility has been used in practice for a few decades and studied by several authors, such as Gillam (1992), Mahler (1987), and Venter (1987). Recently, Robbin (2013) initiated a rigorous study of split credibility and derived several interesting results. The purpose of this paper is to investigate the properties of the primary loss and the excess loss in a split rating plan.
Robbin (2013) showed that the coefficient of variation (CV) of the total loss is no greater than that of the excess loss in the collective risk model[1] when the frequency distribution is a Poisson distribution. We show that this conclusion holds for an arbitrary frequency distribution as well. In addition, we show that the CV of the primary loss is no greater than that of the total loss. These s.png establish the ordering of the CVs of the primary loss, the excess loss, and the total loss.
Next, we point out that the usual credibility theory framework implies a model that is more general than the one in the collective risk model. We call such a model the generalized collective risk model. Therefore, it is natural to ask whether the established ordering of the CVs of the primary loss, the excess loss, and the total loss in the collective risk model still holds in this generalized collective risk model. We show that the answer is affirmative.
Finally, we study the covariance and correlation coefficient between the primary loss and the excess loss, establishing several interesting results. For example, (1) neither the covariance nor the correlation coefficient between the primary loss and the excess loss of a claim is a monotonic function of the split point; (2) the covariance between the primary loss of a claim and the excess loss of a different claim is a nondecreasing function of the state limit; (3) the covariance between the primary loss and the excess loss of a claim is a nondecreasing function of the state limit, but the correlation coefficient between the primary loss and the excess loss of a claim is a nonincreasing function of the state limit; (4) the primary loss and the excess loss of a single claim are nonnegatively correlated, as are the primary loss of one claim and the excess loss of another. Table 4.6 summarizes the key results along this line.
The paper concludes with two conjectures: (1) the correlation coefficient between the primary loss of one claim and the excess loss of another is a nondecreasing function of both the state limit and the split point; (2) the correlation coefficient between the primary loss and the excess loss of a claim is a nondecreasing function of the state limit.
The remainder of the paper is organized as follows. Section 2 establishes that the primary loss is less volatile than the total loss, which in turn is less volatile than the excess loss, in the collective risk model. Section 3 establishes the same ordering of the primary loss, the excess loss, and the total loss in the generalized collective risk model. Section 4 investigates the covariance and correlation between the primary loss and the excess loss. Section 5 concludes the paper with a summary and two conjectures.
2. Coefficients of variation of the primary loss, the excess loss, and the total loss in the collective risk model
Following Robbin (2013), we will use the CV to measure the volatility of a random variable. Robbin (2013) noticed an intuitive justification for split credibility: if the total loss is split into the primary loss and the excess loss, both are less volatile than the total loss, making them more credible than the total loss. Assuming that the (claim) frequency distribution is a Poisson distribution, Robbin proved that the CV of the total loss is no greater than the CV of the excess loss in the collective risk model, thus demonstrating that the intuitive justification is incorrect. But it is still desirable to know whether this intuitive justification may be valid for some other frequency distributions. Since Robbin’s result concerns only the excess loss and the total loss, it is also natural to ask whether the total loss is less volatile than the primary loss. The purpose of this section is to investigate these two problems. We establish that the CV of the total loss is never greater than the CV of the excess loss for an arbitrary frequency distribution. Then we show that the CV of the primary loss is no greater than that of the total loss.
To start, we consider the collective risk model. Let N be the number of claims and Xi be the loss amount of the ith claim (i = 1, 2, . . . , N). We specify that A = X1 + X2 + . . . + XN; that is, A denotes the aggregate loss. We assume that all Xis are independent and identically distributed (iid) with an absolutely continuous cumulative distribution function (CDF) of F(x) and a survival function of S(x). We also assume that each Xi is independent of N. Let C denote the state limit and K denote the split point under a split rating plan. Without loss of generality, we assume that 0 ≤ K ≤ C. Then for the ith claim, the primary loss and the excess loss are given by
Xip={Xi, if Xi≤K;K, if Xi>K;
and
Xie={0, if Xi≤KXi−K, if K<Xi≤C;C−K, if Xi>C
respectively. For aggregate loss A, the primary loss, Ap, and the excess loss, Ae, are defined by
Ap=X1p+…+XNp
and
Ae=X1e+…+XNe
respectively. (To avoid complicated notation, we will often suppress the index
when we refer to an individual claim.) The mean and variance of will be denoted by and respectively; and should be interpreted similarly. For aggregate loss the of the primary loss, the excess loss, and the total loss will be denoted by and respectively; for an individual loss the CVs of the primary loss, the excess loss, and the total loss will be denoted by and respectively.Theorem 2.1. For aggregate loss A in the collective risk model, the CV of the total loss is no greater than the CV of the excess loss; that is,
CVA≤CVAe
In particular, for a single claim, X, the CV of the total loss is no greater than the CV of the excess loss; that is,
CVX≤CVXe
Proof:
The square of the CV of aggregate loss A is given by
(CVA)2=μ2Xσ2N+μNσ2X(μXμN)2=σ2Nμ2N+1μNσ2Xμ2X.
The square of the CV of the excess loss equals
(CVAe)2=μ2Xeσ2N+μNσ2Xe(μXeμN)2=σ2Nμ2N+1μNσ2Xeμ2Xe.
Therefore, it suffices to show that
σ2Xμ2X≤σ2Xeμ2Xe.
To this end, we write Equation (2.1) as
E[X2]−μ2Xμ2X≤E[X2e]−μ2Xeμ2Xe,
which is equivalent to
E[X2]μ2X≤E[X2e]μ2Xe.
We will establish Equation (2.2) by showing that the function
is nondecreasing on the positive real line The derivatives of and are∂∂KE[X2e]=∂∂K[∫CK(x−K)2dF(x)+(C−K)2S(C)]=−2μXe,
and
∂∂KE(μ2Xe)=∂∂K[∫CK(x−K)dF(x)+(C−K)S(C)]2=2[∫CK(x−K)dF(x)+(C−K)S(C)][∫CK−1dF(x)−S(C)]=−2μXeS(K),
respectively. It follows that
G′(K)=2μ3Xe(S(K)E[X2e]−μ2Xe).
Therefore, the whole matter boils down to showing that
μ2Xe≤S(K)E[X2e].
To establish Equation (2.3), we write μ2X as
μXe=∫∞K1⋅g(x)dF(x),
where
g(x)={0, if x≤Kx−K, if K<x≤CC−K, if x>C
and apply the Cauchy-Schwarz inequality.
Theorem 2.2. For aggregate loss A in the collective risk model, the CV of the primary loss is no greater than the CV of the total loss; that is,
CVAp≤CVA
In particular, for a single loss, X, the CV of the primary loss is no greater than the CV of the total loss; that is,
CVXp≤CVX
Proof:
Similar to the proof of Theorem 2.1, we have
(CVA)2=σ2Nμ2N+1μNσ2Xμ2X,
and
(CVAp)2=σ2Nμ2N+1μNσ2Xpμ2Xp.
Therefore, it suffices to show that
σ2Xpμ2Xp≤σ2Xμ2X
which is equivalent to
E[X2p]μ2Xp≤E[X2]μ2X.
To this end, we specify
G(K)=E[X2p]μ2Xp
where
is the split point. Then Thus, we need only to show that is nondecreasing on We have\begin{aligned} \frac{\partial}{\partial K} E\left[X_{p}^{2}\right] & =\frac{\partial}{\partial K}\left[\int_{0}^{K} x^{2} d F(x)+K^{2} S(K)\right] \\ & =2 K S(K), \end{aligned}
and
\begin{aligned} \frac{\partial}{\partial K}\left(\mu_{X_{p}}^{2}\right) & =2 \mu_{X_{p}} \frac{\partial}{\partial K}\left[\int_{0}^{K} x d F(x)+K S(K)\right] \\ & =2 \mu_{X_{p}} S(K) . \end{aligned}
It follows that
G^{\prime}(K)=\frac{2 S(K)}{\mu_{X_{p}}^{3}}\left(K \mu_{X_{p}}-E\left[X_{p}^{2}\right]\right) \geq 0
where the last inequality follows from the fact that
\begin{aligned} K \mu_{X_{p}} & =K\left(\int_{0}^{K} x d F(x)+K S(K)\right) \\ & =K \int_{0}^{K} x d F(x)+K^{2} S(K) \\ & \geq \int_{0}^{K} x^{2} d F(x)+K^{2} S(K)=E\left[X_{p}^{2}\right] . \end{aligned}
Summarizing Theorem 2.1 and Theorem 2.2, we have the following conclusion:
Theorem 2.3. For aggregate loss A in the collective risk model, the CVs of the primary loss, the excess loss, and the total loss satisfy the following relationship:
C V_{A_{p}} \leq C V_{A} \leq C V_{A_{e}} .
In particular, for a single claim X, the CVs of the primary loss, the excess loss, and the total loss satisfy the following inequality:
C V_{X_{p}} \leq C V_{X} \leq C V_{X_{e}} .
Theorem 2.3 establishes the ordering of the CVs of the primary loss, the excess loss, and the total loss. It shows that the primary loss is always less volatile than the total loss. However, this does not mean the primary loss is more credible than the total because, as Robbin (2013) pointed out, volatility alone does not determine credibility. Also, Theorem 2.3 implies neither that the total loss is riskier than the primary loss nor that the excess loss is riskier than the total loss. The reason is that “risk” and “volatility” are not synonymous; see, for example, Brockett and Garven (1998). What Theorem 2.3 does tell us is that the current NCCI experience rating plan splits each aggregate loss into two components, with one being less volatile and the other more volatile. Therefore, the intuitive justification discussed in Section 2 is incorrect regardless of the distribution of the claim frequency. For the correct justification, see Robbin (2013).
In the collective risk model, all random claim-severity variables are assumed to be iid. This means that all policyholders in the pool are from one homogeneous risk class. However, this assumption is rarely met in reality; see, for example, Bühlmann and Gisler (2005) for more discussion along this line. Therefore, it makes sense to relax this iid assumption so that the uncertainty of the policyholder’s risk class is taken into consideration. For this reason, one might want to investigate whether the key results in this section will still hold if the random claim-severity variables are conditionally iid. The next section is devoted to this task.
3. Coefficients of variation of the primary loss, the excess loss, and the total loss in the generalized collective risk model
In the usual setup of credibility theory, each of the observed claims, X1, . . . , Xn, . . . , is assumed to belong to a risk class indicated by a risk parameter, θ, and all the risk classes are described by a risk family, Θ, called the parameter space. Given a risk parameter θ Θ, X1, . . . , Xn, . . . are assumed to be iid. Though X1, . . . , Xn, . . . are assumed to be conditionally iid, they need not be independent. Indeed, the conditional covariance formula implies that the covariance between Xi and Xj equals
\operatorname{Var}_{\theta}\left[E\left(X_{1} \mid \theta\right)\right]
in such a model; see, for example, Bühlmann and Gisler (2005).[2] Therefore, we consider the following model: claim severity random variables are conditionally ilid, given that the risk parameter, and the random frequency variable, are independent of all the s. To distinguish this model from the collective risk model in Section 2, we call this model the generalized collective risk model.[3] Thus it is natural to ask whether the conclusion of Theorem 2.3 still holds in the generalized collective risk model. The rest of this section is devoted to showing that the answer is affirmative. To lighten the notation, we will use and to denote and respectively. and are to be interpreted in a similar manner.
Theorem 3.1. For aggregate loss A in the generalized collective risk model, the CV of the total loss is no greater than the CV of the excess loss; that is,
C V_{A} \leq C V_{A_{e}}
Proof:
Let A = X1 + X2 + . . . + XN. Then the mean of A may be calculated using the iterated expectation formula as follows:
\begin{aligned} \mu_{A} & =E_{N}[E[A \mid N]]=E[N E[X]] \\ & =E[N] E[X]=\mu_{N} \mu_{X} . \end{aligned}
Also, the mean of A, given that N = n, equals
E[A \mid N=n]=n E[X]=n \mu_{X},
and the variance of A, given that N = n, equals
\begin{aligned} \operatorname{Var}[A \mid N=n]= & \operatorname{Var}\left[X_{1}+\ldots+X_{n}\right] \\ = & E_{\theta}\left[\operatorname{Var}\left[X_{1}+\ldots X_{n} \mid \theta\right]\right] \\ & +\operatorname{Var}_{\theta}\left[E\left[X_{1}+\ldots X_{n} \mid \theta\right]\right] \\ = & n E_{\theta}\left[\sigma_{X}^{\theta}\right]+n^{2} \operatorname{Var}_{\theta}\left[\mu_{X}^{\theta}\right] \end{aligned}
It follows from the previous two displays that
\begin{aligned} \sigma_{A}^{2} & =E_{N}[\operatorname{Var}(A \mid N)]+\operatorname{Var}_{N}[E(A \mid N)] \\ & =\sigma_{N}^{2} \mu_{X}^{2}+\mu_{N} E_{\theta}\left[\sigma_{X}^{\theta}\right]+E\left[N^{2}\right] \operatorname{Var}_{\theta}\left[\mu_{X}^{\theta}\right] . \end{aligned}
Therefore,
\begin{aligned} \left(C V_{A}\right)^{2} & =\frac{\sigma_{N}^{2} \mu_{X}^{2}+\mu_{N} E_{\theta}\left[\sigma_{X}^{\theta}\right]+E\left[N^{2}\right] \operatorname{Var}_{\theta}\left[\mu_{X}^{\theta}\right]}{\mu_{N}^{2} \mu_{X}^{2}} \\ & =\left(\frac{\sigma_{N}}{\mu_{N}}\right)^{2}+\frac{\mu_{N} E_{\theta}\left[\sigma_{X}^{\theta}\right]+E\left[N^{2}\right] \operatorname{Var}_{\theta}\left[\mu_{X}^{\theta}\right]}{\mu_{N}^{2} \mu_{X}^{2}} . \end{aligned}
Likewise, for the excess loss we have
\begin{array}{l} \mu_{A_{e}}=\mu_{N} \mu_{X_{e}} \\ \sigma_{A_{e}}^{2}=\sigma_{N}^{2} \mu_{X_{e}}^{2}+\mu_{N} E_{\theta}\left[\sigma_{X_{e}}^{\theta}\right]+E\left[N^{2}\right] \operatorname{Var}_{\theta}\left[\mu_{X_{e}}^{\theta}\right] \end{array}
\left(C V_{A_{e}}\right)^{2}=\left(\frac{\sigma_{N}}{\mu_{N}}\right)^{2}+\frac{\mu_{N} E_{\theta}\left[\sigma_{X_{e}}^{\theta}\right]+E\left[N^{2}\right] \operatorname{Var}_{\theta}\left[\mu_{X_{e}}^{\theta}\right]}{\mu_{N}^{2} \mu_{X_{e}}^{2}} .
To show that
we need only to establish that\begin{array}{c} \frac{\mu_{N} E_{\theta}\left[\sigma_{X}^{\theta}\right]+E\left[N^{2}\right] \operatorname{Var}_{\theta}\left[\mu_{X}^{\theta}\right]}{\mu_{X}^{2}} \\ \leq \frac{\mu_{N} E_{\theta}\left[\sigma_{X_{e}}^{\theta}\right]+E\left[N^{2}\right] \operatorname{Var}_{\theta}\left[\mu_{X_{e}}^{\theta}\right]}{\mu_{X_{e}}^{2}} . \end{array} \tag{3.1}
To this end, we write
G(K)=\frac{\mu_{N} E_{\theta}\left[\sigma_{X_{e}}^{\theta}\right]+E\left[N^{2}\right] \operatorname{Var}_{\theta}\left[\mu_{X_{e}}^{\theta}\right]}{\mu_{X_{e}}^{2}} .
Since G(0) = (CVA)2, it remains to show that G′(K) ≥ 0. Details of this argument are given in the appendix.
Theorem 3.2. For aggregate loss A in the generalized collective risk model, the CV of the primary loss is no greater than the CV of the total loss; that is,
C V_{A_{p}} \leq C V_{A}
Proof:
As in the proof of Theorem 3.1, we have
\left(C V_{A}\right)^{2}=\left(\frac{\sigma_{N}}{\mu_{N}}\right)^{2}+\frac{\mu_{N} E_{\theta}\left[\sigma_{X}^{\theta}\right]+E\left[N^{2}\right] \operatorname{Var}_{\theta}\left[\mu_{X}^{\theta}\right]}{\mu_{N}^{2} \mu_{X}^{2}},
and
\left(C V_{A_{p}}\right)^{2}=\left(\frac{\sigma_{N}}{\mu_{N}}\right)^{2}+\frac{\mu_{N} E_{\theta}\left[\sigma_{X_{p}}^{\theta}\right]+E\left[N^{2}\right] \operatorname{Var}_{\theta}\left[\mu_{X_{p}}^{\theta}\right]}{\mu_{N}^{2} \mu_{X_{p}}^{2}} .
Therefore, it suffices to show that
\begin{array}{l} \frac{\mu_{N} E_{\theta}\left[\sigma_{X_{p}}^{\theta}\right]+E\left[N^{2}\right] \operatorname{Var}_{\theta}\left[\mu_{X_{p}}^{\theta}\right]}{\mu_{X_{p}}^{2}} \\ \leq \frac{\mu_{N} E_{\theta}\left[\sigma_{X}^{\theta}\right]+E\left[N^{2}\right] \operatorname{Var}_{\theta}\left[\mu_{X}^{\theta}\right]}{\mu_{X}^{2}} . \end{array}
To this end, we define a function
G(K)=\frac{\mu_{N} E_{\theta}\left[\sigma_{X_{p}}^{\theta}\right]+E\left[N^{2}\right] \operatorname{Var}_{\theta}\left[\mu_{X_{p}}^{\theta}\right]}{\mu_{X_{p}}^{2}} .
In the appendix, we show that G(K) is nondecreasing on R+. Therefore, G′(K) ≥ 0 on R+, and the theorem is established.
In view of Theorem 3.1 and Theorem 3.2, we have the following conclusion.
Theorem 3.3. For aggregate loss A in the generalized collective risk model, the CVs of the primary loss, the excess loss, and the total loss satisfy the following relationship:
C V_{A_{p}} \leq C V_{A} \leq C V_{A_{e}} .
Theorem 3.3 generalizes Theorem 2.3. If the parameter space Θ is a singleton, that is, Θ contains only one element, then Theorem 3.1, Theorem 3.2, and Theorem 3.3 specialize to Theorem 2.1, Theorem 2.2, and Theorem 2.3, respectively. Therefore, the remarks at the end of Section 2 apply here too. This also means that we have given two different sets of proofs for Theorem 2.1, Theorem 2.2, and Theorem 2.3. However, for the collective risk model, the proofs in Section 2 are more straightforward than those in Section 3.
In addition, Theorem 3.3 has some potential applications in capital allocation. Suppose that an actuary needs to allocate capital to an employer’s workers’ compensation plan. The actuary decides to use the method of allocating capital by percentile layer, discussed extensively in Bodoff (2009) and Hong (2013),[4] but at the same time is concerned about the volatility of each layer of the future loss. Theorem 3.3 says that the layers of the excess loss are more volatile than the layers of the primary loss. Therefore, they might be more challenging to estimate accurately, and so the actuary might want to assign more capital to the layers of the excess loss.
Moreover, Theorem 3.3 might provide some new information for state regulators too. In view of Theorem 3.3, the excess loss is more volatile than the primary loss. Maybe more care needs to be exercised when regulators audit any calculation involving the excess loss.
4. Covariance and correlation coefficient between the primary loss and the excess loss
It is clear that the primary loss and the excess loss are related. Therefore, one would naturally look at the covariance and the correlation coefficient between them. The following theorem gives the covariance between the primary loss and the excess loss of a single claim.
Theorem 4.1. Let X be a single claim, C be the state limit, and K be the split point. Then the primary loss and the excess loss are nonnegatively correlated, and the covariance, Cov(Xp, Xe), between them equals
\operatorname{Cov}\left(X_{p}, X_{e}\right)=\left(\int_{0}^{K} F(x) d x\right)\left(\int_{K}^{C} S(x) d x\right),
where
and are the cumulative distribution function and survival function of respectively. Moreover, is bounded above by and the equality holds if and only ifProof:
Recall that
X_{p}=\left\{\begin{array}{ll} X, & \text { if } X \leq K ; \\ K, & \text { if } X>K ; \end{array}\right.
and that
X_{e}=\left\{\begin{array}{ll} 0, & \text { if } X \leq K \\ X-K, & \text { if } K<X \leq C \\ C-K, & \text { if } X>C \end{array}\right.
Therefore,
\begin{aligned} E\left[X_{e}\right] & =\int_{K}^{C}(x-K) d F(x)+(C-K) S(C) \\ & =\int_{K}^{C} S(x) d x \end{aligned}
E\left[X_{p}\right]=\int_{0}^{K} x d F(x)+K S(K)=\int_{0}^{K} S(x) d x
and
\begin{aligned} E\left[X_{e} X_{p}\right]= & \int_{0}^{K} 0 \odot x d F(x)+\int_{k}^{C}(x-K) \circ K d F(x) \\ & +\int_{C}^{\infty}(C-K) K d F(x) \\ = & K\left[\int_{K}^{C}(x-K) d F(x)+(C-K) S(C)\right] \\ = & K E\left[X_{e}\right] . \end{aligned}
It follows that
\begin{aligned} \operatorname{Cov}\left(X_{p}, X_{e}\right) & =E\left[X_{p} X_{e}\right]-E\left[X_{p}\right] E\left[X_{e}\right] \\ & =K E\left[X_{e}\right]-\left(\int_{0}^{K} S(x) d x\right) E\left[X_{e}\right] \\ & =\left(K-\int_{0}^{K} S(x) d x\right) E\left[X_{e}\right] \\ & =\left(\int_{0}^{K} F(x) d x\right)\left(\int_{K}^{C} S(x) d x\right) \geq 0 . \end{aligned}
The second statement follows from the fact that for two nonnegative real numbers, a and b, a + b ≥
and the equality holds if and only if a = b.Theorem 4.1 says that Xp and Xe are positively correlated. That is, if the primary loss turns out to be larger than average, then it is likely that the excess loss will also be larger than average. This confirms the intuition that a larger-than-average Xe implies a larger-than-average X, which in turn implies a larger-than-average Xe. Moreover, Theorem 4.1 gives an explicit formula for Cov(Xp, Xe), allowing actuaries to calculate the correlation coefficient between Xp and Xe as in Example 4.3. In other words, Theorem 4.1 makes it possible to measure the positive correlation between Xp and Xe on a numerical scale.
Since the primary loss and the excess loss clearly depend on K and C, it is interesting to see how K and C affect their CVs. A scrutiny of the proofs of Theorem 3.1 and Theorem 3.2 reveals that the following result holds.
Theorem 4.2. For aggregate loss A in the generalized collective risk model, the CVs of the primary loss and the excess loss are both nondecreasing functions of K.
In light of the previous discussion, it is natural to ask whether the covariance, Cov(Ap, Ae), between the primary loss and the excess loss for aggregate loss A is also a monotonic function of K. The following example shows that the answer is negative.
Example 4.1. Suppose N is a degenerate random variable at the point 1; that is, we consider an individual claim X. We assume that X follows the exponential distribution with a hazard rate of λ > 0. Then the CDF, F(x), and the survival function, S(x), of X are given by
F(x)=\left\{\begin{array}{ll} 1-e^{-\lambda x}, & x>0 ; \\ 0, & x \leq 0 ; \end{array}\right.
and
S(x)=\left\{\begin{array}{ll} e^{-\lambda x}, & x>0 ; \\ 1, & x \leq 0, \end{array}\right.
respectively. Assume G(K) = Cov(Ap, Ae). Since N is degenerate at 1, G(K) = Cov(Xp, Xe). It follows from Theorem 4.1 that
G(K)=\frac{1}{\lambda}\left(e^{-\lambda K}-e^{-\lambda C}\right)\left[K-\frac{1}{\lambda}\left(1-e^{-\lambda K}\right)\right] .
Take λ = 1. Then
G(K)=\left(e^{-K}-e^{-C}\right)\left(K-1+e^{-K}\right)
Table 4.1 gives some numerical values of G(K) when C = 10, showing that G(K) is not a monotonic function of K.
Remark. In particular, this example shows that the covariance, Cov(Xip, Xje), between the primary loss and the excess loss of a single claim is not a monotonic function of K.
The fact that we used the dimensionless quantity CV in Sections 3 and 4 is important; otherwise, the conclusions in these sections may not hold. To illustrate this point, consider the following example.
Example 4.2. Suppose an individual claim, X, follows the exponential distribution with a hazard rate of λ > 0. Then Var[X] = 1/λ2, and the following are true:
E\left[X_{p}\right]=\int_{0}^{K} x \lambda e^{-\lambda x} d x+K S(K)=\frac{1}{\lambda}\left(1-e^{-\lambda K}\right),
\begin{aligned} E\left[X_{p}^{2}\right] & =\int_{0}^{K} x^{2} \lambda e^{-\lambda x} d x+K^{2} S(K) \\ & =\frac{2}{\lambda^{2}}\left(1-e^{-\lambda K}\right)-\frac{2}{\lambda} K e^{-\lambda K}, \end{aligned}
\begin{aligned} E\left[X_{e}\right] & =\int_{K}^{C}(x-K) \lambda e^{-\lambda x} d x+(C-K) S(C) \\ & =\frac{1}{\lambda}\left(e^{-\lambda K}-e^{-\lambda C}\right), \end{aligned}
and
\begin{aligned} E\left[X_{e}^{2}\right] & =\int_{K}^{C}(x-K)^{2} \lambda e^{-\lambda x} d x+(C-K)^{2} S(C) \\ & =\frac{2}{\lambda}\left[\frac{1}{\lambda}\left(e^{-\lambda K}-e^{-\lambda C}\right)-(C-K) e^{-\lambda C}\right] \end{aligned}
It follows that
\operatorname{Var}\left[X_{p}\right]=\frac{1}{\lambda}\left[\frac{1}{\lambda}\left(1-e^{-2 \lambda K}\right)-2 K e^{-\lambda K}\right]
\begin{aligned} \operatorname{Var}\left[X_{e}\right] & =\frac{1}{\lambda^{2}}\left[\begin{array}{c} e^{-\lambda K}\left(2-e^{-\lambda K}\right) \\ -e^{-\lambda C}\left(2+e^{-\lambda C}\right)+2 e^{-\lambda(C+K)} \end{array}\right] \\ &- \frac{2(C-K)}{\lambda} e^{-\lambda C} . \end{aligned}
Therefore,
\operatorname{Var}[X]-\operatorname{Var}\left[X_{p}\right]=\frac{e^{-\lambda K}}{\lambda}\left(2 K+\frac{1}{\lambda} e^{-\lambda K}\right)>0
\begin{aligned} \operatorname{Var} & {\left[X_{e}\right]-\operatorname{Var}[X] } \\ = & \frac{1}{\lambda^{2}}\left[\begin{array}{c} e^{-\lambda K}\left(2-e^{-\lambda K}\right)-e^{-\lambda C}\left(2+e^{-\lambda C}\right) \\ +2 e^{-\lambda(C+K)}-1 \end{array}\right] \\ & -\frac{2(C-K)}{\lambda} e^{-\lambda C} . \end{aligned}
If λ = 1, K = 1, and C = 4, then
\operatorname{Var}\left[X_{e}\right]-\operatorname{Var}[X]=-0.533<0 .
Indeed, if we look at the variance instead of the CV, then Theorem 4.1 implies that
\begin{aligned} \operatorname{Var}[X] & =\operatorname{Var}\left[X_{p}+X_{e}\right] \\ & =\operatorname{Var}\left[X_{p}\right]+\operatorname{Var}\left[X_{e}\right]+2 \operatorname{Cov}\left(X_{p}, X_{e}\right) \\ & \geq \operatorname{Var}\left[X_{p}\right] \text { or } \operatorname{Var}\left[X_{e}\right] . \end{aligned}
In view of the above discussion, we would like to see whether
is a monotonic function of Since we have shown that and are both nondecreasing functions of one might expect that is also a nondecreasing function of However, the next example shows that the answer is again negative.Example 4.3. As in Example 4.1, we still consider the case in which N is degenerate at 1 and there is an individual loss, X, following the exponential distribution with a hazard rate of λ = 1. Recall that G(K) = Corr(Ap, Ae) = Corr(Xp, Xe). It follows from Examples 4.1 and 4.2 that
G(K)=\frac{\left(e^{-K}-e^{-C}\right)\left(K-1+e^{-K}\right)}{\left(1-e^{-2 K}-2 K e^{-K}\right)} .
Table 4.2 gives some numerical values of G(K) when C = 10; it shows that G(K) is not a monotonic function of K.
Under the same assumption, we consider Corr(Ap, Ae) as a function of the state limit, C; that is, H(C) = Corr(Ap, Ae). Table 4.3 gives some numerical values of H(C) when K = 0.2; it suggests that H(C) might be a nonincreasing function of C.
Since Ap is independent of C, intuitively one would expect H(C) to be a monotonic function of C. The next theorem confirms this intuition.
Theorem 4.3. For a single claim, X, the correlation coefficient, Corr(Xp, Xe), between the primary loss and the excess loss is a nonincreasing function of C.
Proof:
By Theorem 4.1, we have
\operatorname{Corr}\left(X_{p}, X_{e}\right)=\frac{\left(\int_{0}^{K} F(x) d x\right)\left(\int_{K}^{C} S(x) d x\right)}{\sigma_{X_{p}} \sigma_{X_{e}}} .
Since the terms
and are both independent of and and are nonnegatively correlated, we obtainH(C)=\frac{\left(\int_{K}^{C} S(x) d x\right)^{2}}{\sigma_{X_{e}}^{2}}=\frac{\mu_{X_{e}}^{2}}{\sigma_{X_{e}}^{2}},
showing that H(C) is a nondecreasing function on R+. We also have
\frac{\partial}{\partial C} \mu_{X_{e}}^{2}=2 S(C) \mu_{X_{c}} ;
and
\frac{\partial}{\partial C} E\left[X_{e}^{2}\right]=2(C-K) S(C)
It follows that
\frac{\partial}{\partial C} \operatorname{Var}\left[X_{e}\right]=2 S(C)\left[(C-K)-\mu_{X_{e}}\right]
Therefore,
\begin{aligned} H^{\prime}(C) & =\frac{2 S(C) \mu_{X_{e}} \sigma_{X_{e}}^{2}-2 S(C)\left[(C-K)-\mu_{X_{e}}\right] \mu_{X_{e}}^{2}}{\sigma_{X_{e}}^{4}} \\ & =\frac{2 S(C) \mu_{X_{e}}\left[\sigma_{X_{e}}^{2}-(C-K) \mu_{X_{e}}+\mu_{X_{e}}^{2}\right]}{\sigma_{X_{e}}^{4}} \end{aligned}
\begin{array}{l} =\frac{2 S(C) \mu_{X_{e}}\left[E\left(X_{e}^{2}\right)-(C-K) \mu_{X_{e}}\right]}{\sigma_{X_{e}}^{4}} \\ =\frac{2 S(C) \mu_{X_{e}}\left[\begin{array}{c} \int_{K}^{C}(x-K)^{2} d F(x) \\ -(C-K) \int_{K}^{C}(x-K) d F(x) \end{array}\right]}{\sigma_{X_{e}}^{4}} \\ =\frac{2 S(C) \mu_{X_{e}} \int_{K}^{C}(x-K)(x-C) d F(x)}{\sigma_{X_{e}}^{4}} \leq 0 . \end{array}
Theorem 4.3 says that the degree of correlation between Xp and Xe becomes smaller as C is getting larger. For a possible actuarial application, we consider NCCI’s experience rating plan. Suppose the data for the excess loss is lost due to a recent cyber attack, but the data for the primary loss is still available. Now if the primary loss component is larger than average and the state limit is high, then actuaries know that there is a good chance the excess loss component will be larger than average. On the other hand, a quite low state limit gives more support to the possibility that the excess loss component will be larger than average.
As we pointed out earlier, two different claims, Xi and Xj, need not be independent in the generalized collective risk model; it is interesting to see whether Xip and Xje are also nonnegatively correlated, that is, whether Cov(Xip, Xje) ≥ 0 holds. The next result shows that the answer is affirmative.
Theorem 4.4. Let Xi and Xj be two distinct claims in the generalized collective risk model. Then the covariance between the primary loss of Xi and the excess loss of Xj is given by
\operatorname{Cov}\left(X_{i p}, X_{j e}\right)=\operatorname{Cov}_{\theta}\left[E\left(X_{i p} \mid \theta\right), E\left(X_{j e} \mid \theta\right)\right] . \tag{4.1}
Moreover, Cov(Xip, Xje) ≥ 0; that is, Xip and Xje are nonnegatively correlated.
Proof:
To establish Equation (4.1), we apply the conditional covariance formula to and use the fact that the are conditionally iid given to get
\begin{aligned} \operatorname{Cov}\left(X_{i p}, X_{j e}\right)= & E_{\theta}\left[\operatorname{Cov}\left(X_{i p}, X_{j e} \mid \theta\right)\right] \\ & +\operatorname{Cov}_{\theta}\left[E\left(X_{i p} \mid \theta\right), E\left(X_{j e} \mid \theta\right)\right] \\ = & \operatorname{Cov}_{\theta}\left[E\left(X_{i p} \mid \theta\right), E\left(X_{j e} \mid \theta\right)\right] . \end{aligned}
To see the validity of the second statement, we notice that
\begin{array}{l} \operatorname{Cov}_{\theta}\left[E\left(X_{i p} \mid \theta\right), E\left(X_{j e} \mid \theta\right)\right] \\ \quad=E_{\theta}\left[E\left(X_{i p} \mid \theta\right) E\left(X_{j e} \mid \theta\right)\right]-E\left[X_{i p}\right] E\left[X_{j e}\right] \\ \quad=E_{\theta}\left[\left(E\left(X_{i p} \mid \theta\right)-E\left[X_{i p}\right]\right) E\left(X_{j e} \mid \theta\right)\right] \\ \quad \geq 0 . \end{array}
Theorem 4.4 also has some potential applications in actuarial science. Let us continue to consider the example discussed following Theorem 4.3—that is, the data for all past excess losses is lost due to a recent cyber attack, but the data for past primary losses is still available. Suppose the data shows that past primary losses were very large relative to their average. Then without obtaining any new data, an actuary may infer that the excess component of the next loss is likely to be large.
Example 4.1 shows that Cov(Ap, Ae) is not a monotonic function of K. It is interesting to see from Theorem 4.1 that Cov(Xp, Xe) is a nondecreasing function of C. Also note that Cov(Xip, Xje) is not a monotonic function of K, as the next example shows.
Example 4.4. Suppose that, given the risk parameter θ, all claims X1, . . . , Xn, . . . are conditionally iid with a uniform distribution on (0, θ), and that θ follows a uniform distribution on (1, a), where 0 ≤ K ≤ C ≤ θ and a > 1. Then we have
\begin{aligned} \left.E\left[X_{i p} \mid \theta\right]\right) & =\int_{0}^{K} x \frac{1}{\theta} d x+K\left(\frac{\theta-K}{\theta}\right) \\ & =\frac{K(2 \theta-K)}{2 \theta}, \end{aligned}
and
\begin{aligned} \left.E\left[X_{j e} \mid \theta\right]\right) & =\int_{K}^{C}(x-K) \frac{1}{\theta} d x+(C-K)\left(\frac{\theta-C}{\theta}\right) \\ & =\frac{(C-K)(2 \theta-K-C)}{2 \theta} . \end{aligned}
Therefore,
\begin{aligned} E\left[X_{i p}\right] & =\int_{\Theta} \frac{K(2 \theta-K)}{2 \theta} d F(\theta) \\ & =\frac{K}{2(a-1)} \int_{1}^{a} \frac{(2 \theta-K)}{\theta} d \theta \\ & =\frac{K}{2(a-1)}[2(a-1)-K \ln a] \\ & =K-\frac{K^{2} \ln a}{2(a-1)}, \end{aligned}
\begin{aligned} E\left[X_{j e}\right] & =\int_{\Theta} \frac{(C-K)(2 \theta-K-C)}{2 \theta} d F(\theta) \\ & =\frac{(C-K)}{2(a-1)} \int_{1}^{a} \frac{(2 \theta-K-C)}{\theta} d \theta \\ & =\frac{(C-K)}{2(a-1)}[2(a-1)-(K+C) \ln a] \\ & =(C-K)-\frac{\left(C^{2}-K^{2}\right) \ln a}{2(a-1)}, \end{aligned}
and
\begin{aligned} E_{\theta} & {\left[E\left(X_{i p} \mid \theta\right) E\left(X_{j e} \mid \theta\right)\right] } \\ & =\int_{\Theta} \frac{K(C-K)(2 \theta-K-C)(2 \theta-K)}{4 \theta^{2}} d F(\theta) \\ & =\frac{K(C-K)}{4(a-1)} \int_{1}^{a} \frac{(2 \theta-K-C)(2 \theta-K)}{\theta^{2}} d \theta \\ & =\frac{K(C-K)}{4(a-1)}\left[\begin{array}{c} 4(a-1)-(4 K+2 C) \ln a \\ +K(K+C)(1-1 / a) \end{array}\right] . \end{aligned}
If a = 5, C = 1, and G(K) = Cov(Xip,Xje), then, as Table 4.4 shows, G(K) is not a monotonic function of K.
Table 4.5 gives some values of Cov(Xip, Xje) for different values of C when a = 5 and K = 0.25.
Table 4.5 suggests that H(C) = Cov(Xip, Xje) might be a nondecreasing function of C. The next result proves this fact.
Theorem 4.5. Let Xi and Xj be two distinct claims in the generalized collective risk model. Then the covariance, Cov(Xip, Xje), between the primary loss of Xi and the excess loss of Xj is a nonnegative, nondecreasing function of C.
Proof:
If then in view of Theorem 4.1, we need only to show that is nondecreasing. We have
\begin{aligned} H(C)= & E_{\theta}\left[E\left(X_{i p} \mid \theta\right) E\left(X_{j e} \mid \theta\right)\right] \\ & -E_{\theta}\left[E\left(X_{i p} \mid \theta\right)\right] E_{\theta}\left[E\left(X_{j e} \mid \theta\right)\right] \\ = & \int_{\Theta} E\left(X_{i p} \mid \theta\right)\left(\int_{K}^{c} S_{x \mid \theta}(x) d x\right) d F(\theta) \\ & -\left(\int_{\Theta} E\left(X_{i p} \mid \theta\right) d F(\theta)\right) \\ & \left(\int_{\Theta}\left(\int_{K}^{c} S_{x \mid \theta}(x) d x\right) d F(\theta)\right) . \end{aligned}
Since E(Xip|θ) is independent of C,
\begin{aligned} H^{\prime}(C)= & \int_{\Theta} E\left(X_{i p} \mid \theta\right) S_{x \mid \theta}(C) d F(\theta) \\ & -\left(\int_{\Theta} E\left(X_{i p} \mid \theta\right) d F(\theta)\right)\left(\int_{\Theta} S_{x \mid \theta}(C) d F(\theta)\right) \\ = & \int_{\Theta} E\left(X_{i p} \mid \theta\right)\binom{S_{x \mid \theta}(C)}{-\int_{\Theta} S_{x \mid \theta}(C) d F(\theta)} d F(\theta) \geq 0 . \end{aligned}
Therefore, H′(C) is nondecreasing function of C.
Examples 4.1 and 4.3 imply that neither Cov(Ap, Ae) nor Corr(Ap, Ae) is a monotonic function of K. However, the following theorem shows that H(C) = Cov(Ap,Ae) is a nondecreasing function of C.
Theorem 4.6. For aggregate loss A in the generalized collective risk model, the covariance, Cov(Ap, Ae), between the primary loss and the excess loss is a nondecreasing function of C.
Proof:
We have
\operatorname{Cov}\left(A_{p}, A_{e}\right)=\operatorname{Cov}\left(\sum_{i=1}^{N} X_{i p}, \sum_{j=1}^{N} X_{j e}\right)
\begin{aligned} = & \sum_{j=1}^{N} \sum_{i=1}^{N} \operatorname{Cov}\left(X_{i p}, X_{j e}\right) \\ = & \sum_{i=1}^{N} \operatorname{Cov}\left(X_{i p}, X_{i e}\right) \\ & +2 \sum_{i=1}^{N-1} \sum_{j=i+1}^{N} \operatorname{Cov}\left(X_{i p}, X_{j e}\right) . \end{aligned}
Now the conclusion follows from Theorem 4.1 and Theorem 4.5.
Table 4.6 concludes this section by summarizing its key results.
5. Discussion
This paper has studied the properties of the primary loss and the excess loss in NCCI’s split experience rating plan. It has shown that Table 4.6.
in the collective risk model, generalizing a result obtained in the prior literature. Then it showed that the same conclusion holds in the generalized collective risk model. The last part of the paper investigated the covariance and correlation coefficient between the primary loss and the excess loss in the generalized collective risk model. The key results are summarized inIn view of Theorem 4.5 and Theorem 4.6, we would like to know whether Corr(Xip, Xje) is a nondecreasing function of both K and C. Our numerical analysis suggests that the answer might be affirmative. But a proof is elusive to us. Hence, we pose this open problem as a conjecture.
Conjecture 5.1. Let Xi and Xj be two distinct claims in the generalized collective risk model. Then the correlation coefficient, Corr(Xip, Xje), between the primary loss of Xi and the excess loss of Xj is a nonnegative, nondecreasing function of K. It is also a nonnegative, nondecreasing function of C.
We also conjecture that H(C) = Corr(Ap, Ae) is a nondecreasing function of C.
Conjecture 5.2. For aggregate loss A in the generalized collective risk model, the correlation coefficient, Corr(Ap, Ae), between the primary loss and the excess loss is a nondecreasing function of C.
This paper has focused on developing some further results in the foundational theory of split credibility, though it has also discussed some potential applications of our results. Since we have limited knowledge, many potential applications may remain to be discovered. It is our hope that the results in this paper can stimulate some further work along this line.
Acknowledgments
Thanks are due to three anonymous reviewers for their thoughtful comments and suggestions, which led to several substantial improvements in this article. Thanks also go to Ryan Martin, who made the author realize that the usual setup of credibility theory entails an exchangeable model.