1. Introduction
The common shock model, which was introduced by Heckman and Heckman (1983), Meyers (2005, 2007), and Mildenhall (2006) and is also called contagion modeling by White (2012), is a recent approach to modeling the systematic effect of changing insurance industry and general economic environments on losses. These effects include factors specific to the insurance industry, such as the so-called underwriting cycle, as well as cycles in the overall business environment, such as economic recessions and legal changes. Often the effect of such an environmental change is to cause losses to exhibit similar behavior both within, as well as across, companies or lines of business. This behavior often takes the form of a correlation of losses within or across companies or lines of businesses. The common shock model or contagion model method attempts to capture and account for the effects of such an influential environment, which is referred to as a contagious environment. These models accomplish this by determining common factors which can be incorporated separately into both the frequency and severity distributions of effected losses. For the sake of simplicity, we use the single name contagion model to refer to the common shock model.
This paper is structured as follows: section 2 will first briefly review contagion, as applied within frequency distributions. We use the context of the negative binomial distribution to illustrate the application of frequency contagion. In section 2, we further present a new method for incorporating contagion within the binomial distribution. We remark that an incorporation of contagion within the binomial distribution, resulting in positive correlation, has not appeared in the literature, and previous authors have commented on the difficulty of achieving this. Section 3 describes how the effect of a contagious environment can be modeled within the severity component of the collective risk model. Next, section 4 describes the aggregate loss distribution under contagion modeling. Section 5 presents an elementary method that can be used to easily calibrate contagion models to empirical data. Last, section 6 provides a case study on the application of contagion modeling to general insurance losses. In particular, section 6 illustrates the application of the elementary calibration scheme from section 5 to real-world insurance data and makes clear the potential deficiency of traditional collective risk modeling as applied to aggregate annual layered losses.
2. Frequency contagion
In what follows, Ni, represents a claim count random variable (RV), where the subscript i, when present, can be interpreted as signifying the ith line-of-business. C represents the random variable which interacts with the distribution of Ni so as to induce a contagious effect. In what follows, it will be illustrated how C often acts in a multiplicative manner on the parameters of the distribution of Ni. In this context, the contagion random variable can be viewed as representing the randomness of the parameter value, as in Bayesian analysis. In what follows, we refer to C as the contagion RV. The distribution of C can take any form, which we denote by C ∼ f(E[C], Var[C]). The only restrictions on C are:
-
C has positive support, or that all values of C are greater than 0.
-
E(C) = 1 (the expected value of the RV C is 1)
-
Var(C) = c < ∞
The quantity c also plays an important role. As c represents the variance of the contagion random variable, we also require that c ≥ 0, and refer to c as the contagion parameter.
The canonical distribution that is often used to introduce frequency contagion is the Poisson distribution. For a description of the incorporation of contagion within the Poisson distribution, the reader is referred to Meyers (2007). The literature also contains a procedure which results in a model that is equivalent to introducing contagion within the negative binomial. However, it is the belief of the authors that the traditional parameterization of the negative binomial facilitates a greater understanding of the nature of contagion modeling and also makes clear the ease of implementation. Moreover, as some statistical software is not amenable to the mean/variance parameterization of the negative binomial, some practitioners may benefit from a more explicit algorithm in this situation.
In order to explicitly illustrate how contagion can be directly introduced within a negative binomial distribution the Klugman (2012), or r, β parameterization is more useful:
Pr(N=n)=(r+n−1n)(11+β)r(β1+β)n for n≥0 and β>0,r∈R+
Since the negative binomial distribution has two parameters, the contagion RV could potentially operate on the parameters r, β of the negative binomial in a multitude of ways. However, for the sake of brevity, we straightforwardly present the particular operation which is consistent with the presentation of negative binomial contagion in the literature. In a quite analogous manner to Poisson contagion, the contagion RV, C, can simply be multiplied with the parameter β, and leave the r parameter unaffected. Hence, we can describe the negative binomial distribution, under contagion, by:
N∣C∼Negbin(r,Cβ) with C∼f(E[C]=1,Var[C]=c) where C∈R+
The r and β parameterization of the negative binomial allows the incorporation of the contagion RV to be clearly illustrated. However, the characteristics of the resultant model become more clear when the negative binomial is parameterized using the mean of λ, and a dispersion parameter, γ, which is related to variance-to-mean ratio: 1 + γλ, i.e., N ∼ NegBin(λ, γ). For completeness, we note that this parameterization is related to the r and β, parameterization via N ∼ NegBin(r = 1/γ, β = γλ). Under this λ, γ parameterization, the mean λ is multiplied by the contagion RV, i.e., N|C ∼ NegBin(Cλ, γ). Notice that if C takes the constant value of 1, with probability 1, then N will be an ordinary negative binomial distribution with mean λ and variance λ(1 + γλ), or rβ and rβ(1 + β). Further, it can be seen that E(N) = EC[EN(N|C)] = EC[C · λ] = EC [C] · λ. But, since C is required to have an expected value of 1, or E[C] = 1, we have
E(N)=λ.
Further, since we denote Var[C] by c, it can easily be seen that
Var(N)=λ(1+λ(c+cγ+γ)).
Equation (2.4) shows that if the contagion parameter is 0, then the distribution of N reverts to the original negative binomial, whereas as c grows, the variance increases.
The introduction of contagion, across
lines-ofbusiness, for which the claim count of each is modeled with a negative binomial frequency distribution, can be denoted by: for and: Next, assume that parameters of the best-fitting negative binomial have been fit to the data for each line of business. It is important to note that each can have different parameters, and or and after fitting to the respective data. In this case, it can be seen that the resulting correlation between and for and takes the following form:ρNi,Nj=√cλi1+λi(c+cγi+γi)√cλj1+λj(c+cγj+γj)
Again, it is important to note that the correlation between Ni and Nj not only depends on the same contagion parameter, c, but also on the parameter values of each distribution; γi and γj. Hence, a unique correlation can be induced between each Ni and Nj, even though the same contagion parameter, c, is used. Equation (2.5) also shows that the level of the contagion parameter, c, impacts the correlation between Ni & Nj in a way that makes intuitive sense. Namely:
-
As
which represents a weak, or absent, contagious environment, -
As
which represents a strong contagious environment,
Figure 1 depicts the pair-wise correlation between and as a function of and also for different combination of the parameters, and It can be seen from Figure 1 that when the dispersion parameters are high, there is less correlation. There is also a trade-off between the dispersion parameters and the means, Even when the dispersion parameters are relatively large, sufficiently large values of the means, will offset the deleterious effect on the correlation caused by large dispersion parameters. Last, note that the Poisson contagion model can be achieved by setting the dispersion parameter equal to 0, and the formulas for the mean, variance, and correlation can be achieved similarly.
2.1. Binomial-beta contagion
Within actuarial science, claim count RV’s are often assumed to be a member of the (a, b, 0) class, which includes the Poisson, negative binomial, and binomial distributions. For information on the (a, b, 0) class, the reader is referred to Klugman (2012). Further, it has been the author’s experience that the binomial distribution is used in practice when the variance of event count data is significantly less than the corresponding mean. Moreover, we note that the existing literature on contagion modeling, including the work of Meyers (2005, 2007), considers the binomial distribution. For these reasons, we feel that the binomial distribution should not be ignored. To the authors’ knowledge, a viable method for inducing positive correlation, via contagion, between binomially distributed random variables is absent from the literature. The results of this section fill this void by presenting an original formulation for the introduction of contagion among binomially distributed random variables that results in positive correlation.
Before proceeding, the authors wish to impress upon the reader that not every association (operation) of the contagion RV with (on) the parameters of the claim count distribution will lead to a valid contagion model. In particular, in order for an association (operation) of the contagion RV to be valid, it must result in a claim count distribution with the following characteristics:
i) The mean of
ii) As the variance of the contagion increases:
a. The variance of should increase.
b. should increase.
iii) As the variance of the contagion goes to 0 :
a. the variance of should equal its value in the absence of contagion.
b. should go to 0 .
In addition to the preceding requirements, the following condition is often desired:
i. For a given level of the contagion parameter, the correlation between claim count RVs with large means should be stronger than the correlation between claim count RVs with small means, given that the volatilities are held constant.
The probability density for a binomial distribution with parameters ni and pi can be written as
N∼Binomial(n,p)=(nN)pN(1−p)n−N where 0≤N≤n and n∈R+,0<p≤1
First, consider a single binomial distribution. Next, assume that the best-fitting binomial distribution has been fit to the data, and results in the parameter estimates, p̂ and n̂. Now, we assume that N ∼ Binomial (n̂, p), where p is the only parameter that is considered to be random, i.e., the parameter n is evaluated at n̂. Next, consider the following prior distribution for p, where c is the contagion parameter:
p∼Beta[α=1c,β=(1c)1−ˆpˆp]
Note that both p̂ and c are constants in this beta distribution. Using this distribution for p we can check that the resulting posterior predictive distribution for N has the desired properties. In particular, it can be seen that the mean is unaffected, while at the same time, larger values of c result in a larger variance. It can be easily seen that: E(N) = Ep [EN (N|p)] = n̂ · p̂. Thus the mean of the beta-binomial mixture equals the mean of the original, best-fitting, binomial, as required. It can also be shown that the variance of the beta-binomial mixture is Var(N) = n̂p̂(1 − p̂) × (1 + n̂p̂c) × (1/(1 + p̂c)). Notice that when c → 0, Var(N) → n̂p̂ (1 − p̂), and as c → ∞ ⇒ Var(N) → n̂[n̂p̂ (1 − p̂)] > n̂p̂ (1 − p̂).
Now, let Ni be the number of claims for the ith line of business, out of K lines of business. Again, assume that the best-fitting binomial distribution has been determined for each line. Denote the parameter estimates which specify these best-fitting binomial distributions by {n̂i, p̂i|1 ≤ i ≤ K}. We propose the following procedure for inducing contagion between binomial claim count RVs:
Ni|p∼Bin(ˆni,(ˆpiˆp∗)p) for 1≤i≤K and p∼Beta(α=1c,β=1c(1−ˆp∗ˆp∗))
where p̂* = max(p̂1, p̂2, . . . , p̂n), and p̂i is the best-fitting parameter estimate, for line i. We now illustrate that the preceding combination of claim count distributions, and the prior distribution, have the desired characteristics. First, it can easily be shown that for 1 ≤ i ≤ K, E(Ni) = Ep[ENi(Ni|p)] = n̂i · p̂i Hence, the proposed model preserves the mean of each of the marginal distributions, as required. Next, we verify that the variance of each marginal has the desired characteristics. Further, it can be shown that, for 1 ≤ i ≤ K, the variance for a single binomial RV is
Var(Ni)=ˆniˆpi(1−ˆpi)1+cˆp∗+cˆp∗[ˆniˆpi(1−ˆpiˆp∗)]1+cˆp∗+cˆp∗[(ˆniˆpi)2(1ˆp∗−1)]1+cˆp∗
Though it may appear complicated, several important observations can be made from this expression. First, it can be seen that:
-
as c → 0, we have that: Var (Ni) → n̂ip̂i (1 − p̂i), which is the variance of the best-fitting binomial distribution to the data from line i, in the absence of contagion.
-
Var (Ni) is an increasing function of c.
-
as c → ∞, we have
Var(Ni)→ˆniˆpi(1−ˆpiˆp∗)+(ˆni)2(ˆpi)2(1ˆp∗−1).
A little effort reveals that observation 3 illustrates that the variance behaves as required.
Next we investigate the correlation between Ni and Nj under the proposed binomial contagion model. It can be shown that the formula for the correlation between the number of claims in the ith and jth lines of business, where 1 ≤ i, j ≤ K, and i ≠ j, is
ρN1,N2=ˆniˆpiˆnjˆpj(c(1−ˆp∗)1+cˆp∗)σiσj
where σi2 is as in equation (2.9).
From Equation (2.10), several observations regarding
as a function of can be made. For and we have:- as we have
- in an increasing function of
- as it can be shown that:
ρNi,Nj→1√(1+(ˆp∗−ˆpi)ˆniˆpi(1−ˆp∗))(1+(ˆp∗−ˆpj)ˆnjˆpj(1−ˆp∗))
Figure 2 illustrates the pair-wise correlation, under binomial contagion, for various levels of p̂*, p̂1, and p̂2. n̂ = 5 is used throughout.
Note that, as for the Poisson and negative binomial models, different levels of correlation between different pairs of RV’s can be achieved, despite the use of a single contagion parameter. This is so, since the pair-wise correlations depend on the parameter values of the best fitting binomial distributions n̂i, p̂i, and n̂j, p̂j, for each line, as well as on c. From Figure 2, it can also be seen that the correlation is higher between lines for which both p̂i and p̂j are relatively large, and hence have a relatively large number of counts, on average (n̂i constant). Also note, from the pairs of black, red, and blue lines, that for the particular levels of p̂j in this example, a higher value of p̂* results in lower correlations.
3. Severity contagion
In traditional severity modeling, the assumption that all severities are independent is made, largely, to make both the mathematics and implementation easier. This is the case, despite the fact that patterns in loss severities, over time, have been documented by practitioners. One such example is the so-called property and casualty insurance underwriting cycle. Further, some believe that loss trends are a significant driver of premium rates. Moreover, it is reasonable to assume that loss trends are, in turn, driven by a set of endogenous factors, such as changes in underwriting standards and policy terms, and exogenous factors, which include the legal and regulatory environment. In the traditional contagion modeling framework, the goal is to partition a company’s portfolio into a set of mutually exclusive and exhaustive subsets, where the combined effect of these drivers of loss trend is relatively constant within each subset and varies more markedly across subsets. Moreover, traditional contagion modeling attempts to capture the combined effect of all drivers, within any such subset, with only a small set of contagion parameters. Hence, the distinct levels of the contagion parameters represent what’s referred to as a contagious environment. We remark that some may consider the goal of achieving such a parsimonious model to be quixotic; however, one should keep in mind the alternative. It is well known that calibrating correlation matrices is notoriously difficult, especially for a large number of variables.
Another well-known characteristic of traditional contagion modeling is its ability to represent parameter uncertainty within simulation models. A byproduct of this inclusion of parameter uncertainty is an inflation in the variance of the marginal loss distributions. In this section we propose an alternative to traditional contagion modeling that can be viewed as an option for more conservative (re)insurers, who may be uneasy with the increase in variance associated with the traditional model, or (re)insurers who place a large amount of significance in their data. As with traditional contagion modeling, the proposed form of severity contagion induces correlation; however, the proposed method has the benefit of minimizing the distortion of the variance of the marginal distributions. We represent the observed loss severity of the
line of business by and denote that follows a distribution with mean and, variance by The proposed method accomplishes this by supposing that the size of each loss can be decomposed into two components. The first component is governed solely by a theoretical, unadulterated, underlying, loss distribution, and the second component represents the impact of the contagious environment on the size of loss. The former can be loosely thought of as an idiosyncratic component and the latter a systematic component. Consider the losses from the line of business. We denote the idiosyncratic component by the and the systematic component by the Specifically, we assume that can be decomposed into the product of two where is the severity contagion and the underlying loss Analogous to frequency contagion, we use to represent the contagion parameter. The motivation for this decomposition of arises mainly from the desire to separate the portion of the variation of that is solely due to the random claims generation process from the portion of the variance that is due to the contagious environment. Further, it should be noted that the proposed decomposition of the observed losses requires the elucidation of information from the data, over and above that required by existing severity contagion models. As is usually the case, there is a cost associated with this additional precision. In this case, the additional cost comes in the form of increased complexity in calibrating the model. Specifically, instead of the need to only determine the best-fitting contagion parameter, b (to be defined shortly), it is also necessary to determine the RV, Zk, which best describes the pure loss process. Before proceeding, the authors wish to point out that they are not implying that the variance inflation that is associated with traditional contagion modeling is a shortcoming of the traditional severity contagion model.As with the traditional version of severity contagion, the only requirements on the RV β are that its distribution has positive support and that its mean be 1. Aside from this, β can have any distributional form, which we denote by β ∼ f(E(β), Var(β)). Also, since the observed losses are modeled by the product of RV’s, i.e., β · Zk, or using slightly simpler notation; βZk, additional requirements must be imposed, namely that:
-
β be independent of Zk, and
-
E(Zk) = E(Xk).
In summary, the proposed severity contagion method models the size of observed losses by
where and with and, further, such that and Further, since the variance of represents only a small fraction of the overall variance of the assumption that is a member of the same family of distributions as should be, at most, venial.We note that the proceeding model formulation has the desired characteristics, namely, that the mean of
is preserved by the proposed model, since and that the variance has the following form:Var[βZk]=σ2zk+b(μ2k+σ2zk).
We now consider the reasonableness of the proposed severity contagion method. If the overall severity model accurately represents the impact of a contagious environment on the pure loss generation process, and a calibration scheme is able to effectively isolate the contribution of
to the variation of the data, then, at least theoretically, it should be possible to determine parameters, and such that: This makes sense, since:-
In the absence of a contagious environment, it should be inferred that
and hence which implies that or that is small. -
Alternatively, in the presence of a strong contagious environment, it should be inferred that
which, by the same argument, implies that or in other words that
Conversely, under the same assumption that
we have that:-
implies that which implies a weak contagious environment, and
-
implies that which implies a strong contagious environment.
To illustrate the difference between the traditional and proposed severity contagion models, by fitting
to the data, and then multiplying by as in the traditional contagion model, we have that data As mentioned previously, this distortion of the variance would likely not be of concern to (re)insurers who have limited data, or ascribe less credibility to their data, or who may even believe that the observed data does not accurately represent the true, theoretical, variance.We now investigate the proposed model’s ability to induce correlation between two or more loss
's. Let Dist denote the best-fitting distribution to the data from the line of business, and Dist denotes the best-fitting distribution to the data from the line of business. Further, assume that and can be, sufficiently, modeled as the product of which we denote by and where is independent of and and is independent of Further, assume that the parameters of and are and Then, it can be seen that the covariance between and is:Next, this result can be used to show that the correlation between
and is:ρβZk,βZj=b√[b+(1+b)CV2Zk][b+(1+b)CV2Zj]
Figure 3 illustrates the correlation induced between two lines of business as a function of the contagion parameter, b, and the CV’s of Zk and Zj: CV2zk = σ2zk/μ2k, and CV2zj = σ2zj/μ2j). Further, it can be seen that when the variation in loss size is small, the correlation which can be achieved is higher, which is consistent with intuition.
4. Aggregate loss contagion model
In this section, we describe how both frequency and severity contagion can be combined within the collective risk model (CRM). Let
represent frequency distribution for line under contagion, i.e., Poisson with Further, assume that the best-fitting severity distribution to line is where and represent the mean and variance of the distribution of and that it is possible to determine parameters and such that where is independent of as in Section 3. Then, we can simply replace with and with within the traditional CRM, to arrive at the aggregate loss distribution under both frequency and severity contagion, which we denote by :S∗i=N∗i∑k=1βZik
Despite the addition of contagion, it can be shown that the mean of the CRM remains unchanged, i.e., E[S*] = λ · μ. Also, it can be shown that the variance of the aggregate loss, under both frequency and severity contagion, is
Var[S∗]=Var[Zi]E(N∗)+(E[Zi])2Var(N∗)+b⋅{Var[Zi]E(N∗)+(E[Zi])2E(N∗2)}.
We remark that this formula works for any frequency and severity distributions. One can simply substitute in the formulas for the mean and variance of the frequency, under contagion, which we denote by E(N*) and Var(N*). E(N*) and Var(N*) will depend on the type of frequency distribution used but will not depend on the distributional form of the contagion RVs; C, or β. Hence, Equation (4.2) is a fully general formula.
We now investigate the correlation between the aggregate losses from two lines, S*k and S*j, where k ≠ j. It can be shown that the correlation between S*k and S*j, for k ≠ j, is
ρs∗ks∗j=λkμkλjμj√(∑k+b⋅[∑k+(μkλk)2])×(cb+b+c)√(∑j+b⋅[∑j+(μjλj)2])
Note that ∑i = Var[Zi]E(N*) + (E[Zi])2 Var(N*) is the aggregate loss variance for contaged frequency and un-contaged severity.
Figure 4 illustrates the correlation induced between the aggregate losses of two lines of business as a function of the contagion parameter, b. Each of the nine graphs uses 5 and 10 for the means of the two frequency distributions, i.e., λ1 = 5, and λ2 = 10, and corresponds to one of three levels of the frequency contagion parameter, c = 0.1, 0.5, and 0.8. It can be seen that higher levels of the frequency and severity contagion parameters c and b result in higher correlation between the lines. Also, higher levels of μj result in higher correlation. Conversely, higher levels of σj result in lower correlation. Or, stated another way, the greater the uncertainty in the loss sizes of each line, the more tenuous the relationship between the loss sizes, and hence the lower the correlation.
Moreover, in the special case where the claim frequency for both line j and k, are Poisson, it can be shown that the correlation between S*k and S*j, for k ≠ j, is
ρ∗S∗k,s∗j=1√[(1+b)CV2Sk+[bc+b+c]]×(cb+b+c)√[(1+b)CV2Sj+[bc+b+c]]
5. Implementation and calibration
In this section we illustrate one out of many potential methods for calibrating contagion models to empirical data. The motivation for illustrating this particular calibration procedure is three-fold. First, it is a very simple, and practical, way to calibrate contagion models to per-claim empirical data. Further, there appears to be a dearth of literature that describes, in detail, practical approaches to calibrate contagion models to such per-claim empirical data. In fact, to the authors’ knowledge, this paper represents the first attempt to do so. Second, the authors wish to illustrate that the proposed severity contagion model, specifically, can be calibrated with this simple technique. Third, while the authors recognize that advanced statistical techniques, such as the EM algorithm, GLMs or GLMMs, should ideally be used to calibrate the proposed severity contagion model, where Var[data] ≈ Var[βXk], it is very unlikely that the practicing actuary will have sufficient data at their disposal to attain statistically significant results from these methods.
We first investigate the calibration for a single line of business. The procedure employed will, no doubt, depend on the purpose of the analysis as well as the expert opinion of the practitioner. There are three main steps for calibration of contagion models for aggregate losses:
-
Model fitting—determination of the best-fitting distributions for the frequency and severity, based on the empirical data.
-
Calibration of the frequency contagion model, based on the empirical claim count data.
-
Calibration of both the pure, underlying, loss process, Zi, and also the contagion RV β.
Step 2 is accomplished by exploiting the corresponding formula for the variance of the claim count RV under contagion. For example, when claim counts are modeled with a Poisson distribution, with mean λ, we have that Var(N) = λ(1 + c · λ). As Var(N) can be estimated from the observed annual claim count data, the only unknown quantity is the frequency contagion parameter c. Hence, the above equation can be rewritten to solve for c:
c=Var(N)λ2−1λ
Step 3 calibrates the proposed severity calibration method. As mentioned in section 3, given the inevitable uncertainty surrounding the true underlying loss process, as well as the presumed relatively small variance of β, in most circumstances, there is no harm in assuming that the RV Zk follows the same distributional form as the best-fitting distribution to Xk. Hence, the determination of the distribution of Zk only requires the determination of its variance, σ2z. Again, if sufficient data were available, it would be preferred to use advanced statistical procedures, such as the EM algorithm, GLMs, or GLMMs. However, as this is usually not the case, we describe a simple, straightforward severity calibration method. To accomplish step 3 we perform the following steps:
-
Determine b.
-
Use this b to determine σ2z.
-
Determine the parameters of the severity distribution (Pareto, in this case), which correspond to this σ2z, and use this severity distribution within the simulation.
Using Equations (3.1) and (4.2) if claim frequency is assumed to follow a Poisson distribution with parameter λ, then it can be shown that
b=Var[S∗]−λ⋅Var[βZ]−λμ2z−λ2μ2zcλ2μ2z(1+c)
In Equation (5.2),
is the variance of the empirical aggregate losses, and and can be estimated using the mean and the variation, respectively, of the per-claim loss data. Recalling that has already been estimated, at this stage, all remaining quantities to the right of the equals sign in Equation (5.2) can be estimated from the empirical data. Note that Equation (5.2) only applies to the specific case of a Poisson frequency. However, when contagion is introduced within a negative binomial frequency the corresponding formula can be shown to beb=Var[S∗]−λ⋅Var[βZ]λ2μ2z(1+c+γ+cγ)−λμ2z+λ2μ2zc+λ2μ2zγ(1+c)λ2μ2z(1+c+γ+cγ)
Once b has been determined, the variance of the pure underlying loss RV, Z, can be solved for using Equation (3.1):
σ2z=Var[βZ]−bμ2z1+b≈Var[X]−bμ2z1+b
Finally, the aggregate annual losses, under both frequency and severity contagion, are
S∗=N∗∑i=1βZi
where
N∗∣C∼DistN(C⋅λ) with C∼f(E[C]=1,Var[C]=c)
and
{Zi∼DistX(E[Zi]=μz=μx,Var[Zi]=σ2z),Zi and Zj are independent of each other for i≠jβ∼f(E(β)=1,Var(β)=b)
As indicated above, this is just one method for calibrating contagion models. This particular method is informed by the desire to produce aggregate annual loss simulations with variation close to the variation of the empirical annual aggregate losses. Armed with an understanding of the proposed calibration method for a single line of business, various calibration procedures can be employed to extend the calibration to two or more lines of business. We leave the choice of the specific calibration procedure to the practitioner.
6. Case study: Natural peril data
In this case study, we consider only a single line-of-business. Further, we compare the ability of two models to reproduce key characteristics of a sample of historical data. The key characteristic of the simulated values that we focus our attention on is the coefficient of variation (CV). The two models under consideration are:
-
The traditional CRM, and
-
The CRM under contagion.
Both models 1 and 2 will be calibrated to the actual loss data. In particular, the data will be used to determine both the distributional choices (frequency and severity), as well as parameter values of each. In other words, we allow the data to dictate the calibration both models.
Next we turn our attention to the data used in this case study. The data is actual loss data consisting of 670 property claims from severe convective storms between years 2003 and 2012, inclusive. The data is on an occurrence basis and is in units of $1,000. Due to the proprietary nature of the data, only approximate values of the corresponding aggregate annual losses are shown in Table 1. The approximate nature of these values should be kept in mind, should the reader attempt to verify the results presented in this section using values from Table 1.
We now proceed to the analysis of the data. Since the variance of the annual claim counts Var(N) (≅ 600) is significantly greater than the mean E(N) (≅ 70), we use a negative binomial distribution for the frequency distribution, both within the CRM and the CRM under contagion. Specifically, based on the annual aggregate claim data, we use
p=prob success =E[X]Var[X]≅0.1,r=E[X]2Var[X]−E[X]=8.25
We now turn our attention to calibrating the severity component of the contagion model. Using maximum likelihood (MLE) methods, the best-fitting distribution to the observed per-claim severity is a Pareto distribution. Further, the MLE-based parameters of the best-fitting Pareto are αMLE = 2.982, θMLE = 33,468.
The fit of the Pareto distribution to the observed empirical data is illustrated in Figure 5.
From Figure 5, it can readily be seen that the Pareto provides a good fit to the data. Although the MLE estimates of the parameters can be used within the simulation, we opt to use the method-of-moment estimates, (αMoM, θMoM) = (2.876, 33,470), out of a desire to eliminate any bias in the final comparison between the two models. Also, out of a desire to be transparent, Table 2 summarizes both the method of moments and MLE parameterization of the Pareto distribution.
Again, to be fair, the final calibration of the first model (traditional CRM) uses NegBin(p = 0.11, r = 8.25) and Pareto(α = 2.87, θ = 33,470). We now calibrate the severity component of the second model (CRM under contagion). First, we use Equation (5.2) to determine the severity contagion parameter.
b=Var[S∗]−λ⋅Var[βZ]−λμ2z−λ2μ2zcλ2μ2z(1+c)≈0.13.
The inputs that we used for Equation (5.2) are:
-
The mean of the Poisson frequency distribution (l = 70).
-
The modeled mean of the per-claim severity distribution (μz = 17,842).
-
The modeled variance of the per-claim severity (Var[βZ] = σ2x = 32,3292).
-
The empirical variance of the annual aggregate losses (AAL): (Var[S*] = 697,2452).
Note that Equation (5.2) is used, since the negative binomial distribution (without contagion) is equivalent to the Poisson distribution, under contagion. As illustrated in Meyers (2007), c equals the dispersion parameter of the negative binomial: c = (Var[X] − E[X])/(E[X])∧2 = 0.12.
However, severity calibration for the second model is not complete. Recall that under the proposed severity contagion model, the true, underlying, loss process variance is elicited from the observed variation. As explained in Section 5, a simple, straightforward approach is to use Equation (5.4), together with the value of b:
σ2z=Var[βZ]−bμ2Z1+b=29,6342.
If a statistical software package is used that does not allow the parameterization of the Pareto using the mean and variance, the parameter values can be solved for in terms of the mean and variance. For the Pareto distribution, the corresponding formulas are
α=2Var[Z]Var[Z]−E[Z]2,θ=E[Z](Var[Z]+E[Z]2Var[Z]−E[Z]2)
The resulting parameters are αZ = 3.137 and θZ = 38,133. At this point the calibration of the severity component of model 2 (CRM under contagion) is complete, and we can begin the simulations that are at the core of this study.
First, we define the layers of loss, which will be used for all simulations in the case study: 0 – 7.5M, 7.5M – 20M, 20M – 45M, 45M – 70M, 70M – 100M, and 100M – 200M.
The simulation study is accomplished by repeatedly simulating 10 years of aggregate annual layered losses (AALL) and then calculating the corresponding coefficient-of-variation (CV) of each set of 10-year AALLs. We do this using both AALL estimates from the traditional CRM and estimates from the CRM under contagion. This process is repeated until 100,000 10-year CV estimates are obtained under the traditional CRM and also under the CRM under contagion. Next, box-plots of both sets of CV’s are displayed, along with the empirical AALLs (red dots). For reference, the final parameterization of both models is displayed in Table 3. A summary of the simulation procedure is as follows:
-
For each iteration of the simulation, generate 10 years of Aggregate Annual Losses under both:
-
The traditional CRM
-
The CRM under contagion
-
-
For each of the predefined layers, calculate the annual aggregate layered loss.
-
For each set of 10 AALLs, calculate the CV.
-
Repeat 100,000 times.
- At this point, we have 100,000 10-year CV estimates for each layer.
-
For each layer, create a box-plot using the 100,000 10-year CVs (10th, 25th, 50th, 75th, and 90th percentiles).
Note that different severity parameters were used for the traditional CRM and the CRM under contagion. Again, this is not an accident, for recall that under the proposed severity contagion method, the parameterized severity distribution represents the true underlying loss process, Z, whereas under the traditional CRM, it is the observed severity, X, that is parameterized.
Recall that the empirical CV’s of the aggregate annual layered losses are shown in red. Also, the box-plot of the ground-up losses are shown on the far right column of each graph.
Figure 6 illustrates that model 2 (CRM under contagion) produces simulated AALLs for which the 10-year CVs are more consistent with the observed CVs of the empirical AALLs. Also, recall that both the frequency and severity distributions of both models were calibrated to the observed data. Moreover, note that incorporation of a single contagion parameter allowed for the simulation of 10-year CVs that are more consistent with the observed CVs for all layers as well as for the ground-up losses.
7. Conclusion
Within the insurance industry, the ability to accurately capture variation instead of just the mean has grown over the past several decades. In particular, risk measures such as Value-at-Risk (VaR) and Conditional Tail Expectation (CTE) or Tail-VaR (TVaR) were constructed to capture such distributional qualities. The use of contagion within the traditional CRM will allow actuaries and risk professionals to reap the benefits of these modern risk measures.
In this paper we have investigated the contagion modeling paradigm, as introduced by Meyers (2007). First, we have elucidated, analytically, the impact of this modeling paradigm, in particular the induced correlation. Further, this has been done for frequency, severity, and aggregate contagion models under all of the common frequency and severity distributions. Second, we have developed a workable version of contagion modeling under the binomial distribution that produces positive correlation. Third, we have proposed a form of severity contagion for (re)insurers, with highly credible data, who wish to avoid the variance inflation associated with traditional contagion modeling. We have also described a procedure to calibrate aggregate contagion models, under the proposed severity contagion, based on per-claim empirical data. Last, we have demonstrated the efficacy and ease of use of the proposed calibration procedures using a real-world dataset from the insurance industry. This case study illustrates the well-known limitations of traditional collective risk modeling and provides evidence that contagion modeling provides a systematic way to correct for the inherent limitations of traditional collective risk modeling. It is our hope that this work will help establish contagion modeling, both among academics and practitioners, and also motivate additional research on contagion modeling, as well as on other ways to improve traditional collective risk modeling.
Acknowledgments
The authors wish to recognize the contribution to the following parties:
Benedict M. Escoto, PhD, FCAS, Aon Benfield Chicago, Illinois, for being willing to share his knowledge on contagion modeling and supporting calibration.
Steven B. White, FCAS & Chief Actuary, Guy Carpenter, LLC, for sharing his contagious enthusiasm for contagion modeling with us.
Stuart Sadwin, FCAS, MAAA, SVP, and Chief Corporate Actuary, TMNA Services, LLC, for his invaluable support on the model calibration.