The Skewness of Cape-Cod in a Distribution-Free Model

Eric Dal Moro

1. Introduction

After the famous chain ladder method and the Bornhuetter-Ferguson method (Bornhuetter and Ferguson 1972), the Cape Cod reserving method (hereinafter “CC method”; Bühlmann and Straub 1983) is one of the methods most used by practicing actuaries for the projection of non-life paid or incurred triangles. The CC method relies on very few parameters and is hence favored by practicing actuaries.

Following the development of the prediction error estimate for the chain ladder method (Mack 1993, 1999) and for the Bornhuetter-Ferguson method (Mack 2008), Saluz (2015) proposed the estimate of the prediction error for the CC method. The estimate of the prediction error for the CC method by Saluz (2015) should be put in the context of many attempts to better understand the non-life reserving distributions. For example, in relation to the CC method, Clark (2008) already proposed a reserving system that allows the actuary to use exposure information, such as on-level premium, even if that information is only available for a limited number of years. Such a reserving system is related to CC and Bornhuetter-Ferguson methods. Among the different attempts to provide characteristics of the reserving distributions, three areas of research can be observed:

The Bayesian models: Based on external information, these models attempt to provide information on the final reserve amount and its distribution (Taylor 2015; England, Verrall, and Wüthrich 2012). These models often relate to the Bornhuetter Ferguson model.
The stochastic models: These models rely on an underlying simulation framework in which parameters are fitted to the existing data. The most common models are the bootstrapping model (England and Verrall 2002), the GLM models (Merz and Wuthrich 2008b) or making assumptions on chain ladder coefficients (Barnett and Zehnwirth 2000).
In recent years, the necessity to understand the claim development result for the first year within the Solvency 2 framework has led to major breakthroughs around this topic (Merz and Wuthrich 2008a; Siegenthaler 2017). By extension, the claim development result for the full run-off in the chain ladder framework was also developed (Merz and Wuthrich 2014).

In all three areas mentioned above, the focus has always been on the expected value of the reserve amount and its standard deviation. Very few papers have focused on the third or fourth moment of the reserve distribution. Such papers include the skewness estimates for the chain ladder method (Salzmann, Wuthrich, and Merz 2012; Dal Moro 2013) and for the Bornhuetter-Ferguson method (Dal Moro 2021).

In order to complete our knowledge of the CC method in a distribution-free framework, a missing piece is the skewness of the CC method. Based on the work of Saluz (2015), this paper will propose a first approach to estimate this skewness.

2. Notation and data structure

We denote the cumulative claims (cumulative payments or incurred losses) in accident year $i \in \left\{ 0,\ \ldots,I \right\}$ at the end of development year $j \in \left\{ 0,\ \ldots,J \right\}$ by $C_{i,j} > 0$ and we assume J ≤ I. Let $X_{i,j} = C_{i,j} - C_{i,j - 1}$ denote the incremental claims, where we set $C_{i, - 1} = 0.$ The summation over an index starting from 0 is denoted with a square bracket, for example:

$C_{\left\lfloor k \right\rfloor,j} = \sum_{i = 0}^{k}C_{i,j},\ 0 \leq k \leq I,\ 0 \leq j \leq J.$

We assume that all claims are settled after development year J and therefore the total ultimate claim of accident year i is given by $C_{i,J}.$ At time I, we have information in the upper left trapezoid/triangle

$D_{I} = \left\{ C_{i,j}:i + j \leq I,\ j \leq J \right\}$

and our goal is to predict the lower right triangle

$D_{I}^{c} = \left\{ C_{i,j}:i + j > I,\ i \leq I,j \leq J \right\}.$

The chain ladder prediction of the ultimate claim $C_{i,J}$ of accident year i > I − J is given by

${\widehat{C}}_{i,J}^{CL} = C_{i,\iota(i)}\prod_{j = \iota(i)}^{J - 1}{\widehat{f}}_{j}$

where

${\widehat{f}}_{j} = \frac{C_{\left\lfloor I - j - 1 \right\rfloor,j + 1}}{C_{\left\lfloor I - j - 1 \right\rfloor,j}}\\ \text{and} \ \iota(i) = min(J,I - i).$

The chain ladder development pattern is defined as

${\widehat{\beta}}_{j}^{CL} = \prod_{k = j}^{J - 1}{\widehat{f}}_{k}^{- 1},0 \leq j \leq J - 1,\ {\widehat{\beta}}_{J}^{CL} = 1\ . \tag{1}$

3. The Cape Cod method

The Cape Cod predictor (Bühlmann and Straub 1983) for the ultimate claim is given by

${\widehat{C}}_{i,J}^{CC} = C_{i,\iota(i)} + \upsilon_{i}\widehat{q}\left( 1 - {\widehat{\beta}}_{\iota(i)} \right)$

where the earned premium for accident year i is denoted by $\upsilon_{i};$ and

$\widehat{q} = \frac{\sum_{i = 0}^{I}C_{i,\iota(i)}}{\sum_{i = 0}^{I}{\upsilon_{i}{\widehat{\beta}}_{\iota(i)}}}\ \ .$

${\widehat{\beta}}_{\iota(i)}$ is an estimate of $\beta_{\iota(i)}$ and describes the percentage of claims emerging up to development year $\iota(i).$ The incremental development pattern $\gamma_{j} = \beta_{j} - \beta_{j - 1}$ is estimated by

${\widehat{\gamma}}_{0} = {\widehat{\beta}}_{0}\\{\widehat{\gamma}}_{j + 1} = {\widehat{\beta}}_{j + 1} - {\widehat{\beta}}_{j},\ \ \ 0 \leq j \leq J - 1.$

In Bühlmann-Straub (1983), it is mentioned that the estimation of the development pattern ${\widehat{\beta}}_{j}$ is an unsolved problem. In practice, the development pattern is often estimated by the chain ladder (CL) development pattern given in (1).

Finally, we define the outstanding loss liabilities for accident year i at time I as

$R_{i}^{CC} = C_{i,J} - C_{i,I - i}$

4. The stochastic model underlying the Cape Cod method

Model assumptions

Incremental claims $X_{i,j}$ are independent and there exist positive parameters $q,\ t_{j}^{3},\ 0 \leq j \leq J$ and a development pattern $\gamma_{0},\ \ldots,\ \gamma_{J}$ with $\sum_{j = 0}^{J}\gamma_{j} = 1$ such that

$E\left\lbrack X_{i,j} \right\rbrack = \upsilon_{i}q\gamma_{j}$ $SK\left\lbrack X_{i,j} \right\rbrack = \left( \upsilon_{i}q \right)^{\frac{3}{2}}\ t_{j}^{3}$

where $SK\left\lbrack X_{i,j} \right\rbrack$ denotes the third moment of the random variable $X_{i,j}.$

For the estimation of the skewness, we need estimates for $q^{\frac{3}{2}}{\ t}_{j}^{3}.$ Note that

$\widehat{q^{\frac{3}{2}}\ t_{j}^{3}} = \frac{1}{I - j}\sum_{i = 0}^{I - j}\frac{1}{\upsilon_{i}^{\frac{3}{2}}}\left( X_{i,j} - \upsilon_{i}\widehat{\gamma_{j}} \right)^{3},\ 0 \leq j \leq J,\ j \neq I$

is an unbiased estimator for $q^{\frac{3}{2}}{\ t}_{j}^{3}.$

Note also that the above model assumptions assume that the expected loss ratio q is the same for all accident years.

Parameter estimation

In Saluz (2015), the normalized development pattern ${\widehat{\gamma}}_{j}$ is replaced by the raw development pattern below:

${\widehat{\gamma}}_{j}^{raw} = \frac{X_{\left\lfloor I - j \right\rfloor,j}}{\upsilon_{\left\lfloor I - j \right\rfloor}} \tag{2}$

And the cumulative development pattern is estimated by

${\widehat{\beta}}_{j}^{raw} = \sum_{k = 0}^{j}{\widehat{\gamma}}_{j}^{raw}$

5. Skewness of the CC method per accident year

The mean skewness of the estimate of the ultimate loss ${\widehat{C}}_{i,J}^{CC}$ is defined as

$\begin{align} {SK}_{C_{i,J}|D_{I}}\left( {\widehat{C}}_{i,J}^{CC} \right) &= E\left\lbrack \left( C_{i,J} - {\widehat{C}}_{i,J}^{CC} \right)^{3}|D_{I} \right\rbrack \\ &= E\biggl\lbrack \bigl( \sum_{j = I - i + 1}^{J}X_{i,j} - \upsilon_{i}\bigl( {\widehat{\beta}}_{J}^{raw} \\ &\quad \quad- {\widehat{\beta}}_{I - i}^{raw} \bigr) \bigr)^{3} |D_{I} \biggr\rbrack \end{align}$

$\begin{align} {SK}_{C_{i,J}|D_{I}}\biggl( {\widehat{C}}_{i,J}^{CC} \biggr) &= E\biggl\lbrack \biggl( \sum_{j = I - i + 1}^{J}X_{i,j} - \upsilon_{i}q\bigl( 1 - \beta_{I - i} \bigr) \\ &\quad \quad + \upsilon_{i}q\bigl( 1 - \beta_{I - i} \bigr) - \upsilon_{i}\bigl( {\widehat{\beta}}_{J}^{raw} \\ &\quad \quad - {\widehat{\beta}}_{I - i}^{raw} \bigr) \biggr)^{3} |D_{I} \biggr\rbrack \end{align}$

As we have

$\sum_{j = I - i + 1}^{J}\gamma_{j}^{raw} = q\left( 1 - \beta_{I - i} \right)$

we get

$\begin{align} {SK}_{C_{i,J}|D_{I}}\left( {\widehat{C}}_{i,J}^{CC} \right) &= E\biggl\lbrack \bigl( \sum_{j = I - i + 1}^{J}\bigl( X_{i,j} - \upsilon_{i}\gamma_{j}^{raw} \bigr) \\ &\quad \quad - \upsilon_{i}\bigl( {\widehat{\beta}}_{J}^{raw} - {\widehat{\beta}}_{I - i}^{raw} \\ &\quad \quad - q\bigl( 1 - \beta_{I - i} \bigr) \bigr) \bigr)^{3} |D_{I} \biggr\rbrack \end{align}$

Due to the independence of the $X_{i,j},$ we have

$\begin{align} {SK}_{C_{i,J}|D_{I}}\left( {\widehat{C}}_{i,J}^{CC} \right) &= E\biggl\lbrack \biggl( \sum_{j = I - i + 1}^{J}\bigl( X_{i,j} - \upsilon_{i}\gamma_{j}^{raw} \bigr) \biggr)^{3}|D_{I} \biggr\rbrack \\ &\quad - \nu_{i}^{3}\ \\ &\quad E\biggl\lbrack \biggl( \bigl( {\widehat{\beta}}_{J}^{raw} - {\widehat{\beta}}_{I - i}^{raw} - q\bigl( 1 - \beta_{I - i} \bigr) \bigr) \biggr)^{3}\\ &\quad \quad |D_{I} \biggr\rbrack \end{align}$

Hence

$\begin{align} {SK}_{C_{i,J}|D_{I}}\left( {\widehat{C}}_{i,J}^{CC} \right) &= \sum_{j = I - i + 1}^{J}{SK\left( X_{i,j} \right)} \\ &\quad - \nu_{i}^{3}\left( \sum_{j = I - i + 1}^{J}\left( {\widehat{\gamma}}_{j}^{raw} - q\gamma_{j} \right) \right)^{3} \end{align}$

By definition,

$SK\left( X_{i,j} \right) = \left( \upsilon_{i}q \right)^{\frac{3}{2}}t_{j}^{3}$

As for the second element of the equation, we have

$\begin{align} \biggl( \sum_{j = I - i + 1}^{J}\bigl( {\widehat{\gamma}}_{j}^{raw} - q\gamma_{j} \bigr) \biggr)^{3} &= SK\biggl( \sum_{j = I - i + 1}^{J}{\widehat{\gamma}}_{j}^{raw} \biggr) \\ &= \sum_{j = I - i + 1}^{J}{\frac{1}{\bigl( \upsilon_{\bigl\lfloor I - j \bigr\rfloor} \bigr)^{3}}\sum_{l = 0}^{I - j}{SK\bigl( X_{l,j} \bigr)}} \\ &= \sum_{j = I - i + 1}^{J}{\frac{1}{\bigl( \upsilon_{\bigl\lfloor I - j \bigr\rfloor} \bigr)^{3}}\sum_{l = 0}^{I - j}{\bigl( \upsilon_{l}q \bigr)^{\frac{3}{2}}t_{j}^{3}}} \\ &= \sum_{j = I - i + 1}^{J}\frac{q^{\frac{3}{2}}t_{j}^{3}}{\bigl( \upsilon_{\bigl\lfloor I - j \bigr\rfloor} \bigr)^{\frac{3}{2}}} \end{align}$

Finally, we get

$\begin{align} {SK}_{C_{i,J}|D_{I}}\bigl( {\widehat{C}}_{i,J}^{CC} \bigr) &= \bigl( \upsilon_{i}q \bigr)^{\frac{3}{2}}\sum_{j = I - i + 1}^{J}\\ &\quad \quad {t_{j}^{3}\biggl( 1 - \frac{\upsilon_{i}^{\frac{3}{2}}}{\bigl( \upsilon_{\bigl\lfloor I - j \bigr\rfloor} \bigr)^{\frac{3}{2}}} \biggr)} \end{align} \tag{3}$

6. Skewness of the CC method over all accident years

Having estimated the skewness of the CC method by accident year, we will now aggregate these elements over all accident years. In order to do so, we will use the Fleishman polynomials (Fleishman 1978). First, we assume that the centralized and normalized copy of the risk value X_i of the i-th class, $\widehat{X_{i}} = \frac{X_{i} - E\left( X_{i} \right)}{E\left( X_{i} \right)\ {CoV}_{X_{i}}}$ (where CoV denotes the coefficient of variation), is estimated by the Fleishman polynomial structure of a standard normal random variable. In particular, we consider the following case:

${\widehat{X}}_{i} = P_{2}\left( Z_{i} \right) = a_{i}Z_{i} + b_{i}\left( Z_{i}^{2} - 1 \right)$ – where $Z_{i}$ denotes the standard normal distribution – Such a case is suitable for estimating the skewness of a risk portfolio profile when the confidence level is approximated using skewness only.

The coefficients of the polynomial P₂ are calibrated using the method of moments by matching the second and third moments of P₂(Z_i) to 1 (standard deviation of $\widehat{X_{i}}),$ λ_i (skewness of X_i ) respectively.

The coefficients of P₂ can be analytically expressed by solving the following system of equations:

$\left\{\begin{array}{c} 1=a_{i}^{2}+2 b_{i}^{2} \\ \lambda_{i}=6 a_{i}^{2} b_{i}+8 b_{i}^{3} \end{array}\right. \tag{4}$

The system (4) is reduced to:

$\left\{\begin{array}{l} a_{i}=\sqrt{1-2 b_{i}^{2}} \\ \lambda_{i}=6 b_{i}-4 b_{i}^{3} \end{array}\right. \tag{5}$

Such system is easily solved. The roots of the cubic equation (5) can be found using the Cardano’s formula (e.g., Abramowitz and Stegun 1972). If we denote

$\varphi = arccos\left( - \frac{\lambda_{i}}{\sqrt{8}} \right)$

then the only real root of equation (5) is

$b_{i} = \sqrt{2}\ cos\left( \frac{\varphi}{3} + 4\frac{\pi}{3} \right).$

Having estimated the above parameters of the Fleishman polynomial, we define the total reserve value across the portfolio of m risks as

$X_{\Sigma} = \sum_{i = 1}^{m}X_{i}$

where each i-th risk value is approximated by Fleishman polynomial of a standard normal random variable

$X_{i} \approx {CE}_{i}\ \left( 1 + {CoV}_{i}\ P_{2}\left( Z_{i} \right) \right),$

where CE denotes the central best estimate of X.

It is clear that ${CE}_{\Sigma} = \sum_{i = 1}^{m}{CE}_{i}.$

As in Mack (2008), all the stand-alone risks interact between each other according to a Gaussian dependence structure which linear correlations ρ_ij (coefficients of a Gaussian copula) are given by

$\rho_{ij} = \frac{{\widehat{z}}_{n + 1 - j}\left( 1 - {\widehat{z}}_{n + 1 - i} \right)}{{\widehat{z}}_{n + 1 - i}\left( 1 - {\widehat{z}}_{n + 1 - j} \right)}.$

SKEWNESS

We compute the third central moment of X_Σ:

$E\left\lbrack \left( X_{\Sigma} - {CE}_{\Sigma} \right)^{3} \right\rbrack = E\left\lbrack \left( \sum_{i = 1}^{m}{\sigma_{i}\ P_{2}\left( Z_{i} \right)} \right)^{3} \right\rbrack$

$\begin{aligned} E\bigl[\bigl(X_{\Sigma}-C E_{\Sigma}\bigr)^{3}\bigr] &=\sum_{i=1}^{m} \sigma_{i}^{3} \lambda_{i}\\ &\quad +3 \sum_{i j} \sigma_{i}^{2} \sigma_{j} \\ &\quad \quad E\bigl[P_{2}\bigl(Z_{i}\bigr)^{2} P_{2}\bigl(Z_{j}\bigr)\bigr] \\ &\quad +6 \sum_{i j k} \sigma_{i} \sigma_{j} \sigma_{k} \\ &\quad \quad E \bigl[P_{2}\bigl(Z_{i}\bigr) P_{2}\bigl(Z_{j}\bigr) P_{2}\bigl(Z_{k}\bigr)\bigr] \end{aligned} \tag{6}$

where $E\left\lbrack {P_{2}\left( Z_{i} \right)}^{3} \right\rbrack = \lambda_{i}$ as the Fleishman polynomial coefficients are calibrated so that the polynomial has skewness λ_i for i-th stand-alone risk profile. In formula (6), the summation term with multiple 3 has $\begin{pmatrix} m \\ 2 \\ \end{pmatrix}$ different sub-terms, and the summation term with multiple 6 is relevant if $m \geq 3$ and has $\left(\begin{array}{c}m \\ 3\end{array}\right)$ different sub-terms.

The following components of formula (6) above are

$\begin{align} E\bigl\lbrack {P_{2}\bigl( Z_{i} \bigr)}^{2}P_{2}\bigl( Z_{j} \bigr) \bigr\rbrack &= 2\rho_{ij}\\ &\quad \bigl( 2a_{i}a_{j}b_{i} + \bigl( a_{i}^{2} + 4b_{i}^{2} \bigr)b_{j}\rho_{ij} \bigr) \end{align} \tag{7}$

$\begin{align} E\bigl\lbrack P_{2}\bigl( Z_{i} \bigr)P_{2}\bigl( Z_{j} \bigr)P_{2}\bigl( Z_{k} \bigr) \bigr\rbrack &= 2\bigl( a_{j}a_{k}b_{i}\rho_{ij}\rho_{ik} \\ &\quad \quad + a_{j}a_{i}b_{k}\rho_{jk}\rho_{ik} \\ &\quad \quad + a_{i}a_{k}b_{j}\rho_{ij}\rho_{jk} \bigr) \\ &\quad + 8b_{i}b_{k}b_{j}\rho_{ij}\rho_{ik}\rho_{jk} \end{align} \tag{8}$

The skewness λ_Σ is then calculated as follows:

$\lambda_{\Sigma} = \frac{E\left\lbrack \left( X_{\Sigma} - {CE}_{\Sigma} \right)^{3} \right\rbrack}{\left( {CE}_{\Sigma}\ {CoV}_{\Sigma} \right)^{3}}. \tag{9}$

In Table 1, the parameters a_i, b_i are provided, the correlation matrices are provided in Appendix A and the values for σ_i correspond to the msep(R_i) are estimated using Saluz (2015) formulae. Details of calculations related to the numerical example can be found on the excel sheets and in the R program in the folder available at:

https://drive.google.com/drive/folders/14UNUPb1a0A_-YNe0Y4gTDdhEsPnOaIxS?usp=sharing

Table 1.Reserves, volatility and skewness resulting from the application of the Cape-Cod method

				Fleishman polyn.
i	$\hat{R}_{i}^{C C}$	$m S e p\left(\widehat{R}_{\mathrm{i}}^{C C}\right)$	$\frac{S K_{C_{i j} \mid D_{i}}\left(\widehat{C}_{i J}^{C C}\right)}{m s e p\left(\widehat{R}_{i}^{C C}\right)^{3}}$	bi	ai	j	$\hat{\gamma}_{j}^{\text {raw }}$	$\widehat{q^{3/2}t_{j}^{3}}$
9	4,240,563	416,594	0.295	0.04925	0.9976	0	39.49%	194,916
8	1,200,821	162,533	0.070	0.01167	0.9999	1	19.58%	368,471
7	528,056	86,260	-0.022	-0.00359	1.0000	2	4.67%	5,974
6	314,665	73,227	0.021	0.00342	1.0000	3	1.51%	-412
5	166,585	31,984	0.016	0.00262	1.0000	4	1.01%	142
4	90,234	8,475	0.071	0.01188	0.9999	5	0.49%	9
3	35,874	2,989	0.093	0.01544	0.9998	6	0.37%	1
2	25,620	841	0.007	0.00113	1.0000	7	0.08%	0
1	15,210	246	0.000	0.00000	1.0000	8	0.08%	0
Total	6,617,628	480,603	0.484

7. Numerical examples

Equations (3) and (9) were tested on one triangle and are shown in Table 1. The correlation matrices used for applying the Fleishman polynomials are also provided in Appendix A.

As expected, the skewness is positive due to the fact that there is a lower probability for the reserves to be underestimated, i.e., the right tail of the reserving risk distribution is heavier than the left tail. However, with such a distribution, in the case where the reserves are underestimated, the underestimated reserve amount will be bigger when compared to the case of a distribution having a lower skewness.

In order to benchmark the above skewness, it is compared against a few models used in practice. The first benchmark will consist in the use of parametric distributions where means and standard deviations are matched to the calculated means and standard deviations. The second benchmark will use the usual stochastic reserving method: bootstrapping. The third benchmark will consist in estimating the skewness on the basis of the Mack chain ladder model available in the R package ChainLadder (Gesman et al. 2022).

Benchmark 1: Parametric distributions

As often in practice, the distribution of reserves is approximated with a gamma or lognormal distribution, matching to the mean and standard deviation. For a gamma distribution, we would have

$Skewness = 2\ \times Coefficient\ of\ variation$

and for lognormal,

$\begin{align} Skewness &= \left( 3 + \ {Coefficient\ of\ variation}^{2} \right)\ \\ &\quad \times Coefficient\ of\ variation. \end{align}$

With such approximation, it is possible to compare the resulting shape of the distribution resulting from the Cape Cod model with gamma or lognormal distributions. And, in the example above, the estimated skewness is 0.484, which is almost seven times the coefficient of variation of 7.3%, indicating a distribution much more skewed than a lognormal distribution. In the triangle, it can be identified that this higher skewness comes from the accident year 0 which seems to have a different behavior from the other accident years—in particular, on development year 1, where the payment amount of 3 721 237 (Appendix A) would seem to be unusually high. As for any reserving work, further investigation on this particular cell would need to be done to finalize the distribution model.

Benchmark 2: Bootstrapping

For this benchmark, the function BootChainLadder of the R package ChainLadder (Gesman et al. 2022 and the R program provided) is used on the triangle shown in Appendix A (note, however, that the triangle in Appendix A is first converted into a cumulative payment triangle). The function BootChainLadder corresponds to the implementation of the bootstrapping technique described in England and Verrall (2002). For comparison purposes, two process distributions are tested: gamma and over-dispersed Poisson. Results of the bootstrapping are shown in Table 2.

Table 2.Mean, standard deviation and skewness of the IBNR distribution resulting from the bootstrapping method

	Over-dispersed Poisson				Gamma
i	IBNR	Standard deviation of IBNRs	Coefficient of Variation	Skewness of IBNR distribution	IBNR	Standard deviation of IBNRs	Coefficient of Variation	Skewness of IBNR distribution
9	3,952,041	333,218	0.084	0.158	3,951,896	332,107	0.084	0.177
8	1,043,678	140,799	0.135	0.245	1,043,641	140,449	0.135	0.250
7	449,368	90,363	0.201	0.362	448,867	90,118	0.201	0.352
6	286,402	73,282	0.256	0.445	286,146	73,118	0.256	0.446
5	156,462	55,473	0.355	0.583	156,336	55,492	0.355	0.567
4	85,394	42,385	0.496	0.751	85,265	42,316	0.496	0.742
3	34,527	28,981	0.839	0.988	34,416	29,235	0.849	1.014
2	26,300	26,944	1.024	1.086	26,149	26,884	1.028	1.073
1	15,148	21,689	1.432	1.311	15,114	21,635	1.431	1.370
Total	6,049,320	431,277	0.071	0.123	6,047,831	430,988	0.071	0.123

The bootstrapping results indicate that the overall coefficient of variation and the overall level of IBNR seem consistent with the results of the Cape Cod method even though the level of IBNR is smaller in the bootstrapping method. However, the coefficients of variation per accident year are increasing significantly from accident year 9 to accident year 1, even leading to some counterintuitive results; for example, the standard deviation on accident year 1 is higher than the IBNR level, whereas the reserve level on older accident years is usually almost certain and the standard deviation should be small. The same increase across accident years is observed on the skewness.

Such behavior comes from the way in which bootstrapping simulations are done:

First, the assumptions below are taken.

For each accident year i and development year j, we have:
$E\left( C_{ij} \right) = m_{ij} = x_{i}y_{j}$
$Var\left( C_{ij} \right) = \phi x_{i}y_{j}$ (Over-Dispersed Poisson)
Or $Var\left( C_{ij} \right) = \phi m_{ij}^{2}$ (Gamma)

Second, residuals are estimated as:

$r_{ij} = \frac{C_{ij} - m_{ij}}{\sqrt{m_{ij}}}$

Some adjustments on these residuals are done (see England and Verrall 2002 for details).

Third, after resampling the adjusted residuals (resulting in $r_{ij}^{*}),$ a new triangle is created as per the formula:

$C_{ij}^{*} = r_{ij}^{*}\sqrt{m_{ij}} + m_{ij}.$

Fourth, on the basis of this new triangle, the overall IBNRs are estimated, giving a full distribution.

Considering the process applied in the bootstrapping method, the following conclusions can be drawn:

The resampling method uses the residuals of younger (and more uncertain) accident years and applies them to older accident years. This results in the counterintuitive results mentioned above for older accident years.
For the overall IBNR level and the overall standard deviation, the method could be valid, as assumptions on standard deviations and expected value are set at the beginning of the process.
However, for skewness, there is no guarantee that the bootstrapping method can provide reliable results, as there is no assumption set at the beginning of the process for the third moment of the distributions. Interestingly, it could be a nice way to refine the bootstrapping method.

As a conclusion, the bootstrapping benchmark may not be a reliable source for comparison.

Benchmark: Mack chain ladder

The final benchmark is the Mack chain ladder as implemented in the function Quantile of the R package ChainLadder (Gesman et al. 2022 and the R program provided). This method is also applied on the cumulative triangle. Results are shown in Table 3.

Table 3.Mean, standard deviation and skewness of the IBNR distribution resulting from the Mack Chain-Ladder method

	Chain-Ladder
i	IBNR	Standard deviation of IBNRs	Coefficient of Variation	Skewness of IBNR distribution
9	3,950,815	410,817	0.104	1.042
8	1,043,242	134,336	0.129	0.074
7	449,167	85,398	0.190	0.003
6	286,121	73,467	0.257	0.091
5	156,494	33,341	0.213	0.106
4	85,302	7,628	0.089	0.214
3	34,538	3,059	0.089	0.063
2	26,257	915	0.035	0.009
1	15,126	268	0.018	0.000
Total	6,047,064	462,960	0.077	0.735

The Mack chain ladder results indicate that the overall coefficient of variation and the overall level of IBNR seem consistent with the results of the Cape Cod method, even though the level of IBNR is smaller. In addition, the skewness is higher than the Cape Cod skewness. Contrary to the bootstrapping method, the skewness per accident year lessens as accident year becomes older like in the Cape Cod method and also as anticipated. In terms of comparison, the lower skewness and coefficient of variations of the Cape Cod method is certainly explained by the smoothing effect of the Cape Cod method compared to the chain ladder method. In fact, the Cape Cod method as well as the Bornhuetter-Ferguson method take into account premium information so as to smooth the effect of payments being higher than anticipated or lower than anticipated.

As an overall conclusion, the results provided by the Cape Cod skewness seem to fit well within the different benchmarks:

It is above the skewness coming from the parametric distributions gamma and lognormal and it is possible to identify the reason for this difference.
It is below the Mack chain ladder skewness due to the smoothing nature of the Cape Cod method.

Finally, it may be worth mentioning that the ultimate loss ratios resulting from the application of the Cape Cod method reduce from the low 70s in the older accident years to the mid 60s in the younger accident years. When applying the skewness formulae, it is important to check that the overall ultimate loss ratios are stable enough so that the method can be applied. If the resulting Cape Cod loss ratios are too volatile, the skewness estimator will certainly not provide a reliable information.

8. Conclusion

This paper is a first attempt to estimate the skewness of the CC method. It does not rely on assumptions derived from external knowledge of the modeled reserving risk. As a result, the estimation is automatic and may not work on any triangle. In particular, the skewness estimation method assumes a constant loss ratio across the accident years. Therefore, triangles reflecting a high volatility of the loss ratio in the past may not be fit for this method. It must be born in mind that, in such cases, the Cape Cod reserving method is also usually not applicable for reserve estimation.

However, the advantages of this method are:

1 – It is distribution-free: there is no need to assume any distribution of the reserving risk to get an estimation of the skewness.

2 – It is easy to implement and the proposed formulae are simple and elegant.

As a next step, we must recognize also that the CC method is usually used together with the chain ladder method and the Bornhuetter-Ferguson method. Therefore, as for the hybrid chain ladder method (Arbenz and Salzmann 2014), a skewness formula for the mixed CC / Bornhuetter-Ferguson / chain ladder method should be developed in a subsequent paper.

i / j	0	1	2	3	4	5	6	7	8	9	$v_{i}$
0	5,946,975	3,721,237	895,717	207,761	206,704	62,124	65,813	14,850	11,129	15,814	15,473,558
1	6,346,756	3,246,406	723,221	151,797	67,824	36,604	52,752	11,186	11,646		14,882,436
2	6,269,090	2,976,223	847,053	262,768	152,703	65,445	53,545	8,924			14,456,039
3	5,863,015	2,683,224	722,532	190,653	132,975	88,341	43,328				14,054,917
4	5,778,885	2,745,229	653,895	273,395	230,288	105,224					14,525,373
5	6,184,793	2,828,339	572,765	244,899	104,957						15,025,923
6	5,600,184	2,893,207	563,114	225,517							14,832,965
7	5,288,066	2,440,103	528,042								14,550,359
8	5,290,793	2,357,936									14,461,781
9	5,675,568										15,210,363

i/j	9	8	7	6	5	4	3	2	1
9	100%	76%	63%	40%	30%	22%	16%	10%	5%
8	76%	100%	83%	53%	40%	28%	22%	14%	6%
7	63%	83%	100%	64%	48%	34%	26%	16%	7%
6	40%	53%	64%	100%	75%	54%	40%	26%	11%
5	30%	40%	48%	75%	100%	72%	54%	34%	15%
4	22%	28%	34%	54%	72%	100%	76%	48%	21%
3	16%	22%	26%	40%	54%	76%	100%	64%	28%
2	10%	14%	16%	26%	34%	48%	64%	100%	45%
1	5%	6%	7%	11%	15%	21%	28%	45%	100%