Estimation of Tail Development Factors in the Paid-Incurred Chain Reserving Method

Michael Merz; Mario V. Wüthrich

Merz, Michael, and Mario V. Wüthrich. 2013. “Estimation of Tail Development Factors in the Paid-Incurred Chain Reserving Method.” Variance 7 (1): 29–60.

Download all (1)

Figure 1. PIC reserving model. Left panel: cumulative payments $P_{\mathrm{i}, \mathrm{j}}$ development triangle; Right panel: claims incurred $I_{i, j}$ development triangle; both leading to the same ultimate claim amount $P_{i, J+1}=I_{i, J+1}$ for accident year $i$
Download

View more stats

Abstract

In many applied claims reserving problems in P insurance, the claims settlement process goes beyond the latest development period available in the observed claims development triangle. This makes it necessary to estimate so-called tail development factors which account for the unobserved part of the insurance claims. We estimate these tail development factors in a mathematically consistent way. This paper is a modification of the paid-incurred chain (PIC) reserving model of Merz and Wüthrich (2010). This modification then allows for the prediction of the outstanding loss liabilities and the corresponding prediction uncertainty under the inclusion of tail development factors.

1. Introduction and model assumptions

Often in P claims reserving problems, the claims settlement process goes beyond the latest development period available in the observed claims development triangle. This means that there is still an unobserved part of the insurance claims for which one needs to build claims reserves. In such situations, claims reserving actuaries apply so-called tail development factors to the last column of the claims development triangle which account for the settlement that goes beyond this latest development period. Typically, one has only limited information for the estimation of such tail development factors. Therefore, various techniques are applied to estimate these tail development factors. Most of these estimation methods are ad hoc methods that do not fit into any stochastic modeling framework. Popular estimation techniques, for example, fit parametric curves to the data using the right-hand corner of the claims development triangle (Mack 1999; Boor 2006; Verrall and Wüthrich 2012). In practice, one often does a simultaneous study of claims payments and claims incurred data, i.e., incurred-paid ratios are used to determine tail development factors (see Section 3 in Boor 2006).

In this paper we review the paid-incurred chain (PIC) reserving method. The log-normal PIC reserving model introduced in Merz and Wüthrich (2010) can easily be extended so that it allows for the inclusion of tail development factors in a natural and mathematically consistent way. Similar to common practice, the tail development factor estimates will then be based on incurred-paid ratios within our PIC reserving framework.

In the following, we denote accident years by $i \in$ $\{0, \ldots, J\}$ and development years by $j \in\{0, \ldots, J$ , $J+1\}$ . Development year $J$ refers to the latest observed development year of accident year $i=0$ and the step from $J$ to $J+1$ refers to the tail development factors (see Figure 1). Cumulative payments in accident year $i$ after $j$ development years are denoted by $P_{i, j}$ and the corresponding claims incurred by $I_{i j}$ . Moreover, for the ultimate claim we assume $P_{i, J+1}=I_{i, J+1}$ with probability 1 (see Figure 1). This means that we assume that-after several development periods beyond the latest observed development year $J$ -the cumulative payments and the claims incurred lead to the same ultimate claim amount. That is, ultimately, when all claims of accident year $i$ are settled, $I_{i, J+1}$ and $P_{i, J+1}$ must coincide.

Figure 1.PIC reserving model. Left panel: cumulative payments

$P_{\mathrm{i}, \mathrm{j}}$ development triangle; Right panel: claims incurred

$I_{i, j}$ development triangle; both leading to the same ultimate claim amount

$P_{i, J+1}=I_{i, J+1}$ for accident year

$i$

Model Assumptions 1.1

Log-normal PIC reserving model, Merz and Wüthrich (2010)

Conditionally, given parameters $\Theta=\left(\Phi_0, \ldots\right.$ , $\left.\Phi_{J+1}, \Psi_0, \ldots, \Psi_J, \sigma_0, \ldots, \sigma_{J+1}, \tau_0, \ldots, \tau_J\right)$ , we have
the random vector $\left(\xi_{0,0}, \ldots, \xi_{J, J+1}, \zeta_{0,0}, \ldots, \zeta_{J, J}\right)$ has a multivariate Gaussian distribution with uncorrelated components given by

$\begin{array}{ll} \xi_{i, j} \sim N\left(\Phi_j, \sigma_j^2\right) & \text { for } i \in\{0, \ldots, J\} \text { and } \\ & j \in\{0, \ldots, J+1\}, \text { and } \\ \zeta_{i, j} \sim N\left(\Psi_j, \tau_j^2\right) \quad & \text { for } i \in\{0, \ldots, J\} \text { and } \\ & j \in\{0, \ldots, J\} ; \end{array}$

cumulative payments $P_{i, j}$ are given by the recursion, $j=0, \ldots, J+1$ , $P_{i, j}=P_{i, j-1} \exp \left\{\xi_{i, j}\right\}, \quad$ with initial value $P_{i-1}=1$ ;
claims incurred $I_{i, j}$ are given by the (backwards) recursion, $j=0, \ldots, J$ , $I_{i, j}=I_{i, j+1} \exp \left\{-\zeta_{i, j}\right\}$ , with initial value $I_{i, J+1}=P_{i, J+1}$ .

For an extended model discussion we refer to Merz and Wüthrich (2010). Basically, the PIC Model Assumptions 1.1 are a combination of Hertig’s (1985) log-normal model (applied to cumulative payments) and Gogol’s (1993) Bayesian claims reserving model (applied to claims incurred). In contrast to the PIC reserving model in Merz and Wüthrich (2010), we now add an extra development period from J to J + 1. This is exactly the crucial step that allows for the consideration of tail development factors and it leads to the study of incurred-paid ratios for the inclusion of such tail development factors.

The PIC Model Assumptions 1.1 may be criticized because of two restrictive assumptions. We briefly discuss how these can be relaxed.

Assumption $P_{i,-1}=1$ for all $i \in\{0, \ldots, J\}:$ If there are known (prior) differences between different accident years $i$ , this can easily be integrated by setting $P_{i,-1}=v_i$ with constants $v_0, \ldots, v_J>0$ describing these prior differences.
Independence between $\xi_{i, j}$ and $\zeta_{i, l}$ : This is probably the main weakness of the model. However, this assumption can easily be relaxed in the spirit of Happ and Wüthrich (2013). To keep the analysis simple, we refrain from studying this more complex model in the present paper.

2. Estimation of tail development factors

At time J one has observed data given by the set

$D_{J}=\left\{P_{i, j}, I_{i, j}: i+j \leq J, 0 \leq i \leq J, 0 \leq j \leq J\right\},$

and one needs to predict the ultimate claim amounts $P_{i, J+1}=I_{i, J+1}$ , conditional on these observations $D_J$ . On the one hand, this involves the calculation of the conditional expectations $E\left[P_{i, J+1} \mid D_J, \Theta\right]$ and, on the other hand, it involves Bayesian inference on the parameters $\Theta$ , given $D_J$ (see Theorems 2.4 and 3.4 in Merz and Wüthrich 2010). In this section we discuss how to modify the general outline of Model Assumptions 1.1 to incorporate tail development estimation.

2.1. Ultimate claims prediction conditional on parameters

We apply Model Assumptions 1.1 to the tail development factor estimation problem. Therefore, we need to specify the prior distribution of the parameter vector Θ.

Often, there is subjectivity in claims incurred data I_{i, j} because the use of different claims adjusters with different estimation methods and changing reserving guidelines. Therefore, for the present set-up we have decided to consider claims incurred data I_{i, j} only for the estimation of tail development factors, i.e., we work under the assumption of having incomplete claims incurred triangles (see also Dahms (2008) and Happ and Wüthrich (2013) for claims reserving methods on incomplete data). It is not difficult to extend the model to incorporate all claims incurred information, but in the present work this would detract from the tail development factor estimation discussion.

The prediction based on incomplete claims incurred data is done as follows. Assume there exists J* ∈ {0, . . . , J} such that with probability 1

$\Psi_{J} \equiv \tau_{J}^{2} / 2 \tag{2.1}$

and if J* J

$\begin{aligned} \tau_{J^{*}} & =\tau_{J^{*}+1}=\cdots=\tau_{J-1} \equiv \tau, \\ \Psi_{J^{*}} & =\Psi_{J^{*}+1}=\cdots=\Psi_{J-1} \equiv \tau^{2} / 2 . \end{aligned} \tag{2.2}$

Note that if $J^*=J$ we simply assume $\Psi_J \equiv \tau_J^2 / 2$ . These assumptions imply that there is no substantial claims incurred development after claims development period $J^*$ , i.e., there is no systematic drift in the claims incurred development after $J^*$ . This is seen as follows, for $j \in\left\{J^*, \ldots, J\right\}$

$\begin{aligned} E\left[\exp \left\{-\xi_{i, j}\right\}\right] & =E\left[E\left[\exp \left\{-\zeta_{i, j}\right\} \mid \Theta\right]\right] \\ & =E\left[\exp \left\{-\Psi_{j}+\tau_{j}^{2} / 2\right\}\right]=1 \end{aligned}$

This implies that on average the claims incurred prediction is correct (and we have only pure random fluctuations around this prediction), i.e., for j ∈ {J* + 1, . . . , J + 1}

$\begin{aligned} E\left[I_{i, j-1} \mid I_{i, j}\right] & =I_{i, j}, \\ \operatorname{Vco}\left(I_{i, j-1} \mid I_{i, j}\right) & =\left(\exp \left\{\tau_{j-1}^{2}\right\}-1\right)^{1 / 2} \end{aligned}$

where Vco(·) denotes the coefficient of variation. The fact that we allow τ_J to differ from τ corresponds to the difficulty that the tail development factor may cover several development years beyond the last observed column in the claims development triangle and therefore we may allow for standard deviation parameters $\tau_J>\tau$ for the development period from $J$ to $J+1$ (possibly covering more than one period).

Remark. If there is expert judgment about a drift term in the claims incurred development $I_{i, J^*}, \ldots, I_{i, J+1}$ this can easily be integrated by adjusting assumptions (2.1)–(2.2). This also allows one to consider parametric curves, as mentioned in Section 1, but in this case it is more appropriate to treat this knowledge as informative as to prior distributions specifying prior uncertainty in this expert judgment, similar to Verrall and Wüthrich (2012).

Thus, assumptions (2.1)–(2.2) imply that there is no systematic drift in {J* + 1, . . . J + 1}, and under these assumptions we consider tail factor estimation under the restricted observations given by

$\begin{aligned} D_{J}^{*} & =\left\{P_{i, j}, I_{k, l}: i+j \leq J, k+l \leq J, l \geq J^{*}\right\} \\ & =D_{J} \cap\left\{P_{i, j}, I_{k, l}: l \geq J^{*}\right\} . \end{aligned}$

In this spirit, we consider all cumulative payment observations but only claims incurred observations from development year J* on. That is, only the claims incurred I_i,j from the latest J − J* + 1 development periods J*, J* + 1, . . . , J are used to estimate tail development factors and the claims reserves. We define the following parameters

$\begin{array}{c} \eta_{j}=\sum_{m=0}^{j} \Phi_{m} \quad \text { and } \quad w_{j}^{2}=\sum_{m=0}^{j} \sigma_{m}^{2}, \\ \text { for } j=0, \ldots, J+1, \\ \mu_{l}=\eta_{J+1}-\sum_{n=l}^{J} \Psi_{n} \quad \text { and } \quad v_{l}^{2}=w_{J+1}^{2}+\sum_{n=l}^{J} \tau_{n}^{2}, \\ \text { for } l=J^{*}, \ldots, J . \end{array}$

Moreover, we define the parameters

$\beta_{j}=\left\{\begin{array}{ll} \frac{w_{J+1}^{2}-w_{j}^{2}}{v_{j}^{2}-w_{j}^{2}}>0 & \text { for } j=J^{*}, \ldots, J \\ 0 & \text { for } j=0, \ldots, J^{*}-1 . \end{array}\right.$

The following result shows that β_j can be interpreted as the credibility weight for the claims incurred observations: Theorem 2.1.Under Model Assumptions 1.1 we have, conditional on Θ and D_J *,

$\begin{aligned} E\left[P_{i, J+1} \mid D_{J}^{*}, \Theta\right] & =P_{i, J-i}^{1-\beta_{J-i}} I_{i, J-i}^{\beta_{J-i}} \\ & \exp \left\{\left(1-\beta_{J-i}\right) \sum_{l=J-i+1}^{J+1}\left(\Phi_{l}+\sigma_{l}^{2} / 2\right)+\beta_{J-i} \sum_{l=J-i}^{J} \Psi_{l}\right\} . \end{aligned}$

For the conditional variance we obtain

$\begin{array}{l} \operatorname{Var}\left(P_{i, J+1} \mid D_{J}^{*}, \Theta\right)=E\left[P_{i, J+1} \mid D_{J}^{*}, \Theta\right]^{2} \\ \qquad\left(\exp \left\{\left(1-\beta_{J-i}\right) \sum_{l=J-i+1}^{J+1} \sigma_{l}^{2}\right\}-1\right) . \end{array}$

For $i>J-J^*$ there holds $\beta_{J-i}=0$ and, therefore, we obtain a purely claims payment based prediction [see also Hertig’s model (1985) presented in Section 2.1 of Merz and Wüthrich (2010)]

$P_{i, J-i} \exp \left\{\sum_{l=J-i+1}^{J+1}\left(\Phi_{l}+\sigma_{l}^{2} / 2\right)\right\}$

For $i \leq J-J^*$ there holds $\beta_{J-i}>0$ and, therefore, we obtain a correction term to the purely claims payment based prediction which is based on the claims incurred-paid ratio $I_{i, J-i} / P_{i, J-i}$ , i.e., for a large incurredpaid ratio we get a higher expected ultimate claim as can be seen from

$\begin{aligned} P_{i, J-i}^{1-\beta \beta_{J-i}} I_{i, J-i}^{\beta_{J-i}} & =\exp \left\{\left(1-\beta_{J-i}\right) \log P_{i, J-i}+\beta_{J-i} \log I_{i, J-i}\right\} \\ & =P_{i, J-i} \exp \left\{\beta_{J-i} \log \frac{I_{i, J-i}}{P_{i, J-i}}\right\}. \end{aligned}$

2.2. Parameter estimation, the general case

The likelihood function of the restricted observations D_J* is given by [see also (3.5) in Merz and Wüthrich (2010)]

$\begin{aligned} l_{D_{j}^{*}}(\Theta) \propto & \prod_{J=0}^{J} \prod_{i=0}^{J-j} \frac{1}{\sigma_{j}} \exp \left\{-\frac{1}{2 \sigma_{j}^{2}}\left(\Phi_{j}-\log \frac{P_{i, j}}{P_{i, j-1}}\right)^{2}\right\} \\ & \times \prod_{i=0}^{J-J *} \frac{1}{\sqrt{v_{J-i}^{2}-w_{J-i}^{2}}} \exp \left\{-\frac{1}{2\left(v_{J-i}^{2}-w_{J-i}^{2}\right)}\right. \\ & \left.\left(\mu_{J-i}-\eta_{J-i}-\log \frac{I_{i, J-i}}{P_{i, J-i}}\right)^{2}\right\} \times \prod_{J=J-1}^{J-1} \prod_{t=0}^{J-1} \frac{1}{\tau_{j}} \\ & \exp \left\{-\frac{1}{2 \tau_{j}^{2}}\left(\Psi_{j}+\log \frac{I_{i, j}}{I_{i, j+1}}\right)^{2}\right\}, \end{aligned}$

where ∝ means that only relevant terms dependent on Θ are considered. The first line describes the claims payment development, the last line describes the claims incurred development, and the middle line describes the gap between the diagonal claims incurred and the diagonal claims payment observations.

In order to perform a Bayesian inference analysis on the parameters we need to specify the prior distribution of Θ.

Model Assumptions 2.2. PIC tail development factor model

We assume Model Assumptions 1.1 hold true with positive constants $\sigma_0, \ldots, \sigma_{J+1}, \tau_{J^*}=\cdots=\tau_{J-1}=\tau$ , $\Psi_{J^*}=\cdots=\Psi_{J-1}=\tau^2 / 2$ and $\Psi_J=\tau_J^2 / 2$ . Moreover, it holds

$\mathbf{\Phi}_{m} \sim N\left(\phi_{m}, s_{m}^{2}\right) \quad \text { for } m \in\{0, \ldots, J+1\}$

with prior parameters _m ∈ ℝ and s_m 0. □

Under Model Assumptions 2.2 the posterior distribution $u\left(\Phi \mid D_J^*\right)$ of $\Phi=\left(\Phi_0, \ldots, \Phi_{J+1}\right)$ , given $D_J^*$ , is given by

$u\left(\Phi \mid D_{J}^{*}\right) \propto l_{D_{J}^{*}}(\Theta) \prod_{m=0}^{J+1} \exp \left\{-\frac{1}{2 s_{m}^{2}}\left(\Phi_{m}-\phi_{m}\right)^{2}\right\} \tag{2.3}$

This immediately implies the following theorem:

Theorem 2.3. Under Model Assumptions 2.2 the posterior $u\left(\Phi \mid D_J^*\right)$ of $\Phi$ is a multivariate Gaussian distribution with posterior mean $\left(\phi_0^{\text {post }}, \ldots, \phi_{+1}^{\text {post }}\right)$ and posterior covariance matrix $\Sigma\left(D_J^*\right)$ . Define the posterior standard deviation by

$s_{j}^{\text {post }}=\left(s_{j}^{-2}+(J-j+1) \sigma_{j}^{-2}\right)^{-1 / 2} \quad \text { for } j=0, \ldots, J+1$

Then, the inverse covariance matrix $\Sigma\left(D_J^*\right)^{-1}=$ $\left(a_{n, m}\right)_{0 \leq n, m \leq J+1}$ is given by

$a_{n, m}=\left(s_{n}^{\text {post }}\right)^{-2} 1_{\{n=m\}}+\left[\sum_{i=J^{*}}^{(n-1) \wedge(m-1)}\left(v_{i}^{2}-w_{i}^{2}\right)^{-1}\right] 1_{\left\{n, m \geq J^{*}+1\right\}}$

The posterior mean $\left(\phi_0^{\text {post }}, \ldots, \phi_{J+1}^{\text {post }}\right)$ is obtained by

$\left(\phi_{0}^{\text {post }}, \ldots, \phi_{J+1}^{\text {post }}\right)^{\prime}=\sum\left(D_{J}^{*}\right)\left(c_{0}, \ldots, c_{j+1}\right)^{\prime}$

with vector $\left(c_0, \ldots, c_{J+1}\right)$ given by

$\begin{aligned} c_{j}= & \frac{\phi_{j}}{s_{j}^{2}}+\frac{1}{\sigma_{j}^{2}} \sum_{i=0}^{J-j} \log \frac{P_{i, j}}{P_{i, j-1}} \\ & +\left[\sum_{i=J-j+1}^{J-J^{*}} \frac{1}{v_{J-i}^{2}-w_{J-i}^{2}}\left(\log \frac{I_{i, J-i}}{P_{i, J-i}}+\frac{i \tau^{2}+\tau_{J}^{2}}{2}\right)\right] 1_{\left\{j, J^{*}+1\right\}} . \end{aligned}$

Note that the last term in the definition of $a_{n, m}$ and in the definition of $c_j$ corresponds to the development years in $D_J^*$ where we have both claims payments and claims incurred information. Theorem 2.3 immediately implies the following corollary:

Corollary 2.4. Under Model Assumptions 2.2 the posterior $u\left(\Phi \mid D_J^*\right)$ of $\Phi$ is a multivariate Gaussian distribution with $\Phi_0, \ldots, \Phi_{J^*},\left(\Phi_{J^*+1}, \ldots, \Phi_{J+1}\right)$ being independent with

$\left.\mathbf{\Phi}_{j}\right|_{\left\{D_{j}^{*}\right\}} \sim N\left(\phi_{j}^{\text {post }}=\gamma_{j} \bar{\phi}_{j}+\left(1-\gamma_{j}\right) \phi_{j},\left(s_{j}^{\text {post }}\right)^{2}\right) \tag{2.4}$

for j ≤ J* and credibility weight and empirical mean defined by

$\begin{array}{r} \gamma_{j}=\frac{J-j+1}{J-j+1+\sigma_{j}^{2} / s_{j}^{2}} \text { and } \overline{\phi_{j}}=\frac{1}{J-j+1} \sum_{i=0}^{J-j} \log \frac{P_{i, j}}{P_{i, j-1}} \\ \quad \text { for } j=0, \ldots, J^{*} \end{array}$

Henceforth, Corollary 2.4 shows that for development years $j \leq J^*$ we obtain the well-known credibility weighted average between the prior mean $\phi_j$ and the average observation $\bar{\phi}_{j^*}$ . The case $j>J^*$ is more involved: one basically obtains a weighted average between the prior mean $\phi_j$ , the average observation $\bar{\phi}_j$ , and the incurred-paid ratios $\log I_{i, J-i} / P_{i, J-i}, i \geq J-j+1$ .

Remark. Model Assumptions 2.2 specify a Bayesian model with multivariate Gaussian distributions. This setup allows for closed-form solutions. For other distributional assumptions the problem can only be solved numerically using Markov chain Monte Carlo methods. Bayesian statistics, like the Bayesian information criterion BIC, would then allow for model testing and model selection. If one restricts to linear credibility estimators, see Bühlmann and Gisler (2005), then _j^post given in (2.4) corresponds to the linear credibility estimator in more general models.

2.3. Parameter estimation, special case J* = J

We consider the special case $J^*=J$ , that is, only the claims incurred observation $I_{0, J}$ is considered in the tail development factor analysis. This immediately provides:

Corollary 2.5. Choose $J^*=J$ . Under Model Assumptions 2.2 , the posterior distribution $u\left(\Phi \mid D_J^*\right)$ of $\Phi$ is a multivariate Gaussian distribution with $\Phi_0, \ldots, \Phi_{J+1}$ being independent. For $m \leq J^*=J$ the posterior distribution of $\Phi_m$ is given by (2.4). The posterior of $\Phi_{J+1}$ is given by

$\begin{aligned} \left.\mathbf{\Phi}_{J+1}\right|_{\left\{D_{J}^{*}\right\}} &\sim N\left(\phi_{J+1}^{\text {post }}=\gamma_{J+1}\left(\log \frac{I_{0, J}}{P_{0, J}}\right.\right. & \left.+\frac{\mathbf{\tau}_{J}^{2}}{2}\right) \\ &\quad \left.+\left(1-\gamma_{j+1}\right) \phi_{J+1}, a_{J+1, J+1}^{-1}\right), \end{aligned}$

with inverse variance given by

$a_{J+1, J+1}=s_{J+1}^{-2}+\left(\sigma_{J+1}^{2}+\tau_{J}^{2}\right)^{-1}$ and credibility weight given by $\gamma_{J+1}=\frac{1}{1+\left(\sigma_{J+1}^{2}+\tau_{J}^{2}\right) / s_{J+1}^{2}} .$

This means that in the case $J^*=J$ we obtain a credibility-weighted average between the prior tail development factor $\phi_{J+1}$ and the observation $\log \frac{I_{0, J}}{P_{0, J}}$ . Henceforth, in this case only the latest incurredpaid ratio is considered for the estimation of the tail development factor.

3. Posterior claims prediction and prediction uncertainty

3.1. General case

In view of Theorems 2.1 and 2.3 we can now predict the ultimate claim $P_{i, J+1}$ , conditional on the restricted observations $D_J^*$ , under Model Assumptions 2.2.

Proposition 3.1. Bayesian ultimate claims predictor. Under Model Assumptions 2.2 we predict the ultimate claim $P_{i, J+1}$ , given $D_J^*$ , by

$\begin{aligned} E\left[P_{i, J+1} \mid D_{J}^{*}\right]= & P_{i, J-i}^{1-\beta_{J-i}} I_{i, J-i}^{\beta_{J-i}} \exp \left\{\left(1-\beta_{J-i}\right) \sum_{l=J-i+1}^{J+1} \frac{\sigma_{l}^{2}}{2}\right. \\ & \left.+\beta_{J-i} \frac{i \tau^{2}+\tau_{J}^{2}}{2}\right\} \times \exp \left\{\left(1-\beta_{J-i}\right) \sum_{j=J-i+1}^{J+1} \phi_{j}^{p o s t}\right. \\ & \left.+\left(1-\beta_{J-i}\right)^{2} \frac{e_{J-i+1}^{\prime} \sum\left(D_{J}^{*}\right) e_{J-i+1}}{2}\right\}, \end{aligned}$

where $e_j=(0, \ldots, 0,1, \ldots, 1)^{\prime} \in \mathbb{R}^{J+2}$ with the first $j$ components equal to $O$ .

Next we determine the prediction uncertainty. Model Assumptions 2.2 and Theorem 2.3 constitute a full distributional model which allows for the calculation of any risk measure (using Monte Carlo simulations) under the posterior distribution, given D_J*. Here, we use the most popular measure for the prediction uncertainty in claims reserving, the so-called conditional mean square error of prediction (MSEP). The conditional MSEP has the advantage that we can calculate it analytically. Analytical solutions have the advantage that they allow for more basic sensitivity analysis. The conditional MSEP is given by (see also Section 3.1 in Wüthrich and Merz (2008))

$\begin{array}{l} \operatorname{msep}^{\sum_{i=0}^{J} P_{i, J+1 \mid p^{*}}}\left(E\left[\sum_{i=0}^{J} P_{i, J+1} \mid D_{J}^{*}\right]\right) \\ \quad=E\left[\left(\sum_{i=0}^{J} P_{i, J+1}-E\left[\sum_{i=0}^{J} P_{i, J+1} \mid D_{J}^{*}\right]\right)^{2} \mid D_{J}^{*}\right] \\ \quad=\operatorname{Var}\left(\sum_{i=0}^{J} P_{i, J+1} \mid D_{J}^{*}\right), \end{array}$

i.e., in this Bayesian setup the conditional MSEP is equal to the posterior variance. This posterior variance allows for the usual decoupling into average processes error and average parameter estimation error; see (A.3). The conditional MSEP satisfies

$\operatorname{Var}\left(\sum_{i=0}^{J} P_{i, J+1} \mid D_{J}^{*}\right)=\sum_{i, k=0}^{J} \operatorname{Cov}\left(P_{i, J+1}, P_{k, J+1} \mid D_{J}^{*}\right) .$

We obtain the following theorem:

Theorem 3.2. Under Model Assumptions 2.2 the conditional MSEP of the Bayesian predictor $E\left[\sum_{i=0}^J\right.$ $\left.P_{i, J+1} \mid D_J^*\right]$ for the aggregate ultimate claim $\sum_{i=0}^J P_{i, J+1}$ is given by

$\begin{aligned} & \operatorname{msep}_{\sum_{i=0}^J P_{i, J+1} \mid D_J^*}\left(E\left[\sum_{i=0}^J P_{i, J+1} \mid D_J^*\right]\right) \\ & =\sum_{0 \leq i, k \leq J}\left(e^{\left(1-\beta_{J-i}\right)\left(1-\beta_{J-k}\right) e_{-i-i+1} \Sigma\left(D_J^*\right) e_{J-k+1}+1+1_{i=k \xi}\left(1-\beta_{J-i}\right) \sum_{l=J-i+1+\sigma_l^2}^{J+1}}-1\right) \\ & \times E\left[P_{i, J+1} \mid D_J^*\right] E\left[P_{k, J+1} \mid D_J^*\right] \text {. } \\ & \end{aligned}$

3.2. Special case J* = J with non-informative priors

We revisit the special case J* = J and we also assume non-informative priors meaning that s_j² → ∞. In that case we obtain that the posterior distributions of Φ₀, . . . , Φ_*J*+1 are independent Gaussian distributions with

$\begin{array}{c} \left.\Phi_{j}\right|_{\left\{D^{j}\right\}} \sim N\left(\phi_{j}^{\text {post }}=\overline{\phi_{j}}=\frac{1}{J-j+1} \sum_{i=0}^{J-j} \log \frac{P_{i, j}}{P_{i, j-1}}\right. \\ \left.\left(s_{j}^{\text {post }}\right)^{2}=\frac{\sigma_{j}^{2}}{J-j+1}\right), \end{array}$

for j ≤ J, and

$\begin{array}{r} \left.\mathbf{\Phi}_{j+1}\right|_{\left\{D_{j}^{*}\right\}} \sim N\left(\phi_{J+1}^{\text {post }}=\log \frac{I_{0, J}}{P_{0, J}}+\frac{\tau_{J}^{2}}{2},\right. \\ \left.\left(s_{J+1}^{\text {post }}\right)^{2}=a_{J+1, J+1}^{-1}=\sigma_{J+1}^{2}+\tau_{J}^{2}\right) . \end{array}$

This implies for the ultimate claim prediction for i 0

$\begin{aligned} E\left[P_{i, J+1} \mid D_{J}^{*}\right] & =P_{i, J-i} \exp \left\{\sum_{l=J-i+1}^{J+1} \phi_{l}^{\text {post }}+\frac{\sigma_{l}^{2}}{2}+\frac{\left(s_{l}^{\text {post }}\right)^{2}}{2}\right\} \\ & =P_{i, J-i} \prod_{l=-i+1}^{J} \hat{f}_{l} \hat{f}_{J+1}^{(u l t)} \end{aligned} \tag{3.1}$

with chain-ladder factors

$\hat{f}_{l}=\exp \left\{\phi_{l}^{\text {post }}+\left(1+\frac{1}{J-l+1}\right) \frac{\sigma_{l}^{2}}{2}\right\} \tag{3.2}$

$\hat{f}_{J+1}^{(u l t)}=\frac{I_{0, J}}{P_{0, J}} \exp \left\{\sigma_{J+1}^{2}+\tau_{J}^{2}\right\} \tag{3.3}$

That is, the first terms in the product on the right-hand side of (3.1) are the classical chain-ladder factors for Hertig’s log-normal model (1985); see also (5.11)–(5.12) in Wüthrich and Merz (2008). The last term in (3.1), however, describes the tail development factor (adjusted for the variance).

For i = 0 we have

$E\left[P_{0, J+1} \mid D_{J}^{*}\right]=P_{0, J} \hat{f}_{J+1}^{(u l t)}=I_{0, J} \exp \left\{\sigma_{J+1}^{2}+\tau_{J}^{2}\right\} \tag{3.4}$

4. Example

In this section we provide an example. We assume that J = 9 and that the claims payment data P_i,j and the claims incurred data I_i,j for i + j ≤ J are given by Tables 1 and 2, respectively.

Table 1.Observed claims payments data

$P_{i, j}, i+j \leq J$ .

	0	1	2	3	4	5	6	7	8	9
0	1,216,632	1,347,072	1,786,877	2,281,606	2,656,224	2,909,307	3,283,388	3,587,549	3,754,403	3,821,258
1	798,924	1,051,912	1,215,785	1,349,939	1,655,312	1,926,210	2,132,833	2,287,311	2,567,056
2	1,115,636	1,387,387	1,930,867	2,177,002	2,513,171	2,931,930	3,047,368	3,182,511
3	1,052,161	1,321,206	1,700,132	1,971,303	2,298,349	2,645,113	3,003,425
4	808,864	1,029,523	1,229,626	1,590,338	1,842,662	2,150,351
5	1,016,862	1,251,420	1,698,052	2,105,143	2,385,339
6	948,312	1,108,791	1,315,524	1,487,577
7	917,530	1,082,426	1,484,405
8	1,001,238	1,376,124
9	841,930

Table 2.Observed claims incurred data

$I_{i, j}, i+j \leq J$ .

	0	1	2	3	4	5	6	7	8	9
0	3,362,115	5,217,243	4,754,900	4,381,677	4,136,883	4,094,140	4,018,736	4,001,591	4,001,391	4,001,258
1	2,640,443	4,643,860	3,869,954	3,248,558	3,102,002	3,019,980	2,976,064	2,966,941	2,959,955
2	2,879,697	4,785,531	4,045,448	3,467,822	3,377,540	3,341,934	3,283,928	3,287,827
3	2,933,345	5,299,146	4,451,963	3,700,809	3,553,391	3,469,505	3,413,921
4	2,768,181	4,658,933	3,936,455	3,512,735	3,385,129	3,298,998
5	3,228,439	5,271,304	4,484,946	3,798,384	3,702,427
6	2,927,033	5,067,768	4,066,526	3,704,113
7	3,083,429	4,790,944	4,408,097
8	2,761,163	4,132,757
9	3,045,376

We first need to determine $J^* \leq J$ . We choose the value $J^*$ such that there is no substantial claims incurred development (no systematic drift) after development period $J^*$ . This choice is made based on actuarial judgment. We therefore look at the individual chain-ladder factors $I_{i, J+1} / I_{i, j}, j \geq 0$ and $i+j+1 \leq J$ . These are provided in Table 3. In the upper right triangle in Table 3 (with the individual chain ladder factors for years $6,7,8$ ) we see no further systematic development, so we concentrate on possible choices $J^* \in\{6, \ldots, 9\}$ .

Table 3.Individual chain ladder factors

$I_{i, j+1} / I_{i, j}$ for

$j \geq 0$ and

$i+j+1 \leq J$ .

	0	1	2	3	4	5	6	7	8	9
0	1.5518	0.9114	0.9215	0.9441	0.9897	0.9816	0.9957	1.0000	1.0000
1	1.7587	0.8333	0.8394	0.9549	0.9736	0.9855	0.9969	0.9976
2	1.6618	0.8453	0.8572	0.9740	0.9895	0.9826	1.0012
3	1.8065	0.8401	0.8313	0.9602	0.9764	0.9840
4	1.6830	0.8449	0.8924	0.9637	0.9746
5	1.6328	0.8508	0.8469	0.9747
6	1.7314	0.8024	0.9109
7	1.5538	0.9201
8	1.4967
9
average		1.6529	0.8561	0.8714	0.9619	0.9807	0.9834	0.9980	0.9988	1.0000

The standard deviation parameters $s_j, \sigma_j$ and $\tau_j$ should be determined with prior knowledge only. In our example we assume that we have noninformative priors, which means that we set $s_j=\infty$ . For $\sigma_j$ and $\tau_j$ we take an empirical Bayesian point of view and estimate them from the data. For j = 0, . . . , J − 1 we set

$\hat{\mathbf{\sigma}}_{j}^{2}=\frac{1}{J-j} \sum_{i=0}^{J-j}\left(\log \frac{P_{i, j}}{P_{i, j-1}}-\overline{\phi_{j}}\right)^{2}$

Unfortunately, $\sigma_J$ and $\sigma_{J+1}$ cannot be estimated from the data, because we do not have sufficient observations. Therefore, we make the ad hoc choice

$\hat{\mathbf{\sigma}}_{J+1}=\hat{\mathbf{\sigma}}_{J}=\min \left\{\hat{\mathbf{\sigma}}_{J-1}, \hat{\mathbf{\sigma}}_{J-2}, \hat{\mathbf{\sigma}}_{J-1}^{2} / \hat{\mathbf{\sigma}}_{J-2}\right\} .$

We estimate the parameter $\tau=\tau_J^*=\cdots=\tau_{J-1}$ with the empirical standard deviation of $\log I_{i, j+1} / I_{i, j}$ for $i+j+1 \leq J$ and $j \geq 6$ (because we assume that there is no systematic claims incurred development after development period 6; see Table 3). Finally, for $\tau_J$ we do the ad hoc (expert) choice $\tau_J^2=3 \tau^2$ . This suggests that we have (approximately) another three uncorrelated development periods beyond $J=9$ until all claims are finally settled. Of course, additional information about $\tau_J$ (if available) should be used here. These choices provide the standard deviation parameters given in Table 4. Now we are ready to calculate the claims reserves and the corresponding prediction uncertainty in our model according to Proposition 3.1 and Theorem 3.2. We do this for J* ∈ {6, . . . , 9}. The results are provided in Table 5.

Table 4.Estimated

$\hat{\sigma}_j$ for

$j=0, \ldots, J+1$ , and

$\hat{\tau}_{\mathrm{j}}$ for

$j=6, \ldots, J$ .

	0	1	2	3	4	5	6	7	8	9	10
σ̂_j	0.1393	0.0650	0.0731	0.0640	0.0264	0.0271	0.0405	0.0227	0.0494	0.0227	0.0227
τ̂_j							0.0021	0.0021	0.0021	0.0037

Table 5.Estimated claims reserves and corresponding prediction standard deviation in the PIC tail development factor model (Model Assumptions 2.2) for

$J^* \in\{6, \ldots, 9\}$ , and the estimated claims reserves according to Hertig’s model (1985) [see Section 3.1 in Merz and Wüthrich (2010)] without tail development factor

	reserves	msep^1/2	reserves	msep^1/2	reserves	msep^1/2	reserves	msep^1/2	reserves	msep^1/2
Hertig's model [6]			J = 9		J = 8		J = 7		J = 6
no tail factor			PIC tail factor		PIC tail factor		PIC tail factor		PIC tail factor
0	0	0	180,054	14,652	182,752	14,599	182,024	14,594	181,551	14,590
1	47,060	83,995	171,647	124,884	391,633	12,439	390,918	12,433	390,454	12,428
2	336,189	241,482	503,888	279,793	701,497	276,256	107,490	15,517	106,616	15,505
3	549,682	261,129	719,020	299,020	918,561	297,415	673,923	263,493	411,103	17,629
4	655,906	242,377	789,650	271,269	947,248	273,746	754,032	246,311	613,774	221,380
5	1,190,955	326,696	1,361,399	363,250	1,562,242	368,106	1,316,008	332,649	1,137,263	300,944
6	1,115,656	249,249	1,239,724	275,751	1,385,920	280,339	1,206,683	254,165	1,076,573	231,061
7	1,611,611	365,019	1,759,165	396,734	1,933,036	407,990	1,719,870	374,105	1,565,129	345,667
8	2,310,950	521,674	2,486,673	560,910	2,693,737	580,909	2,439,876	536,249	2,255,594	500,075
9	1,954,075	440,471	2,087,331	471,323	2,244,354	489,676	2,051,844	453,365	1,912,098	424,462
tot	9,772,084	1,519,464	11,298,552	1,747,672	12,960,980	1,624,873	10,842,668	1,292,329	9,650,155	1,022,505

Interpretations

The analysis shows that in the presence of tail development, Hertig’s model (1985) may substantially underestimate the outstanding loss lia-bilities compared to the PIC tail development factor models for J* = 9, 8, 7. Only the PIC tail development factor model for J* = 6 gives similar reserves. This comes from the fact that the incurred development factors still give a downward trend to incurred losses in development periods 6 and 7 (see average in Table 3), which contradicts our model assumptions (2.1)–(2.2) and suggests to choose J* = 8 or 9. Of course, as mentioned above, this expert choice is based on the rationale that there is no systematic drift after J*, and statistical methods could justify this hypothesis/choice.
Including tail development factors for J* = 8, 9 also gives a higher prediction uncertainty msep^1/2 compared to Hertig’s model (1985) without tail development factors. This finding is in line with the ones in Verrall and Wüthrich (2012) and shows that prediction uncertainty needs a careful evaluation in the presence of tail development.
Note that for J* = 9 we simultaneously consider claims payments and claims incurred information for accident year i = 0. For J* = 8 we simultaneously consider claims payments and claims incurred information for accident years i = 0, 1. This results in a much lower prediction uncertainty in these accident years (above the horizontal line in the corresponding columns of Table 5). The reason is that the claims incurred information has only little uncertainty (since we assume Ψ_j to be constant for j ≥ J*). This substantially reduces the prediction uncertainty.

We may question whether there is so much information in these last claims incurred observations. If this is not the case, we should either increase τ and τ_J or we should use less informative priors in (2.1)–(2.2). The latter would bring us back to the model of Merz and Wüthrich (2010) and Happ and Wüthrich (2013) with the additional assumption that there is no systematic drift after J*. Moreover, this latter model would also allow us to consider more information than just the restricted one given by D_J*. In the present work we have decided to work with the restricted information D_J* only because then we can fully concentrate on tail factor estimation. Otherwise tail factor estimation would be more hidden in the data and analysis.

5. Conclusion

We have modified the PIC reserving model from Merz and Wüthrich (2010) so that it allows for the incorporation of tail development factors. These tail development factors are estimated considering claims incurred-paid ratios in an appropriate way. This extends the ad hoc methods used in practice and because we perform our analysis in a mathematically consistent way we also obtain formulas for the prediction uncertainty. These are obtained analytically for the conditional MSEP and these can be obtained numerically for other uncertainty measures using Monte Carlo simulations (because we work in a Bayesian setup). The case study highlights the need to incorporate tail development factors in the presence of tail development, since otherwise both the outstanding loss liabilities and the prediction uncertainty are underestimated.

References

Boor, J. 2006. “Estimating Tail Development Factors: What to Do When the Triangle Runs Out.” Casualty Actuarial Society Forum, Winter, 345–90.

Google Scholar

Bühlmann, H., and A. Gisler. 2005. A Course in Credibility Theory and Its Applications. New York: Springer.

Google Scholar

Dahms, R. 2008. “A Loss Reserving Method for Incomplete Claim Data.” Bulletin of the Swiss Association of Actuaries, 127–48.

Google Scholar

Gogol, Daniel. 1993. “Using Expected Loss Ratios in Reserving.” Insurance: Mathematics and Economics 12 (3): 297–99. https://doi.org/10.1016/0167-6687(93)90240-p.

Google Scholar

Happ, Sebastian, and Mario V. Wüthrich. 2013. “Paid-Incurred Chain Reserving Method with Dependence Modeling.” ASTIN Bulletin 43 (1): 1–20. https://doi.org/10.1017/asb.2012.4.

Google Scholar

Hertig, Joakim. 1985. “A Statistical Approach to IBNR-Reserves in Marine Reinsurance.” ASTIN Bulletin 15 (2): 171–83. https://doi.org/10.2143/ast.15.2.2015027.

Google Scholar

Johnson, R. A., and D. W. Wichern. 1988. Applied Multivariate Statistical Analysis. 2nd ed. Englewood Cliffs, NJ: Prentice-Hall.

Google Scholar

Mack, Thomas. 1999. “The Standard Error of Chain Ladder Reserve Estimates: Recursive Calculation and Inclusion of a Tail Factor.” ASTIN Bulletin 29 (2): 361–66. https://doi.org/10.2143/ast.29.2.504622.

Google Scholar

Merz, Michael, and Mario V. Wüthrich. 2010. “Paid–Incurred Chain Claims Reserving Method.” Insurance: Mathematics and Economics 46 (3): 568–79. https://doi.org/10.1016/j.insmatheco.2010.02.004.

Google Scholar

Posthuma, B., E.A. Cator, W. Veerkamp, and E.W. van Zwet. 2008. “Combined Analysis of Paid and Incurred Losses.” Casualty Actuarial Society E-Forum, Autumn, 272–93.

Google Scholar

Verrall, Richard J., and Mario V. Wüthrich. 2012. “Reversible Jump Markov Chain Monte Carlo Method for Parameter Reduction in Claims Reserving.” North American Actuarial Journal 16 (2): 240–59. https://doi.org/10.1080/10920277.2012.10590639.

Google Scholar

Wüthrich, M.V., and M. Merz. 2008. Stochastic Claims Reserving Methods in Insurance. Hoboken, NJ: Wiley.

Google Scholar

A. Appendix: Proofs

In this appendix we prove all the statements. We start with a well-known result for multivariate Gaussian distributions, see, e.g., Appendix A in Posthuma et al. (2008) and Johnson and Wichern (1988): Lemma A.1. Assume (X₁, . . . , X_n)′ is multivariate Gaussian distributed with mean (m₁, . . . , m_n)′ and positive definite covariance matrix Σ. Then we have for the conditional distribution:

$\begin{array}{l} \left.X_{1}\right|_{\left\{X_{2}, \ldots X_{n}\right\}} \sim N\left(m_{1}+\sum_{1,2} \sum_{2,2}^{-1}\left(X^{(2)}-m^{(2)}\right),\right. \\ \left.\sum_{1,1}-\sum_{1,2} \sum_{2,2}^{-1} \sum_{2,1}\right) \\ \end{array}$

where X⁽²⁾ = (X₂, . . . , X_n)′ is multivariate Gaussian with mean m⁽²⁾ = (m₂, . . . , m_n)′ and positive definite covariance matrix Σ_2,2, Σ_1,1 is the variance of X₁ and Σ_1,2 = Σ_2,1′ is the covariance vector between X₁ and X⁽²⁾.

Proof of Theorem 2.1

We first consider the case i J − J*, that is I_i,k ∉ D_J* for k = 0, . . . , J − i, henceforth for accident years i J − J* we do not consider claims incurred information. Using the conditional independence of accident years, given the parameters Θ, we obtain

$E\left[P_{i, J+1} \mid D_{J}^{*}, \Theta\right]=E\left[P_{i, J+1} \mid P_{i, 0}, \ldots, P_{i, J-1}, \Theta\right]$

Furthermore, ${i}>{J}-{J^*}$ implies $\beta_{J-i}=0$ . Therefore, the claim follows from Model Assumptions 1.1, as in (2.2) in Merz and Wüthrich (2010), and because $\beta_{j}=0$ for ${j}<{J^*}$ . Similarly, we obtain for the conditional variance

$\operatorname{Var}\left(P_{i, J+1} \mid D_{J}^{*}, \Theta\right)=E\left[P_{i, J+1} \mid D_{J}^{*}, \Theta\right]^{2}\left(\exp \left\{\sum_{l=J-i+1}^{J+1} \sigma_{l}^{2}\right\}-1\right)$

The case i ≤ J − J* is more involved. Using again the independence of accident years conditional on Θ, we obtain

$E\left[P_{i, J+1} \mid D_{J}^{*}, \Theta\right]=E\left[P_{i, J+1} \mid P_{i, 0}, \ldots, P_{i, J-i}, I_{i, J^{*}}, \ldots, I_{i, J-i}, \Theta\right]$

henceforth, we now have both claims payments and claims incurred observations for accident year i ≤ J − J*. We set j = J − i, then using Lemma A.1 we obtain completely analogous to Theorem 2.4 and Corollary 2.5 in Merz and Wüthrich (2010)

$\begin{aligned} E & {\left[P_{i, J+1} \mid D_{J}^{*}, \Theta\right] } \\ = & \exp \left\{\eta_{J+1}+\left(1-\beta_{j}\right)\left(\log P_{i, j}-\eta_{j}\right)+\beta_{j}\left(\log I_{i, j}-\mu_{j}\right)\right. \\ & \left.+\left(1-\beta_{j}\right)\left(w_{J+1}^{2}-w_{J}^{2}\right) / 2\right\} \\ & =P_{i, j}^{1-\beta_{j}} I_{i, j}^{\beta_{j}} \exp \left\{\left(1-\beta_{j}\right) \sum_{l=j+1}^{J+1}\left(\Phi_{l}+\sigma_{l}^{2} / 2\right)+\beta_{j} \sum_{l=j}^{J} \Psi_{l}\right\} . \end{aligned}$

Analogously, Theorem 2.4 from Merz and Wüthrich (2010) implies for the variance

$\begin{aligned} \operatorname{Var}\left(P_{i, J+1} \mid D_{j}^{*}, \Theta\right)= & E\left[P_{i, J+1} \mid D_{J}^{*}, \Theta\right]^{2} \\ & \left(\exp \left\{\left(1-\beta_{j}\right) \sum_{l=j+1}^{J+1} \sigma_{l}^{2}\right\}-1\right) . \end{aligned}$

This proves the theorem.

Proof of Theorem 2.3 and Corollary 2.4

We first write all the relevant terms of the likelihood of Φ, given D_J*. They are given by

$\begin{array}{l} u\left(\Phi \mid D_{J}^{*}\right) \\ \propto \prod_{J=0}^{J^{*}} \exp \left\{-\frac{1}{2 s_{j}^{2}}\left(\Phi_{j}-\phi_{j}\right)^{2}-\frac{1}{2 \sigma_{j}^{2}} \sum_{i=0}^{J-j}\left(\Phi_{j}-\log \frac{P_{i, j}}{P_{i, j-1}}\right)^{2}\right\} \\ \times \prod_{j=J+1}^{J} \exp \left\{-\frac{1}{2 s_{j}^{2}}\left(\Phi_{j}-\phi_{j}\right)^{2}-\frac{1}{2 \sigma_{j}^{2}} \sum_{i=0}^{J-j}\left(\Phi_{j}-\log \frac{P_{i, j}}{P_{i, j-1}}\right)^{2}\right\} \\ \times \exp \left\{-\frac{1}{2 s_{J+1}^{2}}\left(\Phi_{J+1}-\phi_{J+1}\right)^{2}\right\} \times \prod_{i=0}^{J-J^{*}} \exp \left\{-\frac{1}{2\left(v_{J-i}^{2}-w_{J-i}^{2}\right)}\right. \\ \left.\left(\sum_{m=J-i+1}^{J+1} \Phi_{m}-\frac{i \tau^{2}+\tau_{J}^{2}}{2}-\log \frac{I_{i, J-i}}{P_{i, J-i}}\right)^{2}\right\} . \end{array} \tag{A.1}$

From this we easily see that the posterior distribution of $\Phi$ , given $D_J^*$ , is again multivariate Gaussian and there only remains to determine the posterior mean and covariance matrix. If we square out all terms in (A.1) for obtaining the $\Phi_j^2$ and the $\Phi_j \Phi_n$ terms, we find the covariance matrix $\Sigma\left(D_J^*\right)$ . First of all, we observe that the development periods with $j \leq J^*$ are all on the first line of (A.1) which proves the independence statement on $\Phi_0, \ldots, \Phi_{J^*},\left(\Phi_{J^*+1}\right. \left.\ldots, \Phi_{J+1}\right)$ . Moreover, we see for $j \leq J^*$ that the posterior variance of $\Phi_j$ , given $D_J^*$ , is given by

$s_{j}^{\text {post }}=\left(s_{j}^{-2}+(J-j+1) \sigma_{j}^{-2}\right)^{-1 / 2},$

which provides a_n,m for n, m = 0, . . . , J*. The posterior mean is given by

$\phi_{j}^{\text {post }}=\left(s_{j}^{\text {post }}\right)^{2}\left(\frac{\phi_{j}}{s_{j}^{2}}+\frac{1}{\sigma_{j}^{2}} \sum_{i=0}^{J-j} \log \frac{P_{i, j}}{P_{i, j-1}}\right)$

Next, we square out all terms for j J* to get the covariance matrix. We obtain

$\begin{array}{l} \sum_{n=J^{*}+1}^{J+1}\left(\frac{1}{s_{n}^{2}}+\frac{J-n+1}{\mathbf{\sigma}_{n}^{2}}\right) \mathbf{\Phi}_{n}^{2}+\sum_{n, m=J^{*}+1}^{J+1} \mathbf{\Phi}_{n} \mathbf{\Phi}_{m} \sum_{i=(J-n+1) \vee(J-m+1)}^{J-J^{*}}\left(v_{J-i}^{2}-w_{J-i}^{2}\right)^{-1} \\ =\sum_{n=J^{*}+1}^{J+1}\left(\frac{1}{s_{n}^{2}}+\frac{J-n+1}{\mathbf{\sigma}_{n}^{2}}\right) \mathbf{\Phi}_{n}^{2}+\sum_{n, m=J^{*}+1}^{J+1} \mathbf{\Phi}_{n} \mathbf{\Phi}_{m} \sum_{i=J^{*}}^{(n-1) \wedge(m-1)}\left(v_{i}^{2}-w_{i}^{2}\right)^{-1} . \end{array}$

This provides $a_{n, m}$ for $n, m=J^*+1, \ldots, J+1$ . The posterior mean is obtained by solving the posterior maximum likelihood functions for $\Phi_j, j \geq J^*+1$ . They are given by

$\begin{array}{l} \frac{\partial \log u\left(\Phi \mid D_{J}^{*}\right)}{\partial \Phi_{j}}=\frac{\phi_{j}}{s_{j}^{2}}+\frac{1}{\sigma_{j}^{2}} \sum_{i=0}^{J-j} \log \frac{P_{i, j}}{P_{i, j-1}} \\ \quad+\sum_{i=J-j+1}^{J-J^{*}} \frac{\frac{i \tau^{2}+\tau_{J}^{2}}{2}+\log \frac{I_{i, J-i}}{P_{i, J-i}}-\sum_{m=J+1}^{J+1} \Phi_{m} a_{j, m}^{2}-w_{J-i}^{2}}{=}=. \end{array} \tag{A.2}$

Henceforth, this implies

$\left(c_{0}, \ldots, c_{J+1}\right)^{\prime}=\sum\left(D_{J}^{*}\right)^{-1}\left(\Phi_{0}, \ldots, \Phi_{J+1}\right)^{\prime}$

from which the claim follows.

Proof of Corollary 2.5

The corollary follows from Theorem 2.3 and Corollary 2.4.

Proof of Proposition 3.1

From Theorem 2.1 we obtain

$\begin{aligned} E & {\left[P_{i, J+1} \mid D_{J}^{*}\right]=E\left[E\left[P_{i, J+1} \mid D_{J}^{*}, \Theta\right] \mid D_{J}^{*}\right]=P_{i, J-i}^{1-\beta_{J-i}} I_{i, J-i}^{\beta_{J-i}} } \\ & E\left[\exp \left\{\left(1-\beta_{J-i}\right) \sum_{l=J-i+1}^{J+1}\left(\Phi_{l}+\sigma_{l}^{2} / 2\right)+\beta_{J-i} \frac{i \tau^{2}+\tau_{J}^{2}}{2}\right\} \mid D_{J}^{*}\right] \end{aligned}$

$\begin{aligned} = & P_{i, j-i}^{1-\beta-\beta_{-i}} I_{i, J-i}^{\beta_{J-i}} \exp \left\{\left(1-\beta_{J-i}\right) \sum_{l=-i-i+1}^{J+1} \frac{\sigma_{l}^{2}}{2}+\beta_{J-i} \frac{i \tau^{2}+\tau_{J}^{2}}{2}\right\} \\ & \times E\left[\exp \left\{\left(1-\beta_{J-i}\right) \sum_{l=J-i+1}^{J+1} \Phi_{l}\right\} \mid D_{J}^{*}\right] . \end{aligned}$

From Theorem 2.3 we know that, given $D_I^*, \Phi=$ $\left(\Phi_0, \ldots, \Phi_{J+1}\right)$ has a posterior multivariate Gaussian distribution with posterior mean $\left(\phi_0^{\text {post }}, \ldots, \phi_{J+1}^{\text {post }}\right)$ and posterior covariance matrix $\Sigma\left(D_J^*\right)$ . Henceforth, the posterior distribution of $\sum_{j=J-i+1}^{J+1} \Phi_j$ is Gaussian with mean $\sum_{j=J-i+1}^{J+1} \phi_j^{\text {post }}$ and variance $e_{J-i+1}^{\prime} \Sigma\left(D_J^*\right) e_{J-i+1}$ . This proves the proposition.

Proof of Theorem 3.2

We obtain with the tower property of conditional expectations

$\begin{array}{l} \operatorname{Cov}\left(P_{i, J+1}, P_{k, J+1} \mid D_{J}^{*}\right)= \\ E\left[\operatorname{Cov}\left(P_{i, J+1}, P_{k, J+1} \mid D_{J}^{*}, \Theta\right) \mid D_{J}^{*}\right] \\ \quad+\operatorname{Cov}\left(E\left[P_{i, J+1} \mid D_{J}^{*}, \Theta\right], E\left[P_{k, J+1} \mid D_{J}^{*}, \Theta\right] \mid D_{J}^{*}\right) \end{array} \tag{A.3}$

This is the usual decomposition into average process (co-)variance and average parameter error. The first term in (A.3) is equal to 0 for i ≠ k, because accident years i are independent, conditionally given Θ. Henceforth there remains the case i = k. Using Theorems 2.1 and 2.3 we obtain

$\begin{aligned} E & {\left[\operatorname{Var}\left(P_{i, J+1} \mid D_{J}^{*}, \Theta\right) \mid D_{J}^{*}\right] } \\ = & \left.E\left[E\left[P_{i, J+1} \mid D_{J}^{*}, \Theta\right]^{2} \mid D_{J}^{*}\right] \mid \exp \left\{\left(1-\beta_{J-i}\right) \sum_{l=J-i+1}^{J+1} \sigma_{l}^{2}\right\}-1\right) \\ = & P_{i, J-i}^{2(1-\beta-i)} I_{i, J-i}^{2 \beta-i} \exp \left\{\left(1-\beta_{J-i}\right) \sum_{l=J-i+1}^{J+1} \sigma_{l}^{2}+\beta_{J-i}\left(i \tau^{2}+\tau_{J}^{2}\right)\right\} \\ & \times E\left[\exp \left\{2\left(1-\beta_{J-i}\right) \sum_{l=J-i+1}^{J+1} \Phi_{l}\right\} \mid D_{J}^{*}\right] \\ & \left(\exp \left\{\left(1-\beta_{J-i}\right) \sum_{l=-i-i+1}^{J+1} \sigma_{l}^{2}\right\}-1\right) . \end{aligned}$

From Theorem 2.3 we know that, given $D_I^* \Phi=$ $\left(\Phi_0, \ldots, \Phi_{J+1}\right)$ has a posterior multivariate Gaussian distribution with posterior mean $\left(\phi_0^{\text {post }}, \ldots, \phi_{J+1}^{\text {post }}\right)$ and posterior covariance matrix $\Sigma\left(D_J^*\right)$ . Henceforth, the posterior distribution of $\sum_{j=J-i+1}^{J+1} \Phi_j$ is Gaussian with mean $\sum_{j=J-i+1}^{J+1} \phi_j^{\text {post }}$ and variance $\mathbf{e}_{J-i+1}^{\prime} \Sigma\left(D_J^*\right) \mathbf{e}_{J-i+1}$ . This implies for the first term (A.3)

$\begin{aligned} E\left[\operatorname{Var}\left(P_{i, J+1} \mid D_{J}^{*}, \Theta\right) \mid D_{J}^{*}\right] & =E\left[P_{i, J+1} \mid D_{J}^{*}\right]^{2} \\ \times & \exp \left\{\left(1-\beta_{J-i}\right)^{2} e_{J-i+1}^{\prime} \sum\left(D_{J}^{*}\right) e_{J-i+1}\right\} \\ & \left(\exp \left\{\left(1-\beta_{J-i}\right) \sum_{l=J-i+1}^{J+1} \sigma_{l}^{2}\right\}-1\right) . \end{aligned}$

Finally, we consider the last term in (A.3). Applying Theorems 2.1 and 2.3, we obtain

$\begin{array}{l} \operatorname{Cov}\left(E\left[P_{i, J+1} \mid D_{J}^{*}, \Theta\right], E\left[P_{k, J+1} \mid D_{J}^{*}, \Theta\right] \mid D_{J}^{*}\right) \\ = P_{i, J-i}^{1-\beta-\beta_{-i}} I_{i, J-i}^{\beta,-i} \exp \left\{\left(1-\beta_{J-i}\right) \sum_{l=J-i+1}^{J+1} \frac{\sigma_{l}^{2}}{2}+\beta_{J-i} \frac{i \tau^{2}+\tau_{J}^{2}}{2}\right\} \\ \times P_{k, J-k}^{1-\beta,-k} I_{k, J-k}^{\beta /-k} \exp \left\{\left(1-\beta_{J-k}\right) \sum_{l=J-k+1}^{J+1} \frac{\sigma_{l}^{2}}{2}+\beta_{J-k} \frac{k \tau^{2}+\tau_{J}^{2}}{2}\right\} \\ \times \operatorname{Cov}\left(\exp \left\{\left(1-\beta_{J-i}\right) \sum_{l=J-i+1}^{J+1} \Phi_{l}\right\},\right. \\ \left.\exp \left\{\left(1-\beta_{J-k}\right) \sum_{l=J-k+1}^{J+1} \Phi_{l}\right\} \mid D_{J}^{*}\right) . \end{array}$

Henceforth, we need to calculate this last covariance term. Due to Theorem 2.3 the joint distribution of the exponents is a multivariate Gaussian distribution with covariance This implies
$\begin{array}{l} \operatorname{Cov}\left(E\left[P_{i, J+1} \mid D_{J}^{*}, \Theta\right], E\left[P_{k, J+1} \mid D_{J}^{*}, \Theta\right] \mid D_{J}^{*}\right) \\ \quad=E\left[P_{i, J+1} \mid D_{J}^{*}\right] E\left[P_{k, J+1} \mid D_{J}^{*}\right] \\ \quad\left(\exp \left\{\left(1-\beta_{J-i}\right)\left(1-\beta_{J-k}\right) e_{J-i+1}^{\prime} \sum\left(D_{J}^{*}\right) e_{J-k+1}\right\}-1\right), \end{array}$

which is the well-known covariance formula for log-normal distributions. Collecting the terms for i ≠ k gives the off-diagonal terms. For i = k we obtain the terms

$\begin{array}{l} E\left[P_{i, J+1} \mid D_{J}^{*}\right]^{2} \exp \left\{\left(1-\beta_{J-i}\right)^{2} e_{J-i+1}^{\prime} \sum_{J}^{*}\left(D_{J}^{*}\right) e_{J-i+1}\right\} \\ \left(\exp \left\{\left(1-\beta_{J-i}\right) \sum_{l=J-i+1}^{J+1} \sigma_{l}^{2}\right\}-1\right) \end{array}$

$\begin{aligned} & E\left[P_{i, J+1} \mid D_J^*\right]^2 \exp \left\{\left(1-\beta_{J-i}\right)^2 e_{J-i+1}^{\prime} \sum\left(D_J^*\right) e_{J-i+1}\right\} \\ & \quad\left(\exp \left\{\left(1-\beta_{J-i}\right) \sum_{l=J-i+1}^{J+1} \sigma_l^2\right\}-1\right) \\ &+E\left[P_{i, J+1} \mid D_J\right]^2\left(\exp \left\{\left(1-\beta_{J-i}\right)^2 e_{J-i+1}^{\prime} \sum\left(D_J^*\right) e_{J-i+1}\right\}-1\right) \\ &= E\left[P_{i, J+1} \mid D_J^*\right]^2\left(\operatorname { e x p } \left\{\left(1-\beta_{J-i}\right)^2 e_{J-i+1}^{\prime} \sum\left(D_J^*\right) e_{J-i+1}\right.\right. \\ &\left.\left.+\left(1-\beta_{J-i}\right) \sum_{l=J-i+1}^{J+1} \sigma_l^2\right\}-1\right) . \end{aligned}$

This completes the proof.