The Credibility of the Overall Rate Indication: Making the Theory Work

Joseph Boor

1. Introduction

It is well established that the limited fluctuation or “square root” credibility has limitations. Since it is designed to produce stable estimates, not best estimates, it does not provide the most accurate rates. Further, since any conceivable combination of the fluctuation that may be acceptable and probability of a chance violation of the accepted fluctuation is a priori no different than any other, it is challenging^[1] to show that any particular full credibility standard is better than any other. Lastly, the square root rule relies on an assumption that the statistic receiving the complement of credibility is stable. When the complement of credibility is, say, three years of 15% trend, that assumption is clearly violated. So there is a strong need^[2] for best-estimate credibility.

Some time ago (1967) Hans Bühlmann developed a formula^[3] for the best estimate credibility of a single risk or a single class when the complement of credibility is assigned to the large group that the risk or class is part of. His P/(P + K) formula^[4] is well known and represents a truly optimal (in the sense of making the best predictions) credibility formula. But a formula is also needed for the credibility of the overall rate change for a product or line of business. It is quite common in actuarial work to develop a rate indication for such a group, realize that supplemental data is needed, and credibility weight the overall indicated change with something such as the inflationary trend since the last rate change.^[5] Considering that the overall rate change affects every rate for every class and every risk, this author believes that the credibility of the overall rate indication deserves as much attention as the credibility of the class data within it.

A solid theoretical background has been laid for the credibility of this overall rate indication. Credibility is by nature a process that is designed to update an estimate of loss costs. A paper by Jones and Gerber (1975) provides formulas for the weights in updating formulas (to be discussed later) in terms of the covariances of the historical data points.^[6] This formula, in fact, provides the optimum linear estimate of future costs given all the prior data, not just the data used in the current rate update.

Nevertheless, knowing the mathematical form of the credibility is not the same thing as being able to compute the credibility. As will be shown, standard credibility formulas derived from the Gerber-Jones approach use values for the Brownian motion variance in year-to-year trend, plus values for the “observation error” variances between observed data points and the true expected costs that underlie them.^[7] To compute the credibility, it is necessary to estimate those variance parameters. This paper provides techniques designed to do just that.

2. The theory—Key credibility formulas for the overall rate indication

In this section the key theoretical results from the Jones and Gerber (1975) paper are presented. This should provide the practitioner a summary of the key formulas that create best-estimate credibility. Likely none of the material is new.

2.1. The general Gerber-Jones formulas

The goal is to apply the Gerber-Jones formulas to a realistic model (ultimately, geometric Brownian motion for trend, and observation error with a constant coefficient of variation) of the relationship between historical data and the unknown future loss cost. So, to facilitate the reader’s understanding, the key Gerber-Jones formulas are shown below.

The first statement that must be made is that the Gerber-Jones formula, and, unless stated otherwise, all other formulas, assume that any necessary trend and current level adjustments have already been made to the data. For example, although the prior data used in a credibility formula involves trending and current level adjustments, those adjustments are assumed to have been done^[8] in the background, so all that is involved is determining the optimum credibility weights for the previous years.

With that background, a credibility formula^[9] and data pattern is of the updating type^[10] through the n + 1^st projection (e.g., the optimum^[11] estimate of future loss costs $P_{n+1}$ is a credibility weighted average $P_{n+1}$ = Z_nS_n + (1 − Z_n)P_n of the previous estimate of loss costs P_n and the new data S_n) if there is a constant μ and sequences V₁, V₂, . . . , V_n and W₁, W₂, . . . , W_n such that

$E\left[S_{i}\right]=\mu \quad \text { for each of the } S_{i} \tag{2.1}$

$\operatorname{Cov}\left[S_{i}, S_{j}\right]=V_{i}+W_{i}\\ \text{for each case where } i=j, \text{and} \tag{2.2}$

$=W_{i} \quad \text { if } i<j \tag{2.3}$

Further, when the credibility formula and data pattern are of that updating type, then the optimum credibilities are

$Z_{i}=\frac{W_{i}-W_{i-1}+Z_{i-1} V_{i-1}}{W_{i}-W_{i-1}+Z_{i-1} V_{i-1}+V_{i}} \tag{2.4}$

and

$Z_{1}=\frac{W_{1}}{W_{1}+V_{1}} \tag{2.5}$

2.2. The linear updating-type formulas

As a first step towards understanding the notation, it is helpful to introduce the credibility under a standard linear Brownian motion with a drift (T), variance parameter “δ²” for the Brownian motion, and a constant error variance “σ²” between each trended data point S_i = S_i* + (n + 1 − i)T and the trended underlying expected cost at period i, or L_i = L_i* + (n + 1 − i)T. Logically, the actual deviations from the expected loss (S_i − L_i = E_i per this linear model) could be expected to be independent from both each other and the L_i’s. Of note, this treatment is not new, but is presented so that the reader may understand the process.

Then, if we take “μ” to be the true mean expected loss^[12] at time^[13] n + 1, μ = $L_{n+1}$ = $E[P_{n+1}]$ then the underlying prior expected loss follows a Brownian motion. Further, since Cov[A + αB, C + βB] = αβVar[B] when A, B, and C are mutually independent,

$\begin{aligned} \operatorname{Cov}\left[S_{i}, S_{j}\right] & =\operatorname{Cov}\left[L_{i}+E_{i}, L_{i}+\left(L_{j}-L_{i}\right)+E_{j}\right] \\ & =\operatorname{Var}\left[L_{i}\right]=i \delta^{2}(i<j) \end{aligned} \tag{2.6}$

(noting that L_j is further along in the Brownian motion than L_i, the random motion between L_i and L_j is independent of L_i). Further,

$\operatorname{Cov}\left[S_{i}, S_{i}\right]=i \delta^{2}+\sigma^{2} \tag{2.7}$

So, in the Gerber-Jones formula

$V_{i}=\sigma^{2} \text {; and } \tag{2.8}$

$W_{i}=i \delta^{2} \tag{2.9}$

Hence, per formula (2.4),

$Z_{i}=\frac{\delta^{2}+Z_{i-1} \sigma^{2}}{\delta^{2}+Z_{i-1} \sigma^{2}+\sigma^{2}} \tag{2.10}$

where each Z_i is the optimum credibility to use when combining the new data (S_i) with the prior estimate (P_i) to produce the optimum estimate $P_{i+1}$ of $L_{i+1}$ . Further, the resulting combination of all the prior data points $S_{ji+1}$ that $P_{i+1}$ represents is the optimum estimate of $L_{i+1}$ given the available data.

Jones and Gerber (1975) also show that the successive Z_i’s converge to a limit (which could conceivably be used as a proxy for the credibility Z_i when i is large). In this scenario, setting Z_i = $Z_{i-1}$ in formula (2.10) and solving for Z_i gives

$Z=\frac{\delta^{2} \left( \sqrt{1+4 \frac{\sigma^{2}}{\delta^{2}}}-1 \right)}{2 \sigma^{2}} . \tag{2.11}$

2.3. The geometric Brownian motion formulas

The linear model has a key weakness—it assumes that the growth in losses is linear. In fact, it is well-established that most insurance lines of business suffer inflation that causes loss costs to grow exponentially rather than linearly. That reality requires an adjustment to the Brownian motion model. Instead of having $E[L_i - L_{i-1}]$ = 0 for each i, we should expect zero growth ( $E[L_i/L_{i-1}]$ = 1). Instead of expecting the $L_i - L_{i-1}$ 's to have identical and independent normal distributions, one would expect the $L_i/L_{i-1}$ 's to have independent identical lognormal distributions, with the aforementioned mean of unity and some common variance of δ². So if one begins with unadjusted data points, each denoted as S_i*, the points used to estimate $L_{n+1}$ = $E[P_{n+1}]$ are the inflated values $S^{*}_i (1 + T)^{n+1-i}$ = S_i’s.

Lastly, a model for the differences between the observed S_i’s and the true expected costs, the L_i’s, must be included. In this model, the ratios S_i/L_i are assumed to have independent, identical lognormal distributions with a mean of unity and a constant variance of σ². These distributions are also expected to be independent from those of the year-to-year drifts ( $L_i/L_{i-1}$ 's). The common observation variance of the trended values assumption is consistent with roughly equal numbers of claims from year to year with severity inflation affecting the loss sizes. It would be less proper for an increasing book of business that encompasses more and more expected claims from year to year with consequent reductions in the coefficient of variation of the process variance.

In any event, the covariance structure, using the identity^[14] Cov[AB, CB] = E[A] × E[C] × Var[B], is^[15]

$\begin{aligned} \operatorname{Cov}\left[S_{i}, S_{j}\right]= & \operatorname{Cov}\left[L_{i} \times E_{i}, L_{i} \times\left(L_{j} / L_{i}\right) \times E_{j}\right] \\ = & E\left[L_{j}\right] \times E\left[L_{j-1} / L_{j}\right] \times, \ldots, \times E\left[L_{i} / L_{i \mp 1}\right] \\ & \times E\left[E_{i}\right] \times \operatorname{Var}\left[L_{i}\right](i>j) \\ = & 1 \times 1 \times, \ldots, \times 1 \times \operatorname{Var}\left[L_{i}\right] \\ = & \left(\delta^{2}+1\right)^{1}-1(i>j) \end{aligned} \tag{2.12}$

Further, by the identity Var[AB] − Var[A]Var[B] + E[A]²Var[B] + E[B]²Var[A],

$\operatorname{Cov}\left[S_{i}, S_{i}\right]=\sigma^{2}\left(\delta^{2}+1\right)^{i}+\left(\delta^{2}+1\right)^{i}-1 \tag{2.13}$

So, the key values for the Gerber-Jones formula in this case are

$W_{i}=\left(\delta^{2}+1\right)^{i}-1 \text {; } \tag{2.14}$

$V_{i}=\sigma^{2}\left(\delta^{2}+1\right)^{i} \tag{2.15}$

$Z_{i}=\frac{\delta^{2}+Z_{i-1} \sigma^{2}}{\delta^{2}+\delta^{2} \sigma^{2}+Z_{i-1} \sigma^{2}+\sigma^{2}} \tag{2.16}$

A comparison to equation (2.10) shows that this is identical to the formula for the linear case, except for the additional δ²σ² term in the denominator. But, one should consider that when at least one of the values δ² and σ² is very small, the combination term δ²σ² should be a small part of the denominator. Thus, one might say that, for the case of geometric Brownian motion,

$Z_{i} \cong \frac{\delta^{2}+Z_{i-1} \sigma^{2}}{\delta^{2}+Z_{i-1} \sigma^{2}+\sigma^{2}} \tag{2.17}$

Further, the steady-state credibility may be approximated as

$Z \cong \frac{\delta^{2} \left( \sqrt{1+4 \frac{\sigma^{2}}{\delta^{2}}}-1 \right)}{2 \sigma^{2}} \tag{2.18}$

As a relevant side note, the summands involved in equations (2.13) and (2.14) would inflate uniformly as the losses are projected ahead more than one year, to some n + Δt instead of to time n + 1, and the credibility equation would remain unchanged.^[16]

3. Multi-year formulas and best estimate credibility for the overall rate indication

The approach outlined earlier involves updating a rate with a single new year of data. But it is very common to see rate indications that update a rate with, say, the weighted average of the data from the last five years. The role of this multi-year data in a best estimate credibility formula merits discussion.

3.1. Reasons not to reuse older years

Updating formulas that use multiple years reuse data from prior estimates. So, the reuse of data should be evaluated. The first point to be made is that using multiple years is perfectly appropriate when limited fluctuation credibility is involved. Limited fluctuation credibility deals solely with the extent to which the body of data receiving credibility can be relied on to not create unwarranted increases or decreases of some specified size. It does not purport to create a best estimate of the future costs. It has been stated, though, by the well-respected Howard Mahler in 1986 that this method often produces future loss estimates that are comparable to those of best estimate credibility.

To state it simply, re-using prior years in a Gerber-Jones formula unduly complicates the computations. For example, assume an estimate has been continually updated over 14 years from P₁ and S₁ to P₁₅ with rolling five-year averages^[17] Q₁, . . . , Q₁₄ of the data points S₁, . . . , S₁₄. Logically, the step is to produce the estimate P₁₆ using Q₁₅. Note, though, that the covariance between Q₁₅ and Q₁₄ is fairly high, since they have the points S₁₁, S₁₂, S₁₃, and S₁₄ in common. However, Q₁₅ and Q₁ have no common components.^[18] Generally,^[19] Cov[Q₁₅, Q₁₄] ≠ Cov[Q₁₅, Q₁]. Therefore, the Gerber-Jones formula cannot be used when multiple years are combined.^[20] Therefore, the practice of combining multiple years of data in this context is suboptimal.

That conclusion has a very relevant corollary. If the exposures most useful for limited fluctuation credibility stem from five or even ten years, but best estimate credibility is only based on the most recent year, the resulting credibilities should by nature be different. Therefore, there are circumstances where limited fluctuation credibility is not a good substitute for best estimate credibility.

3.2. Correcting the prior estimate for changes in ultimate loss estimates

There is, however, one respect in which the use of multiple years could improve the estimate. The existing rate is based on the data available earlier, when the various years’ losses were less mature than they are at the time of the updated rate indication. So, it makes sense to update the existing rate for the additional development before using it in the credibility formula. Of course, the existing rate is a multiple credibility weighted average of many years. Further, it is not just an average of many years of loss ratios or pure premiums, it is rather either an average of trended loss ratios brought to the current rate level or trended pure premiums. So, some calculations must be done to include this additional loss development in the prior rate that is used as the complement of credibility. Due to the requirement to use current level data, the correction process for loss ratio ratemaking is slightly more complex than that of pure premium ratemaking. Therefore, Table 1 shows how the calculations needed to update a loss ratio at present rates for loss development might flow.

Table 1.Sample update of prior rate review loss ratio information for ultimate loss changes

Year	(1) (Data) Loss Ratio at Charged Rates @12/31/10	(2) (Data) Loss Ratio at Charged Rates @12/31/11	(3) (2)−(1) Absolute Loss Ratio Change	(4) (Data) Last Prior Current Level Factor
2006	75%	74%	−1%	1.390
2007	98%	90%	−8%	1.300
2008	32%	40%	8%	1.210
2009	75%	70%	−5%	1.150
2010	64%	52%	−12%	1.100
Year	(5) (Data) Credibility First Assigned	(6) 1.0−(5) Complement of Credibility	(7) (6)*[Next (7)] Complement of Credibility	(8) (5)*[Next (7)] Credibility in Last Prior
2006	45%	55%	9%	7%
2007	32%	68%	16%	8%
2008	38%	62%	24%	15%
2009	35%	65%	39%	21%
2010	40%	60%	60%	40%
Year	(9) (Data) Trend Rate First Assigned	(10) [1.0+(5)]*[Next (10)] Total Trend Factor in Last Prior	(11) (3)(7)(10)/(4) Change to Prior Estimate	(12) (Selected) Change to be Reflected
2006	6%	1.469	−0.08%	−0.08%
2007	7%	1.386	−0.66%	−0.66%
2008	8%	1.295	1.27%	1.27%
2009	9%	1.199	−1.09%	−1.09%
2010	10%	1.100	−4.80%	−4.80%
A. Total Change to Prior				−5.36%
B. Prior Loss Ratio for Ratemaking				65.72%
C. Last Rate Change Taken				−5.00%
D. Trend Factor for this Filing				1.12
E. = (B.+A.)8D./(1.0+C.) New “Prior” Value to which Complement of Credibility is Applied				71.16%

The references to “Prior” and “Last Prior” refer to the data used in computing the loss ratio estimate that was used in the last rate change. The “First Assigned” values refer to what was used the first time the specific year of data was used. Also, note that although the loss ratios of many years are likely embedded in the prior loss ratio, only the last five were revised. That is because more mature years see fewer year-to-year revisions in ultimate losses, and contribute a diminishing portion after credibility (see column 7).

It is also worth mentioning that in this example the current level factors could be updated for the next rate review by simply multiplying column (4) by unity plus item “C”. Similar adjustments could be made for the “Credibility in Last Prior” and “Total Trend Factor in Last Prior” columns.

Of course, this example mirrors the calculations in the theoretical literature—the data is assumed to be collected at midnight of December 31, 2011, then used to make rates that are effective at 12:01 a.m. of January 1, 2012. However, the corrections needed to reflect practical realities would appear to be straightforward.

3.3. Updated ultimate losses and updating-type credibility

It could be expected that the process of updating prior year ultimate losses could distort the optimum credibility. In lines such as excess casualty reinsurance, the ultimate loss estimates S_n, $S_{n-1}$ , etc., for the most recent years could have a very high observation error, and those five or so years back could be much closer estimates of the true expected loss L_i’s within their respective years. So, on that basis the true optimum credibility could be expected to be larger for some of the “older” years than the most recent year. However, that would clearly not create an “update.”

Some perspective can be provided about this situation. First, when prior year estimates are not corrected, the formulas of section 2 do provide the optimum credibility. Further, updating the prior year ultimate losses can only be expected to improve the accuracy of the resulting loss prediction. So, this approach can be expected to produce a high quality estimate of future costs, up to any distortion due to lengthy loss development.

If loss development uncertainty is expected to significantly distort the credibility, it may well be preferable to simply start from scratch each year with the ultimate loss estimates for, say, the last twenty years. One may then compute estimates of the process variance in each year, estimates of the loss development error variance in each year, and the Brownian motion-type variance parameter.^[21] It is not difficult to see that, under the linear model (possibly the geometric as well), an updating formula can be derived for the assignment of weights to the various years. It should be clear that the resulting credibility weights may differ greatly between years. However, it does not involve the sort of updating of the prior rate that is part of the typical actuarial application. Rather it involves simply computing a rate from scratch.^[22] Since the focus of this paper is on updating an existing rate with new data, this situation will not be analyzed further in this paper.

4. Estimating the parameters: Z, K, B, δ² and σ²

The section will give the reader some tools for creating estimates of the key variances, and thus help create better loss cost projections. It is not intended to be a survey on the subject. Rather it is intended to give the practitioner the tools needed to implement best estimate ratemaking. The interested reader may review some of the ideas in De Vlyder 1981 and Hayne 1985, to get two other perspectives on this subject.

First, a few quick notes are in order:

Note 1. In many situations, it is not necessary to estimate both δ² and σ². Key formulas can be converted to a function of K = δ²/σ², so K is all one needs to estimate.
Note 2. When estimating δ² and σ² for geometric Brownian motion, note that they are functions of δ′² and σ′² from the logarithmic transform to a linear Brownian motion, exp(δ′²) − 1 = δ², and exp(σ′²) − 1 = σ². So, once one determines how to estimate the constants of variance (or even just their ratio) in a linear Brownian motion, one may estimate the credibility for the geometric Brownian motion.
Note 3. The observation errors (with variance σ²) consist logically of a combination of the sample variance (i.e., the limitations of the law of large numbers due to the high skew in insurance statistics and inability of “small” claim samples to fully estimate the true expected losses each year) and the loss development uncertainty between the early data we base our projections on and the final actual claims costs in each year. Further, the sample variance and development variance are independent and so may be added to determine σ².
Note 4. (Subtraction of Two Estimated Quantities) If we subtract one highly uncertain “large” number from another “large” number, and the difference is “small,” the result has a “large” variance most of the time. When estimating a small number, that “large” variance typically overwhelms the true “small” value one seeks to estimate.
Note 5. (Common Additive Error in all the Data) If all the historical data points are affected equally and simultaneously by a common error that is independent of all the other error terms (for example, all the data is biased by addition of a single, uniform, unknown, amount “” from some distribution with a zero mean), then the optimal solution may be estimated by disregarding this error. Logically, this may be converted algebraically to a situation where one is estimating a future value that contains , with removed from all the historical data. Since the variance of is independent of all aspects of variance in the historical data, the component of the costs being predicted is not susceptible to estimation using the historical data. Hence, it may be disregarded in optimizing the estimate of future costs. A similar result holds when is a constant error multiplier with a mean of one within the data, except that one must consider that the mean of the inverse of may not be unity.

With those concerns in mind, a few methods for estimating the key parameters follow.

4.1. Method 1: The credibility that would have worked in the past.

This approach actually involves no estimation of δ² or σ²; rather, it estimates Z directly. Since estimating Z directly removes the barriers to implementing best estimate credibility for the overall rate indication, it merits discussion (even though it does not involve δ² and σ²). The basic methodology involves assuming some credibility value Z, then using all the data but the last year to estimate the last year given. Assume that one has, say, ten years of on-level, appropriately trended^[23] loss ratios. Then, one could note that the fifth year’s value could be estimating by first applying some unknown credibility factor Z to the fourth^[24] year’s data, Z(1 − Z) to the third year’s data, Z(1 − Z)² to the second year’s data, etc., then dividing by the sum of the credibilities, 1 − (1 − Z)⁴, to correct for the off-balance. In effect, a single credibility value is assumed to have been proper for all four updates.

Once that equation is established, one could vary Z in order to find which Z minimizes the squared difference between the fifth year’s data and the credibility-weighted average. Most modern spreadsheet programs contain solution-generating capabilities that make it straightforward to find such a solution. Then, one may also construct similar equations to solve for a common credibility of Z that use the first five values to predict the sixth, the first six values to predict the seventh, etc. The last step involves replacing the individual solutions of Z that each minimize the squared error of a single predictive step with a solution of a single Z that minimizes the sum of all the squared errors of all the predictive steps simultaneously.

The resulting Z is arguably the best estimator of the credibility in the data, at least as long as a single credibility is appropriate for all the years.

Table 2.Sample calculation of Z from initial reported data and final cost of ten years of data—when data has zero trend

			Input/Output for Solution Function
			Value to minimize = Value to vary to minimize Target is		Target =	0.046
			Value to minimize = Value to vary to minimize Target is			Z =	0.366
Part 1. Data and Estimation of Older Years
Accident Year	(1) Data Initial Data Values	(2) Data Final Ultimate Value	(3) Z((1−Z)^k) All Estimating Weights	(4) [5 Later (3)] Weights for Estimating 1995	(5) (1)*(4) 1995 Estimate	(6) [4 Later (3)] Weights for Estimating 1996	(7) (1)*(5) 1996 Estimate
1991	1.023	1.070	0.010	0.093	0.095	0.059	0.061
1992	0.991	1.107	0.015	0.147	0.146	0.093	0.092
1993	1.209	1.022	0.024	0.232	0.280	0.147	0.178
1994	0.576	0.923	0.038	0.366	0.211	0.232	0.134
1995	0.886	0.769	0.059		0.000	0.366	0.324
1996	0.858	0.907	0.093		0.000		0.000
1997	0.810	0.880	0.147		0.000		0.000
1998	1.061	0.871	0.232		0.000		0.000
1999	0.891	0.767	0.366		0.000		0.000
2000	0.967	0.826	0.000		0.000		0.000
A. Column Sums				0.838	0.732	0.897	0.788
B. (A./[A. in Prev. col.] Loss Ratio Est.					0.874		0.879
C. (from (1)) Actual Loss Ratio Values					0.769		0.907
D. (B-C.)^2 Squared Error in Estimate					0.011		0.001
Part 2. Estimation of Remaining Years and Total Prediction Error (Target)
Accident Year	(8) [3 Later(3)]*(1) 1997 Estimate		(9) [2 Later (3)]*(1) 1998 Weights	(10) [Next Row(3)]*(1) 1999 Estimate	(11) (3)*(1) 2000 Estimate
1991	0.038		0.024	0.015	0.010
1992	0.059		0.037	0.024	0.015
1993	0.113		0.072	0.045	0.029
1994	0.085		0.054	0.034	0.022
1995	0.206		0.130	0.083	0.052
1996	0.314		0.199	0.126	0.080
1997	0.000		0.296	0.188	0.119
1998	0.000		0.000	0.388	0.246
1999	0.000		0.000	0.000	0.326
2000	0.000		0.000	0.000	0.000
A. (as above)	0.814		0.812	0.903	0.899
B. (as above)	0.871		0.847	0.928	0.914	Sum of Est. Errors = Target
C. (as above)	0.880		0.871	0.767	0.826	Sum of Est. Errors = Target
D. (as above)	0.000		0.001	0.026	0.008	0.046

Table 2 illustrates how this process would work with ten years of essentially random sample data. The shaded boxes show the inputs and outputs to the solution process (note that the “Target” box pulls up the “Target” value computed at the bottom of the spreadsheet).

This method has good utility as long as δ² and σ² are stable over time and the data is not prone to very rare large losses.^[25] It is reasonable to expect δ² to be stable as long as the average trend factor is stable, but often that does not occur. Further, it would be reasonable to expect σ² to be fairly stable as long as the premium volume in the line, adjusted for trend, is stable.

What must be said. This approach has nothing to do with the formulas stated earlier. However, it does address the key question in this paper, determining the optimum credibility. Further, since Z has a formula in δ² and σ², it may also used to determine a second variance constant once a first variance constant is known. Then, one might possibly revise the estimate of σ² (derived from Z and δ²) to better account for process variance due to large losses, and consequentially revise the estimate of Z.

4.2. Method 2: Fitting K and B across a large number of similar datasets

In this case, one might assume that the ratemaker is computing rates for a single line of business in 50 U.S. states, or some other situation where there is a fairly large number of segments, and all the segments have approximately the same trend and observation- error-variance-per-unit-of-exposure characteristics. One would also have to assume that the complement of credibility is still supposed to be assigned to the existing rate plus trend, not some amalgam of all the segments. One must also assume that the old premium/exposure and loss data using in pricing the last, say, twelve years of rates are available for each of the segments. And lastly, it would help if the second-to-last data point for each segment, possibly the last data point, is developed enough that each value $L_{n+1,s}$ , for each class (s) is as close an estimate of the expected costs $E_{n+1,s}$ as is reasonably possible.

Just like the estimation of Z in the previous subsection, K and B may be estimated from the data by solving for the values that would produce the best estimates of the most recent costs in the various segments. In the previous subsection the total squared differences between the credibility-weighted average of various sets of years and the future years they project were minimized. In this case, for each segment “s,” one must construct the credibility-weighted average P_n,s of the last n (= 10, or 5, or whatever is most feasible) years of data (the S_i,s’s) in order to estimate each $L_{n+1,s}$ . In doing so, the credibilities should be computed using formula (A.7)

$Z_{i, s} \cong \frac{U_{i, s}+Z_{i-1, s}\left(K+B U_{i, s}\right)}{U_{i, s}+\left(1+Z_{i-1, s}\right)\left(K+B U_{i, s}\right)} . \tag{4.1}$

Per the solution routine, K and B should then be modified so that the squared errors the resulting $P_{n+1,s}$ 's make in estimating the $L_{n+1,s}$ 's are minimized. Crucially, K and B are not to vary from segment to segment. Rather, a single pair of K and B that minimize the sum of all the squared prediction errors is to be found via the solution algorithm.

So the weight assigned to the year n − i data for the line s data, $S_{n-i,s}$ , is

$M_{n-i, s}=\left(1-Z_{n, s}\right)\left(1-Z_{n-1, s}\right) \ldots\left(1-Z_{n-i+1, s}\right) Z_{n-i, s} \tag{4.2}$

The resulting predictions^[26] of the $L_{n+1,s}$ 's are then the various values of

$P_{n+1, s}=\sum_{i=1}^{n} M_{i, s} S_{i, s}+\prod_{i=1}^{n}\left(1-Z_{i, s}\right) S_{0 . s} \tag{4.3}$

(where each $S_{0.s}$ represents the rate or rating information in effect just before the experience period).

As before, the sum across all the s’s of the squared estimating errors _s(P_n,s − L_n,s)², or perhaps a premium or exposure weighted average _s W_n,s(P_n,s − L_n,s)² could be computed in the spreadsheet. The resulting value could be called the “Target” and the solution routine or feature could be used to vary K and B until the lowest value of the “Target” is found.

A sample spreadsheet illustrating this approach with 12 data segments and common trend, process, and parameter variance constants, but different samples from those constants among the segments, is shown in Table 3. The expected loss ratios for each segment were simulated using a geometric Brownian motions with the variance specified in Part 1. The actual loss ratios are also affected by the parameter variance and the process variance (a common factor, divided by the premium per the Law of Large Numbers) listed there. The actual values of K and B are on the very left of Part 1. Lastly, the K and B values that minimize the sum of premium-weighted sum of squared errors in projecting the sixth year’s simulated value (using the credibility weights^[27] defined by K, B, and the premium data) are highlighted in gray.

Table 3.Sample calculation of K and B from data for twelve separate segments subject to a common K and B (alternate annotation style for matrix data)

Part 1: Distribution Properties
Trend Variance		0.0016	(δ²)				Values to Vary in Solver			True Values
Process Variance		0.048	(τ^2—to be divided by premium)				K =	9.2477	(solver)	30.0	(K = τ²/δ²)
Parameter Variance		.0009	(λ²)				B =	1.4732	(solver)	.5625	(B = λ²/δ²)
Part 2: Premiums {U_year,class = U_i,s(Data)}
Year	Class 1	Class 2	Class 3	Class 4	Class 5	Class 6	Class 7	Class 8	Class 9	Class 10	Class 11	Class 12
1	20.00	24.00	28.80	34.56	41.47	49.77	59.72	71.66	86.00	103.20	123.83	148.60
2	22.00	26.40	31.68	38.02	45.62	54.74	65.69	78.83	94.60	113.52	136.22	163.46
3	18.00	21.60	25.92	31.10	37.32	44.79	53.75	64.50	77.40	92.88	111.45	133.74
4	19.00	22.80	27.36	32.83	39.40	47.28	56.73	68.08	81.70	98.04	117.64	141.17
5	21.00	25.20	30.24	36.29	43.55	52.25	62.71	75.25	90.30	108.36	130.03	156.03
Target 6	22.00	26.40	31.68	38.02	45.62	54.74	65.69	78.83	94.60	113.52	136.22	163.46
Part 3: Loss Ratios {L_i,s (Data); in this example, generated using original means (.6,.65,.55 for each group of three) and unity mean/lognormal drift and observation error with drift variance δ², observation variance $\frac{\tau^2}{U_{i, s}}+\lambda^2$ }
1	65.9%	65.2%	48.2%	60.2%	62.3%	55.0%	60.1%	66.1%	53.9%	61.8%	67.2%	50.1%
2	62.3%	61.0%	58.3%	57.5%	66.0%	56.1%	57.7%	67.9%	54.3%	63.6%	66.7%	51.0%
3	59.0%	64.8%	53.8%	59.1%	61.8%	59.6%	55.1%	65.8%	52.3%	61.5%	63.6%	49.4%
4	64.3%	74.3%	54.6%	56.3%	64.8%	55.3%	54.9%	61.0%	55.0%	61.3%	66.1%	50.7%
5	68.0%	80.8%	58.3%	50.3%	61.7%	52.7%	51.7%	59.0%	55.9%	60.2%	68.9%	45.3%
Target 6	68.3%	73.6%	61.3%	53.2%	67.1%	51.0%	54.1%	65.8%	58.8%	65.4%	63.5%	43.6%
Part 4: Credibilities $\left\{Z_{i, s}=\frac{\left[U_{i, s}+Z_{i-1, s}\left(K+B U_{i, s}\right)\right]}{\left[U_{i, s}+\left(1+Z_{i-1, s}\right)\right]\left(K+B U_{i, s}\right)} ; \operatorname{except} Z_{i, s}=\frac{U_{1, s}}{U_{1, s}+K+B U_{1, s}}\right\}$
1	0.34	0.35	0.36	0.36	0.37	0.38	0.38	0.38	0.39	0.39	0.39	0.39
2	0.46	0.47	0.48	0.49	0.49	0.50	0.50	0.50	0.51	0.51	0.51	0.51
3	0.49	0.50	0.51	0.51	0.52	0.52	0.53	0.53	0.53	0.53	0.54	0.54
4	0.50	0.51	0.51	0.52	0.52	0.53	0.53	0.53	0.54	0.54	0.54	0.54
5	0.51	0.51	0.52	0.52	0.53	0.53	0.53	0.54	0.54	0.54	0.54	0.54
Part 5: Weights Computed Using Credibilities {W_5,s = Z_5,s; W_i,s = Z_i,s(1 − Z_i+1,s) × . . . × (1 − Z_5,s); W_prior,s = 100% − (Z_1,s + Z_2,s + Z_3,s + Z_4,s + Z_5,s)}
Prior	0.044	0.041	0.039	0.036	0.035	0.033	0.032	0.031	0.030	0.030	0.029	0.029
1	0.023	0.022	0.021	0.021	0.020	0.020	0.020	0.019	0.019	0.019	0.019	0.019
2	0.058	0.057	0.055	0.054	0.053	0.052	0.052	0.051	0.050	0.050	0.050	0.049
3	0.121	0.120	0.119	0.117	0.116	0.115	0.114	0.114	0.113	0.113	0.112	0.112
4	0.247	0.248	0.248	0.248	0.248	0.248	0.248	0.247	0.247	0.247	0.247	0.247
5	0.506	0.513	0.518	0.523	0.528	0.532	0.535	0.537	0.540	0.542	0.543	0.544
Part 6: Projections {P_s = [Sum of W_i,s × L_i,s) for all years “i”] + W_prior,s × L_1,s}; and Squared Estimation Errors per Year 6 Observed Data {R_s = (P_s − Target L_6,s)²}
Projection to year 6 {P_s}	65.5%	75.2%	56.3%	53.8%	62.8%	54.4%	53.6%	61.1%	55.1%	60.8%	67.4%	47.6%
Squared Error vs. Target 6 {R_s}	7.6E-⁠04	2.4E-⁠04	2.5E-⁠03	3.1E-⁠05	1.9E-⁠03	1.2E-⁠03	2.1E-⁠05	2.2E-⁠03	1.4E-⁠03	2.1E-⁠03	1.5E-⁠03	1.6E-⁠03
Straight Sum of Squared Errors {R₁ + ⋯ + R₁₂}												1.54E-02
Weighted (with U_5,_s for each R_s) Sum of Squared Errors
—Solution routine varied K and B in previous gray area to minimize this value												1.46E-⁠03
Part 7: Validation with Actual Year 6 Underlying Expected Loss Ratios {E_s (Data)}
Expected Loss Ratio at Year 6	66.5%	73.8%	57.0%	49.6%	66.4%	50.5%	53.7%	66.4%	57.4%	62.5%	64.9%	43.8%
*Sum of Errors Projecting Expected Loss with K, B in Gray {S = sum of (P_s* − E_s)²}**												1.06E-02
Sum of Error w/True K,B {T; same as “S” only true K, B used throughout process}												1.07E-02
*Ratio Error w/Est. K, B to True K, B {S/T}*												100%

Note that the loss ratios for year 1 were deemed to have projection errors similar to the rate prior to the experience period, so they were used for the $S_{0.s}$ 's.

What must be said. In testing this method, it appears that it may require a substantial number of data points to reliably estimate of K and B using this process. In particular, twelve classes do not appear to be sufficient for the test case above. However, the fact that K and B are combined as K + BU in the equation means that they act together to impact the credibility. The only difference is that the “B” term reacts to exposure or premium volume, whereas “K” does not. In this case, at a premium of about 20 the estimated value of K + BU is about equal to the true underlying value.

Next, the actual quality of the estimation, the errors in estimating the true (unaffected by process or parameter variance) expected loss ratios for year 6 (as shown at the top of Part 7) were computed. As one may see, the difference between the prediction error using the estimated K and B and the actual K and B is negligible. This suggests that, as long as the sample size (number of “s” values) is small and the difference in premiums, exposures, etc., is small, it may be more helpful to simply replace “K + BU” with “K” in the credibility formula.

4.3. Method 3: Estimating δ² and σ² from the historical data

This method involves using different linear combinations of squared differences between values. As such, it is oriented towards standard, linear, Brownian motion. However, note that the logs of values from a geometric Brownian motion form a linear Brownian motion. So, one may convert geometric Brownian motion data to linear data, estimate the values of δ² and σ² that work in the linear context, then convert those to comparable drift variance and process/parameter variance values. For example, the geometric Brownian motion variance parameter would be $e^{\delta^2} - 1$ when δ² is the variance in the corresponding linear Brownian motion and the mean of the geometric Brownian motion steps is specified to be unity (no change in the multiplicative context).

So, the goal is to find functions of the S_i’s that provide insight into the values of δ² and σ². For example, the squared difference between the beginning and ending values (S_n − S₁)² reflects two samples of parameter/process error at the two endpoints and n − 1 samples from the Brownian motion variance. So, if the two types of variance are similarly sized, the squared difference between the two endpoints should be dominated by a multiple of the Brownian motion variance δ². Similarly, if one adds the squared differences between adjacent points $\sum_{i=1}^{n-1}\left(S_{i+1}-S_i\right)^2$ one would expect the result to be dominated by a multiple of the process^[28] variance σ². Further, one might expect that more precise approximations might be made by using linear combinations of those two values.

So, one might begin by computing the expected values of (S_n − S₁)² and $\sum_{i=1}^{n-1}\left(S_{i+1}-S_i\right)^2$ . First, note that, since the mean expected change in values from the Brownian motion (after trend correction) is zero, and the expected process risk is zero.

$E\left[\left(S_{n}-S_{1}\right)^{2}\right]=\operatorname{Var}\left[S_{n}-S_{1}\right] \tag{4.4}$

However, S_n − S₁ may be expressed as a sum of independent variables, each with mean zero, as (S_n − L_n) + (L_n − L₁) + (L₁ − S₁). So, it is composed of a process error, a Brownian motion of length n − 1, and the negative of a process error. Therefore,

$\begin{aligned} E & {\left[\left(S_{n}-S_{1}\right)^{2}\right] } \\ & =\operatorname{Var}\left[S_{n}-L_{n}\right]+\operatorname{Var}\left[L_{n}-L_{1}\right]+\operatorname{Var}\left[L_{1}-S_{1}\right] \\ & =\sigma^{2}+(n-1) \delta^{2}+\sigma^{2}=(n-1) \delta^{2}+2 \sigma^{2} \end{aligned} \tag{4.5}$

Similarly,

$\begin{aligned} E & {\left[\sum_{i=1}^{n-1}\left(S_{i+1}-S_{i}\right)^{2}\right] } \\ = & \sum_{i=1}^{n-1} \operatorname{Var}\left[L_{i+1}-L_{i}\right]+2 \sum_{i=2}^{n-1} \operatorname{Var}\left[S_{i}-L_{i}\right] \\ & \quad+\operatorname{Var}\left[S_{n}-L_{n}\right]+\operatorname{Var}\left[S_{1}-L_{1}\right] \\ = & (n-1) \delta^{2}+(n-2) \sigma^{2}+\sigma^{2}+\sigma^{2} \\ = & (n-1) \delta^{2}+2(n-1) \sigma^{2} . \end{aligned} \tag{4.6}$

Knowing those values, it is possible to construct estimators for δ² and σ². One may readily see that, by the linearity of expectations,

$\begin{array}{l} \frac{E\left[\sum_{i=1}^{n-1}\left(S_{i+1}-S_{i}\right)^{2}-\left(S_{n}-S_{1}\right)^{2}\right]}{2(n-2)}=\sigma^{2}, \end{array} \tag{4.7}$

and

$\begin{aligned} E & {\left[\frac{(n-1)\left(S_{n}-S_{1}\right)^{2}-\sum_{i=1}^{n-1}\left(S_{i+1}-S_{i}\right)^{2}}{(n-1)(n-2)}\right] } \\ & =\frac{(n-1)\left\{(n-1) \delta^{2}+2 \sigma^{2}\right\}-(n-1) \delta^{2}+2(n-1) \sigma^{2}}{(n-1)(n-2)} \\ & =\delta^{2} . \end{aligned} \tag{4.8}$

So, by creatively using the differences between the first and last point, and the differences between adjacent points, one may estimate the values of δ² and σ².

An example of the use of equations (4.7) and (4.8) is shown in Table 4. The actual observable data over 15 years in column 2 was generated randomly over 15 years, using the actual values δ = 3% and σ = 7%. The values of δ² and σ² were then estimated from the data. As one may see, the estimates are fairly close. But they nonetheless significantly overestimate the credibility.

Table 4.Sample estimation of

$\delta^2$ and

$\sigma^2$ from historical data

Part 1: Data
Brownian S.D.	3%
process S.D.	7%
Implied K	5.444
Part 2: Data and Analysis
(1) Year	(2) Brownian Expected Loss	(3) Process Error	(4) (2)+(3) Including Process Error (Observed Data)	(5) (4)−Previous (4) Annual Change	(6) (5)*(5) Squared Annual Changes	(7) (4)(end)−(4)(begin) Total Change from Beginning to End	(8) (7)*(7) Squared Total Change
1	0.650	0.032	0.682
2	0.617	−0.051	0.566	−0.116	0.013
3	0.621	0.116	0.738	0.172	0.029
4	0.578	0.012	0.590	−0.148	0.022
5	0.590	−0.033	0.557	−0.032	0.001
6	0.593	−0.016	0.577	0.020	0.000
7	0.603	0.082	0.685	0.108	0.012
8	0.581	−0.032	0.549	−0.136	0.018
9	0.613	−0.033	0.580	0.031	0.001
10	0.585	0.004	0.589	0.008	0.000
11	0.586	0.098	0.684	0.095	0.009
12	0.618	−0.057	0.561	−0.123	0.015
13	0.566	0.019	0.585	0.024	0.001
14	0.557	−0.018	0.539	−0.046	0.002
15	0.484	0.026	0.510	−0.029	0.001
Total					0.1251 A.	−0.172	0.0297 B.
C. Estimate of process variance: [A.−B.]/[2(15−2)] Associated standard deviation							0.0037 6%
D. Estimate of variance parameter for Brownian motion: [(n−1)B.−A.]/[(15−1)(15−2)] Associated standard deviation							0.0016 4%
E. Value of K = C./D.							2.293
F. Estimated Steady-State Credibility (equation (2.11) formula using C. and D.)							48%
G. True Steady-State Credibility (equation (2.11) formula using values at top)							35%

A note about trend—The theory underlying this paper assumes that the expected loss, a priori, is the same for all years. That generally requires that historical losses have been trended (and premiums adjusted to the current rate and exposure level) before the calculations commence. Of course, if the trend is computed using the same data as the calculations, the calculated value of δ² may be suppressed. For example, if the random movement began with a large upward jump early in the period, and another jump later, because the value of δ² is high, the analysis of trend may incorrectly infer that it is high trend rather than a high Brownian motion variance. Of course, if the trend is clearly much larger than δ², it may well be less of an issue.

Further, as noted later, the problem of estimating δ² and σ² is relatively ill-conditioned.^[29] So reducing the degrees of freedom of the approximation by estimating trend simultaneously, given a small number of data points, may not be reliable. However, one might be advised to use some related data, such as calendar year reported loss frequency and calendar year closed claim severity, to estimate the trend. On the other hand, if there are a large number of data points relative^[30] to the volatility in the data, then the impact of the random observation error in the initial and ending points on the trend estimate should be minimal.

A third aspect of trend deserves mention as well. Without a correction, the random lognormal aspect of geometric Brownian would produce a mean above one at all points after it begins. In effect, the randomness of the distribution combined with the skew of the lognormal tends to generate its own trend. So, the transformed (into a linear version) version of the data points, rather than having a normal-type^[31] distribution with mean zero, must have a lognormal distribution with mean $-\frac{\delta^{2}}{2}$ . That means that external trend must often be corrected, especially trend computed by averaging several year-to-year growth rates. To complicate matters, δ² is then unknown, so the value needed for the correction is unknown. However, some crude initial estimate of the value of δ² may be used when estimating trend, and then, once the trend is estimated, the δ² estimate may be refined, etc. The process may be continued iteratively until a consistent trend and δ² are computed. Consider that if the estimate is produced by loglinear regression of data with similar geometric

Brownian motion variance, $-\frac{\delta^{2}}{2}$ should already be subsumed into the trend. Further, if quality surrogate data is available for trending, that option deserves serious consideration.

What must be said. There are some special considerations that should help explain why the approximations are not more precise. First, it may be difficult to distinguish say, whether a very high last point is due to a very high uptick in the Brownian motion because δ² is large, or a large process error because σ² is high. So, the basic problem of approximating δ² and σ² may often be ill-conditioned. Second, it is important to review Note 4 at the beginning of this section. At its core, Note 4 says that the error variance in computing the quantities above could be as much as the sum of the variances of the two items you are subtracting. While the error does not quite reach the sum of the variances (due to inter-correlation of the two quantities), one should still be extremely cautious if the difference (the estimate of δ² or σ²) is much smaller than each of the values involved in the subtraction.

Nevertheless, even though the credibility determined using this method sometimes only has moderate precision, it is moderately close to the “best estimate” credibility. Therefore, it still has the potential to create more accurate estimates than the stability-centered classical credibility.

4.4. Method 4: Estimating σ² structurally from loss data and δ² by subtraction

Given the formulas in equations (4.5) and (4.6), it is clear that, once one of δ² and σ² is reliably estimated, the other may be estimated. It should also be clear that equation (4.5) has relatively more content in δ² than equation (4.6). So, if one has a quality estimate of σ², the formula

$\delta^{2} \cong \frac{\left(S_{n}-S_{1}\right)^{2}-2 \sigma^{2}}{n-1} \tag{4.9}$

may be used to estimate δ².

Some estimate of σ² is required to use that formula, though. One method for estimating σ² involves what may be described as a structural analysis. Such a process involves decomposing the process/parameter risk into its components and then estimating each component separately.

The process risk is some ways better represented in historical credibility formulas (such as the P/(P + K), or U/(U + K) in the notation of this paper), so it will be analyzed first. Thankfully, as long as there are enough claims in the data to reliably estimate the upper end of the severity distribution, one may use the collective risk equation to calculate the process variance (which may be labeled “α²”). Then,

$\begin{aligned} \alpha^{2}= & E[\# \text { claims}] \times \operatorname{Var}[\text {severity}] \\ & +\operatorname{Var}[\# \text { claims}] \times E[\text {severity}] \end{aligned} \tag{4.10}$

or in the loss ratio or pure premium context,

$\begin{array}{l} E[\# \text { claims}] \times \operatorname{Var}[\text {severity}]+ \\ \alpha^{2}=\frac{\operatorname{Var}[\# \text { claims}] \times E[\text {severity}]}{(\text { premium or exposures })^{2}}. \end{array} \tag{4.11}$

So, as long as the proper data is available,^[32] the process variance is readily estimable.

The other portion that must be estimated is the parameter variance, which will similarly be denoted “β²”. Note that any year-to-year variations in the trend are subsumed into δ². So, in most cases the only parameter-type variance that need be considered is the uncertainty in loss development to ultimate. That variance has two parts: uncertainty about what the correct expected loss development factor is; and variance of the ultimate loss in each year, as estimated using loss development, around the actual ultimate loss.

It is not hard to see that the uncertainty about the expected loss development factor can be essentially ignored per Note 5 at the beginning of this section. The variance in future loss emergence^[33] on the various years requires some analysis, though. Estimating the remaining random β², given appropriate volume in the triangle, can be done using some fairly well established procedures. For example, a paper by Hayne (1985) details one approach. The result of this approach would be a multiplicative distribution with a mean of unity and a variance of some β².

Of course, it is then necessary to combine α² and β². First, α² should be converted to a multiplicative distribution to use with the multiplicative loss development distribution. Such a distribution would represent the ratio $\frac{expected\ loss + process\ error}{expected\ loss}$ , which has a mean of one and variance $\frac{\alpha^{2}}{(expected\ loss)^{2}}$ . The multiplicative combination of these two clearly independent distributions gives

$\begin{array}{l} \left(\frac{Variance\ of\ process}{parameter\ variance\ in\ geometric\ Brownian\ motion\ space}\right)\\ \quad =\beta^{2}+\frac{\alpha^{2}}{(expected\ loss)^{2}}+\frac{\alpha^{2} \beta^{2}}{(expected\ loss)^{2}}. \end{array}\tag{4.12}$

So, when that is converted to a parameter in the linear model^[34], one may show that

$\alpha^{2}=\log \binom{\beta^{2}+\frac{\alpha^{2}}{(expected\ loss)^{2}}}{+\frac{\alpha^{2} \beta^{2}}{(expected\ loss)^{2}}+1} . \tag{4.13}$

Then, that estimate may be combined with equation (4.9) to obtain an estimate of δ².

4.5. Method 5: Estimating δ² using a larger dataset and σ² by subtraction

Just as σ² may be estimated using alternate approaches, δ² may often be estimated in isolation as well. If a larger proxy dataset (for example, the countrywide private passenger auto experience of a major carrier when rates are being made a low volume state) is available, and that dataset has very minimal process/parameter risk, then the formula (4.8) from subsection 4.3 should produce a very high quality estimate of δ². Then, using equation (4.6), σ² may be estimated via

$\frac{\sum_{i=1}^{n-1}\left(S_{i+1}-S_{i}\right)^{2}}{2(n-1)}-\frac{\delta^{2}}{2} \cong \sigma^{2} \tag{4.14}$

4.6. All or many of the above

Several methods were presented above. They all have different strengths and weaknesses. Whenever possible, it may be helpful to review the results of more than one method. Note that that the credibility formula is not a formula in σ² and δ² per se, it is actually a formula in either the ratio $K=\frac{\delta^{2}}{\alpha^{2}}$ or in K and B. So, when different values for σ² and δ² result from different approaches, but the ratio K is similar, the methods fundamentally agree. Also, note that what may look like large changes in K may have a very minor effect on the credibility when K is very large. Lastly, should the methods disagree it creates an opportunity to evaluate the strengths and weaknesses of each one.

Summary

The “square root” or classical credibility process has been in use for many years. Nevertheless, that method has significant a flaw in that the statistical assumptions (confidence level and failure threshold) may be chosen arbitrarily. Further, it assumes that whatever data receives the complement of credibility is stable and reliable, even when that data is, say, four years of a 20% trend rate. It is hoped that this advancement, by providing a reliable credibility process that uses minimal assumptions, will restructure the credibility processes used by casualty actuaries. Then, the profession can be comfortable that rate indications that use the resulting credibility values are as accurate as possible.

The Credibility of the Overall Rate Indication: Making the Theory Work

Abstract

1. Introduction

2. The theory—Key credibility formulas for the overall rate indication

2.1. The general Gerber-Jones formulas

2.2. The linear updating-type formulas

2.3. The geometric Brownian motion formulas

3. Multi-year formulas and best estimate credibility for the overall rate indication

3.1. Reasons not to reuse older years

3.2. Correcting the prior estimate for changes in ultimate loss estimates

3.3. Updated ultimate losses and updating-type credibility

4. Estimating the parameters: Z, K, B, δ² and σ²

4.1. Method 1: The credibility that would have worked in the past.

4.2. Method 2: Fitting K and B across a large number of similar datasets

4.3. Method 3: Estimating δ² and σ² from the historical data

4.4. Method 4: Estimating σ² structurally from loss data and δ² by subtraction

4.5. Method 5: Estimating δ² using a larger dataset and σ² by subtraction

4.6. All or many of the above

Summary

References

Appendix

Appendix A

B, K and Updating Credibility Under Bhlmann-Type Assumptions and Modified Bhlmann-Type Assumptions

Appendix B

Experience Period Weights?

The Credibility of the Overall Rate Indication: Making the Theory Work

Abstract

1. Introduction

2. The theory—Key credibility formulas for the overall rate indication

2.1. The general Gerber-Jones formulas

2.2. The linear updating-type formulas

2.3. The geometric Brownian motion formulas

3. Multi-year formulas and best estimate credibility for the overall rate indication

3.1. Reasons not to reuse older years

3.2. Correcting the prior estimate for changes in ultimate loss estimates

3.3. Updated ultimate losses and updating-type credibility

4. Estimating the parameters: Z, K, B, δ2 and σ2

4.1. Method 1: The credibility that would have worked in the past.

4.2. Method 2: Fitting K and B across a large number of similar datasets

4.3. Method 3: Estimating δ2 and σ2 from the historical data

4.4. Method 4: Estimating σ2 structurally from loss data and δ2 by subtraction

4.5. Method 5: Estimating δ2 using a larger dataset and σ2 by subtraction

4.6. All or many of the above

Summary

References

Appendix

Appendix A

B, K and Updating Credibility Under Bhlmann-Type Assumptions and Modified Bhlmann-Type Assumptions

Appendix B

Experience Period Weights?

This website uses cookies

4. Estimating the parameters: Z, K, B, δ² and σ²

4.3. Method 3: Estimating δ² and σ² from the historical data

4.4. Method 4: Estimating σ² structurally from loss data and δ² by subtraction

4.5. Method 5: Estimating δ² using a larger dataset and σ² by subtraction