README.md
In problemofpoints/reservetestr: Provides a Framework for Testing Loss Reserve Methods

reservetestr

The goal of reservetestr is to provide a framework for testing loss reserve methods. Specifically, an interface to test methods against the Casualty Actuarial Society (CAS) Loss Reserve Database is provided.

You can install reservetestr from GitHub with:

# install.packages("devtools")
devtools::install_github("problemofpoints/reservetestr", upgrade_dependencies = FALSE)

library(reservetestr)
suppressPackageStartupMessages(library(ChainLadder))
library(dplyr)
#> 
#> Attaching package: 'dplyr'
#> The following objects are masked from 'package:stats':
#> 
#>     filter, lag
#> The following objects are masked from 'package:base':
#> 
#>     intersect, setdiff, setequal, union
library(tidyr)
library(purrr)
library(ggplot2)

reservetestr::ggSetTheme()

From the CAS website:

“Our goal is to prepare a clean and nice data set of loss triangles that could be used for claims reserving studies. The data includes major personal and commercial lines of business from U.S. property casualty insurers. The claims data comes from Schedule P - Analysis of Losses and Loss Expenses in the National Association of Insurance Commissioners (NAIC) database.”

Below is an example paid loss triangle from the database. The first triangle is the upper triangle used to fit the model. The second triangle is the actual lower right of the triangle used for model validation / testing.

cas_loss_reserve_db %>%
  get_meyers_subset(edition = 2) %>%
  filter(line == "ppauto" & group_id == 388) %>%
  pluck("train_tri_set", 1, "paid")
#>       dev_lag
#> acc_yr     1     2     3     4     5     6     7     8     9    10
#>   1988 13440 35680 48703 56319 61018 61119 63049 63556 63744 63835
#>   1989 18757 44166 57578 66264 65600 67721 75369 76713 77007    NA
#>   1990 19834 42225 56347 63194 67112 69459 74267 79208    NA    NA
#>   1991 16230 38045 46055 53983 60638 62917 71074    NA    NA    NA
#>   1992 14629 22427 33873 43339 53168 60413    NA    NA    NA    NA
#>   1993 24597 51373 68484 80253 92192    NA    NA    NA    NA    NA
#>   1994 31723 59733 77398 94395    NA    NA    NA    NA    NA    NA
#>   1995 37397 71133 94294    NA    NA    NA    NA    NA    NA    NA
#>   1996 53670 98628    NA    NA    NA    NA    NA    NA    NA    NA
#>   1997 52837    NA    NA    NA    NA    NA    NA    NA    NA    NA
#> attr(,"exposure")
#>  [1]  83473  91800  95877  99256  96170 139038 152174 167833 180523 164717

The back-testing results use a subset of the full database, resulting in 50 triangles across four lines of business - comauto, othliab, ppauto, wkcomp. For this example, we will only use “comauto”.

cas_db_subset <- cas_loss_reserve_db %>%
  get_meyers_subset(edition = 2)

The main function is run_single_backtest which, in this example, runs the ChainLadder::MackChainLadder method on each paid loss triangle.

mack_paid_results <- run_single_backtest(cas_db_subset, 
                                         testr_MackChainLadder, 
                                         lines_to_include = "comauto",
                                         loss_type_to_backtest = "paid", 
                                         method_label = "mack_paid")

The output provided is the below.

glimpse(mack_paid_results)
#> Observations: 49
#> Variables: 11
#> $ line              <chr> "comauto", "comauto", "comauto", "comauto", "coma...
#> $ group_id          <dbl> 353, 388, 620, 671, 715, 833, 1066, 1090, 1538, 1...
#> $ company           <chr> "Celina Mut Grp", "Federal Ins Co Grp", "Employer...
#> $ method            <chr> "mack_paid", "mack_paid", "mack_paid", "mack_paid...
#> $ actual_ultimate   <dbl> 40000, 745997, 388485, 52884, 107590, 24613, 6302...
#> $ actual_unpaid     <dbl> 7399, 189270, 89855, 13739, 27777, 2959, 5269, 34...
#> $ mean_ultimate_est <dbl> 39177.438, 714600.238, 398408.978, 58625.452, 113...
#> $ mean_unpaid_est   <dbl> 6576.438, 157873.238, 99778.978, 19480.452, 33796...
#> $ stddev_est        <dbl> 1442.5094, 46454.1901, 9466.1578, 2662.4870, 3137...
#> $ cv_unpaid_est     <dbl> 0.21934510, 0.29424994, 0.09487126, 0.13667480, 0...
#> $ implied_pctl      <dbl> 0.7200268344, 0.7563488511, 0.1468789988, 0.01229...

We can create a p-p plot to visually access the accuracy of our estimates. If our method was perfect, we would expect to see the dots fall along the 45 degree line - indicating that our implied percentiles are perfectly uniform.

That is not the case in this example, so for “comauto” the Mack method applied to paid data does not back-test well.

create_pp_plot(mack_paid_results, by_line = TRUE)

Add error metrics for deterministic methods
Create an object structure using S3 class system
Add more error handling
Add testr_ functions for the other methods in ChainLadder
Add ability to test several methods at once

problemofpoints/reservetestr documentation built on Jan. 3, 2020, 9:12 a.m.

rdrr.io home R language documentation Run R code online

CRAN packages Bioconductor packages R-Forge packages GitHub packages

Note that we can't provide technical support on individual packages. You should contact the package authors for that.

problemofpoints/reservetestr
Provides a Framework for Testing Loss Reserve Methods

README.md
In problemofpoints/reservetestr: Provides a Framework for Testing Loss Reserve Methods

reservetestr

Installation

Example Usage

CAS Loss Reserve Database

Future Enhancements

R Package Documentation

Browse R Packages

We want your feedback!

problemofpoints/reservetestr Provides a Framework for Testing Loss Reserve Methods

README.md In problemofpoints/reservetestr: Provides a Framework for Testing Loss Reserve Methods

reservetestr

Installation

Example Usage

CAS Loss Reserve Database

Future Enhancements

R Package Documentation

Browse R Packages

We want your feedback!

problemofpoints/reservetestr
Provides a Framework for Testing Loss Reserve Methods

README.md
In problemofpoints/reservetestr: Provides a Framework for Testing Loss Reserve Methods