The data for this case study comes from the former Swedish insurance company Wasa, and concerns partial casco insurance, for motorcycles this time. It contains aggregated data on all insurance policies and claims during 1994-1998; the reason for using this rather old data set is confidentiality; more recent data for ongoing business can not be disclosed.
A data frame with 64548 observations on the following 9 variables.
The owners age, between 0 and 99, a numeric vector
The owners age, between 0 and 99, a factor with levels
Geographic zone numbered from 1 to 7, in a standard classification of all Swedish parishes, a numeric vector
MC class, a classification by the so called EV ratio, defined as (Engine power in kW x 100) / (Vehicle weight in kg + 75), rounded to the nearest lower integer. The 75 kg represent the average driver weight. The EV ratios are divided into seven classes, a numeric vector
Vehicle age, between 0 and 99, a numeric vector
Bonus class, taking values from 1 to 7. A new driver starts with bonus class 1; for each claim-free year the bonus class is increased by 1. After the first claim the bonus is decreased by 2; the driver can not return to class 7 with less than 6 consecutive claim free years, a numeric vector
the number of policy years, a numeric vector
the number of claims, a numeric vector
the claim cost, a numeric vector
The dataset "mccase.txt"
Ohlsson E., Johansson B. (2010), Non-life insurance pricing with generalized linear models, Springer