This is a simulated data set, based on the car insurance data set used throughout the text. There are 40000 policies over 3 years, giving 120000 records.
A data frame with 120000 observations on the following 6 variables.
number of policy, a numeric vector
driver's age category: 1 (youngest), 2, 3, 4, 5, 6, a numeric vector
vehicle value, in categories 1,...,6. (Category 1 has been recoded as 9.), a numeric vector
1, 2, 3, a numeric vector
number of claims, a numeric vector
a numeric vector
The dataset "Longitudinal Claims"
De Jong P., Heller G.Z. (2008), Generalized linear models for insurance data, Cambridge University Press