Description Usage Format Author(s) Source References Examples
The motor insurance dataset is originially retrieved the cplm package. It contains insurance claim data as well as information on the policyholder. From the original dataset, only part of the variables are kept and some are transformed (see description below). Missing values are imputated via the rrcovNA::impSeq function.
1 | data("AutoClaim")
|
A data frame with 10296 observations on the following 35 variables.
CLM_AMT5Aggregate claim loss of policy (in thousands)
KIDSDRIVNumber of child passengers
TRAVTIMECommute time
CAR_USE(1) Private or (2) commercial use
BLUEBOOK(log) car value
NPOLICYNumber of policies
RED_CARWhether the color of the car is (2) car or (1) not
REVOLKEDWhether the policyholder's license was (2) revoked in the past or (1) not
MVR_PTSNumber of motor vehicule record points
HOMEKIDSNumber of children at home
GENDERGender of policyholder : (1) female or (2) male
MARRIEDWhether the policyholder is (2) married or (1) not
PARENT1Whether (2) the policyholder grew up in a single-parent family or (1) not
AREA(1) Rural or (2) urban area
CAR_TYPE_2-6(0-1 dummy variables) Type of car : (base) Panel Truck, (2) Pickup,(3) Sedan, (4) Sports Car, (5) SUV, (6) Van
JOBCLASS_2-9(0-1 dummy variables) Job class of policyholder: (base) Unknown, (2) Blue Collar, (3) Clerical, (4) Doctor, (5) Home Maker, (6) Lawyer, (7) Manager, (8) Professionnal, (9) Student
MAX_EDUC_2-5(0-1 dummy variables) Maximal level of education of policyholder: (base) less than High School, (2) Bachelors, (3) High School, (4) Masters, (5) PhD
AGE_CAT_2-5(0-1 dummy variables) Age category of policyholder : (base) <30, (2) [30,40), (3) [40,50), (4) [50,60), (5) 60+].
Simon Fontaine, Yi Yang, Bo Fan, Wei Qian and Yuwen Gu.
Maintainer: Simon Fontaine fontaines@dms.umontreal.ca
cplm package.
Fontaine, S., Yang, Y., Fan, B., Qian, W. and Gu, Y. (2018). "A Unified Approach to Sparse Tweedie Model with Big Data Applications to Multi-Source Insurance Claim Data Analysis," to be submitted.
Zhang, Y. (2013). "cplm: Compound Poisson Linear Models." A vignette for R package cplm. Available from http://cran.r-project.org/web/packages/cplm.
Todorov, V. (2016). "rrcovNA: Scalable Robust Estimators with High Breakdown Point for Incomplete Data." A vignette for R package rrcovNA. Available from https://cran.r-project.org/web/packages/rrcovNA.
1 2 3 4 5 6 7 8 9 10 11 |
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.