# Data sets in the cplm pakcage

### Description

The data sets included in package is described here.

### Usage

1 2 3 |

### Format

`FineRoot`

: a data set used for the study of the fine root length density of plants. It is a data frame with 511 records and 5 variables:

`Plant`

:identifier of the apple tree, 1-8

`Stock`

:root stokcing, one of three different root stocks: Mark, MM106 and M26

`Spacing`

:between-row

*\times*within-row spacings, one of the following two:*4 \times 2*meters and*5 \times 3*meters`Zone`

:inner or outer

`RLD`

:root length density

`ClaimTriangle`

: a data set from an insurance loss reserving triangle. It is a data frame with 55 records and 3 variables:

`year`

:the year when the accident occurs

`lag`

:development lag

`increLoss`

:incremental insurance loss in 1000s

`AutoClaim`

: a motor insurance data set retrieved from
the SAS Enterprise Miner database. It is a data frame with 10296 records and 29 variables:

`POLICYNO`

:"character", the policy number

`PLCYDATE`

:"Date", policy effective date

`CLM_FREQ5`

:"integer", the number of claims in the past 5 years

`CLM_AMT5`

:"integer", the total claim amount in the past 5 years

`CLM_AMT`

:"integer", the claim amount in the current insured period

`KIDSDRIV`

:"integer", the number of driving children

`TRAVTIME`

:"integer", the distance to work

`CAR_USE`

:"factor", the primary use of the vehicle: "Commercial", "Private".

`BLUEBOOK`

:"integer", the value of the vehicle

`RETAINED`

:"integer", the number of years as a customer

`NPOLICY`

:"integer", the number of policies

`CAR_TYPE`

:"factor", the type of the car: "Panel Truck", "Pickup", "Sedan", "Sports Car", "SUV", "Van".

`RED_CAR`

:"factor", whether the color of the car is red: "no", "yes".

`REVOLKED`

:"factor", whether the dirver's license was invoked in the past 7 years: "No", "Yes",

`MVR_PTS`

:"integer", MVR violation records

`CLM_FLAG`

:"factor", whether a claim is reported: "No", "Yes".

`AGE`

:"integer", the age of the driver

`HOMEKIDS`

:"integer", the number of children

`YOJ`

:"integer", years at current job

`INCOME`

:"integer", annual income

`GENDER`

:"factor", the gender of the driver: "F", "M".

`MARRIED`

:"factor", married or not: "No", "Yes".

`PARENT1`

:"factor", single parent: "No", "Yes".

`JOBCLASS`

:"factor": "Unknown", "Blue Collar", "Clerical", "Doctor", "Home Maker", "Lawyer", "Manager", "Professional", "Student".

`MAX_EDUC`

:"factor", max education level:"<High School", "Bachelors", "High School", "Masters", "PhD".

`HOME_VAL`

:"integer", the value of the insured's home

`SAMEHOME`

:"integer", years in the current address

`DENSITY`

:"factor", home/work area: "Highly Rural", "Highly Urban", "Rural", "Urban".

`IN_YY`

:"logical", whether the record is used in the Yip and Yau (2005) paper.

### Source

de Silva, H. N., Hall, A. J., Tustin, D. S. and Gandar, P. W. (1999). Analysis of distribution
of root length density of apple trees on different dwarfing rootstocks. *Annals of
Botany*, 83: 335-345.

Dunn, P.K. and Smyth, G.K. (2005). Series evaluation of Tweedie exponential dispersionmodels densities. *Statistics and Computing*, 15, 267-280.

Peters G. W., Shevchenko P. V. and Wuthrich M. V. (2009). Model Uncertainty in Claims Reserving within Tweedie's Compound Poisson Models. *Astin Bulletin*, 39(1), 1-33.

Yip, K. C. H. and Yau, K. K. W. (2005). On Modeling Claim Frequency Data In General
Insurance With Extra Zeros. *Insurance: Mathematics and Economics*, 36(2), 153-163.