This is the complete caterpillar dataset from a 1973 study on pine_full
processionary caterpillars. It assesses the influence of some forest
settlement characteristics on the development of caterpillar colonies. The
response variable is the logarithmic transform of the average number of
nests of caterpillars per tree in an area of 500 square meters (
There are k=10 potentially explanatory variables defined on n=55 areas.
A data frame with 55 observations on the following 11 variables.
altitude (in meters)
slope (en degrees)
number of pine_fulls in the area
height (in meters) of the tree sampled at the center of the area
diameter (in meters) of the tree sampled at the center of the area
index of the settlement density
orientation of the area (from 1 if southbound to 2 otherwise)
height (in meters) of the dominant tree
number of vegetation strata
mix settlement index (from 1 if not mixed to 2 if mixed)
logarithmic transform of the average number of nests of caterpillars per tree
These caterpillars got their names from their habit of moving over the ground in incredibly long head-to-tail processions when leaving their nest to create a new colony.
Tomassone R., Audrain S., Lesquoy-de Turckeim E., Millier C. (1992), “La régression, nouveaux regards sur une ancienne méthode statistique”, INRA, Actualités Scientifiques et Agronomiques, Masson, Paris.
J.-M. Marin, C. Robert. (2007). Bayesian Core: A Practical Approach to Computational Bayesian Statistics. Springer, New-York, pages 48-49.
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.