The forest fire data were collected during January 2000 to December 2003 for fires in the Montesinho natural park located in the northeast region of Portugal. The response variable of interest was area burned in ha. When the area burned as less than one-tenth of a hectare, the response variable as set to zero. In all there were 517 fires and 247 of them recorded as zero. The region was divided into a 10-by-10 grid with coordinates X and Y running from 1 to 9. The categorical variable xyarea indicates the region in this grid for the fire.
A data frame with 517 observations on the following 12 variables. All quantitative variables have been standardized.
a factor with 36 levels
an ordered factor with 12 levels
an ordered factor with 7 levels
fine fuel moisture code
Duff moisture code
initial spread index
average ambient temperature
a numeric vector
log(x+1), x is burned area with x=0 for small fires
The original data may be found at the website below as well
as an analysis.
The quantitative variables in this dataset have been standardized.
For convenience, the original data is provided in
P. Cortez and A. Morais, 2007. A Data Mining Approach to Predict Forest Fires using Meteorological Data. In J. Neves, M. F. Santos and J. Machado Eds., New Trends in Artificial Intelligence, Proceedings of the 13th EPIA 2007 - Portuguese Conference on Artificial Intelligence, December, Guimaraes, Portugal, pp. 512-523, 2007.
1 2 3 4