powerplant: The Combined Cycle Power Plant Data Set

Description Usage Format Note Source Examples

Description

This dataset has 9568 records collected over 6 years (2006 - 2011). The data come from a Combined Cycle Power Plant (CCPP), which has operated under a full load. The data are available on the UCI repository, but not in an easy way to read into R. This provides a convenient form to access the data. A CCPP is composed of gas turbines (GT), steam turbine (ST), and heat recovery STs. The vacuum has effect on ST, while the others affect GT performance.

The authors of the orginal data performed a 5x2 fold statistical test on 5 shuffles (random Permutations). The first shuffle is in the order that this information is presented. The remaining shuffle indices are provided as the shuffle attribute. These shuffles preserve the original data generators 2-fold CV format for 10 measuresments which they used for statistical testing.

Usage

1

Format

AT: Temperatures in the range of 1.81 to 37.11 Celsius.

V: Exhuast vacuum in the range of 25.36 to 81.56 cm Hg.

AP: Ambient presure in the range of 992.89 to 1033.30 milibar.

RH: Relative humidity in the range of 25.56% to 100.16%.

PE: Net Hourly electorial energy output 40.26 to 4895.76 MW.

shuffle2: CV shuffle 2 indices (refer below for an example).

shuffle3: CV shuffle 3 indices.

shuffle4: CV shuffle 4 indices.

shuffle5: CV shuffle 5 indices.

Note

The averages are taken from various sensors located around the plant that record the ambient variables every second. The variables are given without any normalization.

Source

The data was taken direclty from the Excel files on the UCI repository. This allows for a convienent access to these data.

Examples

1
2
	data(powerplant)
	x<-powerplant[attr(powerplant,"shuffle")[,1],]

SemiSupervised documentation built on May 11, 2018, 5:03 p.m.