ptg_stud_data: Student Performance Data Set

ptg_stud_dataR Documentation

Student Performance Data Set

Description

Data from a student achievement in secondary education of two Portuguese schools. Full attribute description could be found in the source webpage.

Usage

ptg_stud_data

Format

An object of class data.frame with 649 rows and 31 columns.

Details

The data used is taken from the Student Performance Data. The original data consists of 30 covariates (13 binary, 11 ordinal, 4 categorical, 2 numerical) and a numerical output variable indicating the students final grade in Portuguese Language course.

The data was split by gender (F/M) n_f=383, n_m=266. The target variable G3 was converted to binary, final_fail which indicates the cases where G3 < 10.

Next, each sub-population was divided into training and testing data, using a 4:1 ratio.

Source

https://archive.ics.uci.edu/ml/datasets/student+performance

References

P. Cortez and A. Silva. Using Data Mining to Predict Secondary School Student Performance. In A. Brito and J. Teixeira Eds., Proceedings of 5th FUture BUsiness TEChnology Conference (FUBUTEC 2008) pp. 5-12, Porto, Portugal, April, 2008, EUROSIS, ISBN 978-9077381-39-7.

See Also

http://www3.dsi.uminho.pt/pcortez/student.pdf


LogRegEquiv documentation built on March 18, 2022, 6:13 p.m.