ohio: Ohio elementary school dataset

Description Format Source References

Description

Data set on 1,965 Ohio elementary school buildings for 2001-2002.

Format

The ‘ohio’ dataset contains 6 objects as follows.

ohioSchools

Original data ohioschool.dat from http://www.spatial-econometrics.com/ (J. LeSage and R. Pace 2009). The data set contains information on, for instance, school building ID, Zip code of the location of the school, proportion of passing on five subjects, number of teacher, number of student, etc. The variables are:
col 1: zip code
col 2: lattitude (zip centroid)
col 3: longitude (zip centroid)
col 4: buidling irn
col 5: district irn
col 6: # of teachers (FTE 2001-02)
col 7: teacher attendance rate
col 8: avg years of teaching experience
col 9: avg teacher salary
col 10: Per Pupil Spending on Instruction
col 11: Per Pupil Spending on Building Operations
col 12: Per Pupil Spending on Administration
col 13: Per Pupil Spending on Pupil Support
col 14: Per Pupil Spending on Staff Support
col 15: Total Expenditures Per Pupil
col 16: Per Pupil Spending on Instruction % of Total Spending Per Pupil
col 17: Per Pupil Spending on Building Operations % of Total Spending Per Pupil
col 18: Per Pupil Spending on Administration % of Total Spending Per Pupil
col 19: Per Pupil Spending on Pupil Support % of Total Spending Per Pupil
col 20: Per Pupil Spending on Staff Support % of Total Spending Per Pupil
col 21: irn number
col 22: avg of all 4th grade proficiency scores
col 23: median of 4th grade prof scores
col 24: building enrollment
col 25: short-term students < 6 months
col 26: 4th Grade (or 9th grade) Citizenship % Passed 2001-2002
col 27: 4th Grade (or 9th grade) math % Passed 2001-2002
col 28: 4th Grade (or 9th grade) reading % Passed 2001-2002
col 29: 4th Grade (or 9th grade) writing % Passed 2001-2002
col 30: 4th Grade (or 9th grade) science % Passed 2001-2002
col 31: pincome per capita income in the zip code area
col 32: nonwhite percent of population that is non-white
col 33: poverty percent of population in poverty
col 34: samehouse % percent of population living in same house 5 years ago
col 35: public % of population attending public schools
col 36: highschool graduates, educ attainment for 25 years plus
col 37: associate degrees, educ attainment for 25 years plus
col 38: college, educ attainment for 25 years plus
col 39: graduate, educ attainment for 25 years plus
col 40: professional, educ attainment for 25 years plus

ohioGrades

The derived dataset for analyzing the percentage passed based on Zip codes. The variables are:
y: the percentage passed (4th or 9th grade) in each school
TchExp: average Teacher's experience
Subjects: for five study subjects of Citizenship, Maths, Reading, Writing and Science
Stu.Tch: student by teacher ratio
School: school index
Zip: Zip code

ohioMedian

The derived dataset for analyzing the median of 4th grade scores based on school districts. The variables are:
MedianScore: the median of 4th grade prof scores
district: school districts

ohioShape

A SpatialPolygonsDataFrame object (see package sp) containing the map information of ohio school districts.

ohioZipDistMat

The spatial distance matrix based on Zip codes. The codes generated this matrix are:
Zsp <- model.matrix(~ factor(Zip) - 1, data = ohioGrades)
uzipC <- matrix(0, nrow = ncol(Zsp), ncol = 2)
Zip <- as.numeric(substr(colnames(Zsp), start = 12, stop = 16))
for (i in 1: ncol(Zsp)) {
Cord <- as.matrix(ohioSchools[(ohioSchools$V1 == Zip[i]), 2:3])
uzipC[i,] <- Cord[1,]
}
Dst <- as.matrix(dist(uzipC))
for(i in 1:nrow(Dst)) {
x <- Dst[i,]
x <- ifelse(x == 0, 0, 1/x)
Dst[i,] <- ifelse(x > 4, 4, x)
}
ohioZipDistMat <- Dst/4

ohioDistrictDistMat

The spatial distance matrix based on school districts. The codes generated this matrix are:
ccNb <- poly2nb(ccShape)
W <- matrix(0, 616, 616)
for (i in 1:nrow(W)) {
tmp <- as.numeric(ccNb[[i]])
for (k in tmp) W[i,k] <- 1
}
W[353,] <- W[,353] <- 0
districtShape <- as.numeric(substr(as.character(ohioShape@data$UNSDIDFP), 3, 7))
dimnames(W) <- list(districtShape, districtShape)
districtSchool <- floor(ohioSchools[,5]/10)
districtSchool <- factor(districtSchool[districtSchool %in% districtShape])
levelsShape <- levels(factor(districtShape))
levelsSchool <- levels(districtSchool)
levels(districtSchool) <- c(levelsSchool, levelsShape[!(levelsShape %in% levelsSchool)])
ohioDistrictDistMat <- W[levels(districtSchool),levels(districtSchool)]

Source

J. LeSage and R. Pace (2009). Introduction to Spatial Econometrics. Chapman \& Hall/CRC, Boca Raton.

References

J. LeSage and R. Pace (2009). Introduction to Spatial Econometrics. Chapman \& Hall/CRC, Boca Raton.

M. Alam, L. Ronnegard, X. Shen (2014). Fitting spatial models in hglm. Submitted.


hglm.data documentation built on May 1, 2019, 6:34 p.m.