hsfull | R Documentation |
This is a classical data set from the field of education used to illustrate
multilevel data and models. It is used in the first edition of Bryk and Raudenbush.
hsfull
is the complete data set with 160 high schools,
hs
is a random subset of 40 high schools, hs1
is a random subset of
80 schools and h2
contains the complement of hs1
. These two subsets
can be used to illustrate split sample validation: develop a model on one half of the data
and assess its performance on the other.
hsfull
A data frame with 7185 observations on the following 9 variables.
school
school id
mathach
measure of math achievment
ses
socio-economic status of family
Sex
a factor with levels Female
Male
Minority
a factor with levels No
Yes
Size
the size of the school
Sector
a factor with levels Catholic
Public
PRACAD
a measure of the priority given by the school to academic subjects
DISCLIM
a measure of the disciplinary climate in the school
Each row consists of the data for one student. hsfull
is the complete data set. hs1
and hs2
are complementary split halves of the schools in the data. hs
is a selection of 40 schools
which seems to be a good number of clusters for presentations in class.
Bryk and Raudenbush (COMPLETE)
Raudenbush, Stephen and Bryk, Anthony (2002), Hierarchical Linear Models: Applications and Data Analysis Methods, Sage (chapter 4).
## Not run:
xqplot(hsfull)
xqplot( up( hsfull, ~ school) )
## End(Not run)
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.