LanguageData | R Documentation |
This is an example used in Chapter 7 of Almond, et. al. (2015), based on a conceptual
assessment described in Mislevy (1995). The assessment has four reporting variables:
_Reading_, _Writing_, _Speaking_, and _Listening_, each of which can take on states
'Novice', 'Intermediate', or 'Advanced'.
To estimate the reliability 1000 students were simulated from
the network (see NetworkTester
and accuracy, using Maximum A Prior
estimates, ('Langauge_modal') and expected accuracy ('Language_exp') estimates were calculated.
data("Language_modal")
data("Language_exp")
data("language16")
data("language24")
The format for both 'Language_modal' and 'Language_exp' is: List of 4 $ Reading A 3x3 matrix giving the accuracy of the Reading measure. $ Writing A 3x3 matrix giving the accuracy of the Writing measure. $ Speaking A 3x3 matrix giving the accuracy of the Speaking measure. $ Listening A 3x3 matrix giving the accuracy of the Listening measure.
All cases, the table has been normalized so that all entries sum to 1.
The data sets 'language16' and 'language24' are simulated data (1000 cases each) from both the 16 task test described above, and a 24 task variant which has 6 copies of each task. Both data frames have the following columns (not necessarily in this order).
A case number.
The "true" (simulated) ability.
The simulated responses for the type D (Listening) tasks. 'TaskD5' is only in 'language24'.
The simulated reponses for the type C (Speaking) tasks. 'TaskC3', 'TaskC4', and 'TaskC5' are only in the 'langauge24' data set.
The simulated responses for the type B (Writing) tasks. 'TaskB3', 'TaskB4', and 'TaskB5' are only in the 'language24' data set.
The simulated responses for the type A (Reading) tasks. 'TaskA5' is only available in the 'langauge24' data set.
The estimated probability that this subject is in the 'Novice', 'Intermediate' and 'Advanced' categories respectively of the Listening skill.
The estimated probability that this subject is in the 'Novice', 'Intermediate' and 'Advanced' categories respectively of the Speaking skill.
The estimated probability that this subject is in the 'Novice', 'Intermediate' and 'Advanced' categories respectively of the Writing skill.
The estimated probability that this subject is in the 'Novice', 'Intermediate' and 'Advanced' categories respectively of the Reading skill.
The most likely state of the skill variables given the relevant data.
The simulation code used to build 'language16' and 'language24' is in 'vignette("SimulationStudies",package="RNetica")'.
The sample language assessment contains:
5 Reading tasks which only tap the Reading attribute.
3 Reading/Writing integrated tasks which require a written response to a textual prompt.
3 Reading/Speaking/Listening Integrated tasks which require a spoken response to an aural stimulus, with textual instructions.
5 Listening tasks which have both aural stimulus and instructions.
Because different numbers of tasks (which have different strengths of evidence) are available, different amounts of information are available about each of the four target variables. A number of different measures of reliability can be calculated from the (expected) accuracy matrixes.
Almond, R.G., Mislevy, R.J., Steinberg, L.S., Williamson, D.M. and Yan, D. (2015) Bayesian Networks in Educational Assessment. Springer. Chapter 7.
Mislevy, R. J. (1995) Test theory and language learning in assessment. Language Testing, 12, 341–369.
data(Language_modal)
fcKappa(Language_modal$Reading)
gkLambda(Language_modal$Reading)
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.