ACED.scores | R Documentation |
ACED (Adaptive Content with Evidence-Based Diagnosis; Shute, Hansen and Almond, 2008) is a Bayes net based assessment system which featured: (a) adaptive item selection and (b) extended feedback for incorrect items. This data contains both item level and pretest/posttest data from a field trial of the ACED system.
data("ACED")
ACED contains 3 primary data.frame
objects and some supplementary data.
All of the data tables have two variables which can serve as keys. SubjID
and AltID
.
Either can be used as a primary key in joins. Note that the first two digits of the AltID
gives the session (i.e., class) that the student was in. Note also that students in
the control group have only pretest and posttest data; hence they do not appear in
ACED.items
, ACED.scores
or ACED.splitHalves
.
ACED.scores
is data frame with 230 observations on 74
variables. These are mostly high-level scores from the Bayesian
network.
Cond_code
a factor giving the experimental condition for this student, the levels are “adaptive_acc”, “adaptive_full”, “linear_full”, and “control”. Note that there are no control students in this data set.
Sequencing
a factor describing whether the sequence of items
was Linear
or Adaptive
Feedback
a factor describing whether the feedback for
incorrect items was Extended
or AccuracyOnly
All_Items
a numeric vector giving the number of items in ACED
Correct
a numeric vector giving the number of items the student got correct
Incorr
a numeric vector giving the number of items the student got incorrect
Remain
a numeric vector giving the number of items not reached or skipped
ElapTime
a difftime vector giving the total time spent on ACED (in seconds)
The next group of columns give “scores” for each of the nodes in
the Bayesian network. Each node has four scores, and the columns are
names p
nodeScoreType where node is
replaced by one of the short codes in ACED.allSkills
.
p
nodeH
a numeric vector giving the probability node is in the high state
p
nodeM
a numeric vector giving the probability node is in the medium state
p
nodeL
a numeric vector giving the probability node is in the low state
EAP
nodethe expected a posteriori value of node
assuming an equal interval scale, where L=0
, M=1
and
H=2
MAP
nodea factor vector giving maximum a
posteriori value of node, i.e.,
which.max(p
nodeH, p
nodeM, p
nodeL)
.
ACED.skillNames
a list with two components, long
and short
giving
the long (spelled out in CamelCase) and short names for the skills is a character vector giving the abbreviations
used for the node/skill/attributes names.
ACED.items
is data frame with 230 observations on 73
variables. These are mostly item-level scores from the field trial.
The first two columns are SubjID
and AltID
. The remaining columns
correspond to ACED internal tasks, and are coded 1 for correct, 0 for incorrect, and
NA
for not reached.
ACED.taskNames
is essentially the row names from ACED.items
.
The naming scheme for the tasks reflect the skills measured by the task.
The response variable names
all start with t
(for task) followed by the name of one or more
skills tapped by the task (if there is more than one, then the first one is
“primary”.) This is followed by a numeric code, 1, 2 or 3,
giving the difficulty (easy, medium or hard) and a letter (a, b or
c) used to indicate alternate tasks following the same task model.
ACED.prePost
is data frame with 290 observations on 32
variables giving the results of the pretest and posttest.
SubjID
ID assigned by study team, “S” followed by 3 digits. Primary key.
AltID
ID assigned by the ACED software vendor. Pattern is “sXX-YY”, where XX is the session and YY is a student with the session.
Classroom
A factor correpsonding to the student's class.
Gender
A factor giving the student's gender (I'm not sure if this is self-report or administrative records.)
Race
A factor giving the student's self-reported race. The codebook is lost.
Level_Code
a factor variable describing the academic
track of the student with levels Honors
, Academic
,
Regular
, Part 1
, Part 2
and ELL
. The
codes Part 1
and Part 2
refer to special education
students in Part 1 (mainstream classroom) or Part 2 (sequestered).
pre_scaled
scale score (after equating) on pretest
post_scaled
scale score (after equating) on posttest
gain_scaled
post_scaled - pre_scaled
Form_Order
a factor variables describing whether
(AB
) Form A was the pretest and Form B was the posttest or
(BA
) vise versa.
PreACorr
number of correct items on Form A for students who took Form A as a pretest
PostBCorr
number of correct items on Form B for students who took Form B as a posttest
PreBCorr
number of correct items on Form B for students who took Form B as a pretest
PostACorr
number of correct items on Form A for students who took Form A as a posttest
PreScore
a numeric vector with either the non-missing
value from PreACorr
and PreBCorr
PostScore
a numeric vector with either the non-missing
value from PostACorr
and PostBCorr
Gain
PostScore - PreScore
preacorr_adj
PreACorr
adjusted to put forms A
and B on the same scale
postbcorr_adj
PostBCorr
adjusted to put forms A
and B on the same scale
prebcorr_adj
PreBCorr
adjusted to put forms A
and B on the same scale
postacorr_adj
PostACorr
adjusted to put forms A
and B on the same scale
Zpreacorr_adj
standardized version of preacorr_adj
Zpostbcorr_adj
standardized version of postbcorr_adj
Zprebcorr_adj
standardized version of prebcorr_adj
Zpostacorr_adj
standardized version of postacorr_adj
scale_prea
score on Form A for students who took Form A as a pretest scaled to range 0-100
scale_preb
score on Form B for students who took Form B as a pretest scaled to range 0-100
pre_scaled
scale score on pretest (whichever form)
scale_posta
score on Form A for students who took Form A as a posttest scaled to range 0-100
scale_postb
score on Form B for students who took Form B as a posttest scaled to range 0-100
Flagged
a logical variable (codebook lost)
Cond
a factor describing the experimental condition
with levels Adaptive/Accuracy
, Adaptive/Extended
,
Linear/Extended
and Control
. Note that controls are included in this data set.
Sequencing
a factor describing whether the sequence of items
was Linear
or Adaptive
Feedback
a factor describing whether the feedback for
incorrect items was Extended
or AccuracyOnly
ACED.Qmatrix
is a logical matrix whose rows correspond to skills (long names)
and whose columns correspond to tasks which is true if the skill is required for solving
the task (according to the expert).
ACED.QEM
is a reduced Q-matrix containing the 15 evidence models
(unique rows in the $Q$-matrix). The entries are character values with "0" indicating skill not needed, "+"
indicating skill is needed and "++" indicating skill is primary. The Tasks
column lists
the tasks corresponding to this evidence model (1, 2 and 3 again represent difficulty level, and the
letters indicating variants). The Anchor
column is used to identify subscales for scale identification.
ACED.splithalves
is a list of two datasets labeled “A” and “B”. Both
have the same structure as ACED.scores
(less the datas giving study condition). These were created
by splitting the 62 items into 2 subforms with 31 items each. For the most part, each item was paired with
an variant which differed only by the last letter. The scores are the results of Bayes net scoring with
half of the items.
ACED.pretest
and ACED.posttest
are raw data from the external pretest and posttest given
before and after the study. Each is a list with four components:
Araw
Unscored responses for students who took that form as pre(post)test. The first row is the key.
Ascored
The scored responses for the Araw
students; correct is 1, incorrect is 0.
Braw
Unscored responses for students who took that form as pre(post)test. The first row is the key.
Bscored
The scored responses for the Araw
students; correct is 1, incorrect is 0.
Because of the counterbalancing each student should appear in either Form A or Form B in the pretest
and in the other group in the posttest. Note that the A and B forms here have no relationship with the A and B
forms in ACED.splithalves
.
ACED is a Bayesian network based Assessment for Learning learning system, thus it served as both a assessment and a tutoring system. It had two novel features which could be turned on and off, elaborated feedback (turned off, it provided accuracy only feedback) and adaptive sequencing of items (turned off, it scheduled items in a fixed linear sequence).
It was originally built to cover all algebraic sequences (arithmetic, geometric and other recursive), but only the branch of the system using geometric sequences was tested. Shute, Hansen and Almond (2008) describe the field trial. Students from a local middle school (who studied arithmetic, but not geometric sequences as part of their algebra curriculum) were recruited for the study. The students were randomized into one of four groups:
Adaptive/Accuracy
Adaptive sequencing was used, but students only received correct/incorrect feedback.
Adaptive/Extended
Adaptive sequencing was used, but students received extended feedback for incorrect items.
Linear/Extended
The fixed linear sequencing was used, but students received extended feedback for incorrect items.
Control
The students did independent study and did not use ACED.
Because students in the control group were not exposed to the ACED
task, neither the Bayes net level scores nor the item level scores are
available for those groups, and those students are excluded from
ACED.scores
and ACED.items
. The students are in the
same order in all of the data sets, with the 60 control students
tacked onto the end of the ACED.prePost
data set.
All of the students (including the control students) were given a
25-item pretest and a 25-item posttest with items similar to the ones
used in ACED. The design was counterbalanced, with half of the
students receiving Form A as the pretest and Form B as the posttest
and the other half the other way around, to allow the two forms to be
equated using the pretest data. The details are buried in
ACED.prePost
.
Note that some irregularities were observed with the English Language
Learner (ACED.prePost$Level_code=="ELL"
) students. Their
teachers were allowed to translated words for the students, but in
many cases actually wound up giving instruction as part of the
translation.
Shute, V. J., Hansen, E. G., & Almond, R. G. (2008). You can't fatten a hog by weighing it—Or can you? Evaluating an assessment for learning system called ACED. International Journal of Artificial Intelligence and Education, 18(4), 289-316.
Thanks to Val Shute for permission to use the data.
ACED development and data collection was sponsored by National Science Foundation Grant No. 0313202.
A more detailed description, including a Q-matrix can be found at the ECD Wiki: http://ecd.ralmond.net/ecdwiki/ACED/ACED.
data(ACED)
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.