Number of Guttman errors

Description

van der Flier's statistics based on the number of Guttman errors.

Usage

1
2
3
4
5
6
7
8
9
G(matrix,
  NA.method = "Pairwise", Save.MatImp = FALSE, 
  IP = NULL, IRT.PModel = "2PL", Ability = NULL, Ability.PModel = "ML",
  mu = 0, sigma = 1)

Gnormed(matrix,
        NA.method = "Pairwise", Save.MatImp = FALSE,
        IP = NULL, IRT.PModel = "2PL", Ability = NULL, Ability.PModel = "ML",
        mu = 0, sigma = 1)

Arguments

matrix

Data matrix of dichotomous item scores: Persons as rows, items as columns, item scores are either 0 or 1, missing values allowed.

NA.method

Method to deal with missing values. The default is pairwise elimination ("Pairwise"). Alternatively, simple imputation methods are also available. The options available are "Hotdeck", "NPModel" (default), and "PModel".

Save.MatImp

Logical. Save (imputted) data matrix to file? Default is FALSE.

IP

Matrix with previously estimated item parameters: One row per item, and three columns ([,1] item discrimination; [,2] item difficulty; [,3] lower-asymptote, also referred to as pseudo-guessing parameter).

In case no item parameters are available then IP=NULL.

IRT.PModel

Specify the IRT model to use in order to estimate the item parameters (only if IP=NULL). The options available are "1PL", "2PL" (default), and "3PL".

Ability

Vector with previoulsy estimated latent ability parameters, one per respondent, following the order of the row index of matrix.

In case no ability parameters are available then Ability=NULL.

Ability.PModel

Specify the method to use in order to estimate the latent ability parameters (only if Ability=NULL). The options available are "ML" (default), "BM", and "WL".

mu

Mean of the apriori distribution. Only used when method="BM". Default is 0.

sigma

Standard deviation of the apriori distribution. Only used when method="BM". Default is 1.

Details

Consider the items' proportion-correct scores, p1,p2,...,pI (I = number of items). A Guttman error consists of an item score pair (Xi,Xj)=(0,1) with pi>pj. Hence, there is a Guttman error when an easier item is answered incorrectly and a more difficult item is answered correctly.

G counts the number of (0,1) pairs given that the items are ordered in decreasing proportion-correct scores order. However, G depends on the total number of items for a given number-correct score. In particular, for a number-correct S, G has maximum equal to S(I-S). Gnormed was created to bound G between 0 and 1 by dividing it by its maximum (conditional on the number-correct score). Hence, (potentially) aberrant response behavior is indicated by large values of G/Gnormed (i.e., in the right tail of the sampling distribution).

Gnormed is perfectly linearly related to Tatsuoka and Tatsuoka's (1982, 1983) NCI statistic (NCI = 1-2Gnormed).

Missing values in matrix are dealt with by means of pairwise elimination by default. Alternatively, single imputation is also available. Three single imputation methods exist: Hotdeck imputation (NA.method = "Hotdeck"), nonparametric model imputation (NA.method = "NPModel"), and parametric model imputation (NA.method = "PModel"); see Zhang and Walker (2008).

  • Hotdeck imputation replaces missing responses of an examinee ('recipient') by item scores from the examinee which is closest to the recipient ('donor'), based on the recipient's nonmissing item scores. The similarity between nonmissing item scores of recipients and donors is based on the sum of absolute differences between the corresponding item scores. The donor's response pattern is deemed to be the most similar to the recipient's response pattern in the group, so item scores of the former are used to replace the corresponding missing values of the latter. When multiple donors are equidistant to a recipient, one donor is randomly drawn from the set of all donors.

  • The nonparametric model imputation method is similar to the hotdeck imputation, but item scores are generated from Bernoulli distributions with probabilities defined by donors with similar total score than the recipient (based on all items except the NAs).

  • The parametric model imputation method is similar to the hotdeck imputation, but item scores are generated from Bernoulli distributions with probabilities estimated by means of parametric IRT models (IRT.PModel = "1PL", "2PL", or "3PL"). Item parameters (IP) and ability parameters (Ability) may be provided for this purpose (otherwise the algorithm finds estimates for these parameters).

Value

An object of class "PerFit", which is a list with 12 elements:

PFscores

A list of length N (number of respondents) with the values of the person-fit statistic.

PFstatistic

The person-fit statistic used.

PerfVects

A message indicating whether perfect response vectors (all-0s or all-1s) were removed from the analysis.

ID.all0s

Row indices of all-0s response vectors removed from the analysis (if applicable).

ID.all1s

Row indices of all-1s response vectors removed from the analysis (if applicable).

matrix

The data matrix after imputation of missing values was performed (if applicable).

Ncat

The number of response categories (2 in this case).

IRT.PModel

The parametric IRT model used in case NA.method="PModel", otherwise NULL.

IP

The Ix3 matrix of estimated item parameters in case NA.method="PModel", otherwise NULL.

Ability.PModel

The method used to estimate abilities in case NA.method="PModel", otherwise NULL.

Ability

The vector of N estimated ability parameters in case NA.method="PModel", otherwise NULL.

NAs.method

The imputation method used (if applicable).

Author(s)

Jorge N. Tendeiro j.n.tendeiro@rug.nl

References

Karabatsos, G. (2003) Comparing the Aberrant Response Detection Performance of Thirty-Six Person-Fit Statistics. Applied Measurement In Education, 16(4), 277–298.

Meijer, R. R. (1994) The number of Guttman errors as a simple and powerful person-fit statistic. Applied Psychological Measurement, 18(4), 311–314.

Meijer, R. R., and Sijtsma, K. (2001) Methodology review: Evaluating person fit. Applied Psychological Measurement, 25(2), 107–135.

Tatsuoka, K. K., and Tatsuoka, M. M. (1982) Detection of aberrant response patterns and their effect on dimensionality. Journal of Educational Statistics, 7(3), 215–231.

Tatsuoka, K. K., and Tatsuoka, M. M. (1983) Spotting erroneous rules of operation by the individual consistency index. Journal of Educational Measurement, 20(3), 221–230.

van der Flier, H. (1977) Environmental factors and deviant response patterns. In Y. H. Poortinga (Ed.), Basic problems in cross-cultural psychology. Amsterdam: The Netherlands.

Zhang, B., and Walker, C. M. (2008) Impact of missing data on person-model fit and person trait estimation. Applied Psychological Measurement, 32(6), 466–479.

See Also

NCI

Examples

1
2
3
4
5
6
7
8
# Load the inadequacy scale data (dichotomous item scores):
data(InadequacyData)

# Compute the G scores:
G.out <- G(InadequacyData)

# Compute the Gnormed scores:
Gnormed.out <- Gnormed(InadequacyData)