Ht: Ht person-fit statistic

Description Usage Arguments Details Value Author(s) References See Also Examples

View source: R/Ht.R


Sijtsma's Ht person-fit statistic.


   NA.method = "Pairwise", Save.MatImp = FALSE,
   IP = NULL, IRT.PModel = "2PL", Ability = NULL, Ability.PModel = "ML",
   mu = 0, sigma = 1)



Data matrix of dichotomous item scores: Persons as rows, items as columns, item scores are either 0 or 1, missing values allowed.


Method to deal with missing values. The default is pairwise elimination ("Pairwise"). Alternatively, simple imputation methods are also available. The options available are "Hotdeck", "NPModel" (default), and "PModel".


Logical. Save (imputted) data matrix to file? Default is FALSE.


Matrix with previously estimated item parameters: One row per item, and three columns ([,1] item discrimination; [,2] item difficulty; [,3] lower-asymptote, also referred to as pseudo-guessing parameter).

In case no item parameters are available then IP=NULL.


Specify the IRT model to use in order to estimate the item parameters (only if IP=NULL). The options available are "1PL", "2PL" (default), and "3PL".


Vector with previoulsy estimated latent ability parameters, one per respondent, following the order of the row index of matrix.

In case no ability parameters are available then Ability=NULL.


Specify the method to use in order to estimate the latent ability parameters (only if Ability=NULL). The options available are "ML" (default), "BM", and "WL".


Mean of the apriori distribution. Only used when method="BM". Default is 0.


Standard deviation of the apriori distribution. Only used when method="BM". Default is 1.


Sijtsma (1986) adapted a statistic introduced by Mokken (1971) that originally allowed assessing the scalability of an item to the Guttman (1944, 1950) model. The same statistic was applied by Sijtsma to the transposed data in order to detect respondents that would not comply with the Guttman model. Assume, without loss of generality, that the rows of the data matrix are ordered by increasing order of total score Sn (n = 1,..., N). The statistic formula is given by the ratio

Ht=Cov(Xn,R(n)) / Cov.max(Xn,R(n)),

where R(n) is the vector of total item scores computed excluding respondent n and the denominator is the maximum covariance given the marginal. Hence, Ht is actually similar to Sato's C.Sato.

Ht is maximum 1 for respondent n when no respondent with a total score smaller/larger than Sn can answer an item correctly/incorrectly that respondent n has answered incorrectly/correctly, respectively. Ht equals zero when the average covariance of the response pattern of respondent n with the other response patterns equals zero. Hence, (potentially) aberrant response behavior is indicated by small values of Ht (i.e., in the left tail of the sampling distribution). The Ht statistic was shown to perform relatively well in several simulation studies (Karabatsos, 2003; Sijtsma, 1986; Sijtsma and Meijer, 1992, Tendeiro and Meijer, 2014).

Missing values in matrix are dealt with by means of pairwise elimination by default. Alternatively, single imputation is also available. Three single imputation methods exist: Hotdeck imputation (NA.method = "Hotdeck"), nonparametric model imputation (NA.method = "NPModel"), and parametric model imputation (NA.method = "PModel"); see Zhang and Walker (2008).

  • Hotdeck imputation replaces missing responses of an examinee ('recipient') by item scores from the examinee which is closest to the recipient ('donor'), based on the recipient's nonmissing item scores. The similarity between nonmissing item scores of recipients and donors is based on the sum of absolute differences between the corresponding item scores. The donor's response pattern is deemed to be the most similar to the recipient's response pattern in the group, so item scores of the former are used to replace the corresponding missing values of the latter. When multiple donors are equidistant to a recipient, one donor is randomly drawn from the set of all donors.

  • The nonparametric model imputation method is similar to the hotdeck imputation, but item scores are generated from Bernoulli distributions with probabilities defined by donors with similar total score than the recipient (based on all items except the NAs).

  • The parametric model imputation method is similar to the hotdeck imputation, but item scores are generated from Bernoulli distributions with probabilities estimated by means of parametric IRT models (IRT.PModel = "1PL", "2PL", or "3PL"). Item parameters (IP) and ability parameters (Ability) may be provided for this purpose (otherwise the algorithm finds estimates for these parameters).


An object of class "PerFit", which is a list with 12 elements:


A list of length N (number of respondents) with the values of the person-fit statistic.


The person-fit statistic used.


A message indicating whether perfect response vectors (all-0s or all-1s) were removed from the analysis.


Row indices of all-0s response vectors removed from the analysis (if applicable).


Row indices of all-1s response vectors removed from the analysis (if applicable).


The data matrix after imputation of missing values was performed (if applicable).


The number of response categories (2 in this case).


The parametric IRT model used in case NA.method="PModel", otherwise NULL.


The Ix3 matrix of estimated item parameters in case NA.method="PModel", otherwise NULL.


The method used to estimate abilities in case NA.method="PModel", otherwise NULL.


The vector of N estimated ability parameters in case NA.method="PModel", otherwise NULL.


The imputation method used (if applicable).


Jorge N. Tendeiro


Guttman, L. (1944) A basis for scaling qualitative data. American Sociological Review, 9, 139-150.

Guttman, L. (1950) The basis for scalogram analysis. In S. A. Stouffer, L. Guttman, E. A. Suchman, P. F. Lazarsfeld, S. A. Star & J. A. Claussen (Eds.), Measurement and precision (pp. 60-90). Princeton NJ: Princeton University Press.

Karabatsos, G. (2003) Comparing the Aberrant Response Detection Performance of Thirty-Six Person-Fit Statistics. Applied Measurement In Education, 16(4), 277–298.

Mokken, R. J. (1971) A theory and procedure of scale analysis. Berlin, Germany: De Gruyter.

Sijtsma, K. (1986) A coefficient of deviance of response patterns. Kwantitatieve Methoden, 7, 131–145.

Sijtsma, K., and Meijer, R. R. (1992) A method for investigating the intersection of item response functions in Mokken's nonparametric IRT model. Applied Psychological Measurement, 16(2), 149-157.

Tendeiro, J. N., and Meijer, R. R. (2014) Detection of Invalid Test Scores: The Usefulness of Simple Nonparametric Statistics. Journal of Educational Measurement, 51(3), 239-259.

Zhang, B., and Walker, C. M. (2008) Impact of missing data on person-model fit and person trait estimation. Applied Psychological Measurement, 32(6), 466–479.

See Also



# Load the inadequacy scale data (dichotomous item scores):

# Compute the Ht scores:
Ht.out <- Ht(InadequacyData)

PerFit documentation built on May 21, 2017, 4:04 a.m.

Search within the PerFit package
Search all R packages, documentation and source code

Questions? Problems? Suggestions? Tweet to @rdrrHQ or email at

Please suggest features or report bugs in the GitHub issue tracker.

All documentation is copyright its authors; we didn't write any of that.