| scoring_bias_test | R Documentation |
For each item, computes the change in item intercept from human to AI scoring within each group, then tests whether this scoring shift differs significantly across groups. A significant result indicates the AI scoring engine introduces a group-dependent parameter distortion — i.e., the AI does not merely re-scale all items uniformly but disfavours (or favours) one group at specific items.
scoring_bias_test(human_mle, ai_mle, fun = "d_fun3")
human_mle |
Output of |
ai_mle |
Output of |
fun |
Scaling function (passed to
the internal scaling function) to use when normalising shifts.
Default: |
Estimand. Define the scoring shift in group g for item
i threshold j as:
\delta_{igj} = d_{igj}^{\text{AI}} - d_{igj}^{\text{Human}}
The DASB is \delta_{i2j} - \delta_{i1j}. Under
H_0: \text{DASB}_{ij} = 0 and independence across scoring
conditions and groups,
\widehat{\mathrm{Var}}(\text{DASB}_{ij}) =
(\sigma_{i1j}^{H})^2 + (\sigma_{i2j}^{H})^2 +
(\sigma_{i1j}^{AI})^2 + (\sigma_{i2j}^{AI})^2
where each \sigma^2 is the diagonal element of the corresponding
group-specific covariance matrix.
A data.frame with one row per item (per threshold for
polytomous items) and columns:
shift_g1Scoring shift \delta_{i1} = d_{i1}^{AI} - d_{i1}^{H}.
shift_g2Scoring shift \delta_{i2} = d_{i2}^{AI} - d_{i2}^{H}.
DASBDifferential AI Scoring Bias: \delta_{i2} - \delta_{i1}.
seStandard error of DASB under the delta method.
zWald z-statistic.
p_valTwo-tailed p-value.
fit_aidif, ai_effect_summary
eg <- make_aidif_eg()
scoring_bias_test(eg$human, eg$ai)
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.