irt_simulate: Run an IRT Monte Carlo Simulation
In irtsim: Monte Carlo Simulation-Based Sample-Size Planning for Item Response Theory

irt_simulate

R Documentation

Run an IRT Monte Carlo Simulation

Description

Execute a Monte Carlo simulation study based on an irt_study specification. For each iteration and sample size, data are generated, missing values applied, the IRT model is fitted, and parameter estimates are extracted and stored.

Usage

irt_simulate(
  study,
  iterations,
  seed,
  progress = TRUE,
  parallel = FALSE,
  se = TRUE,
  compute_theta = TRUE
)

Arguments

`study`	An irt_study object specifying the design and study conditions.
`iterations`	Positive integer. Number of Monte Carlo replications.
`seed`	Integer. Base random seed for reproducibility. Each iteration uses `seed + iteration - 1`.
`progress`	Logical. Print progress messages? Default `TRUE`.
`parallel`	Logical. Run iterations in parallel using `future.apply::future_lapply()`? Default `FALSE`. Requires users to set up a future plan (e.g., `future::plan(multisession)`) before calling. See Details.
`se`	Logical. Compute standard errors and confidence intervals for item parameter estimates? Default `TRUE`. Set to `FALSE` for significant speed improvement when only point estimates are needed (e.g., MSE, bias, RMSE criteria). When `FALSE`, `se`/`ci_lower`/`ci_upper` columns in `item_results` are `NA`.
`compute_theta`	Logical. Compute EAP theta estimates and recovery metrics (correlation, RMSE)? Default `TRUE`. Set to `FALSE` to skip the `mirt::fscores()` call when theta recovery is not needed. When `FALSE`, `theta_cor` and `theta_rmse` in `theta_results` are `NA` (but `converged` is still tracked).

Details

The returned irt_results object stores raw per-iteration estimates. Use summary.irt_results() to compute performance criteria (bias, MSE, RMSE, coverage, etc.) and plot.irt_results() to visualize results.

Parallelization

When parallel = TRUE, the Monte Carlo loop over iterations is parallelized via future.apply::future_lapply(). Each parallel task processes one iteration across all sample sizes sequentially.

Important: This function does NOT configure a future plan. Users must set their own plan before calling with parallel = TRUE:

library(future)
plan(multisession, workers = 4)  # or your preferred backend
results <- irt_simulate(study, iterations = 100, seed = 42, parallel = TRUE)

Without an explicit plan, future defaults to sequential execution (no parallelism).

Reproducibility contract

Reproducibility is guaranteed within a given dispatch mode, not across modes:

Serial mode (parallel = FALSE) uses deterministic per-cell seeds under the session's default RNG kind (Mersenne-Twister). Re-running with the same base seed reproduces identical results bit-for-bit.
Parallel mode (parallel = TRUE) delegates RNG management to future.apply::future_lapply(..., future.seed = TRUE), which assigns each iteration a formally independent L'Ecuyer-CMRG substream. Re-running with the same base seed reproduces identical results bit-for-bit across parallel runs, including across different worker counts.
Across modes, numerical results will differ because the two paths use different RNG algorithms and different seeding strategies. Both are statistically valid; the parallel path has the stronger formal guarantee of independent substreams, which is the standard for Monte Carlo work.

Progress messages are suppressed in parallel mode (workers cannot stream to stdout safely). Set progress = FALSE in serial mode to suppress messages (they appear every 10% of iterations).

Value

An S3 object of class irt_results containing:

item_results: Data frame with per-iteration item parameter estimates (columns: iteration, sample_size, item, param, true_value, estimate, se, ci_lower, ci_upper, converged).
theta_results: Data frame with per-iteration theta recovery summaries (columns: iteration, sample_size, theta_cor, theta_rmse, converged).
study: The original irt_study object.
iterations: Number of replications run.
seed: Base seed used.
elapsed: Elapsed wall-clock time in seconds.
se: Logical flag indicating whether SEs and CIs were computed.
compute_theta: Logical flag indicating whether theta recovery metrics were computed.

Examples


# Minimal example (iterations and sample sizes reduced for speed;
# use iterations >= 100 and 3+ sample sizes in practice)
design <- irt_design(
  model = "1PL", n_items = 5,
  item_params = list(b = seq(-2, 2, length.out = 5))
)
study <- irt_study(design, sample_sizes = c(200, 500))
results <- irt_simulate(study, iterations = 10, seed = 42)
summary(results)
plot(results)

irtsim documentation built on April 24, 2026, 1:07 a.m.