Selected.QIF: Function to Select a Type of Time-Dependent Covaraite Through...

View source: R/functions_mar_select_fill_20240805.R

Selected.QIFR Documentation

Function to Select a Type of Time-Dependent Covaraite Through a Quadratic Inference Function Model

Description

Selects a type of time-dependent covaraite through a marginal quantile regression model using quadratic inference function (QIF) estimation method for longitudinal exposure data with values less than the limit of detection (LOD).

Usage

Selected.QIF(id, y, x, lod, substitue, corstr, beta, maxiter)

Arguments

id

A column matrix of subject IDs. The number of rows is the total number of observations. Data must be sorted by IDs.

y

A column matrix of the observed outcome values or responses.

x

A matrix of covariate values, for which the number of columns is the number of covariates.

lod

A numeric value of limit of detection (LOD).

substitue

A character string specifying the substitution approach, including "None", "LOD", "LOD2", "LODS2", "BetaMean", "BetaGM", "MIWithID", "MIWithIDRM", and "QQplot".

corstr

A character string specifying the working correlation structure, given by either "exchangeable" or "AR-1".

beta

A matrix of initial parameter estimates, e.g., these estimates could be from general linear model or generalized estimating equation (GEE) using independence working structure.

maxiter

The maximum number of iterations.

Details

The function modifies the supplementary R function for QIF in Westgate (2014). With the marginal modeling, Chen et al. (2024) incorporate the fill-in methods, including single and multiple value imputation techniques, such that any measurements less than the limit of detection (LOD) are assigned values. Based on the manuscripts of Chen and Westgate (2017, 2019), this function also enable to use a empirical MSE minimization criterion (EMMC) to select a working type of time-dependent covaraite.

See the Details of the "Fillin" function for introduction of the available fill-in or substitution methods. For a multiple random value imputation technique, it provides an alternative for environmental exposure and biomonitoring data with non-detects, in which the imputed values can be generated using a regression of an exposure measurement on covariate(s) ("MIWithID" and "MIWithIDRM") (Lubin et al., 2004). Information of identification (ID) would be included in "MIWithID" as the covariate, e.g., "id in "simdata58", while ID and order of cluster size or time points would be treated as the covariates in "MIWithIDRM", e.g. "id" and "visit" in "simdata58". Note that the function "impute.boot" and its corresponding functions used to apply the multiple random value imputation are from the package "miWQS" (version 0.4.4). Please cite "miWQS" when publishing results using "MIWithID" or "MIWithIDRM".

Value

An object of class "Selected.QIF" representing the fit.

Note

The function is capable of analyzing one measurement or more than one repeated measurements per subject. Unbalanced repeated measurements are also permittable.

Author(s)

Philip M. Westgate and I-Chen Chen

References

Chen, I-C., Bertke, S. J., Estill, C. F. (2024). Compare the Marginal Effects for Environmental Exposure and Biomonitoring Data with Repeated Measurements and Values Below the Limit of Detection. Journal of Exposure Science and Environmental Epidemiology. \Sexpr[results=rd]{tools:::Rd_expr_doi("10.1038/s41370-024-00640-7")}

Chen, I-C., Westgate, P. M. (2017). Improved methods for the marginal analysis of longitudinal data in the presence of time-dependent covariates. Statistics in Medicine, 36, 2533–46.

Chen, I-C., Westgate, P. M. (2019). A novel approach to selecting classification types for time-dependent covariates in the marginal analysis of longitudinal data. Statistical Methods in Medical Research, 28, 3176–86.

Lubin, J. H., Colt, J. S., Camann, D., et al. (2004). Epidemiologic evaluation of measurement data in the presence of detection limits. Environmental Health Perspectives, 112, 1691–6.

Westgate, P. M. (2014). Criterion for the simultaneous selection of a working correlation structure and either generalized estimating equations or the quadratic inference function approach. Biometrical Journal, 56, 461–476.

See Also

Modified.QIF, MQIF.

Examples

## Uses the simdata58 to run the marginal models.
library(marlod)
library(MASS)

data(simdata58)

id=as.matrix(as.vector(t(simdata58$id)))
y=as.matrix(as.vector(t(simdata58$y)))
x1=as.matrix(as.vector(t(simdata58$x1)))

## LOD=0.5 is equivalent to detection proportion=50.7% (censoring proportion=49.3%).
lod=0.5

## Gets initial estimates for the QIF approach through independence structure
initial=glm(y ~ x1, data=simdata58, family=gaussian)
beta_initial=as.matrix(initial$coefficients)

## Intercept is not included in the "x1"
Selected.QIF(id, y, x1, lod, "None", "exchangeable", beta_initial, 1000)

Selected.QIF(id, y, x1, lod, "LOD", "AR-1", beta_initial, 1000)

Selected.QIF(id, y, x1, lod, "LOD2", "exchangeable", beta_initial, 1000)

Selected.QIF(id, y, x1, lod, "LODS2", "AR-1", beta_initial, 1000)

Selected.QIF(id, y, x1, lod, "BetaMean", "exchangeable", beta_initial, 1000)

Selected.QIF(id, y, x1, lod, "BetaGM", "AR-1", beta_initial, 1000)

Selected.QIF(id, y, x1, lod, "MIWithID", "exchangeable", beta_initial, 1000)

Selected.QIF(id, y, x1, lod, "MIWithIDRM", "AR-1", beta_initial, 1000)

Selected.QIF(id, y, x1, lod, "QQplot", "exchangeable", beta_initial, 1000)

marlod documentation built on June 8, 2025, 10:32 a.m.