glca: Fits Latent Class Models for Data Containing Group Variable...
In glca: An R Package for Multiple-Group Latent Class Analysis

View source: R/glca.R

glca	R Documentation

Fits Latent Class Models for Data Containing Group Variable and Covariates

Description

Function for fitting latent class models with multiple groups, which may or may not include latent class structure for group variable.

Usage

glca(
  formula,
  group = NULL,
  data = NULL,
  nclass = 3,
  ncluster = NULL,
  std.err = TRUE,
  measure.inv = TRUE,
  coeff.inv = TRUE,
  init.param = NULL,
  n.init = 10,
  decreasing = FALSE,
  testiter = 50,
  maxiter = 5000,
  eps = 1e-06,
  na.rm = FALSE,
  seed = NULL,
  verbose = TRUE
)

Arguments

`formula`	a formula for specifying manifest items and covariates using the "`item`" function.
`group`	an optional vector specifying a group of observations. Given group variable, group covariates can be incorporated.
`data`	a data frame containing the manifest item, covariates and group variable.
`nclass`	number of level-1 (individual-level) latent classes.
`ncluster`	number of level-2 (group-level) latent classes. When `group` and `ncluster` (>1) are given the multilevel latent class models will be fitted.
`std.err`	a logical value for whether calculating standard errors for estimates.
`measure.inv`	a logical value of the measurement invariance assumption across groups.
`coeff.inv`	a logical value of the coefficient invariance assumption across groups (random intercept model).
`init.param`	A set of model parameters to be used as the user-defined initial values for the EM algorithm. It should be `list` with the named parameters and have same structure of `param` of the `glca` output. In default, initial parameters are randomly generated.
`n.init`	number of randomly generated initial parameter sets to be used for avoiding the problem of local maxima.
`decreasing`	a logical value for whether reordering the parameters by descending order responding probability for first-category of first manifest item.
`testiter`	number of iterations in the EM algorithm for each initial parameter set. The initial parameter set that provides the largest log-likelihood will be selected for estimating the model.
`maxiter`	maximum number of iterations for the EM algorithm.
`eps`	a convergence tolerance value. When the largest absolute difference between former estimates and current estimates is less than `eps`, the algorithm will stop updating and consider the convergence to be reached.
`na.rm`	a logical value for deleting the lines that have at least one missing manifest item. If `na.rm = FALSE`, MAR procedure will be conducted.
`seed`	In default, the set of initial parameters is drawn randomly. As the same value for seed guarantees the same initial parameters to be drawn, this argument can be used for reproducibility of estimation results.
`verbose`	a logical value indicating whether `glca` should print the estimation procedure onto the screen.

Details

The glca is the function for implementing LCA consist of two-type latent categorical variables (i.e., level-1 and level-2 latent class). The level-1 (individual-level) latent class is identified by the association among the individuals' responses to multiple manifest items, but level-2 (group-level) latent class is categorized by the prevalence of level-1 latent class for group variable. The function glca can handle two types of covariates: level-1 and level-2 covariates. If covariates vary across individuals, they are considered as level-1 covariates. When group and ncluster (>1) are given, covariates which are varying across groups are considered as level-2 covariates. Both types of covariates have effect on level-1 class prevalence.

The formula should consist of an ~ operator between two sides. Manifest items should be indicated in LHS of formula using item function and covariates should be specified in RHS of formula. For example,
item(y1, y2, y3) ~ 1
item(y1, y2, y3) ~ x1 + x2
where the first fomula indicates LCA with three manifest variables (y1, y2, and y3) and no covariate, and the second formula includes two covariates (x1 and x2). Two types of covariates (i.e., level-1 and level-2 covariates) will be automatically detected by glca.

The estimated parameters in glca are rho, gamma, delta, and beta. The set of item response probabilities for each level-1 class is rho. The sets of prevalences for level-1 and level-2 class are gamma and delta, respectively. The prevalence for level-1 class (i.e., gamma) can be modeled as logistic regression using level-1 and/or level-2 covariates. The set of logistic regression coefficients is beta in glca output.

Value

glca returns an object of class "glca".

The function summary prints estimates for parameters and glca.gof function gives goodness of fit measures for the model.

An object of class "glca" is a list containing the following components:

`call`	the matched call.
`terms`	the `terms` object used.
`model`	a `list` of model description.
`var.names`	a `list` of names of data.
`datalist`	a `list` of data used for fitting.
`param`	a `list` of parameter estimates.
`std.err`	a `list` of standard errors for estimates.
`coefficient`	a `list` of logistic regression coefficients for prevalence of level-1 class.
`posterior`	a `data.frame` or a `list` of posterior probablities of each individaul for latent classes and each group for latent clusters.
`gof`	a `list` of goodness of fit measures.
`convergence`	a `list` containing information about convergence.

References

Vermunt, J.K. (2003) Multilevel latent class models. Sociological Methodology, 33, 213–239. \Sexpr[results=rd]{tools:::Rd_expr_doi("10.1111/j.0081-1750.2003.t01-1-00131.x")}

Collins, L.M. and Lanza, S.T. (2009) Latent Class and Latent Transition Analysis: With Applications in the Social, Behavioral, and Health Sciences. John Wiley & Sons Inc.

Examples

##
## Example 1. GSS dataset
##
data("gss08")
# LCA
lca = glca(item(DEFECT, HLTH, RAPE, POOR, SINGLE, NOMORE) ~ 1,
            data = gss08, nclass = 3, n.init = 1)
summary(lca)

# LCA with covariate(s)
lcr = glca(item(DEFECT, HLTH, RAPE, POOR, SINGLE, NOMORE) ~ AGE,
           data = gss08, nclass = 3, n.init = 1)
summary(lcr)
coef(lcr)


# Multiple-group LCA (MGLCA)
mglca = glca(item(DEFECT, HLTH, RAPE, POOR, SINGLE, NOMORE) ~ 1,
             group = DEGREE, data = gss08, nclass = 3, n.init = 1)
summary(mglca)

# Multiple-group LCA with covariate(s) (MGLCR)
mglcr = glca(item(DEFECT, HLTH, RAPE, POOR, SINGLE, NOMORE) ~ SEX,
             group = DEGREE, data = gss08, nclass = 3, n.init = 1)
summary(mglcr)
coef(mglcr)


##
## Example 2. NYTS dataset
##
data("nyts18")
# Multilevel LCA (MLCA)
mlca = glca(item(ECIGT, ECIGAR, ESLT, EELCIGT, EHOOKAH) ~ 1,
            group = SCH_ID, data = nyts18, nclass = 3, ncluster = 2, n.init = 1)
summary(mlca)

# MLCA with covariate(s) (MLCR)
# (SEX: level-1 covariate, SCH_LEV: level-2 covariate)
mlcr = glca(item(ECIGT, ECIGAR, ESLT, EELCIGT, EHOOKAH) ~ SEX + SCH_LEV,
            group = SCH_ID, data = nyts18, nclass = 3, ncluster = 2, n.init = 1)
coef(mlcr)

glca documentation built on Sept. 12, 2024, 7:40 a.m.