library(lcmm)
knitr::opts_chunk$set(
  collapse = TRUE,
  comment = "#>"

)

 

We list here some recurrent problems reported by users. Other issues, questions, concerns may have been reported in github: https://github.com/CecileProust-Lima/lcmm/issues/ .

Please refer to github, both closed and opened issues, before sending any question. And please ask the questions via github only.

 

Why my model did not converge?

It sometimes happens that a model does not converge correctly. This is due most of the time to the very stringent criterion based on the "derivatives". This criterion uses both the first derivatives and the inverse of the matrix of the second derivatives (Hessian). It ensures that the program converges at a maximum. When it can't be computed correctly (most of the time because the Hessian is not definite positive), the program reports "second derivatives = 1".

There are several reasons that may induce a non convergence, e.g.:

  1. When the time variable (or more generally a variable with random effects) is in a unit which induces too small associated parameters (for very small changes per day). In that case, changing the scale (for instance with months or years) may solve the problem.

  2. In models with splines in the link function (lcmm, multlcmm, Jointlcmm, mpjlcmm) or with splines in the baseline risk function (Jointlcmm, mpjlcmm), a parameter associated with splines very close to zero may prevent for correct convergence as it is at the border of the parameter space. In that case, this parameter can be fixed to 0 and convergence should be reached immediately.

  3. When the data are not rich enough and/or the model is too complicated. In that case, this is a problem of numerical non identifiability. There are not many solutions (other than simplifying the model) but some directions may be :

    • Be patient, it happens that after some iterations the derivatives might be invertible and the model converges (you can change maxiter or rerun the program from the estimates at the non convergence point). But this is usually no necessary to specify more than 100 or 200 iterations. The iterative algorithm is used to converge in a few dozen of iterations.
    • You can try to assume a less stringent threshold (e.g., 0.01) but be careful, convergence might be of lower quality.
    • Run the model from different initial values.

 

How to choose the number of latent classes?

Selection of the number of latent classes is a complex question. In some cases, the number is known. When not, different tools can be used to guide the decision:

Finally, it can be useful to present and contrast models with different numbers of latent classes.

The complexity of the selection of the optimal number of latent classes is illustrated in the hlme vignette. Indeed, all the criteria may not be concordant in practice.

 

How to evaluate the quality of a classification?

Good discrimination of classes is usually sought when fitting latent class mixed models. Discriminatory power can be assessed using the entropy criterion (provided in summarytable) but also using the classification table (with command postprob). The description of the classes may also help comprehend the latent class structure.

(see hlme vignette for further details)

 

How to evaluate the fit of a model?

Different techniques can be used in this package to evaluate the goodness of fit. As in mixed models, one can compare the subject-specific predictions with the observations or plot the subject-specific residuals.

The comparison with more flexible models can also be useful (more flexible link functions, more flexible baseline risk functions, more flexible functions of time, etc.)

Each vignette includes a section on the evaluation of the model.

 

How to pre-normalize a variable using lcmm?

This is detailed in vignette on pre-normalizing.

 

How to change the latent class in reference?

The order of the latent classes can be changed in any function (hlme, lcmm, Jointlcmm, mpjlcmm) using the permut function. Here is an example with the estimation of a two class linear mixed model:

mhlme <- hlme(IST ~ I(age-age_init),random=~ I(age-age_init),subject="ID",data=paquid)
set.seed(1234)
mhlme2 <- hlme(IST ~ I(age-age_init),random=~ I(age-age_init),subject="ID",data=paquid,ng=2,
               mixture=~ I(age-age_init),classmb =~ CEP , B=random(mhlme))
summary(mhlme2)

The order of the latent classes can be changed by running:

mhlme2perm <- permut(mhlme2, order=c(2,1))
summary(mhlme2)

The models in objects mhlme2 and mhlme2perm are the same except for the permutation of the classes as shown with the cross-table:

xclass(mhlme2perm,mhlme2)

 

How to make predictions on external data?

An object stemmed from a estimation function of lcmm package (hlme, lcmm, Jointlcmm, mpjlcmm) provides predictions on the data on which the model was estimated.

Different functions allows the same type of computations but on external data:

Classification with PredictClass

predictClass function computes the posterior classification and the posterior class-membership probabilities from any latent class model estimated in package lcmm.

For instance, using the 2-class model estimated above, the posterior probabilities and classification can be computed for the newdata, here the data of the second subject of paquid dataset:

predictClass(mhlme2, newdata=paquid[2:6,])

 

Random-effects with predictRE

predictRE function computes the predicted random-effects of any model estimated within lcmm package for a new subject whose data (i.e., covariates and outcomes) are provided in newdata:

predictRE(mhlme2, newdata=paquid[2:6,])

By default, the random-effects are aggregated over the latent classes (by a weighted sum using the posterior probabilities). The argument classpredRE = TRUE can be used to get class-specific random-effects:

predictRE(mhlme2, newdata=paquid[2:6,], classpredRE = TRUE)

 

Subject-specific predictions using predictY or predictL

For hlme models and Jointlcmm models without link function, subject-specific predictions can be obtained using predictY. This requires a first call to predictRE with classpredRE = TRUE to get the individual random-effects, which can then be passed to the predictY function using option predRE.

predRE_ID2 <- predictRE(mhlme2, paquid[2:6,], classpredRE = TRUE)
predictY(mhlme2, newdata = data.frame(age = seq(70, 85, by = 0.25), age_init = 70, CEP = 0), predRE = predRE_ID2)

For model defining a latent process (lcmm, multlcmm, Jointlcmm with a link function), subject-specific predictions can be obtained in latent process scale using predictL.

mlcmm <- lcmm(CESD ~ I(age - age_init), random = ~ I(age - age_init), subject = 'ID', data = paquid, link = '5-quant-splines')

predRE_ID3 <- predictRE(mlcmm, paquid[7:8,], classpredRE = TRUE)
predictL(mlcmm, newdata = data.frame(age = seq(70, 85, by = 0.25), age_init = 70), predRE = predRE_ID3)

Predictions of outcomes in their natural scale using predictYcond and predictYback

Predictions of the outcomes can be directly computed for a latent process value. This is useful when nonlinear link functions are used in the models. This is obtained with function predictYcond.

When an outcome is transformed and the transformed values are then included in a hlme model, predictions in the natural scale of the outcome can be obtained using the predictYback function. Only marginal predictions can be obtained in this setting.



CecileProust-Lima/lcmm documentation built on June 13, 2025, 8:43 a.m.