logistic4p: Logistic Regressions with Misclassification Correction

Description Usage Arguments Details Value Author(s) References Examples

Description

logistic4p is used to fit logistic regressions with correction of the misclassifications in the binary dependent variable. It is specified by

Usage

1
2
logistic4p(x, y, initial, model = c("lg", "fp.fn", "fp", "fn", "equal"),
max.iter = 1000, epsilon = 1e-06, detail = FALSE)

Arguments

x, y

x is a data frame or data matrix containing the predictor variables and y is the vector of outcomes. The number of rows in x must be the same as the length of y.

initial

starting values for the parameters in the model(FP,FN misclassification parameters and those in the linear predictor); if not specified, the default initials are 0 for the misclassification parameters and estimates obtained from the logistic regression for the parameters in the linear predictor.

model

a character string specifying the model to be used in the analysis.

Currently available options are "lg" (logistic regression), "fp.fn" (logistic regression with both FP and FN parameters), "fp" (logistic regression with the FP parameter), "fn" (logistic regression with the FN parameter), "equal" (logistic regression with FN=FN).

If it is not specified, the default one ('lg') will be used.

max.iter

a positive integer giving the maximal number of iterations; if it is reached, the algorithm will stop.

epsilon

a positive convergence tolerance epsilon; the iterations converge when max(|par-par_old|)<epsilon.

detail

logical indicating if the itermediate output should be printed after each iteration.

Details

This package implements the logistic regressions with misclassification corrections. There are five different models which can be specified by 'model'.

In the specification, x is a matrix of data frame of predictors fitted to the model; y is a numeric vector taking either 0 or 1.

The 'initial' is the vector of starting values for both misclassification and regression coefficients parameters in the model. It is suggested to provide 'initial', however if not, the default one will be used.

For the background to warning messages about 'fitted probabilities numerically 0 or 1 occurred', when the fitted probabilities of some individuals are either 0 or 1.

The package cannot handle missing data problems currently. If there are missing values in either x or y, there will be warning message.

Value

logistic4p returns a list of values inheriting from "logistic4p".

estimates

a named matrix of estimates including parameter estimates, standard errors, z-scores, and p-values.

n.iter

an integer giving the number of iteration used

d

the actual max absolute difference of the parameters of the last two iterations, d=max(|par.final-par_old|).

loglike

loglikelihood evaluated at the parameter estimates.

AIC

Akaike Information Criterion.

BIC

Bayesian Information Criterion.

converged

logical indicating whether the current procedure converged or not.

Author(s)

Haiyan Liu and Zhiyong Zhang

References

Liu, H. and Zhang, Z. (2016) Logistic Regression with Misclassification in Dependent Variables: Method and Software.(In preparation.)

Examples

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
14
## Not run:  
data(nlsy)
y=nlsy[, 1]
x=nlsy[,-1]

mod1=logistic4p(x,y)
mod1
mod1$estimates

mod2=logistic4p(x,y, model='fp.fn')

mod3=logistic4p(x,y, model='fn')

## End(Not run)

logistic4p documentation built on May 2, 2019, 2:18 a.m.