prop.reg: Quasi binomial regression for proportions In Rfast: A Collection of Efficient and Extremely Fast R Functions

Description

Quasi binomial regression for proportions.

Usage

 1 2 prop.reg(y, x, varb = "quasi", tol = 1e-09, maxiters = 100) prop.regs(y, x, varb = "quasi", tol = 1e-09, logged = FALSE, maxiters = 100)

Arguments

 y A numerical vector proportions. 0s and 1s are allowed. x For the "prop.reg" a matrix with data, the predictor variables. This can be a matrix or a data frame. For the "prop.regs" this must be a numerical matrix, where each columns denotes a variable. tol The tolerance value to terminate the Newton-Raphson algorithm. This is set to 10^{-9} by default. varb The type of estimate to be used in order to estimate the covariance matrix of the regression coefficients. There are two options, either "quasi" (default value) or "glm". See the references for more information. logged Should the p-values be returned (FALSE) or their logarithm (TRUE)? maxiters The maximum number of iterations before the Newton-Raphson is terminated automatically.

Details

We are using the Newton-Raphson, but unlike R's built-in function "glm" we do no checks and no extra calculations, or whatever. Simply the model. The "prop.regs" is to be used for very many univariate regressions. The "x" is a matrix in this case and the significance of each variable (column of the matrix) is tested. The function accepts binary responses as well (0 or 1).

Value

For the "prop.reg" function a list including:

 iters The number of iterations required by the Newton-Raphson. varb The covariance matrix of the regression coefficients. phi The phi parameter is returned if the input argument "varb" was set to "glm", othwerise this is NULL. info A table similar to the one produced by "glm" with the estimated regression coefficients, their standard error, Wald test statistic and p-values.

For the "prop.regs" a two-column matrix with the test statistics (Wald statistic) and the associated p-values (or their loggarithm).

Michail Tsagris

References

Papke L. E. & Wooldridge J. (1996). Econometric methods for fractional response variables with an application to 401(K) plan participation rates. Journal of Applied Econometrics, 11(6): 619–632.

McCullagh, Peter, and John A. Nelder. Generalized linear models. CRC press, USA, 2nd edition, 1989.