N1binomial | R Documentation |
Estimate the four parameters of
the (bivariate) N_1
–binomial copula
mixed data type model
by maximum likelihood estimation.
N1binomial(lmean = "identitylink", lsd = "loglink",
lvar = "loglink", lprob = "logitlink", lapar = "rhobitlink",
zero = c(if (var.arg) "var" else "sd", "apar"),
nnodes = 20, copula = "gaussian", var.arg = FALSE,
imethod = 1, isd = NULL, iprob = NULL, iapar = NULL)
lmean , lsd , lvar , lprob , lapar |
Details at |
imethod , isd , iprob , iapar |
Initial values.
Details at |
zero |
Details at |
nnodes |
Number of nodes and weights for the Gauss–Hermite (GH) quadrature. While a higher value should be more accurate, setting an excessive value runs the risk of evaluating some special functions near the boundary of the parameter space and producing numerical problems. |
copula |
Type of copula used. Currently only the bivariate normal is used but more might be implemented in the future. |
var.arg |
See |
The bivariate response comprises Y_1
from the linear model having parameters
mean
and sd
for
\mu_1
and \sigma_1
,
and the binary
Y_2
having parameter
prob
for its mean \mu_2
.
The
joint probability density/mass function is
P(y_1, Y_2 = 0) = \phi_1(y_1; \mu_1, \sigma_1)
(1 - \Delta)
and
P(y_1, Y_2 = 1) = \phi_1(y_1; \mu_1, \sigma_1)
\Delta
where \Delta
adjusts \mu_2
according to the association parameter
\alpha
.
The quantity \Delta
is
\Phi((\Phi^{-1}(\mu_2) - \alpha Z_1)/
\sqrt{1 - \alpha^2})
.
The quantity Z_1
is (Y_1-\mu_1) / \sigma_1
.
Thus there is an underlying bivariate normal
distribution, and a copula is used to bring the
two marginal distributions together.
Here,
-1 < \alpha < 1
, and
\Phi
is the
cumulative distribution function
pnorm
of a standard univariate normal.
The first marginal
distribution is a normal distribution
for the linear model.
The second column of the response must
have values 0 or 1,
e.g.,
Bernoulli random variables.
When \alpha = 0
then \Delta=\mu_2
.
Together, this family function combines
uninormal
and
binomialff
.
If the response are correlated then
a more efficient joint analysis
should result.
This VGAM family function cannot handle
multiple responses. Only a two-column
matrix is allowed.
The two-column fitted
value matrix has columns \mu_1
and \mu_2
.
An object of class "vglmff"
(see vglmff-class
).
The object is used by modelling functions
such as vglm
and vgam
.
This VGAM family function is fragile.
Because the EIMs are approximated by
GH quadrature it is found that convergence
may be a little slower than for other models
whose EIM is tractable.
Also, the log-likelihood may be flat at the MLE
with respect to \alpha
especially
because the correlation
between the two mixed data types may be weak.
It pays to set trace = TRUE
to
monitor convergence, especially when
abs(apar)
is high.
T. W. Yee
Song, P. X.-K. (2007). Correlated Data Analysis: Modeling, Analytics, and Applications. Springer.
rN1binom
,
N1poisson
,
binormalcop
,
uninormal
,
binomialff
,
pnorm
.
nn <- 1000; mymu <- 1; sdev <- exp(1)
apar <- rhobitlink(0.5, inverse = TRUE)
prob <- logitlink(0.5, inverse = TRUE)
mat <- rN1binom(nn, mymu, sdev, prob, apar)
nbdata <- data.frame(y1 = mat[, 1], y2 = mat[, 2])
fit1 <- vglm(cbind(y1, y2) ~ 1, N1binomial,
nbdata, trace = TRUE)
coef(fit1, matrix = TRUE)
Coef(fit1)
head(fitted(fit1))
summary(fit1)
confint(fit1)
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.