selm.fit: Fitting functions for 'selm' models
In sn: The Skew-Normal and Related Distributions Such as the Skew-t and the SUN

selm.fit

R Documentation

Fitting functions for `selm` models

Description

A call to selm activates a call to selm.fit and from here to some other function which actually performs the parameter search, among those listed below. These lower-level functions can be called directly for increased efficiency, at the expense of some more programming effort and lack of methods for the returned object.

Usage

 
selm.fit(x, y, family = "SN", start = NULL, w, fixed.param = list(), 
   offset = NULL, selm.control=list())

sn.mple(x, y, cp = NULL, w, penalty = NULL, trace = FALSE, opt.method =
   c("nlminb",  "Nelder-Mead", "BFGS", "CG", "SANN"), control = list()) 

st.mple(x, y, dp = NULL, w, fixed.nu = NULL, symmetr = FALSE, penalty = NULL, 
   trace = FALSE, opt.method = c("nlminb", "Nelder-Mead", "BFGS", "CG", "SANN"), 
   control = list()) 

msn.mle(x, y, start = NULL, w, trace = FALSE, opt.method = c("nlminb", 
   "Nelder-Mead", "BFGS", "CG", "SANN"), control = list())

msn.mple(x, y, start = NULL, w, trace = FALSE, penalty = NULL, 
   opt.method = c("nlminb", "Nelder-Mead", "BFGS", "CG", "SANN"), 
   control = list()) 

mst.mple(x, y, start = NULL, w, fixed.nu = NULL, symmetr=FALSE, 
   penalty = NULL, trace = FALSE, 
   opt.method = c("nlminb", "Nelder-Mead", "BFGS", "CG", "SANN"), 
   control = list())

Arguments

`x`	a full-rank design matrix with the first column of all 1's.
`y`	a vector or a matrix of response values such that `NROW(y)=nrow(x)`.
`family`	a character string which selects the parametric family of distributions assumed for the error term of the regression model. It must one of `"SN"` (default), `"ST"` or `"SC"`, which correspond to the skew-normal, the skew-t and the skew-Cauchy family, respectively. See `makeSECdistr` for more information on these families and the skew-elliptically contoured (SEC) distributions; notice that family `"ESN"` is not allowed here.
`start, dp, cp`	a vector or a list of initial parameter values, depeding whether `y` is a vector or a matrix. It is assumed that `cp` is given in the CP parameterization, `dp` and `start` in the DP parameterization. For `st.mple` and `mst.mple`, see also the paragraph about `start` in the documentation ‘Details’ of `selm`.
`w`	a vector of non-negative integer weights of length equal to `NROW(y)`; if missing, a vector of all 1's is generated.
`fixed.param`	a list of assignments of parameter values to be kept fixed during the optimization process. Currently, there is only one such option, namely `fixed.param=list(nu='value')`, to fix the degrees of freedom at the named `'value'` when `family="ST"`, for instance `list(nu=3)`. Setting `fixed.param=list(nu=1)` is equivalent to select `family="SC"`.
`penalty`	an optional character string with the name of the penalty function of the log-likelihood; default value `NULL` corresponds to no penalty.
`offset`	this can be used to specify an a priori known component to be included in the linear predictor during fitting. This should be `NULL` or a numeric vector of length equal to the number of cases. One or more `offset` terms can be included in the formula instead or as well, and if more than one are specified their sum is used.

`trace`	a logical value which regulates printing of successive calls to the target function; default value is `FALSE` which suppresses printing.
`fixed.nu`	a positive value to keep fixed the parameter `nu` of the ST distribution in the optimization process; with default value `NULL`, `nu` is estimated like the other parameters.
`symmetr`	a logical flag indicating whether a contraint of symmetry is imposed on the slant parameter; default is `symmetr=FALSE`.
`opt.method`	a character string which selects the optimization method within the set `c("nlminb", "Nelder-Mead", "BFGS", "CG", "SANN")`; the last four of these are `"methods"` of function `optim`.
`selm.control`	a list whose components regulate the working of `selm.fit`; see ‘Details’ for their description;
`control`	a list of control items passed to the optimization function.

Details

A call to selm produces a call to selm.fit which selects the appropriate function among sn.mple, st.mple, msn.mle, msn.mple, mst.mple, depending on the arguments of the calling statement. In the adopted scheme for function names, msn refers to a multivariate skew-normal distribution and mst refers to a multivariate skew-t distribution, while mle and mple refers to maximum likelihood and maximum penalized likelihood estimation, respectively. Of these functions, sn.mple works in CP space; the others in the DP space. In all cases, a correspondig mapping to the alternative parameter space is performed before exiting selm.fit, in addition to the selected parameter set.

The components of selm.control are as follows:

method: the estimation method, "MLE" or "MPLE".
penalty: a string with the name of the penalty function.
info.type: a string with the name of the information matrix, "observed" or "expected"; currently fixed at "observed".
opt.method: a character string which selects the optimization method.
opt.control: a list of control parameters of opt.method.

Function msn.mle, for MLE estimation of linear models with SN errors, is unchanged from version 0.4-x of the package. Function msn.mple is similar to msn.mle but allows to introduce a penalization of the log-likelihood; when penalty=NULL, a call to msn.mle is more efficient. Functions sn.mple and mst.mple work like sn.mle and mst.mle in version 0.4-x if the argument penalty is not set or it is set to NULL, except that mst.mple does not handle a univariate response (use st.mple for that).

Value

A list whose specific components depend on the named function. Typical components are:

`call`	the calling statement
`dp`	vector or list of estimated DP parameters
`cp`	vector or list of estimated CP parameters
`logL`	the maximized (penalized) log-likelihood
`aux`	a list with auxiliary output values, depending on the function
`opt.method`	a list produced by the numerical `opt.method`

Background

Computational aspects of maximum likelihood estimation for univariate SN distributions are discussed in Section 3.1.7 of Azzalini and Capitanio (2014). The working of sn.mple follows these lines; maximization is performed in the CP space. All other functions operate on the DP space.

The technique underlying msn.mle is based on a partial analytical maximization, leading implicitly to a form of profile log-likelihood. This scheme is formulated in detail in Section 6.1 of Azzalini and Capitanio (1999) and summarized in Section 5.2.1 of Azzalini and Capitanio (2014). The same procedure is not feasible when one adopts MPLE; hence function msn.mple has to maximize over a larger parameter space.

When the SN family is fitted with the constraint alpha=0, this amounts to adopt a classical linear model with Gaussian distributional assumption. The corresponding MLE's are the same as those produced by lm, except that the denominator the of the MLE variance (matrix) has the ‘uncorrected’ form. In the multivariate case, the covariance matrix of MLE is computed using expression (10) in Section 15.8 of Magnus and Neudecker (2007).

Maximization of the univariate ST log-likelihood is speeded-up by using the expressions of the gradient given by DiCiccio and Monti (2011), reproduced with inessential variants in Section 4.3.3 of Azzalini and Capitanio (2014).

The working of mst.mple is based on a re-parameterization described in Section 5.1 of Azzalini and Capitanio (2003). The expressions of the corresponding log-likelihood derivatives are given in Appendix B of the full version of the paper.

Author(s)

Adelchi Azzalini

References

Azzalini, A. and Capitanio, A. (1999). Statistical applications of the multivariate skew normal distribution. J.Roy.Statist.Soc. B 61, 579–602. Full-length version available at https://arXiv.org/abs/0911.2093

Azzalini, A. and Capitanio, A. (2003). Distributions generated by perturbation of symmetry with emphasis on a multivariate skew t distribution. J.Roy. Statist. Soc. B 65, 367–389. Full-length version available at https://arXiv.org/abs/0911.2342

Azzalini, A. with the collaboration of Capitanio, A. (2014). The Skew-Normal and Related Families. Cambridge University Press, IMS Monographs series.

DiCiccio, T. J. and Monti, A. C. (2011). Inferential aspects of the skew t-distribution. Quaderni di Statistica 13, 1–21.

Magnus, J. R. and Neudecker, H. (2007). Matrix Differential Calculus with Applications in Statistics and Econometrics, third edition. John Wiley & Sons.

Examples

 
data(wines, package="sn")
X <- model.matrix(~ phenols + wine, data=wines)
fit <- msn.mle(x=X, y=cbind(wines$acidity, wines$alcohol), opt.method="BFGS")
fit <- st.mple(x=X, y = wines$acidity, fixed.nu=4, penalty="Qpenalty")

sn documentation built on April 5, 2023, 5:15 p.m.