stablereg: Stable Generalized Regression Models

View source: R/stable.r

stableregR Documentation

Stable Generalized Regression Models

Description

stablereg fits user specified generalized linear and nonlinear regression models based on the stable distribution to (uncensored, right and/or left censored) data. This allows the location, the dispersion, the skewness and the tails of the fitted stable distribution to vary with explanatory variables.

Usage

stablereg(
  y = NULL,
  loc = 0,
  disp = 1,
  skew = 0,
  tail = 1.5,
  oloc = TRUE,
  odisp = TRUE,
  oskew = TRUE,
  otail = TRUE,
  noopt = FALSE,
  iloc = NULL,
  idisp = NULL,
  iskew = NULL,
  itail = NULL,
  loc_h = NULL,
  disp_h = NULL,
  skew_h = NULL,
  tail_h = NULL,
  weights = 1,
  exact = FALSE,
  delta = 1,
  envir = parent.frame(),
  integration = "Romberg",
  eps = 1e-06,
  up = 10,
  npoint = 501,
  hessian = TRUE,
  llik.output = FALSE,
  print.level = 0,
  ndigit = 10,
  steptol = 1e-05,
  gradtol = 1e-05,
  fscale = 1,
  typsize = abs(p0),
  stepmax = sqrt(p0 %*% p0),
  iterlim = 100
)

Arguments

y

The response vector or a repeated data object. If the repeated data object contains more than one response variable, give that object in envir and give the name of the response variable to be used here.

For censored data, two columns with the second being the censoring indicator (1: uncensored, 0: right censored, -1: left censored.)

loc, loc_h, oloc, iloc

Describe the regression model fitted for the location parameter of the stable distribution, perhaps after transformation by the link function loc_g (set to the identity by default. The inverse link function is denoted by loc_h. Note that these functions cannot contain unknown parameters).

Two specifications are possible:

(1) loc is a linear or nonlinear language expression beginning with ~ or an R function, describing the regression function for the location parameter (after transformation by loc_g, the link function).

iloc is a vector of initial conditions for the parameters in the regression for this parameter.

oloc is a boolean indicating if an optimization of the likelihood has to be carried out on these parameters. If oloc is set to TRUE, a default zero value is considered for the starting values iloc. But if no optimization is desired on the location parameters, i.e. when the likelihood has to be evaluated or optimized at a fixed location, then iloc has to be explicitely specified.

(2) loc is a numeric expression (i.e. a scalar or a vector of the same size as the data vector y, or y[,1] when censoring is considered).

If oloc is set to TRUE, i.e. when an optimization of the likelihood has to be carried out on the location parameter, then the location parameter (after transformation by the link function loc_g) is set to an unknown parameter with initial value equal to iloc[1] or loc[1] when iloc is not specified.

But when oloc is set to FALSE, i.e. when the likelihood has to be evaluated or optimized at a fixed location, then the transformed location is assumed to be equal to loc when it is of the same length as the data vector y (or y[,1] when censoring is considered), and to loc[1] otherwise.

Specification (1) is especially useful in ANOVA-like situations where the location is assumed to change with the levels of some factor variable.

disp, disp_h, odisp, idisp

describe the regression model for the dispersion parameter of the fitted stable distribution, after transformation by the link function disp_g (set to the log function by default). The inverse link function is denoted by disp_h. Again these functions cannot contain unknown parameters. The same rules as above apply when specifying the generalized regression model for the dispersion parameter.

skew, skew_h, oskew, iskew

describe the regression model for the skewness parameter of the fitted stable distribution, after transformation by the link function skew_g (set to log{(1 + [.])/(1 - [.])} by default). The inverse link function is denoted by skew_h. Again these functions cannot contain unknown parameters. The same rules as above apply when specifying the generalized regression model for the skewness parameter.

tail, tail_h, otail, itail

describe the regression model considered for the tail parameter of the fitted stable distribution, after transformation by the link function tail_g (set to log{([.] - 1)/(2 - [.])} by default. The inverse link function is denoted by tail_h. Again these functions cannot contain unknown parameters). The same rules as above apply when specifying the generalized regression model for the tail parameter.

noopt

When set to TRUE, it forces oloc, odisp, oskew and otail to FALSE, whatever the user choice for these last three arguments. It is especially useful when looking for appropriate initial values for the regression model parameters, before undertaking the optimization of the likelihood.

weights

Weight vector.

exact

If TRUE, fits the exact likelihood function for continuous data by integration over intervals of observation, i.e. interval censoring.

delta

Scalar or vector giving the unit of measurement for each response value, set to unity by default. For example, if a response is measured to two decimals, delta=0.01. If the response is transformed, this must be multiplied by the Jacobian. For example, with a log transformation, delta=1/y. (The delta values for the censored response are ignored.) The transformation cannot contain unknown parameters.

envir

Environment in which model formulae are to be interpreted or a data object of class, repeated, tccov, or tvcov; the name of the response variable should be given in y. If y has class repeated, it is used as the environment.

integration, eps, up, npoint

integration indicates which algorithm must be used to evaluate the stable density when the likelihood is computed with exact set to FALSE. See the man page on stable for extra information.

hessian

Arguments controlling the optimization procedure nlm.

llik.output

is TRUE when the likelihood has to be displayed at each iteration of the optimization.

print.level

Arguments controlling the optimization procedure nlm.

ndigit

Arguments controlling the optimization procedure nlm.

steptol

Arguments controlling the optimization procedure nlm.

gradtol

Arguments controlling the optimization procedure nlm.

fscale

Arguments controlling the optimization procedure nlm.

typsize

Arguments controlling the optimization procedure nlm.

stepmax

Arguments controlling the optimization procedure nlm.

iterlim

Arguments controlling the optimization procedure nlm.

Value

A list of class stable is returned. The printed output includes the -log-likelihood, the corresponding AIC, the maximum likelihood estimates, standard errors, and correlations. It also include all the relevant information calculated, including error codes.

Warning

Because of the numerical integrations involved, convergence can be very sensitive to the initial parameter values supplied and to the settings of the arguments controlling nlm. If nlm feeds extreme parameter values in the tails of the distribution to the likelihood function, the integration may hang for a long time.

Author(s)

Philippe Lambert (Catholic University of Louvain, Belgium, phlambert@stat.ucl.ac.be) and Jim Lindsey.

References

Lambert, P. and Lindsey, J.K. (1999) Analysing financial returns using regression models based on non-symmetric stable distributions. Applied Statistics 48, 409-424.

See Also

lm, glm, stable and stable.mode.

Examples


## Share return over a 50 day period (see reference above)
# shares
y <- c(296,296,300,302,300,304,303,299,293,294,294,293,295,287,288,297,
305,307,307,304,303,304,304,309,309,309,307,306,304,300,296,301,298,
295,295,293,292,297,294,293,306,303,301,303,308,305,302,301,297,299)

# returns
ret <- (y[2:50]-y[1:49])/y[1:49]
# hist(ret, breaks=seq(-0.035,0.045,0.01))

day <- seq(0,0.48,by=0.01) # time measured in days/100
x <- seq(1,length(ret))-1

# Classic stationary normal model tail=2
print(z1 <- stablereg(y = ret, delta = 1/y[1:49],
	loc = ~1, disp= ~1, skew = ~1, tail = tail_g(1.9999999),
	iloc = 0, idisp = -3, iskew = 0, oskew = FALSE, otail = FALSE))

# Normal model (tail=2) with dispersion=disp_h(b0+b1*day)
print(z2 <- stablereg(y = ret, delta = 1/y[1:49], loc = ~day,
	disp = ~1, skew = ~1, tail = tail_g(1.999999), iloc = c(0.003,0),
	idisp = -4.5, iskew = 0, oskew = FALSE, otail = FALSE))

# Stable model with loc(ation)=loc_h(b0+b1*day)
print(z3 <- stablereg(y = ret, delta = 1/y[1:49],
	loc = ~day, disp = ~1, skew = ~1, tail = ~1,
	iloc = c(0.001,-0.004), idisp = -4.8, iskew = 0, itail = 0.6))

# Stable model with disp(ersion)=disp_h(b0+b1*day)
print(z4 <- stablereg(y = ret, delta = 1/y[1:49],
	loc = ~1, disp = ~day, skew = ~1, tail = ~1,
	iloc = 0.003, idisp = c(-4.8,0), iskew = -0.03, itail = 1.6))

# Stable model with skew(ness)=skew_h(b0+b1*day)
# Evaluation at fixed parameter values (because noopt is set to TRUE)
print(z5 <- stablereg(y = ret, delta = 1/y[1:49],
	loc = ~1, disp = ~1, skew = ~day, tail = ~1,
	iloc = 5.557e-04, idisp = -4.957, iskew = c(2.811,-2.158),
	itail = 1.57, noopt=TRUE))

# Stable model with tail=tail_h(b0+b1*day)
print(z6 <- stablereg(y = ret, delta = 1/y[1:49], loc = ret ~ 1,
	disp = ~1, skew = ~1, tail = ~day, iloc = 0.002,
	idisp = -4.8, iskew = -2, itail = c(2.4,-4), hessian=FALSE))


stable documentation built on March 18, 2022, 7:48 p.m.