mvregtolregion: Multivariate (Multiple) Linear Regression Tolerance Regions

mvregtol.regionR Documentation

Multivariate (Multiple) Linear Regression Tolerance Regions

Description

Determines the appropriate tolerance factor for computing multivariate (multiple) linear regression tolerance regions based on Monte Carlo simulation.

Usage

mvregtol.region(mvreg, new.x = NULL, alpha = 0.05, P = 0.99, 
                B = 1000)

Arguments

mvreg

A multivariate (multiple) linear regression fit, having class mlm.

new.x

An optional data frame of new values for which to approximate k-factors. This must be a data frame with named columns that match those in the data frame used for the mvreg fitted object.

alpha

The level chosen such that 1-alpha is the confidence level.

P

The proportion of the population to be covered by this tolerance region.

B

The number of iterations used for the Monte Carlo algorithm which determines the tolerance factor. The number of iterations should be at least as large as the default value of 1000.

Details

A basic sketch of how the algorithm works is as follows:

(1) Generate independent chi-square random variables and Wishart random matrices.

(2) Compute the eigenvalues of the randomly generated Wishart matrices.

(3) Iterate the above steps to generate a set of B sample values such that the 100(1-alpha)-th percentile is an approximate tolerance factor.

Value

mvregtol.region returns a matrix where the first column is the k-factor, the next q columns are the estimated responses from the least squares fit, and the final m columns are the predictor values. The first n rows of the matrix pertain to the raw data as specified by y and x. If values for new.x are specified, then there is one additional row appended to this output for each row in the matrix new.x.

Note

As of tolerance version 2.0.0, the arguments to this function have changed. This function no longer depends on inputted y and x matrices or an int argument. Instead, the function requires mvreg, which is of class "mlm", and provides all of the necessary components for the way the output is formatted. Also, new.x must now be a data frame with columns matching those from the data frame used in the mvreg fitted object.

References

Anderson, T. W. (2003) An Introduction to Multivariate Statistical Analysis, Third Edition, Wiley.

Krishnamoorthy, K. and Mathew, T. (2009), Statistical Tolerance Regions: Theory, Applications, and Computation, Wiley.

Krishnamoorthy, K. and Mondal, S. (2008), Tolerance Factors in Multiple and Multivariate Linear Regressions, Communications in Statistics - Simulation and Computation, 37, 546–559.

Examples

 
## 95%/95% multivariate regression tolerance factors using
## a fertilizer data set presented in Anderson (2003, p. 374). 

grain <- c(40, 17, 9, 15, 6, 12, 5, 9)
straw <- c(53, 19, 10, 29, 13, 27, 19, 30)
fert <- c(24, 11, 5, 12, 7, 14, 11, 18)
DF <- data.frame(grain,straw,fert)
new.x <- data.frame(fert = c(10, 15, 20))
mvreg <- lm(cbind(grain, straw) ~ fert + I(fert^2), data = DF)

set.seed(100)
out <- mvregtol.region(mvreg, new.x = new.x, alpha = 0.05,
                       P = 0.95, B = 5000)
out

tolerance documentation built on May 29, 2024, 7:38 a.m.