tunelocal: Dimension selection for three-dimensional correspondence...

View source: R/tunelocal.R

tunelocalR Documentation

Dimension selection for three-dimensional correspondence biplot using convex hull.

Description

This function allows to select the optimal dimension number for correspondence biplot, given the set of possible dimension combination of the original data. It determines the models that are located on the boundary of the convex hull and selects an optimal model by means of the scree test values (st). For exploring, it is also possible to check the optimal model dimension by using boostrap samples which have the same marginal proportions and the total number of the original table. When the input parameter boots = T, it does bootstrap sampling. There are three kinds of possible bootstrap sampling. When boottype = "bootnp" it performs a non parametric bootstrap sampling. When boottype = "bootpsimple" it performs a parametric simple bootstrap sampling. When boottype = "bootpstrat", it performs a parametric stratified bootstrap sampling. In particular in case of parametric bootstrap types, when resamptype=1 it considers a multinomial distribution, and when resamptype = 2 it considers a poisson distribution.

Usage

tunelocal(Xdata, ca3type = "CA3", resp = "row", norder = 3, digits = 3, boots = FALSE,  
nboots = 0, boottype= "bootpsimple", resamptype = 1, PercentageFit = 0.01) 

Arguments

Xdata

The three-way data. It can be a R object array or raw data (n individuals by three categorical variables, for an example, see museum data). When a three-way non-symmetrical variant is performed, by default, the response variable is the row variable when an array is given, or the first of three columns when a raw data set is given. For changing, consider the parameter resp = "col" or resp = "tube".

ca3type

The specification of the analysis to be performed. If ca3type = "CA3", then a three-way (symmetrical) correspondence analysis will be performed (default analysis). If ca3type = "NSCA3", then three-way non-symmetrical correspondence analysis will be performed. If ca3type = "OCA3", then ordered three-way symmetrical correspondence analysis will be performed. If ca3type = "ONSCA3", then ordered three-way non-symmetrical correspondence analysis will be performed.

resp

The input parameter for specifying in non-symmetrical three-way correspondence analysis variants (ca3type = "NSCA3" and ca3type = "ONSCA3") what is the response variable (logically antecendent to the others). By default resp = "row", but it could be the column variable resp = "col" or the tube variable resp = "tube".

norder

The input parameter for specifying the number of ordered variable when ca3type = "OCA3" or ca3type = "ONSCA3".

digits

The input parameter specifying the digital number. By default, digits = 3.

boots

The flag parameter to perform the search of optimal dimensions using bootstrap samples. By defaults, boots = FALSE.

nboots

The number of bootstrap samples to generate when boots = TRUE. Note that when boots = FALSE,
by default nboots = 0, but when boots = TRUE, by default nboots = 100.

boottype

The specification of the kind of bootstrap sampling to be performed. If boottype = "bootpsimple", then a parametric bootstrap using a simple sampling scheme will be performed (default sampling). If boottype = "bootpstrat", then a parametric bootstrap using a stratified sampling scheme will be performed. If boottype = "bootnp", then a non-parametric bootstrap using a simple sampling scheme will be performed.

resamptype

When the kind of bootstrap is parametric you can set the data distribution using the input parameter resamptype according to two distribution: resamptype=1 corresponds to multinomial distribution and resamptype=2 to Poisson distribution.

PercentageFit

Required proportion of increase in fit of a more complex model. By default, PercentageFit = 0.01.

Value

output1

Chi-square criterion and df of models on the convex hull. It gives the criterion values of the models that are located on the boundary of the convex hull and selects the optimal model by means of the scree test values (st). When using boots = FALSE, it gives the set of possible dimension combination of the original data using only the original data array. When using boots = TRUE, it gives the set of possible dimension combination of the original data using bootstrapped data arrays.

Author(s)

Rosaria Lombardo, Michel van de Velden, Eric J Beh.

References

Wilderjans T F, Ceulemans E, and Meers K (2013) CHull: A generic convex hull based model selection method. Behavior Research Methods, 45, 1-15.
Ceulemans E, and Kiers H A L (2006) Selecting among three-mode principal component models of different types and complexities: A numerical convex hull based method. British Journal of Mathematical & Statistical Psychology, 59, 133-150.

Examples


tunelocal(Xdata = happy, ca3type = "CA3")




CA3variants documentation built on Oct. 10, 2022, 5:07 p.m.