Two Scatter Matrix ICS Transformation
This function implements the 2 scatter matrix transformation to obtain an invariant coordinate sytem or independent components, depending on the underlying assumptions.
numeric data matrix or dataframe.
name of the first scatter matrix function or a scatter matrix. Default is the regular covariance matrix.
name of the second scatter matrix or a scatter matrix. Default is the covariance matrix based on forth order moments. Note that the type of S2 must be the same as S1.
list with optional additional arguments for S1. Only considered if S1 is a function.
list with optional additional arguments for S2. Only considered if S2 is a function.
either "B" or "Z". Defines the way to standardize the matrix B. Default is "Z". Details are given below.
Logical, either "TRUE" or "FALSE". Specifies weather the product of the kurtosis values is 1 or not.
a function which indicates what should happen when the data contain 'NA's. Default is to fail.
Seeing this function as a tool for data transformation the result is an invariant coordinate selection which can be used for testing and estimation. And if needed the results can be easily retransformed to the original scale. It is possible to use it also for dimension reduction, finding outliers or when searching for clusters in the data. The function can, however, also be used in a modelling framework. In this case it is assumed that the data were created by mixing independent components which have different kurtosis values. If the two scatter matrices used have then the so-called independence property the function can recover the independet components by estimating the unmixing matrix.
By default S1 is the regular covariance matrix
cov and S2 the matrix of fourth moments
cov4. However those can be replaced with any other
scatter matrix the user prefers. The package ICS offers for example also
tM and in the ICSNP
are for example further scatters as
But of course also scatters from any other package can be used.
Note that when function names are submitted, the function should return only a scatter matrix. If the function returns more, the scatter should be computed in advance or
a wrapper written that yields the required output. For example
tM returns a list with four elements where the scatter estimate is called V. A simple wrapper would then be
my.tm <- function(x, ...) tM(x, ...)$V.
For a given choice of S1 and S2 the general idea of the
ics function is to find the unmixing matrix B and the invariant coordinates (independent coordinates)
Z in such a way, that:
The elements of Z are standardized with respect to S1 (S1(Z)=I).
The elements of Z are uncorrelated with respect to S2. (S2(Z)=D, where D is a diagonal matrix).
The elements of Z are ordered according to their kurtosis.
Given those criteria, B is unique up to sign changes of its rows. The function provides two options to decide the exact form of B.
Method 'Z' standardizes B such, that all components are right skewed. The criterion used, is the sign of each componentwise difference of mean vector and transformation retransformation median. This standardization is prefered in an invariant coordinate framework.
Method 'B' standardizes B independent of Z such that the maximum element per row is positive and each row has norm 1. Usual way in an independent component analysis framework.
In principal if S1 and S2 are true scatter matrices the order does not matter. It will just reverse and invert the kurtosis value vector. This is however not true when not both of them are scatter matrices but one or both are shape matrices. In this case the order of the kurtosis values is also reversed, the ratio however then is not 1 but only constant. This is due to the fact that when shape matrices are used, the kurtosis values are only relative ones. Therefore by the default the kurtosis values are standardized such that their product is 1. If no standardization is wanted, the 'stdKurt' argument should be used.
an object of class
Tyler, D.E., Critchley, F., Dümbgen, L. and Oja, H. (2009), Invariant co-ordinate selecetion, Journal of the Royal Statistical Society,Series B, 71, 549–592.
Oja, H., Sirkiä, S. and Eriksson, J. (2006), Scatter matrices and independent component analysis, Austrian Journal of Statistics, 35, 175–189.
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57 58 59 60 61 62 63 64 65 66 67 68 69 70 71
# example using two functions set.seed(123456) X1 <- rmvnorm(250, rep(0,8), diag(c(rep(1,6),0.04,0.04))) X2 <- rmvnorm(50, c(rep(0,6),2,0), diag(c(rep(1,6),0.04,0.04))) X3 <- rmvnorm(200, c(rep(0,7),2), diag(c(rep(1,6),0.04,0.04))) X.comps <- rbind(X1,X2,X3) A <- matrix(rnorm(64),nrow=8) X <- X.comps %*% t(A) ics.X.1 <- ics(X) summary(ics.X.1) plot(ics.X.1) # compare to pairs(X) pairs(princomp(X,cor=TRUE)$scores) # slow: # library(ICSNP) # ics.X.2 <- ics(X, tyler.shape, duembgen.shape, S1args=list(location=0)) # summary(ics.X.2) # plot(ics.X.2) rm(.Random.seed) # example using two computed scatter matrices for outlier detection library(robustbase) ics.wood<-ics(wood,tM(wood)$V,tM(wood,2)$V) plot(ics.wood) # example using three pictures library(pixmap) fig1 <- read.pnm(system.file("pictures/cat.pgm", package = "ICS")) fig2 <- read.pnm(system.file("pictures/road.pgm", package = "ICS")) fig3 <- read.pnm(system.file("pictures/sheep.pgm", package = "ICS")) p <- dim(fig1@grey) fig1.v <- as.vector(fig1@grey) fig2.v <- as.vector(fig2@grey) fig3.v <- as.vector(fig3@grey) X <- cbind(fig1.v,fig2.v,fig3.v) set.seed(4321) A <- matrix(rnorm(9), ncol = 3) X.mixed <- X %*% t(A) ICA.fig <- ics(X.mixed) par.old <- par() par(mfrow=c(3,3), omi = c(0.1,0.1,0.1,0.1), mai = c(0.1,0.1,0.1,0.1)) plot(fig1) plot(fig2) plot(fig3) plot(pixmapGrey(X.mixed[,1],ncol=p)) plot(pixmapGrey(X.mixed[,2],ncol=p)) plot(pixmapGrey(X.mixed[,3],ncol=p)) plot(pixmapGrey(ics.components(ICA.fig)[,1],ncol=p)) plot(pixmapGrey(ics.components(ICA.fig)[,2],ncol=p)) plot(pixmapGrey(ics.components(ICA.fig)[,3],ncol=p)) par(par.old) rm(.Random.seed)