`ARSyNseq`

filters the noise associated to identified or not identified batch effects considering the experimental design
and applying Principal Component Analysis (PCA) to the ANOVA parameters and residuals.

1 |

`data` |
A Biobase's eSet object created with the |

`factor` |
Name of the factor (as it was given to the |

`batch` |
TRUE to indicate that the |

`norm` |
Type of normalization to be used. One of “rpkm” (default), “uqua”, “tmm” or “n” (if data are already normalized). If length was provided through the |

`logtransf` |
If FALSE, a log-transformation will be applied on the data before computing ARSyN model to improve the results of PCA on count data. |

`Variability` |
Parameter for Principal Componentents (PCs) selection of the ANOVA models effects. This is the desired proportion of variability explained for the PC of the main effects (time and experimental group). Variability=0.75 by default. |

`beta` |
Parameter for PCs selection of the residual model. Components selected will be those that explain more than beta times the average component variability computed as the total data variability divided by the rank of the matrix associated to the factor. Default beta=2. |

When batch is identified with one of the factors described in the argument `factor`

of the `data`

object, `ARSyNseq`

estimates this effect and removes it by estimating the main PCs of the ANOVA effects associated. Selected PCs will be those that explain
more than the variability proportion specified in `Variability`

.

When batch is not identified, the model estimates the effects associated to each factor of interest and analyses if there exists
systematic noise in the residuals. If there is batch effect, it will be identified with the main PCs of these residuals.
Selected PCs will be those that explain more than `beta`

times the average component variability.

The Biobase's eSet object created with the `readData`

function that was given as input but replacing the expression data with the filtered expression data matrix.

Maria Jose Nueda, mj.nueda@ua.es

Nueda, M.J.; Ferrer, A. and Conesa, A. (2012) ARSyN: a method for the identification and removal of
systematic noise in multifactorial time-course microarray experiments. *Biostatistics* 13(3), 553-566.

1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 | ```
# Generating an artificial batch effect from Marioni's data
data(Marioni)
set.seed(123)
mycounts2 = mycounts
mycounts2[,1:4] = mycounts2[,1:4] + runif(nrow(mycounts2)*4, 3, 5)
myfactors = data.frame(myfactors, "batch" = c(rep(1,4), rep(2,6)))
mydata2 = readData(mycounts2, factors = myfactors)
# Exploring batch effect with PCA
myPCA = dat(mydata2, type = "PCA")
par(mfrow = c(1,2))
explo.plot(myPCA, factor = "Tissue")
explo.plot(myPCA, factor = "batch")
# Removing batch effect when the batch is identified for each sample and exploring results with PCA
mydata2corr1 = ARSyNseq(mydata2, factor = "batch", batch = TRUE, norm = "rpkm", logtransf = FALSE)
myPCA = dat(mydata2corr1, type = "PCA")
par(mfrow = c(1,2))
explo.plot(myPCA, factor = "Tissue")
explo.plot(myPCA, factor = "batch")
# If we consider that exist a batch but it is not identified (we do not know the batch information):
mydata2corr2 = ARSyNseq(mydata2, factor = "Tissue", batch = FALSE, norm = "rpkm", logtransf = FALSE)
myPCA = dat(mydata2corr2, type = "PCA")
par(mfrow = c(1,2))
explo.plot(myPCA, factor = "Tissue")
explo.plot(myPCA, factor = "batch")
``` |

Questions? Problems? Suggestions? Tweet to @rdrrHQ or email at ian@mutexlabs.com.

Please suggest features or report bugs with the GitHub issue tracker.

All documentation is copyright its authors; we didn't write any of that.