homogenizeData: Homogenizes two expression matrices

homogenizeDataR Documentation

Homogenizes two expression matrices

Description

This function takes two gene expression matrices (like trainExprMat and testExprMat) and returns homogenized versions of the matrices by employing the homogenization method specified. By default, the Combat method from the sva library is used. In both matrices, genes are row names and samples are column names. It will deal with duplicated gene names, as it subsets and orders the matrices correctly.

Usage

homogenizeData(
  testExprMat,
  trainExprMat,
  batchCorrect = "eb",
  selection = -1,
  printOutput = TRUE
)

Arguments

testExprMat

A gene expression matrix for samples on which we wish to predict a phenotype.Genes are rows, samples are columns.

trainExprMat

A gene expression matrix for samples for which the phenotype is already known.Genes are rows, samples are columns.

batchCorrect

The type of batch correction to be used. Options are 'eb' for Combat, 'none', or 'qn' for quantile normalization. #The default is 'eb'.

selection

This parameter can be used to specify how duplicates are handled. The default value of -1 means to ask the user. #Other options include '1' to summarize duplicates by their mean, and '2'to discard all duplicated genes.

printOutput

To suppress output, set to false. Default is TRUE.

Value

A list containing two entries $train and $test, which are the homogenized input matrices.


oncoPredict documentation built on May 29, 2024, 6:05 a.m.