merge: General method to merge different ExpressionSets

Description Usage Arguments Details Value References Examples

Description

General method to merge different ExpressionSets by applying different techniques to remove inter-study bias.

Usage

1
merge(esets, method='NONE');

Arguments

esets

List of ExpressionSet objects.

method

Merging method aimed at removing inter-study bias. Possible options are: BMC, COMBAT, GENENORM and XPN. If none are specified, the merging More information about each method is given below in the details.

Details

Currently the following different merging techniques are provided:

'BMC':

In [1] they successfully applied a technique similar to z-score normalization for merging breast cancer datasets. They transformed the data by batch mean-centering, which means that the mean is subtracted.

'COMBAT':

Empirical Bayes [2] (also called EJLR or COMBAT) is a method that estimates the parameters of a model for mean and variance for each gene and then adjusts the genes in each batch to meet the assumed model. The parameters are estimated by pooling information from multiple genes in each batch.

'GENENORM':

One of the simplest mathematical transformations to make datasets more comparable is z-score normalization. In this method, for each gene expression value in each study separately all values are altered by subtracting the mean of the gene in that dataset divided by its standard deviation.

'NONE':

Combine esets without any additional transformation. Similar to 'combine' function.

'XPN':

The basic idea behind the cross-platform normalization [4] approach is to find blocks (clusters) of genes and samples in both studies that have similar expression characteristics. In XPN, a gene measurement can be considered as a scaled and shifted block mean.

Note that after using any of those methods the resulting merged dataset only contains the common list of genes/probes between all studies.

Value

A (merged) ExpressionSet object.

References

[1] A. Sims, et al., The removal of multiplicative, systematic bias allows integration of breast cancer gene expression datasets - improving meta-analysis and prediction of prognosis, BMC Medical Genomics, vol. 1, no. 1, p. 42, 2008.

[2] C. Li and A. Rabinovic, Adjusting batch effects in microarray expression data using empirical bayes methods, Biostatistics, vol. 8, no. 1, pp. 118-127, 2007.

[3] M. Benito, et al., Adjustment of systematic microarray data biases, Bioinformatics, vol. 20, no. 1, pp. 105-114, 2004.

[4] A. A. Shabalin, et al., Merging two gene-expression studies via cross-platform normalization, Bioinformatics, vol. 24, no. 9, pp. 1154-1160, 2008.

Examples

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
# retrieve two datasets:
library(inSilicoDb);
InSilicoLogin("rpackage_tester@insilicodb.com", "5c4d0b231e5cba4a0bc54783b385cc9a");
eset1 = getDataset("GSE18842", "GPL570", norm="FRMA", features="GENE");
eset2 = getDataset("GSE31547", "GPL96",  norm="FRMA", features="GENE");
esets = list(eset1,eset2);

# merge them using different methods:
library(inSilicoMerging);
eset_FRMA = merge(esets);
eset_COMBAT = merge(esets, method="COMBAT");

inSilicoMerging documentation built on Oct. 5, 2016, 4:16 a.m.