golub: Subset of a gene expression dataset from Golub et al. (1999)

golubR Documentation

Subset of a gene expression dataset from Golub et al. (1999)

Description

A subset of 200 genes from the gene expression data (38 tumor mRNA samples) from the leukemia microarray study of Golub et al. (1999). Pre-processing was done as described in Dudoit et al. (2002). These data are obtained from the hopach package by Pollard and Van der Laan, October 2007; the 200 genes exhibiting the most variation have been selected, as described in the manual of the hopach package.

Usage

data(golub)

Value

golub

matrix of gene expression levels for the 38 tumor mRNA samples, rows correspond to genes (200 genes) and columns to mRNA samples.

golub.cl

numeric vector indicating the tumor class, 27 acute lymphoblastic leukemia (ALL) cases (code 0) and 11 acute myeloid leukemia (AML) cases (code 1).

golub.gnames

a matrix containing the names of the 200 genes for the expression matrix golub. The three columns correspond to the gene index, ID, and Name, respectively.

Source

Golub et al. (1999). Molecular classification of cancer: class discovery and class prediction by gene expression monitoring, Science, Vol. 286:531-537.
http://www-genome.wi.mit.edu/MPR/ .

References

S. Dudoit, J. Fridlyand, and T. P. Speed (2002). Comparison of discrimination methods for the classification of tumors using gene expression data. Journal of the American Statistical Association, Vol. 97, No. 457, p. 77–87.


rwehrens/PCA documentation built on Jan. 27, 2024, 2:33 p.m.