Ramaswamy2001: Multiclass cancer prediction involving 14 tumor types

Ramaswamy2001R Documentation

Multiclass cancer prediction involving 14 tumor types

Description

The optimal treatment of cancer patients depends on establishing accurate diagnoses using clinical and histopathologic data. In some instances, particularly involving metastatic tumors, this is difficult to atypical clinical presentation or histopathology. The purpose of this study was to determine whether the diagnosis of common adult malignancies could be achieved purely by molecular classification. The data consists of 218 tumor samples, spanning 14 common tumor types. For each tumor, the expression of 16,063 genes were measured.

Format

  • y: A 14-level factor indicating the type of tumor

  • X: Gene expression measurement

  • y.test: Same as above, but for independent testing

  • X.test: Same as above, but for independent testing

Dimensions

  • n = 144 (train), 54 (test)

  • p = 16,063

Annotation

  • The object fData contains the associated gene names and gene symbols for the (mapped) probes in X. Rows of fData correspond to columns of X, and are named accordingly.

References

I obtained the raw data from Trevor Hastie; I am not sure what sort of preprocessing/normalization was done to it. I added the annotation, both with respect to the tumor types and the feature data.

Original citation:

Ramaswamy S, Tamayo P, Rifkin R, Mukherjee S, Yeang C, Angelo M, Ladd C, Reich M, Latulippe E, Mesirov J and others (2001). Multiclass cancer diagnosis using tumor gene expression signatures. Proceedings of the National Academy of Sciences, 98: 15149-15154.


pbreheny/hdrm documentation built on May 24, 2024, 9:30 p.m.