NCI60: NCI60: Gene Expression in Cancer Cell Lines

NCI60R Documentation

NCI60: Gene Expression in Cancer Cell Lines

Description

This is NCI microarray data and consists of 6830 gene expression measurements for each of 64 cancer cell lines derived from 14 types of human tumours. The researchers wanted to classify the cell lines based on the observed gene expression patterns.

Usage

NCI60

Format

A data frame with 64 observations (rows) and 6832 variables (columns).

Column name Data type Description Values
[,1] ID factor Unique ID for each cell line (V1...V64)
[,2] Labs factor 14 types of cancer (BREAST...UNKNOWN)
[,3:6832] Expr1...Expr6830 numeric Gene expression data (-6.939981 - 8.66)

Details

CNS central nervous system
NSCLC non-small cell lung cancer
K562 chronic myeloid leukaemia
MCF7 metastatic adenocarcinoma (breast cancer)

Source

The data is from https://vincentarelbundock.github.io/Rdatasets/articles/data.html

References

James, G., Witten, D., Hastie, T., and Tibshirani, R. (2013) An Introduction to Statistical Learning with applications in R, Springer-Verlag:New York.

Ross, D.T., et al. (2000) Systematic variation in gene expression patterns in human cancer cell lines. Nat Genet, 24, 227–235.

Examples


# Summary of some of the variables
summary(NCI60[, 1:6])

# Number of cases per cancer type
table(NCI60$Labs)


thoree/stat340 documentation built on June 30, 2024, 4:04 p.m.