R/data.R

#' Clinical and microarray data from the TCGA dataset
#' A dataset containing the survival outcome in lung cancer TCGA trial
#' Lung adenocarcinomas were profiled by Beer et al. (2002) using Affymetrix hu6800 microarrays. The data here were normalized from raw .CEL files by RMAExpress (v0.3). The expression matrix contains expression data for 86 patients with 7,129 probe sets.
#'
#' @format Gene expression matrix and survival outcome data frame (time, cens).
#' \describe{
#'   \item{time}{Actual survival months observed}
#'   \item{cens}{censoring indicator 1 observed, 0 censored}
#'   \item{featurenames}{feature names in the gene expression matrix}
#'   ...
#' }
"lungdata"

#' GBM data
#'
#'This is a subset of the glioblastoma dataset from the cancer genome atlas (TCGA) GBM study (2009) used in Shen et al. (2012).
#'
#' @format A list object containing three data matrices: copy number, methylation and mRNA expression in 55 samples.
#' \describe{
#'   \item{time}{Actual survival months observed}
#'   \item{cens}{censoring indicator 1 observed, 0 censored}
#'   \item{featurenames}{feature names in the gene expression matrix}
#'   ...
#' }
#' @references Ronglai Shen, Qianxing Mo, Nikolaus Schultz, Venkatraman E. Seshan, Adam B. Olshen, Jason Huse, Marc Ladanyi, Chris Sander. (2012). Integrative Subtype Discovery in Glioblastoma Using iCluster. PLoS ONE 7, e35236
"gbm"

#' Metabric clinical data
#'
#'This dataset includes the clinical data for the study METABRIC.
#'
#' @format A datset with 3 covariates and 2 survival variables.
#' \describe{
#'   \item{os_months}{Actual survival months observed}
#'   \item{os_status}{censoring indicator 1 observed, 0 censored}
#'   ...
#' }
#' @references Curtis C, Shah SP, Chin SF, Turashvili G, Rueda OM, Dunning MJ, Speed D, Lynch AG, Samarajiwa S, Yuan Y, et al. The genomic and transcriptomic architecture of 2,000 breast tumours reveals novel subgroups. Nature. 2012; 486:346–352. [PubMed: 22522925]
"metabric_clinical_data"
csetraynor/predsurv documentation built on May 17, 2020, 3:43 a.m.