Novartis CCLE resources

knitr::opts_chunk$set(fig.width=12, fig.height=8, fig.path='Figs/',message=FALSE)

title: "CCLE_data" author: "Lavanya Kannan" date: "r Sys.Date()" output: html_document

Introduction

The CCLE project has data about 1036 cellines covering various cancer types. In this vignette, we will see how we can download a toy dataset, define containers and finally to integrate the toy example based on common genes and or disease types. Here we describe the toy dataset of 10 cellines that includes pharmacological dataset of 5 compounds on each of the cellines and expression dataset of the same set of 500 genes. Also included is the copy number variation data.

Copy Number Variation

Both segmental average values and by gene values are available in the CCLE page. Note the gene level data can be derived from the segmental values and hence we load the segmental dataset and convert it to a GRanges object as follows:

library(ccleWrap)
seg = read.delim(system.file("CCLE_toy/CCLE_copynumber_2013-12-03.seg.txt", package="ccleWrap"))
library(GenomicRanges)
ccleCNSeg = GRanges(paste("chr", seg$Chromosome, sep=""), IRanges(seg$Start, seg$End)) 
mcols(ccleCNSeg) = seg[,-c(2,3,4)]
ccleCNSeg$CCLE_name = as.character(ccleCNSeg$CCLE_name)

Expression Sets

The expression and the corresponding pdata can be loaded as follows:

expr.dat_toy <- read.table(system.file("CCLE_toy/expr.dat_toy.csv",package="ccleWrap"), skip=0, sep=",", as.is=TRUE,header = TRUE, check.names=FALSE)

esif_toy <- read.table(system.file("CCLE_toy/esif_toy.csv",package="ccleWrap"), skip=0, sep=",", as.is=TRUE,header = TRUE, check.names=FALSE)

With the ExpressionSet function in the Biobase package, these data can be converted to an expression set.

library(Biobase)
ccleEx = ExpressionSet(as.matrix(expr.dat_toy))
pData(ccleEx) = esif_toy
annotation(ccleEx) = "hgu133plus2hsentrezg.db"

Pharmacological Dataset

The pharmacological data can be loaded as below.

ddata = read.csv(system.file("CCLE_toy/pharma_toy.csv",package="ccleWrap"), stringsAsFactors=FALSE)
nstring2vec = function(x, sep=",") {
  as.numeric(strsplit(x, sep)[[1]])
  }
dosemat = sapply(ddata[,5], nstring2vec)
activityMeds = sapply(ddata[,"Activity.Data..median."], nstring2vec)
activitySDs = sapply(ddata[,"Activity.SD"], nstring2vec)
lines = sub("_", ",", ddata[,1])
lineName = sapply(strsplit(lines, ","), "[", 1)
lineOrg = sapply(strsplit(lines, ","), "[", 2)
df = read.csv(system.file("CCLE_toy/pharma_toy.csv",package="ccleWrap"), stringsAsFactors=FALSE,h=TRUE)
nr = nrow(df)

csvname="CCLE_NP24.2009_Drug_data_2012.02.20.csv"
csvhash.md5="b64295ef99912d1d4bead76461d0e2a1"

recs = lapply(1:nr, function(x) parseCCLEline(df[x,]))
ccleRx = new("ccleSet", expts=recs, dateCreated=date(),
  csvname=csvname, csvhash.md5=csvhash.md5)

lwaldron/ccleWrap documentation built on May 21, 2019, 9 a.m.

rdrr.io home R language documentation Run R code online

CRAN packages Bioconductor packages R-Forge packages GitHub packages

Note that we can't provide technical support on individual packages. You should contact the package authors for that.

lwaldron/ccleWrap
wrapper structures and functions for Broad/Novartis CCLE resources

In lwaldron/ccleWrap: wrapper structures and functions for Broad/Novartis CCLE resources

Introduction

Copy Number Variation

Expression Sets

Pharmacological Dataset

R Package Documentation

Browse R Packages

We want your feedback!

lwaldron/ccleWrap wrapper structures and functions for Broad/Novartis CCLE resources

In lwaldron/ccleWrap: wrapper structures and functions for Broad/Novartis CCLE resources

Introduction

Copy Number Variation

Expression Sets

Pharmacological Dataset

R Package Documentation

Browse R Packages

We want your feedback!

lwaldron/ccleWrap
wrapper structures and functions for Broad/Novartis CCLE resources