Home

/

GitHub

/

waldronlab/cBioPortalData

/

cBioDataPack: Obtain pre-packaged data from cBioPortal and represent as a...

cBioDataPack: Obtain pre-packaged data from cBioPortal and represent as a...
In waldronlab/cBioPortalData: Exposes and Makes Available Data from the cBioPortal Web Resources

View source: R/cBioDataPack.R

cBioDataPack

R Documentation

Obtain pre-packaged data from cBioPortal and represent as a MultiAssayExperiment object

Description

The cBioDataPack function allows the user to download and process cancer study datasets found in MSKCC's cBioPortal. Output datasets use the MultiAssayExperiment data representation to faciliate analysis and data management operations.

Usage

cBioDataPack(
  cancer_study_id,
  use_cache = TRUE,
  names.field = c("Hugo_Symbol", "Entrez_Gene_Id", "Gene"),
  cleanup = TRUE,
  ask = interactive(),
  check_build = TRUE
)

Arguments

`cancer_study_id`	`character(1)` The study identifier from cBioPortal as seen in the dataset links at https://www.cbioportal.org/datasets.
`use_cache`	`logical(1)` (default TRUE) create the default cache location and use it to track downloaded data. If data found in the cache, data will not be re-downloaded. A path can also be provided to data cache location.
`names.field`	`character()` Possible column names for the column that will used to label ranges for data such as mutations or copy number (defaults: "Hugo_Symbol", "Entrez_Gene_Id", "Gene", and "Composite.Element.REF"). Values are cycled through and eliminated when no data present, or duplicates are found. Values in the corresponding column must be unique in each row.
`cleanup`	`logical(1)` whether to delete the `untar`-red contents from the `exdir` folder (default TRUE)
`ask`	`logical(1)` Whether to prompt the the user before downloading and loading study `MultiAssayExperiment` that is not currently building based on previous testing. Set to `interactive()` by default. In a non-interactive session, data download will be attempted; equivalent to `ask = FALSE`. The argument will also be used when a cache directory needs to be created when using `downloadStudy`.
`check_build`	logical(1L) Whether to check the build status of the `studyId` using an internal dataset. This argument should be set to `FALSE` if using alternative `hostnames`, e.g., 'pedcbioportal.kidsfirstdrc.org'

Details

The full list of study identifiers (studyIds) can obtained from getStudies(). Currently, only ~ 72% of datasets can be represented as MultiAssayExperiment data objects from the data tarballs. Refer to getStudies(..., buildReport = TRUE) and its "pack_build" column to see which study identifiers are not building. Users who would like to prioritize particular datasets should open GitHub issues at the URL in the DESCRIPTION file. For a more fine-grained approach to downloading data from the cBioPortal API, refer to the cBioPortalData function.

Value

A MultiAssayExperiment object

cBio_URL

The cBioDataPack function accesses data from the cBio_URL option. By default, it points to an Amazon S3 bucket location. Previously, it pointed to 'http://download.cbioportal.org'. This recent change (> 2.1.17) should provide faster and more reliable downloads for all users. See the URL using cBioPortalData:::.url_location. This can be changed if there are mirrors that host this data by setting the cBio_URL option with getOption("cBio_URL", "https://some.url.com/") before running the function.

Author(s)

Levi Waldron, Marcel R., Ino dB.

Examples


cbio <- cBioPortal()

head(getStudies(cbio)[["studyId"]])

mae <- cBioDataPack("acc_tcga")

waldronlab/cBioPortalData documentation built on April 14, 2025, 12:11 a.m.

waldronlab/cBioPortalData index

README.md

rdrr.io home R language documentation Run R code online

CRAN packages Bioconductor packages R-Forge packages GitHub packages

Note that we can't provide technical support on individual packages. You should contact the package authors for that.

waldronlab/cBioPortalData
Exposes and Makes Available Data from the cBioPortal Web Resources

cBioDataPack: Obtain pre-packaged data from cBioPortal and represent as a...
In waldronlab/cBioPortalData: Exposes and Makes Available Data from the cBioPortal Web Resources

Obtain pre-packaged data from cBioPortal and represent as a MultiAssayExperiment object

Description

Usage

Arguments

Details

Value

cBio_URL

Author(s)

See Also

Examples

Related to cBioDataPack in waldronlab/cBioPortalData...

R Package Documentation

Browse R Packages

We want your feedback!

waldronlab/cBioPortalData Exposes and Makes Available Data from the cBioPortal Web Resources

cBioDataPack: Obtain pre-packaged data from cBioPortal and represent as a... In waldronlab/cBioPortalData: Exposes and Makes Available Data from the cBioPortal Web Resources

Obtain pre-packaged data from cBioPortal and represent as a MultiAssayExperiment object

Description

Usage

Arguments

Details

Value

cBio_URL

Author(s)

See Also

Examples

Related to cBioDataPack in waldronlab/cBioPortalData...

R Package Documentation

Browse R Packages

We want your feedback!

waldronlab/cBioPortalData
Exposes and Makes Available Data from the cBioPortal Web Resources

cBioDataPack: Obtain pre-packaged data from cBioPortal and represent as a...
In waldronlab/cBioPortalData: Exposes and Makes Available Data from the cBioPortal Web Resources