drugSensitivitySig-PharmacoSet-method: Creates a signature representing the association between gene...

drugSensitivitySig,PharmacoSet-methodR Documentation

Creates a signature representing the association between gene expression (or other molecular profile) and drug dose response, for use in drug sensitivity analysis.

Description

Given a Pharmacoset of the sensitivity experiment type, and a list of drugs, the function will compute a signature for the effect gene expression on the molecular profile of a cell. The function returns the estimated coefficient, the t-stat, the p-value and the false discovery rate associated with that coefficient, in a 3 dimensional array, with genes in the first direction, drugs in the second, and the selected return values in the third.

Usage

## S4 method for signature 'PharmacoSet'
drugSensitivitySig(
  object,
  mDataType,
  drugs,
  features,
  cells,
  tissues,
  sensitivity.measure = "auc_recomputed",
  molecular.summary.stat = c("mean", "median", "first", "last", "or", "and"),
  sensitivity.summary.stat = c("mean", "median", "first", "last"),
  returnValues = c("estimate", "pvalue", "fdr"),
  sensitivity.cutoff,
  standardize = c("SD", "rescale", "none"),
  molecular.cutoff = NA,
  molecular.cutoff.direction = c("less", "greater"),
  nthread = 1,
  parallel.on = c("drug", "gene"),
  modeling.method = c("anova", "pearson"),
  inference.method = c("analytic", "resampling"),
  verbose = TRUE,
  ...
)

Arguments

object

PharmacoSet a PharmacoSet of the perturbation experiment type

mDataType

character which one of the molecular data types to use in the analysis, out of dna, rna, rnaseq, snp, cnv

drugs

character a vector of drug names for which to compute the signatures. Should match the names used in the PharmacoSet.

features

character a vector of features for which to compute the signatures. Should match the names used in correspondant molecular data in PharmacoSet.

cells

character allows choosing exactly which cell lines to include for the signature fitting. Should be a subset of sampleNames(pSet)

tissues

character a vector of which tissue types to include in the signature fitting. Should be a subset of sampleInfo(pSet)$tissueid

sensitivity.measure

character which measure of the drug dose sensitivity should the function use for its computations? Use the sensitivityMeasures function to find out what measures are available for each PSet.

molecular.summary.stat

character What summary statistic should be used to summarize duplicates for cell line molecular profile measurements?

sensitivity.summary.stat

character What summary statistic should be used to summarize duplicates for cell line sensitivity measurements?

returnValues

character Which of estimate, t-stat, p-value and fdr should the function return for each gene drug pair?

sensitivity.cutoff

numeric Allows the user to binarize the sensitivity data using this threshold.

standardize

character One of "SD", "rescale", or "none", for the form of standardization of the data to use. If "SD", the the data is scaled so that SD = 1. If rescale, then the data is scaled so that the 95% interquantile range lies in [0,1]. If none no rescaling is done.

molecular.cutoff

Allows the user to binarize the sensitivity data using this threshold.

molecular.cutoff.direction

character One of "less" or "greater", allows to set direction of binarization.

nthread

numeric if multiple cores are available, how many cores should the computation be parallelized over?

parallel.on

One of "gene" or "drug", chooses which level to parallelize computation (by gene, or by drug).

modeling.method

One of "anova" or "pearson". If "anova", nested linear models (including and excluding the molecular feature) adjusted for are fit after the data is standardized, and ANOVA is used to estimate significance. If "pearson", partial correlation adjusted for tissue of origin are fit to the data, and a Pearson t-test (or permutation) test are used. Note that the difference is in whether standardization is done across the whole dataset (anova) or within each tissue (pearson), as well as the test applied.

inference.method

Should "analytic" or "resampling" (permutation testing + bootstrap) inference be used to estimate significance. For permutation testing, QUICK-STOP is used to adaptively stop permutations. Resampling is currently only implemented for "pearson" modelling method.

verbose

logical 'TRUE' if the warnings and other informative message shoud be displayed

...

additional arguments not currently fully supported by the function

Value

array a 3D array with genes in the first dimension, drugs in the second, and return values in the third.

Examples

data(GDSCsmall)
drug.sensitivity <- drugSensitivitySig(GDSCsmall,
  mDataType = "rna",
  nthread = 1, features = fNames(GDSCsmall, "rna")[1]
)
print(drug.sensitivity)


bhklab/PharmacoGx documentation built on April 18, 2024, 3:13 a.m.