Home

/

GitHub

/

summary.subsamples: calculate summary statistics for each subsampled depth in a...

summary.subsamples: calculate summary statistics for each subsampled depth in a...
In StoreyLab/subSeq: Subsampling of high-throughput sequencing count data

Description Usage Arguments Details Value References Examples

Given a subsamples object, calculate a metric for each depth that summarizes the power, the specificity, and the accuracy of the effect size estimates at that depth.

1
2
3

## S3 method for class 'subsamples'
summary(object, oracle = NULL, FDR.level = 0.05,
  average = FALSE, p.adjust.method = "qvalue", ...)

`object`	a subsamples object
`oracle`	a subsamples object of one depth showing what each depth should be compared to; if NULL, each will be compared to the highest depth
`FDR.level`	A false discovery rate used to calculate the number of genes found significant at each level
`average`	If TRUE, averages over replications at each method+depth combination before returning
`p.adjust.method`	Method to correct p-values in order to determine significance. By default "qvalue", but can also be given any method that can be given to p.adjust.
`...`	further arguments passed to or from other methods.

To perform these calculations, one must compare each depth to an "oracle" depth, which, if not given explicitly, is assumed to be the highest subsampling depth. This thus summarizes how closely each agrees with the full experiment: if very low-depth subsamples still agree, it means that the depth is high enough that the depth does not make a strong qualitative difference.

The concordance correlation coefficient is described in Lin 1989. Its advantage over the Pearson is that it takes into account not only whether the coefficients compared to the oracle close to a straight line, but whether that line is close to the x = y line.

Note that selecting average=TRUE averages the depths of the replicates (as two subsamplings with identical proportions may have different depths by chance). This may lead to depths that are not integers.

A summary object, which is a data.table with one row for each subsampling depth, containing the metrics

`significant`	number of genes found significant at the given FDR
`pearson`	Pearson correlation of the coefficient estimates with the oracle
`spearman`	Spearman correlation of the coefficient estimates with the oracle
`concordance`	Concordance correlation of the coefficient estimates with the oracle
`MSE`	mean squared error between the coefficient estimates and the oracle
`estFDP`	estimated FDP: the estimated false discovery proportion, as calculated from the average oracle local FDR within genes found significant at this depth
`rFDP`	relative FDP: the proportion of genes found significant at this depth that were not found significant in the oracle
`percent`	the percentage of genes found significant in the oracle that were found significant at this depth

Lawrence I-Kuei Lin (March 1989). "A concordance correlation coefficient to evaluate reproducibility". Biometrics (International Biometric Society) 45 (1): 255-268.

# see subsample function to see how ss is generated
data(ss)
# summarise subsample object
ss.summary = summary(ss)

StoreyLab/subSeq documentation built on June 4, 2019, 12:09 a.m.

StoreyLab/subSeq index

README.md

rdrr.io home R language documentation Run R code online

CRAN packages Bioconductor packages R-Forge packages GitHub packages

Note that we can't provide technical support on individual packages. You should contact the package authors for that.

StoreyLab/subSeq
Subsampling of high-throughput sequencing count data

summary.subsamples: calculate summary statistics for each subsampled depth in a...
In StoreyLab/subSeq: Subsampling of high-throughput sequencing count data

Description

Usage

Arguments

Details

Value

References

Examples

Related to summary.subsamples in StoreyLab/subSeq...

R Package Documentation

Browse R Packages

We want your feedback!

StoreyLab/subSeq Subsampling of high-throughput sequencing count data

summary.subsamples: calculate summary statistics for each subsampled depth in a... In StoreyLab/subSeq: Subsampling of high-throughput sequencing count data

Description

Usage

Arguments

Details

Value

References

Examples

Related to summary.subsamples in StoreyLab/subSeq...

R Package Documentation

Browse R Packages

We want your feedback!

StoreyLab/subSeq
Subsampling of high-throughput sequencing count data

summary.subsamples: calculate summary statistics for each subsampled depth in a...
In StoreyLab/subSeq: Subsampling of high-throughput sequencing count data