scoring: Score Copy Number Profile

View source: R/scoring.R

scoringR Documentation

Score Copy Number Profile

Description

Returns quantification of copy number profile and events including tandem duplication and Chromothripisis etc. Only copy number data from autosome is used here. Some of the quantification methods are rough, you use at your risk. You should do some extra work to check the result scores.

Usage

scoring(object, TD_size_cutoff = c(1000, 1e+05, 2e+06), TD_cn_cutoff = Inf)

Arguments

object

a object of CopyNumber.

TD_size_cutoff

a length-3 numeric vector used to specify the start, midpoint, end segment size for determining tandem duplication size range, midpoint is used to split TD into short TD and long TD. Default is 1Kb to 100Kb for short TD, 100Kb to 2Mb for long TD.

TD_cn_cutoff

a number defining the maximum copy number of TD, default is Inf, i.e. no cutoff.

Value

a data.table with following scores:

  • cnaBurden: CNA burden representing the altered genomic fraction as previously reported.

  • cnaLoad: CNA load representing the quantity of copy number alteration.

  • MACN: mean altered copy number (MACN) reflecting the property of altered copy number segments, calculated as

    MACN = \frac{\sum_{i} CN_i}{N_{cnv}}

    where CN_i is the copy number of altered segment i, N_{cnv} is the number of CNV.

  • weightedMACN: same as MACN but weighted with segment length.

    MACN_{weighted} = \frac{\sum_{i} (CN_i \times L_{i})}{ \sum_{i} L_{i} }

    where L_{i} is the length of altered copy number segment i.

  • Ploidy: ploidy, the formula is same as weightedMACN but using all copy number segments instead of altered copy number segments.

  • TDP_pnas: tandem duplication phenotype score from ⁠https://www.pnas.org/doi/10.1073/pnas.1520010113⁠, the threshold k in reference is omitted.

    TDP = - \frac{\sum_{chr} |TD_{obs}-TD_{exp}|}{TD_{total}}

    where TD_{total} is the number of TD, TD_{obs} and TD_exp are observed number of TD and expected number of TD for each chromosome.

  • TDP: tandem duplication score used defined by our group work, TD represents segment with copy number greater than 2.

    TD = \frac{TD_{total}}{\sum_{chr} |TD_{obs}-TD_{exp}|+1}

  • sTDP: TDP score for short TD.

  • lTDP: TDP score for long TD.

  • TDP_size : TDP region size (Mb).

  • sTDP_size: sTDP region size (Mb).

  • lTDP_size: lTDP region size(Mb).

  • Chromoth_state: chromothripsis state score, according to reference \Sexpr[results=rd]{tools:::Rd_expr_doi("10.1016/j.cell.2013.02.023")}, chromothripsis frequently leads to massive loss of segments on the affected chromosome with segmental losses being interspersed with regions displaying normal (disomic) copy-number (e.g., copy-number states oscillating between copy-number = 1 and copy-number = 2), form tens to hundreds of locally clustered DNA rearrangements. Most of methods use both SV and CNV to infer chromothripsis, here we roughly quantify it with

    \sum_{chr}{N_{OsCN}^2}

    where N_{OsCN} is the number of oscillating copy number pattern "2-1-2" for each chromosome.

Examples

# Load copy number object
load(system.file("extdata", "toy_copynumber.RData",
  package = "sigminer", mustWork = TRUE
))

d <- scoring(cn)
d

d2 <- scoring(cn, TD_cn_cutoff = 4L)
d2

sigminer documentation built on May 29, 2024, 3:11 a.m.