getASCN: Getting Allele-specific DNA Copy Number

Description Usage Arguments Value See Also Examples

Description

Given a set of breakpoints where parent-specific copy number changes, this function estimates the parent-specific copy number for each segment, and the haplotype for the major chromosome on segments where the two homologous chromosomes have different copy numbers. You are recommended to specify the parameter "rdep", the case-control genome-wide average coverage ratio. Usually, a good estimate of rdep is (total mapped reads in tumor)/(total mapped reads in normal).

Usage

1
2
getASCN(readMatrix, rdep=NULL, tauhat=NULL, threshold=0.15, pOri=c(0.49,0.51), 
error=1e-5, maxIter=1000)

Arguments

readMatrix

A data frame with four columns and the column names are "AN", "BN", "AT" and "BT". They are A-allele coverage in the tumor (case) sample, B-allele coverage in the tumor (case) sample, A-allele coverage in the normal (control) sample, and B-allele coverage in the normal (control) sample, respectively.

rdep

The case-control coverage ratio, i.e., the ratio of the total number of mapped reads in case sample versus that in the control sample. If this is not specified (NULL), then the value median(AT+BT)/median(AN+BN) will be used.

tauhat

The estimated break points. If it is not specified (NULL), then this function will first estimate the break points by calling the function "gettau", and then estimate the parent-specific DNA copy number for each segment.

threshold

The estimated copy number are set to be 1 if it differs from 1 by less than this threshold.

pOri, error, maxIter

Parameters used in estimating the success probabilities of the mixed binomial distribution. See the manuscript by Chen and Zhang for more details. "pOri" provides the initial success probabilities. The two values in pOri needs to be different. "error" provides the stopping criterion. "maxIter" is the maximum iterating steps if the stopping criterion is not achieved.

Value

tauhat

A vector holding the estimated break points in terms of the index in the coverage vectors.

ascn

The estimated parent-specific copy numbers in the segments between the break points in tauhat.

Haplotype

The estimated haplotype for the major chromosome (the chromosome has a higher copy number compared to its homologous chromosome) on segments where the two homologous chromosomes have different copy numbers.

See Also

getChangepoints, view

Examples

1
2
3
4
5
6
data(Example) 
cn = getASCN(readMatrix, tauhat=tauhat)
 # cn$tauhat would give the indices of change-points.  
 # cn$ascn would give the estimated allele-specific copy numbers for each segment.
 # cn$Haplotype[[i]] would give the estimated haplotype for the major chromosome in segment i 
 # if this segment has different copy numbers on the two homologous chromosomes.

Example output



falcon documentation built on May 2, 2019, 3:30 p.m.

Related to getASCN in falcon...