Compute.input.variables: Compute Mutation Counts from Tumor Exome Somatic Mutations

Description Usage Arguments Details Value Author(s) See Also Examples

Description

This function returns a data frame of mutation counts from the mutation information in the data argument.

Usage

1
Compute.input.variables(data, repeats, uniform.seq.len=38, seq.len = NULL)

Arguments

data

A data frame in the NGS "mutation annotation file" format. See NGStestdata for details.

repeats

A data frame indicating the genome coordinates of simple sequence repeats.

uniform.seq.len

The length of the capture sequence. This argument should be used when the capture sequences for all the samples have the same length. This argument will be ignored when seq is supplied.

seq.len

A data frame with two columns: Tumor_Sample_Barcode and the corresponding Sequence_Length (Mb). This should be provided if the capture sequences for the tumors have different lengths.

Details

This function computes 9 variables (listed below) from data. Mutations are one of two types: single nucleotide substitutions (SNSs) and short insertions/deletions (indels).

Value

A data frame with 9 columns that are the 9 variables listed above.

Author(s)

Mini Huang

See Also

MSIseq.train, MSIseq.classify,

Examples

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
14
15
16
17
18
19
## Not run: 
## load sample data: NGStestdata, NGStestseqLen

data(NGStestdata)
data(NGStestseqLen)

## download the Hg19repeats annotation file and load it
url <- 
"http://steverozen.net/data/Hg19repeats.rda"
file <- basename(url)
download.file(url, file)
load("Hg19repeats.rda")

## get mutation counts for test data 

test.mutationNum<-Compute.input.variables(NGStestdata, 
	repeats=Hg19repeats, seq.len = NGStestseqLen)

## End(Not run)

MSIseq documentation built on May 2, 2019, 7:23 a.m.