revCountDnaKmers: revCountDnaKmers: Counting K-mers in DNA sequence.

Description Usage Arguments Details Value Author(s) Examples

Description

Counts DNA K-mers for reverse complement of given DNA sequence. The k-mers are counted in a set of search windows, which are defined by start and width parameter. From each position of the search window, a DNA k-mer is identified on the left hand side on the reverse complement of the given DNA sequence. Each value in the start vector defindes the right border of a search window. The size of the search window is given by the appropriate value in the width vector.

Usage

1

Arguments

dna

character. Single DNA sequence (vector of length 1). dna must not contain other characters as "ATCGN". Capitalization does not matter. When a 'N' character is found, the ongoing identification of a DNA k-mer is terminated.

k

numeric. Number of nucleotides in tabled DNA motifs.

start

numeric. Vector of (1-based) start positions for reading frames.

width

numeric. Defines number of k-mers (size of search window) for each start position. Must have the same length as start or length 1 (in which case the values of width are recycled.)

Details

The start positions for identification of DNA k-mers are all positions in {start-width+1,...,start}. In order to prevent counting before the first nucleotide of the DNA sequence, all start values must be >= width+k. The function throws an error when this border is exceeded.

Value

matrix. Each colum containts the motif-count values for one frame. Each row represents one DNA motif. The DNA sequence of the DNA motif is given as row.name.

Author(s)

Wolfgang Kaisers

Examples

1
2
rseq<-"TATTAT"
revCountDnaKmers(rseq,2,6:4,2)

Example output

Loading required package: zlibbioc
   1 2 3
AA 0 1 1
AC 0 0 0
AG 0 0 0
AT 1 0 1
CA 0 0 0
CC 0 0 0
CG 0 0 0
CT 0 0 0
GA 0 0 0
GC 0 0 0
GG 0 0 0
GT 0 0 0
TA 1 1 0
TC 0 0 0
TG 0 0 0
TT 0 0 0

seqTools documentation built on May 2, 2019, 4:45 p.m.