fastqKmerLocs: fastqKmerLocs function: Counts DNA k-mers position wise from...

Description Usage Arguments Details Value Note Author(s) References Examples

Description

Reads (compressed) fastq files and counts for DNA k-mers for each position in sequence

Usage

1
fastqKmerLocs(filenames,k=4)

Arguments

filenames

Vector of fastqKmerLocs file names. Files can be gz compressed.

k

Length of counted DNA k-mers.

Details

Maximal allowed value for k is 12.

Value

list. The length of the list equals the number of given filenames. Contains for each given file a matrix with 4^k rows and (maxSeqLen-k+1) columns. The matrix contains for each k-mer and k-mer-start position the counted values.

Note

The static size of the retured k-mer array is 4^k.

Author(s)

Wolfgang Kaisers

References

Cock PJA, Fields CJ, Goto N, Heuer ML, Rice PM The sanger fastq file format for sequences with quality scores and the Solexa/Illumina fastq variants. Nucleic Acids Research 2010 Vol.38 No.6 1767-1771

Examples

1
2
3
4
5
6
basedir<-system.file("extdata",package="seqTools")
setwd(basedir)
res<-fastqKmerLocs("test_l10_ATCGN.fq",k=2)
res<-fastqKmerLocs("test_l10_atcg.fq",k=2)
res<-fastqKmerLocs("test_l10_ATCGN.fq",k=2)
res<-fastqKmerLocs("test_l6_multi_line.fq",k=2)

seqTools documentation built on May 2, 2019, 4:45 p.m.