Description Usage Arguments Details Value Author(s) References Examples
Reads (compressed) fastq files and counts for given DNA k-mer subset for each position in sequence. The k-mer subset is given by a vector of k-mer indices. k-mer indices can be obtained from DNA k-mers with the function getKmerIndex
.
1 | fastqKmerSubsetLocs(filenames,k=4,kIndex)
|
filenames |
|
k |
|
kIndex |
|
Maximal allowed value for k is 12.
list
. The length of the list equals the number of given filenames. Contains for each given file a matrix. Each matrix has one row for each given kIndex
and an additional row with counts for all other DNA k-mers (labeled other
). The number of columns equals the maximal sequence length in the fastq file.
Wolfgang Kaisers
Cock PJA, Fields CJ, Goto N, Heuer ML, Rice PM The sanger fastq file format for sequences with quality scores and the Solexa/Illumina fastq variants. Nucleic Acids Research 2010 Vol.38 No.6 1767-1771
1 2 3 4 5 6 | basedir<-system.file("extdata",package="seqTools")
setwd(basedir)
k<-4
kMers<-c("AAAA","AACC","AAGG")
kIdx<-kMerIndex(kMers)
res<-fastqKmerSubsetLocs("test_l6.fq",k,kIdx)
|
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.