writeSimContFastq: writeSimContFastq: Create fastq files with simulated k-mer...

Description Usage Arguments Details Value Author(s) References Examples

Description

Writes compressed fastq files where sequence sections contain concatenated k-mers which are uniformly distributed in the range of k-mers for given k. A fraction of the reads can be contaminated with one or more deterministic k-mers.

Usage

1
writeSimContFastq(k=6,nk=5,nSeq=10,pos=1,kIndex=1,nContam=nSeq,filename="simc.fq.gz")

Arguments

k

numeric. Length of k-mer. Default value is 6.

nk

numeric. Number of k-mers in each fastq read. Default value is 5.

nSeq

numeric. Number of simulated reads in fastq-file. Default value is 10.

pos

numeric. Determines at which position in sequence the k-mer is inserted. 1-based (1=first position).

kIndex

numeric. k-mer index of inserted k-mer. The k-mer index can be retreaved for a given k-mer with 'getKmerIndex'.

nContam

numeric. Absolute number of contaminated reads. The k-mer's are inserted at the firsts 'nContam' reads of the sequence array.

filename

character. Name of written (compressed) fastq file.

Details

The read headers are consequtive numbered. The phred quality values are equally set to 46 (='.') which represents a phred value of 13. This function is not designed for routine use. The random content fastq files can be used in order to measure the separation capabilities of hierarchical clustering mechanisms.

Value

None.

Author(s)

Wolfgang Kaisers

References

Cock PJA, Fields CJ, Goto N, Heuer ML, Rice PM The sanger fastq file format for sequences with quality scores and the Solexa/Illumina fastq variants. Nucleic Acids Research 2010 Vol.38 No.6 1767-1771

Examples

1

seqTools documentation built on May 2, 2019, 4:45 p.m.