sim_fq: sim_fq: Performs an experimental series of separation...
In wokai/seqTools: Analysis of nucleotide, sequence and quality content on fastq files.

Description Usage Arguments Details Value Author(s) Examples

Writes compressed FASTQ files where sequence sections contain concatenated k-mers which are uniformly distributed in the range of k-mers for given k. The function first writes a batch of randomly FASTQ files containing randomly simulated DNA sequence. In a second step the function repeatedly writes FASTQ files with random DNA sequence where a fraction of the reads is 'contaminated' with given DNA k-mers. In a third step, for each set of simulated and contaminated files, a hierarchical cluster (HC) tree based on DNA k-mers is calculated. For each set of files, the size of the smaller fraction in the first half of the tree is counted (perc). The value can be used as measure for separation capability of the HC algorithm.

1 2	sim_fq(nRep=2, nContamVec=c(100, 1000), grSize=20, nSeq=1e4, k=6, kIndex=1365, pos=20)

nRep

numeric. Number of replicates for each combination of each nContamVec value

nContamVec

numeric. Vector with nContam (absolute number of contaminated reads) values.

grSize

numeric. Number FASTQ files in control and contamination group.

nSeq

numeric. Number of reads per FASTQ file.

`k`	`numeric`. k value used in fastqq function.

kIndex

numeric. k-mer index of inserted k-mer(s). The k-mer index can be retreaved for a given k-mer with 'kMerIndex'. Default value is 1365 (="CCCCCC").

pos

numeric. Determines at which position in sequence the k-mer is inserted. 1-based (1=first position).

The function is intended to be used as explorative tool (not for routine quality assessment). There are some files written and there will be a lot of output on the terminal. It is therefore recommended to switch to a separate working directory and to run this function on a separate terminal. The function is not exported.

data.frame containing results of the counted perc values for each repetition of the simulation.

Wolfgang Kaisers

kMerIndex("CCCCCC")
## Not run: res <- seqTools:::sim_fq(nRep=2, nContamVec=c(10, 100),
                    grSize=4, nSeq=1e2)
## End(Not run)

wokai/seqTools documentation built on May 4, 2019, 9:46 a.m.

wokai/seqTools index

rdrr.io home R language documentation Run R code online

CRAN packages Bioconductor packages R-Forge packages GitHub packages

Note that we can't provide technical support on individual packages. You should contact the package authors for that.

wokai/seqTools
Analysis of nucleotide, sequence and quality content on fastq files.

sim_fq: sim_fq: Performs an experimental series of separation...
In wokai/seqTools: Analysis of nucleotide, sequence and quality content on fastq files.

Description

Usage

Arguments

Details

Value

Author(s)

Examples

Related to sim_fq in wokai/seqTools...

R Package Documentation

Browse R Packages

We want your feedback!

wokai/seqTools Analysis of nucleotide, sequence and quality content on fastq files.

sim_fq: sim_fq: Performs an experimental series of separation... In wokai/seqTools: Analysis of nucleotide, sequence and quality content on fastq files.

Description

Usage

Arguments

Details

Value

Author(s)

Examples

Related to sim_fq in wokai/seqTools...

R Package Documentation

Browse R Packages

We want your feedback!

wokai/seqTools
Analysis of nucleotide, sequence and quality content on fastq files.

sim_fq: sim_fq: Performs an experimental series of separation...
In wokai/seqTools: Analysis of nucleotide, sequence and quality content on fastq files.