dada_to_fasta: Write DADA sequences to fasta.
In vmikk/metagMisc: Miscellaneous functions for metagenomic analysis

dada_to_fasta

R Documentation

Write DADA sequences to fasta.

Description

Rename sequences to their hash values and write them to fasta file.

Usage

dada_to_fasta(seqtab, out = "DADA2.fasta", hash = "sha1", ...)

Arguments

`seqtab`	dada-class or derep-class objects
`out`	Output file name (fasta)
`hash`	Hash function to use: "sha1" (default), "sha256", "md5"
`...`	Additional parameters passed on to `uniquesToFasta`

Details

This function relabels sequences using diffetent message digest algorithms applied to each sequence. This approach guarantees (with a very high probability) that FASTA entries from different projects with identical names will also have identical sequences. MD5 algorithm generates a 128-bit digest that is represented by 32 hexadecimal characters. SHA1 generates a 160-bit digest that is represented by 40 hexadecimal characters. SHA256 generates a 256-bit digest that is represented by 64 hexadecimal characters. The probability of a collision (two non-identical sequences resulting in the same digest) is smaller for the SHA-algorithms than it is for the MD5 algorithm. Default hash function is SHA1 which should produce identical results with "–relabel_sha1" function of VSEARCH.

Value

Invisible returns sequence names in the VSEARCH/USEARCH style.

Examples

derep1 = derepFastq(system.file("extdata", "sam1F.fastq.gz", package="dada2"))
dada1 <- dada(derep1, err=tperr1)
seqtab <- getUniques(dada1)
dada_to_fasta(seqtab, out = "DADA2.fasta", hash = "sha1")

vmikk/metagMisc documentation built on April 13, 2025, 7:54 p.m.