trim_fasta_names: Trim Representative Sequence Names

Description Usage Arguments Details Value Author(s) Examples

View source: R/trim_fasta_names.R

Description

This function trims the fasta IDs of the representative sequences to include only the machine name.

Usage

1
trim_fasta_names(repseq_file, trimmed_names, strip = FALSE)

Arguments

repseq_file

The name of the combined fasta file of representative sequences.

trimmed_names

The name of the modified fasta file written to disk.

strip

A logical. If TRUE, then the output fasta file is unaligned and any model sequences are removed. The default is FALSE.

Details

This function is a necessary prerequisite to renaming representative sequences with their corresponding OTU names.

Unaligning the sequences results in a smaller file size and is acceptable if the sequences are to be classified only. It may be desirable if using the web-based classifier because the smaller file size makes for shorter upload time. Unaligning must not be done if the sequences are to be treed.

Value

This function operates on files. It returns a message that it has completed. It is not normally assigned to a variable; if so, the variable contains only the message.

Author(s)

John Quensen

Examples

1
2
repseq.file <- system.file("extdata", "all_seq_complete.clust_rep_seqs.fasta", package="RDPutils")
trim_fasta_names(repseq_file = repseq.file, trimmed_names = "names_trimmed.fasta", strip = FALSE)

jfq3/RDPutils documentation built on Nov. 8, 2019, 1:05 p.m.