ThresholdExtract: Utility for retrieving species sequences givin a dataframe of...

Description Usage Arguments Value Author(s) Examples

View source: R/ThresholdExtract.R

Description

User defines a threshold and is returned a dataframe containing only the samples/species with at least that many sequences. This dataframe can be fed into ExtractBuscoSeqs() as the busco_table parameter

Usage

1
ThresholdExtract(aligned_dir, missing_df, threshold_fasta_folder)

Arguments

aligned_dir

Directory where fasta alignments can be found

missing_df

Dataframe created by ParseBuscoResults() or MissingDataTable()

threshold_fasta_folder

Folder to store extracted fasta files

Value

Reads through a dataframe of missing and extracts sequences from the aligned_dir and writes to threshold_fasta_folder. Fasta files must be realigned and this can be accomplished with MafftOrientAlign()

Author(s)

Dustin J Wcisel, djwcisel@ncsu.edu

James Thomas Howard, jthowar3@ncsu.edu

Jeffrey A Yoder, jayoder@ncsu.edu

Alex Dornburg, dornburgalex@gmail.com

Examples

1
ThresholdExtract(aligned_dir = ad, missing_df = threshold_df, threshold_fasta_folder = "/path/to/store/fastas/threshold100")

djwcisel/toast documentation built on Nov. 4, 2019, 10:55 a.m.