head_derep: Dereplicate fasta file based on fasta headers

Description Usage Arguments Details Value Note Author(s) References See Also

View source: R/MDOP.R

Description

# This function dereplicates sequences in a fasta file based on headers.

Usage

1
head_derep(target_dir, Seq_file)

Arguments

target_dir

A string with a path for a target file folder. ie. "C:/A_file"

Seq_file

A srting with the pat for the target file. ie. "C:/seqfile.fas"

Details

The function will operate with or without an argument in a Windows or Apple OS environment.

Value

There are no values returned and results of head_derep are printed to screen and are evident through the creation of a file with unique records based on header.

Note

This function was created and tested using Ver. 3.5.1; R Core Team 2018 It was tested on the operating systems Windows 7 and 10, Mac OS Mojave Version 10.14.6 Aside from base functions for this R version no other packages were utilizied

Author(s)

Written by Rob Young and Jiaojia Yu at the University of Guelph in Ontario Canada, October 2019

References

https://github.com/HannerLab Young, R. G., Yu, J., Cote, M. J., Hanner, R. H. (Submitted January 2020). Introducing MDOP: an R package to aid Molecular Data Organization for Publication to shared databases. Biodiversity Data Journal.

See Also

target_file_list() recursive_copy() max_packs() copy_by_list() degap() rank_seq() head_derep() seq_derep() multi_to_single_fasta()


rgyoung6/Molecular-Data-Organization-for-Publication-MDOP documentation built on Jan. 21, 2020, 12:12 a.m.