remove_duplicates: Removes multiple copies of one gene

Description Usage Arguments Details

View source: R/remove_duplicates.R

Description

remove_duplicates

Usage

1
remove_duplicates(df_to_update)

Arguments

df_to_update

data frame with a column name '$gene_name'. Expected to be 'GB_data', the result of pipeline so far.

Details

Any instances of one gene being recorded multiple times for a single accession number are found and reduced to a unique occurance.

Due to the structure of NCBI xml tree, after standardise_gene_names has been used, there are likely to be multiple occurances of one gene associated with a single accession number, these need to be removed.


EvolEcolGroup/mtDNAcombine documentation built on July 8, 2021, 10:30 p.m.