View source: R/tr_anno_cleaner.R
tr_anno_cleaner | R Documentation |
Given an input CSV or TSV annotation file from
https://pseudomonas.com, separates and cleans the data, returning a tidy
tibble with the following columns: "locus_tag", "gene_name", and
"product_name". Setting the extra_cols
argument to TRUE
will add the
columns "start", "end", and "strand". Enabling fill_names
will populate
missing gene names with the corresponding locus tag.
tr_anno_cleaner(input_file, extra_cols = FALSE, fill_names = FALSE)
input_file |
Path to the input TSV or CSV file. |
extra_cols |
Logical to determine if start, end, and strand columns
should be included. Defaults to |
fill_names |
Logical to determine if blank/NA genes names should be
filled in with corresponding locus tag. Defaults to |
A data frame (tibble) of the cleaned input file
Download annotation files from https://pseudomonas.com
https://www.github.com/travis-m-blimkie/tRavis
tr_anno_cleaner(
input_file = paste0(
"https://pseudomonas.com/downloads/pseudomonas/pgd_r_22_1/",
"Pseudomonas_aeruginosa_PAO1_107/Pseudomonas_aeruginosa_PAO1_107.csv.gz"
)
)
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.