View source: R/translate_cds_to_protein_all.R
translate_cds_to_protein_all | R Documentation |
A helper function that takes fasta
files storing coding sequences
as input and translates these coding sequences into amino acid sequences
storing them as fasta
output files.
translate_cds_to_protein_all(
input_folder,
output_folder,
delete_corrupt_cds = FALSE
)
input_folder |
file path to folder storing the coding sequences |
output_folder |
name or file path to a folder that that shall be generated to store the output |
delete_corrupt_cds |
delete_corrupt_cds a logical value indicating whether sequences with corrupt base triplets should be removed from the input |
Hajk-Georg Drost
## Not run:
# install.packages("biomartr")
# download coding sequences of Arabidopsis thaliana, Arabidopsis lyrata, and Capsella rubella
org_list <- c("Arabidopsis thaliana", "Arabidopsis lyrata", "Capsella rubella")
biomartr::getCDSSet(db = "refseq",
organism = org_list,
gunzip = TRUE,
path = "cds_examples")
# translate coding sequences into amino acid sequences
translate_cds_to_protein_all(input_folder = "cds_examples",
output_folder = "translated_seqs",
delete_corrupt_cds = FALSE)
## End(Not run)
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.