mineSeq | R Documentation |
An ape-based function to connect with the GenBank database, read nucleotide sequences using accession numbers, and write them in a fasta format file.
mineSeq(inputdf = NULL,
gb.colnames = NULL,
as.character = FALSE,
verbose = TRUE,
save = TRUE,
dir = "RESULTS_mineSeq",
filename = "GenBanK_seqs")
inputdf |
A dataframe object containing the taxon names in a 'Species' column, the voucher information in 'Voucher' column, and the GenBank accessions for each genes in separate columns named by the corresponding gene. If the columns 'Species' and 'Voucher' are not provided in the dataframe, then the function will consider the taxonomy of the retrieved sequences as originally available in GenBank. |
gb.colnames |
A vector with column names within the |
as.character |
A logical controlling whether to return the sequences as an object of class "DNAbin" (the default). |
verbose |
Logical, if |
save |
Logical, if |
dir |
The path to the directory where the mined DNA sequences in a
fasta format file will be saved provided that the argument |
filename |
Name of the output file to be saved. The default is to create a file entitled GenBanK_seqs. |
A list of DNA sequences made of vectors of class 'DNAbin', or of single characters (if as.character = TRUE) with two attributes (species and description).
Domingos Cardoso
## Not run:
library(catGenes)
data(GenBank_accessions)
mineSeq(inputdf = GenBank_accessions,
gb.colnames = c("ETS", "ITS", "matK", "petBpetD", "trnTF", "Xdh"),
as.character = FALSE,
verbose = TRUE,
save = TRUE,
dir = "RESULTS_mineSeq",
filename = "GenBanK_seqs")
## End(Not run)
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.