alignSeqs | R Documentation |
Perform automated multiple sequence alignment with msa package based either on ClustalW or Muscle algorithms. The function uses one or multiple FASTA-formatted files to perform alignments and may save the aligned sequences in FASTA, NEXUS or PHYLIP format.
alignSeqs(filepath = GenBank_accessions,
method = NULL,
gapOpening = "default",
format = "NEXUS",
verbose = TRUE,
dir = "RESULTS_alignSeqs",
filename = NULL)
filepath |
Path to the directory where the FASTA-formatted DNA alignments are stored. |
method |
Specifies the multiple sequence alignment to be used. Currently, "ClustalW" and "Muscle" are supported. |
gapOpening |
Gap opening penalty; the defaults are specific to the
algorithm (see |
format |
Define either "NEXUS", "FASTA" or "PHYLIP" for writing the resulting aligned DNA sequences in such formats. The default is to save the aligned sequences in a NEXUS-formatted file. |
verbose |
Logical, if |
dir |
The path to the directory where the mined DNA sequences
in a fasta format file will be saved provided that the argument |
filename |
A name or a vector of names of the output file(s) to be saved. The default is to create output file(s) named based on the original name of the input file(s) but also including an identifier suffix "aligned". |
Domingos Cardoso
mineSeq
## Not run:
library(catGenes)
data(GenBank_accessions)
folder_name_mined_seqs <- paste0("RESULTS_mineSeq/", todaydate)
mineSeq(inputdf = GenBank_accessions,
gb.colnames = c("ETS", "ITS", "matK", "petBpetD", "trnTF", "Xdh"),
as.character = FALSE,
verbose = TRUE,
save = TRUE,
dir = "RESULTS_mineSeq",
filename = "GenBanK_seqs")
alignSeqs(filepath = folder_name_mined_seqs,
method = "ClustalW",
gapOpening = "default",
format = "NEXUS",
verbose = TRUE,
dir = "RESULTS_alignSeqs")
## End(Not run)
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.