alignSeqs: Automated multiple sequence alignment
In domingoscardoso/catGenes: Concatenating multiple DNA alignments and other phylogenetic features

alignSeqs

R Documentation

Automated multiple sequence alignment

Description

Perform automated multiple sequence alignment with msa package based either on ClustalW or Muscle algorithms. The function uses one or multiple FASTA-formatted files to perform alignments and may save the aligned sequences in FASTA, NEXUS or PHYLIP format.

Usage

alignSeqs(filepath = GenBank_accessions,
          method = NULL,
          gapOpening = "default",
          format = "NEXUS",
          verbose = TRUE,
          dir = "RESULTS_alignSeqs",
          filename = NULL)

Arguments

`filepath`	Path to the directory where the FASTA-formatted DNA alignments are stored.
`method`	Specifies the multiple sequence alignment to be used. Currently, "ClustalW" and "Muscle" are supported.
`gapOpening`	Gap opening penalty; the defaults are specific to the algorithm (see `msaClustalW` and `msaMuscle)`. Note that the sign of this parameter is ignored. The sign is automatically adjusted such that the called algorithm penalizes gaps instead of rewarding them.
`format`	Define either "NEXUS", "FASTA" or "PHYLIP" for writing the resulting aligned DNA sequences in such formats. The default is to save the aligned sequences in a NEXUS-formatted file.
`verbose`	Logical, if `FALSE`, a message showing each step during the multiple sequence alignment will not be printed in the console in full.
`dir`	The path to the directory where the mined DNA sequences in a fasta format file will be saved provided that the argument `save` is set up in `TRUE`. The default is to create a directory named RESULTS_alignSeqs and the sequences will be saved within a subfolder named after the current date.
`filename`	A name or a vector of names of the output file(s) to be saved. The default is to create output file(s) named based on the original name of the input file(s) but also including an identifier suffix "aligned".

Author(s)

Domingos Cardoso

Examples

## Not run: 
library(catGenes)

data(GenBank_accessions)

folder_name_mined_seqs <- paste0("RESULTS_mineSeq/", todaydate)

mineSeq(inputdf = GenBank_accessions,
        gb.colnames = c("ETS", "ITS", "matK", "petBpetD", "trnTF", "Xdh"),
        as.character = FALSE,
        verbose = TRUE,
        save = TRUE,
        dir = "RESULTS_mineSeq",
        filename = "GenBanK_seqs")

alignSeqs(filepath = folder_name_mined_seqs,
          method = "ClustalW",
          gapOpening = "default",
          format = "NEXUS",
          verbose = TRUE,
          dir = "RESULTS_alignSeqs")

## End(Not run)

domingoscardoso/catGenes documentation built on March 29, 2025, 9:51 p.m.

domingoscardoso/catGenes index

README.md

rdrr.io home R language documentation Run R code online

CRAN packages Bioconductor packages R-Forge packages GitHub packages

Note that we can't provide technical support on individual packages. You should contact the package authors for that.

domingoscardoso/catGenes
Concatenating multiple DNA alignments and other phylogenetic features

alignSeqs: Automated multiple sequence alignment
In domingoscardoso/catGenes: Concatenating multiple DNA alignments and other phylogenetic features

Automated multiple sequence alignment

Description

Usage

Arguments

Author(s)

See Also

Examples

Related to alignSeqs in domingoscardoso/catGenes...

R Package Documentation

Browse R Packages

We want your feedback!

domingoscardoso/catGenes Concatenating multiple DNA alignments and other phylogenetic features

alignSeqs: Automated multiple sequence alignment In domingoscardoso/catGenes: Concatenating multiple DNA alignments and other phylogenetic features

Automated multiple sequence alignment

Description

Usage

Arguments

Author(s)

See Also

Examples

Related to alignSeqs in domingoscardoso/catGenes...

R Package Documentation

Browse R Packages

We want your feedback!

domingoscardoso/catGenes
Concatenating multiple DNA alignments and other phylogenetic features

alignSeqs: Automated multiple sequence alignment
In domingoscardoso/catGenes: Concatenating multiple DNA alignments and other phylogenetic features