import.msf | R Documentation |
Reads a Multiple Sequence Alignment (MSA) file in MSF format (.msf extension).
import.msf(file, aa.to.upper = TRUE, gap.to.dash = TRUE)
file |
a string of characters to indicate the name of the MSA file to be read. |
aa.to.upper |
a logical value indicating whether amino acids should be converted to upper case (TRUE) or not (FALSE). Default is TRUE. |
gap.to.dash |
a logical value indicating whether the dot (.) and tilde ( |
Initially, Multiple Sequence Format (MSF) was the multiple sequence alignment format of the Wisconsin Package (WP) or GCG (Genetic Computer Group). This package is a suite of over 130 sequence analysis programs for database searching, secondary structure prediction or sequence alignment. Presently, numerous multiple sequence alignment editors (Jalview and GeneDoc for example) can read and write MSF files.
MSF file displays several specificities:
a header containing sequence identifiers and characteristics (length, check and weight).
a separator symbolized by 2 slashes (//).
sequences of identifiers, displayed by consecutive blocks.
A object of class 'align', which is a named list whose elements correspond to sequences, in the form of character vectors.
The import.msf
function was developed for the bios2mds
R package (Julien PELE [aut], Jean-Michel BECU [aut], Marie CHABBERT [cre]).
It checks the presence of duplicated identifiers in header. Sequences whose identifiers are missing in header are ignored.
Julien PELE
read.alignment
function from seqinr
package.
read.GDoc
function from aaMI
package (archived).
#Importing MSA file in MSF format
aln <- import.msf(system.file("msa/toy_align.msf", package = "Bios2cor"))
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.