Description Usage Arguments Details Value Author(s) References See Also Examples
View source: R/StaggerAlignment.R
Staggers overlapping characters in a multiple sequence alignment that are better explained by multiple insertions than multiple deletions.
1 2 3 4 5 6 | StaggerAlignment(myXStringSet,
tree = NULL,
threshold = 3,
fullLength = FALSE,
processors = 1,
verbose = TRUE)
|
myXStringSet |
An |
tree |
A bifurcating |
threshold |
Numeric giving the ratio of insertions to deletions that must be met to stagger a region of the alignment. Specifically, the number of insertions divided by deletions must be less than threshold to stagger. |
fullLength |
Logical specifying whether the sequences are full-length ( |
processors |
The number of processors to use, or |
verbose |
Logical indicating whether to display progress. |
Multiple sequence aligners typically maximize true homologies at the expense of increased false homologies. StaggerAlignment
creates a “staggered alignment” which separates regions of the alignment that are likely not homologous into separate regions. This re-balances the trade-off between true positives and false positives by decreasing the number of false homologies at the loss of some true homologies. The resulting alignment is less aesthetically pleasing because it is widened by the introduction of many gaps. However, in an evolutionary sense a staggered alignment is more correct because each aligned position represents a hypothesis about evolutionary events: overlapping characters between any two sequences represent positions common to their ancestor sequence that may have evolved through substitution.
The single parameter threshold
controls the degree of staggering. Its value represents the ratio of insertions to deletions that must be crossed in order to stagger a region. A threshold
of 1
would mean any region that could be better explained by separate insertions than deletions should be staggered. A higher value for threshold
makes it more likely to stagger, and vise-versa. A very high value would conservatively stagger most regions with gaps, resulting in few false homologies but also fewer true homologies. The default value (3
) is intended to remove more false homologies than it eliminates in true homologies. It may be preferable to tailor the threshold
depending on the purpose of the alignment, as some downstream procedures (such as tree building) may be more or less sensitive to false homologies.
An XStringSet
of aligned sequences.
Erik Wright eswright@pitt.edu
Coming soon!
AdjustAlignment
, AlignSeqs
, IdClusters
1 2 3 4 5 | db <- system.file("extdata", "Bacteria_175seqs.sqlite", package="DECIPHER")
dna <- SearchDB(db, remove="all")
alignedDNA <- AlignSeqs(dna)
staggerDNA <- StaggerAlignment(alignedDNA)
BrowseSeqs(staggerDNA, highlight=1)
|
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.