| tirClust | R Documentation |
Takes transposable elements clustered by VSEARCH,
packClust, and produces consensus sequences
for the terminal inverted repeats of each. Allows for the
visualisation of TIR similarities between clusters for both
forward and reverse strands.
tirClust( packMatches, Genome, tirLength = 25, plot = TRUE, plotSavePath = NULL, k = 5, output = "consensus" )
packMatches |
A dataframe containing genomic ranges and names referring
to sequences to be extracted. This dataframe is in the format
produced by coercing a
Must contain the following features:
|
Genome |
A DNAStringSet object containing sequences referred to in
|
tirLength |
The TIR size to be considered. Consensus sequences will
be generated based on the first and last |
plot |
Argument specifying whether the TIR consensus sequences should be plottted as a dendrogram. |
plotSavePath |
File path for the dendrogram plot. If unspecified, the dendrogram plot is not saved. |
k |
The k-mer size to be used for calculating a distance
matrix between TIR consensus sequences. See
|
output |
Controls the output of |
If output is specified as "consensus" (default),
returns a list of consensus sequences for each cluster
specified in packMatches as a
DNAStringSet.
Else if output is specified as "dendrogram",
returns a dendrogram object used to create hierarchical
clustering diagrams.
Jack Gisby
codepackClust, codepackAlign,
kdistance,
DNAStringSet,
as.alignment, packSearch
data(arabidopsisThalianaRefseq) data(packMatches) tirClust(packMatches, arabidopsisThalianaRefseq)
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.