tirClust | R Documentation |
Takes transposable elements clustered by VSEARCH,
packClust
, and produces consensus sequences
for the terminal inverted repeats of each. Allows for the
visualisation of TIR similarities between clusters for both
forward and reverse strands.
tirClust( packMatches, Genome, tirLength = 25, plot = TRUE, plotSavePath = NULL, k = 5, output = "consensus" )
packMatches |
A dataframe containing genomic ranges and names referring
to sequences to be extracted. This dataframe is in the format
produced by coercing a
Must contain the following features:
|
Genome |
A DNAStringSet object containing sequences referred to in
|
tirLength |
The TIR size to be considered. Consensus sequences will
be generated based on the first and last |
plot |
Argument specifying whether the TIR consensus sequences should be plottted as a dendrogram. |
plotSavePath |
File path for the dendrogram plot. If unspecified, the dendrogram plot is not saved. |
k |
The k-mer size to be used for calculating a distance
matrix between TIR consensus sequences. See
|
output |
Controls the output of |
If output
is specified as "consensus" (default),
returns a list of consensus sequences for each cluster
specified in packMatches
as a
DNAStringSet
.
Else if output
is specified as "dendrogram",
returns a dendrogram object used to create hierarchical
clustering diagrams.
Jack Gisby
codepackClust, codepackAlign,
kdistance
,
DNAStringSet
,
as.alignment
, packSearch
data(arabidopsisThalianaRefseq) data(packMatches) tirClust(packMatches, arabidopsisThalianaRefseq)
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.