simRank | R Documentation |
Computes the SimRank similarity (number of shared unique k-mers over the smallest number of unique k-mers.)
simRank(x, k = 7)
x |
an object of class DNAStringSet containing the sequences. |
k |
size of used k-mers. |
distSimRank()
returns 1-simRank()
.
simRank()
returns a similarity object of class "simil" (see proxy).
distSimRank()
returns a dist object.
Michael Hahsler
Santis et al, Simrank: Rapid and sensitive general-purpose k-mer search tool, BMC Ecology 2011, 11:11
### load sequences
sequences <- readDNAStringSet(system.file("examples/DNA_example.fasta",
package="rMSA"))
sequences
### compute similarity
simil <- simRank(sequences)
### use hierarchical clustering
hc <- hclust(distSimRank(sequences))
plot(hc)
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.