clusterKmers: Cluster k-mers
In sarks: Suffix Array Kernel Smoothing for discovery of correlative sequence motifs and multi-motif domains

Description Usage Arguments Value Examples

Takes a set of k-mer sequences and returns a list of partitioning the input k-mers into clusters of more similar k-mers. Hierarchical clustering (average linkage) is performed based on Jaccard coefficient distance metric applied treating each k-mer as the set of all tetramers which can be found as substrings within it.

1 2	clusterKmers(kmers, k = 4, nClusters = NULL, maxClusters = NULL, directional = TRUE)

`kmers`	character vector or XStringSet of k-mers to partition into clusters
`k`	length of sub-k-mers (default k=4 to use tetramers) with which to calculate Jaccard distances for clustering
`nClusters`	number of clusters to partition kmers into; if set to NULL (default value), selects number of clusters to maximize the average silhouette score (https://en.wikipedia.org/wiki/Silhouette_(clustering)).
`maxClusters`	if nClusters not specified, can optionally set maximum number of clusters allowed in silhouette score optimization.
`directional`	logical value: if FALSE, considers each kmer as equivalent to its reverse-complement. Makes sense only if applying to DNA sequences!

list of character vectors (or XStringSet objects as per the class of kmers argument) partitioning kmers into clusters: the character vector at the i-th element of the output list contains the elements from kmers assigned to cluster i.

kmers <- c(
    'CAGCCTGG', 'CCTGGAA', 'CAGCCTG', 'CCTGGAAC', 'CTGGAACT',
    'ACCTGC', 'CACCTGC', 'TGGCCTG', 'CACCTG', 'TCCAGC',
    'CTGGAAC', 'CACCTGG', 'CTGGTCTA', 'GTCCTG', 'CTGGAAG', 'TTCCAGC'
)
clusterKmers(kmers, directional=FALSE)

sarks documentation built on Nov. 8, 2020, 6:54 p.m.

sarks index

sarks-vignette

rdrr.io home R language documentation Run R code online

CRAN packages Bioconductor packages R-Forge packages GitHub packages

Note that we can't provide technical support on individual packages. You should contact the package authors for that.

sarks
Suffix Array Kernel Smoothing for discovery of correlative sequence motifs and multi-motif domains

clusterKmers: Cluster k-mers
In sarks: Suffix Array Kernel Smoothing for discovery of correlative sequence motifs and multi-motif domains

Description

Usage

Arguments

Value

Examples

Related to clusterKmers in sarks...

R Package Documentation

Browse R Packages

We want your feedback!

sarks Suffix Array Kernel Smoothing for discovery of correlative sequence motifs and multi-motif domains

clusterKmers: Cluster k-mers In sarks: Suffix Array Kernel Smoothing for discovery of correlative sequence motifs and multi-motif domains

Description

Usage

Arguments

Value

Examples

Related to clusterKmers in sarks...

R Package Documentation

Browse R Packages

We want your feedback!

sarks
Suffix Array Kernel Smoothing for discovery of correlative sequence motifs and multi-motif domains

clusterKmers: Cluster k-mers
In sarks: Suffix Array Kernel Smoothing for discovery of correlative sequence motifs and multi-motif domains