| cluster_sequences | R Documentation |
Performs clustering on sequence data using specified dissimilarity measures
and clustering methods. The sequences are first converted to strings
and compared using the stringdist package.
cluster_sequences(
data,
k,
dissimilarity = "hamming",
method = "pam",
na_syms = c("*", "%"),
weighted = FALSE,
lambda = 1,
...
)
## S3 method for class 'tna_clustering'
print(x, ...)
data |
A |
k |
An |
dissimilarity |
A |
method |
A |
na_syms |
A |
weighted |
A |
lambda |
A |
... |
Additional arguments passed to |
x |
A |
A tna_clustering object which is a list containing:
data: The original data.
k: The number of clusters.
assignments: An integer vector of cluster assignments.
silhouette: Silhouette score measuring clustering quality.
sizes: An integer vector of cluster sizes.
method: The clustering method used.
distance: The distance matrix.
data <- data.frame(
T1 = c("A", "B", "A", "C", "A", "B"),
T2 = c("B", "A", "B", "A", "C", "A"),
T3 = c("C", "C", "A", "B", "B", "C")
)
# PAM clustering with optimal string alignment (default)
result <- cluster_sequences(data, k = 2)
print(result)
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.