tidy_clara: Tidy CLARA (Clustering Large Applications)

View source: R/unsupervised-clustering.R

tidy_claraR Documentation

Tidy CLARA (Clustering Large Applications)

Description

Performs CLARA clustering (scalable version of PAM)

Usage

tidy_clara(data, k, metric = "euclidean", samples = 50, sampsize = NULL)

Arguments

data

A data frame or tibble

k

Number of clusters

metric

Distance metric (default: "euclidean")

samples

Number of samples to draw (default: 50)

sampsize

Sample size (default: min(n, 40 + 2*k))

Value

A list of class "tidy_clara" containing clustering results

Examples


# CLARA for large datasets
large_data <- iris[rep(1:nrow(iris), 10), 1:4]
clara_result <- tidy_clara(large_data, k = 3, samples = 50)
print(clara_result)



tidylearn documentation built on Feb. 6, 2026, 5:07 p.m.