kMeansCalcClusters: Function to mathematically calculate the optimal number of...

Description Usage Arguments Value

Description

It should come close to calculating the elbow point mathematically for the optimal number of clusters using K-Means clustering.

It essentially draws a line from the least number of clusters to the max number of clusters and then calculates the point that's the farthest away from that line.

It assumes all columns are already numeric values.The columns used in the dataframe must already be numeric values, and they should already be standardized.

It's heavily dependent on the max number of clusters. A larger nc will have a higher accuracy.

Usage

1

Arguments

data

The name of the dataframe whose columns will be clustered.

nc

The number of clusters to try. Defaults either to 40 or the number of rows in the dataframe.

Value

Number.


RonGuymon/ronsFunctions documentation built on May 8, 2019, 11:42 a.m.