ahfoss/kamila: Methods for Clustering Mixed-Type Data

Implements methods for clustering mixed-type data, specifically combinations of continuous and nominal data. Special attention is paid to the often-overlooked problem of equitably balancing the contribution of the continuous and categorical variables. This package implements KAMILA clustering, a novel method for clustering mixed-type data in the spirit of k-means clustering. It does not require dummy coding of variables, and is efficient enough to scale to rather large data sets. Also implemented is Modha-Spangler clustering, which uses a brute-force strategy to maximize the cluster separation simultaneously in the continuous and categorical variables.

Getting started

Package details

AuthorAlexander Foss [aut, cre], Marianthi Markatou [aut]
MaintainerAlexander Foss <[email protected]>
LicenseGPL-3 | file LICENSE
Version0.1.1.2
URL https://github.com/ahfoss/kamila
Package repositoryView on GitHub
Installation Install the latest version of this package by entering the following in R:
install.packages("devtools")
library(devtools)
install_github("ahfoss/kamila")
ahfoss/kamila documentation built on Sept. 28, 2018, 3:02 p.m.