hclustvfreq: Fast hierarchical, agglomerative clustering of frequency data

Description Usage Arguments Details See Also Examples

View source: R/hclust.vector.R View source: R/hclust.vector.R

Description

This function implements a version of the hierarchical, agglomerative clustering hclust.vector focused on table of frequencies.

Usage

1
2
3
4
hclustvfreq(data, freq = NULL, method = "single", metric = "euclidean",
  p = NULL)

.hclustvfreq(tfq, method = "single", metric = "euclidean", p = NULL)

Arguments

data

any object that can be coerced into a double matrix

freq

a one-sided, single term formula specifying frequency weights

method

the agglomeration method to be used. This must be (an unambiguous abbreviation of) one of "single", "ward", "centroid" or "median".

metric

the distance measure to be used. This must be one of "euclidean", "maximum", "manhattan", "canberra", "binary" or "minkowski"

p

parameter for the Minkowski metric.

tfq

a frequency table

Details

Any variables in the formula are removed from the data set.

This function is a wrapper of hclust.vector to be used with tables of frequencies. It use the frequency weights as parameter members.

See Also

hclust.vector, link{tablefreq}

Examples

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
library(dplyr)
library(fastcluster)

data <- iris[,1:3,drop=FALSE]
hc <- hclustvfreq(data, method="centroid",metric="euclidean")
cutree(hc,3) ## Different length than data

tfq <- tablefreq(iris[,1:3])
hc <- .hclustvfreq(tfq, method="centroid",metric="euclidean")
tfq$group <- cutree(hc,3)

Example output

Attaching package: 'dplyr'

The following objects are masked from 'package:stats':

    filter, lag

The following objects are masked from 'package:base':

    intersect, setdiff, setequal, union


Attaching package: 'fastcluster'

The following object is masked from 'package:stats':

    hclust

  [1] 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 2 2 1 1 1 2 2 1 1 1 1 1 1 1 1 2 1 1 1 1 1 1
 [38] 1 2 1 1 1 1 2 1 1 1 1 1 2 2 2 2 2 1 1 2 2 2 2 2 2 2 2 2 2 2 2 1 1 2 2 2 2
 [75] 2 1 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2
[112] 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 2 3 3 3 3 3 3 3 3 3 3 3 3

freqweights documentation built on May 29, 2017, 12:01 p.m.