find_split: Find the Best Split

Description Usage Arguments Value

View source: R/monoclust.R

Description

Find the best split in terms of reduction in inertia for the transferred node, indicate by row. Find the terminal node with the greatest change in inertia and bi-partition it.

Usage

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
find_split(
  data,
  cuts,
  frame_row,
  cloc,
  dist,
  variables,
  minsplit,
  minbucket,
  ncores
)

Arguments

data

Original data set.

cuts

Cuts data set, which has the next higher value of each variable in the original data set.

frame_row

One row of the split tree as data frame.

cloc

Vector of current cluster membership.

dist

Distance matrix of all observations in the data. exported function yet. Vector of 1 for all observations.

variables

List of variables selected for clustering procedure. It could be a vector of variable indexes, or a vector of variable names.

minsplit

The minimum number of observations that must exist in a node in order for a split to be attempted.

minbucket

The minimum number of observations in any terminal leaf node. Default is minsplit/3.

ncores

Number of CPU cores on the current host.

Value

The updated frame_row with the next split updated.


monoClust documentation built on Feb. 15, 2021, 5:07 p.m.