nroAggregate | R Documentation |
Estimate district averages based on assigned map locations for each data point.
nroAggregate(topology, districts, data = NULL)
topology |
A data frame with K rows and six columns, see details. |
districts |
An integer vector of M best-matching districts. |
data |
A vector of M elements or an M x N matrix of data values. |
Topology can be either the output from nroKohonen()
or a
data frame in the same format as the element topology
within the
the output from nroKohonen()
.
The input argument districts
is expected to be the output from
nroMatch()
.
If the input argument data
is empty, the histogram of the data points
on the map is returned (a vector of K elements).
If data are available, a matrix of K rows and N columns that contains the average district values after smoothing is returned. The output includes the attribute 'histogram' that contains data point counts over each data column. Column names and the attribute 'binary' are copied from the input.
# Import data.
fname <- system.file("extdata", "finndiane.txt", package = "Numero")
dataset <- read.delim(file = fname)
# Prepare training data.
trvars <- c("CHOL", "HDL2C", "TG", "CREAT", "uALB")
trdata <- scale.default(dataset[,trvars])
# K-means clustering.
km <- nroKmeans(data = trdata)
# Self-organizing map.
sm <- nroKohonen(seeds = km)
sm <- nroTrain(map = sm, data = trdata)
# Assign data points into districts.
matches <- nroMatch(centroids = sm, data = trdata)
# District averages for one variable.
chol <- nroAggregate(topology = sm, districts = matches,
data = dataset$CHOL)
print(chol)
# District averages for all variables.
planes <- nroAggregate(topology = sm, districts = matches, data = dataset)
print(head(planes))
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.