Description Usage Arguments Details Value References See Also Examples
Ascendant hierarchical clustering of a set of variables. Variables can be quantitative, qualitative or a mixture of both. The aggregation criterion is the decrease in homogeneity for the clusters being merged. The homogeneity of a cluster is the sum of the correlation ratio (for qualitative variables) and the squared correlation (for quantitative variables) between the variables and the center of the cluster which is the first principal component of PCAmix. PCAmix is defined for a mixture of qualitative and quantitative variables and includes ordinary principal component analysis (PCA) and multiple correspondence analysis (MCA) as special cases. Missing values are replaced by means for quantitative variables and by zeros in the indicator matrix for qualitative variables.
1 |
X.quanti |
a numeric matrix of data, or an object that can be coerced to such a matrix (such as a numeric vector or a data frame with all numeric columns). |
X.quali |
a categorical matrix of data, or an object that can be coerced to such a matrix (such as a character vector, a factor or a data frame with all factor columns). |
init |
an initial partition (a vector of integers indicating the cluster to which each variable is allocated). |
If the quantitative and qualitative data are in a same dataframe, the function
PCAmixdata::splitmix
can be used to extract automatically the qualitative and the quantitative
data in two separated dataframes.
height |
a set of p-1 non-decreasing real values: the values of the aggregation criterion. |
clusmat |
a p by p matrix with group memberships where each column k corresponds to the elements of the partition in k clusters. |
merge |
a p-1 by 2 matrix. Row i of |
Chavent, M., Liquet, B., Kuentz, V., Saracco, J. (2012), ClustOfVar: An R Package for the Clustering of Variables. Journal of Statistical Software, Vol. 50, pp. 1-16.
cutreevar
, plot.hclustvar
,
stability
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 | #quantitative variables
data(decathlon)
tree <- hclustvar(X.quanti=decathlon[,1:10], init=NULL)
plot(tree)
#qualitative variables with missing values
data(vnf)
tree_NA <- hclustvar(X.quali=vnf)
plot(tree_NA)
vnf2<-na.omit(vnf)
tree <- hclustvar(X.quali=vnf2)
plot(tree)
#mixture of quantitative and qualitative variables
data(wine)
X.quanti <- PCAmixdata::splitmix(wine)$X.quanti
X.quali <- PCAmixdata::splitmix(wine)$X.quali
tree <- hclustvar(X.quanti,X.quali)
plot(tree)
|
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.