clustCPS: CPS Analysis for cluster validation..

Description Usage Arguments Value Examples

Description

Covering Point Set Analysis for validating clustering results. It conducts alignment among different results and then calculates the covering point set. The return contains several statistics which can be directly used as input for mplot or cplot. If you want to design your own workflow, you can use function CPS instead.

Usage

1
2
3
clustCPS(data, k, l = TRUE, pre = TRUE, noi = "after",
  cmethod = "kmeans", dimr = "PCA", vis = "tsne", ref = NULL,
  nPCA = 50, nEXP = 100)

Arguments

data

– data given in a matrix format, where rows are samples, and columns are variables.

k

– number of clusters.

l

– logical. If True, log-transforamtion will be carried out on the data.

pre

– logical. If True, pre-dimension reduction will be carried out based on the variance.

noi

– adding noise before or after the dimension reduction, choosing between "before" and "after", default "after".

cmethod

– clustering method, choosing from "kmeans" and "mclust", default "kmeans".

dimr

– dimension reduction technique, choose from "none" and "PCA", default "PCA".

vis

– the visualization method to be used, such as "tsne" and "umap", default "tsne". Also, you can provide your own visualization coordinates in a numeric matrix of two columns.

ref

– optional, clustering reuslt in a vector format and the first cluster is labeled as 1. If provided it will be used as the reference, if not we will generate one.

nPCA

– number of principal conponents to use, default 50.

nEXP

– number of perturbed clustering reuslts for CPS Analysis, default 100.

Value

a list used for mplot or cplot, in which tight_all is the overall tightness, member is the matrix used for the membership plot, set is the matrix for the covering point set plot, tight is the vector of cluster-wise tightness, vis is the visualization coordinates, and ref is the reference labels.

Examples

1
2
3
4
5
6
7
8
9
# CPS Analysis on validation of clustering result
data(YAN)
# Suppose you generate the visualization coordinates on your own
x1=matrix(seq(1,nrow(YAN),1),ncol=1)
x2=matrix(seq(1,nrow(YAN),1),ncol=1)
# Using nEXP=50 for illustration, usually use nEXP greater 100
y=clustCPS(YAN[,1:100], k=7, l=FALSE, pre=FALSE, noi="after",vis=cbind(x1,x2), nEXP = 50)
# visualization of the results
mplot(y,4)

OTclust documentation built on May 6, 2019, 9 a.m.