qualitySet: Goodness of classifications for a set of k clusters.
In neobernad/evaluomeR: Evaluation of Bioinformatics Metrics

qualitySet

R Documentation

Goodness of classifications for a set of k clusters.

Description

The goodness of the classifications are assessed by validating the clusters generated for a range of k values. For this purpose, we use the Silhouette width as validity index. This index computes and compares the quality of the clustering outputs found by the different metrics, thus enabling to measure the goodness of the classification for both instances and metrics. More precisely, this measurement provides an assessment of how similar an instance is to other instances from the same cluster and dissimilar to the rest of clusters. The average on all the instances quantifies how the instances appropriately are clustered. Kaufman and Rousseeuw suggested the interpretation of the global Silhouette width score as the effectiveness of the clustering structure. The values are in the range [0,1], having the following meaning:

There is no substantial clustering structure: [-1, 0.25].
The clustering structure is weak and could be artificial: ]0.25, 0.50].
There is a reasonable clustering structure: ]0.50, 0.70].
A strong clustering structure has been found: ]0.70, 1].

Usage

qualitySet(
  data,
  k.set = c(2, 4),
  cbi = "kmeans",
  all_metrics = FALSE,
  getImages = FALSE,
  seed = NULL,
  ...
)

Arguments

`data`	A `SummarizedExperiment`. The SummarizedExperiment must contain an assay with the following structure: A valid header with names. The first column of the header is the ID or name of the instance of the dataset (e.g., ontology, pathway, etc.) on which the metrics are measured. The other columns of the header contains the names of the metrics. The rows contains the measurements of the metrics for each instance in the dataset.
`k.set`	A list of integer values of `k`, as in c(2,4,8). The values must be contained in [2,15] range.
`cbi`	Clusterboot interface name (default: "kmeans"): "kmeans", "clara", "clara_pam", "hclust", "pamk", "pamk_pam", "pamk". Any CBI appended with '_pam' makes use of `pam`. The method used in 'hclust' CBI is "ward.D2".
`all_metrics`	Boolean. If true, clustering is performed upon all the dataset.
`getImages`	Boolean. If true, a plot is displayed.
`seed`	Positive integer. A seed for internal bootstrap.

Value

A list of SummarizedExperiment containing the silhouette width measurements and cluster sizes from k.set.

References

\insertRef

kaufman2009findingevaluomeR

Examples

# Using example data from our package
data("rnaMetrics")
# Without plotting
dataFrameList = qualitySet(rnaMetrics, k.set=c(2,3), getImages = FALSE)

neobernad/evaluomeR documentation built on March 29, 2025, 3:35 p.m.

neobernad/evaluomeR index

README.md

rdrr.io home R language documentation Run R code online

CRAN packages Bioconductor packages R-Forge packages GitHub packages

Note that we can't provide technical support on individual packages. You should contact the package authors for that.

neobernad/evaluomeR
Evaluation of Bioinformatics Metrics

qualitySet: Goodness of classifications for a set of k clusters.
In neobernad/evaluomeR: Evaluation of Bioinformatics Metrics

Goodness of classifications for a set of k clusters.

Description

Usage

Arguments

Value

References

Examples

Related to qualitySet in neobernad/evaluomeR...

R Package Documentation

Browse R Packages

We want your feedback!

neobernad/evaluomeR Evaluation of Bioinformatics Metrics

qualitySet: Goodness of classifications for a set of k clusters. In neobernad/evaluomeR: Evaluation of Bioinformatics Metrics

Goodness of classifications for a set of k clusters.

Description

Usage

Arguments

Value

References

Examples

Related to qualitySet in neobernad/evaluomeR...

R Package Documentation

Browse R Packages

We want your feedback!

neobernad/evaluomeR
Evaluation of Bioinformatics Metrics

qualitySet: Goodness of classifications for a set of k clusters.
In neobernad/evaluomeR: Evaluation of Bioinformatics Metrics