DianaParamSelection: DianaParamSelection

Description Usage Arguments Examples

View source: R/cluster_functions.R

Description

Runs DIANA (DIvisive ANAlysis) clustering with differing numbers of partitions and returns validation metrics.

Usage

1
2
3
4
5
6
7
8
DianaParamSelection(
  dataset = NULL,
  distance = NULL,
  k = c(2, 5, 10),
  metric = "euclidean",
  nthreads = 4,
  scale = TRUE
)

Arguments

dataset

A transcriptomics dataset. Preferably filtered first. First columns should be gene names. All other columns should be expression levels.

distance

A distance matrix. If a distance matrix has already been created (such as by using the DistanceGen function), the matrix can be passed to this function to save time. If a distance matrix is not provided then it will be generated by the function.

k

A numeric vector giving the number of clusters to be evaluated.

metric

The distance metric to be used to calculate the distances between genes. See parallelDist::parDist for all accepted arguments. Also allows the option of 'abs.correlation'. Not used if a distance matrix is provided.

nthreads

The number of threads to be used for parallel computations.If NULL then the maximum number of threads available will be used.

scale

Logical. If TRUE then each gene will be scaled

Examples

1
2
3
k.options <- seq(2,10)
diana.validation <- DianaParamSelection(Laurasmappings, k=k.options,
                                         nthreads = 2)

nathansam/CircadianTools documentation built on Dec. 26, 2019, 11:30 a.m.