Description Usage Arguments Value Examples
View source: R/cluster_functions.R
Applies DIANA (DIvisive ANAlysis) clustering to a transcriptomics dataset and appends a cluster column to this dataset for all genes.
1 2 3 4 5 6 7 8 9 |
dataset |
A transcriptomics dataset. First columns should be gene names. All other columns should be expression levels. Not needed if an argument to distance is given. |
distance |
A distance matrix. If a distance matrix has already been created (such as by using the DistanceGen function), the matrix can be passed to this function to save time. If a distance matrix is not provided then it will be generated by the function. |
k |
The total number of clusters. |
metric |
The distance metric to be used to calculate the distances between genes. See parallelDist::parDist for all accepted arguments. Also allows the option of 'abs.correlation'. Not used if a distance matrix is provided. |
nthreads |
Number of processor threads to be used for calculating the distance matrix. If not specifed then the maximum number of logical cores are used. |
scale |
Logical. If TRUE then each gene will be scaled |
center |
If the gene activity should be centered before clustering. |
Returns transcriptomics dataset provided with additional cluster column appended denoted which cluster each gene belongs to.
1 2 | t.filter <-TFilter(Laurasmappings, nthreads = 2)
diana.df <- DianaClustering(t.filter, k= 10, nthreads = 2)
|
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.