Description Usage Arguments Value Author(s) References Examples
View source: R/external_TMixClust.R
analyse_stability
Performs multiple clustering runs
with TMixClust, analyses the agreement between runs
with the Rand index and returns the clustering solution with the largest
likelihood.
A plot of agreement probability between all the runs and the run with the
maximum likelihood is produced.
1 2 3 | analyse_stability(time_series_df, time_points = seq_len(ncol(time_series_df)),
nb_clusters = 2, em_iter_max = 1000, mc_em_iter_max = 10,
em_ll_convergence = 0.001, nb_clustering_runs = 3, nb_cores = 1)
|
time_series_df |
data frame containing the time series. Each row is a time series comprised of the time series name which is also the row name, and the time series values at each time point. |
time_points |
vector containing numeric values for the time points.
Default: |
nb_clusters |
desired number of clusters |
em_iter_max |
maximum number of iterations for the expectation-maximization (EM) algorithm. Default: 1000. |
mc_em_iter_max |
maximum number of iterations for Monte-Carlo resampling. Default is 10. |
em_ll_convergence |
convergence threshold for likelihood improvement. Default is 0.001. |
nb_clustering_runs |
number of times the clustering procedure is repeated on the input data. Default is 3. |
nb_cores |
number of cores to be used to run the separate clustering operations in parallel. Default is 1. |
TMixClust object with the highest likelihood. Renders a plot showing the overall distribution of the Rand index, which allows the user to assess clustering stability.
Monica Golumbeanu, monica.golumbeanu@bsse.ethz.ch
Golumbeanu M, Desfarges S, Hernandez C, Quadroni M, Rato S, Mohammadi P, Telenti A, Beerenwinkel N, Ciuffi A. (2017) Dynamics of Proteo-Transcriptomic Response to HIV-1 Infection.
1 2 3 4 5 6 7 8 9 10 11 12 13 | # Load the toy time series data provided with the TMixClust package
data(toy_data_df)
# Identify the most optimal clustering solution with 3 clusters
best_clust_obj = analyse_stability(toy_data_df, nb_clusters = 3,
nb_clustering_runs = 4, nb_cores = 1)
# Plot the time series from each cluster
for (i in seq_len(3)) {
# Extract the time series in the current cluster and plot them
c_df=toy_data_df[which(best_clust_obj$em_cluster_assignment==i),]
plot_time_series_df(c_df, plot_title = paste("cluster",i))
}
|
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.