README.md

CRAN_Status_Badge CRAN RStudio mirror downloads CRAN RStudio mirror downloads

Databionic Swarm

Algorithms implementing populations of agents that interact with one another and sense their environment may exhibit emergent behavior such as self-organization and swarm intelligence. Here, a swarm system called Databionic swarm (DBS) is introduced which was published in Thrun, M.C., Ultsch A.: "Swarm Intelligence for Self-Organized Clustering" (2020), Artificial Intelligence, . DBS is able to adapt itself to structures of high-dimensional data such as natural clusters characterized by distance and/or density based structures in the data space. The first module is the parameter-free projection method called Pswarm (Pswarm()), which exploits the concepts of self-organization and emergence, game theory, swarm intelligence and symmetry considerations. The second module is the parameter-free high-dimensional data visualization technique, which generates projected points on the topographic map with hypsometric tints defined by the generalized U-matrix (GeneratePswarmVisualization()). The third module is the clustering method itself with non-critical parameters (DBSclustering()). Clustering can be verified by the visualization and vice versa. The term DBS refers to the method as a whole. It enables even a non-professional in the field of data mining to apply its algorithms for visualization and/or clustering to data sets with completely different structures drawn from diverse research fields. The comparison to common projection methods can be found in the book of Thrun, M.C.: "Projection Based Clustering through Self-Organization and Swarm Intelligence" (2018) . A comparison to 26 common clustering algorithms on 15 datasets is presented on the website.

Exemplary Topographic Map

The generalized U-matrix generates the visualization of a topographic map with hypsometric tints, which can be vividly described as a virtual 3D landscape with a specific color scale chosen with an algorithm defining the contour lines [24]. The topographic map addresses the central problem in clustering, i.e., the correct estimation of the number of clusters. It allows the assessment of the number of clusters [24] by inspecting the 3D landscape. The color scale and contour lines imitate valleys, ridges, and basins: blue colors indicate small distances (sea level), green and brown colors indicate middle distances (low hills), and shades of gray and white indicate vast distances (high mountains covered with snow and ice). Valleys and basins represent clusters, and the watersheds of hills and mountains represent the borders between clusters. In this 3D land-scape, the borders of the visualization are cyclically connected with a periodicity defined by two parameters (L,C).

Fig. 1: The Topographic map of the normalized generalized U-matrix visualizes the distance-based structures of 7447 Dimensions. Each point is a patient colored by their diagnosis. Patients with the same diagnosis lie in the same valley.

Download Figure as Interactive HTML

Note: You have to download the raw source file by clicking in github "View raw", and then saving it with right click as *.html locally and then re-opening this file in your browser.

Installation

Installation using CRAN

Install automatically with all dependencies via

install.packages("DatabionicSwarm",dependencies = T)

Installation using Github

Please note, that dependecies have to be installed manually.

remotes::install_github("Mthrun/DatabionicSwarm")

Installation using R Studio

Please note, that dependecies have to be installed manually.

Tools -> Install Packages -> Repository (CRAN) -> DatabionicSwarm

Tutorial Examples

The tutorial with several examples can be found on in the vignette on CRAN:

https://cran.r-project.org/web/packages/DatabionicSwarm/vignettes/DatabionicSwarm.html

Manual

The full manual for users or developers is available here: https://cran.r-project.org/web/packages/DatabionicSwarm/DatabionicSwarm.pdf



Mthrun/DatabionicSwarm documentation built on Nov. 2, 2023, 6:51 a.m.