hotspot_analysis: function to find areas of high density (so called "Hotspots")...
In pmusfeld/HotspotAnalysis: Find Hotspots (areas of high denisty) in 2-dimensional data

Description Usage Arguments Value Author(s) Examples

View source: R/Hotspot_function.R

This function takes a two-dimensional dataset as it's input and and detetcts areas of high density in the data. The distribution of the input data is compared to randomly distirbuted data of the same structure. Therefore, for every data-point the distance to a specified proportion of other nearest neighbor data-points is calculated and compared to the randomly expected value. Points which are closer to each other than randomly expected are selected and clustered to Hotpots.

1	hotspot_analysis(point_id, column_x_coord, column_y_coord, prop_nneighbours = 0.05, criterion_NNI = 0.8, number_cluster = NULL, prefer_more_cluster = TRUE)

`point_id`	Input column which contains a point identifier for every point (x/y) in the data
`column_x_coord`	Input column (a numeric vector) which contains the x-coordinates of the data
`column_y_coord`	Input column (a numeric vector) which contains the y-coordinates of the data
`prop_nneighbours`	A number < 1, specifying the proportion (percentage from the total number of data points) of nearest neighbors that should be considered when calculating the distances from every point to his sourrounded points. The default value is 5 percent (0.05) from the total number of points.
`criterion_NNI`	A number < 1, specifying the threshold for selecting points of high density. A Value of 1 indicates that a point is distributed as randomly expected. The closer the value is to 0, the stricter the selection of points. The default value is 0.8.
`number_cluster`	A number, manually specifying the number of cluster. By default, the number of cluster/hotspots is determined automatically by optimizing the silhouette width of different cluster solutions. Nevertheless, if the solution is not appropriate for personal needs or content-related issues, the number of needed Hotspots ca be specified manually.
`prefer_more_cluster`	A logical (TRUE/FALSE). By default, the cluster solution with the best fit (by optimizing silhouette width) is choosen. With regard to content-related issues, it sometimes can be more appropriate to prefer a higher number of smaller and close by cluster over less but bigger cluster. Therefore, all good cluster solutions in a small range from the optimum are selected and the solution with the highest number of cluster is chosen.

A list object, containing a data frame with the Hotspot assignment and 3 graphs to control the selection process:

`hotspot_assignment`	A data frame with 2 columns, containing the point identifier and the assigned Hotspot
`plot_result`	A scatter-plot of the data, showing the hotspot-assignment
`plot_point_selection`	A scatter plot, showing just the points which where selected to be distributed in areas of high density
`plot_random_data`	A scatter plot, showing the randomly distributed points which the data is compared to

Philipp Musfeld

##create data with Hotspots:
x_0_100_H1 <- rnorm(n = 100, mean = 20, sd = 5)
y_0_100_H1 <- rnorm(n = 100, mean = 20, sd = 5)

x_0_100_H2 <- rnorm(n = 100, mean = 50, sd = 5)
y_0_100_H2 <- rnorm(n = 100, mean = 50, sd = 5)

x_0_100_H3 <- rnorm(n = 100, mean = 90, sd = 5)
y_0_100_H3 <- rnorm(n = 100, mean = 90, sd = 5)

x_0_100_H4 <- rnorm(n = 50, mean = 25, sd = 2.5)
y_0_100_H4 <- rnorm(n = 50, mean = 95, sd = 2.5)

x_0_100_H5 <- rnorm(n = 50, mean = 70, sd = 5)
y_0_100_H5 <- rnorm(n = 50, mean = 30, sd = 5)

x_0_100_kH <- runif(n = 200, min = 0, max = 100)
y_0_100_kH <- runif(n = 200, min = 0, max = 100)

x_coord <- c(x_0_100_H1, x_0_100_H2,  x_0_100_H3,  x_0_100_H4,  x_0_100_H5,  x_0_100_kH)
y_coord <- c(y_0_100_H1, y_0_100_H2,  y_0_100_H3,  y_0_100_H4,  y_0_100_H5,  y_0_100_kH)

data <- data.frame(x_coord, y_coord)
data$point <- paste("point_", rownames(data), sep = "")

hotspot_analysis(point_id = data$point, column_x_coord = data$x_coord, column_y_coord = data$y_coord, prop_nneighbours = 0.05, criterion_NNI = 0.8, number_cluster = NULL, prefer_more_cluster = TRUE)

pmusfeld/HotspotAnalysis documentation built on Oct. 19, 2020, 12:56 a.m.

pmusfeld/HotspotAnalysis index

README.md

rdrr.io home R language documentation Run R code online

CRAN packages Bioconductor packages R-Forge packages GitHub packages

Note that we can't provide technical support on individual packages. You should contact the package authors for that.

pmusfeld/HotspotAnalysis
Find Hotspots (areas of high denisty) in 2-dimensional data

hotspot_analysis: function to find areas of high density (so called "Hotspots")...
In pmusfeld/HotspotAnalysis: Find Hotspots (areas of high denisty) in 2-dimensional data

Description

Usage

Arguments

Value

Author(s)

Examples

Related to hotspot_analysis in pmusfeld/HotspotAnalysis...

R Package Documentation

Browse R Packages

We want your feedback!

pmusfeld/HotspotAnalysis Find Hotspots (areas of high denisty) in 2-dimensional data

hotspot_analysis: function to find areas of high density (so called "Hotspots")... In pmusfeld/HotspotAnalysis: Find Hotspots (areas of high denisty) in 2-dimensional data

Description

Usage

Arguments

Value

Author(s)

Examples

Related to hotspot_analysis in pmusfeld/HotspotAnalysis...

R Package Documentation

Browse R Packages

We want your feedback!

pmusfeld/HotspotAnalysis
Find Hotspots (areas of high denisty) in 2-dimensional data

hotspot_analysis: function to find areas of high density (so called "Hotspots")...
In pmusfeld/HotspotAnalysis: Find Hotspots (areas of high denisty) in 2-dimensional data