estimate_boxcount: Box-counting Dimension
In Rdimtools: Dimension Reduction and Estimation Methods

est.boxcount

R Documentation

Box-counting Dimension

Description

Box-counting dimension, also known as Minkowski-Bouligand dimension, is a popular way of figuring out the fractal dimension of a set in a Euclidean space. Its idea is to measure the number of boxes required to cover the set repeatedly by decreasing the length of each side of a box. It is defined as

dim(S) = \lim \frac{\log N(r)}{\log (1/r)}

as r\rightarrow 0, where N(r) is the number of boxes counted to cover a given set for each corresponding r.

Usage

est.boxcount(X, nlevel = 50, cut = c(0.1, 0.9))

Arguments

`X`	an (n\times p) matrix or data frame whose rows are observations.
`nlevel`	the number of `r` (radius) to be tested.
`cut`	a vector of ratios for computing estimated dimension in (0,1).

Value

a named list containing containing

estdim: estimated dimension using cut ratios.
r: a vector of radius used.
Nr: a vector of boxes counted for each corresponding r.

Determining the dimension

Even though we could use arbitrary cut to compute estimated dimension, it is also possible to use visual inspection. According to the theory, if the function returns an output, we can plot plot(log(1/output$r),log(output$Nr)) and use the linear slope in the middle as desired dimension of data.

Automatic choice of r

The least value for radius r must have non-degenerate counts, while the maximal value should be the maximum distance among all pairs of data points across all coordinates. nlevel controls the number of interim points in a log-equidistant manner.

Author(s)

Kisung You

References

\insertRef

hentschel_infinite_1983Rdimtools

\insertRef

ott_chaos_2002Rdimtools

Examples


## generate three different dataset
X1 = aux.gensamples(dname="swiss")
X2 = aux.gensamples(dname="ribbon")
X3 = aux.gensamples(dname="twinpeaks")

## compute boxcount dimension
out1 = est.boxcount(X1)
out2 = est.boxcount(X2)
out3 = est.boxcount(X3)

## visually verify : all should have approximate slope of 2.
opar <- par(no.readonly=TRUE)
par(mfrow=c(1,3))
plot(log(1/out1$r), log(out1$Nr), main="swiss roll")
plot(log(1/out2$r), log(out2$Nr), main="ribbon")
plot(log(1/out3$r), log(out3$Nr), main="twinpeaks")
par(opar)

Rdimtools documentation built on Dec. 28, 2022, 1:44 a.m.