eval.stress: Evaluates the Sammon stress of an embedding

Description Usage Arguments Details Value References See Also

View source: R/spe.R

Description

Given an N dimensional dataset embedded M dimensions, this function will evaluate the Sammon stress of the embedding, via probability sampling

Usage

1
2
3
eval.stress( x, coord,
             ndim = 0, edim = 0, nobs = 0,
             samplesize = 1e6)

Arguments

x

The embedded data in matrix form. If present in a data.frame it will be coerced to a matrix

coord

The input data in matrix form. If present in a data.frame it will be coerced to a matrix

nobs

The number of observations (rows of the input matrix should be the same as the rows of the embedding matrix) If it is not specified nobs will be taken as nrow(coord)

ndim

The number of input dimensions. If not specified it will be taken as ncol(coord)

edim

The number of dimensions to embed in. If not specified it will be taken as ncol(x)

samplesize

The number of iterations for probability sampling. For a dataset of 6070 observations there will be 6070x6069/2 pairwise distances. The default value gives a close approximation and runs fast. If you want a better approximation 1e7 is a good value. YMMV

Details

The Sammon stress is given by

S = ∑_{i < j} \frac{ (d_{ij} - r_{ij} )^2 }{r_{ij}} / ∑_{i<j} r_{ij}

where d_{ij} is the Euclidean distance between two observations in the embedded data and r_{ij} is the relationship (in this case it is the Euclidean distance but could be a similarity value) between the two observations in the input data

Value

Returns the value of the Sammon stress as a single number

References

Stochastic Proximity Embedding, J. Comput. Chem., 2003, 24, 1215-1221

See Also

spe


spe documentation built on May 1, 2019, 10:17 p.m.

Related to eval.stress in spe...