cem: Conditional Expectation Manifolds
In cems: Conditional Expectation Manifolds

Description Usage Arguments Value Author(s) References See Also Examples

View source: R/cem.R

This package computes principal surfaces based on the approach described in Gerber et. al. 2009 and Gerber and Whitaker 2011.

Prinicpal surfaces are typically found by minimizing E[ || Y -g(λ(Y))||^2 ] over the functions g:R^m -> R^n with m < n m < n and λ_g:R^n -> R^m defined as an orthogonal projection onto g.

In Gerber et. al. 2009 the oppoiste approach is described; fixing g_λ(x) = E[Y | λ(Y) = x and minimzing over λ, i.e. optimizing the conditonal expectation manifold (CEM) given λ. Gerber et. al. 2009 called this approach kernel map manifolds (KMM) since both mappings where defined by kernel regression.

In Gerber and Whitaker 2011 the same formulation is exploited to provide an approach that solves the model selection problem for principal curves. The orthogonal projection distance minimization E[ ||Y - g_λ(Y)||^2] yields principal curves that are saddle points and thus model selection (i.e. bandwidth slection in this implementation) is not possible even with cross-validation. The approach in Gerber and Whitaker 2011 formulates an alternate optimization problem minimzing orthogonality E[ < Y - g_λ(Y), \frac{d}{ds} g(s)|_{s=|lambda(Y)}>^2 ] which leads to principal curves at minima.

This package implements the approach in Gerber et. al. 2009 for both formulation, i.e. minimzing projection distance E[ ||Y - g_λ(Y)||^2] or orthogonality E[ < Y - g_λ(Y), \frac{d}{ds} g(s)|_{s=|lambda(Y)}>^2 ] . . The implementation is based on a kernel regression for λ and g and uses a numerical gradient descent for minimization. The gradient descent includes an optimization of the bandwidth, i.e. model selection. For minimzing the projection distance this dopes not lead to good results since principal curves are saddle points. Thus stepBW should be set to 0 in this case.

cem( y, x,  knnX=50, sigmaX= 1/3, iter =100, nPoints =
    nrow(y), stepX = 0.25, stepBW = 0.1, verbose=1, risk=2, penalty=0,
    sigmaAsFactor = T, optimalSigmaX =  F , quadratic = F)
cem.optimize(object, iter = 100, nPoints = nrow(object$y), stepX=1, stepBW=0.1,
    verbose=1, optimalSigmaX =F ) 
## S3 method for class 'cem'
predict(object, newdata = object$y, type=c("coordinates",
      "curvature"), ...)
cem.geodesic(object, xs, xe, iter = 100, step = 0.01,
    verbose=1, ns=100)

`y`	n-dimensional data to compute conditional expectation manifold for.
`x`	Initalization for low dimensional mapping λ. For example an isomap or lle or PCA embedding of ly.
`knnX`	Number of nearest neighbors for kernel regression of g, i.e. the regression is trunctaed to only the knnX nearest neighbors
`sigmaX`	Initalize bandwidth of g to sigmaX. If `sigamAsFactor` is set to true the bandiwdth is computed as sigmaX times average knnX nearest neighbor distance.
`iter`	Number of optimization iterations, i.e. number of gradient desecent with line search setps.
`stepX`	Gradient descent step size for optimizing coordinate mapping
`stepBW`	Gradient descent step size for optimizing bandwidths
`verbose`	Report gradient descent information. 1 reports iteration number and mean squared projetcion distances. 2 has additonal information on step size and line search.
`sigmaAsFactor`	Use sigmaX and sigmaY as multipliers of the average nearest neighbor distances in Y and λ(Y). respectively.
`optimalSigmaX`	If true optimizes sigmaX before every iteration - will not work for MSE minimzation, i.e. sigmaX will go to 0- owrk well for orthogonal projection and speeds up computation significantly
`risk`	Which objective function should be minimized. 0 = E[ \|\|Y - g_λ(Y)\|\|^2]. 1 = E[<g(f(y)) - y , g'(f(y))>^2]. 2 = 1 but with g'(f(y))> ortho normalized. 3=2 with g(f(y)) - y normalized.
`penalty`	0 = No penalty, 1 = Deviation from arc length parametrization
`quadratic`	Use a locally quadratic regression instead of linear for g
`nPoints`	Number of points that are sampled for computing gradient descent directions
`object`	CEM object to do prediction for
`newdata`	Data to do prediction for. If `ncol(newdata) == m` for each point x in newdata g(x) is computed. If `col{newdata} == n` for each point y in newdata λ(y) is computed.
`type`	Prediction type: coorindates or curvatures of the manifold model
`...`	Additional arguments have no effect.
`xs`	Start point for geodesic
`xe`	End point for geodesic
`step`	Step size for optimizing geoesic
`ns`	Number of segments for dicretizing geodesic

An object of class "cem".

Samuel Gerber

Samuel Gerber, Tolga Tasdizen, Ross Whitaker, Dimensionality Reduction and Principal Surfaces via Kernel Map Manifolds, In Proceedings of the 2009 International Conference on Computer Vison (ICCV 2009).

Samuel Gerber and Ross Whitaker, Regularization-Free Principal Curve Estimation Journal of Machine Learning Research 2013.

cem.example.arc cem.example.sr

##Noisy half circle example
phi <- runif(100)*pi
arc <- cbind(cos(phi), sin(phi)) * (1+rnorm(100) * 0.1)

pc <-  cem(y=arc, x=phi, knnX=10, iter=10, optimalSigmaX=TRUE, verbose=2)

#predict original data
y <- predict(pc, pc$x);

#predict new data
xt <- seq(min(pc$x), max(pc$x), length.out=100)
yt <- predict(pc, xt)

#plot things
arc0 <- cbind(cos(phi), sin(phi))
o0 <- order(phi) 

par(mar=c(5,5,4,2))
plot(arc, xlab=expression(y[1]), ylab=expression(y[2]), col = "#00000020",
     pch=19, asp=1, cex.lab=1.5, cex.axis=1.5, cex=2, bty="n")

lines(arc0[o0,], lwd=4, col="black", lty=6)
lines(yt$y, col="dodgerblue", lwd=4, lty=1)