R/ace.R
In acepack: ACE and AVAS for Selecting Multiple Regression Transformations

Documented in ace ace.default ace.formula plot.ace print.ace summary.ace

  #############################################################################
 #
# This file is part of acepack.
#
# Copyright 1985,2007 Jerome H. Friedman
# Copyright 2016,2025 Shawn Garbett, Vanderbilt University Medical Center
#
# Permission to use, copy, modify, distribute, and sell this software and
# its documentation for any purpose is hereby granted without fee,
# provided that the above copyright notice appear in all copies and that
# both that copyright notice and this permission notice appear in
# supporting documentation. No representations are made about the
# suitability of this software for any purpose.  It is provided "as is"
# without express or implied warranty.
###############################################################################

#' @name ace
#' @title Alternating Conditional Expectations
#'
#' @param x matrix; A matrix containing the independent variables.
#' @param y numeric; A vector containing the response variable.
#' @param wt numeric; An optional vector of weights.
#' @param cat integer; An optional integer vector specifying which variables
#'   assume categorical values.  Positive values in \code{cat} refer to columns
#'   of the \code{x} matrix and zero to the response variable.  Variables must
#'   be numeric, so a character variable should first be transformed with
#'   as.numeric() and then specified as categorical.
#' @param mon integer; An optional integer vector specifying which variables are
#'   to be transformed by monotone transformations.  Positive values
#'   in \code{mon} refer to columns of the \code{x} matrix and zero to the
#'   response variable.
#' @param lin integer; An optional integer vector specifying which variables are
#'   to be transformed by linear transformations. Positive values in \code{lin}
#'   refer to columns of the \code{x} matrix and zero to the response variable.
#' @param circ integer; An integer vector specifying which variables assume
#'   circular (periodic) values.  Positive values in \code{circ} refer to
#'   columns of the \code{x} matrix and zero to the response variable.
#' @param delrsq numeric(1); termination threshold. Iteration stops when
#'   R-squared changes by less than \code{delrsq} in 3 consecutive iterations
#'   (default 0.01).
#' @param control named list; control parameters to set. Documented at 
#' \code{\link{set_control}}.
#' @param on.error function; call back for when ierr is not equal to zero. Defaults to warning.
#' @param formula formula; an object of class "\code{\link{formula}}": a
#'    symbolic description of the model to be smoothed.
#' @param data an optional data frame, list or environment (or object coercible
#'   by \code{\link{as.data.frame}} to a data frame) containing the variables in
#'   the model. If not found in data, the variables are taken from
#'   \code{environment(formula)}, typically the environment from which
#'   \code{ace} is called.
#' @param subset an optional vector specifying a subset of observations to be
#'   used in the fitting process. Only used when a \code{formula}
#'   is specified.
#' @param na.action a function which indicates what should happen when the data
#'   contain NAs. The default is set by the \code{na.action} setting of
#'   \code{\link{options}}, and is \code{\link{na.fail}} if that is unset.
#'   The ‘factory-fresh’ default is \code{\link{na.omit}}. Another possible
#'   value is NULL, no action. Value \code{\link{na.exclude}} can be useful.
#' @param ... additional arguments which go ignored for ace call. Included for S3 dispatch
#'   consistency. They are utilized when using print as they get passed to cat. 
#'   Also when plotting an ace object they are passed to plot.
#' @param digits rounding digits for summary/print
#' @param object an S3 ace object
#' @param which when plotting an ace object which plots to produce.
#' @param caption a list of captions for a plot. 
#' @param xlab the x-axis label when plotting.
#' @param ylab the y-axis label when plotting.
#' @param ask when plotting should the terminal be asked for input between plots.
#' @return
#'   A structure with the following components:
#'    \item{x}{the input x matrix.}
#'    \item{y}{the input y vector.}
#'    \item{tx}{the transformed x values.}
#'    \item{ty}{the transformed y values.}
#'    \item{rsq}{the multiple R-squared value for the transformed values.}
#'    \item{l}{the codes for cat, mon, ...}
#'
#' @description
#'    Uses the alternating conditional expectations algorithm to find the
#'    transformations of y and x that maximize the proportion of variation
#'    in y explained by x. When x is a matrix, it is transformed so that
#'   its columns are equally weighted when predicting y.
#'
#' @references
#'    Breiman and Friedman, Journal of the American Statistical
#'    Association (September, 1985).
#'
#'    The R code is adapted from S code for avas() by Tibshirani, in the
#'    Statlib S archive; the FORTRAN is a double-precision version of
#'   FORTRAN code by Friedman and Spector in the Statlib general archive.
#'
#' @examples
#' 
#' TWOPI <- 8*atan(1)
#' x <- runif(200,0,TWOPI)
#' y <- exp(sin(x)+rnorm(200)/2)
#' a <- ace(x,y)
#' par(mfrow=c(3,1))
#' plot(a$y,a$ty)  # view the response transformation
#' plot(a$x,a$tx)  # view the carrier transformation
#' plot(a$tx,a$ty) # examine the linearity of the fitted model
#' 
#' # example when x is a matrix
#' X1 <- 1:10
#' X2 <- X1^2
#' X <- cbind(X1,X2)
#' Y <- 3*X1+X2
#' a1 <- ace(X,Y)
#' par(mfrow=c(1,1))
#' plot(rowSums(a1$tx),a1$y)
#' (lm(a1$y ~ a1$tx)) # shows that the colums of X are equally weighted
#' 
#' # From D. Wang and M. Murphy (2005), Identifying nonlinear relationships
#' # regression using the ACE algorithm.  Journal of Applied Statistics,
#' # 32, 243-258.
#' X1 <- runif(100)*2-1
#' X2 <- runif(100)*2-1
#' X3 <- runif(100)*2-1
#' X4 <- runif(100)*2-1
#' 
#' # Original equation of Y:
#' Y <- log(4 + sin(3*X1) + abs(X2) + X3^2 + X4 + .1*rnorm(100))
#' 
#' # Transformed version so that Y, after transformation, is a
#' # linear function of transforms of the X variables:
#' # exp(Y) = 4 + sin(3*X1) + abs(X2) + X3^2 + X4
#' 
#' a1 <- ace(cbind(X1,X2,X3,X4),Y)
#' 
#' # For each variable, show its transform as a function of
#' # the original variable and the of the transform that created it,
#' # showing that the transform is recovered.
#' par(mfrow=c(2,1))
#' 
#' plot(X1,a1$tx[,1])
#' plot(sin(3*X1),a1$tx[,1])
#' 
#' plot(X2,a1$tx[,2])
#' plot(abs(X2),a1$tx[,2])
#' 
#' plot(X3,a1$tx[,3])
#' plot(X3^2,a1$tx[,3])
#' 
#' plot(X4,a1$tx[,4])
#' plot(X4,a1$tx[,4])
#' 
#' plot(Y,a1$ty)
#' plot(exp(Y),a1$ty)
#' 
#' @rdname ace
#' @export
#' @useDynLib acepack, .registration=TRUE
#' 
ace <- function(...) UseMethod("ace")

# Internal function to handle error
ace_error <- function(ierr, FUN)
{
  if(!inherits(FUN, 'function')) return(NULL)
  if(ierr==1 || ierr==2) FUN("Weights must be greater than zero or Y has no variance.")
  if(ierr==3 || ierr==4 || ierr==5 || ierr==6) FUN("Internal error. Variable category misspecified or no variance in a dimension.")
  if(ierr != 0) FUN(paste("Internal error. Unknown ierr", ierr, "returned."))
}

#' @rdname ace
#' @export
ace.default  <- function(
  x,
  y,
  wt      = NULL,
  cat     = NULL, 
  mon     = NULL, 
  lin     = NULL,
  circ    = NULL,
  delrsq  = 0.01,
  control = NULL,
  on.error = warning,
  ...) 
{
  if(!is.null(control)) do.call(set_control, control)
  
  x  <- as.matrix(x)
  
  if(is.null(wt)) wt <- rep(1, nrow(x))
  
  if (delrsq <= 0) stop("delrsq must be positive")

  iy <- ncol(x) + 1
  l  <- matrix(1, ncol = iy)
  
  if (!is.null(circ))
  {
    for (i in 1:length(circ))
    {
      if (circ[i] < 0 || circ[i] > ncol(x)) stop("bad circ= specification")

      nncol <- if (circ[i] == 0) iy else circ[i]

      if (l[nncol] != 2 & l[nncol] != 1) 
        stop("conflicting transformation specifications")

      l[nncol] <- 2
    }
  }
    
  if (length(mon) > 0)
  {
    for (i in 1:length(mon))
    {
      if (mon[i] < 0 || mon[i] > ncol(x)) stop("bad mon= specification")

      nncol <-  if (mon[i] == 0) iy else mon[i]

      if (l[nncol] != 3 && l[nncol] != 1) 
        stop("conflicting transformation specifications")

      l[nncol] <- 3
    }
  }
  
  if (length(lin)>0)
  {
    for (i in 1:length(lin))
    {
      if (lin[i] < 0 || lin[i] > ncol(x)) 
        stop("bad lin= specification")

      nncol <- if (lin[i] == 0) iy else lin[i]

      if (l[nncol] != 4 && l[nncol] != 1)
        stop("conflicting transformation specifications")
      
      l[nncol] <- 4
    }
  }
  
  if (length(cat))
  {
    for (i in 1:length(cat))
    {
      if (cat[i] < 0 || cat[i] > ncol(x)) stop("bad cat= specification")

      nncol <- if (cat[i] == 0) iy else cat[i]

      if (l[nncol] != 5 && l[nncol] != 1) 
        stop("conflicting transformation specifications")

      l[nncol] <- 5
    }
  }
  
  tx <- x
  ty <- y
  m  <- matrix(0, nrow = nrow(x), ncol = iy)
  z  <- matrix(0, nrow = nrow(x), ncol = 12)
  ns <- 1
  
  mode(x)      <- "double"
  mode(y)      <- "double"
  mode(tx)     <- "double"
  mode(ty)     <- "double"
  mode(wt)     <- "double"
  mode(delrsq) <- "double"
  mode(z)      <- "double"
  
  results <- structure(
    .Fortran("mace",
      p       = as.integer(ncol(x)),
      n       = as.integer(nrow(x)), 
      x       = t(x),
      y       = y,
      w       = as.double(wt),
      l       = as.integer(l), 
      delrsq  = delrsq,
      ns      = as.integer(ns),
      tx      = tx,
      ty      = ty, 
      rsq     = double(1),
      ierr    = integer(1),
      m       = as.integer(m), 
      z       = z,
      PACKAGE = "acepack"),
    class=c("ace","list"))
  
  if(results$ierr != 0) ace_error(results$ierr, on.error)
  
  # Find original R^2
  results$orig_rsq <- summary(lm(results$y ~ t(results$x)))$r.squared

  results
}

#' @rdname ace
#' @importFrom stats model.frame
#' @export
ace.formula  <- function(
  formula,
  data      = NULL,
  subset    = NULL,
  na.action = getOption('na.action'),
  ...)
{
  # Copied from lm()
  mf <- match.call(expand.dots = FALSE)
  m  <- match(c("formula", "data", "subset", "na.action"), names(mf), 0L)
  mf <- mf[c(1L, m)]
  
  mf$drop.unused.levels <- TRUE
  mf[[1L]] <- quote(stats::model.frame)
  
  mf <- eval(mf, parent.frame())

  ace(mf[,2:ncol(mf)],mf[,1],...)
}

#' @rdname ace
#' @export
summary.ace <- function(object, ...)
{
  object$print_summary <- TRUE

  object
}

#' @rdname ace
#' @importFrom stats lm
#' @export
print.ace <- function(x, ..., digits=4)
{
  x$rsq      <- round(x$rsq,      digits)
  x$orig_rsq <- round(x$orig_rsq, digits)
  
  cat('\nAlternating Conditional Expections\n\n', ...)

  cat('p =', x$p, ', N =', x$n, '\n\n', ...)
  cat('Raw Multiple R-squared:', x$orig_rsq, '\n', ...)
  cat('Transformed Multiple R-squared:', x$rsq, '\n', ...)
  
  cat('\n', ...)
  
  if(!is.null(x$print_summary) && x$print_summary)
  {
    cat('Original Y\n', ...)
    print(summary(x$y))
    cat('\nTransformed Y\n', ...)
    print(summary(x$ty))
    cat('\nOriginal X\n', ...)
    print(summary(t(x$x)))
    cat('\nTransformed X\n', ...)
    print(summary(x$tx))
  }
}

#' @rdname ace
#' @importFrom graphics par
#' @importFrom grDevices as.graphicsAnnot
#' @importFrom grDevices dev.flush
#' @importFrom grDevices dev.hold
#' @importFrom grDevices dev.interactive
#' @importFrom grDevices devAskNewPage
#' @export
plot.ace <- function(
  x, 
  ...,
  which=1:(x$p+1),
  caption=c(list("Response Y ACE Transformation"),
    as.list(paste("Carrier", rownames(x$x), "ACE Transformation"))),
  xlab = "Original",
  ylab = "Transformed",
  ask = prod(par("mfcol")) < length(which) && dev.interactive()
)
{
  show <- rep(FALSE, x$p+1)
  show[which] <- TRUE
  
  getCaption <- function(k) # allow caption = "" , plotmath etc
    if(length(caption) < k) NA_character_ else as.graphicsAnnot(caption[[k]])
  
  if (ask)
  {
  	oask <- devAskNewPage(TRUE)
  	on.exit(devAskNewPage(oask))
  }
  
  if(show[1L])
  {
    dev.hold()
    plot(x$y, x$ty, main=getCaption(1), xlab=xlab, ylab=ylab, ...)
    dev.flush()
  }
  
  for(i in 1L:(x$p))
    if(show[i+1])
    {
      dev.hold()
      plot(x$x[i,], x$tx[,i], main=getCaption(i+1), xlab=xlab, ylab=ylab, ...)
      dev.flush()
    }
}

Any scripts or data that you put into this service are public.

acepack documentation built on April 4, 2025, 2:06 a.m.

rdrr.io home R language documentation Run R code online

CRAN packages Bioconductor packages R-Forge packages GitHub packages

Note that we can't provide technical support on individual packages. You should contact the package authors for that.

acepack
ACE and AVAS for Selecting Multiple Regression Transformations

R/ace.R
In acepack: ACE and AVAS for Selecting Multiple Regression Transformations

Defines functions plot.ace print.ace summary.ace ace.formula ace.default ace_error ace

Documented in ace ace.default ace.formula plot.ace print.ace summary.ace

Try the acepack package in your browser

R Package Documentation

Browse R Packages

We want your feedback!

acepack ACE and AVAS for Selecting Multiple Regression Transformations

R/ace.R In acepack: ACE and AVAS for Selecting Multiple Regression Transformations

Defines functions plot.ace print.ace summary.ace ace.formula ace.default ace_error ace

Documented in ace ace.default ace.formula plot.ace print.ace summary.ace

Try the acepack package in your browser

R Package Documentation

Browse R Packages

We want your feedback!

acepack
ACE and AVAS for Selecting Multiple Regression Transformations

R/ace.R
In acepack: ACE and AVAS for Selecting Multiple Regression Transformations