R/partitioning.R
In ecospace: Simulating Community Assembly and Ecological Diversification Using Ecospace Frameworks

Documented in partitioning

#' Use Partitioning Rule to Simulate Ecological Diversification of a Biota.
#'
#' Implement Monte Carlo simulation of a biota undergoing ecological
#' diversification using the partitioning rule.
#'
#' @param nreps Vector of integers (such as a sequence) specifying sample number
#'   produced. Only used when function is applied within \code{lapply} or
#'   related function. Default \code{nreps=1} or any other integer produces a
#'   single sample.
#' @param Sseed Integer giving number of species (or other taxa) to use at start
#'   of simulation.
#' @param ecospace An ecospace framework (functional trait space) of class
#'   \code{ecospace}.
#' @param Smax Maximum number of species (or other taxa) to include in
#'   simulation.
#' @param method Distance measure to use when calculating functional distances
#'   between species. Default is \code{Euclidean} using
#'   \code{stats::\link[stats]{dist}}. \code{Gower} or any other value uses
#'   Gower distance (using \code{FD::\link[FD]{gowdis}}). Presence of factor or
#'   ordered factor character types forces use of Gower distance.
#' @param rule The partitioning implementation to use in the simulation. Default
#'   \code{strict} implements the 'minimum distant neighbor' rule;
#'   \code{relaxed} implements the "maximum nearest neighbor" rule. See
#'   'Details' below for further explanation.
#' @param strength Strength parameter controlling probability that partitioning
#'   rule is followed during simulation. Values must range between
#'   \code{strength = 1} (default, rules always implemented) and \code{strength = 0}
#'   (rules never implemented).
#'
#' @details Simulations are implemented as Monte Carlo processes in which
#'   species are added iteratively to assemblages, with all added species having
#'   their character states specified by the model rules, here the
#'   'partitioning' rule. Simulations begin with the seeding of \code{Sseed}
#'   number of species, chosen at random (with replacement) from either the
#'   species pool (if provided in the \code{weight.file} when building the
#'   ecospace framework using \code{create_ecospace}) or following the
#'   neutral-rule algorithm (if a pool is not provided). Once seeded, the
#'   simulations proceed iteratively (character-by-character,
#'   species-by-species) by following the appropriate algorithm, as explained
#'   below, until terminated at \code{Smax}.
#'
#'   \strong{Partitioning rule algorithm:} Measure distances between all pairs
#'   of species, using \code{Euclidean} or \code{Gower} distance method
#'   specified by \code{method} argument. Use either of the following rules to
#'   identify the position of each additional species. \describe{
#'   \item{\code{strict} (minimum distant neighbor) rule}{Identify the maximum
#'   distances between all pairs of species (the most-distant neighbors); the
#'   space to be partitioned is the minimum of these distances. This
#'   implementation progressively fills in the largest parts of the ecospace
#'   that are least occupied between neighboring species, and eventually
#'   partitions the ecospace in straight-line gradients between seed species.}
#'   \item{\code{relaxed} (maximum nearest neighbor) rule}{Identify
#'   nearest-neighbor distances between all pairs of species; the space to be
#'   partitioned is the maximum of these distances. This implementation places
#'   new species in the most unoccupied portion of the ecospace that is within
#'   the cluster of pre-existing species, often the centroid.}} In both rules,
#'   each new species is created as a resampled combination of the character
#'   states of the identified neighbors. If multiple pairs meet the specific
#'   criteria, one of these pairs is chosen at random. Ordered, multistate
#'   character partitioning (such as ordered factors or order numeric character
#'   types) can include any state equal to or between the observed states of
#'   existing species. The probability of following the partitioning rule is
#'   determined by the \code{strength} parameter. Default \code{strength = 1}
#'   always implements the rule, whereas \code{strength = 0} never implements it
#'   (essentially making the simulation follow the \code{\link{neutral}} rule.)
#'
#'   Each newly assigned character is compared with the ecospace framework
#'   (\code{ecospace}) to confirm that it is an allowed state combination before
#'   proceeding to the next character. If the newly built character is
#'   disallowed from the ecospace framework (i.e., because it has "dual
#'   absences" [0,0], has been excluded based on the species pool
#'   [\code{weight.file} in \code{create_ecospace}], or is not allowed by the
#'   ecospace \code{constraint} parameter), then the character-selection
#'   algorithm is repeated until an allowable character is selected. When
#'   simulations proceed to very large sample sizes (>100), this confirmatory
#'   process can require long computational times, and produce "new"
#'   intermediate species that are functionally identical to pre-existing
#'   species. This can occur, for example, when no life habits, or perhaps only
#'   one, exist that forms an allowable intermediate between the selected
#'   neighbors.
#'
#'   Partitioning rules tend to produce ecospaces displaying linear gradients
#'   between seed species (in the \code{strict} implementation) or concentration
#'   of life habits near the functional centroid (in the \code{relaxed}
#'   implementation). Additional details on the partitioning simulation are
#'   provided in Novack-Gottshall (2016a,b), including sensitivity to
#'   ecospace framework (functional trait space) structure, recommendations for
#'   model selection, and basis in ecological and evolutionary theory.
#'
#' @return Returns a data frame with \code{Smax} rows (representing species) and
#'   as many columns as specified by number of characters/states (functional
#'   traits) in the ecospace framework. Columns will have the same data type
#'   (numeric, factor, ordered numeric, or ordered factor) as specified in the
#'   ecospace framework.
#'
#' @note A bug exists within \code{FD::\link[FD]{gowdis}} where nearest-neighbor
#'   distances can not be calculated when certain characters (especially numeric
#'   characters with values other than 0 and 1) share identical traits across
#'   species. The nature of the bug is under investigation, but the current
#'   implementation is reliable under most uses. If you run into problems
#'   because of this bug, a work-around is to manually change the function to
#'   call \code{cluster::\link[cluster]{daisy}} using \code{metric = "gower"}
#'   instead.
#'
#'   The function has been written to allow usage (using \code{\link{lapply}} or
#'   some other list-apply function) in 'embarrassingly parallel' implementations
#'   in a high-performance computing environment.
#'
#' @author Phil Novack-Gottshall \email{pnovack-gottshall@@ben.edu}
#'
#' @references Bush, A. and P.M. Novack-Gottshall. 2012. Modelling the
#'   ecological-functional diversification of marine Metazoa on geological time
#'   scales. \emph{Biology Letters} 8: 151-155.
#' @references Novack-Gottshall, P.M. 2016a. General models of ecological
#'   diversification. I. Conceptual synthesis. \emph{Paleobiology} 42: 185-208.
#' @references Novack-Gottshall, P.M. 2016b. General models of ecological
#'   diversification. II. Simulations and empirical applications.
#'   \emph{Paleobiology} 42: 209-239.
#'
#' @seealso \code{\link{create_ecospace}}, \code{\link{neutral}},
#'   \code{\link{redundancy}}, \code{\link{expansion}}
#'
#' @examples
#' # Create an ecospace framework with 15 3-state factor characters
#' # Can also accept following character types: "numeric", "ord.num", "ord.fac"
#' nchar <- 15
#' ecospace <- create_ecospace(nchar = nchar, char.state = rep(3, nchar),
#'   char.type = rep("factor", nchar))
#'
#' # Single (default) sample produced by partitioning function (with strength = 1 and
#' #   "strict" partitioning rules):
#' Sseed <- 5
#' Smax <- 40
#' x <- partitioning(Sseed = Sseed, Smax = Smax, ecospace = ecospace, rule = "strict")
#' head(x, 10)
#'
#' # Plot results, showing order of assembly
#' # (Seed species in red, next 5 in black, remainder in gray)
#' # Notice the 'strict' partitioning model produces an ecospace with life-habit gradients
#' #   between seed species
#' seq <- seq(nchar)
#' types <- sapply(seq, function(seq) ecospace[[seq]]$type)
#' if(any(types == "ord.fac" | types == "factor")) pc <- prcomp(FD::gowdis(x)) else
#'   pc <- prcomp(x)
#' plot(pc$x, type = "n", main = paste("Partitioning model,\n", Smax, "species"))
#' text(pc$x[,1], pc$x[,2], labels = seq(Smax), col = c(rep("red", Sseed), rep("black", 5),
#'   rep("slategray", (Smax - Sseed - 5))), pch = c(rep(19, Sseed), rep(21, (Smax - Sseed))),
#'   cex = .8)
#'
#' # Same, but following "relaxed" partitioning rules:
#' # Notice the 'relaxed' partitioning model only fills in the ecospace between seed species
#' x <- partitioning(Sseed = Sseed, Smax = Smax, ecospace = ecospace, rule = "relaxed")
#' if(any(types == "ord.fac" | types == "factor")) pc <- prcomp(FD::gowdis(x)) else
#'   pc <- prcomp(x)
#' plot(pc$x, type = "n", main = paste("Partitioning model,\n", Smax, "species"))
#' text(pc$x[,1], pc$x[,2], labels = seq(Smax), col = c(rep("red", Sseed), rep("black", 5),
#'   rep("slategray", (Smax - Sseed - 5))), pch = c(rep(19, Sseed), rep(21, (Smax - Sseed))),
#'   cex = .8)
#'
#' # Change strength parameter so rules followed 95% of time:
#' x <- partitioning(Sseed = Sseed, Smax = Smax, ecospace = ecospace, strength = 0.95, rule = "strict")
#' if(any(types == "ord.fac" | types == "factor")) pc <- prcomp(FD::gowdis(x)) else
#'   pc <- prcomp(x)
#' plot(pc$x, type = "n", main = paste("Partitioning model,\n", Smax, "species"))
#' text(pc$x[,1], pc$x[,2], labels = seq(Smax), col = c(rep("red", Sseed), rep("black", 5),
#'   rep("slategray", (Smax - Sseed - 5))), pch = c(rep(19, Sseed), rep(21, (Smax - Sseed))),
#'   cex = .8)
#'
#' # Create 5 samples using multiple nreps and lapply (can be slow)
#' nreps <- 1:5
#' samples <- lapply(X = nreps, FUN = partitioning, Sseed = 5, Smax = 40, ecospace)
#' str(samples)
#'
#' @export
partitioning <- function(nreps = 1, Sseed, Smax, ecospace, method = "Euclidean", rule = "strict", strength = 1) {
  if (strength < 0 | strength > 1)
    stop("strength must have a value between 0 and 1\n")
  nchar <- length(ecospace) - 1
  seq <- seq_len(nchar)
  pool <- ecospace[[length(ecospace)]]$pool
  state.names <-
    unlist(sapply(seq, function(seq)
      colnames(ecospace[[seq]]$char.space)[seq_len(ncol(ecospace[[seq]]$char.space) - 3)]))
  char.type <- sapply(seq, function(seq)
    ecospace[[seq]]$type)
  if (method != "Euclidean" | any(char.type == "factor") | any(char.type == "ord.fac"))
    method <- "Gower"
  cs <-
    sapply(seq, function(seq)
      ncol(ecospace[[seq]]$char.space) - 3)
  c.start <- c(1, cumsum(cs)[1:nchar - 1] + 1)
  c.end <- cumsum(cs)
  data <- prep_data(ecospace, Smax)
  for (sp in 1:Smax) {
    if (sp <= Sseed) {
      if (!is.logical(pool)) {
        data[sp, ] <- pool[sample2(seq_len(nrow(pool)), 1), ]
      } else {
        for (ch in 1:nchar) {
          c.sp <- ecospace[[ch]]$char.space
          data[sp, c.start[ch]:c.end[ch]] <-
            c.sp[c.sp[(rmultinom(1, 1, prob = c.sp$pro) == 1), ncol(c.sp)], seq_len(cs[ch])]
        }
      }
    } else {
      # Choose ecospace-appropriate distance metric.
      if (method == "Gower") {
        dist <-
          FD::gowdis(data[seq_len(sp - 1), ])
      } else {
        dist <- dist(data[seq_len(sp - 1), ])
      }
      d <- as.matrix(dist)
      d[row(d) == col(d)] <- NA	# Make diagonals missing
      # Identify rule to identify neighbors
      if (rule == "strict")
        nnd <- max(apply(d, 1, min, na.rm = TRUE))
      if (rule == "relaxed")
        nnd <- min(apply(d, 1, max, na.rm = TRUE))
      pairs <- arrayInd(which(d == nnd), dim(d))
      pick <- rbind(data[pairs[sample2(seq_len(nrow(pairs)), 1), ], ])
      for (ch in 1:nchar) {
        c.sp <- ecospace[[ch]]$char.space
        st <- seq_len(cs[ch])
        opts.l <- length(unique(unlist(c.sp[, st])))
        opts <-
          apply(as.matrix(c.sp[, st]), 2, unique2, length = opts.l)
        ps <- as.matrix(pick[, c.start[ch]:c.end[ch]])
        # Repeat until yields "allowable" ecospace combination
        repeat {
          if (runif(1, 0, 1) <= strength) {
            if (ecospace[[ch]]$type == "factor") {
              data[sp, c.start[ch]:c.end[ch]] <-
                ps[sample2(seq_len(nrow(ps)), 1), 1]
            } else {
              data[sp, c.start[ch]:c.end[ch]] <-
                sapply(st, function(st)
                  sample2(opts[which(opts[, st] <= max(ps[, st]) &
                                       opts[, st] >= min(ps[, st])), st], 1))
            }
          } else {
            data[sp, c.start[ch]:c.end[ch]] <-
              c.sp[c.sp[(rmultinom(1, 1, prob = c.sp$pro) == 1), ncol(c.sp)], seq_len(cs[ch])]
          }
          if (paste(data[sp, c.start[ch]:c.end[ch]], collapse = ".") %in% ecospace[[ch]]$allowed.combos)
            break
        }
      }
    }
  }
  return(data)
}

Any scripts or data that you put into this service are public.

ecospace documentation built on July 8, 2020, 5:54 p.m.

rdrr.io home R language documentation Run R code online

CRAN packages Bioconductor packages R-Forge packages GitHub packages

Note that we can't provide technical support on individual packages. You should contact the package authors for that.

ecospace
Simulating Community Assembly and Ecological Diversification Using Ecospace Frameworks

R/partitioning.R
In ecospace: Simulating Community Assembly and Ecological Diversification Using Ecospace Frameworks

Defines functions partitioning

Documented in partitioning

Try the ecospace package in your browser

R Package Documentation

Browse R Packages

We want your feedback!

ecospace Simulating Community Assembly and Ecological Diversification Using Ecospace Frameworks

R/partitioning.R In ecospace: Simulating Community Assembly and Ecological Diversification Using Ecospace Frameworks

Defines functions partitioning

Documented in partitioning

Try the ecospace package in your browser

R Package Documentation

Browse R Packages

We want your feedback!

ecospace
Simulating Community Assembly and Ecological Diversification Using Ecospace Frameworks

R/partitioning.R
In ecospace: Simulating Community Assembly and Ecological Diversification Using Ecospace Frameworks