DNCImper: Assembly process identification based on SIMPER analysis

Documented in DNCI_multigroup

#' DNCI multigroup Function : Pairwise Dispersal-Niche Continuum Index on 2 groups or more
#'
#' This function is a wraper for DNCI.ses() function. Can be used with 2 group or more.
#' Warning: if you are comparing groups of different size, use symmetrize = TRUE and repeat the computation N times.
#'
#' Quantitative identification of the main assembly process. Pairwise analysis.
#' This function is based on DNCI_ses (for 2 group analysis) and PerSIMPER function E index return().
#' The three distributions of E index (corresponding to the three hypothesis: niche, dispersal, niche+dispersal) are used to compute the DNCI index.
#' If DNCI is significantly < 0 : dispersal || DNCI significantly > 0 : niche || DNCI +- CI ~~ 0 : dispersal+niche
#' See Vilmi, Gibert et al. 2021 Ecography for DNCI computation and more information on process identification as well as example with Chinese and US river communities.
#' More information in code and comments inside function file.
#' @param x Sample/Taxa matrix with sample in row and taxa in column
#' @param grouping Grouping vector, ex : c(1,1,1,1,2,2,2,2,2) : 2 groups or more ex2 : c(1,1,1,1,1,2,2,2,2,3,3,3,3)
#' @param id Name of the dataset, default = "no_name"
#' @param symmetrize By random sampling of the largest groups, analyzed pairs of group are made even. Strong heterogeneity in group length can impact DNCI values. More information in function file and Vilmi et al. 2021.
#' @param count Display the number of permutation done, can be usefull with very large or small matrix, default = TRUE
#' @param dataTYPE Need to be set for presence/absence or abundance data ("count"), default = "prab" (presence_absence)
#' @param Nperm Number of permutation, default = 1000, should be change to 100 for robustness analysis
#' @param plotSIMPER Display the SIMPER, PerSIMPER and E index plots, default = TRUE
#' @param parallelComputing Run PerSIMPER on half of the available cores/nodes
#' @examples A <- DNCImper:::DNCI_multigroup(Matrix, Group)
#' @examples #where Matrix is a presence/absence matrix with taxa in column and sample in row
#' @examples #and Group is a vector with length() == number of rows/samples in Matrix, 2 groups, 1 pair
#' @examples #
#' @examples B <- DNCImper:::DNCI_multigroup(DNCImper::Matrix_4groups, DNCImper::Group4, Nperm = 100, count = FALSE, plotSIMPER = FALSE)
#' @examples #In this example, four groups (so 6 pairs) are analysed, with 100 permutations, with no countdown and no plots
#' @importFrom graphics legend lines title
#' @importFrom stats median quantile sd
#' @importFrom utils combn
#'
#'



#############################################################
######### DNCI Function for 2 groups AND more  ##############
################# ANALYSIS ON PAIRS #########################
###### This function calls PerSIMPER and DNCI functions #####

## x = matrix with taxa in columns and localities in rows
## grouping = grouping vector for localities e.g. group <- c(1, 1, 1, 1, 1, 2, 2, 2, 2, 2, 2, 2, 2)
## grouping need to have the same length as X number of rows
## id = Name of your dataset
## Nperm and count arguments are for PerSIMPER calling, same argument as PerSIMPER fun()

## SYMMETRIZE : [IMPORTANT] : this argument make group even by subsampling the largest
##                            group to reduce it (or them) to the smallest group size
##              [IMPORTANT] : Repeat computation X (e.g. 1000) times to obtain mean values
##                            Effect can be strong if groups are strongly uneven

DNCI_multigroup <- function(x, grouping,id = "no_name", Nperm = 1000, count = TRUE,
                            symmetrize = FALSE, plotSIMPER = TRUE, dataTYPE = "prab", parallelComputing = FALSE) {
  group.combinations <- combn(unique(sort(grouping)),2)
  warning("This function is based on resampling algorithm, it MUST be repeated in order to obtain mean/median DNCI values")
  ddelta <- NULL

  for(i in 1:NCOL(group.combinations)) {
    splitx <- split(x,grouping)

    #Ici symmetrize:
    if(symmetrize == TRUE)
    {
      Add <- which(c(NROW(splitx[[group.combinations[1,i]]]),
                     NROW(splitx[[group.combinations[2,i]]])) == max(c(NROW(splitx[[group.combinations[1,i]]]),
                                                                       NROW(splitx[[group.combinations[2,i]]]))))
      if(Add == 1)
      {

        sampled_lines <- sample(1:length(splitx[[group.combinations[1,i]]][,1]),
                                length(splitx[[group.combinations[2,i]]][,1]))
        splitx[[group.combinations[1,i]]] <- splitx[[group.combinations[1,i]]][sampled_lines,]
      }

      if(Add == 2)
      {

        sampled_lines <- sample(1:length(splitx[[group.combinations[2,i]]][,1]),
                                length(splitx[[group.combinations[1,i]]][,1]))
        splitx[[group.combinations[2,i]]] <- splitx[[group.combinations[2,i]]][sampled_lines,]
      }

    }

    paired.x <- rbind(splitx[[group.combinations[1,i]]],
                      splitx[[group.combinations[2,i]]])

    # remove empty species
    ifzero <- which(apply(paired.x, 2, sum) == 0)
    if(length(ifzero > 0)){
      paired.x <- paired.x[,-which(colSums(paired.x)==0)]}
    if(length(which(rowSums(paired.x) == 0)) != 0){stop("ERROR : A row/sample is empty")}
    group.pair <- c(rep(group.combinations[1,i], NROW(splitx[[group.combinations[1,i]]])),
                    rep(group.combinations[2,i], NROW(splitx[[group.combinations[2,i]]])))
    ddelta <- rbind(ddelta, DNCImper:::DNCI.ses(x=paired.x,grouping=group.pair,id=id, Nperm = Nperm,
                                                count = count, plotSIMPER = plotSIMPER, dataTYPE = dataTYPE, parallelComputing = parallelComputing)) #here is the part that calculates the index based on PERSIMPER
  }

  return(ddelta)
}

### return : similar to DNCI results with the exception that returned variable is a multiples rows dataframe().

Corentin-Gibert-Paleontology/DNCImper documentation built on Feb. 8, 2025, 10:20 a.m.

rdrr.io home R language documentation Run R code online

CRAN packages Bioconductor packages R-Forge packages GitHub packages

Note that we can't provide technical support on individual packages. You should contact the package authors for that.

Corentin-Gibert-Paleontology/DNCImper
Assembly process identification based on SIMPER analysis

R/DNCI_multigroup.R
In Corentin-Gibert-Paleontology/DNCImper: Assembly process identification based on SIMPER analysis

Defines functions DNCI_multigroup

Documented in DNCI_multigroup

R Package Documentation

Browse R Packages

We want your feedback!

Corentin-Gibert-Paleontology/DNCImper Assembly process identification based on SIMPER analysis

R/DNCI_multigroup.R In Corentin-Gibert-Paleontology/DNCImper: Assembly process identification based on SIMPER analysis

Defines functions DNCI_multigroup

Documented in DNCI_multigroup

R Package Documentation

Browse R Packages

We want your feedback!

Corentin-Gibert-Paleontology/DNCImper
Assembly process identification based on SIMPER analysis

R/DNCI_multigroup.R
In Corentin-Gibert-Paleontology/DNCImper: Assembly process identification based on SIMPER analysis