README.md

ZIR (Zero-Inflated Rank Test)

Introduction

When the underlying distribution of data is unknown, usually we use nonparametric test to test the difference between groups. For example, Wilcoxon rank sum test (Mann-Whitney U test) is often used for two group comparison, and Kruskal Wallis test is used for multiple group comparison. However, sometimes the data may contain many zeroes and the traditional rank based test, such as Wilcoxon rank sum test and Kruskal Wallis test have low power due to the ties in the data.

Here we developed a R package called ZIR (Zero-Inflated Rank Test), which has more power than the traditional rank based test when the data contain many zeroes, and have similar power as the traditional rank based test when the data don't have too many zeros.

Installation

You can install our ZIR package from Github

install.packages("devtools")
devtools::install_github("chvlyl/ZIR")
library(ZIR)

Basic Usage

Modified Wilcoxon rank sum test (ZIW) for zero-inflated data

x <- c(rep(0,5),rlnorm(20, meanlog = 0, sdlog = 1))
y <- c(rep(0,10),rlnorm(20, meanlog = 2, sdlog = 1))
ziw(x, y, perm = FALSE)

If you want to use permutations to generate pvalues, you can set perm to TRUE

ziw(x, y, perm = TRUE)

Modified Kruskal Wallis test (ZIKW) for zero-inflated data

x <- list(group1 = c(rep(0,5),rlnorm(20, meanlog = 0, sdlog = 1)),
      group2=c(rep(0,10),rlnorm(20, meanlog = 1, sdlog = 1)),
      group3=c(rep(0,15),rlnorm(20, meanlog = 2, sdlog = 1)))
zikw(x, perm = FALSE)

If you want to use permutations to generate pvalues, you can set perm to TRUE

zikw(x, perm = TRUE)

Citation

Wanjie Wang, Eric Z. Chen and Hongzhe Li (2021). Truncated Rank-Based Tests for Two-Part Models with Excessive Zeros and Applications to Microbiome Data. Submitted.



chvlyl/ZIR documentation built on Oct. 14, 2021, 12:50 a.m.