Home

/

CRAN

/

mMPA

/

Package: mMPA
In mMPA: Implementation of Marker-Assisted Mini-Pooling with Algorithm

Introduction

The mMPA provides functions to determine the number of quantitative assays needed for a sample of data using pooled testing methods, which include mini-pooling (MP), MP with algorithm (MPA), and marker-assisted MPA (mMPA). To estimate the number of assays needed, the package also provides a tool to conduct Monte Carlo (MC) to simulate different orders in which the sample would be collected to form pools. Using MC avoids the dependence of the estimated number of assays on any specific ordering of the samples to form pools.

Example

The following R code example demonstrates the use of the mMPA package.

Estimate the average number of assays required by mMPA

Let us assume that blood samples of n = 300 HIV+ individuals are collected for HIV viral load (VL) testing. We simulate the VL test results using a Gamma (shape = 2.8, scale = 150) distribution, and generate their corresponding risk scores by adding a uniform random noise to the percentile of VL. The resulting VL has a median of 392 and an interquartile range (IQR) from 224 to 565; the resulting risk score and VL have a Spearman’s correlation of 0.69.

n = 300
set.seed(100)
pvl = rgamma(n, shape = 2.8, scale = 150)

summary(pvl)
riskscore = (rank(pvl)/n) * 0.5 + runif(n) * 0.5
cor(pvl, riskscore, method = "spearman")

We use mMPA to do a pooled VL testing with a pool size of K = 5. A total of 60 pools are formed.

# Pool size K is set to 5
K = 5
# so, the number of pools = 60
n.pool  = n/K; n.pool

Of course, there are many ways to form pools. Using Monte Carlo simulation, we permute the data perm_num = 100 time to mimic situations that the individuals came to the clinics in different orders. Thus, different choices of five blood samples are pooled.

The mMPA package includes a function called pooling_mc(v, s, K, perm_num, method, ...), which takes five main arguments as function inputs: Values of test results (v), corresponding risk scores (s), pool size (K), the number of Monte Carlo simulations (perm_num), and the method for pooling (which by default use method = "mMPA"). The function outputs the total number of VL assays needed for each of the 60 pools from each permutation.

 library(mMPA)
 foo = pooling_mc(pvl, riskscore, K, perm_num = 100)

The output foo is a 60x100 matrix, of which each column stores the numbers of VL tests needed by the 60 pools that are formed for each permutation.

The average number of VL tests needed per pool is then calculated to be 3.35.

 mean(foo)

The average number of VL tests needed per individual is then calculated as 0.67.

 mean(foo)/K

So the Average number of VL Tests Required per 100 individuals (ATR) is estimated to be 67.

Comparison with other pooling algorithms

If we use mini-pooling (MP) for VL testing, we need an average of 1.192 assays for each individual.

foo_mp = pooling_mc(pvl, riskscore, perm_num = 100, method = "minipool")
mean(foo_mp)

mean(foo_mp)/K

If we use mini-pooling with algorithm (MPA) (c.f. May et al, 2010), we need 0.79 assay per individual on average.

foo_mpa = pooling_mc(pvl, riskscore, perm_num = 100, method = "mpa")
mean(foo_mpa)
mean(foo_mpa)/K

The ATRs for MP, MPA, and mMPA are 119, 79, and 67, respectively. Graphically, the efficiency of the three pooling algorithms is illustrated by the following graph.

boxplot(cbind(MP=apply(foo_mp, 2, mean),
              MPA=apply(foo_mpa, 2, mean),
              mMPA=apply(foo, 2, mean))/K*100,
        ylab = "Number of assays required per 100 individuals")

Calculation of 95% confidence interval using bootstrap

Above we use Monte Carlo simulation (with 100 permutations) to obtain a point estimate of ATR for each of the three pooling methods. It is a "point" estimate because all permutations are carried out on one sample, and not taking into account the sampling variability. In the following, we provide a code example to illustrate how to use the bootstrap method to calculate the 95% confidence intervals for the estimated ATRs.

### we use 500 bootstrap resamples ###
 n_bt = 500

### For each bootstrap resample, we use Monte Carlo simulation to 
### estimate the ATR for each pooling method. The results are saved 
### in a 500x3 matrix. 
 bt_result = matrix(NA, nrow = n_bt, ncol = 3)

 for(i in 1:n_bt){
   bt_index = sample(size = n, x = 1:n, replace = T)
   bt_pvl = pvl[bt_index]
   bt_riskscore = riskscore[bt_index]

   ### bt_pvl is a bootstrap sample of PVL; the corresponding risk is bt_riskscore
   bt_result[i, 1] = mean(pooling_mc(bt_pvl, bt_riskscore, perm_num = 100, method = "minipool"))/K*100
   bt_result[i, 2] = mean(pooling_mc(bt_pvl, bt_riskscore, perm_num = 100, method = "mpa"))/K*100
   bt_result[i, 3] = mean(pooling_mc(bt_pvl, bt_riskscore, perm_num = 100, method = "mmpa"))/K*100
}

In the following, we define a function called ci_foo() to calculate 2.5% and 97.5% tiles.

> ci_foo = function(x) quantile(x, probs = c(0.025, 0.975))
> apply(bt_result, 2, ci_foo)
          [,1]      [,2]      [,3]
2.5%  118.2737  75.47583  63.10633
97.5% 119.6833  82.05275  70.74525

The 95% confidence intervals are shown in the following table. So for this simulated data set, mMPA requires an average of 67 VL assays per 100 individual with a 95% CI of (63, 71). mMPA requires significantly less assays than direct individual testing (IND) and pooled testing using MP and MPA.

| Method | ATR | 95% CI | |---------|:-----:|:--------------:| | IND | 100 | --- | | MP | 119.2 | (118.3, 119.7) | | MPA | 79 | (75, 82) | | mMPA | 67 | (63, 71) |

Appendix: GitHub Installation

The latest version of the mMPA package is available at GitHub mMPA. It requires the devtools package to be installed in R. If you do not have devtools in your R program, use the code install.packages("devtools") to install the devtools package first. Then run the following codes to install the mMPA package.

devtools::install_github("taotliu/mMPA")
library(mMPA)

Any scripts or data that you put into this service are public.

mMPA documentation built on May 1, 2019, 6:33 p.m.

rdrr.io home R language documentation Run R code online

CRAN packages Bioconductor packages R-Forge packages GitHub packages

Note that we can't provide technical support on individual packages. You should contact the package authors for that.

mMPA
Implementation of Marker-Assisted Mini-Pooling with Algorithm

Package: mMPA
In mMPA: Implementation of Marker-Assisted Mini-Pooling with Algorithm

Introduction

Example

Estimate the average number of assays required by mMPA

Comparison with other pooling algorithms

Calculation of 95% confidence interval using bootstrap

Appendix: GitHub Installation

Try the mMPA package in your browser

R Package Documentation

Browse R Packages

We want your feedback!

mMPA Implementation of Marker-Assisted Mini-Pooling with Algorithm

Package: mMPA In mMPA: Implementation of Marker-Assisted Mini-Pooling with Algorithm

Introduction

Example

Estimate the average number of assays required by mMPA

Comparison with other pooling algorithms

Calculation of 95% confidence interval using bootstrap

Appendix: GitHub Installation

Try the mMPA package in your browser

R Package Documentation

Browse R Packages

We want your feedback!

mMPA
Implementation of Marker-Assisted Mini-Pooling with Algorithm

Package: mMPA
In mMPA: Implementation of Marker-Assisted Mini-Pooling with Algorithm