README.md

Multi-batch integration with scPSM

1 Overview

This repository contains the code for the paper Propensity Score Matching enables batch effect corrected imputation in single-cell RNA-seq analysis by Xu et al.

ScPSM (a propensity score matching method for scRNA-seq data) is a statistical tool useful for simultaneously correcting batch effect, imputing dropout and denoising gene expression.

2 Installation

devtools::install_github("eleozzr/scPSM")

The installation should only take a few seconds. The dependencies of the package are listed in the DESCRIPTION file of the package.

3 Usage

We provide two ways to run our methods. For the fist one, the authors can install scPSM package and then use the internal function psm_integrate. For the second one, the authors can run external R script.

3.1 using scPSM package

Start with the vignette online in ./vignettes/README.md.

3.2 using the main function directly

To perform scPSM, first run the help function in file utils.R, then run the script scPSM_main.R. Then the function psm_integrate will be loaded, we can refer to tutorial_pancreas.md for detail running steps.

psm_integrate <- function(batches, markers, hvg, k.self=10, k.mnn=10,
                          correct.all=TRUE, merge.order=1:4){
                          ...
                          }

Arguments

Value

Note: To run the example require the software R >= 4.0.0, batchelor >= 1.4.0, BiocNeighbors >= 1.6.0 and BiocParallel >= 1.22.0.

4 Simulation Data

The code of generating simulation datasets for Fig 2 and Fig S4 is available in simulation_data.md.

5 Real Data

The original data for the toy example is available in the inst/extdata folder

6 Cheatsheet

You can also refer to this cheatsheet to undersand a common workflow



eleozzr/scPSM documentation built on May 23, 2022, 12:05 a.m.