README.md
In BatadaLab/scID: geneset-guided identification of cell types using single cell RNA-seq data

scID

The power of single cell RNA sequencing (scRNA-seq) stems from its ability to uncover cell type-dependent phenotypes, which rests on the accuracy of cell type identification. However, resolving cell types within and, thus, comparison of scRNA-seq data across conditions is challenging due to technical factors such as sparsity, low number of cells and batch effect. To address these challenges we developed scID (Single Cell IDentification), which uses the framework of Fisher's Linear Discriminant Analysis to identify transcriptionally related cell types between scRNA-seq datasets. Detailed information on the method and performance evaluation is demostrated in the publication Boufea et al., iScience 2020. By increasing power to identify transcriptionally similar cell types across datasets, scID enhances investigator's ability to extract biological insights from scRNA-seq data.

scID classifies cells of a given target dataset based on their transcriptional similarity to given reference clusters in 4 steps. As a first step, scID extracts cluster-specific gene sets from the reference data and calculates weights (based on Fisher's Linear Discriminant Analysis) that represent their discriminative power to identify the cluster of interest. Next, scID scores all target cells based on the expression of the cluster-specific gene sets and, finally, identifies equivalent target cells by fitting a mixture of Gaussian distributions.

On May 17, 2019, we released the new version of scID (v2.0.0) that uses negative markers together with positive for identifying equivalent cells. We have seen that this improves classification in presense of very simlar cell types in the dataset.

scID can be installed using the devtools R package:

install.packages('devtools')
devtools::install_github("BatadaLab/scID")

Given two single-cell RNA-seq gene expression datasets with one of them having known groups of cells (clusters), scID can be used to identify transcriptionally similar cells in the second dataset.

scID_output <- scID::scid_multiclass(target_gem, reference_gem, reference_clusters, ...)

Input

target_gem An nxm data frame of n genes (rows) in m cells (columns) of the dataset with unknown grouping, where each entry is library-depth or column normalized gene expression. Cell names are expected to be unique
reference_gem An NxM data frame of N genes (rows) in M cells (columns) of the dataset with known grouping, where each entry is library-depth or column normalized gene expression
reference_clusters A list of cluster labels for the reference cells

Output

scID_output is a list of four objects

scID_output$labels A named list of cluster labels for the target cells
scID_output$markers A data frame of signature genes extracted from the reference clusters
scID_output$weights A list of the estimated weights for all cluster-specific genes
scID_output$scores A data frame of scores of target cells (columns) for each reference cluster-specific geneset (rows)

Identification of equivalent cells across single-cell RNA-seq datasets

To report bugs or ask any questions please use the GitHub issues tracker.

BatadaLab/scID documentation built on Oct. 21, 2021, 3:06 p.m.

rdrr.io home R language documentation Run R code online

CRAN packages Bioconductor packages R-Forge packages GitHub packages

Note that we can't provide technical support on individual packages. You should contact the package authors for that.

BatadaLab/scID
geneset-guided identification of cell types using single cell RNA-seq data

README.md
In BatadaLab/scID: geneset-guided identification of cell types using single cell RNA-seq data

scID

Installation

Usage

Input

Output

Vignettes

Need help?

R Package Documentation

Browse R Packages

We want your feedback!

BatadaLab/scID geneset-guided identification of cell types using single cell RNA-seq data

README.md In BatadaLab/scID: geneset-guided identification of cell types using single cell RNA-seq data

scID

Installation

Usage

Input

Output

Vignettes

Need help?

R Package Documentation

Browse R Packages

We want your feedback!

BatadaLab/scID
geneset-guided identification of cell types using single cell RNA-seq data

README.md
In BatadaLab/scID: geneset-guided identification of cell types using single cell RNA-seq data