In rnabioco/clustifyrdata: External data sets for clustifyr

knitr::opts_chunk$set(
  message = FALSE,
  warning = FALSE,
  collapse = TRUE,
  comment = "#>",
  fig.align = "center"
)

Benchmark 1. MCA lung dataset annotation using `ref_tabula_muris_drop` reference

library(clustifyr)
library(clustifyrdata)

l_mat <- clustifyrdata::MCA_lung_mat
l_meta <- clustifyrdata::MCA_lung_meta

# find lung references, remove generic terms
lung_cols <-  grep("-Lung",
                   colnames(ref_tabula_muris_drop), 
                   value = TRUE)

tml_ref <- ref_tabula_muris_drop[, lung_cols]
tml_ref <- tml_ref[, -c(8, 13)]

# default with all genes
start <- proc.time()

res <- clustify(
  input = l_mat,
  ref_mat = tml_ref,
  metadata = l_meta,
  cluster_col = "Annotation"
)

res_allgenes <- cor_to_call(
  cor_mat = res,
  metadata = l_meta,
  cluster_col = "Annotation"
)

end <- proc.time()

names(res_allgenes) <- c("MCA annotation", "clustifyr call", "r")
print(end - start)
print(res_allgenes, n = nrow(res_allgenes))

benchmark 2. Using sorted microarray data to classify 10x PBMC example data, available in `clustifyrdata` package

full_pbmc_matrix <- clustifyrdata::pbmc_matrix
full_pbmc_meta <- clustifyrdata::pbmc_meta
microarray_ref <- clustifyrdata::ref_hema_microarray

start <- proc.time()

res <- clustify(
  input = full_pbmc_matrix,
  ref_mat = microarray_ref,
  metadata = full_pbmc_meta,
  query_genes = pbmc_vargenes[1:500],
  cluster_col = "classified"
)

res2 <- cor_to_call(res, threshold = 0.5)

end <- proc.time()

names(res2) <- c("manual annotation", "clustifyr call", "r")
print(end - start)
print(res2, n = nrow(res2))

Please see manuscript for full benchmarking.

Comparison with other methods

using Tablua Muris (drop and facs samples) 12 shared tissues, which can be downloaded as seurat objects

Building reference and then mapping:

default clustify, with all genes

clustify, pulling var.genes from seurat objects

clustify, using M3Drop for feature selection

clustify, using per_cell = TRUE option, and then assign cluster consensus ident with collapse_to_cluster = TRUE

clustify, after ALRA imputation, using per_cell = TRUE option, and then assign cluster consensus ident with collapse_to_cluster = TRUE

scmap-cluster

Mapping from prebuilt all-encompassing references to the drop samples:

clustify, using ref_tabula_muris_facs

singleR, using default built-in mouse references without fine tuning

Generate marker gene list (of 30 genes per reference identity), and then mapping

default clustify_list

knitr::include_graphics("img/test.png")

rnabioco/clustifyrdata documentation built on Nov. 8, 2022, 4:26 a.m.

rdrr.io home R language documentation Run R code online

CRAN packages Bioconductor packages R-Forge packages GitHub packages

Note that we can't provide technical support on individual packages. You should contact the package authors for that.

rnabioco/clustifyrdata
External data sets for clustifyr

In rnabioco/clustifyrdata: External data sets for clustifyr

Benchmark 1. MCA lung dataset annotation using `ref_tabula_muris_drop` reference

benchmark 2. Using sorted microarray data to classify 10x PBMC example data, available in `clustifyrdata` package

Comparison with other methods

R Package Documentation

Browse R Packages

We want your feedback!

rnabioco/clustifyrdata External data sets for clustifyr

In rnabioco/clustifyrdata: External data sets for clustifyr

Benchmark 1. MCA lung dataset annotation using ref_tabula_muris_drop reference

benchmark 2. Using sorted microarray data to classify 10x PBMC example data, available in clustifyrdata package

Comparison with other methods

R Package Documentation

Browse R Packages

We want your feedback!

rnabioco/clustifyrdata
External data sets for clustifyr

Benchmark 1. MCA lung dataset annotation using `ref_tabula_muris_drop` reference

benchmark 2. Using sorted microarray data to classify 10x PBMC example data, available in `clustifyrdata` package