Process_OnescRNAseq: process one scRNAseq dataset to generate QC metadata

Description Usage Arguments Value See Also Examples

View source: R/scRNABatchQC.R

Description

Generate technical and biological metadata for one single cell RNAseq dataset

Usage

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
14
Process_OnescRNAseq(
  input,
  sf = 10000,
  mincounts = 500,
  mingenes = 200,
  maxmito = 0.2,
  mtRNA = "^mt-|^MT-",
  rRNA = "^Rp[sl][[:digit:]]|^RP[SL][[:digit:]]",
  nHVGs = 1000,
  nPCs = 10,
  PCind = 1,
  organism = "mmusculus",
  chunk.size = NULL
)

Arguments

input

string of file or path name, a SingleCellExperiment or Seurat v3 object;
input can be the file name (or a URL starting http://, file://, etc.) of gene-by-cell count matrix, the rowname should be gene symbol; the file should be regular delimited file; Compressed files ending .gz and .bz2 are supported.
input can be the path name, which contains the barcodes.tsv.gz, features.tsv.gz, and matrix.mtx.gz provided by 10X from CellRanger >=3.0
input can also be a SingleCellExperiment or Seurat v3 object

sf

integer; Scale factor to normalize the single cell RNA-seq data (default: 10000)

mincounts

integer; the cutoff of filtering the cell if the total number of counts in the cell less than the mincounts (default:500)

mingenes

integer; the cutoff of filtering the cell if the total number of genes detected in the cell less than the mingenes (default: 200)

maxmito

float; the cutoff of filtering the cell if the percentage of mtRNA reads in the cell larger than the minmito; (default: 0.2);

mtRNA

string; the pattern of genenames for mitochondrial encoded RNAs ; (default: "^mt-|^MT-", the default is mtRNA genenames in human or mouse); If not human or mouse, input the gene name pattern of mtRNA

rRNA

string; the pattern of genenames for ribosomal proteins; (default: "^Rp[sl][[:digit:]]|^RP[SL][[:digit:]]", the default is ribosomal protein genenames in human or mouse); If not human or mouse, input the gene name pattern of ribosomal proteins

nHVGs

integer; the number of highly variable genes (default: 1000)

nPCs

integer: the number of principal components (default: 10)

PCind

integer; which principal component for exploring biological featues (default: 1; the first principal component will be used to find genes highly correlated with PCA 1); PCind should be less than nPC

organism

string; the organism of single cell RNAseq datasets; if supported by WebGestaltR, functional enrichment analysis will be performed (defeault: mmusculus)

chunk.size

NULL or integer; default is NULL, suggesting data will be loaded into memory at one time, otherwise, the data will be loaded into memory by chunks with chunk.size

Value

a SingleCellExperiment object with several slots:

See Also

Process_scRNAseq, Combine_scRNAseq

Examples

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
library(scRNABatchQC)
sce<-Process_OnescRNAseq(input="https://github.com/liuqivandy/scRNABatchQC/raw/master/bioplar1.csv.gz")
head(sce@rowRanges@elementMetadata)
head(colData(sce))
counts(sce)[1:5,1:5]
logcounts(sce)[1:5,1:5]
sce@metadata$rawmeta$ngenes
head(sce@metadata$rawmeta$CellData)
head(sce@metadata$pc1Pathway)
sces=list(sce=sce)
plotDensity(sces)

liuqivandy/scRNABatchQC documentation built on March 24, 2021, 11:01 p.m.