Process_scRNAseq: process scRNAseq datasets one by one to generate QC metadata

Description Usage Arguments Value See Also Examples

View source: R/scRNABatchQC.R

Description

Generate technical and biological metadata for one or multiple single-cell RNAseq datasets;each dataset is processed one by one

Usage

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
14
15
Process_scRNAseq(
  inputs,
  names = NULL,
  nHVGs = 1000,
  nPCs = 10,
  sf = 10000,
  mincounts = 500,
  mingenes = 200,
  maxmito = 0.2,
  PCind = 1,
  mtRNA = "^mt-|^MT-",
  rRNA = "^Rp[sl][[:digit:]]|^RP[SL][[:digit:]]",
  organism = "mmusculus",
  chunk.size = NULL
)

Arguments

inputs

a string vector of file or path names, or a list of SingleCellExperiment or Seurat v3 objects;
inputs can be a string vector of file names (or a URL starting http://, file://, etc.) of gene-by-cell count matrices, the rowname should be gene symbol; each file should be regular delimited file; Compressed files ending .gz and .bz2 are supported.
inputs can be a string vector of path names, each of which contains the barcodes.tsv.gz, features.tsv.gz, and matrix.mtx.gz provided by 10X from CellRanger >=3.0
inputs can also be a list of SingleCellExperiment or Seurat v3 objects

names

string vector; giving the names of single-cell RNAseq datasets (default: NULL); names should have the same length of inputs; if NULL, the names are S1, S2...

nHVGs

integer; the number of highly variable genes (default: 1000)

nPCs

integer; the number of principal components (default: 10)

sf

integer; Scale factor to normalize the data (default: 10000)

mincounts

integer; the cutoff of filtering the cell if the total number of counts in the cell less than the mincounts (default:500)

mingenes

integer; the cutoff of filtering the cell if the total number of genes detected in the cell less than the mingenes (default: 200)

maxmito

float; the cutoff of filtering the cell if the percentage of mtRNA reads in the cell larger than the minmito (default: 0.2)

PCind

integer; which principal component for exploring biological featues (default: 1; the first principal component will be used to find genes highly correlated with PCA 1); PCind should be less than nPC

mtRNA

string; the pattern of gene names for mitochondrial encoded RNAs ; (default: "^mt-|^MT-", the default is mtRNA gene names in human or mouse); If not human or mouse, give the gene name pattern of mtRNA

rRNA

string; the pattern of gene names for ribosomal proteins; (default: "^Rp[sl][[:digit:]]|^RP[SL][[:digit:]]", the default is ribosomal protein gene names in human or mouse); If not human or mouse, give the gene name pattern of ribosomal proteins

organism

string; the organism of single cell RNAseq datasets; if supported by WebGestaltR, functional enrichment analysis will be performed (defeault: mmusculus)

chunk.size

NULL or integer; default is NULL, suggesting data will be loaded into memory at one time, otherwise, the data will be loaded into memory by chunks with chunk.size

Value

a list of SingleCellExperiment objects ;
each SingleCellExperiment object containing metadata for one single cell RNAseq dataset;
each SingleCellExperiment object containing several slots:

See Also

Combine_scRNAseq , generateReport

Examples

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
library(scRNABatchQC)
sces<-Process_scRNAseq(inputs=c("https://github.com/liuqivandy/scRNABatchQC/raw/master/bioplar1.csv.gz","https://github.com/liuqivandy/scRNABatchQC/raw/master/bioplar5.csv.gz"))
names(sces)
class(sces[[1]])
head(sces[[1]]@rowRanges@elementMetadata)
head(sces[[2]]@rowRanges@elementMetadata)
head(colData(sces[[1]]))
sces[[1]]@metadata$rawmeta$ngenes
head(sces[[1]]@metadata$rawmeta$CellData)
head(sces[[1]]@metadata$pc1Pathway)
plotDensity(sces,"total_counts")
plotVarianceTrend(sces)
plotPCPathways(sces)

liuqivandy/scRNABatchQC documentation built on March 24, 2021, 11:01 p.m.