SCATE: Perform SCATE

Description Usage Arguments Details Value Author(s) Examples

View source: R/SCATE.R

Description

Single-cell ATAC-seq signal Extration and Enhancement

Usage

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
SCATE(
  satac,
  type = "reads",
  peakOverlapMethod = "full",
  genome = "hg19",
  cluster = NULL,
  clusterid = NULL,
  clunum = NULL,
  datapath = NULL,
  verbose = TRUE,
  ncores = 1
)

Arguments

satac

If type='reads', satac should be a GRanges object or list of GRanges object of scATAC-seq reads. Each element corresponds to one single cell. The GRanges should be the middle point of the reads with length of 1 base pair. Use 'satacprocess' to preprocess raw reads. If type='peaks', satac should be a data frame or list of data frames of scATAC-seq peaks. For each data frame, first column is chromsome name, second column is start site, third column is end site, and fourth column is the number of reads of the peak.

type

Character variable of either 'reads' or 'peaks'.

peakOverlapMethod

Character variable of either 'full' or 'middle'. Only effective when type = 'peaks'. If peakOverlapMethod='full', then the full range of the peak will be used to find overlap with bins, and all bins overlapping with this peak will be assigned the read counts of this peak. If peakOverlapMethod='middle', only the middle base pair of the peak will be used to find overlap with bins.

genome

Character variable of either "hg19" or "mm10". Default is 'hg19'.

cluster

Numeric vector specifying the cluster of cells. Needs to be named and include all cells in satac. If NULL, SCATE will be run on all cells in satac.

clusterid

Numeric number specifying the single cluster to run SCATE. If NULL SCATE will be run on all clusters. Ignored if cluster is NULL. The cluster id must be included in variable 'cluster'.

clunum

Numeric value specifying number of CRE clusters. If NULL, SCATE automatically chooses number of CRE clusters.

datapath

Character variable of the path to the customized database (eg myfolder/database.rds). The database can be made using 'makedatabase' function. If not null, 'genome' is ignored.

verbose

Either TRUE or FALSE. If TRUE, progress will be displayed.

ncores

Numeric variable of number of cores to use. If NULL, the maximum number of cores is used.

Details

This function takes as input the scATAC-seq reads and generates enhanced signals. Users can either perform SCATE on clusters of cells or a single group of cells.

Value

A numeric vector or matrix of values generated by SCATE, depending on the number of clusters. The length of the vector or the number of rows of the matrix is the same as the number of bins in the genome. If a matrix, the column names indicate the cluster id.

Author(s)

Zhicheng Ji, Weiqiang Zhou, Wenpin Hou, Hongkai Ji* <whou10@jhu.edu>

Examples

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
#Reads as input, setting CRE cluster number as 156 to increase speed. Users need to set it to be NULL in real applications.
gr <- GRanges(seqnames="chr1",IRanges(start=seq_len(100)+1e6,end=seq_len(100)+1e8))
SCATE(gr,clunum=156,type='reads',genome="mm10") 
## Not run: 
peak <- data.frame(seqnames="chr1",start=seq_len(100)+1e6,end=seq_len(100)+1e8,count=1)
#Peak as input, peakOverlapMethod=full
SCATE(satac=peak,clunum=156,type='peaks',genome="mm10") 
#Peak as input, peakOverlapMethod=middle
SCATE(satac=peak,clunum=156,type='peaks',peakOverlapMethod='middle',genome="mm10") 

## End(Not run)

SCATE documentation built on Nov. 8, 2020, 5:56 p.m.