CENA: The Cell Niche Associations (CENA) algorithm

Description Usage Arguments Value Examples

View source: R/CENA.R

Description

CENA is a method for a joint identification of pairwise association together with the particular subset of cells in which the association is detected. In this implementation, CENA is limited to associations between the genes' expression levels (data from scRNA-sequencing) and an additional cellular meta-data of choice. Note that python3 should be installed on the machine, with the following libraries: numpy, igraph, leidenalg.

Usage

1
2
3
4
CENA(geneExpressionDataMatrix, phenotypeData, cellSpace,
  resolution_parameter, genesToRun = row.names(geneExpressionDataMatrix),
  no_cores = NULL, k1 = NULL, k2 = 10, Tw = 30,
  python_path = NULL)

Arguments

geneExpressionDataMatrix

A matrix containing the single-cell RNA-seq data. Each row corresponds to a certain gene and each column to a certain cell. The algorithm assumes the order of the cells in this scRNA-seq data matrix is the same as the order in the meta-data ('phenotypeData') and the cell-state space ('cellSpace').

phenotypeData

A vector containing the meta-data levels of each of the cells.

cellSpace

Cell coordinates matrix for the single-cell data. Each row represent a cell and each column corresponds to a specific dimension. Only 2-dim spaces are allowed.

resolution_parameter

The resolution parameter of Leiden. Cutoff as desired; Higher values provide smaller clusters.

genesToRun

A vector of genes for which associations should be inferred. The default value is all genes in the input scRNA-seq matrix. However, to reduce running times, it is recommended to use a small subset of the genes.

no_cores

The number of cores which will be used for the analysis. The defalt (NULL) is total number of cores minus 1.

k1

The K1 parameter. The default value is 1 percentage of the cells. k1 should be kept relatively small.

k2

The K2 paremeter. The default value is 10. k2 have relatively small effect on results.

Tw

The subset size cutoff (Tw). The default value is 30. Cutoff as desired; high values filter out small subsets.

python_path

The location of python on the machine. If not specified, the default python path will be selected. Note that python3 and the following libraries should be installed: numpy, igraph, leidenalg.

Value

cluster_information

Association statistics for cell subsets obtained by CENA for each of the genes (rows). Columns corresponds to the r.squared, slope, intercept and p.value of these associations.

cluster

A list containing, for each gene, the identified cell subset in which the association occurs.

parameters

The initial parameters used by CENA.

Examples

1
2
3
4
5
data(cellSpace)
data(geneExpressionDataMatrix)
data(phenotypeData)
# running CENA on 5 genes
results = CENA(geneExpressionDataMatrix, phenotypeData, cellSpace, resolution_parameter = 8, no_cores = 1)

mayalevy/CENA documentation built on Jan. 29, 2020, 4:42 p.m.