GENIE3: GENIE3

Description Usage Arguments Value Examples

Description

GENIE3 Infers a gene regulatory network (in the form of a weighted adjacency matrix) from expression data, using ensembles of regression trees.

Usage

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
GENIE3(
  exprMatrix,
  regulators = NULL,
  targets = NULL,
  treeMethod = "RF",
  K = "sqrt",
  nTrees = 1000,
  nCores = 1,
  verbose = FALSE
)

Arguments

exprMatrix

Expression matrix (genes x samples). Every row is a gene, every column is a sample. The expression matrix can also be provided as one of the Bioconductor classes:

  • ExpressionSet: The matrix will be obtained through exprs(exprMatrix)

  • RangedSummarizedExperiment: The matrix will be obtained through assay(exprMatrix), wich will extract the first assay (usually the counts)

regulators

Subset of genes used as candidate regulators. Must be either a vector of indices, e.g. c(1,5,6,7), or a vector of gene names, e.g. c("at_12377", "at_10912"). The default value NULL means that all the genes are used as candidate regulators.

targets

Subset of genes to which potential regulators will be calculated. Must be either a vector of indices, e.g. c(1,5,6,7), or a vector of gene names, e.g. c("at_12377", "at_10912"). If NULL (default), regulators will be calculated for all genes in the input matrix.

treeMethod

Tree-based method used. Must be either "RF" for Random Forests (default) or "ET" for Extra-Trees.

K

Number of candidate regulators randomly selected at each tree node (for the determination of the best split). Must be either "sqrt" for the square root of the total number of candidate regulators (default), "all" for the total number of candidate regulators, or a stricly positive integer.

nTrees

Number of trees in an ensemble for each target gene. Default: 1000.

nCores

Number of cores to use for parallel computing. Default: 1.

verbose

If set to TRUE, a feedback on the progress of the calculations is given. Default: FALSE.

Value

Weighted adjacency matrix of inferred network. Element w_ij (row i, column j) gives the importance of the link from regulatory gene i to target gene j.

Examples

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
## Generate fake expression matrix
exprMatrix <- matrix(sample(1:10, 100, replace=TRUE), nrow=20)
rownames(exprMatrix) <- paste("Gene", 1:20, sep="")
colnames(exprMatrix) <- paste("Sample", 1:5, sep="")

## Run GENIE3
set.seed(123) # For reproducibility of results
weightMatrix <- GENIE3(exprMatrix, regulators=paste("Gene", 1:5, sep=""))

## Get ranking of edges
linkList <- getLinkList(weightMatrix)
head(linkList)

Example output

  regulatoryGene targetGene    weight
1          Gene3     Gene18 0.6187213
2          Gene4      Gene1 0.5933340
3          Gene1     Gene15 0.5584180
4          Gene3     Gene11 0.5402384
5          Gene2     Gene16 0.4975430
6          Gene1     Gene13 0.4925515

GENIE3 documentation built on Nov. 8, 2020, 5:52 p.m.