compare_genesets: Compare two lists of gene sets

View source: R/compare_gene_sets.R

compare_genesetsR Documentation

Compare two lists of gene sets

Description

This function compares two lists of gene sets using either the Jaccard index or the hypergeometric test.

Usage

compare_genesets(
  set_1 = NULL,
  set_2 = NULL,
  stat = c("jaccard", "hypergeom", "intersection", "union", "size_set_1", "size_set_2",
    "diff_set_1", "diff_set_2"),
  background = NULL
)

Arguments

set_1

A list containing gene sets to be compared.

set_2

A list containing gene sets to be compared.

stat

The statistics to be computed between gene sets. It can be either "jaccard", "hypergeom", "intersection" "size_set_1", "size_set_2", "diff_set_1" (specific to set_1), "diff_set_2" (specific to set_2). The background is taken into account. Note that hypergeometric tests check for enrichment.

background

The background (universe) to consider. Default to the non-redundant list of elements merged from set_1 and set2. You may provide a vector with all genes of the genome for instance.

Details

The Jaccard index is a measure of similarity between two sets defined as the size of the intersection divided by the size of the union of the sets. The hypergeometric test is used to determine whether the overlap between two sets is more significant than expected by chance. The 'intersection' method, simply computes the size of the intersection between a and b. The "union", "size_set_1", "size_set_2", "diff_set_1" and "diff_set_2" compute the union of the two sets, the size of gene sets from set_1, the size of gene sets from set_2, the gene that are specific to set_1, the gene that are specific to set_2, respectively.

Value

A matrix of comparison results where each row corresponds to a gene set in set_1, and each column corresponds to a gene set in set_2.

Examples

set.seed(123)
set_1 <- list(letters[1:10], letters[11:20])
x <- sample(letters[1:20])
set_2 <- list(x[1:5], x[6:20])
comp <- compare_genesets(set_1, set_2, stat = "jaccard")
comp <- compare_genesets(set_1, set_2, stat = "hypergeom")


dputhier/dbfmcl documentation built on May 31, 2024, 8:57 a.m.