GO2list: Filter GO and KEGG database

Description Usage Arguments Details Value Examples

View source: R/littlehelpers.R

Description

Filter GO and KEGG database and transform database to list

Usage

1
2
3
4
GO2list(dbase, go.cat = NULL, rm = NULL, keep = NULL)
KEGG2list(dbase, rm = NULL, keep = NULL)
GO2offspring(x)
GO2level(x, go.level=-1, relation=c("is_a"))

Arguments

dbase

A datastructure storing identifieres of GO/KEGG terms and assigned genes. Can be one of

database

usually of class ‘ProbeGo3AnnDbBimap’ (as defined in package “AnnotationDbi”)

named list

with keys being the identifiers and values being genes

dataframe

with first column being the identifiers and second column being genes. Additional columns are ignored.

x

a list with keys being the identifiers and values being genes (e.g. output of GO2list)

go.cat

GO category ("MF", "BP", "CC") that should be returned and filtered

go.level

Level in the DAG of GO terms. Defaults to “-1” for pass through without modification. Otherwise: a positive integer giving the level at which GO terms should be grouped together.

rm

remove these terms

keep

keep only these terms

relation

relationships in GO hierarchy that should be considered. Defaults to “is_a”

Details

The settings for “rm” and “keep” can be combined, allowing for efficient reduction of the number of GO terms and KEGG pathways, respectively.

Providing a named list instead of a database can be useful for non-model organisms, where only a draft Blast2GO-annotation is available. In this case, the names of the list are the GO terms (or KEGG pathways) and the content of each list item is a character vector with tag-ids.

The function GO2offspring does the same as the databaseGO2ALLPROBES function does (e.g. hgu133plus2GO2ALLPROBES). I.e. instead of representing only features (probe sets, genes, ...) assigned to the GO terms directly, it also contains all features assigned to all children (offsprings).

The function GO2level groups GO terms together at a more general level to simplify data interpretation and speed up runtime. This function works according to the level option provided by DAVID, but the number of levels is not restricted.

Value

A named list with each slot containing the ids for the term or pathway.

Examples

1
2
3
library(hgu133plus2.db)
x <- GO2list(dbase=hgu133plus2GO2PROBE, go.cat="CC",
	rm=c("GO:0000139", "GO:0000790", "GO:0005730", "GO:0005739"))

geecc documentation built on April 28, 2020, 8:19 p.m.