getGeneFeatures: Get simple feature vector representation of genes

Description Usage Arguments Details Value Note Author(s) References See Also Examples

View source: R/geneSimilarity.R

Description

Computes feature vectors for list of genes: Each gene is represented by a vector describing the presence/absence of all GO terms. The absence of each GO term is additionally weighted by its information content.

Usage

1
getGeneFeatures(genelist, pca=FALSE, normalization=FALSE, verbose=FALSE)

Arguments

genelist

character vector of Entrez gene IDs

pca

perform PCA on feature vectors to reduce dimensionality

normalization

scale the feature vectors to norm 1

verbose

print out additional information

Details

The PCA postprocessing determines the principal components that can explain at least 95% of the total variance in the feature space.

Value

matrix with rows being genes and columns being GO terms.

Note

The result depends on the currently set ontology ("BP","MF","CC").

Author(s)

Holger Froehlich

References

M. Mistry, P Pavlidis, Gene Ontology term overlap as a measure of gene functional similarity, BMC Bioinformatics, 9:327, 2008.

See Also

getGeneSimPrototypes, selectPrototypes, getGeneSim, getTermSim, setOntology

Examples

1
	# see selectPrototypes

Example output

Loading required package: GO.db
Loading required package: AnnotationDbi
Loading required package: stats4
Loading required package: BiocGenerics
Loading required package: parallel

Attaching package:BiocGenericsThe following objects are masked frompackage:parallel:

    clusterApply, clusterApplyLB, clusterCall, clusterEvalQ,
    clusterExport, clusterMap, parApply, parCapply, parLapply,
    parLapplyLB, parRapply, parSapply, parSapplyLB

The following objects are masked frompackage:stats:

    IQR, mad, sd, var, xtabs

The following objects are masked frompackage:base:

    anyDuplicated, append, as.data.frame, basename, cbind, colnames,
    dirname, do.call, duplicated, eval, evalq, Filter, Find, get, grep,
    grepl, intersect, is.unsorted, lapply, Map, mapply, match, mget,
    order, paste, pmax, pmax.int, pmin, pmin.int, Position, rank,
    rbind, Reduce, rownames, sapply, setdiff, sort, table, tapply,
    union, unique, unsplit, which.max, which.min

Loading required package: Biobase
Welcome to Bioconductor

    Vignettes contain introductory material; view with
    'browseVignettes()'. To cite Bioconductor, see
    'citation("Biobase")', and for packages 'citation("pkgname")'.

Loading required package: IRanges
Loading required package: S4Vectors

Attaching package:S4VectorsThe following object is masked frompackage:base:

    expand.grid


Loading required package: annotate
Loading required package: XML

GOSim documentation built on Nov. 8, 2020, 11:05 p.m.