formatGOdb.curated: Generates new GO database from curated evidence-codes only...

View source: R/Utilities.R

formatGOdb.curatedR Documentation

Generates new GO database from curated evidence-codes only for functional enrichment using run.topGO.meta.

Description

Generates new GO database from curated evidence-codes only for functional enrichment using run.topGO.meta.

Usage

formatGOdb.curated(url, organism = "Pf", plasmoDB_version = "")

Arguments

url

url or filepath to a .gaf or gz-compressed gaf file. Note that zip archives won't work directly with this function. Zip files need to be downloaded and unzipped, then the 'url' argument should be updated to the local unzipped file path.

organism

optional string to include in your output filename. Defaults to "Pf".

plasmoDB_version

= optional string to include the PlasmoDB release version in your output filename (e.g. "v66").

Details

Outputs a dated file named "*_GOdb.out" to the ./Routput folder. ./Routput will be created if it doesn't already exist.

You will need to run topGO::readmappings() on the file generated with formatGOdb to use it as the geneID2GO parameter.

Example: geneID2GO <- topGO::readmappings("./Routput/Pf_Mar022022_GOdb.out")

Notes on gaf.gz format

The .gaf or .gaf.gz file should be in tabular format with 17 columns, one row for each GO term associated with a geneID. No formatting is necessary when using the provided url.

A GOdb from Plasmo DB's P. falciparum annotation (from PlasmoDB release 66; accessed Sep 28, 2023) pre-formatted using this function and ready for run.topGO.meta is included in this package (Pfal_geneID2GO_curated).

See Also

formatGOdb()


oberstal/pfGO documentation built on April 22, 2024, 7:15 a.m.