geneSymbolsForDataset: geneSymbolsForDataset

Description Usage Arguments Details Value Examples

View source: R/GeneSurvey.R

Description

Take a vector of purported gene names and a vector of valid gene names and return a list containing matched and unmatched names. The matched names may also be corrected, such as when the RNASeq data adds a pipe and Entrez id to a gene symbol.

Usage

1
geneSymbolsForDataset(theRequestVector, theGeneSymbolVector)

Arguments

theRequestVector

Vector of purported gene symbols.

theGeneSymbolVector

Vector of valid gene symbols.

Details

Take a vector of purported gene symbol and a vector of valid gene names and return a list containing matched and unmatched names. The matched names may also be corrected, such as when the RNASeq data adds a pipe and Entrez id to a gene symbol.

This means a purported gene symbol of TP53 is returned as simply TP53, with a direct match. A purported gene symbol of FBXW7 is returned as matched to FBXW7|55294, since the TCGA Standardized Data from MD Anderson keeps the Entrez id, since the FBXW7 gene symbol was not in the translation file when this data set was processed. A purported gene symbol of FOO is returned as unmatched, since (at present) there is no such gene symbol.

Value

A list with two named elements, matched and unmatched. The matched components gives valid/known genes for the give gene list, while unmatched indicates genes for which no match was found.

Examples

1
#See examples in tests directory in the install.

MD-Anderson-Bioinformatics/GeneSurvey documentation built on May 7, 2019, 2:04 p.m.