keywordfreq: KeyWordFreq

Description Usage Arguments Details Value Author(s) Examples

Description

Frequencies of common biology keywords appearing in search results

Usage

1
KeyWordFreq(searchtable, category = c("celltype", "disease", "tissue"))

Arguments

searchtable

The direct output of function "GEOsearchterm"

category

A character vector specifying which category in the common biology keyword list to be used. Should be contain "celltype", "disease" or "tissue".

Details

This function calculates the frequencies of each common biology keyword appearing in the given search table. The list of common biology keywords is compiled from http://www.atcc.org/. The list contains three categories: cell types, diseases and tissues. Users can specify which category to be used. The function also returns log fold change and FDR of fisher test to check whether each keyword has significantly more appearance compared to base frequency. The base frequency is defined as the number of appearance of the key word in all samples (roughly 40000 samples) included in GEO database.

Value

A data.frame with the frequency of each common biology keyword. First column: keyword name; Second column: kyeword frequency; Third column: log fold change of the frequency; Fourth column: FDR of fisher test.

Author(s)

Zhicheng Ji, Hongkai Ji <zji4@zji4.edu>

Examples

1
KeyWordFreq(GEOSearchTerm("Oct4 RNA-seq"))

zji90/GEOsearch documentation built on May 4, 2019, 11:23 p.m.