get_sig_genes: Output text files with lists of significant genes

Description Usage Arguments Details

View source: R/get_sig_genes.R

Description

This function retrieves lists of gene identifiers based on the results of a differential expression analysis. It provides options to retrieve based on FDR and logFC thresholds, positive and negative logFC, and ranked lists.

Usage

1
2
3
4
5
6
get_sig_genes(
  topGenes,
  method=,
  adj_p_cut=0.01, fc_cut=log2(1.5),
  p_col="P.Value", adj_p_col="adj.P.Val", fc_col="logFC",
  threshold_col=NULL)

Arguments

topGenes

a data frame, typically the output of a call to topTable. Must contain genes, log2 fold-change, and adjusted p-values.

method

character, specifying the type of gene lists to return. Recognized values are "ranked_list", "combined", "directional", "up", "down". "ranked_list" returns a vector of all genes in ranked order by p-value (from smallest to largest). "combined" outputs a list of all significant genes meeting the threshold. "up" and "down" return lists of significant genes meeting the threshold that are up- or down-regulated, respectively. "directional" returns a list containing two vectors, one "up" and one "down", which are identical to those returned by the "up" and "down" methods. Partial matches are allowed.

adj_p_cut

numeric, the cutoff for adjusted p-value. Genes with adjusted p-values greater than or equal to this value are not included in the result. Defaults to 0.01.

fc_cut

numeric, the absolute value cutoff for log2 fold change. Genes with absolute value log2-FC less than or equal to this value are not included in the result. Defaults to log2(1.5). To include all genes, set to 0. To ignore logFC, set to NULL.

p_col

name or number of the column in topGenes on which to sort. Generally the raw p-values, as adjusted p-values are often homogenized across a range of raw p-values. Defaults to "P.Value", which corresponds to the output from topTable. To include all genes, set to >1.

adj_p_col

name or number of the column in topGenes containing the p-values to compare to p_cut. Defaults to "adj.P.Val", which corresponds to the output from topTable.

fc_col

name or number of the column in topGenes containing the fold-change values to compare to fc_cut. Defaults to "logFC", which corresponds to the output from topTable.

threshold_col

name or number of the column in topGenes containing the logical values indicating which genes meet thresholds. This is an alternate way to determine signficance of genes. If specified, p_cut and fc_cut are ignored.

Details

This function writes out lists of genes to text files. By default, it outputs a list ranked by p-value, lists of genes significant based on FDR and logFC thresholds (all, up, and down).


BenaroyaResearch/limmaTools documentation built on Dec. 17, 2021, 10:49 a.m.