gene_filter | R Documentation |
This function filters out genes that show a low heterogeneity, as measured by Shannon's entropy.
## S4 method for signature 'matrix'
gene_filter(
x,
from = min(x, na.rm = TRUE),
to = max(x, na.rm = TRUE),
nBins = 20,
heterogeneity_threshold = 0.1
)
## S4 method for signature 'SummarizedExperiment'
gene_filter(
x,
from = min(assay(x, awst_values), na.rm = TRUE),
to = max(assay(x, awst_values), na.rm = TRUE),
nBins = 20,
heterogeneity_threshold = 0.1,
awst_values = "awst"
)
x |
a matrix of transformed gene expression counts (typically the
results of |
from |
the minimum value from which to start binning data. |
to |
the maximum value for the binning of the data. |
nBins |
the number of bins. |
heterogeneity_threshold |
the trheshold used for the filtering. |
awst_values |
integer scalar or string indicating the assay that contains the awst-transformed values to use as input. |
Shannon's entropy is computed on the categorized data after AWST transformation. Those genes that show a lower entropy than the predefined threshold are deemed to carry too low information to be useful for the classification of the samples, and are hence removed.
if 'x' is a matrix, it returns a filtered matrix. If 'x' is a 'SummarizedExperiment', it returns a filtered 'SummarizedExperiment'
matrix
: the input is a matrix of awst-transformed values.
SummarizedExperiment
: the input is a SummarizedExperiment with
awst-transformed values in one of its assays.
Risso and Pagnotta (2019). Within-sample standardization and asymmetric winsorization lead to accurate classification of RNA-seq expression profiles. Manuscript in preparation.
set.seed(222)
x <- matrix(rpois(75, lambda=5), ncol=5, nrow=15)
a <- awst(x)
gene_filter(a)
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.