normalize: Normalization of 'AnyHermesData' Objects
In insightsengineering/hermes: Preprocessing, analyzing, and reporting of RNA-seq data

normalize,AnyHermesData-method

R Documentation

Normalization of `AnyHermesData` Objects

Description

The normalize() method is normalizing the input AnyHermesData according to one or more specified normalization methods. The results are saved as additional assays in the object.

Possible normalization methods (which are implemented with separate helper functions):

cpm: Counts per Million (CPM). Separately by sample, the original counts of the genes are divided by the library size of this sample, and multiplied by one million. This is the appropriate normalization for between-sample comparisons.
rpkm: Reads per Kilobase of transcript per Million reads mapped (RPKM). Each gene count is divided by the gene size (in kilobases) and then again divided by the library sizes of each sample (in millions). This allows for within-sample comparisons, as it takes into account the gene sizes - longer genes will always have more counts than shorter genes.
tpm: Transcripts per Million (TPM). This addresses the problem of RPKM being inconsistent across samples (which can be seen that the sum of all RPKM values will vary from sample to sample). Therefore here we divide the RPKM by the sum of all RPKM values for each sample, and multiply by one million.
voom: VOOM normalization. This is essentially just a slight variation of CPM where a prior_count of 0.5 is combined with lib_sizes increased by 1 for each sample. Note that this is not required for the corresponding differential expression analysis, but just provided as a complementary experimental normalization approach here.
vst: Variance stabilizing transformation. This is to transform the normalized count data for all genes into approximately homoskedastic values (having constant variance).
rlog: The transformation to the log2 scale values with approximately homoskedastic values.

Usage

## S4 method for signature 'AnyHermesData'
normalize(
  object,
  methods = c("cpm", "rpkm", "tpm", "voom", "vst"),
  control = control_normalize(),
  ...
)

h_cpm(object, control = control_normalize())

h_rpkm(object, control = control_normalize())

h_tpm(object, control = control_normalize())

h_voom(object, control = control_normalize())

h_vst(object, control = control_normalize())

h_rlog(object, control = control_normalize())

Arguments

`object`	(`AnyHermesData`) object to normalize.
`methods`	(`character`) which normalization methods to use, see details.
`control`	(named `list`) settings produced by `control_normalize()`.
`...`	not used.

Value

The AnyHermesData object with additional assays containing the normalized counts. The control is saved in the metadata of the object for future reference.

Functions

h_cpm(): calculates the Counts per Million (CPM) normalized counts.
h_rpkm(): calculates the Reads per Kilobase per Million (RPKM) normalized counts.
h_tpm(): calculates the Transcripts per Million (TPM) normalized counts.
h_voom(): calculates the VOOM normalized counts.
h_vst(): variance stabilizing transformation (vst) from DESeq2 package.
h_rlog(): regularized log transformation (rlog) from DESeq2 package.

Examples

a <- hermes_data

# By default, log values are used with a prior count of 1 added to original counts.
result <- normalize(a)
assayNames(result)
tpm <- assay(result, "tpm")
tpm[1:3, 1:3]

# We can also work on original scale.
result_orig <- normalize(a, control = control_normalize(log = FALSE))
tpm_orig <- assay(result_orig, "tpm")
tpm_orig[1:3, 1:3]

# Separate calculation of the CPM normalized counts.
counts_cpm <- h_cpm(a)
str(counts_cpm)

# Separate calculation of the RPKM normalized counts.
counts_rpkm <- h_rpkm(a)
str(counts_rpkm)

# Separate calculation of the TPM normalized counts.
counts_tpm <- h_tpm(a)
str(counts_tpm)

# Separate calculation of the VOOM normalized counts.
counts_voom <- h_voom(a)
str(counts_voom)

# Separate calculation of the vst transformation.
counts_vst <- h_vst(a)
str(counts_vst)

# Separate calculation of the rlog transformation.
counts_rlog <- h_rlog(a)
str(counts_rlog)

insightsengineering/hermes documentation built on Jan. 25, 2025, 6:21 a.m.

insightsengineering/hermes index

README.md

rdrr.io home R language documentation Run R code online

CRAN packages Bioconductor packages R-Forge packages GitHub packages

Note that we can't provide technical support on individual packages. You should contact the package authors for that.

insightsengineering/hermes
Preprocessing, analyzing, and reporting of RNA-seq data

normalize: Normalization of 'AnyHermesData' Objects
In insightsengineering/hermes: Preprocessing, analyzing, and reporting of RNA-seq data

Normalization of `AnyHermesData` Objects

Description

Usage

Arguments

Value

Functions

See Also

Examples

Related to normalize in insightsengineering/hermes...

R Package Documentation

Browse R Packages

We want your feedback!

insightsengineering/hermes Preprocessing, analyzing, and reporting of RNA-seq data

normalize: Normalization of 'AnyHermesData' Objects In insightsengineering/hermes: Preprocessing, analyzing, and reporting of RNA-seq data

Normalization of AnyHermesData Objects

Description

Usage

Arguments

Value

Functions

See Also

Examples

Related to normalize in insightsengineering/hermes...

R Package Documentation

Browse R Packages

We want your feedback!

insightsengineering/hermes
Preprocessing, analyzing, and reporting of RNA-seq data

normalize: Normalization of 'AnyHermesData' Objects
In insightsengineering/hermes: Preprocessing, analyzing, and reporting of RNA-seq data

Normalization of `AnyHermesData` Objects