Linnorm.Norm: Linnorm Normalization Function

Description Usage Arguments Details Value Examples

Description

This function performs batch effect and library size difference normalization on the input dataset.

Usage

1
2
3
4
Linnorm.Norm(datamatrix, RowSamples = FALSE, spikein = NULL,
  spikein_log2FC = NULL, showinfo = FALSE, output = "XPM",
  minNonZeroPortion = 0.3, BE_F_p = 0.3173, BE_F_LC_Genes = "Auto",
  BE_F_HC_Genes = 0.01, BE_strength = 0.5, max_F_LC = 0.75)

Arguments

datamatrix

The matrix or data frame that contains your dataset. Each row is a feature (or Gene) and each column is a sample (or replicate). Raw Counts, CPM, RPKM, FPKM or TPM are supported. Undefined values such as NA are not supported. It is not compatible with log transformed datasets.

RowSamples

Logical. In the datamatrix, if each row is a sample and each row is a feature, set this to TRUE so that you don't need to transpose it. Linnorm works slightly faster with this argument set to TRUE, but it should be negligable for smaller datasets. Defaults to FALSE.

spikein

character vector. Names of the spike-in genes in the datamatrix. Defaults to NULL.

spikein_log2FC

Numeric vector. Log 2 fold change of the spike-in genes. Defaults to NULL.

showinfo

Logical. Show algorithm running information. Defaults to FALSE.

output

character. "Raw" or "XPM". Output's total count will be approximately the median of the inputs' when set to "Raw". Output CPM (if input is raw counts or CPM) or TPM (if input is RPKM FPKM or TPM) when set to "XPM".

minNonZeroPortion

Double >=0, <= 1. Minimum non-Zero Portion Threshold. Genes not satisfying this threshold will be removed. For exmaple, if set to 0.75, genes without at least 75 percent of the samples being non-zero will be removed. Defaults to 0.75.

BE_F_p

Double >=0, <= 1. Filter genes with standard deviation and skewness less than this p value before applying Linnorm's batch effect normalization algorithm. Defaults to 0.3173.

BE_F_LC_Genes

Double >= 0.01, <= 0.95 or Character "Auto". Filter this portion of the lowest expressing genes before applying Linnorm's batch effect normalization algorithm. It can be determined automatically by setting to "Auto". Defaults to "Auto".

BE_F_HC_Genes

Double >=0, <= 1. Filter this portion of the highest expressing genes before applying Linnorm's batch effect normalization algorithm. Defaults to 0.01.

BE_strength

Double >0, <= 1. How strongly should Linnorm normalize batch effects? Defaults to 0.5.

max_F_LC

Double >=0, <= 0.95. When L_F_LC or B_F_LC is set to auto, this is the maximum threshold that Linnorm would assign. Defaults to 0.75.

Details

This function normalizes the input dataset using the Linnorm algorithm.

Value

This function returns a normalized data matrix.

Examples

1
2
3
4
#Obtain example matrix:
data(LIHC)
#Normalization:
normalizedExp <- Linnorm(LIHC)

Linnorm documentation built on Nov. 8, 2020, 6:48 p.m.