scaleNotCenter: Scale genes by root-mean-square across cells
In rliger: Linked Inference of Genomic Experimental Relationships

scaleNotCenter

R Documentation

Scale genes by root-mean-square across cells

Description

This function scales normalized gene expression data after variable genes have been selected. We do not mean-center the data before scaling in order to address the non-negativity constraint of NMF. Computation applied to each normalized dataset matrix can form the following equation:

S_{i,j}=\frac{N_{i,j}}{\sqrt{\sum_{p}^{n}\frac{N_{i,p}^2}{n-1}}}

Where N denotes the normalized matrix for an individual dataset, S is the output scaled matrix for this dataset, and n is the number of cells in this dataset. i, j denotes the specific gene and cell index, and p is the cell iterator.

Please see detailed section below for explanation on methylation dataset.

Usage

scaleNotCenter(object, ...)

## S3 method for class 'dgCMatrix'
scaleNotCenter(object, features, scaleFactor = NULL, ...)

## S3 method for class 'DelayedArray'
scaleNotCenter(
  object,
  features,
  scaleFactor = NULL,
  geneRootMeanSq = NULL,
  overwrite = FALSE,
  chunk = getOption("ligerChunkSize", 20000),
  verbose = getOption("ligerVerbose", TRUE),
  ...
)

## S3 method for class 'ligerDataset'
scaleNotCenter(
  object,
  features = NULL,
  scaleFactor = NULL,
  chunk = getOption("ligerChunkSize", 20000),
  verbose = getOption("ligerVerbose", TRUE),
  ...
)

## S3 method for class 'ligerMethDataset'
scaleNotCenter(
  object,
  features = NULL,
  verbose = getOption("ligerVerbose", TRUE),
  ...
)

## S3 method for class 'liger'
scaleNotCenter(
  object,
  useDatasets = NULL,
  features = varFeatures(object),
  verbose = getOption("ligerVerbose", TRUE),
  remove.missing = NULL,
  ...
)

## S3 method for class 'Seurat'
scaleNotCenter(
  object,
  assay = NULL,
  layer = "ligerNormData",
  save = "ligerScaleData",
  datasetVar = "orig.ident",
  features = NULL,
  ...
)

Arguments

`object`	liger object, ligerDataset object, dgCMatrix-class object, or a Seurat object.
`...`	Arguments passed to other methods. The order goes by: "liger" method calls "ligerDataset" method", which then calls "dgCMatrix" method. "Seurat" method directly calls "dgCMatrix" method.
`features`	Character, numeric or logical index that choose the variable feature to be scaled. "liger" method by default uses `varFeatures(object)`. "ligerDataset" method by default uses all features. "Seurat" method by default uses `Seurat::VariableFeatures(object)`.
`scaleFactor`	Numeric vector of scaling factor to normalize the raw counts to unit sum. This pre-calculated at liger object creation (stored as `object$nUMI` and internally specified in S3 method chains, thus is generally not needed to be specified by users.
`geneRootMeanSq`	Numeric vector of root-mean-square of unit-normalized expression for each gene. This is pre-calculated at the call of `selectBatchHVG` (stored at `featureMeta(dataset(object, "datasetName"))$rootMeanSq` and internally specified in S3 method chains, thus is generally not needed to be specified by users.
`overwrite`	Logical. When writing newly computed HDF5 array to a separate HDF5 file, whether to overwrite the existing file. Default `FALSE` raises an error when the file already exists.
`chunk`	Integer. Number of maximum number of cells in each chunk, when scaling is applied to any HDF5 based dataset. Default `20000`.
`verbose`	Logical. Whether to show information of the progress. Default `getOption("ligerVerbose")` or `TRUE` if users have not set.
`useDatasets`	A character vector of the names, a numeric or logical vector of the index of the datasets to be scaled but not centered. Default `NULL` applies to all datasets.
`remove.missing`	Deprecated. The functionality of this is covered through other parts of the whole workflow and is no long needed. Will be ignored if specified.
`assay`	Name of assay to use. Default `NULL` uses current active assay.
`layer`	For Seurat>=4.9.9, the name of layer to retrieve normalized data. Default `"ligerNormData"`. For older Seurat, always retrieve from `data` slot.
`save`	For Seurat>=4.9.9, the name of layer to store normalized data. Default `"ligerScaleData"`. For older Seurat, stored to `scale.data` slot.
`datasetVar`	Metadata variable name that stores the dataset source annotation. Default `"orig.ident"`.

Value

Updated object

dgCMatrix method - Returns scaled dgCMatrix object
ligerDataset method - Updates the scaleData and scaledUnsharedData (if unshared variable feature available) slot of the object
liger method - Updates the scaleData and scaledUnsharedData (if unshared variable feature available) slot of chosen datasets
Seurat method - Adds a named layer in chosen assay (V5), or update the scale.data slot of the chosen assay (<=V4)

Methylation dataset

Because gene body mCH proportions are negatively correlated with gene expression level in neurons, we need to reverse the direction of the methylation data before performing the integration. We do this by simply subtracting all values from the maximum methylation value. The resulting values are positively correlated with gene expression. This will only be applied to variable genes detected in prior. Please make sure that argument modal is set accordingly when running createLiger. In this way, this function can automatically detect it and take proper action. If it is not set, users can still manually have the equivalent processing done by doing scaleNotCenter(lig, useDataset = c("other", "datasets")), and then reverseMethData(lig, useDataset = c("meth", "datasets")).

Note

Since the scaling on genes is applied on a per dataset base, other scaling methods that apply to a whole concatenated matrix of multiple datasets might not be considered as equivalent alternatives, even if options like center are set to FALSE. Hence we implemented an efficient solution that works under such circumstance, provided with the Seurat S3 method.

Examples

pbmc <- selectBatchHVG(pbmc, n = 10)
pbmc <- scaleNotCenter(pbmc)

rliger documentation built on June 8, 2025, 1:56 p.m.

rliger index

Package overview README.md Data integration with LIGER

rdrr.io home R language documentation Run R code online

CRAN packages Bioconductor packages R-Forge packages GitHub packages

Note that we can't provide technical support on individual packages. You should contact the package authors for that.

rliger
Linked Inference of Genomic Experimental Relationships

scaleNotCenter: Scale genes by root-mean-square across cells
In rliger: Linked Inference of Genomic Experimental Relationships

Scale genes by root-mean-square across cells

Description

Usage

Arguments

Value

Methylation dataset

Note

Examples

Related to scaleNotCenter in rliger...

R Package Documentation

Browse R Packages

We want your feedback!

rliger Linked Inference of Genomic Experimental Relationships

scaleNotCenter: Scale genes by root-mean-square across cells In rliger: Linked Inference of Genomic Experimental Relationships

Scale genes by root-mean-square across cells

Description

Usage

Arguments

Value

Methylation dataset

Note

Examples

Related to scaleNotCenter in rliger...

R Package Documentation

Browse R Packages

We want your feedback!

rliger
Linked Inference of Genomic Experimental Relationships

scaleNotCenter: Scale genes by root-mean-square across cells
In rliger: Linked Inference of Genomic Experimental Relationships