rpa.online: rpa.online

rpa.onlineR Documentation

rpa.online

Description

RPA-online for preprocessing very large expression data sets.

Usage

rpa.online(
  cel.path = NULL,
  cel.files = NULL,
  sets = NULL,
  cdf = NULL,
  bg.method = "rma",
  probe.parameters = list(alpha = 1, beta = 1),
  epsilon = 0.01,
  mc.cores = 1,
  verbose = TRUE,
  shuffle = TRUE,
  batch.size = 100,
  batches = NULL,
  save.batches.dir = ".",
  keep.batch.files = FALSE,
  unique.run.identifier = paste("RPA-run-id-", rnorm(1), sep = ""),
  rseed = 23,
  speedup = TRUE,
  summarize.with.affinities = FALSE
)

Arguments

cel.path

Path to CEL file directory

cel.files

List of CEL files to preprocess

sets

Probesets for which RPA will be computed

cdf

Specify an alternative CDF environment

bg.method

Specify background correction method. See bgcorrect.methods() for options.

probe.parameters

Can be used to set user-specified priors for the model parameters alpha, beta. Not used tau2.method = "var". The prior parameters alpha and beta are prior parameters for inverse Gamma distribution of probe-specific variances. Noninformative prior is obtained with alpha, beta -> 0. Not used with tau2.method 'var'. Scalar alpha and beta specify an identical inverse Gamma prior for all probes, which regularizes the solution. Can be also specified as lists, each element corresponding to one probeset. May also include quantile.basis, which should be provided at log2 domain.

epsilon

Convergence tolerance. The iteration is deemed converged when the change in all parameters is < epsilon.

mc.cores

Number of cores for parallel computation

verbose

Print progress information during computation

shuffle

Form random batches

batch.size

Batch size for online mode (rpa.online); the complete list of CEL files will be preprocessed in batches with this size using Bayesian online-updates for probe-specific parameters.

batches

User-defined CEL file batches

save.batches.dir

Output directory for temporary batch saves.

keep.batch.files

Logical. Keep (TRUE) or remove (FALSE) the batch files after preprocessing.

unique.run.identifier

Define identifier for this run for naming the temporary batch files. By default, a random id is generated.

rseed

Random seed.

speedup

Speed up computations with approximations.

summarize.with.affinities

Use affinity estimates in probe summarization step. Default: FALSE.

Details

rpa.online is used to preprocess very large expression data collections based on a Bayesian hyperparameter update procedure. Returns an expressionSet object preprocessed with RPA. Gives an estimate of the probeset-level mean parameter d of the RPA model, and returns these in an expressionSet object. The CEL files are handled in batches to obtain Bayesian updates for probe-specific hyperpriors; after sweeping through the database in batches the results are combined. The online mode is useful for preprocessing very large expression data sets where ordinary preprocessing algorithms fail, without compromises in modelling stage.

Value

List with two elements: an instance of the 'expressionSet' class and probe parameters. For probe.parameters contents, see the probe.parameters input argument.

Author(s)

Leo Lahti leo.lahti@iki.fi

References

See citation("RPA")

See Also

rpa, AffyBatch, ExpressionSet

Examples

# eset <- rpa.online(cel.file.path) 

microbiome/RPA documentation built on April 9, 2023, 10:59 a.m.