sccomp_remove_outliers: sccomp_remove_outliers main
In stemangiola/sccomp: Tests differences in cell-type proportion for single-cell data, robust to outliers

sccomp_remove_outliers

R Documentation

sccomp_remove_outliers main

Description

The sccomp_remove_outliers function takes as input a table of cell counts with columns for cell-group identifier, sample identifier, integer count, and factors (continuous or discrete). The user can define a linear model using an input R formula, where the first factor is the factor of interest. Alternatively, sccomp accepts single-cell data containers (e.g., Seurat, SingleCellExperiment, cell metadata, or group-size) and derives the count data from cell metadata.

Usage

sccomp_remove_outliers(
  .estimate,
  percent_false_positive = 5,
  cores = detectCores(),
  inference_method = attr(.estimate, "inference_method"),
  output_directory = "sccomp_draws_files",
  verbose = TRUE,
  mcmc_seed = sample(1e+05, 1),
  max_sampling_iterations = 20000,
  enable_loo = FALSE,
  sig_figs = 9,
  approximate_posterior_inference = NULL,
  variational_inference = NULL,
  ...
)

Arguments

`.estimate`	A tibble including a cell_group name column, sample name column, read counts column (optional depending on the input class), and factor columns.
`percent_false_positive`	A real number between 0 and 100 (not inclusive), used to identify outliers with a specific false positive rate.
`cores`	Integer, the number of cores to be used for parallel calculations.
`inference_method`	Character string specifying the inference method to use ('pathfinder', 'hmc', or 'variational').
`output_directory`	A character string specifying the output directory for Stan draws.
`verbose`	Logical, whether to print progression details.
`mcmc_seed`	Integer, used for Markov-chain Monte Carlo reproducibility. By default, a random number is sampled from 1 to 999999.
`max_sampling_iterations`	Integer, limits the maximum number of iterations in case a large dataset is used, to limit computation time.
`enable_loo`	Logical, whether to enable model comparison using the R package LOO. This is useful for comparing fits between models, similar to ANOVA.
`sig_figs`	Number of significant figures to use for Stan model output. Default is 9.
`approximate_posterior_inference`	DEPRECATED, use the `variational_inference` argument.
`variational_inference`	DEPRECATED Logical, whether to use variational Bayes for posterior inference. It is faster and convenient. Setting this argument to `FALSE` runs full Bayesian (Hamiltonian Monte Carlo) inference, which is slower but the gold standard.
`...`	Additional arguments passed to the `cmdstanr::sample` function.

Value

A tibble (tbl), with the following columns:

cell_group - The cell groups being tested.
parameter - The parameter being estimated from the design matrix described by the input formula_composition and formula_variability.
factor - The covariate factor in the formula, if applicable (e.g., not present for Intercept or contrasts).
c_lower - Lower (2.5%) quantile of the posterior distribution for a composition (c) parameter.
c_effect - Mean of the posterior distribution for a composition (c) parameter.
c_upper - Upper (97.5%) quantile of the posterior distribution for a composition (c) parameter.
c_n_eff - Effective sample size, the number of independent draws in the sample. The higher, the better.
c_R_k_hat - R statistic, a measure of chain equilibrium, should be within 0.05 of 1.0.
v_lower - Lower (2.5%) quantile of the posterior distribution for a variability (v) parameter.
v_effect - Mean of the posterior distribution for a variability (v) parameter.
v_upper - Upper (97.5%) quantile of the posterior distribution for a variability (v) parameter.
v_n_eff - Effective sample size for a variability (v) parameter.
v_R_k_hat - R statistic for a variability (v) parameter, a measure of chain equilibrium.
count_data - Nested input count data.

Examples


print("cmdstanr is needed to run this example.")
# Note: Before running the example, ensure that the 'cmdstanr' package is installed:
# install.packages("cmdstanr", repos = c("https://stan-dev.r-universe.dev/", getOption("repos")))


  if (instantiate::stan_cmdstan_exists()) {
    data("counts_obj")
    
    estimate = sccomp_estimate(
      counts_obj,
      ~ type,
      ~1,
      sample,
      cell_group,
      count,
      cores = 1
    ) |>
    sccomp_remove_outliers(cores = 1)
  }

stemangiola/sccomp documentation built on June 1, 2025, 7:03 p.m.

stemangiola/sccomp index

README.md

rdrr.io home R language documentation Run R code online

CRAN packages Bioconductor packages R-Forge packages GitHub packages

Note that we can't provide technical support on individual packages. You should contact the package authors for that.

stemangiola/sccomp
Tests differences in cell-type proportion for single-cell data, robust to outliers

sccomp_remove_outliers: sccomp_remove_outliers main
In stemangiola/sccomp: Tests differences in cell-type proportion for single-cell data, robust to outliers

sccomp_remove_outliers main

Description

Usage

Arguments

Value

Examples

Related to sccomp_remove_outliers in stemangiola/sccomp...

R Package Documentation

Browse R Packages

We want your feedback!

stemangiola/sccomp Tests differences in cell-type proportion for single-cell data, robust to outliers

sccomp_remove_outliers: sccomp_remove_outliers main In stemangiola/sccomp: Tests differences in cell-type proportion for single-cell data, robust to outliers

sccomp_remove_outliers main

Description

Usage

Arguments

Value

Examples

Related to sccomp_remove_outliers in stemangiola/sccomp...

R Package Documentation

Browse R Packages

We want your feedback!

stemangiola/sccomp
Tests differences in cell-type proportion for single-cell data, robust to outliers

sccomp_remove_outliers: sccomp_remove_outliers main
In stemangiola/sccomp: Tests differences in cell-type proportion for single-cell data, robust to outliers