remove_outliers: Remove Outliers Based on Cluster Information

View source: R/utils.R

remove_outliersR Documentation

Remove Outliers Based on Cluster Information

Description

The remove_outliers function removes rows from a dataset based on the presence of outliers marked by a specific cluster ID (typically 100) in the pandora_cluster column. This function is meant to be used internally during downstream dataset analysis to filter out data points that have been identified as outliers during clustering.

Usage

remove_outliers(dataset, settings)

Arguments

dataset

A data frame that includes clustering results, particularly a pandora_cluster column.

settings

A list of settings. Must contain the logical value datasetAnalysisRemoveOutliersDownstream. If datasetAnalysisRemoveOutliersDownstream is TRUE, outliers (rows where pandora_cluster == 100) will be removed from the dataset.

Value

A filtered data frame with outliers removed if applicable.


immunaut documentation built on April 12, 2025, 1:22 a.m.