specleanr: Detecting Environmental Outliers in Data Analysis Pipelines

A framework used to detect and handle outliers during data analysis workflows. Outlier detection is a statistical concept with applications in data analysis workflows, highlighting records that are suspiciously high or low. Outlier detection in distribution models was initiated by Chapman (1991) (available at <https://www.researchgate.net/publication/332537800_Quality_control_and_validation_of_point-sourced_environmental_resource_data>), who developed the reverse jackknifing method. The concept was further developed and incorporated into different R packages, including 'flexsdm' (Velazco et al., 2022, <doi:10.1111/2041-210X.13874>) and 'biogeo' (Robertson et al., 2016 <doi:10.1111/ecog.02118>). We compiled various outlier detection methods obtained from the literature, including those elaborated in Dastjerdy et al. (2023) <doi:10.3390/geotechnics3020022> and Liu et al. (2008) <doi:10.1109/ICDM.2008.17>. In this package, we introduced the ensembling aspect, where multiple outlier detection methods are used to flag the record as either an absolute outlier. The concept can also be applied in general data analysis, as well as during the development of species distribution models.

Package details

AuthorAnthony Basooma [aut, cre] (ORCID: <https://orcid.org/0000-0002-8994-9989>), Thomas Hein [ctb, fnd, ths] (ORCID: <https://orcid.org/0000-0002-7767-4607>), Astrid Schmidt-Kloiber [ctb, fnd, dtc] (ORCID: <https://orcid.org/0000-0001-8839-5913>), Merret Buurman [ctb], Sami Domisch [ctb], Martin Tschikof [ctb], Florian Borgwardt [ctb, fnd] (ORCID: <https://orcid.org/0000-0002-8974-7834>)
MaintainerAnthony Basooma <anthony.basooma@boku.ac.at>
LicenseGPL (>= 3)
Version1.0.0
URL https://anthonybasooma.github.io/specleanr/
Package repositoryView on CRAN
Installation Install the latest version of this package by entering the following in R:
install.packages("specleanr")

Try the specleanr package in your browser

Any scripts or data that you put into this service are public.

specleanr documentation built on Nov. 26, 2025, 1:07 a.m.