bioLeak: Leakage-Safe Modeling and Auditing for Genomic and Clinical Data

Prevents and detects information leakage in biomedical machine learning. Provides leakage-resistant split policies (subject-grouped, batch-blocked, study leave-out, time-ordered), guarded preprocessing (train-only imputation, normalization, filtering, feature selection), cross-validated fitting with common learners, permutation-gap auditing, batch and fold association tests, and duplicate detection.

Package details

AuthorSelcuk Korkmaz [aut, cre] (ORCID: <https://orcid.org/0000-0003-4632-6850>)
Bioconductor views Classification GeneExpression QualityControl Regression Reproducibility Software Survival Workflow
MaintainerSelcuk Korkmaz <selcukorkmaz@gmail.com>
LicenseMIT + file LICENSE
Version0.3.0
URL https://github.com/selcukorkmaz/bioLeak
Package repositoryView on CRAN
Installation Install the latest version of this package by entering the following in R:
install.packages("bioLeak")

Try the bioLeak package in your browser

Any scripts or data that you put into this service are public.

bioLeak documentation built on March 6, 2026, 1:06 a.m.