synthesizer: Fast, Robust, and High-Quality Synthetic Data Generation with a Tuneable Privacy-Utility Trade-Off

Synthesize numeric, categorical, mixed and time series data. Data circumstances including mixed (or zero-inflated) distributions and missing data patterns are reproduced in the synthetic data. A single parameter allows balancing between high-quality synthetic data that represents correlations of the original data and lower quality but more privacy safe synthetic data without correlations. Tuning can be done per variable or for the whole dataset.

Package details

AuthorMark van der Loo [aut, cre] (ORCID: <https://orcid.org/0000-0002-9807-4686>)
MaintainerMark van der Loo <mark.vanderloo@gmail.com>
LicenseEUPL
Version0.6.0
URL https://github.com/markvanderloo/synthesizer
Package repositoryView on CRAN
Installation Install the latest version of this package by entering the following in R:
install.packages("synthesizer")

Try the synthesizer package in your browser

Any scripts or data that you put into this service are public.

synthesizer documentation built on Nov. 19, 2025, 1:07 a.m.