synthpop: Generating Synthetic Versions of Sensitive Microdata for Statistical Disclosure Control

A tool for producing synthetic versions of microdata containing confidential information so that they are safe to be released to users for exploratory analysis. The key objective of generating synthetic data is to replace sensitive original values with synthetic ones causing minimal distortion of the statistical information contained in the data set. Variables, which can be categorical or continuous, are synthesised one-by-one using sequential modelling. Replacements are generated by drawing from conditional distributions fitted to the original data using parametric or classification and regression trees models. Data are synthesised via the function syn() which can be largely automated, if default settings are used, or with methods defined by the user. Optional parameters can be used to influence the disclosure risk and the analytical quality of the synthesised data. For a description of the implemented method see Nowok, Raab and Dibben (2016) <doi:10.18637/jss.v074.i11>. Functions to assess identity and attribute disclosure for the original and for the synthetic data are included in the package, and their use is illustrated in a vignette on disclosure (Practical Privacy Metrics for Synthetic Data).

Package overview Disclosure Inference in synthpop Using synthpop Utility

Vignettes Man pages API and functions Files

Package details
Author	Beata Nowok [aut, cre], Gillian M Raab [aut], Chris Dibben [ctb], Joshua Snoke [ctb], Caspar van Lissa [ctb], Lotte Pater [ctb]
Maintainer	Beata Nowok <beata.nowok@gmail.com>
License	GPL-2 \| GPL-3
Version	1.9-1.1
URL	<https://www.synthpop.org.uk/>
Package repository	View on CRAN
Installation	Install the latest version of this package by entering the following in R: `install.packages("synthpop")`

Any scripts or data that you put into this service are public.

synthpop documentation built on June 8, 2025, 1:31 p.m.

rdrr.io home R language documentation Run R code online

CRAN packages Bioconductor packages R-Forge packages GitHub packages

Note that we can't provide technical support on individual packages. You should contact the package authors for that.

synthpop
Generating Synthetic Versions of Sensitive Microdata for Statistical Disclosure Control

synthpop: Generating Synthetic Versions of Sensitive Microdata for Statistical Disclosure Control

Getting started

Browse package contents

Package details

Try the synthpop package in your browser

R Package Documentation

Browse R Packages

We want your feedback!

synthpop Generating Synthetic Versions of Sensitive Microdata for Statistical Disclosure Control

synthpop: Generating Synthetic Versions of Sensitive Microdata for Statistical Disclosure Control

Getting started

Browse package contents

Package details

Try the synthpop package in your browser

R Package Documentation

Browse R Packages

We want your feedback!

synthpop
Generating Synthetic Versions of Sensitive Microdata for Statistical Disclosure Control