synthpop: Generating Synthetic Versions of Sensitive Microdata for Statistical Disclosure Control

A tool for producing synthetic versions of microdata containing confidential information so that they are safe to be released to users for exploratory analysis. The key objective of generating synthetic data is to replace sensitive original values with synthetic ones causing minimal distortion of the statistical information contained in the data set. Variables, which can be categorical or continuous, are synthesised one-by-one using sequential modelling. Replacements are generated by drawing from conditional distributions fitted to the original data using parametric or classification and regression trees models. Data are synthesised via the function syn() which can be largely automated, if default settings are used, or with methods defined by the user. Optional parameters can be used to influence the disclosure risk and the analytical quality of the synthesised data. For a description of the implemented method see Nowok, Raab and Dibben (2016) <http://doi.org/10.18637/jss.v074.i11>.

Author
Beata Nowok, Gillian M Raab, Joshua Snoke and Chris Dibben
Date of publication
2016-11-23 14:09:47
Maintainer
Beata Nowok <beata.nowok@gmail.com>
License
GPL-2 | GPL-3
Version
1.3-1

View on CRAN

Man pages

compare
Comparison of synthesised and observed data
compare.fit.synds
Compare model estimates based on synthesised and observed...
compare.synds
Compare univariate distributions of synthesised and observed...
glm.synds
Fitting (generalized) linear models to synthetic data
multi.compare
Multivariate comparison of synthesised and observed data
read.obs
Importing original data sets form external files
replicated.uniques
Replications in synthetic data
SD2011
Social Diagnosis 2011 - Objective and Subjective Quality of...
sdc
Tools for statistical disclosure control (sdc)
summary.fit.synds
Inference from synthetic data
summary.synds
Synthetic data object summaries
syn
Generating synthetic data sets
syn.bag
Synthesis with bagging
syn.cart
Synthesis with classification and regression trees (CART)
syn.lognorm
Synthesis by linear regression after transformation of a...
syn.logreg
Synthesis by logistic regression
syn.nested
Synthesis for a variable nested within another variable.
syn.norm
Synthesis by linear regression
syn.normrank
Synthesis by normal linear regression preserving the marginal...
syn.passive
Passive synthesis
syn.pmm
Synthesis by predictive mean matching
syn.polr
Synthesis by ordered polytomous regression
syn.polyreg
Synthesis by unordered polytomous regression
syn.rf
Synthesis with random forest
syn.sample
Synthesis by simple random sampling
syn.survctree
Synthesis of survival time by classification and regression...
synthpop-package
Generating synthetic versions of sensitive microdata for...
tab.utility
[EXPERIMENTAL] Tabular utility
utility.synds
Distributional comparison of synthesised and observed data
write.syn
Exporting synthetic data sets to external files

Files in this package

synthpop
synthpop/inst
synthpop/inst/CITATION
synthpop/inst/doc
synthpop/inst/doc/synthpop.Rnw
synthpop/inst/doc/synthpop.pdf
synthpop/inst/doc/synthpop.R
synthpop/NAMESPACE
synthpop/NEWS
synthpop/data
synthpop/data/SD2011.rda
synthpop/R
synthpop/R/sampler.syn.r
synthpop/R/syn.r
synthpop/R/methods.syn.r
synthpop/R/padModel.syn.r
synthpop/R/padMis.syn.r
synthpop/R/IO.r
synthpop/R/compare.syn.r
synthpop/R/multi.compare.syn.r
synthpop/R/utility.syn.r
synthpop/R/syn.strata.r
synthpop/R/functions.syn.r
synthpop/vignettes
synthpop/vignettes/synthpop.bib
synthpop/vignettes/Figure2income.pdf
synthpop/vignettes/synthpop.Rnw
synthpop/vignettes/Figure3Z.pdf
synthpop/vignettes/Figure1ls.pdf
synthpop/MD5
synthpop/build
synthpop/build/vignette.rds
synthpop/build/partial.rdb
synthpop/DESCRIPTION
synthpop/man
synthpop/man/syn.norm.Rd
synthpop/man/SD2011.Rd
synthpop/man/compare.synds.Rd
synthpop/man/syn.survctree.Rd
synthpop/man/read.obs.Rd
synthpop/man/synthpop-package.Rd
synthpop/man/syn.Rd
synthpop/man/syn.cart.Rd
synthpop/man/syn.normrank.Rd
synthpop/man/syn.pmm.Rd
synthpop/man/summary.fit.synds.Rd
synthpop/man/syn.logreg.Rd
synthpop/man/glm.synds.Rd
synthpop/man/syn.polr.Rd
synthpop/man/tab.utility.Rd
synthpop/man/summary.synds.Rd
synthpop/man/syn.bag.Rd
synthpop/man/multi.compare.Rd
synthpop/man/compare.Rd
synthpop/man/syn.nested.Rd
synthpop/man/write.syn.Rd
synthpop/man/sdc.Rd
synthpop/man/replicated.uniques.Rd
synthpop/man/syn.passive.Rd
synthpop/man/syn.sample.Rd
synthpop/man/syn.polyreg.Rd
synthpop/man/syn.lognorm.Rd
synthpop/man/utility.synds.Rd
synthpop/man/compare.fit.synds.Rd
synthpop/man/syn.rf.Rd