synthpop: Generating Synthetic Versions of Sensitive Microdata for Statistical Disclosure Control

A tool for producing synthetic versions of microdata containing confidential information so that they are safe to be released to users for exploratory analysis. The key objective of generating synthetic data is to replace sensitive original values with synthetic ones causing minimal distortion of the statistical information contained in the data set. Variables, which can be categorical or continuous, are synthesised one-by-one using sequential modelling. Replacements are generated by drawing from conditional distributions fitted to the original data using parametric or classification and regression trees models. Data are synthesised via the function syn() which can be largely automated, if default settings are used, or with methods defined by the user. Optional parameters can be used to influence the disclosure risk and the analytical quality of the synthesised data. For a description of the implemented method see Nowok, Raab and Dibben (2016) <http://doi.org/10.18637/jss.v074.i11>.

AuthorBeata Nowok, Gillian M Raab, Joshua Snoke and Chris Dibben
Date of publication2016-11-23 14:09:47
MaintainerBeata Nowok <beata.nowok@gmail.com>
LicenseGPL-2 | GPL-3
Version1.3-1

View on CRAN

Man pages

compare: Comparison of synthesised and observed data

compare.fit.synds: Compare model estimates based on synthesised and observed...

compare.synds: Compare univariate distributions of synthesised and observed...

glm.synds: Fitting (generalized) linear models to synthetic data

multi.compare: Multivariate comparison of synthesised and observed data

read.obs: Importing original data sets form external files

replicated.uniques: Replications in synthetic data

SD2011: Social Diagnosis 2011 - Objective and Subjective Quality of...

sdc: Tools for statistical disclosure control (sdc)

summary.fit.synds: Inference from synthetic data

summary.synds: Synthetic data object summaries

syn: Generating synthetic data sets

syn.bag: Synthesis with bagging

syn.cart: Synthesis with classification and regression trees (CART)

syn.lognorm: Synthesis by linear regression after transformation of a...

syn.logreg: Synthesis by logistic regression

syn.nested: Synthesis for a variable nested within another variable.

syn.norm: Synthesis by linear regression

syn.normrank: Synthesis by normal linear regression preserving the marginal...

syn.passive: Passive synthesis

syn.pmm: Synthesis by predictive mean matching

syn.polr: Synthesis by ordered polytomous regression

syn.polyreg: Synthesis by unordered polytomous regression

syn.rf: Synthesis with random forest

syn.sample: Synthesis by simple random sampling

syn.survctree: Synthesis of survival time by classification and regression...

synthpop-package: Generating synthetic versions of sensitive microdata for...

tab.utility: [EXPERIMENTAL] Tabular utility

utility.synds: Distributional comparison of synthesised and observed data

write.syn: Exporting synthetic data sets to external files

Functions

compare Man page
compare.fit.synds Man page
compare.synds Man page
glm.synds Man page
lm.synds Man page
multi.compare Man page
print.compare.fit.synds Man page
print.compare.synds Man page
print.fit.synds Man page
print.summary.fit.synds Man page
print.summary.synds Man page
print.synds Man page
print.tab.utility Man page
print.utility.synds Man page
read.obs Man page
replicated.uniques Man page
SD2011 Man page
sdc Man page
summary.fit.synds Man page
summary.synds Man page
syn Man page
syn.bag Man page
syn.cart Man page
syn.cartbboot Man page
syn.ctree Man page
syn.cubertnorm Man page
syn.lognorm Man page
syn.logreg Man page
syn.nested Man page
syn.norm Man page
syn.normrank Man page
syn.passive Man page
syn.pmm Man page
syn.pmm.proper Man page
syn.polr Man page
syn.polr.proper Man page
syn.polyreg Man page
syn.polyreg.proper Man page
syn.rf Man page
syn.sample Man page
syn.sqrtnorm Man page
syn.strata Man page
syn.survctree Man page
syn.survctree.proper Man page
synthpop Man page
synthpop-package Man page
tab.utility Man page
utility.synds Man page
write.syn Man page

Files

synthpop
synthpop/inst
synthpop/inst/CITATION
synthpop/inst/doc
synthpop/inst/doc/synthpop.Rnw
synthpop/inst/doc/synthpop.pdf
synthpop/inst/doc/synthpop.R
synthpop/NAMESPACE
synthpop/NEWS
synthpop/data
synthpop/data/SD2011.rda
synthpop/R
synthpop/R/sampler.syn.r
synthpop/R/syn.r
synthpop/R/methods.syn.r
synthpop/R/padModel.syn.r
synthpop/R/padMis.syn.r
synthpop/R/IO.r
synthpop/R/compare.syn.r
synthpop/R/multi.compare.syn.r
synthpop/R/utility.syn.r
synthpop/R/syn.strata.r
synthpop/R/functions.syn.r
synthpop/vignettes
synthpop/vignettes/synthpop.bib
synthpop/vignettes/Figure2income.pdf
synthpop/vignettes/synthpop.Rnw
synthpop/vignettes/Figure3Z.pdf
synthpop/vignettes/Figure1ls.pdf
synthpop/MD5
synthpop/build
synthpop/build/vignette.rds
synthpop/build/partial.rdb
synthpop/DESCRIPTION
synthpop/man
synthpop/man/syn.norm.Rd synthpop/man/SD2011.Rd synthpop/man/compare.synds.Rd synthpop/man/syn.survctree.Rd synthpop/man/read.obs.Rd synthpop/man/synthpop-package.Rd synthpop/man/syn.Rd synthpop/man/syn.cart.Rd synthpop/man/syn.normrank.Rd synthpop/man/syn.pmm.Rd synthpop/man/summary.fit.synds.Rd synthpop/man/syn.logreg.Rd synthpop/man/glm.synds.Rd synthpop/man/syn.polr.Rd synthpop/man/tab.utility.Rd synthpop/man/summary.synds.Rd synthpop/man/syn.bag.Rd synthpop/man/multi.compare.Rd synthpop/man/compare.Rd synthpop/man/syn.nested.Rd synthpop/man/write.syn.Rd synthpop/man/sdc.Rd synthpop/man/replicated.uniques.Rd synthpop/man/syn.passive.Rd synthpop/man/syn.sample.Rd synthpop/man/syn.polyreg.Rd synthpop/man/syn.lognorm.Rd synthpop/man/utility.synds.Rd synthpop/man/compare.fit.synds.Rd synthpop/man/syn.rf.Rd

Questions? Problems? Suggestions? or email at ian@mutexlabs.com.

Please suggest features or report bugs with the GitHub issue tracker.

All documentation is copyright its authors; we didn't write any of that.