preproviz: Tools for Visualization of Interdependent Data Quality Issues

Data quality issues such as missing values and outliers are often interdependent, which makes preprocessing both time-consuming and leads to suboptimal performance in knowledge discovery tasks. This package supports preprocessing decision making by visualizing interdependent data quality issues through means of feature construction. The user can define his own application domain specific constructed features that express the quality of a data point such as number of missing values in the point or use nine default features. The outcome can be explored with plot methods and the feature constructed data acquired with get methods.

AuthorMarkus Vattulainen [aut, cre]
Date of publication2016-07-09 10:10:07
MaintainerMarkus Vattulainen <markus.vattulainen@gmail.com>
LicenseGPL-2
Version0.2.0
https://github.com/mvattulainen/preproviz

View on CRAN

Man pages

AnalysisClass-class: An S4 class representing analysis data

BaseClass-class: An abstract S4 class representing contructed features

computeValue: generic function for computing constructed feature vectors

constructfeature: constructor function for adding constructed features to the...

ControlClass-class: An S4 class representing setups to be executed

DataClass-class: An S4 class representing data objects

defaultParameters: defaultParameters

getbasedata: getbasedata

getclasslabels: getclasslabels

getcmdsdata: get classical multidimensional scaling from minmaxconstructed...

getcombineddata: get basedata and constructed data combined

getconstructeddata: getconstructeddata

getlofscores: getlofscores

getlofsumdata: getlofsumdata

getlongformatconstructeddata: get constructed data in long format

getlongformatminmaxconstructeddata: getlongformatminmaxconstructeddata

getminmaxconstructeddata: get contructed data that have been min-max normalized

getname: get name of an object

getnumericbasedata: getnumericbasedata

getnumericombineddata: get numeric columns of combined data

getparameters: getparameters

getvariableimportancedata: get random forest variable importance data

initializecontrolclassobject: constructor function for intializing a ControlClass object

initializedataobject: constructor function for initializing a DataClass object

initializeparameterclassobject: constructor function for intializing a ParameterClass objects

initializesetupclassobject: constructor function for initializing a SetUpClass object

ParameterClass-class: An S4 class representing selected constructed features

plotCMDS: generic function for plotting classical multidimensional...

plotDENSITY: generic function for plotting density estimates of...

plotHEATMAP: generic function for plotting heatmap

plotLOFSUM: generic function for plotting lof sum of constructed features

plotOUTLIERS: generic function for plotting density of LOF scores

plotVARCLUST: generic function for plotting variable clusters

plotVARIMP: generic function for plotting variable importance

preproviz: the MAIN execution function

ReportClass-class: An S4 class representing visualizations

RunClass-class: An S4 class representing preproviz output (data and...

SetUpClass-class: An S4 class representing setups

Files in this package

preproviz
preproviz/inst
preproviz/inst/doc
preproviz/inst/doc/preproviz.Rmd
preproviz/inst/doc/preproviz.R
preproviz/inst/doc/preproviz.html
preproviz/tests
preproviz/tests/testthat.R
preproviz/tests/testthat
preproviz/tests/testthat/test.R
preproviz/NAMESPACE
preproviz/R
preproviz/R/03AnalysisClass.R preproviz/R/DefaultControl.R preproviz/R/05ReportingClass.R preproviz/R/01BaseClass.R preproviz/R/06RunClass.R preproviz/R/04ControlClass.R preproviz/R/02DefaultFeatures.R preproviz/R/00Utils.R
preproviz/vignettes
preproviz/vignettes/preproviz.Rmd
preproviz/MD5
preproviz/build
preproviz/build/vignette.rds
preproviz/DESCRIPTION
preproviz/man
preproviz/man/initializecontrolclassobject.Rd preproviz/man/ParameterClass-class.Rd preproviz/man/BaseClass-class.Rd preproviz/man/getvariableimportancedata.Rd preproviz/man/getlongformatconstructeddata.Rd preproviz/man/getnumericbasedata.Rd preproviz/man/getnumericombineddata.Rd preproviz/man/constructfeature.Rd preproviz/man/ReportClass-class.Rd preproviz/man/getconstructeddata.Rd preproviz/man/getbasedata.Rd preproviz/man/plotHEATMAP.Rd preproviz/man/plotDENSITY.Rd preproviz/man/AnalysisClass-class.Rd preproviz/man/initializeparameterclassobject.Rd preproviz/man/plotVARCLUST.Rd preproviz/man/getparameters.Rd preproviz/man/plotCMDS.Rd preproviz/man/defaultParameters.Rd preproviz/man/plotLOFSUM.Rd preproviz/man/getcombineddata.Rd preproviz/man/getlongformatminmaxconstructeddata.Rd preproviz/man/getminmaxconstructeddata.Rd preproviz/man/getclasslabels.Rd preproviz/man/computeValue.Rd preproviz/man/getlofscores.Rd preproviz/man/RunClass-class.Rd preproviz/man/getcmdsdata.Rd preproviz/man/DataClass-class.Rd preproviz/man/ControlClass-class.Rd preproviz/man/getlofsumdata.Rd preproviz/man/initializedataobject.Rd preproviz/man/SetUpClass-class.Rd preproviz/man/plotOUTLIERS.Rd preproviz/man/preproviz.Rd preproviz/man/initializesetupclassobject.Rd preproviz/man/plotVARIMP.Rd preproviz/man/getname.Rd

Questions? Problems? Suggestions? or email at ian@mutexlabs.com.

All documentation is copyright its authors; we didn't write any of that.