vtreat: A Statistically Sound 'data.frame' Processor/Conditioner

A 'data.frame' processor/conditioner that prepares real-world data for predictive modeling in a statistically sound manner. 'vtreat' prepares variables so that data has fewer exceptional cases, making it easier to safely use models in production. Common problems 'vtreat' defends against: 'Inf', 'NA', too many categorical levels, rare categorical levels, and new categorical levels (levels seen during application, but not during training). 'vtreat::prepare' should be used as you would use 'model.matrix'.

AuthorJohn Mount, Nina Zumel
Date of publication2017-01-21 23:09:51
MaintainerJohn Mount <jmount@win-vector.com>
LicenseGPL-3
Version0.5.30
https://github.com/WinVector/vtreat

View on CRAN

Functions

buildEvalSets Man page
catScore Man page
designTreatmentsC Man page
designTreatmentsN Man page
designTreatmentsZ Man page
format.vtreatment Man page
getSplitPlanAppLabels Man page
kWayCrossValidation Man page
kWayStratifiedY Man page
linScore Man page
makekWayCrossValidationGroupedByColumn Man page
mkCrossFrameCExperiment Man page
mkCrossFrameNExperiment Man page
oneWayHoldout Man page
prepare Man page
print.vtreatment Man page
problemAppPlan Man page
vnames Man page
vorig Man page
vtreat Man page
vtreat-package Man page

Files

vtreat
vtreat/inst
vtreat/inst/doc
vtreat/inst/doc/vtreatSplitting.Rmd
vtreat/inst/doc/vtreatGrouping.Rmd
vtreat/inst/doc/SavingTreamentPlans.html
vtreat/inst/doc/vtreatCrossFrames.R
vtreat/inst/doc/vtreat.html
vtreat/inst/doc/vtreat.Rmd
vtreat/inst/doc/vtreatRareLevels.html
vtreat/inst/doc/vtreatSignificance.html
vtreat/inst/doc/vtreatOverfit.R
vtreat/inst/doc/vtreatRareLevels.R
vtreat/inst/doc/vtreatVariableTypes.html
vtreat/inst/doc/vtreatScaleMode.R
vtreat/inst/doc/vtreatScaleMode.Rmd
vtreat/inst/doc/vtreatVariableTypes.Rmd
vtreat/inst/doc/vtreatCrossFrames.Rmd
vtreat/inst/doc/SavingTreamentPlans.Rmd
vtreat/inst/doc/vtreatOverfit.Rmd
vtreat/inst/doc/vtreatScaleMode.html
vtreat/inst/doc/vtreatGrouping.R
vtreat/inst/doc/vtreatSplitting.html
vtreat/inst/doc/vtreatGrouping.html
vtreat/inst/doc/vtreatCrossFrames.html
vtreat/inst/doc/vtreat.R
vtreat/inst/doc/SavingTreamentPlans.R
vtreat/inst/doc/vtreatSignificance.Rmd
vtreat/inst/doc/vtreatSplitting.R
vtreat/inst/doc/vtreatOverfit.html
vtreat/inst/doc/vtreatRareLevels.Rmd
vtreat/inst/doc/vtreatVariableTypes.R
vtreat/inst/doc/vtreatSignificance.R
vtreat/tests
vtreat/tests/testthat.R
vtreat/tests/testthat
vtreat/tests/testthat/testWeirdTypes.R
vtreat/tests/testthat/testW1.R
vtreat/tests/testthat/testdplyr.R
vtreat/tests/testthat/testParallel.R
vtreat/tests/testthat/testExpmtDesign.R
vtreat/tests/testthat/testNoY.R
vtreat/tests/testthat/testPC.R
vtreat/tests/testthat/testSig.R
vtreat/tests/testthat/uci.car.data.Rdata
vtreat/tests/testthat/testBO.R
vtreat/tests/testthat/testCar.R
vtreat/tests/testthat/testScale.R
vtreat/tests/testthat/testDataTable.R
vtreat/tests/testthat/testStability.R
vtreat/tests/testthat/testZW.R
vtreat/tests/testthat/testUniqValue.R
vtreat/NAMESPACE
vtreat/R
vtreat/R/utils.R vtreat/R/deviationFact.R vtreat/R/prevalenceFact.R vtreat/R/vtreatImpl.R vtreat/R/outOfSample.R vtreat/R/indicatorTreatment.R vtreat/R/effectTreatmentN.R vtreat/R/vtreat.R vtreat/R/cleanTreatment.R vtreat/R/isBadTreatment.R vtreat/R/effectTreatmentC.R
vtreat/vignettes
vtreat/vignettes/vtreatSplitting.Rmd
vtreat/vignettes/vtreatGrouping.Rmd
vtreat/vignettes/vtreat.Rmd
vtreat/vignettes/vtreatScaleMode.Rmd
vtreat/vignettes/vtreatVariableTypes.Rmd
vtreat/vignettes/vtreatCrossFrames.Rmd
vtreat/vignettes/SavingTreamentPlans.Rmd
vtreat/vignettes/vtreatOverfit.Rmd
vtreat/vignettes/superX.png
vtreat/vignettes/vtreatSignificance.Rmd
vtreat/vignettes/vtreatX.png
vtreat/vignettes/vtreatRareLevels.Rmd
vtreat/README.md
vtreat/MD5
vtreat/build
vtreat/build/vignette.rds
vtreat/DESCRIPTION
vtreat/man
vtreat/man/vtreat-package.Rd vtreat/man/designTreatmentsC.Rd vtreat/man/problemAppPlan.Rd vtreat/man/catScore.Rd vtreat/man/designTreatmentsZ.Rd vtreat/man/mkCrossFrameCExperiment.Rd vtreat/man/makekWayCrossValidationGroupedByColumn.Rd vtreat/man/format.vtreatment.Rd vtreat/man/buildEvalSets.Rd vtreat/man/print.vtreatment.Rd vtreat/man/designTreatmentsN.Rd vtreat/man/vorig.Rd vtreat/man/prepare.Rd vtreat/man/getSplitPlanAppLabels.Rd vtreat/man/oneWayHoldout.Rd vtreat/man/kWayStratifiedY.Rd vtreat/man/linScore.Rd vtreat/man/vnames.Rd vtreat/man/kWayCrossValidation.Rd vtreat/man/mkCrossFrameNExperiment.Rd

Questions? Problems? Suggestions? or email at ian@mutexlabs.com.

Please suggest features or report bugs with the GitHub issue tracker.

All documentation is copyright its authors; we didn't write any of that.