vtreat: A Statistically Sound 'data.frame' Processor/Conditioner

A 'data.frame' processor/conditioner that prepares real-world data for predictive modeling in a statistically sound manner. 'vtreat' prepares variables so that data has fewer exceptional cases, making it easier to safely use models in production. Common problems 'vtreat' defends against: 'Inf', 'NA', too many categorical levels, rare categorical levels, and new categorical levels (levels seen during application, but not during training). 'vtreat::prepare' should be used as you would use 'model.matrix'.

AuthorJohn Mount, Nina Zumel
Date of publication2017-01-21 23:09:51
MaintainerJohn Mount <jmount@win-vector.com>
LicenseGPL-3
Version0.5.30
https://github.com/WinVector/vtreat

View on CRAN

Files in this package

vtreat
vtreat/inst
vtreat/inst/doc
vtreat/inst/doc/vtreatSplitting.Rmd
vtreat/inst/doc/vtreatGrouping.Rmd
vtreat/inst/doc/SavingTreamentPlans.html
vtreat/inst/doc/vtreatCrossFrames.R
vtreat/inst/doc/vtreat.html
vtreat/inst/doc/vtreat.Rmd
vtreat/inst/doc/vtreatRareLevels.html
vtreat/inst/doc/vtreatSignificance.html
vtreat/inst/doc/vtreatOverfit.R
vtreat/inst/doc/vtreatRareLevels.R
vtreat/inst/doc/vtreatVariableTypes.html
vtreat/inst/doc/vtreatScaleMode.R
vtreat/inst/doc/vtreatScaleMode.Rmd
vtreat/inst/doc/vtreatVariableTypes.Rmd
vtreat/inst/doc/vtreatCrossFrames.Rmd
vtreat/inst/doc/SavingTreamentPlans.Rmd
vtreat/inst/doc/vtreatOverfit.Rmd
vtreat/inst/doc/vtreatScaleMode.html
vtreat/inst/doc/vtreatGrouping.R
vtreat/inst/doc/vtreatSplitting.html
vtreat/inst/doc/vtreatGrouping.html
vtreat/inst/doc/vtreatCrossFrames.html
vtreat/inst/doc/vtreat.R
vtreat/inst/doc/SavingTreamentPlans.R
vtreat/inst/doc/vtreatSignificance.Rmd
vtreat/inst/doc/vtreatSplitting.R
vtreat/inst/doc/vtreatOverfit.html
vtreat/inst/doc/vtreatRareLevels.Rmd
vtreat/inst/doc/vtreatVariableTypes.R
vtreat/inst/doc/vtreatSignificance.R
vtreat/tests
vtreat/tests/testthat.R
vtreat/tests/testthat
vtreat/tests/testthat/testWeirdTypes.R
vtreat/tests/testthat/testW1.R
vtreat/tests/testthat/testdplyr.R
vtreat/tests/testthat/testParallel.R
vtreat/tests/testthat/testExpmtDesign.R
vtreat/tests/testthat/testNoY.R
vtreat/tests/testthat/testPC.R
vtreat/tests/testthat/testSig.R
vtreat/tests/testthat/uci.car.data.Rdata
vtreat/tests/testthat/testBO.R
vtreat/tests/testthat/testCar.R
vtreat/tests/testthat/testScale.R
vtreat/tests/testthat/testDataTable.R
vtreat/tests/testthat/testStability.R
vtreat/tests/testthat/testZW.R
vtreat/tests/testthat/testUniqValue.R
vtreat/NAMESPACE
vtreat/R
vtreat/R/utils.R vtreat/R/deviationFact.R vtreat/R/prevalenceFact.R vtreat/R/vtreatImpl.R vtreat/R/outOfSample.R vtreat/R/indicatorTreatment.R vtreat/R/effectTreatmentN.R vtreat/R/vtreat.R vtreat/R/cleanTreatment.R vtreat/R/isBadTreatment.R vtreat/R/effectTreatmentC.R
vtreat/vignettes
vtreat/vignettes/vtreatSplitting.Rmd
vtreat/vignettes/vtreatGrouping.Rmd
vtreat/vignettes/vtreat.Rmd
vtreat/vignettes/vtreatScaleMode.Rmd
vtreat/vignettes/vtreatVariableTypes.Rmd
vtreat/vignettes/vtreatCrossFrames.Rmd
vtreat/vignettes/SavingTreamentPlans.Rmd
vtreat/vignettes/vtreatOverfit.Rmd
vtreat/vignettes/superX.png
vtreat/vignettes/vtreatSignificance.Rmd
vtreat/vignettes/vtreatX.png
vtreat/vignettes/vtreatRareLevels.Rmd
vtreat/README.md
vtreat/MD5
vtreat/build
vtreat/build/vignette.rds
vtreat/DESCRIPTION
vtreat/man
vtreat/man/vtreat-package.Rd vtreat/man/designTreatmentsC.Rd vtreat/man/problemAppPlan.Rd vtreat/man/catScore.Rd vtreat/man/designTreatmentsZ.Rd vtreat/man/mkCrossFrameCExperiment.Rd vtreat/man/makekWayCrossValidationGroupedByColumn.Rd vtreat/man/format.vtreatment.Rd vtreat/man/buildEvalSets.Rd vtreat/man/print.vtreatment.Rd vtreat/man/designTreatmentsN.Rd vtreat/man/vorig.Rd vtreat/man/prepare.Rd vtreat/man/getSplitPlanAppLabels.Rd vtreat/man/oneWayHoldout.Rd vtreat/man/kWayStratifiedY.Rd vtreat/man/linScore.Rd vtreat/man/vnames.Rd vtreat/man/kWayCrossValidation.Rd vtreat/man/mkCrossFrameNExperiment.Rd

Questions? Problems? Suggestions? or email at ian@mutexlabs.com.

Please suggest features or report bugs with the GitHub issue tracker.

All documentation is copyright its authors; we didn't write any of that.