vtreat: A Statistically Sound data.frame Processor/Conditioner

Share:

A data.frame processor/conditioner that prepares real-world data for predictive modeling in a statistically sound manner. Prepares variables so that data has fewer exceptional cases, making it easier to safely use models in production. Common problems 'vtreat' defends against: Inf, NA, too many categorical levels, rare categorical levels, new categorical levels (levels seen during application, but not during training). 'vtreat::prepare' should be used as you would use 'model.matrix'.

Author
John Mount, Nina Zumel
Date of publication
2016-10-25 00:18:32
Maintainer
John Mount <jmount@win-vector.com>
License
GPL-3
Version
0.5.28
URLs

View on CRAN

Man pages

buildEvalSets
Build set carve-up for out-of sample evaluation.
catScore
return significnace 1 variable logistic regression
designTreatmentsC
Build all treatments for a data frame to predict a...
designTreatmentsN
build all treatments for a data frame to predict a numeric...
designTreatmentsZ
Design variable treatments with no outcome variable.
format.vtreatment
Display treatment plan.
getSplitPlanAppLabels
read application labels off a split plan.
kWayCrossValidation
k-fold cross validation, a splitFunction in the sense of...
kWayStratifiedY
k-fold cross validation stratified on y, a splitFunction in...
linScore
Return in-sample linear stats and scaling.
makekWayCrossValidationGroupedByColumn
Build a k-fold cross validation splitter, respecting (never...
mkCrossFrameCExperiment
Run categorical cross-frame experiment.
mkCrossFrameNExperiment
Run numeric cross frame experiment.
oneWayHoldout
One way holdout, a splitFunction in the sense of...
prepare
Apply treatments and restrict to useful variables.
print.vtreatment
Print treatmentplan.
problemAppPlan
check if appPlan is a good carve-up of 1:nRows into nSplits...
vnames
New treated variable names from a treatmentplan$treatment...
vorig
Original variable name from a treatmentplan$treatment item.
vtreat-package
vtreat: a package for simple variable treatment

Files in this package

vtreat
vtreat/inst
vtreat/inst/doc
vtreat/inst/doc/vtreatSplitting.Rmd
vtreat/inst/doc/vtreatGrouping.Rmd
vtreat/inst/doc/vtreatCrossFrames.R
vtreat/inst/doc/vtreat.html
vtreat/inst/doc/vtreat.Rmd
vtreat/inst/doc/vtreatRareLevels.html
vtreat/inst/doc/vtreatSignificance.html
vtreat/inst/doc/vtreatOverfit.R
vtreat/inst/doc/vtreatRareLevels.R
vtreat/inst/doc/vtreatVariableTypes.html
vtreat/inst/doc/vtreatScaleMode.R
vtreat/inst/doc/vtreatScaleMode.Rmd
vtreat/inst/doc/vtreatVariableTypes.Rmd
vtreat/inst/doc/vtreatCrossFrames.Rmd
vtreat/inst/doc/vtreatOverfit.Rmd
vtreat/inst/doc/vtreatScaleMode.html
vtreat/inst/doc/vtreatGrouping.R
vtreat/inst/doc/vtreatSplitting.html
vtreat/inst/doc/vtreatGrouping.html
vtreat/inst/doc/vtreatCrossFrames.html
vtreat/inst/doc/vtreat.R
vtreat/inst/doc/vtreatSignificance.Rmd
vtreat/inst/doc/vtreatSplitting.R
vtreat/inst/doc/vtreatOverfit.html
vtreat/inst/doc/vtreatRareLevels.Rmd
vtreat/inst/doc/vtreatVariableTypes.R
vtreat/inst/doc/vtreatSignificance.R
vtreat/tests
vtreat/tests/testthat.R
vtreat/tests/testthat
vtreat/tests/testthat/testWeirdTypes.R
vtreat/tests/testthat/testW1.R
vtreat/tests/testthat/testdplyr.R
vtreat/tests/testthat/testParallel.R
vtreat/tests/testthat/testExpmtDesign.R
vtreat/tests/testthat/testNoY.R
vtreat/tests/testthat/testPC.R
vtreat/tests/testthat/testSig.R
vtreat/tests/testthat/uci.car.data.Rdata
vtreat/tests/testthat/testBO.R
vtreat/tests/testthat/testCar.R
vtreat/tests/testthat/testScale.R
vtreat/tests/testthat/testDataTable.R
vtreat/tests/testthat/testStability.R
vtreat/tests/testthat/testZW.R
vtreat/tests/testthat/testUniqValue.R
vtreat/NAMESPACE
vtreat/R
vtreat/R/utils.R
vtreat/R/deviationFact.R
vtreat/R/prevalenceFact.R
vtreat/R/vtreatImpl.R
vtreat/R/outOfSample.R
vtreat/R/indicatorTreatment.R
vtreat/R/effectTreatmentN.R
vtreat/R/vtreat.R
vtreat/R/cleanTreatment.R
vtreat/R/isBadTreatment.R
vtreat/R/effectTreatmentC.R
vtreat/vignettes
vtreat/vignettes/vtreatSplitting.Rmd
vtreat/vignettes/vtreatGrouping.Rmd
vtreat/vignettes/vtreat.Rmd
vtreat/vignettes/vtreatScaleMode.Rmd
vtreat/vignettes/vtreatVariableTypes.Rmd
vtreat/vignettes/vtreatCrossFrames.Rmd
vtreat/vignettes/vtreatOverfit.Rmd
vtreat/vignettes/superX.png
vtreat/vignettes/vtreatSignificance.Rmd
vtreat/vignettes/vtreatX.png
vtreat/vignettes/vtreatRareLevels.Rmd
vtreat/README.md
vtreat/MD5
vtreat/build
vtreat/build/vignette.rds
vtreat/DESCRIPTION
vtreat/man
vtreat/man/vtreat-package.Rd
vtreat/man/designTreatmentsC.Rd
vtreat/man/problemAppPlan.Rd
vtreat/man/catScore.Rd
vtreat/man/designTreatmentsZ.Rd
vtreat/man/mkCrossFrameCExperiment.Rd
vtreat/man/makekWayCrossValidationGroupedByColumn.Rd
vtreat/man/format.vtreatment.Rd
vtreat/man/buildEvalSets.Rd
vtreat/man/print.vtreatment.Rd
vtreat/man/designTreatmentsN.Rd
vtreat/man/vorig.Rd
vtreat/man/prepare.Rd
vtreat/man/getSplitPlanAppLabels.Rd
vtreat/man/oneWayHoldout.Rd
vtreat/man/kWayStratifiedY.Rd
vtreat/man/linScore.Rd
vtreat/man/vnames.Rd
vtreat/man/kWayCrossValidation.Rd
vtreat/man/mkCrossFrameNExperiment.Rd