Man pages for jeroenvdhoven/datapiper
datapiper

build_dockerBuild a docker image out of a model function
build_model_packageBuilds a package from your pipeline
create_statsCalculates stats based on custom functions on the response...
create_stats_predictUses previous statistics results to generate columns for a...
feature_categorical_filter_predictFilters categorical variables for new datasets
feature_finderAdd features one-by-one to find good small sets of features
feature_interactions_predictComputes interaction effects for a new dataset
feature_one_hot_encode_predictApply one-hot encoding
feature_transformer_post_predictUses the results of 'pipe_feature_transformer' on new...
feature_transformer_predictUses the results of 'pipe_feature_transformer' on new...
find_best_modelsSelect top models from the find_model function
find_expand_resultsExpands the params column of the result of 'find_model'
find_modelFind fitting models and test them using given metrics on the...
find_model_through_bayesHyper parameter search using bayesian optimisation
find_template_formula_and_dataA convenient wrapper function for find_model for models that...
find_xgbA convenient wrapper function for find_model for xgboost
flatten_pipelineFlattens a pipeline so it does not contain any more...
greedy_max_independent_setFinds a maximum independent set using greedy search
high_correlation_featuresDetermines which columns are too highly correlated.
impute_modelGenerate a model to impute missing data in a column
impute_predict_allUse the models from 'impute_all' to impute the selected...
invokeGeneric function to apply either a pipe or pipeline to new...
invoke.pipeApplies a pipe to new data
invoke.pipelineApplies a pipeline to new data
is.pipeTests if an object inherits from pipe
is.pipelineTests if an object inherits from pipeline
model_trainerWrapper function for model inputs to 'find_model'
NA_indicators_predictIndicate which fields are NA
pipeCreates a pipe object out of a function and a list of...
pipe_categorical_filterRemove values from categorical variables that do not occur...
pipe_checkCreate a pipeline step that learns what the data looks like
pipe_clusteringAdd cluster labels to a training set
pipe_create_statsGeneric function for creating statistics on the response...
pipe_dplyrWrapper function to turn a dplyr function into a pipeline...
pipe_feature_interactionsGenerates permutation interaction effects between sets of...
pipe_feature_transformerApplies different transformations to each numeric feature and...
pipe_functionWrapper for putting a single function into a pipeline
pipe_imputeImpute multiple missing columns using lm, mean, or xgboost,...
pipelineCreates a pipeline out of a set of pipes
pipe_mutateApplies mutate in a pipeline
pipe_NA_indicatorsIndicate which fields are NA
pipe_one_hot_encodeTrain one-hot encoding
pipe_pcaApply PCA to a subset of the columns in a dataset
pipe_range_classifierGenerates features for regression problems through...
pipe_remove_high_correlation_featuresRemoves highly correlated features whilst keeping as many as...
pipe_remove_single_value_columnsRemove all columns that have only a single value
pipe_scalerRescales data to standardised ranges
pipe_selectApplies select in a pipeline
plot_high_correlationsPlots highly correlated features as a graph
preserve_columns_predictOnly keep previously selected columns
range_predictUse generated models and scales to create features for a new...
segmentA simple wrapper for creating a pipe segment
standard_column_namesStandardises column names into an oft-acceptable format
test_dockerTest your docker image
train_pipelineCreate a train/test pipeline from individual functions
jeroenvdhoven/datapiper documentation built on July 14, 2019, 9:34 p.m.