Man pages for AlteryxLabs/datadoctor
Data Doctor

blanks_to_missingConvert any blank values to missing (NA) values
bsplineImportanceCorrelation based variable importance weights using B-splines...
change_factor_to_numberChange factor type columns to numeric.
change_int_to_factorChange integer type of columns to factor type.
change_missing_to_naConvert whitespace entries to NAs
change_numeric_to_intChange a numeric column to integer, if it appears to be an...
chiSquaredImportanceChi-squared statistic based variable importance weights
cols_collinearGroups of collinear columns within a data frame
cols_continuous_duplicatesFind duplicate and perfectly correlated numeric and integer...
cols_continuous_fitnessCalculate the fitness measure for continuous columns.
cols_duplicatesFind duplicate and perfectly correlated integer columns in a...
cols_factor_duplicatesFind duplicate factor columns in a data frame
cols_factor_fitnessCalculate the fitness measure for factor columns.
cols_factor_is_numberDetermine which factors might actually be numbers.
cols_imputationImpute missing values, with factors getting the value...
cols_integer_is_numericDetermine which numeric variables in a data frame are...
cols_int_is_categoricalReturn column names, whose current data type is integer, but...
cols_int_sequentialDetermine which integer columns have sequential values
cols_missing_blankReport on the number and percentage of missing (NA) values...
cols_numeric_is_integerDetermine which numeric fields might actually be integers
cols_num_outliersConservatively detect outliers for continuous variables in a...
cols_num_sparse_levelsConvert any blank values to missing (NA) values
cols_with_unique_valReturn names of columns that has only one value
detect_outlier_singleDetect outliers for single column
entropyEntropy measure
entropyBinsEntropy target based bining
entropySplitFinds the value of a numeric vector that results in the best...
equalIntervalBinsBin a numeric variable into a set of equal interval bins.
formattedTextListAn English language centric helper function that converts a...
get_col_typesReturn names of columns that are factor, integer or numeric...
get_HHICalculate the Hefindahl Hirschman index for the levels of a...
intervalLabelsA helper function to construct interval labels
posLogProduces logarithm values for strictly positive values, zero...
strip_helperStrip off characters such as currency indicator and...
strongCorFind strong bivariate correlations
trimNewlinesStrip out newlines and white space from elements of a...
AlteryxLabs/datadoctor documentation built on May 28, 2017, 3:52 p.m.