Man pages for ncsa/DataHarmonizationPipeline
An automated statistical analysis pipeline in R for use on public health datasets

addIdentifyingInfoFunction that adds identifying info (unique to each row) to...
applyPCAFunction that applies PCA on the preprocessed data and...
check.integerFunction that checks if a provided variable is an integer...
checkinteger.columnFunction that checks if all the data in a column of a dataset...
dropColumnsWithMissingDataFunction that drops columns in the discrete and continuous...
dropNonNumericalDataFunction that drops non numerical data (not necessary for...
dropRowsWithMissingDataFunction that drops rows in the discrete and continuous...
getDiscreteAndContinuousIndicesFunction that gets the indices of the dataframe columns that...
loadDataFunction that loads data to be analyzed from the specified...
mainDriverFunction that is the main driver for the entire program that...
makeDataTypesUniformFunction that converts all the data in both datasets to be of...
missingPercentFunction that gets the percentage of missing data in the...
normalizeColFunction that normalizes data in the column of a dataset...
normalizeContinuousDataFunction that normalizes all the data in the continuous...
ncsa/DataHarmonizationPipeline documentation built on May 30, 2019, 2:05 p.m.