dprep: Data Pre-Processing and Visualization Functions for Classification

Data preprocessing techniques for classification. Functions for normalization, handling of missing values,discretization, outlier detection, feature selection, and data visualization are included.

AuthorEdgar Acuna and the CASTLE research group at The University of Puerto Rico-Mayaguez
Date of publication2015-11-24 07:46:38
MaintainerEdgar Acuna <edgar.acuna@upr.edu>
LicenseGPL
Version3.0.2

View on CRAN

Man pages

acugow: Gower distance from a vector to a matrix

arboleje: Predicting a bank's decision to give a loan for buying a car.

arboleje1: Predicting a bank's decision to give a loan for buying a car.

autompg: The Auto MPG dataset

baysout: Outlier detection using Bay and Schwabacher's algorithm.

breastw: The Breast Wisconsin dataset

bupa: The Bupa dataset

ce.impute: Imputation in supervised classification

ce.mimp: Mean or median imputation

census: census

chiMerge: Discretization using the Chi-Merge method

circledraw: circledraw

clean: Dataset's cleaning

colon: Alon et al.'s colon dataset

combinations: Constructing distinct permutations

crossval: Cross validation estimation of the misclassification error

crx: crx

cv10knn2: Auxiliary function for sequential feature selection

cv10lda2: Auxiliary function for sequential forward selection

cv10log: 10-fold cross validation estimation error for the classifier...

cv10mlp: 10-fold cross validation error estimation for the multilayer...

cv10rpart2: Auxiliary function for sequential feature selection

cvnaiveBayesd: Crossvalidation estimation error for the naive Bayes...

decscale: Decimal Scaling

diabetes: The Pima Indian Diabetes dataset

disc.1r: Discretization using the Holte's 1R method

disc2: Auxiliary function for performing discretization using equal...

disc.ef: Discretization using the method of equal frequencies

disc.ew: Discretization using the equal width method

disc.mentr: Discretization using the minimum entropy criterion

discretevar: Performs Minimum Entropy discretization for a given attribute

distancia: Vector-Vector Euclidiean Distance Function

distancia1: Vector-Vector Manhattan Distance Function

dist.to.knn: Auxiliary function for the LOF algorithm.

dprep-package: Data Preprocessing for supervised classification

ec.knnimp: Imputation using k-nearest neighbors.

eje1dis: Basic example for discriminant analysis

finco: FINCO Feature Selection Algorithm

heartc: The Heart Cleveland dataset

hepatitis: The hepatitis dataset

imagmiss: Visualization of Missing Data

inconsist: Computing the inconsistency measure

ionosphere: The Ionosphere dataset

knneigh.vect: Auxiliary function for computing the LOF measure.

knngow: K-nn classification using Gower distance

landsat: The landsat Satellite dataset

lofactor: Local Outlier Factor

lvf: Las Vegas Filter

mahaout: Multivariate outlier detection through the boxplot of the...

mardia: The Mardia's test of normality

maxlof: Detection of multivariate outliers using the LOF algorithm

midpoints1: Auxiliary function for computing minimun entropy...

mmnorm: Min-max normalization

mo3: The third moment of a multivariate distribution

mo4: The fourth moment of a multivariate distribution

moda: Calculating the Mode

near1: Auxiliary function for the reliefcont function

near3: Auxiliary function for the reliefcat function

nnmiss: Auxiliary function for knn imputation

outbox: Detecting outliers through boxplots of the features.

parallelplot: Parallel Coordinate Plot

radviz2d: Radial Coordinate Visualization

rangenorm: range normalization

reachability: Function for computing the reachability measure in the LOF...

redundancy: Finding the unique observations in a dataset along with their...

relief: RELIEF Feature Selection

reliefcat: Feature selection by the Relief Algorithm for datasets...

reliefcont: Feature selection by the Relief Algorithm for datasets with...

robout: Outlier Detection with Robust Mahalonobis distance

row.matches: Finding rows in a matrix equal to a given vector

sbs1: One-step sequential backward selection

score: Score function used in Bay's algorithm for outlier detection

sffs: Sequential Floating Forward Method

sfs: Sequential Forward Selection

sfs1: One-step sequential forward selection

Shuttle: The Shuttle dataset

signorm: Sigmoidal Normalization

softmaxnorm: Softmax Normalization

sonar: The Sonar dataset

srbct: Khan et al.'s small round blood cells dataset

star3d: Data Visuaization using star coordinates in three dimensions

starcoord: The star coordinates plot

surveyplot: Surveyplot

tchisq: Auxiliary function for the Chi-Merge discretization

top: Auxiliary function for Bay's Ouylier Detection Algorithm

unor: Auxiliary function for performing Holte's 1R discretization

vehicle: The Vehicle dataset

vvalen: The Van Valen test for equal covariance matrices

vvalen1: Auxiliary function for computing the Van Valen's...

znorm: Z-score normalization

Functions

acugow Man page
arboleje Man page
arboleje1 Man page
autompg Man page
baysout Man page
breastw Man page
bupa Man page
ce.impute Man page
ce.mimp Man page
census Man page
chiMerge Man page
circledraw Man page
clean Man page
colon Man page
combinations Man page
crossval Man page
crx Man page
cv10knn2 Man page
cv10lda2 Man page
cv10log Man page
cv10mlp Man page
cv10rpart2 Man page
cvnaiveBayesd Man page
decscale Man page
diabetes Man page
disc.1r Man page
disc2 Man page
disc.ef Man page
disc.ew Man page
disc.mentr Man page
discretevar Man page
distancia Man page
distancia1 Man page
dist.to.knn Man page
dprep Man page
dprep-package Man page
ec.knnimp Man page
eje1dis Man page
finco Man page
heartc Man page
hepatitis Man page
imagmiss Man page
inconsist Man page
ionosphere Man page
knneigh.vect Man page
knngow Man page
landsat Man page
lofactor Man page
lvf Man page
mahaout Man page
mardia Man page
maxlof Man page
midpoints1 Man page
mmnorm Man page
mo3 Man page
mo4 Man page
moda Man page
near1 Man page
near3 Man page
nnmiss Man page
outbox Man page
parallelplot Man page
radviz2d Man page
rangenorm Man page
reachability Man page
redundancy Man page
relief Man page
reliefcat Man page
reliefcont Man page
robout Man page
row.matches Man page
sbs1 Man page
score Man page
sffs Man page
sfs Man page
sfs1 Man page
Shuttle Man page
signorm Man page
softmaxnorm Man page
sonar Man page
srbct Man page
star3d Man page
starcoord Man page
surveyplot Man page
tchisq Man page
top Man page
unor Man page
vehicle Man page
vvalen Man page
vvalen1 Man page
znorm Man page

Files

dprep
dprep/src
dprep/src/Discrete.cpp
dprep/NAMESPACE
dprep/data
dprep/data/Shuttle.rda
dprep/data/eje1dis.rda
dprep/data/crx.rda
dprep/data/hepatitis.rda
dprep/data/bupa.rda
dprep/data/ionosphere.rda
dprep/data/sonar.rda
dprep/data/vehicle.rda
dprep/data/srbct.rda
dprep/data/heartc.rda
dprep/data/arboleje.rda
dprep/data/datalist
dprep/data/landsat.rda
dprep/data/breastw.rda
dprep/data/diabetes.rda
dprep/data/autompg.rda
dprep/data/arboleje1.rda
dprep/data/census.rda
dprep/data/colon.rda
dprep/R
dprep/R/crossval.R dprep/R/radviz2d.R dprep/R/maxlof.R dprep/R/knneigh.vect.R dprep/R/inconsist.R dprep/R/reachability.R dprep/R/decscale.R dprep/R/redundancy.R dprep/R/parallelplot.R dprep/R/chiMerge.R dprep/R/disc2.R dprep/R/softmaxnorm.R dprep/R/mardia.R dprep/R/clean.R dprep/R/mmnorm.R dprep/R/cv10rpart2.R dprep/R/dist.to.knn.R dprep/R/nnmiss.R dprep/R/sfs1.R dprep/R/ec.knnimp.R dprep/R/tchisq.R dprep/R/surveyplot.R dprep/R/score.R dprep/R/finco.R dprep/R/reliefcont.R dprep/R/sbs1.R dprep/R/cv10mlp.R dprep/R/distancia1.R dprep/R/star3d.R dprep/R/ce.mimp.R dprep/R/disc.ef.R dprep/R/reliefcat.R dprep/R/sffs.R dprep/R/combinations.R dprep/R/mo4.R dprep/R/vvalen.R dprep/R/rangenorm.R dprep/R/disc.ew.R dprep/R/starcoord.R dprep/R/lvf.R dprep/R/baysout.R dprep/R/unor.R dprep/R/signorm.R dprep/R/near3.R dprep/R/knngow.R dprep/R/top.R dprep/R/cv10log.R dprep/R/znorm.R dprep/R/near1.R dprep/R/mahaout.R dprep/R/ce.impute.R dprep/R/sfs.R dprep/R/cvnaiveBayesd.R dprep/R/outbox.R dprep/R/acugow.R dprep/R/moda.R dprep/R/relief.R dprep/R/mo3.R dprep/R/disc.mentr.R dprep/R/circledraw.R dprep/R/disc.1r.R dprep/R/imagmiss.R dprep/R/robout.R dprep/R/vvalen1.R dprep/R/row.matches.R dprep/R/midpoints1.R dprep/R/distancia.R dprep/R/lofactor.R dprep/R/cv10knn2.R dprep/R/cv10lda2.R dprep/R/discretevar.R
dprep/MD5
dprep/DESCRIPTION
dprep/man
dprep/man/ec.knnimp.Rd dprep/man/srbct.Rd dprep/man/knngow.Rd dprep/man/reliefcat.Rd dprep/man/robout.Rd dprep/man/colon.Rd dprep/man/chiMerge.Rd dprep/man/maxlof.Rd dprep/man/outbox.Rd dprep/man/baysout.Rd dprep/man/softmaxnorm.Rd dprep/man/score.Rd dprep/man/sbs1.Rd dprep/man/row.matches.Rd dprep/man/cv10lda2.Rd dprep/man/circledraw.Rd dprep/man/clean.Rd dprep/man/star3d.Rd dprep/man/hepatitis.Rd dprep/man/distancia.Rd dprep/man/ionosphere.Rd dprep/man/disc.1r.Rd dprep/man/reachability.Rd dprep/man/crx.Rd dprep/man/cv10knn2.Rd dprep/man/mahaout.Rd dprep/man/mmnorm.Rd dprep/man/sfs.Rd dprep/man/lofactor.Rd dprep/man/combinations.Rd dprep/man/cv10mlp.Rd dprep/man/distancia1.Rd dprep/man/disc.ew.Rd dprep/man/near1.Rd dprep/man/ce.impute.Rd dprep/man/mo3.Rd dprep/man/unor.Rd dprep/man/top.Rd dprep/man/disc.mentr.Rd dprep/man/discretevar.Rd dprep/man/redundancy.Rd dprep/man/starcoord.Rd dprep/man/autompg.Rd dprep/man/vvalen.Rd dprep/man/reliefcont.Rd dprep/man/cv10log.Rd dprep/man/vehicle.Rd dprep/man/znorm.Rd dprep/man/tchisq.Rd dprep/man/disc2.Rd dprep/man/finco.Rd dprep/man/sonar.Rd dprep/man/dist.to.knn.Rd dprep/man/ce.mimp.Rd dprep/man/disc.ef.Rd dprep/man/knneigh.vect.Rd dprep/man/census.Rd dprep/man/relief.Rd dprep/man/midpoints1.Rd dprep/man/inconsist.Rd dprep/man/landsat.Rd dprep/man/moda.Rd dprep/man/parallelplot.Rd dprep/man/dprep-package.Rd dprep/man/mo4.Rd dprep/man/signorm.Rd dprep/man/imagmiss.Rd dprep/man/eje1dis.Rd dprep/man/breastw.Rd dprep/man/cvnaiveBayesd.Rd dprep/man/sffs.Rd dprep/man/diabetes.Rd dprep/man/sfs1.Rd dprep/man/arboleje.Rd dprep/man/crossval.Rd dprep/man/vvalen1.Rd dprep/man/bupa.Rd dprep/man/radviz2d.Rd dprep/man/lvf.Rd dprep/man/decscale.Rd dprep/man/mardia.Rd dprep/man/acugow.Rd dprep/man/Shuttle.Rd dprep/man/rangenorm.Rd dprep/man/near3.Rd dprep/man/surveyplot.Rd dprep/man/nnmiss.Rd dprep/man/heartc.Rd dprep/man/cv10rpart2.Rd dprep/man/arboleje1.Rd

Questions? Problems? Suggestions? or email at ian@mutexlabs.com.

Please suggest features or report bugs with the GitHub issue tracker.

All documentation is copyright its authors; we didn't write any of that.