inomaly: my first package (On developing..)

Solution for big data

inomaly is an R package which contains:

R function for anomaly detection, filling in loss values of a variable from one data frame with the values from another variable, checking outliers then replacing them with other value or cluster value.




FillIn is function for filling missing values, outliersZ is function for checking outlier.

Create data set with missing values as follow.

naDF <- data.frame(a = sample(c(1,2), 100, rep=TRUE), 
                   b = sample(c(3,4), 100, rep=TRUE), 
                 fNA = sample(c(100, 200, 300, 400, NA), 100, rep=TRUE))

Create full dataset

fillDF <- data.frame(a = c(1,2,1,2), 
                     b = c(3,3,4,4),
                 fFull = c(100, 200, 300, 400))

Fill in missing f's from naDF with values from fillDF

FilledInData <- FillIn(naDF, fillDF, Var1 = "fNA", Var2 = "fFull", KeyVar = c("a", "b"))
df$column1<-outliersZ(df$column1, zCutOff = 1.96, replace = NA, values = FALSE, digits = 4)

irwannafly/inomaly documentation built on June 7, 2019, 2:37 p.m.