README.md

DataMan

Build Status License

R package for data cleaning, preliminary data analysis and modeling assessing with visualisation.

Data Cleaning

Data cleaning have 2 functions at the moment:

Preliminary Data Analysis

library(MASS)
data("Insurance")
dataPlot(Insurance$Age,Insurance$Claims,exposure = Insurance$Holders,
         by=Insurance$District,xname="Age",byname="District")

Model Assess

Pred = Insurance$Claims + runif(nrow(Insurance),min=0,max=10)
resiPlot(Insurance$Claims,Pred)

interPlot(Insurance$Age,Insurance$District,Insurance$Claims,xname="Age",yname="District")

library(networkD3)
data(iris)
iris.mod <- gbm(Species ~ ., distribution="multinomial", data=iris, n.trees=2000, shrinkage=0.01, cv.folds=5, verbose=FALSE, n.cores=1)
tree_data <- tree2data(iris.mod,1)
sankeyNetwork(tree_data[[1]],tree_data[[2]],Source="src",Target="tar",Value="value",NodeID="name")

Getting Started

You can install DataMan from GitHub as follows:

devtools::install_github('SixiangHu/DataMan')

License

This package is free and open source software, licensed under GPL 2 or later.



SixiangHu/DataMan documentation built on May 9, 2019, 1:48 p.m.