nietodaniel/LargeDataExplorer: Powerful package to clean and re-format very large datasets after classifying its variables by their usefulness for machine learning

This package can identify the irrelevant variables for machine learning and exclude them, while keeping and tranforming the useful ones. It also can explore a dataset, categorizing its variables as numeric, categorical (text), categorical (number), boolean, date, primary keys, keys, NA-variable, Univalue, plain text and variables with repeated information. It will also calculate the basic statistics for each variable.

Getting started

Package details

Maintainer
LicenseGPL-3 + file LICENSE
Version0.0.0.9000
URL https://github.com/nietodaniel/LargeDataExplorer
Package repositoryView on GitHub
Installation Install the latest version of this package by entering the following in R:
install.packages("remotes")
remotes::install_github("nietodaniel/LargeDataExplorer")
nietodaniel/LargeDataExplorer documentation built on Sept. 20, 2020, 7:57 p.m.