autodb: Automatic Database Normalisation for Data Frames

Automatic normalisation of a data frame to third normal form, with the intention of easing the process of data cleaning. (Usage to design your actual database for you is not advised.) Originally inspired by the 'AutoNormalize' library for 'Python' by 'Alteryx' (<https://github.com/alteryx/autonormalize>), with various changes and improvements. Automatic discovery of functional or approximate dependencies, normalisation based on those, and plotting of the resulting "database" via 'Graphviz', with options to exclude some attributes at discovery time, or remove discovered dependencies at normalisation time.

Package details

AuthorMark Webster [aut, cre] (<https://orcid.org/0000-0002-3351-0686>)
MaintainerMark Webster <markwebster204@yahoo.co.uk>
LicenseBSD_3_clause + file LICENSE
Version2.3.1
URL https://charnelmouse.github.io/autodb/ https://github.com/CharnelMouse/autodb
Package repositoryView on CRAN
Installation Install the latest version of this package by entering the following in R:
install.packages("autodb")

Try the autodb package in your browser

Any scripts or data that you put into this service are public.

autodb documentation built on April 4, 2025, 5:12 a.m.