library(framecleaner) library(dplyr)
Data imported from excel and csv in business situations can have messy characteristics and data formats. This package provides functions to tidy your data frame using the power of tidyselect
.
create sample data
tibble::tibble( date = c("20190101", "20190305", "20201012"), numeric_val = c(1, NA, 5), char_val = c("", " val ", "-") ) -> sample_table sample_table
Data occasionally has different ways to represent NA values. set_na
checks as default c("-", "", " ", "null")
but any values can be supplied to automatically be set to NA. This is helpful when you want to check the NA profile of a data frame using validata::diagnose
sample_table %>% make_na()
remove whitespace from the ends of character variables that may be otherwise undetectable by inspection.
sample_table %>% remove_whitespace()
automatically convert character columns that should be dates.
sample_table %>% set_date()
relocates an unorganized dataframe using heuristics such as putting character and date columns first, and organizing by alphabetical order.
sample_table %>% relocate_all()
Wrapper function to apply all cleaning operations to a data frame using sensible defaults.
sample_table %>% clean_frame()
use tidyselect to fill NAs with a single value
sample_table %>% fill_na()
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.