knitr::opts_chunk$set( collapse = TRUE, comment = "#>" )
The quickeda package is designed to take in a dataframe with mixed data types and produce an initial numerical exploratory data analysis (EDA). There are 7 functions that will produce individual sections of an EDA, and one function that will run all of them at once for quick analysis. Functions requiring numerical data types will automatically select the numerical columns from a dataframe, no initial slicing is necessary.
The functions are as follows:
The "overdose" data is a cleaned data set of county level data of drug overdose mortality rates and social, economic, and demographic indicators. Raw data used to assemble this data were acquired from "The County Health Rankings" dataset, a collaboration between the Robert Wood Johnson Foundation and the University of Wisconsin Population Health Institute.This database was built predominantly from the following: The Behavioral Risk Factor Surveillance System (BRFSS), the National Center for Health Statistics, and the CDC WONDER mortality data.
The database for the rankings is available for downloading as an Excel spreadsheet at: http://www.countyhealthrankings.org/explore-health-rankings/rankings-data-documentation
For a full account of how data for each variable was attained see: http://www.countyhealthrankings.org/sites/default/files/resources/2017_Measures_DataSources Years.pdf
Variables are all reported as percentages, rates per 100,000, ratios, or similar population adjusted measures. After data cleaning, they are all of numerical type float or integer. There are 74 variables in the dataset, however only 10 are selected here to showcase the package.
The following command will download the quickeda package from Github:
devtools::install_github("ValeryLynn/Quick_EDA")
library(quickeda)
#Get data overdose <- read.csv(file="overdose.csv", header=TRUE, sep=",") df_od <- overdose[,1:10] head(df_od)
data_describe(df_od)
data_types(df_od)
data_stats(df_od)
df = make_df(df_od) head(df)
dens_plots(df_od)
qq_plots(df_od)
correlations(df_od)
quick_eda(df_od)
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.