na_summary | R Documentation |
An all-in-one missingness report
na_summary( df, grouping_cols = NULL, sort_by = NULL, descending = FALSE, exclude_cols = NULL, pattern = NULL, pattern_type = NULL, regex_kind = "exclusion", round_to = NULL, reset_rownames = FALSE )
df |
A valid R 'object' for which the percentage of missing values is required. |
grouping_cols |
A character vector. If supplied, one can provide the columns by which to group the data. |
sort_by |
One of counts or percents. This determines whether the results are sorted by counts or percentages. |
descending |
Logical. Should missing values be sorted in decreasing order ie largest to smallest? Defaults to FALSE. |
exclude_cols |
A character vector indicating columns to exclude when returning results. |
pattern |
Pattern to use for exclusion or inclusion. column inclusion criteria. |
pattern_type |
A regular expression type. One of "starts_with", "contains", or "regex". Defaults to NULL. Only use for selective inclusion. |
regex_kind |
One of inclusion or exclusion. Defaults to exclusion to exclude columns using regular expressions. |
round_to |
Number of places to round 2. Defaults to user digits option. |
reset_rownames |
Should the rownames be reset in the output? defaults to FALSE |
na_summary(airquality) # grouping test2 <- data.frame(ID= c("A","A","B","A","B"),Vals = c(rep(NA,4),"No"), ID2 = c("E","E","D","E","D")) df <- data.frame(A=1:5,B=c(NA,NA,25,24,53), C=c(NA,1,2,3,4)) na_summary(test2,grouping_cols = c("ID","ID2")) # sort summary na_summary(airquality,sort_by = "percent_missing",descending = TRUE) na_summary(airquality,sort_by = "percent_complete") # Include only via a regular expression na_summary(mtcars, pattern_type = "contains", pattern = "mpg|disp|wt", regex_kind = "inclusion") na_summary(airquality, pattern_type = "starts_with", pattern = "ozone", regex_kind = "inclusion") # exclusion via a regex na_summary(airquality, pattern_type = "starts_with", pattern = "oz|Sol", regex_kind = "exclusion") # reset rownames when sorting by variable na_summary(df,sort_by="variable",descending=TRUE, reset_rownames = TRUE)
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.