R/iron.R

Defines functions waffle_iron

Documented in waffle_iron

#' Waffle Iron
#'
#' Pour your data into a waffle iron to get ready to cook a waffle chart.
#'
#' Prepare raw data so it is fit to create a waffle visualisation. The type of data transformation that is required does not gel
#' well with ggplot2 underlying mechanism. The way around this is to provide a function that does the preperation outside of ggplot.
#' @param data A dataframe to feed into the waffle iron
#' @param mapping A mapping as produce by \code{\link{aes_d}}, \code{\link{aes_d_}} or a character vector of a column present in the dataset
#' @param rows The number of rows in the waffle
#' @param sample_size The proportion of rows to sample the dataset (between 0 and 1). Useful when the dataset is too large to plot correctly.
#' @param na.rm A boolean flag to automatically remove NAs. Removing NAs will sometimes cause a missing notch in your waffle.
#' @export
#' @examples
#' waffle_iron(mpg, aes_d(group = class))
#'
#' waffle_iron(mpg, aes_d(group = class), sample_size = 0.75)
waffle_iron <- function(
  data,
  mapping,
  rows = 8,
  sample_size = 1,
  na.rm = T
){
  # sample the data
  if(!(sample_size>0 & sample_size <=1)){
    stop("Please use a sample value between 0 and 1", call. = F)
  }
  sample_rows <- sample(x = 1:nrow(data), size = (nrow(data) * sample_size))
  if(length(sample_rows) < rows){
    min_sample_size <- round(rows / nrow(data), 2)
    stop(paste0("The sample size is too low for this dataset, it must be at least ",min_sample_size) , call. = F)
  }

  data <- data[sample_rows,]

  # if a character mapping supplied, generate an `aes_d_` mapping instead
  if(inherits(mapping, "character")){
    mapping <- aes_d_(group = mapping)
  }

  # create the waffle dataset
  data <- aes_d_rename(data, mapping, c("group"))
  data <- data[order(data$group),]
  grid_data <- expand.grid(y = 1:rows, x = seq_len((ceiling(nrow(data) / rows))))
  grid_data$group <- c(data$group, as.factor(rep(NA, nrow(grid_data) - length(data$group))))
  # deal with NAs
  if(na.rm == T){
    grid_data <- grid_data[!is.na(grid_data$group),]
  }
  grid_data
}
liamgilbey/ggwaffle documentation built on Sept. 3, 2022, 4:53 p.m.