R/rmw_predict_nested_sets.R
In rmweather: Tools to Conduct Meteorological Normalisation and Counterfactual Modelling for Air Quality Data

Documented in rmw_predict_nested_sets

#' Function to make predictions from a random forest models using a nested 
#' tibble.
#' 
#' @param df_nest Nested tibble created by \code{\link{rmw_model_nested_sets}}.
#' 
#' @param se Should the standard error of the predictions be calculated? 
#' 
#' @param n_cores Number of CPU cores to use for the model calculations. 
#' 
#' @param keep_vectors Should the prediction vectors be kept in the return? This
#' is usually not needed because these vectors have been added to the 
#' \code{observations} variable.
#' 
#' @param model_errors Should model error statistics between the observed and 
#' predicted values be calculated and returned? 
#' 
#' @param as_long For when \code{model_errors} is \code{TRUE}, should the model 
#' error unit be returned in "long format"? 
#' 
#' @param partial Should the model's partial dependencies also be calculated? 
#' This will increase the execution time of the function. 
#' 
#' @param verbose Should the function give messages? 
#' 
#' @param progress Should a progress bar be displayed?
#' 
#' @seealso \code{\link{rmw_nest_for_modelling}}, 
#' \code{\link{rmw_model_nested_sets}}, \code{\link{rmw_predict}}, 
#' \code{\link{rmw_calculate_model_errors}}, 
#' \code{\link{rmw_partial_dependencies}}
#' 
#' @author Stuart K. Grange
#' 
#' @return Nested tibble.
#' 
#' @export
rmw_predict_nested_sets <- function(df_nest, se = FALSE, n_cores = NULL, 
                                    keep_vectors = FALSE, model_errors = FALSE, 
                                    as_long = TRUE, partial = FALSE, 
                                    verbose = FALSE, progress = FALSE) {
  
  # Check input
  if (!all(c("observations", "model") %in% names(df_nest))) {
    cli::cli_abort("Input requires `observations` and `model` variables.")
  }
  
  # Make the predictions
  if (verbose) {
    cli::cli_alert_info(
      "{str_date_formatted()}: Predicting with `{nrow(df_nest)}` model{?s}..."
    )
  }
  
  # Use the vectors directly and put into a tibble
  df_predictions <- purrr::map2(
    df_nest$model, 
    df_nest$observations,
    ~rmw_predict(
      model = .x, df = .y, se = se, n_cores = n_cores, verbose = FALSE
    ),
    .progress = progress
  ) %>% 
    tibble(predictions = .)
  
  # Bind the predictions and add the predictions to the observations variable
  if (se) {
    df_nest <- df_nest %>% 
      dplyr::bind_cols(df_predictions) %>% 
      mutate(
        observations = list(
          mutate(
            observations, 
            value_predict = predictions$predictions,
            value_se = predictions$se,
            value_delta = value - value_predict
          )
        )
      )
  } else {
    df_nest <- df_nest %>% 
      dplyr::bind_cols(df_predictions) %>% 
      mutate(
        observations = list(
          mutate(
            observations, 
            value_predict = predictions,
            value_delta = value - value_predict
          )
        )
      )
  }
  
  # Drop prediction vectors, not usually needed after being put into observations
  if (!keep_vectors) {
    df_nest <- select(df_nest, -predictions)
  }
  
  # Calculate the errors between the observed and predicted values in the desired
  # format
  if (model_errors) {
    if (verbose) {
      cli::cli_alert_info(
        "{str_date_formatted()}: Calculating model error statistics..."
      )
    }
    df_nest <- df_nest %>% 
      mutate(
        model_errors = list(
          rmw_calculate_model_errors(
            observations, testing_only = TRUE, as_long = as_long
          )
        )
      )
  }
  
  # Calculate partial dependencies if desired, standard ifelse needed for NULL
  # logic
  if (partial) {
    df_nest <- df_nest %>% 
      mutate(
        partial = list(
          rmw_partial_dependencies(
            model, 
            observations, 
            variable = NA,
            training_only = TRUE,
            n_cores = ifelse(is.null(n_cores), NA, n_cores),
            verbose = verbose
          )
        )
      )
  }
  
  return(df_nest)
  
}

Any scripts or data that you put into this service are public.

rmweather documentation built on April 12, 2025, 1:39 a.m.

rdrr.io home R language documentation Run R code online

CRAN packages Bioconductor packages R-Forge packages GitHub packages

Note that we can't provide technical support on individual packages. You should contact the package authors for that.

rmweather
Tools to Conduct Meteorological Normalisation and Counterfactual Modelling for Air Quality Data

R/rmw_predict_nested_sets.R
In rmweather: Tools to Conduct Meteorological Normalisation and Counterfactual Modelling for Air Quality Data

Defines functions rmw_predict_nested_sets

Documented in rmw_predict_nested_sets

Try the rmweather package in your browser

R Package Documentation

Browse R Packages

We want your feedback!

rmweather Tools to Conduct Meteorological Normalisation and Counterfactual Modelling for Air Quality Data

R/rmw_predict_nested_sets.R In rmweather: Tools to Conduct Meteorological Normalisation and Counterfactual Modelling for Air Quality Data

Defines functions rmw_predict_nested_sets

Documented in rmw_predict_nested_sets

Try the rmweather package in your browser

R Package Documentation

Browse R Packages

We want your feedback!

rmweather
Tools to Conduct Meteorological Normalisation and Counterfactual Modelling for Air Quality Data

R/rmw_predict_nested_sets.R
In rmweather: Tools to Conduct Meteorological Normalisation and Counterfactual Modelling for Air Quality Data