help.md
In rplanes: Plausibility Analysis of Epidemiological Signals

How to use the `rplanes` Explorer

Introduction
Example Data
Analysis Steps
Inputs
Outputs
About

The rplanes Explorer is written as a Shiny web application to translate the R package API to point-and-click features. The app includes functionality to intuitively run plausibility analysis and view output. The processing depends on a combination of observed data uploaded and used as a "seed" for baseline characteristics along with designated data to evaluate. As with the rplanes R package, the app can handle varying geographic and temporal resolutions (i.e., daily, weekly, or monthly reporting).

To demonstrate usage, the app features an example data set. Users can select the "Example" option to load pre-populated forecast data for plausibility analysis. This data set contains 4 week-ahead forecasts for incident flu hospitalizations in select United States locations. The forecasts begin with the week ending 2022-11-05 and extend through the week of 2022-11-26. The baseline data used to generate the seed is loaded from HHS Protect flu hospitalizations that have been aggregated from daily to weekly reports at the state and national level. All of the data preparation is done internally. Users simply click "Analyze" to explore the kinds of outputs that rplanes generates.

The application allows users to run plausibility analysis with several steps:

Select the type of signal to be evaluated
Upload data to use for the plausibility analysis seed
Upload data containing the signal to be evaluated (or for an observed signal identify the number of points to evaluate)
Enter the resolution, outcome, and forecast horizon (if applicable)
Optionally modify default parameters used for analysis
Click "Analyze"

The steps above require that the user specify several inputs, each of which are described in detail below.

The rplanes package implements a plausibility analysis algorithm that can work on either observed or forecasted data signals. Users begin by entering the type of signal as "Forecast" or "Observed".

It is important to note that depending on the type of signal to be evaluated, some components may not apply.

The observed data uploaded is primarily used to seed the background characteristics used in plausibility analysis. The app internally finds the appropriate date for a cutoff to identify baseline features of the reported data. However, if a forecast is being evaluated then the uploaded data cannot have any gaps between the last report and the first horizon forecasted.

Data must be uploaded to the app in .csv format. At minimum it must include columns for location (geographic unit such as FIPS code) and date (date of reported value in yyyy-mm-dd format). Note that these columns must be named as "location" and "date" respectively. The observed data must also include a column that contains the outcome (e.g., case count). The name of this column is arbitrary so long as it matches the outcome name provided in the app input for "Outcome" (see below). The uploaded .csv file may contain other columns, however these will not be used in plausibility analysis.

The choice of the type of signal to evaluate will determine how the user specifies data to be evaluated.

If a forecast signal is selected, then the user must upload a .csv file containing forecast data. Forecasts must be a prepared in a "quantile" format. The format must be specified as "Legacy"^1 or "Hubverse"^2.

For "Legacy", the file must at minimum have following columns:

forecast_date: The date on which the forecast was generated (yyyy-mm-dd format)
location: Location code for the given forecast
target: Name of the forecast structured as "N wk ahead {forecasted outcome}" (e.g., "4 wk ahead inc flu hospitalizations")
target_end_date: The date corresponding to the forecasted target (yyyy-mm-dd format)
type: The type of forecast (either "point" or "quantile")
quantile: The quantile for the forecasted value; if the type is "point" then quantile is NA
value: The forecasted value for the given quantile, location, and target

For "Hubverse", the file must at minimum have following columns:

reference_date: The date for the week on which the forecast was generated (yyyy-mm-dd format)
location: Location code for the given forecast
horizon: The number of time points ahead for the given forecast
target: Name of the forecast (e.g., "inc flu hospitalizations")
target_end_date: The date corresponding to the forecasted target (yyyy-mm-dd format)
output_type: The type of forecast (e.g., "quantile")
output_type_id: If the output type is set to "quantile" then this will contain quantile for the forecasted value
value: The forecasted value for the given quantile, location, and target

If an observed signal is selected, then the user can select the number of most recent observations to evaluate. The number of values will determine the cutoff date to identify the baseline characteristics in the original uploaded observed data. In other words, there is no need to upload separate observed data to be evaluated since the initial upload will contain all data for seed and evaluation.

The app can accommodate data reported or forecasted at daily, weekly, or monthly cadence. The user selects the appropriate resolution to match the observed data and the data to be evaluated.

The user must enter the name of the outcome. For observed data, the outcome entry should match the name of the column that contains the signal in the uploaded .csv file.

For forecast evaluations, the user will enter the horizon as a number. The app defaults to 4 for this input.

Users can optionally modify the following parameters:

Prediction Interval: The prediction interval defines the space between upper and lower bounds and internally maps to the appropriate quantiles (centered on the median) in the forecast evaluated.
PLANES Components: By default the app will run all components available for the given signal. As noted elsewhere in the rplanes documentation, not all components are available for evaluating observed signals. The user can optionally select specific components to use in the analysis.
Weights: Unless modified, the app will deliver an overall score based on equal weights for all components. This input allows users to modify that behavior. If custom weighting scheme is preferred, then the app will display numeric inputs for each component selected.
Significance (Trend): The significance level to identify change points via the trend component. Default is 0.1.
Tolerance (Repeat): The number of tolerated repeats before flagging via the repeat component. Default is defined by the number of repeats observed for the given location in the seed.
Prepend Values (Repeat): The number of values to prepend to the evaluated signal from the seed during analysis with the repeat component. The default behavior is to use the maximum number of repeats observed for the given location in the seed.

The app includes output to view plausibility scoring results and the raw data used for analysis.

The plausibility scoring results are presented in "Overall" and "Individual Locations and Components" sections. The overall scores (i.e., all combinations of locations and components analyzed) are displayed in a tile plot and as a table with all scores. Users can download or copy the table contents. Additionally, for each location the user can view plots of individual components, each which shows the features that did or did not raise flags in scoring.

The raw data is also displayed for the users in two tables. The first shows the observed data used to seed background characteristics, and the second is a table with data to evaluated.

Primary developers of the tool are VP Nagraj, Desiree Williams, and Amy Benefield.

Any scripts or data that you put into this service are public.

rplanes documentation built on Sept. 11, 2024, 9:01 p.m.

rdrr.io home R language documentation Run R code online

CRAN packages Bioconductor packages R-Forge packages GitHub packages

Note that we can't provide technical support on individual packages. You should contact the package authors for that.

rplanes
Plausibility Analysis of Epidemiological Signals

inst/app/help.md
In rplanes: Plausibility Analysis of Epidemiological Signals

How to use the `rplanes` Explorer

Introduction

Example Data

Analysis Steps

Inputs

Type of Signal Evaluated

Observed Data

Data to be Evaluated

Resolution

Outcome

Forecast Horizon

Modify Defaults

Outputs

Scoring

Raw Data

About

Try the rplanes package in your browser

R Package Documentation

Browse R Packages

We want your feedback!

rplanes Plausibility Analysis of Epidemiological Signals

inst/app/help.md In rplanes: Plausibility Analysis of Epidemiological Signals

How to use the rplanes Explorer

Introduction

Example Data

Analysis Steps

Inputs

Type of Signal Evaluated

Observed Data

Data to be Evaluated

Resolution

Outcome

Forecast Horizon

Modify Defaults

Outputs

Scoring

Raw Data

About

Try the rplanes package in your browser

R Package Documentation

Browse R Packages

We want your feedback!

rplanes
Plausibility Analysis of Epidemiological Signals

inst/app/help.md
In rplanes: Plausibility Analysis of Epidemiological Signals

How to use the `rplanes` Explorer