inst/app/help.md

How to use the rplanes Explorer

Introduction

The rplanes Explorer is written as a Shiny web application to translate the R package API to point-and-click features. The app includes functionality to intuitively run plausibility analysis and view output. The processing depends on a combination of observed data uploaded and used as a "seed" for baseline characteristics along with designated data to evaluate. As with the rplanes R package, the app can handle varying geographic and temporal resolutions (i.e., daily, weekly, or monthly reporting).

Example Data

To demonstrate usage, the app features an example data set. Users can select the "Example" option to load pre-populated forecast data for plausibility analysis. This data set contains 4 week-ahead forecasts for incident flu hospitalizations in select United States locations. The forecasts begin with the week ending 2022-11-05 and extend through the week of 2022-11-26. The baseline data used to generate the seed is loaded from HHS Protect flu hospitalizations that have been aggregated from daily to weekly reports at the state and national level. All of the data preparation is done internally. Users simply click "Analyze" to explore the kinds of outputs that rplanes generates.

Analysis Steps

The application allows users to run plausibility analysis with several steps:

  1. Select the type of signal to be evaluated
  2. Upload data to use for the plausibility analysis seed
  3. Upload data containing the signal to be evaluated (or for an observed signal identify the number of points to evaluate)
  4. Enter the resolution, outcome, and forecast horizon (if applicable)
  5. Optionally modify default parameters used for analysis
  6. Click "Analyze"

Inputs

The steps above require that the user specify several inputs, each of which are described in detail below.

Type of Signal Evaluated

The rplanes package implements a plausibility analysis algorithm that can work on either observed or forecasted data signals. Users begin by entering the type of signal as "Forecast" or "Observed".

It is important to note that depending on the type of signal to be evaluated, some components may not apply.

Observed Data

The observed data uploaded is primarily used to seed the background characteristics used in plausibility analysis. The app internally finds the appropriate date for a cutoff to identify baseline features of the reported data. However, if a forecast is being evaluated then the uploaded data cannot have any gaps between the last report and the first horizon forecasted.

Data must be uploaded to the app in .csv format. At minimum it must include columns for location (geographic unit such as FIPS code) and date (date of reported value in yyyy-mm-dd format). Note that these columns must be named as "location" and "date" respectively. The observed data must also include a column that contains the outcome (e.g., case count). The name of this column is arbitrary so long as it matches the outcome name provided in the app input for "Outcome" (see below). The uploaded .csv file may contain other columns, however these will not be used in plausibility analysis.

Data to be Evaluated

The choice of the type of signal to evaluate will determine how the user specifies data to be evaluated.

If a forecast signal is selected, then the user must upload a .csv file containing forecast data. Forecasts must be a prepared in a "quantile" format. The format must be specified as "Legacy"^1 or "Hubverse"^2.

For "Legacy", the file must at minimum have following columns:

For "Hubverse", the file must at minimum have following columns:

If an observed signal is selected, then the user can select the number of most recent observations to evaluate. The number of values will determine the cutoff date to identify the baseline characteristics in the original uploaded observed data. In other words, there is no need to upload separate observed data to be evaluated since the initial upload will contain all data for seed and evaluation.

Resolution

The app can accommodate data reported or forecasted at daily, weekly, or monthly cadence. The user selects the appropriate resolution to match the observed data and the data to be evaluated.

Outcome

The user must enter the name of the outcome. For observed data, the outcome entry should match the name of the column that contains the signal in the uploaded .csv file.

Forecast Horizon

For forecast evaluations, the user will enter the horizon as a number. The app defaults to 4 for this input.

Modify Defaults

Users can optionally modify the following parameters:

Outputs

The app includes output to view plausibility scoring results and the raw data used for analysis.

Scoring

The plausibility scoring results are presented in "Overall" and "Individual Locations and Components" sections. The overall scores (i.e., all combinations of locations and components analyzed) are displayed in a tile plot and as a table with all scores. Users can download or copy the table contents. Additionally, for each location the user can view plots of individual components, each which shows the features that did or did not raise flags in scoring.

Raw Data

The raw data is also displayed for the users in two tables. The first shows the observed data used to seed background characteristics, and the second is a table with data to evaluated.

About

The app is developed and maintained as part of the rplanes package, which is licensed under MIT License and Copyright (c) 2023 Signature Science LLC.

Primary developers of the tool are VP Nagraj, Desiree Williams, and Amy Benefield.



Try the rplanes package in your browser

Any scripts or data that you put into this service are public.

rplanes documentation built on Sept. 11, 2024, 9:01 p.m.