knitr::opts_chunk$set(echo = TRUE)
Boots
The analytic takes Defense Manpower Data Center(DMDC) data on troop numbers abroad for active duty military personnel and generates forecasts for the selected country or countries, provides accuracy statistics, a visual depiction of the forecasts including actual observed values, and projected troop numbers for the year or years of interest.
This analytic will be utilized by USTRANSCOM to better predict USTRANSCOM workload demands in coming years. The features which my analytic will perform when complete are to import the data, tidy the data in the format popularized by Wickham, group the data by country, branch, order the data by year, convert the data to a zoo/time series object, implement forecast functions, plot the forecasts, provide accuracy metrics, provide forecast values, and confidence intervals for years of interest. The typical end-users for whom this analytic is being developed are analysts at USTRANSCOM/JDPAC. Important knowledge/skills/abilities an end-user must have in order to utilize the analytic is a knowledge of forecasting techniques such as ARIMA, exponential smoothing, and possibly some dynamic regression models; however, the analytic will provide a basic description of what the model is and its key parameters and terms.
The analytic implements autoregressive integrated moving average (ARIMA) forecasting models, simple exponential smoothing models, Holt models, and forecasting accuracy metrics.
This analytic is built upon \textbf{R} packages: forecast, ggplot2 and tidyverse.
End-users will access the data analytic product via the internet. The analytic will be available via the github repository and potentially on the CRAN network.
All data are available from the unclassified, open-source database maintained by DMDC. There are no security concerns that need to be addressed.
There are no appearance/design constraints that the analytic must adhere to, but the goal is to provide a clean design with easy to use graphical user interface.
The following table gives several features which will be addressed in the analytic. Each row in the table represents a column. Information pertaining to the following points are also provided in each of the columns:
A rank of 1 is deemed most important with higher values being of lesser import.
| Feature | Description | Rank | Status | Value | Inputs/Outputs | Utilization | Sufficient Time? | Current or Future? | |-------------------|-----------------------------------------------------------------------------------------------------------------------------|------|-------------|---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|-------------------------------------------------------------------------------------------------------------|-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|------------------|--------------------| | Download the Data | This feature downloads the data off of the DMDC website | 10 | Not Started | This makes the user not have to spend as much time having to download the data. | Inputs: PDF and/or Microsoft Excel Files Containing Troop ValuesOutputs: R Data Frame Containing Troop Data | Get new DMDC Data | Maybe | Future | | Tidy the Data | This feature tidys the data such that each variable is represented by a column and each observation is represented by a row | 7 | In Progress | This makes the data easy to work with since it is in the format popularized by Wickham | Inputs: R Data Frame in Non-Tidy FormatOutputs: R Data Frame in Tidy Format | Organizes the data in a convenient format | Yes | Current | | Group Data | This feature groups the data by country and military branch. | 6 | In Progress | This is valuable in that it allows for forecasts to be disaggregated for future use of forecasting in hopes that disaggregation will improve forecast accuracy. | Inputs: R Data Frame in Tidy FormatOutputs: R Data Frame grouped by country and branch. | Allows for disaggregated forecasting | Yes | Current | | Order Data | This feature orders the data in chronological order | 4 | In Progress | Ordering the data is necessary for forecasting the data. | Inputs: R Data FrameOutputs: R Data Frame | Necessary for the end product | Yes | Current | | Fill Missing Data | This feature fills the missing observations by interpolating between the nearest two points. | 8 | In Progress | Filling missing data will allow for forecasting functions to work properly and clearly explains how missing data is being handled. | Inputs: R Data FrameOutputs: R Data Frame | Necessary for the end product | Yes | Current | | Convert Data | This feature converts the ordered data into a time series object. | 5 | In Progress | Converting the data series into a Zoo object in R allows forecasting functions to operate. | Inputs: R Data FrameOutputs:Zoo Object | Necessary for the end product | Yes | Current | | Forecast | This feature takes the converted data and applies forecasting functions to it. | 1 | In Progress | Forecasting the data utilizing Rob Hyndman's forecast package allows for exponential smoothing, ARIMA, and several other forecasting functions to be implemented. | Inputs: Zoo ObjectOutputs: Forecast Object | Allows the end-user to generate their desired forecast models. | Yes | Current | | Plot Forecasts | This function takes the forecast and generates plots. | 3 | In Progress | Plotting the functions allows the end-user to have a visual depiction of how well the forecasting is predicting new observations. | Inputs: Forecast ObjectOutputs: Plot (graphical object) | Allows for user to visualize the forecasts | Yes | Current | | Assess Accuracy | This function assesses the accuracy of the forecasts. | 9 | In Progress | Assessing the accuracy of the forecasts indicates which forecasts are performing well and which are performing poorly. This will allow the user to select the best performing model. | Inputs: Forecast ObjectOutputs: List of forecast accuracy metrics | The output will enable the user to compare forecasts accuracies in order to determine which forecast is fitting the data best. This can then be compared to the parsimony of the model to determine which one is most efficient. | Yes | Current | | Provide Values | This function provides the actual values for the forecast for use in USTRANSCOM Models | 2 | Not Started | By providing the forecasted values, this analytic will enable USTRANSCOM to feed US troop data into its workload demand forecasts. | Inputs: Forecast ObjectOutputs: Vector | Providing values is necessary so that USTRANSCOM can incorporate forecasted values into its workload demand forecast models. | Yes | Current |
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.