mlr3forecasting: Forecasting for 'mlr3'

 Feature Name: `Forecasting Reduction Techniques`
 Start Date: 2019-11-16
 Target Date:
 ```

## Summary
[summary]: #summary

 - Lagged values of a timeseries can be used as features for a machine learning algorithm.
 - Two major apporaches: rolling window regression and expanding window regression

## Motivation
[motivation]: #motivation

Most machine learning algorithms are not designed for temporal data. To still use these methods, we have to put the data in the right shape for our algorithm.


## Guide-level explanation
[guide-level-explanation]: #guide-level-explanation



Basically there are two major approaches to feed timeseries data into a regression method: the "rolling window"-approach and the "expanding window"-approach. Both methods use the last k values as features to predict the next value(s) in the timeseries.

Example for k = 2: Predict X(t) given X(t-1) and X(t-2)

X(t) X(t-1) X(t-2) 1 1 NA NA 2 2 1 NA 3 3 2 1 4 4 3 2 5 5 4 3 6 6 5 4 7 7 6 5 8 8 7 6 9 9 8 7 10 10 9 8

``` The difference between these two approaches is the amount of data that is used for making the prediction. While the rolling window (with window size n) approach uses the last n values for training and then predicts the next values, the expanding window approach trains the model before every prediction on all the available data.

Example: Suppose we are at timestep 50 of a timeseries

Rolling window approach:

choose a window-size (for simplicity we choose 50)
train the model on observation 1 to 50
predict the value for timestep 51
for the next prediction train the model on observation 2 to 51
predict the value for timestep 52
repeat according to this scheme

Expanding window approach:

use all the available data to train the model (timesteps 1 to 50)
predict the value for timestep 51
use all the available data to train the model (timesteps 1 to 51)
predict the value for timestep 52
repeat according to this scheme

For multi-step-forecasts there are several options to tackle the problem.

1. Direct forecasting

For the direct forecasting method you build a seperate model for each timestep you want to predict.

prediction(t+1) = model1(x(t), x(t-1),..., x(t-n))
prediction(t+2) = model2(x(t), x(t-1),..., x(t-n))
etc.

An alternative to this approach is building a multi-output regression model that can handle the forecasts directly without the need for additional models.

2. Recursive forecasting

For the recursive forecasting method you build one model and predict timestep t+1. To forecast the next value (t+2) you feed this prediction back into the model.

prediction(t+1) = model1(x(t), x(t-1),..., x(t-n))
prediction(t+2) = model1(prediction(t+1), x(t), x(t-1),..., x(t-n-1))
etc.

3. Hybrid forecasting

It would also be possible to combine the two strategies: build a seperate model for each timestep, but feed the prediction of the previous timestep as input.

prediction(t+1) = model1(x(t), x(t-1),..., x(t-n))
prediction(t+2) = model2(prediction(t+1), x(t), x(t-1),..., x(t-n-1))
etc.

The main drawback of this two approaches are the hurt iid assumptions that are usually required for machine learning algorithms. Another point is how we want to handle multi-step ahead forecasts, as there are several options to do that.

mlr-org/mlr3forecasting documentation built on June 29, 2023, 11:57 p.m.

rdrr.io home R language documentation Run R code online

CRAN packages Bioconductor packages R-Forge packages GitHub packages

Note that we can't provide technical support on individual packages. You should contact the package authors for that.

Tweet to @rdrrHQ

GitHub issue tracker

ian@mutexlabs.com