Description Usage Format Details Value References
This dataset is a random subset of a high frequency trading dataset used to assess the performace of RNNs for prediction (Dixon, 2017).
| 1 | 
A dataset with 30000 observations of sequence length = 10, with a single sequence per row.
The y data is labeled as -1,0,1.
The x data constructs time series sequences (numeric).
The feature represents the instantaneous liquidity imbalance using the best bid to ask ratio. The labels represent the next-event mid-price movement - Y=1 is an up-tick, Y=-1 is a down-tick and Y=0 represents no-movement. The time series sequences length is set to 10. In this package, the class 1 and -1 observations are random selected to yield 1200 non-zero observations, while class 0 has 28800 observations. Observations are ordered chronologically.
hft: the dataset HFT
Matthew Dixon.(2017) Sequence Classification of the Limit Order Book using Recurrent Neural Networks. arXiv:1707.05642.
Add the following code to your website.
For more information on customizing the embed code, read Embedding Snippets.