Benchmarks for Out-of-Distribution Generalization in Time Series
View the Project on GitHub jc-audet/WOODS
View the Documentation on RtD woods.readthedocs.io
Motivation: Seasonality is the property of time series where recurring characteristics appear every cycle of a fixed period, e.g., weekly. A common practice in the forecasting field is to provide models with additional information, e.g., day of week in order to allow models to leverage seasonality for better predictions. However, holidays is a seasonality of time series that is very sparse which models often fail to capture. We study the performance of models on sparse seasonality with the AusElec dataset.
Problem: We consider the electricity consumption forecasting task. The dataset has 13 time domains, where each domain contains data from different months and holidays. The goal is to perform well on all seasonalities.
Downloads: The preprocessed data is available through Huggingface monash_tsf dataset and through Zenodo. It can also be directly downloaded through the WOODS repository by running the following command:
python -m woods.scripts.download_datasets AusElec --data_path /path/to/data
[1] Hyndman, Rob J., and George Athanasopoulos. Forecasting: principles and practice. OTexts, 2018.
[2] Godahewa, Rakshitha, et al. “Monash time series forecasting archive.” arXiv preprint arXiv:2105.06643 (2021).
This dataset is licensed under the Creative Commons Attribution 4.0 International License