Skip to content

Forecasting System for Continual Learning Scenarios based on Hoeffding Trees With Change Point Detection Mechanism

Notifications You must be signed in to change notification settings

rasvob/Hoeffding-Trees-with-CPD-Multistep-Forecasing

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

23 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

A Natural Gas Consumption Forecasting System for Continual Learning Scenarios based on Hoeffding Trees With Change Point Detection Mechanism

Forecasting natural gas consumption, considering seasonality and trends, is crucial in planning its supply and consumption and optimizing the cost of obtaining it, mainly by industrial entities. However, in times of threats to its supply, it is also a critical element that guarantees the supply of this raw material to meet individual consumers' needs, ensuring society's energy security.

This paper aims to propose and evaluate two novel approaches, Pure and Mixed, for multistep forecasting of natural gas consumption based on change point detection and model aggregation. We design a general methodology for multistep multivariate time series forecasting with continual learning capabilities using data stream processing. The performance of the defined change point divided model collection approaches is compared with two baseline approaches. We use three metrics and three settings of the PELT algorithm to measure the forecasting error and the effect of the number of change points. Our experiments show that the defined Pure approach outperforms all other approaches by each selected metric and for all tested parameter settings. We also demonstrate that fewer change points result in a lower error and that the Pure approach is more robust and suitable for continual learning tasks. We present the potential of our methodology for forecasting natural gas consumption in a dynamic and complex environment and suggest future work.

Jupyter notebooks

  • Reuired packages are in the requirements.txt, experiments were conducted in Jupyter notebooks in the repository.

  • NGC prefixed Jupyters use Natural gas consumption data, EL prefixed uses Electricity load data

  • PELT setting for Low, Medium and High are controlled by variables LOW, MED, HIGH for the PELT penalization factor in the notebooks, just change the multiplicator in the penalization variables, e.g. for LOW it is *pen = cp_params[model][47]LOW.

These notebooks contains source codes for the experiments:

  • NGC_HT_Day2Day_Online.ipynb: SMCA

  • NGC_HT_Day2Day_BySeason_2021: QDMDC

  • NGC_HT_Day2Day_ByCP_2021.ipynb: PCPDMC

  • NGC_HT_Day2Day_ByCP_ERR_Switch_2021.ipynb: MCPDMC-SW

  • NGC_HT_Day2Day_ByCP_ERR_WAVG_2021.ipynb: MCPDMC-WA

  • NGC_{FCN, LSTM, GRU}_Day2Day_Online.ipynb: SMCA using GRU/LSTM/FCN as a forecasting model

  • NGC_{FCN, LSTM, GRU}_Day2Day_BySeason_2021: QDMDC using GRU/LSTM/FCN as a forecasting model

  • NGC_{FCN, LSTM, GRU}_Day2Day_ByCP_2021.ipynb: PCPDMC using GRU/LSTM/FCN as a forecasting model

  • EL_HT_Day2Day_Online.ipynb: SMCA

  • EL_HT_Day2Day_BySeason_2021: QDMDC

  • EL_HT_Day2Day_ByCP_2021.ipynb: PCPDMC

  • EL_HT_Day2Day_ByCP_ERR_Switch_2021.ipynb: MCPDMC-SW

  • EL_HT_Day2Day_ByCP_ERR_WAVG_2021.ipynb: MCPDMC-WA

  • EL_{FCN, LSTM, GRU}_Day2Day_Online.ipynb: SMCA using GRU/LSTM/FCN as a forecasting model

  • EL_{FCN, LSTM, GRU}_Day2Day_BySeason_2021: QDMDC using GRU/LSTM/FCN as a forecasting model

  • EL_{FCN, LSTM, GRU}_Day2Day_ByCP_2021.ipynb: PCPDMC using GRU/LSTM/FCN as a forecasting model

Results folder

Folder contains .csv files with the experiments outputs.

List of change points for the Low, Medium and High PELT settings are in files:

  • BKPS_LOW.csv
  • BKPS_MED.csv
  • BKPS_HIGH.csv

And for electricity load data:

  • EL_BKPS_LOW.csv
  • EL_BKPS_MED.csv
  • EL_BKPS_HIGH.csv

These csv files contains the experiments outputs:

  • NGC_HT_Day2Day_Online_2021.csv: SMCA

  • NGC_HT_Day2Day_BySeason_2021.csv: QDMDC

  • NGC_HT_Day2Day_ByCP_{LOW, MED, HIGH}_2021.csv: PCPDMC

  • NGC_HT_Day2Day_ByCP_{LOW, MED, HIGH}_ERR_Switch_2021.csv: MCPDMC-SW

  • NGC_HT_Day2Day_ByCP_{LOW, MED, HIGH}_ERR_WAVG_2021.csv: MCPDMC-WA

  • NGC_{FCN, LSTM, GRU}_Day2Day_Online_2021.csv: SMCA using GRU/LSTM/FCN as a forecasting model

  • NGC_{FCN, LSTM, GRU}_Day2Day_BySeason_2021.csv: QDMDC using GRU/LSTM/FCN as a forecasting model

  • NGC_{FCN, LSTM, GRU}Day2Day_ByCP{LOW, MED, HIGH}_2021.csv: PCPDMC using GRU/LSTM/FCN as a forecasting model

  • EL_HT_Day2Day_Online_2021.csv: SMCA

  • EL_HT_Day2Day_BySeason_2021.csv: QDMDC

  • EL_HT_Day2Day_ByCP_{LOW, MED, HIGH}_2021.csv: PCPDMC

  • EL_HT_Day2Day_ByCP_{LOW, MED, HIGH}_ERR_Switch_2021.csv: MCPDMC-SW

  • EL_HT_Day2Day_ByCP_{LOW, MED, HIGH}_ERR_WAVG_2021.csv: MCPDMC-WA

  • EL_{FCN, LSTM, GRU}_Day2Day_Online_2021.csv: SMCA using GRU/LSTM/FCN as a forecasting model

  • EL_{FCN, LSTM, GRU}_Day2Day_BySeason_2021.csv: QDMDC using GRU/LSTM/FCN as a forecasting model

  • EL_{FCN, LSTM, GRU}Day2Day_ByCP{LOW, MED, HIGH}_2021.csv: PCPDMC using GRU/LSTM/FCN as a forecasting model

Publications

Title Journal Link Year
A Natural Gas Consumption Forecasting System for Continual Learning Scenarios based on Hoeffding Trees With Change Point Detection Mechanism Uunder rewiew in KNOSYS arXiv 2024
@misc{svoboda2023natural,
title={A Natural Gas Consumption Forecasting System for Continual Learning Scenarios based on Hoeffding Trees with Change Point Detection Mechanism}, 
author={Radek Svoboda and Sebastian Basterrech and Jędrzej Kozal and Jan Platoš and Michał Woźniak},
year={2023},
eprint={2309.03720},
archivePrefix={arXiv},
primaryClass={cs.LG}
}