Feat/model deeptime #1329

madtoinou · 2022-10-31T16:05:53Z

Summary

Implement the DeepTIMe model from https://arxiv.org/pdf/2207.06046.pdf, based on the original repository https://github.com/salesforce/DeepTime and the article pseudo-code.

Also implement some basics tests, inspired by the tests for N-Beats.

Other Information

In the original article, distinct optimizers are defined for the three groups of parameters: Ridge Regression regularization term, the biais/norm of the Implicit Neural Representation (INR) network and the weights of the INR. This was accomplished by overriding the configure_optimizer method and partially breaking the logic behind the lr_scheduler_cls and lr_scheduler_kwargs arguments. To make the model easier to use out of the box, the default arguments correspond to the original article parameters (including for the optimizer).

All the module necessary for this architecture were included in the same file to limit the fragmentation of the code. The Ridge Regression and the INR modules could however be extracted if others models require them.

The support for the nr_params functionnality is not implemented yet.

…ociated tests (inspired from the nbeats tests)

…figure_optimizer method

…and taking advantage of the safeguards offered by darts

…uler

codecov-commenter · 2022-10-31T16:47:11Z

Codecov Report

Base: 93.97% // Head: 94.03% // Increases project coverage by +0.05% 🎉

Coverage data is based on head (ea19348) compared to base (d712ce5).
Patch coverage: 96.95% of modified lines in pull request are covered.

Additional details and impacted files

@@            Coverage Diff             @@
##           master    #1329      +/-   ##
==========================================
+ Coverage   93.97%   94.03%   +0.05%     
==========================================
  Files          82       83       +1     
  Lines        8917     9102     +185     
==========================================
+ Hits         8380     8559     +179     
- Misses        537      543       +6

Impacted Files	Coverage Δ
darts/dataprocessing/transformers/scaler.py	`97.56% <ø> (ø)`
darts/utils/data/training_dataset.py	`89.47% <ø> (ø)`
darts/models/forecasting/deeptime.py	`96.93% <96.93%> (ø)`
...arts/models/forecasting/torch_forecasting_model.py	`87.70% <100.00%> (-0.03%)`	⬇️
darts/timeseries.py	`91.94% <0.00%> (-0.06%)`	⬇️
darts/models/forecasting/block_rnn_model.py	`98.24% <0.00%> (-0.04%)`	⬇️
darts/models/forecasting/nhits.py	`99.27% <0.00%> (-0.01%)`	⬇️

Help us with your feedback. Take ten seconds to tell us how you rate us. Have a feature suggestion? Share it here.

☔ View full report at Codecov.
📢 Do you have feedback about the report comment? Let us know in this issue.

eliane-maalouf

few comments on quick initial things I noticed

darts/tests/models/forecasting/test_deeptime.py

hrzn

Looks very good, nice job @madtoinou !
After glancing at the paper, I also think it would be a nice addition to Darts.
I haven't looked into all minute details of the processing being done but I trust you :) I've got a few small comments. Perhaps the main one concerns nr_params which we should try to exploit before we merge.

darts/utils/data/training_dataset.py

darts/models/forecasting/deeptime.py

darts/tests/models/forecasting/test_deeptime.py

…_global_forecasting_models, the number of epochs during the reduced to the striuct minimum, corrected typo in docstring, removed the mutable default argument, added check in TorchForecastingModel for n_epochs

…crease the length of the prediction from 2 to 3, last version was relying on erroneous broadcasting

…failing at the moment

…eepTime, removed some comments in the forward method, corrected typo

…duler default argument, added check for scheduler warmup_epochs value, removed hardcoded typing of ridge regressor regularization term

…d to warmup_epochs

hrzn · 2022-11-28T14:16:19Z

I find that the results do not seem fantastic in the probabilistic setting. E.g. when running the following code:

from darts.datasets import AirPassengersDataset
from darts.dataprocessing.transformers import Scaler
from darts.models import DeepTimeModel
from darts.utils.likelihood_models import GaussianLikelihood, LaplaceLikelihood

series = AirPassengersDataset().load().astype(np.float32)

scaler = Scaler()
train, val = scaler.fit_transform(series[:-36]), scaler.transform(series[-36:])

model = DeepTimeModel(input_chunk_length=24,
                      output_chunk_length=12,
                      likelihood=GaussianLikelihood())

model.fit(train, epochs=100)

pred = model.predict(series=train, n=36, num_samples=300)
train.plot()
pred.plot()

I get this - the variance seems almost zero.

I'm wondering whether this might be due to our treatment of the distributions parameters, which perhaps happens too early in the processing (when creating the time representations), which could (maybe?) cause degenerate results. Could we maybe find a way to "tile" tensors somewhere else later in the forward pass? WDYT @madtoinou ?

madtoinou · 2022-11-28T23:05:42Z

This is indeed a bit disappointing, I should have spend more time looking at the variance of the resulting distribution.

There is not much room for tiling downstream: after the INR (fully connected network), there is only the ridge regression trying to solve the equation AX = B where A is the time representation transpose time itself, and B is the time representation transposed multiplied by the observations. I don't see how we could tweak this part.

I am going to experiment with using different Fourier features for each distribution parameter (before the INR), it should add bit of heterogeneity but I am not sure that it could solve the issue.

madtoinou added 10 commits October 26, 2022 18:58

implementation of the deeptime forecasting model, creation of the ass…

b15d8bb

…ociated tests (inspired from the nbeats tests)

updated the init file

0492472

attempt to implement the optimizer described in the original article

4f24cb6

changed the approach for optimizer definition, now overriding the con…

6035336

…figure_optimizer method

corrected small typos in the documentation

3db5d14

improved variable naming for the schedulers

d52f231

simplified the model implementation based on the article pseudo code …

e13ed57

…and taking advantage of the safeguards offered by darts

reverted input_length_chunck to values identical to the nbeats tests

00e7f18

improved the docstring

9903435

simplified logic around default arguments for optimizers and lr_sched…

470bb08

…uler

madtoinou requested review from hrzn and dennisbader as code owners October 31, 2022 16:05

eliane-maalouf reviewed Nov 4, 2022

View reviewed changes

madtoinou added 2 commits November 7, 2022 15:35

corrected test file according to reviwer comments

be7239e

Merge branch 'master' into feat/model-deeptime

db96dd5

hrzn reviewed Nov 14, 2022

View reviewed changes

madtoinou added 4 commits November 16, 2022 10:04

correction according to reviewer comments: DeepTime was added to test…

19104fc

…_global_forecasting_models, the number of epochs during the reduced to the striuct minimum, corrected typo in docstring, removed the mutable default argument, added check in TorchForecastingModel for n_epochs

fix: added transpose in the target array for multivariate test and in…

fead1ea

…crease the length of the prediction from 2 to 3, last version was relying on erroneous broadcasting

feat: draft of the probabilistic predictions for deeptime, tests are …

56dc64e

…failing at the moment

forgot to set the random_state in the test_probabilistic_models for D…

4351ec5

…eepTime, removed some comments in the forward method, corrected typo

madtoinou requested a review from hrzn November 17, 2022 09:07

madtoinou and others added 7 commits November 25, 2022 16:13

fix: removed outdated comment, added comment about optimizer and sche…

f81ccbc

…duler default argument, added check for scheduler warmup_epochs value, removed hardcoded typing of ridge regressor regularization term

merged with master

8f301da

corrected tests after adding a check on the value of n_epochs compare…

bff3b66

…d to warmup_epochs

Merge branch 'master' into feat/model-deeptime

9b167a8

Merge branch 'master' into feat/model-deeptime

4e49eb3

fix: corrected msising parenthesis due to auto-merging

3f9e944

Merge branch 'master' into feat/model-deeptime

ea19348

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feat/model deeptime #1329

Feat/model deeptime #1329

madtoinou commented Oct 31, 2022 •

edited

codecov-commenter commented Oct 31, 2022 •

edited

eliane-maalouf left a comment

hrzn left a comment

hrzn commented Nov 28, 2022

madtoinou commented Nov 28, 2022

Feat/model deeptime #1329

Are you sure you want to change the base?

Feat/model deeptime #1329

Conversation

madtoinou commented Oct 31, 2022 • edited

Summary

Other Information

codecov-commenter commented Oct 31, 2022 • edited

Codecov Report

eliane-maalouf left a comment

Choose a reason for hiding this comment

hrzn left a comment

Choose a reason for hiding this comment

hrzn commented Nov 28, 2022

madtoinou commented Nov 28, 2022

madtoinou commented Oct 31, 2022 •

edited

codecov-commenter commented Oct 31, 2022 •

edited