Add Developments folder via upload #286

facundoallia · 2022-10-28T13:59:28Z

Developments folder contain Ensemble notebook

review-notebook-app · 2022-10-28T13:59:32Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

review-notebook-app · 2022-11-09T16:54:13Z

View / edit / reply to this conversation on ReviewNB

MMenchero commented on 2022-11-09T16:54:13Z
----------------------------------------------------------------

Suggested title: How to do an ensemble model for time series forecasting.

StatsForecast allows you to create ensemble models in a very easy way. First we need to import the data that we are going to use, in this case the M4 Dataset. After that, we´ll to (remove "to") generate the forecasts via the generate_forecast() function. Finally, we´ll create the ensemble models using the forecasts ~~previously~~ generated. In this notebook we’ll implement and benchmark a ensemble model of AutoARIMA, AutoETS and AutoCES:

review-notebook-app · 2022-11-09T16:54:14Z

View / edit / reply to this conversation on ReviewNB

MMenchero commented on 2022-11-09T16:54:13Z
----------------------------------------------------------------

Let's try to summarize the information in a table

|Frequency| Min # observations training set| Forecasting horizon|

|Yearly | 13 | 6 |

review-notebook-app · 2022-11-09T16:54:15Z

View / edit / reply to this conversation on ReviewNB

MMenchero commented on 2022-11-09T16:54:14Z
----------------------------------------------------------------

Give a brief introduction to this section, saying that we'll now generate the forecasts and the ensemble for every frequency.

review-notebook-app · 2022-11-09T16:54:16Z

View / edit / reply to this conversation on ReviewNB

MMenchero commented on 2022-11-09T16:54:15Z
----------------------------------------------------------------

Use overall instead of total.

review-notebook-app · 2022-11-09T16:54:16Z

View / edit / reply to this conversation on ReviewNB

MMenchero commented on 2022-11-09T16:54:16Z
----------------------------------------------------------------

Expand the conclusions and add the table comparing our results with the other competitors. Mention the ease of use of StatsForecast for generating multiple models in one go and for creating the ensembles.

review-notebook-app · 2022-11-15T07:20:38Z

View / edit / reply to this conversation on ReviewNB

MMenchero commented on 2022-11-15T07:20:37Z
----------------------------------------------------------------

Hay que checar esta tabla. Para el Naive2, se obtuvieron los siguientes valores:

sMAPE = 13.564
MASE = 1.912

Entonces

OWA = 1/2*( sMAPE_Nixtla/sMAPE_Naive2 + MASE_Nixtla/MASE_Naive2) = 0.841 != 0.853

Tomando los valores del Naive2 de arriba, para el primer lugar de M4 obtenemos

OWA = 1/2*(11.374/13.564+1.536/1.912) = 0.8209

Este es el valor que aparece en la tabla de resultados

https://www.sciencedirect.com/science/article/pii/S0169207019301128

Sugiero también agregar a los autores de los métodos ya que hay varias afiliaciones que aparecen como Individual.

review-notebook-app · 2022-11-18T06:27:51Z

View / edit / reply to this conversation on ReviewNB

MMenchero commented on 2022-11-18T06:27:51Z
----------------------------------------------------------------

Line #1.    # Get trarin dataframe:

Typo: Train. Seems to be in other frequencies as well.

review-notebook-app · 2022-11-18T06:27:52Z

View / edit / reply to this conversation on ReviewNB

MMenchero commented on 2022-11-18T06:27:52Z
----------------------------------------------------------------

I think we can compute the accuracy of every frequency and model more efficiently since the code for it seems to be the same, just with different names. One way of doing this is with this function. The only argument it takes is the name of the frequency.

def compute_accuracy(freq): 
   
  """""Computes the accuracy of a given frequency and model"""
   
  data = pd.read_csv(freq.lower()+'_df_forecast.csv')
  data = data.drop(columns=['Unnamed: 0']) # add 'Unnamed: 0.1' if requiered
  data['ds'] = data.groupby('unique_id')['ds'].transform(lambda data: np.arange(1, len(data) + 1))
  data = pd.melt(data, id_vars=['unique_id', 'ds'], var_name='model')
  data = pd.pivot(data, index=['unique_id', 'model'], columns='ds', values='value').reset_index()
  models = ['Ensemble_median', 'AutoARIMA', 'ETS', 'CES', 'AutoTheta']

  res = {}
  for k in range(len(models)):

    df_mask=data['model'] == models[k]

    df = data[df_mask]

    df.rename(columns={'unique_id': 'id'}, inplace=True)

    df = df.set_index(df['id'])

    df = df.drop(['id', 'model'], axis=1)

    y_metrics = M4Evaluation.evaluate('data2', freq, df.sort_values('id').values)

    res[models[k]] = y_metrics

   

  metrics = [res[k] for k in models]

  total_metrics = pd.concat(metrics)

  total_metrics['model'] = models 

   

  return total_metrics

We can call it using

compute_accuracy('Yearly')

This should produce the same table as above.

I think the following are good practices to keep in mind:

- Keep the variable names as short as possible.

- If you need to do a process more than once, write a function for it if possible.

CLAassistant · 2022-12-05T22:03:39Z

Thank you for your submission! We really appreciate it. Like many open source projects, we ask that you sign our Contributor License Agreement before we can accept your contribution.
_{You have signed the CLA already but the status is still pending? Let us recheck it.}

Add files via upload

f08460f

MMenchero requested review from MMenchero and AzulGarza October 28, 2022 17:35

facundoallia added 5 commits November 2, 2022 12:29

Add files via upload

cb996d1

Delete forecast_hformat.rar

0a30128

Add files via upload

5a2ac2e

Add files via upload

46bf1c1

Add files via upload

76eaa69

facundoallia added 9 commits November 10, 2022 15:17

Add files via upload

712c1d8

Add files via upload

578111f

Delete m4_ranking.jpg

0ccfbb6

Add files via upload

72d32d7

Add files via upload

72da17c

Add files via upload

fe084ee

Delete ensemble_m4.ipynb

63f6940

Add files via upload

9282365

Add files via upload

15ac862

facundoallia added 6 commits November 16, 2022 12:09

Delete methods.jpg

fe305a6

Delete metrics.PNG

1b2b7f4

Add files via upload

948c96e

Delete hourly_df_forecast.zip

996259d

Add files via upload

f01e050

Delete ensemble_m4 ARIMA-ETS-CES-THETA.ipynb

83b3264

Add files via upload

e68d7a7

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Developments folder via upload #286

Add Developments folder via upload #286

facundoallia commented Oct 28, 2022

review-notebook-app bot commented Oct 28, 2022

review-notebook-app bot commented Nov 9, 2022 •

edited

review-notebook-app bot commented Nov 9, 2022 •

edited

review-notebook-app bot commented Nov 9, 2022

review-notebook-app bot commented Nov 9, 2022 •

edited

review-notebook-app bot commented Nov 9, 2022 •

edited

review-notebook-app bot commented Nov 15, 2022 •

edited

review-notebook-app bot commented Nov 18, 2022

review-notebook-app bot commented Nov 18, 2022

CLAassistant commented Dec 5, 2022

Add Developments folder via upload #286

Are you sure you want to change the base?

Add Developments folder via upload #286

Conversation

facundoallia commented Oct 28, 2022

review-notebook-app bot commented Oct 28, 2022

review-notebook-app bot commented Nov 9, 2022 • edited

review-notebook-app bot commented Nov 9, 2022 • edited

review-notebook-app bot commented Nov 9, 2022

review-notebook-app bot commented Nov 9, 2022 • edited

review-notebook-app bot commented Nov 9, 2022 • edited

review-notebook-app bot commented Nov 15, 2022 • edited

review-notebook-app bot commented Nov 18, 2022

review-notebook-app bot commented Nov 18, 2022

CLAassistant commented Dec 5, 2022

review-notebook-app bot commented Nov 9, 2022 •

edited

review-notebook-app bot commented Nov 9, 2022 •

edited

review-notebook-app bot commented Nov 9, 2022 •

edited

review-notebook-app bot commented Nov 9, 2022 •

edited

review-notebook-app bot commented Nov 15, 2022 •

edited