Skip to content

Analysis of web-scrapped news articles of DIE ZEIT. Construction of a sentiment indicator out of the articles and employment in a state-of-the-art economic prediction set-up demonstrating its usefulness for predicting inflation in particular.

Notifications You must be signed in to change notification settings

dullibri/zeit-2

Repository files navigation

zeit-2

code file structure:

  • tests: includes R-code for statistical tests, mostly for comparing forecast accuracy

news analysis:

  • access newspaper articles of Die Zeit
  • clean them of html
  • create a document term matrix
  • evaluate sentiment using sentiws (publically available data base)
  • identify topics
  • create sentiment indices for each topic over time.

pseudo-out-of sample experiment:

  • making sets of bundesbank realtime data.R bindet die realtime daten zu bestimmten Tagen zu sets zusammen.
  • survey selecting.R collects survey data. in file data.
  • buba_selecting_data.R
  • oecd_selecting_data.R
  • Short_term_Indicators3.xlsx collects all data not available from source.

changing the realtime experiment

I take the approach of bubblesbreakdowns:

  • lag length selection is implemented based on bma methods (raftery et al.).
  • in contrast to the original bubblesbreakdowns approach, this will only be implemented for the last vintage.
  • starting with the last vintage, the lag length estimated there will be used for all other vintages.

the steps to implent this:

  1. step (getting process)
  • the file realtime experiment.Rmd will be used including the data.
  • it will be named "realtime experiment zeit.Rmd"
  • The sequence will be reversed, such that the loop starts at the latest vintage.
  • one modified version of bmafo will be used that returns the last vintages lag length and the forecast.
  • starting with the penultimate vintage, another modified version will use the lag length of the latest vintage and returns the forecast based on model estimated on the penultimate data.
  1. step (getting data)
  • bundesbank realtime data will be transformed to sets (containing one vintage of several variables)
  • the unrevised data are read in.
  • the revised and the unrevised data are plugged into the existing process from bubblesbreakdowns.

goldstandard testing

  • optimize zeit-index.Rmd contains the sentiment analysis used and compares an index created on its basis to a goldstandard.

About

Analysis of web-scrapped news articles of DIE ZEIT. Construction of a sentiment indicator out of the articles and employment in a state-of-the-art economic prediction set-up demonstrating its usefulness for predicting inflation in particular.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published