26–30 Jun 2022
Europe/Berlin timezone

Textual Data for Time Series Forecasting

28 Jun 2022, 15:10


Other/special session/invited session INVITED SFdS


David Obst (EDF R&D)


Traditional mid-term electricity forecasting models rely on calendar and meteorological information such as temperature and wind speed to achieve high performance. However depending on such variables has drawbacks, as they may not be informative enough during extreme weather. While ubiquitous, textual sources of information are hardly included in prediction algorithms for time series, despite the relevant information they may contain. In this work, we propose to leverage openly accessible weather reports for electricity demand and meteorological time series prediction problems. Our experiments on French and British load data show that the considered textual sources allow to improve overall accuracy of the reference model, particularly during extreme weather events such as storms or abnormal temperatures. Additionally we apply our approach to the problem of imputation of missing values in meteorological time series, and we show that our text-based approach beats standard methods. Furthermore, the influence of words on the time series' predictions can be interpreted for the considered encoding schemes of the text, leading to a greater confidence in our results.

Keywords Time series, Forecasting, Electricity consumption

Primary authors

David Obst (EDF R&D) Mrs Sandra Claudel (EDF R&D) Jairo Cugliari (Université de Lyon) Prof. Badih Ghattas (Aix-Marseille Université) yannig goude (EDF R&D) Prof. Georges Oppenheim (Université Paris-Est Marne-la-Vallée)

Presentation materials

There are no materials yet.