Forecaster's Dilemma: Extreme Events and Forecast Evaluation

Sebastian Lerch, Thordis L. Thorarinsdottir, Francesco Ravazzolo, Tilmann Gneiting
2017 Statistical Science  
In public discussions of the quality of forecasts, attention typically focuses on the predictive performance in cases of extreme events. However, the restriction of conventional forecast evaluation methods to subsets of extreme observations has unexpected and undesired effects, and is bound to discredit skillful forecasts when the signal-to-noise ratio in the data generating process is low. Conditioning on outcomes is incompatible with the theoretical assumptions of established forecast
more » ... on methods, thereby confronting forecasters with what we refer to as the forecaster's dilemma. For probabilistic forecasts, proper weighted scoring rules have been proposed as decision-theoretically justifiable alternatives for forecast evaluation with an emphasis on extreme events. Using theoretical arguments, simulation experiments and a real data study on probabilistic forecasts of U.S. inflation and gross domestic product (GDP) growth, we illustrate and discuss the forecaster's dilemma along with potential remedies. Key words and phrases: Diebold-Mariano test, hindsight bias, likelihood ratio test, Neyman-Pearson lemma, predictive performance, probabilistic forecast, proper weighted scoring rule, rare and extreme events.
doi:10.1214/16-sts588 fatcat:qdywcwyvyneo5hicqeargcq7aq