Spectral methods for imputation of missing air quality data

Shai Moshenberg, Uri Lerner, Barak Fishbain
2015 Environmental Systems Research  
Air quality is well recognized as a contributing factor for various physical phenomena and as a public health risk factor. Consequently, there is a need for an accurate way to measure the level of exposure to various pollutants. Longitudinal continuous monitoring however, is often incomplete due to measurement errors, hardware problems or insufficient sampling frequency. In this paper we introduce the discrete sampling theorem for the task of imputing missing data in longitudinal air-quality
more » ... e series. Within the context of the discrete sampling theorem, two spectral schemes for filling missing values are presented-a Discrete Cosine Transform (DCT) and Clustering Single Variable Decomposition (K-SVD) based methods. Results: The evaluation of the suggested methods in terms of accuracy and robustness showed that the spectral methods are comparable to the state of the art when the data is missing at random and do have the upper hand when data is missing in big chunks. The accuracy was evaluated using a complete very long air pollutants time series. Previous studies used incomplete shorter series, altering the results. The robustness of the imputation method was evaluated by examining its performance with increasing portions of missing data. Conclusions: Spectral methods are a great option for air quality data imputation, which should be considered especially when the missing data patterns are unknown.
doi:10.1186/s40068-015-0052-z fatcat:judllcs6vrfptcxifyvzcorcs4