Skip to main content

Advertisement

Log in

Air quality data series estimation based on machine learning approaches for urban environments

  • Published:
Air Quality, Atmosphere & Health Aims and scope Submit manuscript

Abstract

Air pollution is one of the main environmental problems in residential areas. In many cases, the effects of air pollution on human health can be prevented by forecasting the air quality in the next day. In order to predict the 1 day in advance air quality index (AQI) of Orumiyeh city, the hybrid single decomposition (HSD) and hybrid two-phase decomposition (HTPD) models were used. In the first step, the AQI data were decomposed by complementary ensemble empirical mode decomposition with adaptive noise (CEEMDAN) and was hybridized with general regression neural network (GRNN) and extreme learning machine (ELM) as HSD models. In the second step, using variational mode decomposition (VMD) technique the results of the first intrinsic mode functions (IMFs) of CEEMDAN model were decomposed into nine VMs and were predicted by GRNN and ELM models to obtain IMF1. Finally, in the third step, GRNN and ELM were used again to predict the IMFS as HTPD models. Results showed that in predicting AQI series data by HSD models both CEEMDAN-ELM and CEEMDAN-GRNN models were similarly accurate. Among all the models used, the accuracy of CEEMDAN-VMD-GRNN as the HTPD model was the highest in the training phase (R2 = 0.98, RMSE = 4.13 and MAE = 2.99) and in the testing phase (R2 = 0.74, RMSE = 5.45 and MAE = 3.87). It can be concluded that HTPD models have more accurate results to predict AQI data compared with HSD models.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8

Similar content being viewed by others

Abbreviations

AQI:

air quality index

HSD:

hybrid single decomposition

HTPD:

hybrid two-phase decomposition

CEEMDAN:

complementary ensemble empirical mode decomposition with adaptive noise

GRNN:

general regression neural network

ELM:

extreme learning machine

VMD:

variational mode decomposition

IMFS :

intrinsic mode functions

ARIMA:

autoregressive integrated moving average

MLP:

multilayer perceptron

GLM:

generalized linear models

ANN:

artificial neural networks

MLR:

multiple linear regression

ANFIS:

adaptive neuro-fuzzy inference system

VIF:

variance inflation factor

References

  • Amanollahi J, Tzanis C, Ramli MF, Abdullah AM (2016) Urban heat evolution in a tropical area utilizing Landsat imagery. Atmos Res 167:175–182

  • Alimissis et al (2018) Spatial estimation of urban air pollution with the use of artificial neural network models. Atmos Environ 191:205–2013

    CAS  Google Scholar 

  • Amanollahi J, Ausati S (2020a) PM2.5 concentration forecasting using ANFIS, EEMD-GRNN, MLP, and MLR models: a case study of Tehran, Iran. Air Qual Atmos Health 13:161–171

    CAS  Google Scholar 

  • Amanollahi J, Ausati S (2020b) Validation of linear, nonlinear, and hybrid models for predicting particulate matter concentration in Tehran, Iran. Theor Appl Climatol 140:709–717

    Google Scholar 

  • Araujo L, Belotti JT, Alves TA, Tadano YD, Siqueira H (2020) Ensemble method based on artificial neural networks to estimate air pollution health risks. Environ Model Softw 123:104567

    Google Scholar 

  • Ausati S, Amanollahi J (2016) Assessing the accuracy of ANFIS, EEMD-GRNN, PCR, and MLR models in predicting PM2.5. Atmos Environ 142:465–474

    CAS  Google Scholar 

  • Bates JT, Pennington AF, Zhai X, Friberg MD, Metcalf F, Darrow L, Strickland M, Mulholland J, Russell A (2018) Application and evaluation of two model fusion approaches to obtain ambient air; pollutant concentrations at a fine spatial resolution (250m) in Atlanta. Environ Model Softw 109:182–190

    Google Scholar 

  • Bench G (2004) Measurement of contemporary and fossil carbon contents of PM2.5 aerosols: results from Turtleback Dome, Yosemite National Park. Environ Sci Technol 38:2424–2427

    CAS  Google Scholar 

  • Chattopadhyay G, Midya SK, Chattopadhyay S (2019) MLP based preductive model for surfce ozone concentration over an urban area in the Gangetic West Bengal during pre-monsoon season. J Atmos Sol Terres Phys 184:57–62

    CAS  Google Scholar 

  • Chen J, Lu J, Ning J, Yan Y, Li S, Zhou L (2019) Pollution characteristics, sources, and risk assessment of heavy metals and perfluorinated compounds in PM2.5 in the major industrial city of northern Xinjiang, China. Air Qual Atmos Health 12:909–918

    CAS  Google Scholar 

  • Dragomiretskiy K, Zosso D (2014) Variational mode decomposition. IEEE Trans. Signal Process. 62:531–544

  • Fallahi S, Amanollahi J, Tzanis CG, Ramli MF (2018) Estimating solar radiation using NOAA/AVHRR and ground measurement data. Atmos Res 199:93–102

    Google Scholar 

  • Ganguly ND, Tzanis C (2011) Study of stratosphere-troposphere exchange events of ozone in India and Greece using ozonesonde ascents. Meteorol Appl 18:467–474

    Google Scholar 

  • Ghasemi A, Amanollahi J (2019) Integration of ANFIS model and forward selection method for air quality forecasting. Air Qual Atmos Health 12:59–72

    CAS  Google Scholar 

  • Gomez-Losada A, Pires JCM, Pino-Mejias R (2016) Charactrization of background air pollution exposure in urban environments usng a ametric based on hidden Markov models. Atmos Environ 127:255–261

    CAS  Google Scholar 

  • Gumede PR, Savage MJ (2017) Respiratory health effects associated with indoor particulate matter (PM2.5) in children residing near a landfill site in Durban, South Africa. Air Qual Atmos Health 10:853–860

    CAS  Google Scholar 

  • Huang GB, Zhu QY, Siew CK (2006) Extreme learning machine: theory and application. Neurocomputing 70:489–501

  • Hwang S-L, Lin Y-C, Hsiao K-Y, Lin C-M, Chi M-C (2020) Spatiotemporal assessment of mortality attributable to ambient PM 2.5 exposure in Taiwan during 2008-2015. Air Qual Atmos Health 13:233–245

    Google Scholar 

  • Jian L, Zhao Y, Zhu YP, Zhang MB, Bertolatti D (2012) An application of ARIMA model to predict submicron particle concentrations from meteorological factors at a busy roadside in Hangzhou, China. Sci Total Environ 426:336–345

    CAS  Google Scholar 

  • Jie W, Wang J, Li W, Polivka T (2015) Using a deterministic time-lagged ensemble forecast with a probabilistic threshold for improving 6-15 day summer precipitation prediction in China. Atmos Res 156:142–159

    Google Scholar 

  • Kaboodvandpour S, Amanollahi J, Qhavami S, Mohammadi B (2015) Assessing the accuracy of multiple regressions, ANFIS, and ANN models in predicting dust storm occurrences in Sanandaj, Iran. Nat Hazards 78:879–893

    Google Scholar 

  • Kampa M, Castanas E (2008) Human health effects of air pollution. Environ Pollut 151:362–367

    CAS  Google Scholar 

  • Lei MT, Monjardino J, Mends L, Goncalves D, Ferreira F (2019) Macao air quality forecast using statistical methods. Air Qual Atmos Health 12:1049–1057

    CAS  Google Scholar 

  • Leung MT, Daock H, Chen A (2000) Forecasting stock indices: a comparison of classification and level estimation models. Int J Forecast 16:173–190

  • Mirzaei M, Amanollahi J, Tzanis CG (2019) Evaluation of linear, nonlinear, and hybrid models for predicting PM2.5 based on a GTWR model and MODIS AOD data. Air Qual Atmos Health 12:1215–1224

    CAS  Google Scholar 

  • Niu MF, Wang YF, Sun SL, Li YW (2016) A novel hybrid decomposition-and-ensemble model based on CEEMD and GWO for short-termPM2.5 concentration forecasting. Atmos Environ 134:168–180

    CAS  Google Scholar 

  • Niyogi et al (2007) Potential impact of aerosol-land-atmosphere interactions on the Indian monsoonal rainfall characteristics. Nat Hazards 42:345–359

    Google Scholar 

  • Ostro B, Chestnut L, Vichit-Vadakan N, Laixuthai A (1999) The impact of particulate matter on daily mortality in Bangkok, Thailand. J Air Waste Manage Assoc 49:100–107

    CAS  Google Scholar 

  • Paoin K, Ueda K, Seposo XT, Hayano J, Kiyono K, Ueda N, Kawamura T, Honda A, Takano H (2020) Association between PM2.5 exposure and heart rate variability for the patients with cardiac problems in Japan. Air Qual Atmos Health 13:339–347

    CAS  Google Scholar 

  • Ribeiro MC, Pinho P, Branquinho C, Llop E, Pereira MJ (2016) Geostatistical uncertainty of assessing air quality using high-spatial-resolution lichen data: a health study in the urban area of Sines, Portugal. Sci Total Environ 562:740–750

    CAS  Google Scholar 

  • Simaha CP, Devara PCS, Saha SK (2013) Aerosol pollution and its impact on regional climate during Holi festival inferred from ground-based and satellite remote sensing observations. Nat Hazards 69:889–903

    Google Scholar 

  • Singh A, Pant P, Pope FD (2019) Air quality during and after festivals: aerosol concentrations, composition and health effects. Atmos Res 227:220–232

    CAS  Google Scholar 

  • Tzanis C, Varotsos CA (2008) Tropospheric aerosol forcing of climate: a case study for the greater area of Greece. Int J Remote Sens 29(9):2507–2517

  • Tzanis et al (2019) Applying linear and nonlinear models for the estimation of particulate matter variability. Environ Pollut 246:89–98

    CAS  Google Scholar 

  • Ventura LMB, Pinto FO, Soares LM, Luna AS, Gioda A (2019) Forecast of daily PM2.5 concentrations applying artificial neural networks and Holt-Winters models. Air Qual Atmos Health 12:317–325

    CAS  Google Scholar 

  • Wang XX, Chen S, Lowe D, Harris CJ (2006) Sparse support vector regression basedon orthogonal forward selection for the generalised kernel model. Neurocomputing. 70:462–474

    Google Scholar 

  • Wang D, Wei S, Luo H, Yue C, Grunder O (2017) A novel hybrid model for air quality index forecasting based n two-phase decomposition technique and modified extreme learning machine. Sci Total Environ 580:719–733

    CAS  Google Scholar 

  • Xu X, Ren W (2019) A hybrid model based on a two-layer decomposition approach and an optimized neural network for chaotic time series prediction. Symmetry 2019(11):610. https://doi.org/10.3390/sym11050610

    Article  Google Scholar 

  • Xu Y, Liu H, Duan Z (2020) A novel hybrid model for multi-step daily AQI forecasting drive by air pollution big data. Air Qual Atmos Health 13:197–207

    CAS  Google Scholar 

  • Yeganeh B, Hewson MG, Clifford S, Tavassoli A, Knibbs LD, Morawska L (2018) Estmation the spatiotemporal variation of NO2 concentration using an adaptive neuro-fuzzy inference system. Environ Model Softw 100:225–235

    Google Scholar 

  • Yeh J, Shieh J, Huang NE (2010) Complementary ensemble empirical mode decomposition: a novel noise enhanced data analysis method. Advances in Adaptive Data Analysis .2:135–156

  • Zafra C, Angel Y, Torres E (2017) ARIMA analysis of the effect of land srface coverage on PM10 concentrations in a high-altitude megacity. Atmos Pollut Res 8(4):660–668

    Google Scholar 

  • Zhang H, Zhang WD, Palazoglu A, Sun W (2012) Prediction of ozone levels using a hidden Markov model (HMM) with gamma distribution. Atmos Environ 62:64–73

  • Zhang L, Lin J, Qiu R, Hu X, Zhang H, Chen Q, Tan H, Lin D, Wang J (2018) Trend analysis and forcast of PM2.5 in Fuzhou, China using the ARIMA model. Ecol Indic 95(Part1):702–710

    CAS  Google Scholar 

  • Zhang B, Zhang H, Zhao G, Lian J (2020) Constructing a PM2.5 concentration prediction model by combining auto-encoder with Bi-LSTM neural networks. Environ Model Softw 124:104600

    Google Scholar 

  • Zhou Q, Jiang H, Wang J, Zhou J (2014) A hybrid model for PM2.5 forecasting based on ensemble empirical mode decomposition and a general regression neural network. Sci Total Environ 496:264–274

    CAS  Google Scholar 

  • Zhu S, Lian X, Wei L, Che J, Shen X, Yang L, Qiu X, Liu X, Go W, Ren X, Li J (2018) PM2.5 forecasting using SVR with PSOGSA algorithm based on CEEMD, GRNN and GCA considering meteorological factors. Atmos Environ 183:20–32

    CAS  Google Scholar 

Download references

Acknowledgments

We thank Dr. Vahid Nimehchisalem, from the Department of English, Faculty of Modern Languages and Communication, Universiti Putra Malaysia, for editing our manuscript. The researchers are grateful to Mrs. Shadi Ausati for her effort of modeling in Matlab software.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Jamil Amanollahi.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Rahimpour, A., Amanollahi, J. & Tzanis, C.G. Air quality data series estimation based on machine learning approaches for urban environments. Air Qual Atmos Health 14, 191–201 (2021). https://doi.org/10.1007/s11869-020-00925-4

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11869-020-00925-4

Keywords

Navigation