The estimation of hourly PM2.5 concentrations across China based on a Spatial and Temporal Weighted Continuous Deep Neural Network (STWC-DNN)

https://doi.org/10.1016/j.isprsjprs.2022.05.011Get rights and content

Abstract

The continuous distributions of PM2.5 concentrations and predictor variables in the surrounding regions influence the PM2.5 concentrations in the prediction positions notably, yet few machine learning models quantified the spatially continuous interactions between PM2.5 concentrations and predictor variations, which limits the prediction accuracy. To fill this gap, a Spatial and Temporal Weighted Continuous Deep Neural Network (STWC-DNN) was proposed. For STWC-DNN, three sub-networks, Single Pixel Network (SPN), Multiple Station Network (MSN), and Continuous Region Network (CRN) were designed to analyze the influence of predictor variables at the prediction position, the influence of PM2.5 concentrations from surrounding stations, and the influence of continuous raster predictor variables from surrounding pixels respectively. STWC-DNN was experimented using hourly Himawari AOD data and the outputs were compared with a series of advanced models. STWC-DNN achieved higher accuracy than existing models and the sample-based, time-based, and station-based 10-fold cross-validation (CV) R2 were 0.92, 0.90, and 0.79, respectively. The principle of establishing STWC-DNN sheds useful lights on the effective use of raster predictor variables and automatic spatiotemporal weight function to better estimate PM2.5 and other airborne pollutants based on multiple data sources. The codes of STWC-DNN are now available at https://github.com/wangzh2022/STWC-DNN.

Introduction

With the rapid urbanization and industrialization, PM2.5 (particulate matter with an aerodynamic diameter ≤ 2.5 μm) has become a severe environmental issue in China, some urban agglomerations in particular. PM2.5 concentrations have a significant negative effect on public health, especially the increase of cardiovascular and respiratory-related morbidity and mortality (Lim et al., 2011, Crouse et al., 2012, Kloog et al., 2013). Therefore, growing efforts have been made on a comprehensive understanding of spatiotemporal variations of PM2.5 concentrations (Ye et al. 2018) and their anthropogenic (Hagler et al. 2007) and meteorological drivers (Chen et al., 2017, Chen et al., 2018, Chen et al., 2019a, Chen et al., 2019b). These studies usually required long time-series and large-scale PM2.5 concentration data. Since 2013, with increasing haze episodes, ground PM2.5 observation stations have been widely made at 1497 stations across the country. However, high-resolution and high precision PM2.5 concentration products, which are widely required in a diversity of applications, cannot be reliably obtained based on the interpolation using sparsely and unevenly distributed ground observation stations (Zhang et al. 2018).

Thanks to the rapid progress of satellite remote sensing technology, it has become a promising approach for estimating PM2.5 concentrations using satellite-derived aerosol optical depth (AOD) products. Recently, a series of AOD products, mainly including the Moderate Resolution Imaging Spectroradiometer (MODIS) (Fang et al., 2016, He and Huang, 2018, Hu et al., 2014), GF-1 (Zhang et al. 2018), the Visible Infrared Imaging Radiometer Suite (VIIRS) (Wu et al., 2016, Hu et al., 2017, Yao et al., 2019), and the Multiangle Imaging SpectroRadiometer (MISR) (Liu et al., 2007, Ma et al., 2014, You et al., 2015), have been employed in previous studies to estimate PM2.5 concentrations. However, one major limitation for most AOD products is the relatively coarse temporal resolution. The temporal resolution for these AODs products is generally produced at a daily basis or even more. Retrieved daily PM2.5 products cannot effectively meet the requirement for exploring the effects of anthropogenic activities and meteorological conditions on PM2.5 concentrations, which generally present notable variations on an hourly basis (Chen et al. 2020).

As a geostationary satellite, Himawari-8 can provide AODs with a spatial resolution of 5 km and a temporal resolution of 10 min (Level 2) or 1 h (Level 3) (Bessho et al., 2016, Yumimoto et al., 2016). Based on the Himawari-8 Level 3 AODs products, growing scholars attempted to estimate PM2.5 concentrations using different algorithms. Wang et al. (2017) employed an improved linear mixed-effect model (LME) for the Beijing-Tianjin-Hebei (BTH) region and achieved a coefficient of determination (R2) of 0.86. Zhang et al. (2019) employed an improved seasonal LME model for Central China (CCH), BTH, Yangtze River Delta (YRD) and Pearl River Delta (PRD) and achieved R2 of 0.82, 0.84, 0.80 and 0.74 respectively. Chen et al (2019) employed a stacking model for Central and Eastern China and achieved a R2 of 0.85. Wei et al. (2021) proposed a Space-Time Light Gradient Boosting Machine (STLG) model for China and achieved a R2 of 0.85. Although these studies proved the feasibility of estimating hourly PM2.5 products at regional scales, major limitations remained. LME models cannot effectively estimate PM2.5 concentrations in areas with limited stations (Wang et al. 2017), and thus not suitable for large-scale and continuous estimation. Since the stacking model ignored spatiotemporal variations, PM2.5 products generated using this model suffered from abnormal values, similar to the salt-and-pepper noise, and caused large prediction biases (Wu et al. 2016). STLG model used the geographical distances of prediction position to stations to generate the spatial feature which would suffer from the uneven station distribution. Therefore, a robust and comprehensive model is required for better estimating PM2.5 concentrations at a national scale.

A key step for AOD-based PM2.5 estimation is the exploration of AOD-PM2.5 relationship. In early years, linear models (Engel-Cox et al., 2004, Gupta and Christopher, 2009) have been a major approach for estimating PM2.5 concentrations using AODs. However, AOD-PM2.5 relationship is complicated and non-linear under different emission and meteorological conditions (Yang et al. 2019). Therefore, advanced models were proposed to better explain AOD-PM2.5 relationship. Statistical models such as Geographically Weighted Regression (GWR) model (Hu et al., 2014, Song et al., 2015), LME model (Li et al., 2015), two-stage model (Hu et al., 2014, Ma et al., 2016), and Geographically and Temporally Weighted Regression (GTWR) model (Huang et al., 2010, He and Huang, 2018), have been employed to explain the nonlinear AOD-PM2.5 relationship by adding random effects or local effects to regression models. Nevertheless, it remains challenging for using statistical models to precisely express the complicated, uncertain nonlinear AOD-PM2.5 relationship. Compared with the statistical models, machine learning models can fit complicated relationships (Hu et al. 2017). Researchers employed such models as random forest (RF) models (Hu et al. 2017), and neural network (NN) models (Wu et al., 2012, Li et al., 2017a, Wang et al., 2019) to predict PM2.5 concentrations at a prediction pixel by considering various predictor variables, including AODs data, and other auxiliary variables, such as meteorological and land use data, and achieved improved prediction accuracies.

Previous studies demonstrated the potential of applying machine learning to PM2.5 concentration prediction. Nevertheless, spatiotemporal variations were limitedly considered in these machine learning models. On one hand, PM2.5 concentrations in surrounding stations were considered in Spatial and Temporal Random Forest (STRF) model (Wei et al. 2019), Deep Belief Network (Geoi-DBN) model (Li et al. 2017b), Geographically and Temporally Weighted Neural Network (GTWNN) model (Li et al. 2020a), and SLGT model (Wei et al. 2021). For STRF and Geoi-DBN models, the spatiotemporal variation items, calculated through the spatiotemporal weight function, are used as extra input predictor variables. For GTWNN, the spatiotemporal variations are represented in a similar approach to the GTWR, yet the linear model in GTWR model is replaced with the Neural Network (NN) model. However, for these models, the spatiotemporal weight function is fixed and not adaptive according to actual data sources. A fixed weight function may cause large uncertainties when summarizing remarkable spatiotemporal variations in large areas. The accuracy of PM2.5 concentration prediction would be affected significantly in regions where spatiotemporal variations do not fit the weight function. The missing data and the uneven station distribution would also result in low prediction accuracy. On the other hand, the diffusion of PM2.5 concentrations is controlled by the surrounding spatial distribution of meteorological conditions (Chen et al. 2020). Even if the predictor variables of positions are the same, different diffusion conditions in the surrounding regions would lead to different PM2.5 concentrations of the positions. The situation cannot be modeled in the most existing machine learning models which simply consider predictor variables at the prediction position, resulting in limited prediction accuracy.

To fill these gaps, we attempt to develop a Spatial and Temporal Weighted Continuous Deep Neural Network (STWC-DNN) for better estimating hourly PM2.5 concentrations across China using Himawari-8 AOD products. For STWC-DNN, specific networks are proposed to automatically establish spatiotemporal weight functions to better consider the discrete meteorological influences on PM2.5 concentrations. Moreover, in addition to discrete predictor variables, we further consider spatial continuous meteorological influences on PM2.5 concentrations at the prediction pixel by employing raster data sources. The accuracy of estimated PM2.5 concentrations is evaluated through sample-based, time-based, and station-based 10-fold cross-validation (CV) (Li et al. 2020b). To further evaluate the performance of STWC-DNN, the three 10-fold CV results of STWC-DNN are compared with some recent models, including the Multiple Line Regression (MLR) model, GTWR model, RF model, STRF model, Geoi-DBN model, GTWNN model, and SLTG model. Since previous models rarely considered spatially continuous variable data, the principle of establishing STWC-DNN sheds useful lights on the effective inclusion of multiple raster data for better estimating the concentration of PM2.5, ground ozone and other pollutants. The codes of STWC-DNN are now available at https://github.com/wangzh2022/STWC-DNN.

Section snippets

Data sources

The data used in the study mainly included ground observed PM2.5 concentrations, satellite-retrieved AOD with 5 km spatial resolution and 1 h time resolution, meteorological data (Boundary Layer Height (BLH), Relative Humidity (RH), Surface Pressure (SP), etc.) and such auxiliary data related to PM2.5 concentrations as Normalized Difference Vegetation Index (NDVI) and Digital Elevation Model (DEM). Details of employed data sources were introduced as follows.

Methods

Nonlinear PM2.5-meteorology interactions were highly complicated and presented notable spatiotemporal variations (Chen et al., 2020). The uncertainty of AOD-PM2.5 relationship was mainly caused by the complicated, underlying influence of predictor variables on PM2.5 concentrations. In addition to predictor variables at the prediction pixel, the spatiotemporal distribution of PM2.5 level and predictor variables can also have a strong influence on PM2.5 concentrations at the prediction pixel.

Descriptive statistics

Fig. 4 shows the histograms and descriptive statistics of variables (except for DEM that does not change over time) in the entire model fitting data set. With the removal of missing data, there was a total of 465,362 observed PM2.5 concentration samples where corresponding gridded variables were available in 2017 over China. In this data set, the annual mean PM2.5 concentration was 58.79 ± 44.35 μg/m3. For seasons, the highest value was in winter (77.40 μg/m3) and lower values were in autumn

Discussion

Despite a satisfactory accuracy, limitations remained for STWC-DNN. Firstly, since the temporal resolution of meteorological data was 6 h, which was much lower than the Himawari-8 AOD products, the hourly temporal variations of PM2.5 have yet been fully utilized. Some alternative sources such as the forecast products from the Goddard Earth Observing System (GEOS) (https://gmao.gsfc.nasa.gov/GMAO_products/NRT_products.php) or the meteorological data at stations, with a higher temporal resolution

Conclusions

To comprehensively understand the influence of predictor variables on PM2.5 concentrations, we proposed a Spatial and Temporal Weighted Continuous Deep Neural Network (STWC-DNN), which employed an automatic spatiotemporal weight function and a set of raster predictor variables. STWC-DNN included three sub-networks, Single Pixel Network (SPN), Multiple Station Network (MSN) and Continuous Region Network (CRN), which aimed to consider the influence of predictor variables at the target position,

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Acknowledgement

This work was supported by National Natural Science Foundation of China (Grant No. 42171399, Grant No. 41901414) and Beijing Municipal Natural Science Foundation, China (Grant No. 8202031).

References (57)

  • K. Bessho et al.

    An introduction to Himawari-8/9—Japan’s new-generation geostationary meteorological satellites

    J. Meteorol. Soc. Jpn

    (2016)
  • J. Chen et al.

    Stacking machine learning model for estimating hourly PM2.5 in China based on Himawari 8 aerosol optical depth data

    Sci. Total Environ.

    (2019)
  • Z. Chen et al.

    Detecting the causality influence of individual meteorological factors on local PM2.5 concentration in the Jing-Jin-Ji region

    Sci. Rep.

    (2017)
  • Z. Chen et al.

    Understanding meteorological influences on PM2.5 concentrations across China: a temporal and spatial perspective

    Atmos. Chem. Phys.

    (2018)
  • Z. Chen et al.

    The control of anthropogenic emissions contributed to 80% of the decrease in PM2.5 concentrations

    Atmos. Chem. Phys.

    (2019)
  • Z. Chen et al.

    Influence of meteorological conditions on PM2.5 concentrations across china: a review of methodology and mechanism

    Environ. Int.

    (2020)
  • D.L. Crouse et al.

    Risk of Nonaccidental and Cardiovascular Mortality in Relation to Long-term Exposure to Low Concentrations of Fine Particulate Matter: A Canadian National-Level Cohort Study

    Environ. Health Perspect.

    (2012)
  • Devlin, J., Chang, M., Lee, K., et al., 2018. BERT: Pre-training of Deep Bidirectional Transformers for Language...
  • J. Duchi et al.

    Adaptive subgradient methods for online learning and stochastic optimization

    J. Mach. Learn. Res.

    (2011)
  • J.A. Engel-Cox et al.

    Qualitative and quantitative evaluation of MODIS satellite sensor data for regional and urban scale air quality

    Atmos. Environ.

    (2004)
  • X. Fang et al.

    Satellite-based ground PM2.5 estimation using timely structure adaptive modeling

    Remote Sens. Environ.

    (2016)
  • S. Gao et al.

    Res2Net: A New Multi-scale Backbone Architecture

    IEEE Trans. Pattern Anal. Mach. Intell.

    (2019)
  • X. Glorot et al.

    Deep sparse rectifier neural networks

    J. Mach. Learn. Res.

    (2011)
  • P. Gupta et al.

    Particulate matter air quality assessment using integrated surface, satellite, and meteorological products: multiple regression approach

    J. Atmos. Res.: Atmos.

    (2009)
  • G.S.W. Hagler et al.

    Local and regional anthropogenic influence on PM2.5 elements in Hong Kong

    Atmos. Environ.

    (2007)
  • K. He et al.

    Deep Residual Learning for Image Recognition

    IEEE Conf. Comput. Vision Pattern Recogn.

    (2016)
  • Q. He et al.

    Satellite-based mapping of daily high-resolution ground PM2.5 in China via space-time regression modeling

    Remote Sens. Environ.

    (2018)
  • X. Hu et al.

    Estimating PM2.5 Concentrations in the Conterminous United States Using the Random Forest Approach

    Environ. Sci. Technol.

    (2017)
  • X. Hu et al.

    Estimating ground-level PM2.5 concentrations in the Southeastern United States using MAIAC AOD retrievals and a two-stage model

    Remote Sens. Environ.

    (2014)
  • B. Huang et al.

    Geographically and temporally weighted regression for modeling spatio-temporal variation in house prices

    Int. J. Geogr. Inf. Sci.

    (2010)
  • Krizhevsky A., Sutskever I., Hinton G., et al. 2012. ImageNet classification with deep convolutional neural networks....
  • M. Kikuchi et al.

    Improved Hourly Estimates of Aerosol Optical Thickness Using Spatiotemporal Variability Derived From Himawari-8 Geostationary Satellite

    IEEE Trans. Geosci. Remote Sens.

    (2018)
  • I. Kloog et al.

    Long- and Short-Term Exposure to PM 2.5 and Mortality: Using Novel Exposure Models

    Epidemiology

    (2013)
  • T. Lin et al.

    Feature Pyramid Networks for Object Detection

    Comput. Vision Pattern Recogn.

    (2017)
  • T. Li et al.

    Point-surface fusion of station measurements and satellite observations for mapping PM2.5 distribution in China: Methods and assessment

    Atmos. Environ.

    (2017)
  • T. Li et al.

    Estimating Ground-Level PM 2.5 by Fusing Satellite and Station Observations: A Geo-Intelligent Deep Learning Approach: Deep Learning for PM 2.5 Estimation

    Geophys. Res. Lett.

    (2017)
  • R. Li et al.

    Estimating ground-level PM2.5 using fine-resolution satellite data in the megacity of Beijing, China

    Aerosol Air Qual. Res.

    (2015)
  • T. Li et al.

    Geographically and temporally weighted neural networks for satellite-based mapping of ground-level PM2.5

    ISPRS J. Photogramm. Remote Sens.

    (2020)
  • Cited by (16)

    • A gap-filling hybrid approach for hourly PM<inf>2.5</inf> prediction at high spatial resolution from multi-sourced AOD data

      2022, Environmental Pollution
      Citation Excerpt :

      Extensive studies have been conducted to model daily PM2.5 at high spatial resolutions from polar-orbiting satellite AOD, including the prediction of PM2.5 concentrations at 1-km using MAIAC AOD (Bi et al., 2019; He et al., 2021; Kloog et al., 2015; Pu and Yoo, 2020; Xiao et al., 2017; Stafoggia et al., 2019). Recently, an increasing number of studies also have attempted to predict hourly PM2.5 using AOD from geostationary satellites (Chen et al., 2019; Jiang et al., 2021; Park et al., 2019; She et al., 2020; Sun et al., 2021; Wei et al., 2021a; Wang et al., 2022). However, it is worth noting that most previous studies, except Jiang et al. (2021), either ignored the missingness of AOD or solely relied on single-platformed AOD, and consequently were not able to maximize the utility of multi-platformed AOD products in the prediction of ambient PM2.5.

    View all citing articles on Scopus
    View full text