Next Article in Journal
Using a Statistical Crop Model to Predict Maize Yield by the End-Of-Century for the Azuero Region in Panama
Next Article in Special Issue
Spatial Distribution of Atmospheric Aerosol Physicochemical Characteristics in the Russian Sector of the Arctic Ocean
Previous Article in Journal
Historic and Modern Air Pollution Studies Conducted in Utah
Previous Article in Special Issue
The CO2 Emissions Drivers of Post-Communist Economies in Eastern Europe and Central Asia
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

VARMA-EGARCH Model for Air-Quality Analyses and Application in Southern Taiwan

by
Edward Ming-Yang Wu
1 and
Shu-Lung Kuo
2,*
1
College of Maritime, National Kaohsiung University of Science and Technology, Kaohsiung 811, Taiwan
2
Department of Technology Management, The Open University of Kaohsiung, Kaohsiung 806, Taiwan
*
Author to whom correspondence should be addressed.
Atmosphere 2020, 11(10), 1096; https://doi.org/10.3390/atmos11101096
Submission received: 4 August 2020 / Revised: 29 September 2020 / Accepted: 12 October 2020 / Published: 14 October 2020
(This article belongs to the Special Issue Air Pollution Estimation)

Abstract

:
This study adopted the Exponential Generalized Autoregressive Conditional Heteroscedasticity (EGARCH) model to analyze seven air pollutants (or the seven variables in this study) from ten air quality monitoring stations in the Kaohsiung–Pingtung Air Pollutant Control Area located in southern Taiwan. Before the verification analysis of the EGARCH model is conducted, the air quality data collected at the ten air quality monitoring stations in the Kaohsiung–Pingtung area are classified into three major factors using the factor analyses in multiple statistical analyses. The factors with the most significance are then selected as the targets for conducting investigations; they are termed “photochemical pollution factors”, or factors related to pollution caused by air pollutants, including particulate matter with particles below 10 microns (PM10), ozone (O3) and nitrogen dioxide (NO2). Then, we applied the Vector Autoregressive Moving Average-EGARCH (VARMA-EGARCH) model under the condition where the standardized residual existed in order to study the relationships among three air pollutants and how their concentration changed in the time series. By simulating the optimal model, namely VARMA (1,1)-EGARCH (1,1), we found that when O3 was the dependent variable, the concentration of O3 was not affected by the concentration of PM10 and NO2 in the same term. In terms of the impact response analysis on the predictive power of the three air pollutants in the time series, we found that the asymmetry effect of NO2 was the most significant, meaning that NO2 influenced the GARCH effect the least when the change of seasons caused the NO2 concentration to fluctuate; it also suggested that the concentration of NO2 produced in this area and the degree of change are lower than those of the other two air pollutants. This research is the first of its kind in the world to adopt a VARMA-EGARCH model to explore the interplay among various air pollutants and reactions triggered by it over time. The results of this study can be referenced by authorities for planning air quality total quantity control, applying and examining various air quality models, simulating the allowable increase in air quality limits, and evaluating the benefit of air quality improvement.

1. Introduction

According to air quality monitoring reports from the Environmental Protection Administration, major pollutants causing the Pollutant Standard Index (PSI) [1] to exceed the air quality standard are particulate matter (PM10) and ozone (O3). Taiwan’s air quality data are released to the public in the form of pollution standards index (PSI) values, following the procedure identical to that of the United States Environmental Protection Agency. The nation’s PSI was formulated and promulgated in 1994 by the Environmental Protection Administration (EPA) based on the daily concentrations of five air pollutants: particulate matter with particles below 10 microns (PM10), sulfur dioxide (SO2), nitrogen dioxide (NO2), carbon monoxide (CO), and ozone (O3). Air quality in central and southern Taiwan has deteriorated, especially after large-scale pollution sources were set up one after another. As population density continues to rise in tandem with the development of industries and transportation, it results in many pollutants affecting humans’ habitats; consequently, people have become more aware of pollution treatments [2,3]. Among the treatments, air quality management stands out as an imperative issue. Since air is highly dispersive or can be transported long range, if the total emission tolerance of pollutants in one area is not kept below that area’s capacity, the pollutants will not only influence the air quality in that area, but also in adjacent regions [4,5].
Air pollution is a well-known environmental problem associated with urban areas around the world [6,7]. Various monitoring programs have been used to determine air quality by generating vast amounts of data on the concentration of each of the previously mentioned air pollutant in different parts of the world. The large data sets often do not convey air quality status to the scientific community, government officials, policy makers, and in particular to the general public in a simple and straightforward manner [5]. Up to now, PSI has been developed and disseminated by many agencies in the U.S. Canada, Europe, Australia, China, Indonesia, Taiwan, etc.
Models analyzing the degree of fluctuation can largely be divided into two categories: predictable volatility and unpredictable volatility (or stochastic volatility). The former belongs to the Generalized Autoregressive Conditional Heteroscedasticity (GARCH) group (which includes the exponential GARCH (EGARCH) model adopted in our study), and can predict the fluctuations in the current term using all of the information acquired from the previous term. The biggest difference between the two models lies in their estimation procedures [8,9]. The EGARCH model is similar to the GARCH model as it can also capture other properties of a financial time series, such as volatility clustering. If the fluctuation rate at t-1 is high, it will be high at t as well. In other words, an impulse occurring at t-1 also has an effect on the fluctuation rate at t. Moreover, since the value adopted in the formula is the logarithm of the variance rather than the variance per se, there is no need to impose any limits on the parameters in the EGARCH model. As a result, the EGARCH model automatically meets the requirement that the variance must be positive, which is one of its main advantages. In general, applying maximum likelihood estimation under conditions allowed by the model helps to increase the optimization speed and makes the optimization results more reliable [8,10].
This study analyzed the data obtained from the ten automatic air quality monitoring stations in the Kaohsiung–Pingtung Air Pollutant Control Area via factor analysis for multivariate statistics, in order to determine the air pollution factors that most influenced the air quality in the Kaohsiung–Pingtung area, namely the photochemical pollution factors (PM10, O3, and NO2). Next, on the condition that a standardized residual existed, this study explored how air quality in the Vector Autoregressive Moving Average (VARMA)-EGARCH model fluctuated according to seasons. Since the analysis of results can reflect the relationships among air pollutants in real time, the authorities can refer to the results when applying and examining various air quality models, simulating the allowable increase in air quality limits, and evaluating the benefits of air quality improvement.

2. Experimental Method and Methodology

2.1. Selection of Air Quality Monitoring Stations

The Kaohsiung–Pingtung Air Pollutant Control Area, established by the Environmental Protection Administration in southern Taiwan, consists of ten ordinary air quality monitoring stations (Meinong station, Nanzi station, Qianjin station, Renwu station, Zuoying station, Xiaogang station, Daliao station, Linyuan station in Kaohsiung, and Pingtung station and Chaozhou station in Pingtung). In these ten ordinary air quality monitoring stations, Meinong, Pingtung, Daliao, and Chaochou are located in sub-urban belonging to non-industrial zone; Nanzi, Renwu, Zuoying, Xiaogang and Linyuan belong to industrial zone; Cianjin is located in the urban area.
The ten air quality monitoring stations are located on the Pingtung Plain, scattered across areas from 30 to 100 m above sea level. The mean annual temperature of the area is from 24 to 25 °C; the average temperature of the warmest months (July to August) exceeds 30 °C, while that of the coldest months (January to February) remains above 18 °C. The average annual rainfall is between 1500 and 2000 mm. We chose the Kaohsiung–Pingtung Air Pollutant Control Area because this region includes Kaohsiung City, which suffers from the most severe air pollution in Taiwan. The city, home to approximately 2.77 million people, has eight industrial areas and a large steel plant. We chose this Air Pollutant Control Area as our research subject because the air quality is usually poor due to its industrial activities in addition to the large number of cars and motorcycles in the city.
We collected 420 complete air pollutant monitoring statistics between 1 January 2019 and 30 May 2020 and analyzed seven pollutants (or seven variables): SO2, NO2, CO, PM10, O3, THC, and CH4 (please refer to Figure 1 for the locations of the ten air quality monitoring stations chosen for this study). These statistics were acquired by analyzing data collected over 24 h by the EPA’s autonomous monitoring stations in the area. The objective in studying the statistics of up to one and a half years is to reflect the temporality effect on air quality; thus, we were able to observe the EGARCH model during different periods with different observation frequencies and elucidate whether variables in different time series present different information. The tool adopted for corroboration was EVIEWS 10.0

2.2. Data Selection and Compilation

Prior to the analysis of the EGARCH model, we analyzed three factors using factor analyses in multiple statistical analyses that influences air quality in the Kaohsiung–Pingtung area: the photochemical pollution factor, the fuel pollution factor, and the organic pollution factor, employing factor analysis for multivariate statistics. Next, we selected the photochemical pollution factor, which is the most influential factor determining air quality in the Kaohsiung–Pingtung area, to study the VARMA-EGARCH model.

2.3. ARIMA Modeling

A time series { x t ; t = 0 , ± 1 , ± 2 , ..... } is ARMA (p, q) if it is covariance stationary and can be represented as
x t = φ 1 x t 1 + + φ x p t p + ε t + θ 1 ε t 1 + + θ q ε t q
where φ p ≠ 0, θ q ≠ 0, and ε t are the innovations with N(0, σ ε 2 ) and σ ε 2 > 0. The parameters p and q are called the autoregressive [AR(p)] and the moving average [MA(q)] orders, respectively. When a time series does not appear covariance stationary, the differencing procedure may be applied to make it stationary. Then, the ARMA (p, q) model can be applied to the stationary differenced time series and model so constructed is called ARIMA (p, d, q) model where d denotes the order of differencing [11,12]. The parameters φ and θ have been estimated using maximum likelihood method in the present study.
An inspection of autocorrelation function (ACF) and partial autocorrelation function (PACF) helps in identifying the orders AR(p) and MA(q). In addition, more objectively defined criterions such as Akaike Information Criterion (AIC), Hannan–Quinn Information Criterion (HQIC), Bayesian Information Criterion (BIC) and Final Prediction Error (FPE) can also be used to identify the correct orders p and q [12,13].

2.4. ARMA-EGARCH Modeling

Conditional mean formula:
  r t = a 0 + i = 1 m a i r t i + j = 1 n b j Ɛ t j   ,   Ɛ | Ω t 1 ~ N ( 0 , h t )
Conditional variance formula:
ln ( h t )   =   α 0 + i = 1 q ( γ i Ɛ t i h t i + α i ( | Ɛ t i | h t i E ( | Ɛ t i | h t i ) ) ) + j = 1 p β j ln ( h t j )
Ɛ t i h t i > 0 represents good news on the market while Ɛ t i h t i < 0 reflects bad news on the market; where, γi = 0 indicates that the degree of air pollution has a symmetrical effect in reaction to news impact.
If γi < 0, it suggests that the degree and fluctuation of air pollution incurred by air pollution shock are more significant than the situation when air pollution shock does not result in air pollution; in other words, a leverage effect can be observed when γi < 0. To determine whether the three GARCH models are applicable to the time series analysis, we first needed to confirm whether the time series has an ARCH effect; this was tested in this study using the Lagrange Multiplier (LM) test proposed by Engle [14]. Since it is necessary for the residual of the conditional mean formula in the GARCH model to follow the white noise process, all possible orders (p, q) of the GARCH model must be compared with each other via trial and error. To determine whether or not residuals reached white noise, we later adopted the modified Q-statistics proposed by Ljung and Box [15] to analyze the residual of each series. However, the p and q orders of the model selected via trial and error might be subject to over-fitting. To solve this issue, we used the Akaike Information Criteria (AIC) [16] and Schwartz Bayesian Criterion (SBC) [17] to analyze the model under the principle of parsimonious parameterization.

2.5. Setting of the Model

This study used the following statistical principles and methods to conduct its simulation and predictive modeling for the selected photochemical pollution factors. The purpose was to explain the essence and meanings of the fat tail test, the Ljung–Box sequence test, and the ARCH test.

2.5.1. Fat Tail Test

The main argument in this section is related to the last argument of the previous section—in the light of a non-explained empirical fact, it is preferable to try to describe it first with the available tools before increasing the model complexity. There are two main approaches to explain fat tails through Gaussian-based models. First, the fat tail test appears because the volatility of asset returns is dynamic. Therefore, looking at the unconditional distribution as described by histogram plots or kernel density plots, we can see heavy tails but they are mostly due to the time varying volatility. Second, fat tails appear because asset returns depend, in a non-linear fashion, on other factors which are distributed according to a Gaussian law. Results of examining the skewness, kurtosis, and Jarque–Bera normal distribution can be used for determining whether the distribution of modeling errors has fat tails.

2.5.2. Ljung–Box Sequence Test

It was necessary to test whether the residual items in the regression model have sequence correlation before estimating the ARCH and (E)GARCH models. If the residual items have sequence correlation, the squared residual items will be examined to see if it has an ARCH effect. As such, it is very important to check if the residual items have sequence correlation before estimating the ARCH and (E)GARCH models.

2.5.3. Examination of the ARCH Effectiveness

If the standard deviation of a time series is stationary, we can say that the variance of the time series is homogeneous, or heterogeneous, or vice versa. Before applying data to a heterogeneous variance model, we should examine data to see if they contain heterogeneous variance. This study adopted the Lagrange Multiplier (LM) proposed by Engle (1982) and Ljung and Box Q statistics [14] to test whether the variance of the time series is heterogeneous.

2.5.4. Impact Response Analyses

The impulse response function (IRF) is used to study the impact of the structural disturbances on the variables in a VAR model over time; in other words, other variables’ dynamic response pattern to an exogenous impulse when the said impulse impacts a variable. This study adopted the Cholesky method in dealing with orthogonalization, in order to analyze the intertemporal dynamic effects of indices across nations in the model, and the degree of dynamic interplay among those indices. The formulae are as follows:
Y t = u + i = 0 Φ i ε t i Φ i   =   I ,   i   =   1 ,   2 ,   ,  
In formula (3), i represents the variable responding to an impulse. Error εt−1 can be interpreted as unexpected impulses in ti. To examine how an impulse of a one-term delay influences the current term variables, one can refer to Formula (4).
Ə Y t Ə Ɛ t 1 = Φ j , k , z
In the above formula, the coefficient factor on row j, line k, formed a consecutive function over time, called the “impulse response function”.
This study adopted the AIC proposed by Akaike [16] to select suitable lagged variables, which were represented as follows:
AIC(m) = T × ln(SSR/T) + 2m
where, m indicates the number of variables in the model, T represents the number of samples of the three air pollutants, and SSR shows the sum of squares of the error terms.

3. Results and Discussion

3.1. Application of the Results of Factor Analysis

As mentioned above, this study divided air quality in the Kaohsiung–Pingtung area into three major factors according to the results of the factor analysis for multivariate statistics. It then chose the most important factor, the photochemical pollution factor, as its subject. This factor included three air pollutants: PM10, O3, and NO2 (listed by the degree of their factor loadings). Moreover, since the data series had seasonality and cyclicity, the seven pollutant variables were standardized during factor analysis. The formula for standardization is as follows:
Z ν , t = Y v , t μ t σ t   t   =   1 ,     w
where, Yv,t represents the original series, and μt, σt indicate the average and standard deviation in period 1~w.

3.2. Simulations of the Photochemical Pollution Factor with Models

3.2.1. Analysis of the Basic Properties of the Three Air Pollutants

The three air pollutants selected for the photochemical pollution factor of this study were PM10, O3, and NO2. Table 1 presents the analysis results of the three air pollutants’ basic properties between 1 January 2019 and 30 May 2020, including: average, standard deviation, skewness, kurtosis, and statistics obtained from the Jarque–Bera normality test. Firstly, in terms of kurtosis, the kurtosis coefficients of the three air pollutants were larger than the coefficient of normal distribution (which was 2), suggesting that each variant had the characteristic of a seasonal time series (in other words, pollutant concentrations vary according to seasons). In terms of skewness, the three pollutants’ skewness leaned to the right (meaning that their skewness coefficients were all positive). Among them, NO2 had the highest value of 4.3 when compared with PM10 and O3, which had values of 2.3 and 0.8, respectively (the smaller the value, the more stable the concentration). This result indicates that the concentration of NO2 normally remains low. However, many statistics showed sudden spikes in the past, which corresponded to the results of Kuo and Ho [18]. This corroborated with the view that NO2 had the strongest asymmetric effect, meaning that the concentration and degree of fluctuation of NO2 formed in Kaohsiung were lower than those of the other two pollutants. Since NO2 is least affected by the change of seasons, it is more difficult to predict its concentration fluctuations. Air pollutants that are non-compliant with the air quality standard in Kaohsiung and Pingtung are mainly PM10 and O3; hence, when the concentrations of PM10 and O3 increase, air pollution becomes more severe, especially during the winter. This result is reflected by the skewness of the two in Table 1, which is not high. Regarding NO2, an increase in its concentration can only be observed in Kaohsiung and Pingtung during certain periods in winter and early spring; since the usual contribution of NO2 to air pollution was less than PM10 and O3, it had a higher skewness coefficient. Kaohsiung City, home to approximately 2.77 million people, has eight industrial areas and a large steel plant. Air quality is usually poor due to the industrial activities in those areas and the large number of cars and motorcycles in the region. In addition, the Kaohsiung–Pingtung area is susceptible to pollutants from external sources brought by the north-eastern seasonal wind, and pollutants traveling from northern and central Taiwan. To make matters worse, pollutants are trapped in the area by an inversion caused by the seasonal wind as it descends from the Central Mountain Range. In addition, emissions from factories and cars concentrated in Greater Kaohsiung make poor air quality more likely in the region during winter. Base on the above reasons, the air pollution in this area has been quite serious over the years.
Table 1 also shows that the statistics obtained from the Jarque–Bera normality test are higher than the critical value (degree of freedom is 2, X 0.05 , 2 2 = 5.99) at a 5% significance level, reflecting the hypothesis that all variants refuse normal distribution. This means that all the three air pollutants had two fat tails, which proves that seasonality had a significant effect on them.

3.2.2. Examination of ARCH Effectiveness

The LM (Lagrance Multiplier) statistics [19] can be applied to examine whether the ARCH effect exists in a number sequence. The LM statistics is TR2 with T being the number of samples, and R2 being the determination coefficient value obtained using the ordinary least squares (OLS) regression; T × R2 obeys the chi-square distribution with P degrees of freedom. When the model LM statistics is obvious, the ARCH effect exists in the number sequence. The statistics of three air pollutants listed in Table 2 indicates that the conditional variance of the three air pollutants shows a strong ARCH effect (that is, all T × R2 values were significant at a 5% significant level). Hence, the ARCH effectiveness is appropriate for explaining these three air pollutants.

3.2.3. Ljung–Box Sequential Examination

Whether the residual in the regression model has serial correlation must be examined before the ARCH and EGARCH models are estimated. If the residual has serial correlation, the square of residual appears to have the ARCH and EGARCH effects. In this study, such examination was carried out using the Ljung–Box examination method; the results are listed in Table 3. All the examination statistics for L-B-Q (K) are smaller that the critical value so that null hypothesis, which is not conforming to alternative hypothesis, cannot be rejected. Hence, residuals of the various number sequences do not have serial correlation; this confirms to the phenomenon of white noise so that the model disposition is appropriate for these 3 air pollutants.
To run the Ljung Box test by hand, we must calculate the statistic Q. For a time series γ of length n:
Q ( m ) = n ( n + 2 ) j = 1 m γ j 2 n j
where: γj = the accumulated sample autocorrelations, m = the time lag.
We reject the null hypothesis and say that the model shows lack of fit if
Q   >   X 1 α , h 2
where X 1 α , h 2 is the 1 − α quantile of the chi-square distribution with h degrees of freedom.

3.2.4. Choosing the Best EGARCH Model

In order to subsequently simulate the best VARMA-EGARCH model, we tested different combinations with vector models VARMA and EGARCH. We chose the best one from the multiple VARMA (p, q)-EGARCH (p, q) models we created for the simulated analysis. Table 4 presents the analysis results. Among the models, VARMA (1,1)-EGARCH (1,1), the one with the smallest AIC and SC values, was chosen as the best model because it could best capture the fluctuations of air quality in different seasons.

3.3. Simulation Results of the Photochemical Pollution Factor VARMA (1,1)-EGARCH (1,1) Model

According to the results of an extra factor analysis for multivariate statistics, factors under the photochemical pollution factor can be listed, with their factor loadings from big to small, as PM10 > O3 > NO2. Photochemical reactions in the atmosphere are mainly triggered by radiation given off by the sun. Once pollutants (or precursors) absorb photons and enter an electronically excited state, they react with other pollutants and form O3. In light of this phenomenon, this study designated O3 as the dependent variable and PM10 and NO2 as independent variables and applied them to the VARMA (1,1)-EGARCH (1,1) model, in order to study how they change in the time series. Table 5 presents the relativity and trend of concentration of PM10, O3, and NO2 in the time series under simulation with the VARMA (1,1)-EGARCH (1,1) model. This vector model can prevent bias when estimating conditional variance because it reflects the structural changes of the three air pollutants in different seasons.
The simulation results in Table 5 show that the concentration of O3 (the t-statistic of b0 was −0.75, meaning that it did not reach the significance level since it was <1.96) in the term could not be estimated according to the concentration of PM10 in the same term. In contrast, the concentration of PM10 with a lag time of one and two terms influenced the concentration of O3 in the current term (the t-statistics of b1 and b2 were 2.56 and 2.04, respectively, meaning that they reached the significance level since they were >1.96). In terms of NO2, we found that the concentration of NO2 in the current term could not be used to estimate the concentration of O3 in the same term (the t-statistic of c0 was 0.32, meaning that it did not reach the significance level since it was <1.96). However, the concentration of NO2 with a lag time of one term influenced the concentration of O3 in the current term (the t-statistic of c1 was 3.79, meaning that it reached the significance level since it was >1.96). Moreover, the concentration of NO2 with a lag time of two terms still influenced the concentration of O3 in the current term (the t-statistic of C2 was 2.92, meaning that it reached the significance level since it was >1.96). However, the effect of NO2 with a lag time of two terms on the concentration of O3 was not as remarkable as that of NO2 with a lag time of one term. From the above analyses of the three air pollutants, we found that the concentration of O3 was not affected by that of PM10 and NO2 in the same term, but after a lag time of one term, the concentration of O3 was influenced by PM10 and NO2 with a lag time of one to two terms. The results help to explain why PM10 and NO2 are the most important precursors for photochemical reactions in the atmosphere. Upon being released into the air from various sources of emission, PM10 and NO2 do not immediately start photochemical reactions with the sun; rather, they only start reacting with the sun after a one-term lag and form the final product, O3. The photochemical reactions continue into the second term because PM10 and NO2 last longer in the atmosphere, thus being able to produce O3 in the second term. However, from the simulation results in Table 5, it is found that the t-statistics of b2 and c2 (2.04 and 2.92, respectively) with a lag time of two terms reached the significance level, but their significance was less remarkable than that of b1 and c1 (2.56 and 3.79, respectively) with a lag time of one term. From the results, we can conclude that when PM10 and NO2 form in the atmosphere, we cannot use their concentrations to estimate the concentration of O3 in the current term. Rather, we must wait until they have been involved in photochemical reactions for a while, as that is when O3 starts to form. In other words, only PM10 and NO2 with one- or two-term lag can influence the formation of O3 in the current term. Furthermore, one-term lagged PM10 and NO2 have stronger impacts on the concentration of O3 than those with a lag time of two terms. Regarding O3, its concentration in the current term was significantly impacted by O3 with a lag of one term (the t-statistic of a1 was 7.03, meaning that it reached the significance level since it was >1.96), but was not significantly impacted by O3 with a lag of two terms (the t-statistic of a2 was 1.14, meaning that it did not reach the significance level since it was <1.96). These results suggested that although the concentration of O3 in the current term is affected by that of O3 with a one-term lag, the concentration in the current term is less impacted by O3 with a two-term lag, as photochemical reactions start to decrease when the day proceeds into the evening. In this study, Kaohsiung–Pingtung Air Pollutant Control Area is located in the tropics. The annual average temperature is greater than 20 °C, and the photochemical reaction is the most obvious from noon to afternoon and in summer. In addition, the degree of air pollution generated in this area is also the most serious from noon to afternoon. As a result, O3 and PM10 concentration are usually higher during this time.
In regard to the analysis of γ i , t in the EGARCH model, the t-statistic of γ 1 in the VARMA (1,1)-EGARCH (1,1) model was a significant at 3.13, with a value of −0.082 (<0). This result suggests that when the concentrations of PM10 and O3 change drastically in winter, air quality in Kaohsiung and Pingtung will become noncompliant with the air quality standard. The Kaohsiung–Pingtung area is susceptible to air pollution because it is a populous industrial hub located in southern Taiwan. It has a large number of cars and scooters, as well as facilities that release pollutants regularly (namely chimneys). During winter, air quality is particularly prone to being noncompliant with the air quality standard set by the Environmental Protection Administration due to high PM10 concentration. On the other hand, air pollutants quickly disperse in the air in summer; thus, there are fewer days with poor air quality than in the winter. This phenomenon reflects the leverage effect (when γ 1 < 0) in the EGARCH model [20]. Taiwan has done a magnificent job in containing COVID-19 and has received critical acclaim globally. The pandemic barely has any impact on businesses and industries in the nation. According to statistics up to the end of August 2020, there were only 493 confirmed cases, and only seven people died from COVID-19, a tiny fraction of the nation’s 23 million population. The monitoring stations mentioned in this study were still operating during the pandemic and were not compromised by COVID-19 at all. That is, during this period, the monitoring results did not show any abnormal phenomena.

3.4. The Predictive Power of the VARMA (1,1)-EGARCH (1,1) Model on the Three Air Pollutants

Based on an impact responses analysis, we applied the VARMA (1,1)-EGARCH (1,1) model to estimate the three air pollutants of the photochemical pollution factor, in order to test the predictive power and degree of the model under the influence of leverage effect (Table 5 shows that γ 1 < 0). Figure 2, Figure 3 and Figure 4 display the estimated concentrations of the three air pollutants with the first 400 data from the sequence. The estimation results indicated that the correlation coefficients (r) of the predictive power of PM10, O3, and NO2 were 0.832, 0.814, and 0.773, respectively, fully reflecting the analysis results mentioned previously (Kuo and Ho, 2018). In other words, the concentration of NO2 did not change as drastically as the other two pollutants because NO2 was less affected by the change of seasons, suggesting that it had the strongest asymmetric effect, or NO2 has the least influence on the GARCH effect when its concentration fluctuations are triggered by the change of seasons, which makes it more difficult to predict its concentration fluctuations. Moreover, the EGARCH model could effectively reduce bias resulting from conditional heteroskedasticity when estimating correlation coefficients. The model enabled the presentation of diverse forms when it comes to dynamic variance modeling, and could accurately estimate conditional variances among the three air pollutants.

4. Conclusions

This study adopted the VARMA-EGARCH model to explore the changes and influences of variation of air pollutants in the time series. This approach is unprecedented because there are barely any published studies on how multiple air pollutants change and influence each other in an Air Pollutant Control Area with the EGARCH model. Moreover, since the parameters of the traditional multivariate GARCH model are too complex, while the multivariate GARCH model cannot fully represent multivariate analysis results, we applied dummy variables from the EGARCH model to the variant formula to enable it to accurately estimate the variances of the three air pollutants and capture the seasonal fluctuations of air quality in the VARMA-EGARCH model. This study also applied the VARMA (1,1)-EGARCH (1,1) model to explore the relativity and trend of concentration of PM10, O3, and NO2 in the time series. When PM10 and NO2 form in the atmosphere, their concentrations cannot be used to estimate the concentration of O3 in the current term. Rather, we must wait until they have been involved in photochemical reactions for a while, as that is when O3 starts to form. In other words, only PM10 and NO2 with one- or two-term lag can influence the formation of O3 in the current term; one-term lagged PM10 and NO2 have stronger impacts on the concentration of O3 than those with a lag time of two terms. Finally, the estimation of the three air pollutants using the VARMA (1,1)-EGARCH (1,1) model indicated that the concentration of NO2 did not change as drastically as the other two pollutants because NO2 was less affected by the change of seasons. This suggested that since NO2 had the strongest asymmetric effect, it was thus more difficult to predict its concentration fluctuations.
We adopted the Pollution Standards Index for evaluating the degree of air pollution. On the other hand, the Air Pollution Index (API) is based on the Ambient Air Quality Standard GB 3095-1996 and only assesses SO2, NO2, and PM10. It was abolished and supplanted by the Air Quality Index (AQI), which is based on GB 3095-2012 and evaluates SO2, NO2, PM10, PM2.5, O3, and CO; the AQI is mainly adopted for studying the degree of influence of air quality on human health. We did not study the variation of PM2.5 concentration because PM2.5 was included in PM10; the subject of this study was not air pollutants’ impact on human health, but rather the formation mechanism of various air pollutants and the interplay among these pollutants over time.
In this study, the EGARCH model made it possible to track the degree of change of air pollutants in each time series in real time. It also took the heterogeneity of each air pollutant variant into consideration, an aspect ignored in past studies. The research findings can serve as a reference for the government when it comes to the application and certification of air quality models, simulation of models for a maximum allowable limit of increments, evaluation of air quality improvement schemes, and planning of strategies. In addition, the results of this study can also be referenced by authorities for planning air quality total quantity control, applying and examining various air quality models, simulating the allowable increase in air quality limits, and evaluating the benefit of air quality improvement. Furthermore, we will refer to the API and AQI methods if we are going to delve deeper into studies related to air pollution in the future and could be useful in the future extension of their work.

Author Contributions

E.M.-Y.W. formulates the research of direction of this study clearly. In addition, he collects and compiles data of ten air quality monitoring stations in the Kaohsiung–Pingtung Air Pollutant Control Area. S.-L.K. mainly carries out the execution of EVIEWS 10.0 and evaluates the variations of photochemical pollution factors and the pattern of mutual influences for the various air pollution species with respect to time series. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. EPA (Environmental Protection Agency). Measuring Air Quality: The Pollutant Standards Index; EPA 451/K-94-001; EPA: Washington, DC, USA, 1994.
  2. Hsu, C.H.; Cheng, F.Y. Synoptic weather patterns and associated air pollution in Taiwan. Aerosol Air Qual. Res. 2019, 19, 1139–1151. [Google Scholar] [CrossRef] [Green Version]
  3. Balram, D.; Lian, K.Y.; Sebastian, N. Air quality warning system based on a localized PM2.5 soft sensor using a novel approach of Bayesian regularized neural network via forward feature selection. Ecotoxicol. Environ. Saf. 2019, 182, 109386. [Google Scholar] [CrossRef] [PubMed]
  4. Wang, C.W. Effects of Automobile Tailpipe in the Hsuehshan Tunnel of on the Air Quality of Neighboring Areas Using ADMS Model. Master’s Thesis, Environmental Engineering Research Institute, National Sun Yat-Sen University, Kaohsiung, Taiwan, 2009. (In Chinese). [Google Scholar]
  5. Kuo, S.L.; Wu, E.M.Y. Highway tunnel air quality assessment using multivariate statistical classification via factor, cluster, and discriminant analysis. Int. J. Eng. Technol. 2018, 7, 287–291. [Google Scholar]
  6. Balram, D.; Lian, K.Y.; Sebastian, N. A novel soft sensor based warning system for hazardous ground-level ozone using advanced damped least squares neural network. Ecotoxicol. Environ. Saf. 2020, 205, 111168. [Google Scholar] [CrossRef] [PubMed]
  7. Maurer, M.; Klemm, O.; Lokys, H.L.; Lin, N.H. Trends of fog and visibility in Taiwan: Climate change or air quality improvement? Aerosol Air Qual. Res. 2019, 19, 896–910. [Google Scholar] [CrossRef] [Green Version]
  8. Hansen, B.E. Autoregressive conditional density estimation. Int. Econ. Rev. 1994, 35, 705–730. [Google Scholar] [CrossRef]
  9. Wang, K.L.; Fawson, C.; Barrett, C.B.; McDonald, J. A flexible parametric GARCH model with an application to exchange rates. J. Appl. Econ. 2001, 16, 521–536. [Google Scholar] [CrossRef] [Green Version]
  10. Arestis, P.; Caporale, G.; Cipollini, A. Testing for Financial Contagion between developed and emerging markets during the 1997 East Asian crisis. Int. J. Finance Econ. 2005, 10, 359–367. [Google Scholar] [CrossRef] [Green Version]
  11. Shumway, R.H.; Stoffer, D.S. Time Series Analysis and Its Applications—With R Examples; Springer Science Business Media, LLC: Heidelberg, Germany, 2006. [Google Scholar]
  12. Brockwell, J.B.; Davis, R.A. Introduction to Time Series and Forecasting; Springer Inc.: New York, NY, USA, 2002. [Google Scholar]
  13. Kumar, U.; Prakash, A.; Jain, V.K. A multivariate time series approach to study the interdependence among O3, NOx and VOCs in ambient urban atmosphere. Environ. Model. Assess. 2008, 14, 631–643. [Google Scholar] [CrossRef]
  14. Engle, R. Autoregressive conditional heteroskedasticity with estimates of the variance of U.K. inflation. Econometrica 1982, 50, 987–993. [Google Scholar] [CrossRef]
  15. Ljung, G.M.; Box, G.E.P. On a measure of lack of fit in time series models. Biometrika 1978, 65, 297–303. [Google Scholar] [CrossRef]
  16. Akaike, H. A new look at the statistical model identification. IEEE Trans. Autom. Control 1974, 19, 716–723. [Google Scholar] [CrossRef]
  17. Schwartz, G.E. Estimating the dimension of a model. Ann. Stat. 1978, 6, 461–464. [Google Scholar] [CrossRef]
  18. Kuo, S.L.; Ho, C.L. The assessment of time series for an entire air quality control district in southern Taiwan using GARCH model. Int. J. Eng. Technol. 2018, 7, 119–124. [Google Scholar]
  19. Lee, C.L.; Lee, R.K.; Kao, Y.M. Lagrange Multiplier Optimization Synthesis of Long-Period Fiber Gratings. In Proceedings of the 2007 Asia Optical Fiber Communication and Optoelectronics Conference, Shanghai, China, 17–19 October 2007. [Google Scholar]
  20. Forbes, K.; Rigobon, R. No contagion, only interdependence: Measuring stock market co-movements. J. Financ. 2002, 5, 2223–2261. [Google Scholar] [CrossRef]
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Figure 1. Air quality monitoring locations in Kaohsiung–Pingtung area, Taiwan.
Figure 1. Air quality monitoring locations in Kaohsiung–Pingtung area, Taiwan.
Atmosphere 11 01096 g001
Figure 2. Estimation of the concentration of PM10 with the VARMA (1,1)-EGARCH (1,1) model.
Figure 2. Estimation of the concentration of PM10 with the VARMA (1,1)-EGARCH (1,1) model.
Atmosphere 11 01096 g002
Figure 3. Estimation of the concentration of O3 with the VARMA (1,1)-EGARCH (1,1) model.
Figure 3. Estimation of the concentration of O3 with the VARMA (1,1)-EGARCH (1,1) model.
Atmosphere 11 01096 g003
Figure 4. Estimation of the concentration of NO2 with the VARMA (1,1)-EGARCH (1,1) model.
Figure 4. Estimation of the concentration of NO2 with the VARMA (1,1)-EGARCH (1,1) model.
Atmosphere 11 01096 g004
Table 1. Basic properties of the three pollutants in the photochemical pollution factor.
Table 1. Basic properties of the three pollutants in the photochemical pollution factor.
ItemsO3 (ppb)PM10 (μg/m3)NO2 (ppb)
Mean35.78363.1
Median32.67859.3
Maximum204427488
Minimum8.62016.6
Std. Dev.0.80.80.4
Kurtosis7.511.76.1
Skewness0.82.34.3
Jarque–Bera25182054796
Probability0.00.00.0
Sum Sq. Dev.112.6124.488.3
Observations420420420
Table 2. Results of Autoregressive Conditional Heteroscedasticity (ARCH) effectiveness for 3 air pollutants.
Table 2. Results of Autoregressive Conditional Heteroscedasticity (ARCH) effectiveness for 3 air pollutants.
QO3 (ppb)PM10 (μg/m3)NO2 (ppb)Critical Value
(Lagged Variables)(TR2)(TR2)(TR2) x ( 0.05 , k ) 2
115.57356.21140.363.84
218.98367.29154.225.99
321.22381.00163.877.82
424.58390.54179.789.49
530.16393.58192.6511.07
633.20402.63216.6012.59
736.61411.29231.4714.07
838.14416.85249.5415.51
939.52438.20266.3316.92
1042.65468.13289.5119.68
Note: All TR2 values less than 5% indicate “obviousness”.
Table 3. Ljung–Box Sequence Test for 3 air pollutants.
Table 3. Ljung–Box Sequence Test for 3 air pollutants.
L-BQ (K)O3 (ppb)PM10 (μg/m3)NO2 (ppb)Critical Value
x ( 0.05 , k ) 2
11.920.840.573.84
22.622.010.955.99
35.183.362.647.82
46.795.694.269.49
59.007.176.6111.07
610.488.608.4712.59
712.4611.4310.6914.07
812.8912.5211.1215.51
913.6213.7712.0616.92
1016.3115.9512.9518.31
Table 4. Analysis results of the best combination of photochemical pollution factor Vector Autoregressive Moving Average (VARMA) model and Exponential Generalized Autoregressive Conditional Heteroscedasticity (EGARCH) model.
Table 4. Analysis results of the best combination of photochemical pollution factor Vector Autoregressive Moving Average (VARMA) model and Exponential Generalized Autoregressive Conditional Heteroscedasticity (EGARCH) model.
EGARCH TypeEGARCH (0.1)EGARCH (0.2)EGARCH (1,1)EGARCH (2,1)
Vector Model AICSCAICSCAICSCAICSC
VARMA (1,0)8.1218.1368.0538.0627.7138.0207.9527.993
VARMA (2,0)8.0548.1017.9598.0167.6207.7958.0518.071
VARMA (0,1)8.0128.0777.9948.0237.6027.7937.9528.032
VARMA (0,2)7.9537.9987.9217.9657.5777.7087.8938.011
VARMA (1,1)7.9027.9267.8657.9157.5237.5427.6217.779
VARMA (2,1)7.8837.9037.8397.8687.5317.6037.6597.724
Table 5. Parameter estimation during the process when photochemical pollution factor VARMA (1,1) was paired with EGARCH (1,1).
Table 5. Parameter estimation during the process when photochemical pollution factor VARMA (1,1) was paired with EGARCH (1,1).
Vector Modela0a1a2b0b1b2c0c1c2d1α0α1α2β1γ1
VARMA (1,0)0.96−2.31 3.1431.03 0.510.62−1.120.182.63−1.690.560.26−0.163
t-statistic1.240.96−0.772.14−3.14−1.550.780.660.533.221.164.554.42
VARMA (2,0)−3.441.99 0.873.46−2.590.97−1.520.531.320.642.211.580.33−0.035
t-statistic−1.252.551.57−1.961.412.640.321.630.09−3.111.14−0.762.142.56
VARMA (0,1)1.97 1.141.252.140.163.021.191.843.21−0.341.032.3−0.321
t-statistic0.94−3.63−0.882.45−1.162.461.51−0.712.650.09−2.031.991.93
VARMA (0,2)2.01 0.963.03 1.130.061.59−0.160.920.560.311.55
t-statistic2.510.942.170.88−2.123.121.221.17−2.77−0.264.02
VARMA (2,1)−0.292.0162.232.311.33.01−0.631.590.862.07−0.172.031.14−1.67
t-statistic1.383.46−1.65−1.102.051.161.542.621.142.63−0.33−1.680.822.31
VARMA (1,1)1.371.135.14−2.420.073.410.962.190.171.565.680.710.525.26−0.082
t-statistic0.317.031.14−0.752.562.040.323.792.923.413.022.650.78.163.13
Note: (1) O3 = ao + a1O3(t−1) + a2O3(t−2) + b0PM10(t) + b1PM10(t−1) + b2PM10(t−2) + c0NO2(t) + c1NO2(t−1) + c2NO2(t−2) + d1εt−1 ht = α0 + α1 ε t 1 2 + α2 ε t 2 2 + β1ht−1 with εt~N(0,1). A large ε t 1 2 or ht−1 gives rise to a large σ t 2 . This means that a large ε t 1 2 tends to be followed by another large ε t 2 , generating, again, the well-known behavior of volatility clustering in time series. (2)   γ i , t = c i 0 + j = 1 p i b i j γ i , t j + u i , t . (3) If γi = 0, it indicates that air quality has a symmetrical effect on the reaction to news shock. If γi < 0, it suggests that air quality has a leverage effect on the reaction to news shock. (4) A t-statistic value ≥ 1.96 means that the parameter reaches the significance level. In contrast, a t-statistic value < 1.96 suggests that the parameter does not reach the significance level.

Share and Cite

MDPI and ACS Style

Wu, E.M.-Y.; Kuo, S.-L. VARMA-EGARCH Model for Air-Quality Analyses and Application in Southern Taiwan. Atmosphere 2020, 11, 1096. https://doi.org/10.3390/atmos11101096

AMA Style

Wu EM-Y, Kuo S-L. VARMA-EGARCH Model for Air-Quality Analyses and Application in Southern Taiwan. Atmosphere. 2020; 11(10):1096. https://doi.org/10.3390/atmos11101096

Chicago/Turabian Style

Wu, Edward Ming-Yang, and Shu-Lung Kuo. 2020. "VARMA-EGARCH Model for Air-Quality Analyses and Application in Southern Taiwan" Atmosphere 11, no. 10: 1096. https://doi.org/10.3390/atmos11101096

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop