Interval Estimation of Random Coefficient Integer-Valued Autoregressive Model Based on Mean Empirical Likelihood Method

Xu, Xianghong; Wang, Dehui; Zhao, Zhiwen

doi:https://doi.org/10.1155/2021/8207375

Mathematical Problems in Engineering

On this page

Abstract Introduction Data Availability Conflicts of Interest Acknowledgments References Copyright Related Articles

Research Article | Open Access

Volume 2021 | Article ID 8207375 | https://doi.org/10.1155/2021/8207375

Interval Estimation of Random Coefficient Integer-Valued Autoregressive Model Based on Mean Empirical Likelihood Method

Xianghong Xu,¹Dehui Wang,¹and Zhiwen Zhao²

Academic Editor: Rafał Stanisławski

Received13 Jun 2020

Revised10 Dec 2020

Accepted08 Feb 2021

Published26 Feb 2021

Abstract

In this paper, we study the use of the mean empirical likelihood (MEL) method in a first-order random coefficient integer-valued autoregressive model. The MEL ratio statistic is established, its limiting properties are discussed, and the confidence regions for the parameter of interest are derived. Furthermore, a simulation study is presented to demonstrate the performance of the proposed method. Finally, a real data analysis of dengue fever is performed.

1. Introduction

Integer-valued time series data are commonly encountered in many fields, such as economics, finance, actuarial science, medicine, and epidemiology (e.g., the number of patients in a hospital at a specific point of time and the number of persons in a queue waiting for service at a certain moment). Related research on integer-valued time series started in the 1980s, in which two main methods are used: a state-space model based on an unobserved “state” process and a thinning model based on a thinning operation “.” Regarding state-space models, we can refer to the paper by Fukasawa and Basawa [1]. Regarding thinning models, we can refer to the paper by Steutel and Harn [2], which mainly proposed the binomial thinning operation. Let be a non-negative integer-valued random variable and . Then, the binomial thinning operator “” is defined aswhere is an i.i.d. Bernoulli random sequence where , which is also independent of . Based on the thinning operator “,” a first-order autoregressive process with count- or integer-valued data (INAR(1)) was defined by Al-Osh and Alzaid [3] as follows:where is a sequence of i.i.d. non-negative integer-valued random variables with mean and variance and is independent of . The INAR(1) model has been discussed by many authors. Al-Osh and Alzaid [4] introduced a family of models for a stationary sequence of dependent binomial random variables and discussed the existence of a stationary distribution for the binomial AR(1) process. Al-Osh and Aly [5] presented AR(1) models with negative binomial and geometric marginals and investigated some properties of the processes. McKenzie [6] described some simple models that may be used for modelling or generating sequences of dependent discrete random variates with negative binomial and geometric univariate marginal distributions. Later, McKenzie [7] discussed the problem of defining a practically useful representation for the innovation process of a first-order autoregression with a negative binomial marginal distribution. Moreover, McKenzie [8] demonstrated that the powerful Markov property, which greatly simplifies the distributional structure of finite autoregressions, is analogous to (non-Markovian) finite moving-average processes. Furthermore, McKenzie [9] developed and investigated a family of models for discrete-time processes with Poisson marginal distributions. Alzaid and Al-Osh [10] investigated some properties of INAR(1) processes.

In some practical applications, the parameter may vary with time. For example, let denote the number of unemployed people in month t. Here, could potentially satisfy an INAR model, where is the number of unemployed people in month t who were unemployed in the previous month and represents the number of newly unemployed people in the current month. Here, represents the unemployment rate, which may be affected by economic conditions and other factors and can vary randomly over time. Zheng et al. [11] introduced a first-order random coefficient integer-valued autoregressive (RCINAR(1)) process as follows:where is an i.i.d. sequence with cumulative distribution function (CDF) on ; is an i.i.d. non-negative integer-valued sequence with probability mass function (PMF) , in which ; and , , and are independent. , where is an i.i.d. Bernoulli random sequence and ; is also independent of .

Zheng et al. [11] established the ergodicity of the process, obtained the moments and autocovariance functions, and derived the conditional least-squares (CLS) and quasi-likelihood estimators of the model parameters. In recent years, RCINAR(1) models have been discussed in many studies. Roitershtein and Zhong [12] studied the asymptotic behaviour of the RCINAR(1) model in the case where the additive term in the underlying random linear recursion belongs to the domain of attraction of a stable law. Zhang and Wang [13] presented the explicit expressions for the higher-order moments and cumulants of the RCINAR(1) process. Zhao and Hu [14] applied the least-squares method to estimate the parameters in the RCINAR(1) process. Kang [15] considered the problem of testing for parameter changes in RCINAR models. Li et al. [16] introduced a first-order random coefficient integer-valued threshold autoregressive process based on binomial thinning. Bakouch et al. [17] introduced a new stationary first-order integer-valued autoregressive process with random coefficient and zero-inflated geometric marginal distribution. Zhang et al. [18] introduced the RCINAR(1) process with generalized negative binomial marginals. Yu et al. [19] proposed a new bivariate RCINAR(1) (BRCINAR(1)) process with dependent innovations.

Many research methods have been applied to INAR models, among which the empirical likelihood (EL) method has been the main focus in recent years. The EL method, introduced by Owen [20] and further studied by Owen [21] and others, is a nonparametric statistical method. The EL method is a useful tool for statistical inference and has been successfully applied to many areas, such as linear regression models [22], generalized linear models [23], generalized estimation equations [24], dependent processes [25], semiparametric varying-coefficient partially linear regression models [26], and the limit theory of RCINAR(1) processes [27]. Zhao and Yu [28] estimated the variance of the random coefficient in the RCINAR(1) process by the EL method.

Although the EL method has many advantages and has been widely applied in various scenarios, there are some problems with this method, for example, the empirical likelihood ratio confidence regions may have poor accuracy, especially in small-sample and multidimensional scenarios. The literature discusses many attempts to solve this problem. DiCiccio et al. [29] proved that the EL is Bartlett correctable. Chen et al. [30] introduced the adjusted EL (AEL), and Taso and Wu [31] introduced the extended EL (EEL). The above methods provide improved results in small-sample scenarios, but the calculation is complex and involves a new parameter estimation method. Liang et al. [32] introduced the mean empirical likelihood (MEL) method, which is simple and rapid to implement and much more accurate than the previous EL methods.

In this paper, we focus on the use of the MEL method for the RCINAR(1) model (3). The MEL ratio statistic is derived, and its limiting properties are discussed. Specifically, the confidence region is derived for the parameter of interest.

The rest of this paper is organized as follows. In Section 2, we introduce the main results. In Section 3, we present some simulation results. In Section 4, we apply our method in the dengue fever cases data. Finally, in Section 5, we prove the main results.

2. Mean Empirical Likelihood for an RCINAR(1) Process

In this section, we will discuss how to use the MEL method for RCINAR(1) models (3). Zheng et al. [11] noted that the process is an irreducible, aperiodic, and positive recurrent (and hence ergodic) Markov chain.

Let , and ; note that they are all assumed to be finite. Let . We use the MEL method to estimate the unknown parameter . Based on the sample , Zheng et al. [11] derived the conditional least-squares (CLS) estimator of the model parameter. The CLS estimator of is obtained by minimizing over , where

Note that ; then,

By taking the derivative of with respect to , we obtain the estimating equation:where

Let . The elements of set are denoted by ; let be equal to the number of elements in , and it is easy to understand that . We define the MEL ratio statistic of aswhere . According to the method of Lagrange multipliers, letwhere and are the Lagrange multipliers. Fromwe know that and . Then, from (10), we obtain . Hence, , where satisfies

Thus, the log EL ratio statistic has the form

Further, let . The MEL ratio statistic is defined as

To obtain the limiting properties of , we impose the following assumptions:(C1) is a strictly stationary and ergodic RCINAR(1) process(C2)

The limit distribution of is established in the following theorem.

Theorem 1. Under Assumptions (C1) and (C2), we havewhere is a chi-square distribution with 2 degrees of freedom.

According to Theorem 1, we can construct the confidence region for parameter . The confidence region of iswhere is the -quantile of the chi-square distribution with 2 degrees of freedom.

3. Simulation Results

In this section, we conduct simulation studies to compare the MEL confidence region with the EL, AEL, and EEL results.

Consider the RCINAR(1) modelwhere and .

We fixed at 1 and then used the above model to generate data. We take and . Four different sample sizes (n = 20, 30, 50, and 100) are investigated, and the nominal confidence levels are chosen as 0.95 and 0.90. All the simulations are based on 1000 replications. We evaluate the coverage probability of the confidence regions, and the results are summarized in Tables 1–4.

It can be seen from the Tables 1–4 that the coverage probability of the confidence region approaches the confidence levels (0.95 and 0.90) as the sample size n increases. The MEL method has similar performance to the EEL method. The MEL and EEL coverage probabilities are much larger than nominal levels when the sample size is small. In all cases, the MEL method is uniformly better than the EL method, and it is much more accurate when the sample size is small.

In order to further study the performance of MEL method, we give the figure of the confidence region for n = 20, 30, 50, and 100 when and (Figure 1). At the same time, we calculated the CI length of and , and the results are summarized in Table 5.

It can be seen from the Table 5 and Figure 1 that the confidence region is relatively large when the sample size is small, so the coverage probability of the confidence region is relatively large. However, as the sample size increases, the confidence region becomes smaller and the length of the confidence interval shortens.

4. Real Data Analysis

In this section, we apply our proposed methods to analyse the monthly counts of dengue fever cases in China from January 2004 through April 2012, as reported by the Chinese Center for Disease Control and Prevention (http://www.chinacdc.cn). The data are plotted in Figure 2 and consist of 100 observations, which are denoted by . The plots of the autocorrelation function (ACF) and partial autocorrelation function (PACF) for the series are given in Figures 3 and 4, respectively. The corresponding plots of the sample ACF and PACF indicate an AR(1)-like autocorrelation structure.

In Figure 5, based on the observation data , we give the figure of the MEL ratio confidence region when the confidence level is 0.95. Through the calculation, we have that the least-squares estimation , which is denoted by in Figure 5. From Figure 5, we can see that is in the MEL ratio confidence region.

5. Proof of Theorem 1

In this section, we present the proof of Theorem 1. To obtain the proof, we need the following lemmas.

Lemma 1. Assume that (C1) and (C2) hold. Then,

Proof. Note thatThus, according to Lemma 4.2 of Zhang et al. [33], we know that Lemma 1 holds.

Lemma 2. Assume that (C1) and (C2) hold. Then,

Proof. Note thatThus, by Assumption (C1), we know that Lemma 2 holds.

Lemma 3. Assume that (C1) and (C2) hold. Then,where , , , and .

Proof. Note thatThus, by Lemma 2.1 and Lemma 4.1 of Zhang et al. [33], we know that Lemma 3 holds.

Lemma 4. Assume that (C1) and (C2) hold. Then,

Proof. Note thatBy the strong law of large numbers and Assumptions (C1) and (C2), we have thatTherefore, by (24)–(26), we know that Lemma 4 holds.

Lemma 5. Assume that (C1) and (C2) hold. Then,

Proof. Let , where . From (11), we know thatwhere . By Lemma 3, we have , where and are the largest and smallest eigenvalues, respectively, of . Next, we provide the proof in three steps.(i)Step 1. We prove that To prove (29), we need to prove that Note that From Lemma 2.1 of Zhang et al. [33], we know that Then, we have , so (30) holds.(ii)Step 2. We prove that From (28), we have From Lemma 2.1 of Zhang et al. [33], it is easy to see that . Hence, according to (34), (33) holds.(iii)Step 3. We prove thatWe have proved that . Let , and we have . Note that , . Hence, (35) is proved, so Lemma 5 holds.

Lemma 6. Assume that (C1) and (C2) hold. Then,

Proof. From (11), we know thatHence,To prove Lemma 6, according to Lemma 3, we need to prove thatNote thatThe proof of Lemma 6 is complete.

Proof of Theorem 1. By the Taylor expansion, we know thatwhereTherefore,According to Lemma 2.1 and Lemma 4.1 of Zhang et al. [33] and Lemma 3, we know that Theorem 1 holds.

Data Availability

The data used to support the findings of this study have not been made available because our research data come from computer simulation.

Conflicts of Interest

The authors declare that they have no conflicts of interest.

Acknowledgments

This study was supported by the National Natural Science Foundation of China (nos. 11871028, 11731015, and 11901053), the Natural Science Foundation of Jilin Province (nos. 20170101057JC and 20180101216JC), and the Program for Changbaishan Scholars of Jilin Province (2015010).

References

T. Fukasawa and I. V. Basawa, “Estimation for a class of generalized state-space time series models,” Statistics & Probability Letters, vol. 60, no. 4, pp. 459–473, 2002.
View at: Publisher Site | Google Scholar
F. W. Stevtal and K. V. Harn, “Discrete analogues of self-decomposability and stability,” The Annals of Probability, vol. 7, no. 5, pp. 893–899, 1979.
View at: Google Scholar
M. A. Al-Osh and A. A. Alzaid, “First-order integer-valued autoregressive (Inar(1)) process,” Journal of Time Series Analysis, vol. 8, no. 3, pp. 261–275, 1987.
View at: Publisher Site | Google Scholar
M. A. Al-Osh and A. A. Alzaid, “Binomial autoregressive moving average,” Stochastic Models, vol. 7, no. 3, pp. 261–282, 1987.
View at: Google Scholar
M. A. Al-Osh and E. Aly, “Binomial autoregressive moving average models,” Stochastic Models, vol. 7, pp. 261–282, 1987.
View at: Google Scholar
E. McKenzie, “Autoregressive moving-average processes with negative-binomial and geometric marginal distributions,” Advances in Applied Probability, vol. 18, no. 3, pp. 679–705, 1986.
View at: Publisher Site | Google Scholar
E. McKenzie, “Innovation distributions for gamma and negative-binomial autoregressions,” Scandinavian Journal of Statistics, vol. 14, pp. 79–85, 1987.
View at: Google Scholar
E. McKenzie, “The distributional structure of finite moving-average processes,” Journal of Applied Probability, vol. 25, no. 02, pp. 313–321, 1988.
View at: Publisher Site | Google Scholar
E. McKenzie, “Some ARMA models for dependent sequences of Poisson counts,” Advances in Applied Probability, vol. 20, no. 4, pp. 822–835, 1988.
View at: Publisher Site | Google Scholar
A. Alzaid and M. Al-Osh, “First-order integer-valued autoregressive (INAR (1)) process: distributional and regression properties,” Statistica Neerlandica, vol. 42, no. 1, pp. 53–61, 1988.
View at: Publisher Site | Google Scholar
H. Zheng, I. V. Basawa, and S. Datta, “First-order random coefficient integer-valued autoregressive processes,” Journal of Statistical Planning and Inference, vol. 137, no. 1, pp. 212–229, 2007.
View at: Publisher Site | Google Scholar
A. Roitershtein and Z. Zhong, “On random coefficient INAR(1) processes,” Science China Mathematics, vol. 56, no. 1, pp. 177–200, 2013.
View at: Publisher Site | Google Scholar
H. Zhang and D. Wang, “Inference for random coefficient INAR(1) process based on frequency domain analysis,” Communications in Statistics - Simulation and Computation, vol. 44, no. 4, pp. 1078–1100, 2015.
View at: Publisher Site | Google Scholar
Z. Zhao and Y. Hu, “Statistical inference for first-order random coefficient integer-valued autoregressive processes,” Journal of Inequalities and Applications, vol. 359, no. 1, pp. 1–12, 2015.
View at: Google Scholar
J. Kang, “Detection of parameter change in random coefficient integer-valued autoregressive models,” Entropy, vol. 20, no. 2, 107 pages, 2018.
View at: Publisher Site | Google Scholar
H. Li, K. Yang, S. S. Zhao, and D. H. Wang, “First-order random coefficients integer-valued threshold autoregressive processes,” Advances in Statistical Analysis, vol. 102, pp. 305–331, 2018.
View at: Google Scholar
H. S. Bakouch, M. Mohammadpour, and M. Shirozhan, “A zero-inflated geometric INAR(1) process with random coefficient,” Applications of Mathematics, vol. 63, no. 1, pp. 79–105, 2018.
View at: Publisher Site | Google Scholar
J. Zhang, D. Wang, and K. Yang, “A study of RCINAR(1) Process with generalized negative binomial marginals,” Communications in Statistics–Simulation and Computation, vol. 49, pp. 1–24, 2019.
View at: Google Scholar
M. Yu, D. Wang, K. Yang, and Y. Liu, “Bivariate first-order random coefficient integer-valued autoregressive processes,” Journal of Statistical Planning and Inference, vol. 204, pp. 153–176, 2020.
View at: Publisher Site | Google Scholar
A. B. Owen, “Empirical likelihood ratio confidence intervals for a single functional,” Biometrika, vol. 75, no. 2, pp. 237–249, 1988.
View at: Publisher Site | Google Scholar
A. Owen, “Empirical likelihood ratio confidence regions,” The Annals of Statistics, vol. 18, no. 1, pp. 90–120, 1990.
View at: Publisher Site | Google Scholar
A. Owen, “Empirical likelihood for linear models,” The Annals of Statistics, vol. 19, no. 4, pp. 1725–1747, 1991.
View at: Publisher Site | Google Scholar
E. D. Kolaczyk, “Empirical likelihood for generalized linear models,” Statistica Sinica, vol. 4, no. 1, pp. 199–218, 1994.
View at: Google Scholar
J. Qin and J. Lawless, “Empirical likelihood and general estimating equations,” The Annals of Statistics, vol. 22, no. 1, pp. 300–325, 1994.
View at: Publisher Site | Google Scholar
Y. Kitamura, “Empirical likelihood methods with weakly dependent processes,” The Annals of Statistics, vol. 25, no. 5, pp. 2084–2102, 1997.
View at: Publisher Site | Google Scholar
J. You and Y. Zhou, “Empirical likelihood for semiparametric varying-coefficient partially linear regression models,” Statistics & Probability Letters, vol. 76, no. 4, pp. 412–422, 2006.
View at: Publisher Site | Google Scholar
Z.-W. Zhao, D.-H. Wang, and Y. Zhang, “Limit theory for random coefficient first-order autoregressive process under martingale difference error sequence,” Journal of Computational and Applied Mathematics, vol. 235, no. 8, pp. 2515–2522, 2011.
View at: Publisher Site | Google Scholar
Z. Zhao and W. Yu, “Empirical likelihood inference for first-order random coefficient integer-valued autoregressive processes,” Mathematical Problems in Engineering, vol. 2016, Article ID 9505794, 8 pages, 2016.
View at: Publisher Site | Google Scholar
T. Diciccio, P. Hall, and J. Romano, “Empirical likelihood is bartlett-correctable,” The Annals of Statistics, vol. 19, no. 2, pp. 1053–1061, 1991.
View at: Publisher Site | Google Scholar
J. Chen, A. M. Variyath, and B. Abraham, “Adjusted empirical likelihood and its properties,” Journal of Computational and Graphical Statistics, vol. 17, no. 2, pp. 426–443, 2008.
View at: Publisher Site | Google Scholar
M. Tsao and F. Wu, “Empirical likelihood on the full parameter space,” The Annals of Statistics, vol. 41, no. 4, pp. 2176–2196, 2013.
View at: Publisher Site | Google Scholar
W. Liang, H. Dai, and S. He, “Mean empirical likelihood,” Computational Statistics & Data Analysis, vol. 138, no. 2, pp. 155–169, 2019.
View at: Publisher Site | Google Scholar
H. Zhang, D. Wang, and F. Zhu, “The empirical likelihood for first-order random coefficient integer-valued autoregressive processes,” Communications in Statistics - Theory and Methods, vol. 40, no. 3, pp. 492–509, 2011.
View at: Publisher Site | Google Scholar

Copyright

Copyright © 2021 Xianghong Xu et al. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

PDF Download Citation

Download other formats

Order printed copies

Views

232

Downloads

572

Citations