Abstract
Estimation and diagnostic procedures for partially linear models with first-order autoregressive [AR(1)] skew-normal errors are proposed in this paper. An EM iterative process with analytic expressions for the M and E-steps, which combines back-fitting and Newton–Raphson algorithms, is developed for the parameter estimation. A linear smoother for the estimation of the effective degrees of freedom concerning the nonparametric component is derived from the iterative process. Local influence analysis is developed based on the conditional expectation of the complete-data log-likelihood function, used in the EM algorithm. A simulation study is also conducted to evaluate the efficiency of the EM algorithm. Finally, the methodology developed through the paper is illustrated with a real data set on daily ozone concentration.
Similar content being viewed by others
References
Atkinson A (1981) Two graphical displays for outlying and influential observations in regression. Biometrika 68:13
Azzalini A (1985) A class of distributions which includes the normal ones. Scand J Stat 12:171–178
Bondon P (2009) Estimation of autoregressive models with epsilon-skew-normal innovations. J Multivar Anal 100:1761–1776
Breiman L, Friedman JH (1985) Estimating optimal transformations for multiple regression and correlation (with discussion). J Am Stat Assoc 80(391):580–619
Cao CZ, Lin JG, Zhu LX (2010) Heteroscedasticity and/or autocorrelation diagnostics in nonlinear models with ar(1) and symmetrical errors. Stat Pap 51:813–836
Cook RD (1986) Assessment of local influence. J R Stat Soc Ser B 48:133–169
Cook RD, Weisberg S (1982) Residuals and influence in regression. Chapman and Hall/CRC, Boca Raton
de Boor C (2001) A practical guide to spline, revised. Springer, Berlin
Dempster A, Laird N, Rubin D (1977) Maximum likelihood from incomplete data via the EM algorithm. J R Stat Soc Ser B 39:1–38
Dunn PK, Smyth GK (1996) Randomized quantile residuals. J Comput Graph Stat 5(3):236–244
Eilers PHC, Marx BD (1996) Flexible smoothing with B-splines and penalties. Stat Sci 11(2):89–121
Ferreira CS, Bolfarine H, Lachos VH (2011) Skew scale mixtures of normal distributions: properties and estimation. Stat Methodol 8:154–171
Ferreira CS, Paula GA (2017) Estimation and diagnostic for skew-normal partially linear models. J Appl Stat 44(16):3033–3053
Ferreira G, Castro LM, Lachos VH, Dias R (2013) Bayesian modeling of autoregressive partial linear models with scale mixture of normal errors. J Appl Stat 40(8):1796–1816
Ferreira JT, Steel MFJ (2006) A constructive representation of univariate skewed distributions. J Am Stat Assoc 101:823–829
Genton MG (2004) Skew-elliptical distributions and their applications: a journey beyond normality. Chapman and Hall, London
Green PJ (1987) Penalized likelihood for general semi-parametric regression models. Int Stat Rev 55:245–259
Green PJ (1990) On use of the EM algorithm for penalized likelihood estimation. J R Stat Soc Ser B 52:443–452
Greene WH (2012) Econometric analysis. 7th edn. Prentice Hall, Upper Saddle River
Hastie T, Tibshirani R (1990) Generalized additive models. Chapman and Hall, London
Johnson NL, Kotz S, Balakrishnan N (1994) Continuous univariate distributions, vol 1. Wiley, New York
Lee SY, Xu L (2004) Influence analysis of nonlinear mixed-effects models. Comput Stat Data Anal 45:321–341
Liu S (2004) On diagnostics in conditionally heteroscedastic time series models under elliptical distributions. J Appl Probab 41A:393–405
Lu B, Song X (2006) Local influence of multivariate Probit latent variable models. J Multivar Anal 97:1783–1798
Nadarajah S, Kotz S (2003) Skewed distributions generated by the normal kernel. Stat Probab Lett 65:269–277
Paula GA, Medeiros MJ, Vilca-Labra FE (2009) Influence diagnostics for linear models with first-order autoregressive elliptical errors. Stat Probab Lett 79:339–346
R Core Team (2019) R: A language and environment for statistical computing. In: R foundation for statistical computing, Vienna, Austria. http://www.R-project.org/
Relvas CEM, Paula GA (2016) Partially linear models with first-order autoregressive symmetric errors. Stat Pap 57:795–825
Sahu SK, Dey DK, Branco MD (2003) A new class of multivariate distributions with applications to Bayesian regression models. Can J Stat 31:129–150
Segal MR, Bacchetti P, Jewell NP (1994) Variances for maximum penalized likelihood estimates obtained via the EM algorithm. J R Stat Soc Ser B 56:345–352
Sharafi M, Nematollahi AR (2016) Ar(1) model with skew-normal innovations. Metrika 79:1011–1029
Wood S (2017) Generalized additive models, an introduction with R, 2nd edn. Chapman and Hall, London
Zhu H, Lee S (2001) Local influence for incomplete-data models. J R Stat Soc Ser B 63:111–126
Acknowledgements
We thank two anonymous referees for comments that improved this manuscript. This work was partially supported by CNPq-Brazil and FAPEMIG-Brazil.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Appendices
Appendix A: Approximate standard errors
The variance–covariance matrix of the \(\widehat{\varvec{\theta }}\), corresponding to the inverse of the observed information matrix, is obtained by treating the penalized likelihood as a usual likelihood Segal et al. (1994). Given the observed log-likelihood function \(\ell (\varvec{\theta })\) in (5), the correspondence penalized log-likelihood function of \(\varvec{\theta }=(\varvec{\beta }^\top ,\varvec{\gamma }^\top ,\sigma ^2,\delta ,\rho )^\top \) is of the form
The observed information matrix for \(\varvec{\theta }\) may be written as
with elements
where \(W_\Phi '(x)=-W_\Phi (x)(x+W_\Phi (x))\). The first and second derivatives of \(B_i\), \(i=1,\ldots ,n\), in relation to \(\varvec{\theta }\) are given by
where \(\mathbf {x}_0=\mathbf {n}_0=\mathbf {0}\) and \(y_0=\mu _0=0\).
Appendix B: Derivation of the matrices \(\mathbf {N}\) and \(\mathbf {K}\)
Let \(\mathbf {t}=(t_1,\ldots , t_m)\) and ndx the number of equidistant Knots desired by the user. It follows the commands in R (R Core Team 2019) for construction of the matrices \(\mathbf {N}\) and \(\mathbf {K}\):
Rights and permissions
About this article
Cite this article
Silva Ferreira, C.d., Paula, G.A. & Lana, G.C. Estimation and diagnostic for partially linear models with first-order autoregressive skew-normal errors. Comput Stat 37, 445–468 (2022). https://doi.org/10.1007/s00180-021-01130-2
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00180-021-01130-2