Abstract
Functional data analysis tools, such as function-on-function regression models, have received considerable attention in various scientific fields because of their observed high-dimensional and complex data structures. Several statistical procedures, including least squares, maximum likelihood, and maximum penalized likelihood, have been proposed to estimate such function-on-function regression models. However, these estimation techniques produce unstable estimates in the case of degenerate functional data or are computationally intensive. To overcome these issues, we proposed a partial least squares approach to estimate the model parameters in the function-on-function regression model. In the proposed method, the B-spline basis functions are utilized to convert discretely observed data into their functional forms. Generalized cross-validation is used to control the degrees of roughness. The finite-sample performance of the proposed method was evaluated using several Monte-Carlo simulations and an empirical data analysis. The results reveal that the proposed method competes favorably with existing estimation techniques and some other available function-on-function regression models, with significantly shorter computational time.
Similar content being viewed by others
References
Aguilera AM, Escabias M, Preda C, Saporta G (2010) Using basis expansions for estimating functional PLS regression: applications with chemometric data. Chemometr Intell Lab Syst 104(2):289–305
Alin A, Agostinelli C (2017) Robust iteratively reweighted SIMPLS. J Chemometr 31(3):e2881
Antoniadis A, Paparoditis E, Sapatinas T (2009) Bandwidth selection for functional time series prediction. Stat Prob Lett 79:733–740
Barnardi M, Catania L (2018) The model confidence set package for R. Int J Comput Econ Econometr 8(2):144–158
Boulesteix A-L, Durif G, Lambert-Lacroix S, Peyre J, Strimmer K (2018) plsgenomics: PLS Analyses for Genomics. R package version 1.5–2. https://CRAN.R-project.org/package=plsgenomics. Accessed 30 July 2019
Brumback BA, Rice JA (1998) Smoothing spline models for the analysis of nested and crossed samples of curves. J Am Stat Assoc 93(443):961–976
Cardot H, Sarda P (2008) Varying-coefficient functional linear regression models. Commun Stat Theory Methods 37(20):3168–3203
Chiou J-M, Yang Y-F, Chen Y-T (2016) Multivariate functional linear regression and prediction. J Multivar Anal 146:301–312
Cuevas A (2014) A partial overview of the theory of statistics with functional data. J Stat Plan Inference 147:1–23
Dayal BS, MacGregor JF (1997) Improved PLS algorithms. J Chemometr 11(1):73–85
de Jong S (1993) SIMPLS: an alternative approach to partial least squares regression. Chemometr Intell Lab Syst 18(3):251–263
Fan J, Zhang W (1999) Statistical estimation in varying coefficient models. Ann Stat 27(5):1491–1518
Febrero-Bande M, Oviedo de la Fuente M (2012) Statistical computing in functional data analysis: the R package fda.usc. J Stat Softw 51(4):1–28
Febrero-Bande M, Galeano P, Gozalez-Manteiga W (2017) Functional principal component regression and functional partial least-squares regression: an overview and a comparative study. Int Stat Rev 85(1):61–83
Ferraty F, Vieu P (2006) Nonparametric functional data analysis. Springer, New York
Goldsmith J, Scheipl F, Huang L, Wrobel J, Gellar J, Harezlak J, McLean MW, Swihart B, Xiao L, Crainiceanu C, Reiss PT (2018) Refund: regression with functional data. R package version 0.1–17. https://CRAN.R-project.org/package=refund. Accessed 30 July 2019
Golub G, Kahan W (1965) Calculating the singular values and pseudo-inverse of a matrix. SIAM J Numer Anal 2(2):205–224
Hansen PR, Lunde A, Nason JM (2011) The model confidence set. Econometrica 79(2):453–497
Happ C, Greven S (2018) Multivariate functional principal component analysis for data observed on different (dimensional) domains. J Am Stat Assoc Theory Methods 113(522):649–659
Hoover DR, Rice JA, Wu CO, Yang L-P (1998) Nonparametric smoothing estimates of time-varying coefficient models with longitudinal data. Biometrika 85(4):809–822
Horvath L, Kokoszka P (2012) Inference for functional data with applications. Springer, New York
Huang JZ, Wu CO, Zhou L (2002) Varying-coefficient models and basis function approximations for the analysis of repeated measurements. Biometrika 89(1):111–128
Huang JZ, Wu CO, Zhou L (2004) Polynomial spline estimation and inference for varying coefficient models with longitudinal data. Stat Sinica 14(3):763–788
Hyndman RJ, Shang HL (2009) Forecasting functional time series. J Korean Stat Soc 38(3):199–211
Ivanescu AE, Staicu A-M, Scheipl F, Greven S (2015) Penalized function-on-function regression. Comput Stat 30(2):539–568
Krämer N, Boulesteix A-L, Tutz G (2008) Penalized partial least squares with applications to B-spline transformations and functional data. Chemometr Intell Lab Syst 94(1):60–69
Martens H, Naes T (1989) Multivariate calibration. Wiley, New York
Matsui H, Kawano S, Konishi S (2009) Regularized functional regression modeling for functional response and predictors. J Math-for-Ind 1(A3):17–25
Müller H-G, Yao F (2008) Functional additive models. J Am Stat Assoc 103(484):1534–1544
Preda C, Saporta G (2005) PLS regression on a stochastic process. Comput Stat Data Anal 48(1):149–158
Preda C, Schiltz J (2011) Functional PLS regression with functional response: the basis expansion approach. In: Proceedings of the 14th applied stochastic models and data analysis conference. Universita di Roma La Spienza. p. 1126–33
R Core Team (2019) R: a language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. https://www.R-project.org/. Accessed 30 July 2019
Racine J (2000) Consistent cross-validatory model-selection for dependent data: \(hv\)-block cross-validation. J Econometr 99:39–61
Ramsay JO, Dalzell CJ (1991) Some tools for functional data analysis. J R Stat Soc Ser B 53(3):539–572
Ramsay JO, Silverman BW (2002) Applied functional data analysis. Springer, New York
Ramsay JO, Silverman BW (2006) Functional data analysis. Springer, New York
Reiss PT, Ogden TR (2007) Functional principal component regression and functional partial least squares. J Am Stat Assoc 102(479):984–996
Sanchez G (2012) plsdepot: Partial Least Squares (PLS) data analysis methods. R package version 0.1.17. https://CRAN.R-project.org/package=plsdepot. Accessed 30 July 2019
Şentürk D, Müller H-G (2005) Covariate adjusted correlation analysis via varying coefficient models. Scand J Stat 32(3):365–383
Wang W (2014) Linear mixed function-on-function regression models. Biometrics 70(4):794–801
Wold H (1974) Causal flows with latent variables: partings of the ways in the light of NIPALS modelling. Eur Econ Rev 5(1):67–86
Wu CO, Chiang C-T (2000) Kernel smoothing on varying coefficient models with longitudinal dependent variable. Stat Sinica 10(2):443–456
Wu Y, Fand J, Müller H-G (2010) Varying-coefficient functional linear regression. Bernoulli 16(3):730–758
Yamanishi Y, Tanaka Y (2003) Geographically weighted functional multiple regression analysis: a numerical investigation. J Jpn Soc Comput Stat 15(2):307–317
Yao Q, Tong H (1998) Cross-validatory bandwidth selections for regression estimation based on dependent data. J Stat Plan Inference 68:387–415
Yao F, Müller H-G, Wang J-L (2005) Functional linear regression analysis for longitudinal data. Ann Stat 33(6):2873–2903
Zhu H, Fan J, Kong L (2014) Spatially varying coefficient model for neuroimaging data with jump discontinuities. J Am Stat Assoc 109(507):1084–1098
Acknowledgements
The authors acknowledge R code assistance from Dr. Fabian Scheipl at the Ludwig-Maximilians University of Munich and Ms. Julia Wrobel at Columbia University. The authors thank two anonymous referees for their careful reading of our manuscript and valuable suggestions and comments, which have helped us produce a much-improved paper. The second author also acknowledges the financial support from a research grant at the College of Business and Economics at the Australian National University.
Author information
Authors and Affiliations
Corresponding author
Additional information
Handling Editor: Bryan F. J. Manly
Rights and permissions
About this article
Cite this article
Beyaztas, U., Shang, H.L. On function-on-function regression: partial least squares approach. Environ Ecol Stat 27, 95–114 (2020). https://doi.org/10.1007/s10651-019-00436-1
Received:
Revised:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10651-019-00436-1