Skip to main content

Advertisement

Log in

Outlier detection under a covariate-adjusted exponential regression model with censored data

  • Original paper
  • Published:
Computational Statistics Aims and scope Submit manuscript

Abstract

Exponential regression models with censored data are most widely used in practice. In the modeling process, there exist situations where the covariates are not directly observed but are observed after being contaminated by unknown functions of an observable confounder in a multiplicative manner. The problem of outlier detection is a fundamental and important problem in applied statistics. In this paper, we use a nonparametric regression method to adjust the covariates and recast the outlier detection issue into a high-dimensional regularization regression issue in the covariate-adjusted exponential regression model with censored data. We propose a smoothly clipped absolute deviation (SCAD) penalized likelihood method to detect the possible outliers, which features that the proposed method can simultaneously deal with outlier detection and estimations for the regression coefficients. The coordinate descent algorithm is employed to facilitate computation. Simulation studies are conducted to evaluate the finite-sample performance of our proposed method. An application to a German breast cancer study demonstrates the utility of the proposed method in practice.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2

Similar content being viewed by others

References

  • Breheny P, Huang J (2011) Coordinate descent algorithms for nonconvex penalized regression, with applications to biological feature selection. Ann Appl Stat 5(1):232–253

    Article  MathSciNet  Google Scholar 

  • Chien L (2013) Multiple deletion diagnostics in beta regression models. Comput Stat 28:1639–1661

    Article  MathSciNet  Google Scholar 

  • Cox DR (1972) Regression models and life-tables. J R Stat Soc Ser B Methodol 34(2):187–202

    MathSciNet  MATH  Google Scholar 

  • Cui X, Guo W, Lin L et al (2009) Covariate-adjusted nonlinear regression. Ann Stat 37(4):1839–1870

    Article  MathSciNet  Google Scholar 

  • Fan J, Li R (2001) Variable selection via nonconcave penalized likelihood and its oracle properties. J Am Stat Assoc 96(456):1348–1360

    Article  MathSciNet  Google Scholar 

  • Friedman J, Hastie TH, Fling H et al (2007) Pathwise coordinate optimization. Ann Appl Stat 1(2):302–332

    Article  MathSciNet  Google Scholar 

  • Grambsch PM, Therneau TM (1994) Proportional hazards tests and diagnostics based on weighted residuals. Biometrika 81(3):515–526

    Article  MathSciNet  Google Scholar 

  • Inácio de Carvalho V, de Carvalho M, Branscum AJ (2017) Nonparametric Bayesian covariate-adjusted estimation of the Youden index. Biometrics 73(4):1279–1288

    Article  MathSciNet  Google Scholar 

  • Jiang H, Symanowski J, Qu Y, Ni X, Wang Y (2011) Covariate-adjusted non-parametric survival curve estimation. Stat Med 30(11):1243–1253

    Article  MathSciNet  Google Scholar 

  • Kim SS, Krzanowski WJ (2007) Detecting multiple outliers in linear regression using a cluster method combined with graphical visualization. Comput Stat 22:109–119

    Article  MathSciNet  Google Scholar 

  • Li X, Du J, Li G, Fan M (2014) Variable selection for covariate adjusted regression model. J Syst Sci Complex 27(06):1227–1246

    Article  MathSciNet  Google Scholar 

  • Lin D, Ying Z (1994) Semiparametric analysis of the additive risk model. Biometrika 81(1):61–71

    Article  MathSciNet  Google Scholar 

  • Nagaraju D, Shaik S, Shaik A, Hussain B (2014) Outliers detection in regression analysis using partial least square approach. Springer Int Publ 249:125–135

    Google Scholar 

  • Sawant P, Billor N, Shin H (2012) Functional outlier detection with robust functional principal component analysis. Comput Stat 27:83–102

    Article  MathSciNet  Google Scholar 

  • Schwarz G (1978) Estimating the dimension of a model. Ann Stat 6(2):461–464

    Article  MathSciNet  Google Scholar 

  • Şentürk D, Müller HG (2005a) Covariate adjusted correlation analysis via varying coefficient models. Scand J Stat 32(3):365–383

    Article  MathSciNet  Google Scholar 

  • Şentürk D, Müller HG (2005b) Covariate-adjusted regression. Biometrika 92(1):75–89

    Article  MathSciNet  Google Scholar 

  • Şentürk D, Nguyen DV (2006) Estimation in covariate-adjusted regression. Comput Stat Data Anal 50(11):3294–3310

    Article  MathSciNet  Google Scholar 

  • She Y, Owen AB (2011) Outlier detection using nonconvex penalized regression. J Am Stat Assoc 106(494):626–639

    Article  MathSciNet  Google Scholar 

  • Wu T, Lange K (2008) Coordinate descent algorithms for lasso penalized regression. Ann Appl Stat 2(1):224–244

    Article  MathSciNet  Google Scholar 

  • Zhang J, Zhu LX, Liang H (2012) Nonlinear models with measurement errors subject to single-indexed distortion. J Multivar Anal 112:1–23

    Article  MathSciNet  Google Scholar 

  • Zhao J, Xie C (2018) A nonparametric test for covariate-adjusted models. Stat Probab Lett 133:65–70

    Article  MathSciNet  Google Scholar 

  • Zhou H, Hanson T, Jara A, Zhang J (2015) Modelling county level breast cancer survival data using a covariate-adjusted frailty proportional hazards model. Ann Appl Stat 9(1):43–68

    Article  MathSciNet  Google Scholar 

  • Zou H, Li R (2008) One-step sparse estimates in nonconcave penalized likelihood models. Ann Stat 36(4):1509–1533

    MathSciNet  MATH  Google Scholar 

Download references

Acknowledgements

This work was supported by the National Natural Science Foundation of China (NSFC) (No. 11901175) and Fundamental Research Funds for the Hubei Key Laboratory of Applied Mathematics, Hubei University (No. HBAM201907)

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Zhan Liu.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Pan, Y., Liu, Z. & Song, G. Outlier detection under a covariate-adjusted exponential regression model with censored data. Comput Stat 36, 961–976 (2021). https://doi.org/10.1007/s00180-020-01052-5

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00180-020-01052-5

Keywords

Navigation