Bias correction via outcome reassignment for cross-sectional data with binary disease outcome

Wang, Mei-Cheng; Zhu, Yuxin

doi:10.1007/s10985-022-09559-3

Bias correction via outcome reassignment for cross-sectional data with binary disease outcome

Published: 24 June 2022

Volume 28, pages 659–674, (2022)
Cite this article

Lifetime Data Analysis Aims and scope Submit manuscript

269 Accesses
1 Altmetric
Explore all metrics

Abstract

Cross-sectionally sampled data with binary disease outcome are commonly analyzed in observational studies to identify the relationship between covariates and disease outcome. A cross-sectional population is defined as a population of living individuals at the sampling or observational time. It is generally understood that binary disease outcome from cross-sectional data contains less information than longitudinally collected time-to-event data, but there is insufficient understanding as to whether bias can possibly exist in cross-sectional data and how the bias is related to the population risk of interest. Wang and Yang (2021) presented the complexity and bias in cross-sectional data with binary disease outcome with detailed analytical explorations into the data structure. As the distribution of the cross-sectional binary outcome is quite different from the population risk distribution, bias can arise when using cross-sectional data analysis to draw inference for population risk. In this paper we argue that the commonly adopted age-specific risk probability is biased for the estimation of population risk and propose an outcome reassignment approach which reassigns a portion of the observed binary outcome, 0 or 1, to the other disease category. A sign test and a semiparametric pseudo-likelihood method are developed for analyzing cross-sectional data using the OR approach. Simulations and an analysis based on Alzheimer’s Disease data are presented to illustrate the proposed methods.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Statistical Approaches to Longitudinal Data Analysis in Neurodegenerative Diseases: Huntington’s Disease as a Model

Article 22 February 2017

Two sample Mendelian Randomisation using an outcome from a multilevel model of disease progression

Article Open access 28 January 2024

Utilization of Observational Data as a Proxy Cohort for Comparison Purposes with Open-Label Study Results: An Example from Alzheimer’s Disease

Article Open access 11 February 2019

References

Alexander L, Lopes B, Ricchetti-Masterson K, Yeatts KB, ERIC N (2015) Cross-sectional studies. Eric Noteb 2(6):1–5
Google Scholar
Banerjee M, Wellner JA (2005) Confidence intervals for current status data. Scand J Stat 32(3):405–424
Article MathSciNet Google Scholar
Cox DR (1972) Regression models and life-tables. J Royal Stat Soc: Series B (Methodoll) 34(2):187–202
MathSciNet MATH Google Scholar
Edwards JK, Cole SR, Chu H, Olshan AF, Richardson DB (2014) Accounting for outcome misclassification in estimates of the effect of occupational asbestos exposure on lung cancer death. Am J Epidemiology 179(5):641–647
Article Google Scholar
Efron B, Tibshirani RJ (1994) An introduction to the bootstrap. CRC Press, Cambridge
Book Google Scholar
Finkelstein DM (1986) A proportional hazards model for interval-censored failure time data. Biometrics 42:845–854
Article MathSciNet Google Scholar
Gilbert R, Martin RM, Donovan J, Lane JA, Hamdy F, Neal DE, Metcalfe C (2016) Misclassification of outcome in case-control studies: methods for sensitivity analysis. Stat Methods Med Res 25(5):2377–2393
Article MathSciNet Google Scholar
Groeneboom P, Wellner JA (1992) Information bounds and nonparametric maximum likelihood estimation, vol 19. Springer Science & Business Media, Heidelberg
Book Google Scholar
Ho DE, Imai K, King G, Stuart EA (2011) MatchIt: nonparametric preprocessing for parametric causal inference. J Stat Softw 42(8):1–28
Article Google Scholar
Jewell NP, van der Laan M (2003) Current status data: review, recent developments and open problems. Handb Stat 23:625–642
Article MathSciNet Google Scholar
Lin D, Oakes D, Ying Z (1998) Additive hazards regression with current status data. Biometrika 85(2):289–298
Article MathSciNet Google Scholar
Mandel M (2015) Analyzing multiple cross-sectional samples with application to hospitalization time after surgeries. Stat Med 34(26):3415–3423
Article MathSciNet Google Scholar
Mandel M, Fluss R (2009) Nonparametric estimation of the probability of illness in the illness-death model under cross-sectional sampling. Biometrika 96(4):861–872
Article MathSciNet Google Scholar
Martinez BAF, Leotti VB, GdSe Silva, Nunes LN, Machado G, Corbellini LG (2017) Odds ratio or prevalence ratio? an overview of reported statistical methods and appropriateness of interpretations in cross-sectional studies with dichotomous outcomes in veterinary medicine. Front Vet Sci 4:193
Article Google Scholar
McNemar Q (1947) Note on the sampling error of the difference between correlated proportions or percentages. Psychometrika 12(2):153–157
Article Google Scholar
Müller M (2001) Estimation and testing in generalized partial linear models? a comparative study. Stat Comput 11(4):299–309
Article MathSciNet Google Scholar
Rossini A, Tsiatis A (1996) A semiparametric proportional odds regression model for the analysis of current status data. J Am Stat Assoc 91(434):713–721
Article MathSciNet Google Scholar
Severini TA, Staniswalis JG (1994) Quasi-likelihood estimation in semiparametric models. J Am stat Assoc 89(426):501–511
Article MathSciNet Google Scholar
Wang MC (1991) Nonparametric estimation from cross-sectional survival data. J Am Stat Assoc 86(413):130–143
Article MathSciNet Google Scholar
Wang MC, Yang Y (2021) Complexity and bias in cross-sectional data with binary disease outcome in observational studies. Stat Med 40(4):950–962
Article MathSciNet Google Scholar
Wisniewski T, Castano EM, Golabek A, Vogel T, Frangione B (1994) Acceleration of Alzheimer’s fibril formation by apolipoprotein e in vitro. Am J Pathol 145(5):1030
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Biostatistics, Johns Hopkins Bloomberg School of Public Health, Baltimore, USA
Mei-Cheng Wang
Department of Neurology, Johns Hopkins School of Medicine, Baltimore, USA
Yuxin Zhu

Authors

Mei-Cheng Wang
View author publications
You can also search for this author in PubMed Google Scholar
Yuxin Zhu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Mei-Cheng Wang.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Wang, MC., Zhu, Y. Bias correction via outcome reassignment for cross-sectional data with binary disease outcome. Lifetime Data Anal 28, 659–674 (2022). https://doi.org/10.1007/s10985-022-09559-3

Download citation

Received: 03 November 2021
Accepted: 06 June 2022
Published: 24 June 2022
Issue Date: October 2022
DOI: https://doi.org/10.1007/s10985-022-09559-3

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Bias correction via outcome reassignment for cross-sectional data with binary disease outcome

Abstract

Access this article

Similar content being viewed by others

Statistical Approaches to Longitudinal Data Analysis in Neurodegenerative Diseases: Huntington’s Disease as a Model

Two sample Mendelian Randomisation using an outcome from a multilevel model of disease progression

Utilization of Observational Data as a Proxy Cohort for Comparison Purposes with Open-Label Study Results: An Example from Alzheimer’s Disease

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Bias correction via outcome reassignment for cross-sectional data with binary disease outcome

Abstract

Access this article

Similar content being viewed by others

Statistical Approaches to Longitudinal Data Analysis in Neurodegenerative Diseases: Huntington’s Disease as a Model

Two sample Mendelian Randomisation using an outcome from a multilevel model of disease progression

Utilization of Observational Data as a Proxy Cohort for Comparison Purposes with Open-Label Study Results: An Example from Alzheimer’s Disease

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation