Predicting counterfactual risks under hypothetical treatment strategies: an application to HIV

Dickerman, Barbra A.; Dahabreh, Issa J.; Cantos, Krystal V.; Logan, Roger W.; Lodi, Sara; Rentsch, Christopher T.; Justice, Amy C.; Hernán, Miguel A.

doi:10.1007/s10654-022-00855-8

Predicting counterfactual risks under hypothetical treatment strategies: an application to HIV

METHODS
Published: 22 February 2022

Volume 37, pages 367–376, (2022)
Cite this article

European Journal of Epidemiology Aims and scope Submit manuscript

Barbra A. Dickerman ORCID: orcid.org/0000-0003-2843-687X^1,2,
Issa J. Dahabreh^1,2,3,
Krystal V. Cantos¹⁰,
Roger W. Logan^1,2,
Sara Lodi^1,2,4,
Christopher T. Rentsch^5,6,7,
Amy C. Justice^6,7,8 &
…
Miguel A. Hernán^1,2,3,9

2000 Accesses
6 Citations
13 Altmetric
Explore all metrics

Abstract

The accuracy of a prediction algorithm depends on contextual factors that may vary across deployment settings. To address this inherent limitation of prediction, we propose an approach to counterfactual prediction based on the g-formula to predict risk across populations that differ in their distribution of treatment strategies. We apply this to predict 5-year risk of mortality among persons receiving care for HIV in the U.S. Veterans Health Administration under different hypothetical treatment strategies. First, we implement a conventional approach to develop a prediction algorithm in the observed data and show how the algorithm may fail when transported to new populations with different treatment strategies. Second, we generate counterfactual data under different treatment strategies and use it to assess the robustness of the original algorithm’s performance to these differences and to develop counterfactual prediction algorithms. We discuss how estimating counterfactual risks under a particular treatment strategy is more challenging than conventional prediction as it requires the same data, methods, and unverifiable assumptions as causal inference. However, this may be required when the alternative assumption of constant treatment patterns across deployment settings is unlikely to hold and new data is not yet available to retrain the algorithm.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Considerations for Developing Applied Health Policy Models: The Example of HIV Treatment Expansion in Resource-Limited Settings

Adjusting expected deaths for mortality displacement during the COVID-19 pandemic: a model based counterfactual approach at the level of individuals

Article Open access 18 October 2023

Masks and distancing during COVID-19: a causal framework for imputing value to public-health interventions

Article Open access 04 March 2021

References

Dickerman BA, Hernán MA. Counterfactual prediction is not only for causal inference. Eur J Epidemiol. 2020;35(7):615–7. https://doi.org/10.1007/s10654-020-00659-8.
Article PubMed PubMed Central Google Scholar
van Geloven N, Swanson SA, Ramspek CL, et al. Prediction meets causal inference: the role of treatment in clinical prediction models. Eur J Epidemiol. 2020;35:619–30.
Article Google Scholar
Schulam P, Saria S. Reliable decision support using counterfactual models. Adv Neural Inf Process Syst. 2017;30:1697–708.
Google Scholar
Subbaswamy A, Saria S. From development to deployment: dataset shift, causality, and shift-stable models in health AI. Biostatistics. 2020;21(2):345–52. https://doi.org/10.1093/biostatistics/kxz041.
Article PubMed Google Scholar
Dahabreh IJ, Hernán MA. Extending inferences from a randomized trial to a target population. Eur J Epidemiol. 2019;34(8):719–22. https://doi.org/10.1007/s10654-019-00533-2.
Article CAS PubMed Google Scholar
Dahabreh IJ, Robertson SE, Steingrimsson JA, Stuart EA, Hernán MA. Extending inferences from a randomized trial to a new target population. Stat Med. 2020;39(14):1999–2014. https://doi.org/10.1002/sim.8426.
Article PubMed Google Scholar
Finlayson SG, Subbaswamy A, Singh K, et al. The clinician and dataset shift in artificial intelligence. N Engl J Med. 2021;385(3):283–6. https://doi.org/10.1056/NEJMc2104626.
Article PubMed PubMed Central Google Scholar
Sperrin M, Martin GP, Pate A, Van Staa T, Peek N, Buchan I. Using marginal structural models to adjust for treatment drop-in when developing clinical prediction models. Stat Med. 2018;37(28):4142–54. https://doi.org/10.1002/sim.7913.
Article PubMed PubMed Central Google Scholar
Pajouheshnia R, Peelen LM, Moons KGM, Reitsma JB, Groenwold RHH. Accounting for treatment use when validating a prognostic model: a simulation study. BMC Med Res Methodol. 2017;17(1):103. https://doi.org/10.1186/s12874-017-0375-8.
Article PubMed PubMed Central Google Scholar
Lin L, Sperrin M, Jenkins DA, Martin GP, Peek N. A scoping review of causal methods enabling predictions under hypothetical interventions. Diagn Progn Res. 2021;5(1):3. https://doi.org/10.1186/s41512-021-00092-9.
Article PubMed PubMed Central Google Scholar
Hernán MA, Hsu J, Healy B. A second chance to get causal inference right: a classification of data science tasks. Chance. 2019;32(1):42–9.
Article Google Scholar
U.S. Department of Veterans Affairs. Veterans Affairs HIV Program Fact Sheet. 2020. https://www.hiv.va.gov/pdf/HIV-program-factsheet.pdf.
Justice AC, Dombrowski E, Conigliaro J, et al. Veterans Aging Cohort Study (VACS): overview and description. Med Care. 2006;44(8 Suppl 2):S13-24. https://doi.org/10.1097/01.mlr.0000223741.02074.66.
Article PubMed PubMed Central Google Scholar
Tibshirani R. Regression shrinkage and selection via the lasso. J Royal Stat Soc Series B (Methodol). 1996;58(1):267–88.
Google Scholar
Steyerberg EW, Eijkemans MJC, Habbema JDF. Application of shrinkage techniques in logistic regression analysis: a case study. Stat Neerl. 2001;55(1):76–88. https://doi.org/10.1111/1467-9574.00157.
Article Google Scholar
Tate JP, Justice AC, Hughes MD, et al. An internationally generalizable risk index for mortality after one year of antiretroviral therapy. AIDS. 2013;27(4):563–72. https://doi.org/10.1097/QAD.0b013e32835b8c7f.
Article PubMed Google Scholar
Tate JP, Sterne JAC, Justice AC. Veterans Aging Cohort Study and the Antiretroviral Therapy Cohort Collaboration. Albumin, white blood cell count, and body mass index improve discrimination of mortality in HIV-positive individuals. AIDS. 2019;33(5):903–12. https://doi.org/10.1097/QAD.0000000000002140.
Article CAS PubMed Google Scholar
Moons KGM, Altman DG, Reitsma JB, et al. Transparent Reporting of a multivariable prediction model for Individual Prognosis or Diagnosis (TRIPOD): explanation and elaboration. Ann Intern Med. 2015;162(1):W1–73. https://doi.org/10.7326/M14-0698.
Article PubMed Google Scholar
Austin PC, Steyerberg EW. Graphical assessment of internal and external calibration of logistic regression models by using loess smoothers. Stat Med. 2014;33(3):517–35. https://doi.org/10.1002/sim.5941.
Article PubMed Google Scholar
Hernán MA, Robins JM. Using big data to emulate a target trial when a randomized trial is not available. Am J Epidemiol. 2016;183(8):758–64. https://doi.org/10.1093/aje/kwv254.
Article PubMed PubMed Central Google Scholar
Hernán MA, Robins JM. Per-protocol analyses of pragmatic trials. N Engl J Med. 2017;377(14):1391–8. https://doi.org/10.1056/NEJMsm1605385.
Article PubMed Google Scholar
Hernán MA, Robins JM. Causal inference: what if. Boca Raton: Chapman & Hall/CRC; 2020.
Google Scholar
Robins JM. A new approach to causal inference in mortality studies with a sustained exposure period—Application to the healthy worker survivor effect [published errata appear in Mathl Modelling 1987;14:917–21]. Math Model. 1986;7:1393–512.
Article Google Scholar
Taubman SL, Robins JM, Mittleman MA, Hernán MA. Intervening on risk factors for coronary heart disease: an application of the parametric g-formula. Int J Epidemiol. 2009;38(6):1599–611. https://doi.org/10.1093/ije/dyp192.
Article PubMed PubMed Central Google Scholar
Young JG, Cain LE, Robins JM, O’Reilly EJ, Hernán MA. Comparative effectiveness of dynamic treatment regimes: an application of the parametric g-formula. Stat Biosci. 2011;3(1):119–43. https://doi.org/10.1007/s12561-011-9040-7.
Article PubMed PubMed Central Google Scholar
Sugiyama M, Krauledat M, Müller KM. Covariate shift adaptation by importance weighted cross validation. J Mach Learn Res. 2007;8:985–1005.
Google Scholar
Gretton A, Smola A, Huang J, Schmittfull M, Borgwardt K, Schölkopf B. Covariate shift by kernel mean matching. In: Quiñonero-Candela J, Sugiyama M, Schwaighofer A, Lawrence ND, editors. Dataset shift in machine learning. Cambridge, MA: The MIT Press; 2008. p. 131–60.
Chapter Google Scholar
Steingrimsson JA, Gatsonis C, Dahabreh IJ. Transporting a prediction model for use in a new target population. 2021; https://arxiv.org/abs/2101.11182v2.
Subbaswamy A, Saria S. Counterfactual normalization: proactively addressing dataset shfit and improving reliability using causal mechanisms. Proceedings of the Thirty-Fourth Conference on Uncertainty in Artificial Intelligence. 2018. 947–57.
Subbaswamy A, Schulam P, Saria S. Preventing failures due to dataset shift: learning predictive models that transport. Artificial Intelligence and Statistics (AISTATS). 2019.
Dahabreh IJ, Robins JM, Haneuse S, Hernán MA. Generalizing causal inferences from randomized trials: counterfactual and graphical identification. 2019; https://arxiv.org/abs/1906.10792v1.
Robins J, Orellana L, Rotnitzky A. Estimation and extrapolation of optimal treatment and testing strategies. Stat Med. 2008;27(23):4678–721. https://doi.org/10.1002/sim.3301.
Article PubMed Google Scholar
Hernán MA, VanderWeele TJ. Compound treatments and transportability of causal inference. Epidemiology. 2011;22(3):368–77. https://doi.org/10.1097/EDE.0b013e3182109296.
Article PubMed PubMed Central Google Scholar
VanderWeele TJ, Hernán MA. Causal inference under multiple versions of treatment. J Causal Inference. 2013;1(1):1–20. https://doi.org/10.1515/jci-2012-0002.
Article PubMed PubMed Central Google Scholar

Download references

Funding

This research was supported by National Institutes of Health grants K99 CA248335 (B.A.D.) and R37 AI02634 and Providence/Boston Center for AIDS Research grant P30 AI042853 (S.L.).

Author information

Authors and Affiliations

CAUSALab, Harvard T.H. Chan School of Public Health, Boston, MA, USA
Barbra A. Dickerman, Issa J. Dahabreh, Roger W. Logan, Sara Lodi & Miguel A. Hernán
Department of Epidemiology, Harvard T.H. Chan School of Public Health, Boston, MA, USA
Barbra A. Dickerman, Issa J. Dahabreh, Roger W. Logan, Sara Lodi & Miguel A. Hernán
Department of Biostatistics, Harvard T.H. Chan School of Public Health, Boston, MA, USA
Issa J. Dahabreh & Miguel A. Hernán
Department of Biostatistics, Boston University School of Public Health, Boston, MA, USA
Sara Lodi
Faculty of Epidemiology and Population Health, London School of Hygiene & Tropical Medicine, London, UK
Christopher T. Rentsch
Veterans Affairs Connecticut Healthcare System, West Haven, CT, USA
Christopher T. Rentsch & Amy C. Justice
Department of Internal Medicine, Yale School of Medicine, New Haven, CT, USA
Christopher T. Rentsch & Amy C. Justice
Center for Interdisciplinary Research on AIDS, Yale School of Public Health, New Haven, CT, USA
Amy C. Justice
Harvard-MIT Division of Health Sciences and Technology, Boston, MA, USA
Miguel A. Hernán
Epidemiology and Drug Safety, IQVIA, Cambridge, Durham, MA, USA
Krystal V. Cantos

Authors

Barbra A. Dickerman
View author publications
You can also search for this author in PubMed Google Scholar
Issa J. Dahabreh
View author publications
You can also search for this author in PubMed Google Scholar
Krystal V. Cantos
View author publications
You can also search for this author in PubMed Google Scholar
Roger W. Logan
View author publications
You can also search for this author in PubMed Google Scholar
Sara Lodi
View author publications
You can also search for this author in PubMed Google Scholar
Christopher T. Rentsch
View author publications
You can also search for this author in PubMed Google Scholar
Amy C. Justice
View author publications
You can also search for this author in PubMed Google Scholar
Miguel A. Hernán
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Barbra A. Dickerman.

Ethics declarations

Conflict of interest

The author declares that they have no conflict of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Below is the link to the electronic supplementary material.

Supplementary file1 (DOCX 365 kb)

Rights and permissions

Reprints and permissions

About this article

Cite this article

Dickerman, B.A., Dahabreh, I.J., Cantos, K.V. et al. Predicting counterfactual risks under hypothetical treatment strategies: an application to HIV. Eur J Epidemiol 37, 367–376 (2022). https://doi.org/10.1007/s10654-022-00855-8

Download citation

Received: 14 August 2021
Accepted: 14 February 2022
Published: 22 February 2022
Issue Date: April 2022
DOI: https://doi.org/10.1007/s10654-022-00855-8

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Predicting counterfactual risks under hypothetical treatment strategies: an application to HIV

Abstract

Access this article

Similar content being viewed by others

Considerations for Developing Applied Health Policy Models: The Example of HIV Treatment Expansion in Resource-Limited Settings

Adjusting expected deaths for mortality displacement during the COVID-19 pandemic: a model based counterfactual approach at the level of individuals

Masks and distancing during COVID-19: a causal framework for imputing value to public-health interventions

References

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Supplementary Information

Supplementary file1 (DOCX 365 kb)

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Predicting counterfactual risks under hypothetical treatment strategies: an application to HIV

Abstract

Access this article

Similar content being viewed by others

Considerations for Developing Applied Health Policy Models: The Example of HIV Treatment Expansion in Resource-Limited Settings

Adjusting expected deaths for mortality displacement during the COVID-19 pandemic: a model based counterfactual approach at the level of individuals

Masks and distancing during COVID-19: a causal framework for imputing value to public-health interventions

References

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Supplementary Information

Supplementary file1 (DOCX 365 kb)

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation