Replication study design: confidence intervals and commentary

Kupper, Lawrence L.; Martin, Sandra L.

doi:10.1007/s00362-022-01291-2

Replication study design: confidence intervals and commentary

Regular Article
Published: 27 January 2022

Volume 63, pages 1577–1583, (2022)
Cite this article

Statistical Papers Aims and scope Submit manuscript

213 Accesses
2 Altmetric
Explore all metrics

Abstract

Methods for designing a comparable replication study have received considerable attention in the published literature, with both Bayesian and non-Bayesian methods having been developed from a hypothesis testing and associated P-value perspective. The purpose of this paper is to describe, using a maximum likelihood-based confidence interval framework, a new frequentist method for choosing the sample size for a comparable replication study. This new method is compared to the published “predictive power” (or “PP”) method. For each of these two methods, a new and easy-to-use formula is derived for computing the optimal comparable replication study sample size that guarantees satisfying a specific confidence interval criterion with a chosen high minimum probability. Connections to hypothesis testing are made, and the Discussion section provides further commentary and considers a numerical example involving published data.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Power priors for replication studies

Article Open access 21 September 2023

Samuel Pawel, Frederik Aust, … Eric-Jan Wagenmakers

Assessing evidence for replication: A likelihood-based approach

Article 21 May 2020

Peter Dixon & Scott Glover

Estimating power in (generalized) linear mixed models: An open introduction and tutorial in R

Article Open access 05 May 2021

Levi Kumle, Melissa L.-H. Võ & Dejan Draschkow

Data Availability

Not applicable.

Code Availability

Not applicable.

References

Abeler J et al (2011) Reference points and effort provision. Am Econ Rev 101:470–492
Article Google Scholar
Bonett DG (2021) Design and analysis of replication studies. Organ Res Methods 24(3):513–529
Article Google Scholar
Branch MN (2014) Malignant side effects of null hypothesis significance testing. Theory Psychol 24:256–277
Article Google Scholar
Camerer CF et al (2016) Evaluating replicability of laboratory experiments in economics. Science 351:1433–1436
Article Google Scholar
Cumming G (2008) Replication and p intervals: p values predict the future only vaguely, but confidence intervals do much better. Perspect Psychol Sci 3(4):286–300
Article MathSciNet Google Scholar
Errington TM et al (2021) Reproducibility in cancer biology: challenges for assessing replicability in preclinical cancer biology. eLife 10:e67995. https://doi.org/10.7554/eLife.67995
Article Google Scholar
Held L (2020) A new standard for the analysis and design of replication studies. J R Stat Soc A 2:431–448
Article MathSciNet Google Scholar
Held L, Pawel S, Schwab S (2020) Replication power and regression to the mean. Significance 17(6):10–11
Article Google Scholar
Lash TL (2017) The harm done to reproducibility by the culture of null hypothesis significance testing. Am J Epidemiol 186(6):627–635
Article Google Scholar
Miller J (2009) What is the probability of replicating a statistically significant effect? Psychon Bull Rev 16(4):617–640
Article Google Scholar
Patil P, Peng RD, Leek JT (2016) What should we expect when we replicate? A statistical view of replicability in psychological science. Perspect Psychol Sci 11(4):539–544
Article Google Scholar
Shrout PE, Rodgers JL (2018) Psychology, science, and knowledge construction: broadening perspectives of the replication crisis. Ann Rev Psychol 69:487–510
Article Google Scholar
Wasserman RL, Lazar NA (2016) The ASA statement on p-values: context, process, and purpose. Am Stat 70:129–133
Article MathSciNet Google Scholar
Yaffe J (2019) From the editor-do we have a replication crisis in social work research? J Soc Work Educ 55(1):1–4
Article MathSciNet Google Scholar

Download references

Acknowledgements

We want to acknowledge helpful email exchanges with Professor Leonhard Held and Ms. Charlotte Micheloud, Epidemiology, Biostatistics and Prevention Institute, Center for Reproducible Science, University of Zurich; and, we want to thank two journal referees for some very good suggestions that greatly improved the paper.

Funding

Not applicable.

Author information

Authors and Affiliations

Department of Biostatistics, Gillings School of Global Public Health, University of North Carolina, Chapel Hill, NC, 27599, USA
Lawrence L. Kupper
Department of Maternal and Child Health, Gillings School of Global Public Health, University of North Carolina, Chapel Hill, NC 27599, USA
Sandra L. Martin

Authors

Lawrence L. Kupper
View author publications
You can also search for this author in PubMed Google Scholar
Sandra L. Martin
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Lawrence L. Kupper.

Ethics declarations

Conflict of interest

Not applicable.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Kupper, L.L., Martin, S.L. Replication study design: confidence intervals and commentary. Stat Papers 63, 1577–1583 (2022). https://doi.org/10.1007/s00362-022-01291-2

Download citation

Received: 03 May 2021
Revised: 06 January 2022
Accepted: 20 January 2022
Published: 27 January 2022
Issue Date: October 2022
DOI: https://doi.org/10.1007/s00362-022-01291-2

Keywords

Mathematics Subject Classification

62

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Replication study design: confidence intervals and commentary

Abstract

Access this article

Similar content being viewed by others

Power priors for replication studies

Assessing evidence for replication: A likelihood-based approach

Estimating power in (generalized) linear mixed models: An open introduction and tutorial in R

Data Availability

Code Availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Mathematics Subject Classification

Navigation

Replication study design: confidence intervals and commentary

Abstract

Access this article

Similar content being viewed by others

Power priors for replication studies

Assessing evidence for replication: A likelihood-based approach

Estimating power in (generalized) linear mixed models: An open introduction and tutorial in R

Data Availability

Code Availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Mathematics Subject Classification

Search

Navigation