Calibration of P-values for calibration and for deviation of a subpopulation from the full population

Tygert, Mark

doi:10.1007/s10444-023-10068-6

Calibration of P-values for calibration and for deviation of a subpopulation from the full population

Open access
Published: 04 September 2023

Volume 49, article number 70, (2023)
Cite this article

Download PDF

You have full access to this open access article

Advances in Computational Mathematics Aims and scope Submit manuscript

Calibration of P-values for calibration and for deviation of a subpopulation from the full population

Download PDF

Mark Tygert¹

246 Accesses
Explore all metrics

Abstract

The author’s recent research papers, “Cumulative deviation of a subpopulation from the full population” and “A graphical method of cumulative differences between two subpopulations” (both published in volume 8 of Springer’s open-access Journal of Big Data during 2021), propose graphical methods and summary statistics, without extensively calibrating formal significance tests. The summary metrics and methods can measure the calibration of probabilistic predictions and can assess differences in responses between a subpopulation and the full population while controlling for a covariate or score via conditioning on it. These recently published papers construct significance tests based on the scalar summary statistics, but only sketch how to calibrate the attained significance levels (also known as “P-values”) for the tests. The present article reviews and synthesizes work spanning many decades in order to detail how to calibrate the P-values. The present paper presents computationally efficient, easily implemented numerical methods for evaluating properly calibrated P-values, together with rigorous mathematical proofs guaranteeing their accuracy, and illustrates and validates the methods with open-source software and numerical examples.

Article PDF

Statistical tests, P values, confidence intervals, and power: a guide to misinterpretations

Article Open access 01 April 2016

Estimating power in (generalized) linear mixed models: An open introduction and tutorial in R

Article Open access 05 May 2021

Violating the normality assumption may be the lesser of two evils

Article Open access 07 May 2021

Data availability

The data sets generated during and/or analyzed during the current study are available in the following repositories: (1) https://github.com/facebookresearch/cdeets (for all synthetic data sets) and (2) https://www2.census.gov/programs-surveys/acs/data/pums/2019/1-Year (for California households file csv_hca.zip—which includes the file psam_h06.csv that our software processes—from the American Community Survey of the US Census Bureau); MIT-licensed open-source codes in Python 3 and shell scripts that automatically reproduce all figures and statistics of the present paper are publicly available in the repository cdeets at https://github.com/facebookresearch/cdeets

References

Tygert M.: Cumulative deviation of a subpopulation from the full population. J Big Data 8(117), 1–60 (2021b). https://arxiv.org/abs/2008.01779
Tygert M.: A graphical method of cumulative differences between two subpopulations. J Big Data 8(158), 1–29 (2021c). https://arxiv.org/abs/2108.02666
Kloumann I, Korevaar H, McConnell C, Tygert M, Zhao J.: Cumulative differences between paired samples. Tech. Rep. 2305.11323 (2023). arXiv: https://arxiv.org/abs/2305.11323
Tygert M.: Controlling for multiple covariates. Tech. Rep. 2112.00672 (2021a). arXiv: https://arxiv.org/abs/2112.00672
Arrieta-Ibarra I, Gujral P, Tannen J, Tygert M, Xu C.: Metrics of calibration for probabilistic predictions. J Mach Learn Res 23, 1–54 (2022). https://arxiv.org/abs/2205.09680
Lee D, Huang X, Hassani H, Dobriban E (2022) T-Cal: an optimal test for the calibration of predictive models. Tech. Rep. 2203.01850. arXiv
Delgado, M.A.: Testing the equality of nonparametric regression curves. Stat Probab Lett 17(3), 199–204 (1993)
Article MathSciNet MATH Google Scholar
Diebolt, J.: A nonparametric test for the regression function: asymptotic theory. J Stat Plan Inference 44(1), 1–17 (1995)
Article MathSciNet MATH Google Scholar
Stute, W.: Nonparametric model checks for regression. Ann Stat 25(2), 613–641 (1997)
Article MathSciNet MATH Google Scholar
Kuiper, N.H.: Tests concerning random points on a circle. Proc Koninklijke Nederlandse Akademie van Wetenschappen Series A 63, 38–47 (1962)
MATH Google Scholar
Kolmogorov, A.N.: Sulla determinazione empirica di una legge di distribuzione (On the empirical determination of a distribution function). Giorn Ist Ital Attuar 4, 83–91 (1933)
Google Scholar
Smirnov, N.: On the estimation of the discrepancy between empirical curves of distribution for two independent samples. Bulletin Mathématique de l’Université de Moscou 2(2), 3–11 (1939)
MathSciNet Google Scholar
Feller, W.: The asymptotic distribution of the range of sums of independent random variables. Ann Math Stat 22(3), 427–432 (1951)
Article MathSciNet MATH Google Scholar
Darling, D.A., Siegert, A.J.F.: The first passage problem for a continuous Markov process. Ann Math Stat 24(4), 624–639 (1953)
Article MathSciNet MATH Google Scholar
Ciesielski, Z., Taylor, S.J.: First passage times and sojourn times for Brownian motion in space and the exact Hausdorff measure of the sample path. Trans Am Math Soc 103(3), 434–450 (1962)
Article MathSciNet MATH Google Scholar
Masoliver J.: Extreme values and the level-crossing problem: an application to the Feller process. Phys Rev E 89(4), 042106 (2014)

Download references

Acknowledgements

We would like to thank Kamalika Chaudhuri, Imanol Arrieta Ibarra, Michael Rabbat, Jonathan Tannen, Susan Zhang, and the anonymous reviewers.

Author information

Authors and Affiliations

Fundamental Artificial Intelligence Research, Meta Platforms, Inc., 786 Coleman Ave. Apt. L, Menlo Park, CA, 94025-2440, USA
Mark Tygert

Authors

Mark Tygert
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Mark Tygert.

Ethics declarations

Conflict of interest

Meta Platforms, Inc. employs the author. The author receives a salary and stock from Meta.

Additional information

Communicated by: Akil Narayan

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Tygert, M. Calibration of P-values for calibration and for deviation of a subpopulation from the full population. Adv Comput Math 49, 70 (2023). https://doi.org/10.1007/s10444-023-10068-6

Download citation

Received: 14 November 2022
Accepted: 11 July 2023
Published: 04 September 2023
DOI: https://doi.org/10.1007/s10444-023-10068-6

Keywords

MSC codes

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Calibration of P-values for calibration and for deviation of a subpopulation from the full population

Abstract

Article PDF

Similar content being viewed by others

Statistical tests, P values, confidence intervals, and power: a guide to misinterpretations

Estimating power in (generalized) linear mixed models: An open introduction and tutorial in R

Violating the normality assumption may be the lesser of two evils

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

MSC codes

Navigation

Calibration of P-values for calibration and for deviation of a subpopulation from the full population

Abstract

Article PDF

Similar content being viewed by others

Statistical tests, P values, confidence intervals, and power: a guide to misinterpretations

Estimating power in (generalized) linear mixed models: An open introduction and tutorial in R

Violating the normality assumption may be the lesser of two evils

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

MSC codes

Search

Navigation