Abstract
In 1992, in a famous paper, Deville and Särndal proposed the calibration method in order to adjust samples on known population totals. This paper had a very important impact in the theory and practice of survey statistics. In this paper, we propose a rigorous formalization of the calibration problem viewed as an optimization problem. We examine the main calibration functions and we discuss the question of the existence of solutions. We also propose an alternate way of solving the optimization problem given by the calibration principle. We finally present a set of simulations in order to compare the different methods.
Similar content being viewed by others
References
Andersson C, Lennart N (1998) A user’s guide to clan 97: a SAS-program for computation of point- and standard error estimates in sample survey. Technical report, Statistics Sweden
Beaumont JF, Bocci C (2008) Another look at ridge calibration. Metron 66(1):5–20
Berger YG (2018) Empirical likelihood approaches in survey sampling. Surv Statistician 78:22–31
Berger YG, De La Riva Torres O (2016) Empirical likelihood confidence intervals for complex sampling designs. J Roy Stat Soc B78(2):319–341
Bethlehem JG, Keller WJ (1987) Linear weighting of sample survey data. J Off Stat 3:141–153
Boyd S, Vandenberghe L (2004) Convex optimization. Cambridge University Press, Cambridge
Brick MJ (2013) Unit nonresponse and weighting adjustments: a critical review. J Off Stat 29(3):329–353
Cassel CM, Särndal CE, Wretman JH (1976) Some results on generalized difference estimation and generalized regression estimation for finite population. Biometrika 63:615–620
Chang T, Kott PS (2008) Using calibration weighting to adjust for nonresponse under a plausible model. Biometrika 95:555–571
Chaudhuri S, Handcock MS, Rendall MS (2008) Generalized linear models incorporating population level information: an empirical-likelihood-based approach. J R Stat Soc Ser B (Stat Methodol) 70(2):311–328
Chen J, Qin J (1993) Empirical likelihood estimation for finite populations and the effective usage of auxiliary information. Biometrika 80:107–116
Chen J, Sitter RR (1999) A pseudo empirical likelihood approach to the effective use of auxiliary information in complex surveys. Statistica Sinica 9:385–406
Chen J, Wu C (1999) Estimation of distribution function and quantiles using the model-calibrated pseudo empirical likelihood method. Statistica Sinica 12:1223–1239
Chen J, Sitter RR, Wu C (2002) Using empirical likelihood methods to obtain range restricted weights in regression estimators for surveys. Biometrika 89(1):230–237
Cholakian V (1980) Un exemple d’application de diverses méthodes d’ajustement d’un tableau à des marges imposées. Les Cahiers de l’Analyse des Données 5:173–176
Cornfield J (1944) On samples from finite populations. J Am Stat Assoc 39:236–239
Davies G, Gillard J, Zhigljavsky A (2015) Calibration in survey sampling as an optimization problem. In: Migdalis A, Karakitsiou A (eds) Optimization, control, and applications in the information age. Springer, New York, pp 67–89
Deming WE (1948) Statistical adjustment of data. Wiley, New York
Deming WE (1950) Some theory of sampling. Dover Publications, New York
Deming WE, Stephan FF (1940) On a least square adjustment of sampled frequency table when the expected marginal totals are known. Ann Math Stat 11:427–444
Demnati A, Rao JNK (2004) Linearization variance estimators for survey data (with discussion). Surv Methodol 30:17–34
Deville JC (1988) Estimation linéaire et redressement sur informations auxiliaires d’enquêtes par sondage. In: Monfort A, Laffond JJ (eds) Mélanges économiques: Essais en l’honneur de Edmond Malinvaud. Economica, Paris, pp 915–927
Deville JC (1998) La correction de la non-réponse par calage ou par échantillonnage équilibré. Technical report, Insee, Paris, recueil de la Section des méthodes d’enquête
Deville JC (2000) Generalized calibration and application to weighting for non-response. In: Compstat - proceedings in computational statistics: 14th symposium held in Utrecht. The Netherlands, Springer, New York, pp 65–76
Deville JC (2002) La correction de la nonréponse par calage généralisé. Actes des Journées de Méthodologie Statistique. Insee-Méthodes, Paris, pp 3–20
Deville JC (2004) Calage, calage généralisé et hypercalage. Technical report, Insee, Paris
Deville JC, Särndal CE (1990) Estimateur par calage et technique de ratissage généralisé dans les enquêtes par sondage. Technical report, Insee, Paris
Deville JC, Särndal CE (1992) Calibration estimators in survey sampling. J Am Stat Assoc 87:376–382
Deville JC, Särndal CE, Sautory O (1993) Generalized raking procedure in survey sampling. J Am Stat Assoc 88:1013–1020
Dupont F (1994) Calibration used as a nonresponse adjustment. In: Diday E, Lechevallier Y, Schader M, Bertrand P, Burtschy B (eds) New approaches in classification and data analysis. Springer, Berlin, pp 539–548
Estevao VM, Särndal CE (2000) A functional form approach to calibration. J Off Stat 16:379–399
Estevao VM, Särndal CE (2006) Survey estimates by calibration on complex auxiliary information. Int Stat Rev 74:127–147
Estevao VM, Hidiroglou MA, Särndal CE (1995) Methodological principles for a generalized estimation system at statistics canada. J Off Stat 11:181–204
Froment R, Lenclud B (1976) Ajustement de tableaux statistiques. Annales de l’Insee 22–23:29–53
Fuller WA, Isaki CT (1981) Currents topics in survey sampling. In: Krewski D, Platek R, Rao JNK (eds) Survey design under superpopulation models. Academic Press, New York, pp 196–226
Fuller WA, Loughin MM, Baker HD (1994) Regression weighting in the presence of nonresponse with application to the 19871988 nationwide food consumption survey. Surv Methodol 20:75–85
Goga C, Shehzad MA (2010) Overview of ridge regression estimators in survey sampling. Technical report, Université de Bourgogne, Dijon, France
Graf M (2011) Use of survey weights for the analysis of compositional data. In: Pawlowsky-Glahn V, Buccianti A (eds) Compositional data analysis: theory and applications. Wiley, Chichester, pp 114–127
Guandalini A, Tillé Y (2017) Design-based estimators calibrated on estimated totals from multiple surveys. Int Stat Rev 85:250–269
Guggemos F, Tillé Y (2010) Penalized calibration in survey sampling: design-based estimation assisted by mixed models. J Stat Plan Inference 140(11):3199–3212
Güler O (2010) Foundations of optimization, vol 258. Graduate texts in mathematics. Springer, New York
Hartley HO, Rao JNK (1968) A new estimation theory for sample survey. Biometrika 55:547–557
Haziza D, Lesage É (2016) A discussion of weighting procedures for unit nonresponse. J Off Stat 32(1):129–145
Horvitz DG, Thompson DJ (1952) A generalization of sampling without replacement from a finite universe. J Am Stat Assoc 47:663–685
Huang ET, Fuller WA (1978) Non-negative regression estimation for sample survey data. In: Proceedings of the social statistics section of the American Statistical Association, pp 300–305
Isaki CT, Fuller WA (1982) Survey design under a regression population model. J Am Stat Assoc 77:89–96
Kim JK, Park H (2010) Calibration estimation in survey sampling. Int Stat Rev/Revue Internationale de Statistique 78(1):21–39
Kott PS (1994) A note on handling nonresponse in surveys. J Am Stat Assoc 89:693–696
Kott PS (2006) Using calibration weighting to adjust for nonresponse and coverage errors. Surv Methodol 32:133–142
Kott PS (2009) Calibration weighting: combining probability samples and linear prediction models. In: Pfeffermann D, Rao CR (eds) Handbook of statistics, Part B: sampling, vol 29. Elsevier/North-Holland, New York, Amsterdam, pp 55–82
Kott PS, Chang T (2010) Using calibration weighting to adjust for nonignorable unit nonresponse. J Am Stat Assoc 105(491):1265–1275. https://doi.org/10.1198/jasa.2010.tm09016
Le Guennec J, Sautory O (2002) CALMAR2: une nouvelle version de la macro CALMAR de redressement d’échantillon par calage. Actes des Journées de Méthodologie. Insee, Paris, Paris, pp 33–38
Lemel Y (1976) Une généralisation de la méthode du quotient pour le redressement des enquêtes par sondages. Annales de l’Insee 22–23:273–281
Lesage É, Haziza D, D’Haultfoeuille X (2018) A cautionary tale on instrumental calibration for the treatment of nonignorable unit nonresponse in surveys. J Am Stat Assoc 114:1–28
Lumley T (2010) Survey: analysis of complex survey samples. R package version 3.23-0, The Comprehensive R Archive Network
Lundström S, Särndal CE (1999) Calibration as a standard method for treatment of nonresponse. J Off Stat 15:305–327
Madre JL (1980) Méthodes d’ajustement d’un tableau à des marges. Les Cahiers de l’Analyse des Données 5:87–99
Matei A, Tillé Y (2007) Computational aspects of order \(\pi ps\) sampling schemes. Comput Stat Data Anal 51:3703–3717
Matei A, Tillé Y (2016) The R sampling package, Version 2.8. Université de Neuchâtel, Neuchâtel
Narain RD (1951) On sampling without replacement with varying probabilities. J Indian Soc Agric Stat 3:169–174
Nascimento Silva PLD, Skinner CJ (1997) Variable selection for regression estimation in finite populations. Surv Methodol 23(1):23–32
Nieuwenbroek NJ, Boonstra HJ (2002) Bascula 4. 0 for weighting sample survey data with estimation of variances. Surv Statistician Softw Rev 46:6–11
Owen AB (1988) Empirical likelihood ratio confidence intervals for a single functional. Biometrika 75(2):237–249
Park M, Yang M (2008) Ridge regression estimation for survey samples. Commun Stat Theory Methods 37(4):532–543
Rebecq A (2017) icarus: Calibrates and Reweights Units in Samples, likelihood, estimation. R package version 0.3.0. The Comprehensive R Archive Network
Roy G, Vanheuverzwyn A (2001) Redressement par la macro CALMAR : applications et pistes d’amélioration. Traitements des fichiers d’enquêtes, éditions PUG pp 31–46
Särndal CE (1980) On \(\pi \)-inverse weighting versus best linear unbiased weighting in probability sampling. Biometrika 67:639–650
Särndal CE (2007) The calibration approach in survey theory and practice. Surv Methodol 33:99–119
Särndal CE, Lundström S (2005) Estimation in surveys with nonresponse. Wiley, New York
Särndal CE, Swensson B, Wretman JH (1992) Model assisted survey sampling. Springer, New York
Sautory O (1993) La macro calmar, redressement d’échantillon par calage sur marges. Tech. rep., Séries des documents de travail de la Direction des Statistiques Démographiques et Sociales, F9310, Insee, Paris
Sautory O, Le Guennec J (2003) La macro CALMAR2: redressement d’un échantillon par calage sur marges - documentation de l’utilisateur. Technical report, Insee, Paris
Shah BV (1981) SESUDAAN, standard errors program for computing of standardized rates from sample survey data. Research Triangle Institute, Research Triangle Park
Shah BV, Holt MM, Folsom RE (1977) Inference about regression models from sample survey data. Bull Int Stat Inst 47(3):43–57
Shah BV, Folsom RE, Harrell F, Dillard C (1984) Survey data analysis software for logistic regression. Research Triangle Institute, Research Triangle Park
Shah BV, Folsom RE, LaVange LM, Wheeless SC, Boyle KE, Williams RL (1993) Statistical methods and mathematical algorithms used in SUDAAN. Research Triangle Institute, Research Triangle Park
Shah BV, Barnwell BG, Bieler GS et al (1997) SUDAAN user’s manual, release 7.5., vol 67. Research Triangle Institute, Research Triangle Park
Stephan FF (1942) An iterative method of adjusting sample frequency data tables when expected marginal totals are known. Ann Math Stat 13:166–178
Thionet P (1959) L’ajustement des résultats des sondages sur ceux des dénombrements. Revue de l’Institut International de Statistique 27:8–25
Thionet P (1976) Construction et reconstruction de tableaux statistiques. Annales de l’Insee 22–23:5–27
Tillé Y, Matei A (2016) sampling: Survey Sampling. http://CRAN.R-project.org/package=sampling, R package version 2.8
Vanderhoeft C (2001) Generalised calibration at Statistics Belgium SPSS module gCALIBS and current practices. Technical report, statistics Belgium working paper no. 3
Vanderhoeft C, Waeytens E, Museux JM (2001) Generalised calibration with SPSS 9. 0 for Windows baser. In: Droesbeke JJ, Lebart L (eds) Enquêtes. Modèles et Applications. Dunod, Paris, pp 404–415
Wu C, Rao JNK (2006) Pseudo-empirical likelihood ratio confidence intervals for complex surveys. Can J Stat/La revue canadienne de statistique 34(3):359–376
Wu C, Sitter RR (2001) A model-calibration approach to using complete auxiliary information from survey data. J Am Stat Assoc 96:185–193
Yule GU (1912) On the methods of measuring association between two attributes. J Roy Stat Soc 75(6):579–652
Acknowledgements
The authors thank the Swiss Federal Statistical Office (FSO) which partially supported this work as well as the three reviewers for their comments and efforts towards improving our manuscript.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
This invited paper is discussed in comments available at: http://dx.doi.org/10.1007/s11749-019-00682-2; http://dx.doi.org/10.1007/s11749-019-00683-1; http://dx.doi.org/10.1007/s11749-019-00684-0; http://dx.doi.org/10.1007/s11749-019-00686-y; http://dx.doi.org/10.1007/s11749-019-00687-x
Rights and permissions
About this article
Cite this article
Devaud, D., Tillé, Y. Deville and Särndal’s calibration: revisiting a 25-years-old successful optimization problem. TEST 28, 1033–1065 (2019). https://doi.org/10.1007/s11749-019-00681-3
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11749-019-00681-3