Introduction

The United States Census Bureau will implement a formal differential privacy disclosure avoidance system to protect respondent privacy for the first time in publishing Census 2020 data. Historically, respondent confidentiality was protected through various disclosure avoidance techniques, including suppression, noise infusion, and data swapping (Lauger et al., 2014; McKenna, 2018). The census data that users know and depend on, even decennial “counts,” have for decades had some level of information reduction and perturbation included to protect privacy. Given recent technological advancements and the proliferation of data from third party sources, the Bureau determined after Census 2010 that a modernized system of disclosure avoidance is necessary to avoid the potential for attackers to reconstruct Census data and identify respondents (Abowd, 2019). The Bureau committed to publishing 2020 Census data using a new “formally private system built on a set of differentially private algorithms” (US Census Bureau, 2018), known as “differential privacy” (DP).

Differential privacy attempts to balance data accuracy with privacy, recognizing trade-offs between the two principles. It is based on a mathematical system of quantifying privacy risk and building a synthetic dataset that, while maintaining data structure, introduces formal perturbations to the raw data where the amount of privacy leaked can be quantified and set so as not to exceed a certain threshold. This threshold – referred to as epsilon, or the Privacy Loss Budget (PLB) – for Census 2020 is to be set by the Data Stewardship Executive Policy Committee in 2021. The committee allocates a specific amount of the privacy loss budget for each variable-by-geography combination, so that combinations with greater allocations of this budget will be more accurate than those with less budget. PLB for the PL 94-171 redistricting data file was set June 9, 2021. A decision on the PLB for the Demographic and Housing Characteristics (DHC file), which includes age-specific data necessary for net migration estimates, will come later.

Differentially private approaches are based in a growing body of mathematical theory and research but have not been applied previously at the scale of the US decennial census. Implementing a DP approach at this scale is ground-breaking work, the research and development for which the Census Bureau is leading (US Census Bureau, 2018). The level of complexity involved in applying such a technique, and determining all of the associated allocations of privacy loss budget to appropriately balance accuracy and privacy for the wide variety of data use cases associated with the US Census, is a monumental challenge.

The Census Bureau is currently working to determine how to allocate the privacy loss budget among various uses. They are doing so by reviewing and prioritizing existing “use cases” – uses of census data for specific purposes including political representation and redistricting, funding allocation, regulatory functions and meeting legal mandates, research, and planning. In an effort to engage with the user community, build transparency, and evaluate the impact of differential privacy applications for various uses, the Bureau has (as of May 20, 2021) published five versions of differentially private 2010 demonstration data that can be compared to the published Census 2010 Summary File 1 results. The first four versions maintained the same PLB, but improvements to processing algorithms have continually been made to reduce error (US Census Bureau, 2021a). The April 2021 version included an increased PLB to epsilon = 12.2, which is almost three times larger than prior demonstration products and shows considerable increases in data accuracy (US Census Bureau, 2021b).

The purpose of this research brief is to evaluate the impacts of differential privacy application on one important and widely used “use case”: Net Migration Estimates by Age, Sex, Race, and Hispanic-Origin for US Counties.Footnote 1 For each decade since the 1950s, demographers have generated high-quality net migration estimates by age, sex, and race for US counties using decennial census data as the starting and ending populations. Before the series can be extended to 2010–2020, it is critical to know the accuracy of the differentially private Census 2020 data and, correspondingly, what level of accuracy we can expect in the net migration estimates. In estimating these impacts, we aim to (1) inform decision making in the US Census Bureau for setting and allocating the PLB in the DHC file; (2) evaluate the level of detail (age-race-ethnic breakdown) that can be provided accurately in the 2010–2020 net migration estimates; (3) draw attention to the potential for DP errors to cluster in space; and (4) raise awareness among data users more broadly who are trying to understand the potential impact of DP implementation on data quality for various uses.

Net Migration Estimates for US Counties

As noted above, demographers have generated net migration estimates by age, sex, and various race and ethnicity categories for US counties from the 1950s through the 2000s (Bowles et al., 1965 and 1977; Fuguitt et al., 1993; Voss et al., 2004; White et al., 1992; Winkler et al., 2013a). Currently, our research team is working to extend this series by producing a similar dataset for the 2010–2020 decade. The resulting series will be a longitudinal dataset of 70 years of age-specific net migration for all US counties. The existing data are available for download through ICPSR and are served to the public on an interactive website at www.netmigration.wisc.edu where users can generate custom maps and charts, and download data. The series has been widely used among demographers studying patterns of migration, applied demographers generating population estimates and projections, and planners and analysts in the business community. The 2000–2010 data/documentation has been downloaded from ICPSR over 700 times since their release in 2013 (ICPSR, 2020), and the interactive website has had over 200,000 visits including over 10,500 data downloads.

Estimates for the 2010–2020 decade, as in prior decades, are being derived using a forward cohort residual method, subtracting a measure of natural increase from population change over the period to generate net migrants. For past decades, this process has produced highly accurate and reliable data given that they are generated using high-quality data that avoid sampling error, including enumerations from the decennial censuses conducted at the start and end of each decade and administrative records of US births and deaths from the National Center for Health Statistics. As such, the 2010–2020 estimates rely on Census 2020 for a count of the population at the end of the decade in each county. These final populations will be compared to an expected population, which the team is constructing by aging the starting population observed at Census 2010 forward ten years, adding in new births, and subtracting out deaths. In this approach, the difference between the final Census 2020 population and the expected 2020 population must be due to net migration. The central concern is that if differential privacy substantially alters the published Census 2020 counts at the county level, then the net migration estimates will be affected and their accuracy diminished.

The net migration data are unique in that they offer age-specific estimates of net migration at a relatively small scale (the county) for the entire country. Migration is highly selective by age, and age patterns of migration vary tremendously across space (Johnson & Winkler, 2015). Age-specific migration’s spatial distribution influences the age structure of communities, with significant implications for population aging, child health, education, and economic well-being. Selective migration by age, sex, and race/ethnicity alters counties’ population composition, having significant implications for health service provisioning and staffing, service infrastructure development, economic and labor market conditions, fertility, and mortality. Similar-quality data are not available from other sources. Migration estimates derived from the American Community Survey are the closest, but those estimates cannot be broken down by detailed demographic characteristics at the county level given the relatively small sample size and related margins of error.

Data/Methods

This study evaluates the impact of differential privacy on the accuracy of net migration estimates by age and Hispanic-origin for US counties. We take a counterfactual approach, comparing the net migration estimates published for the 2000–2010 decade with a hypothetical set of DP net migration estimates produced using differentially private Census 2010 data. In our approach, we compare the published estimates to what they would have been had the end-of-decade population been subjected to differential privacy. Given that this is the same scenario currently facing the Census 2020 counts, our approach should offer a realistic estimate of the impacts of differential privacy on the anticipated 2010–2020 net migration estimates.

In our analysis, we address the following questions:

  1. 1.

    How different (on average) are DP versions of net migration estimates than the originally published net migration estimates by five-year age group?

  2. 2.

    To what extent are Hispanic net migration estimates affected by DP application?

  3. 3.

    In how many counties are the DP estimates fit for use versus not fit for use, and how is fitness distributed by population size and across space?

  4. 4.

    To what extent does DP preserve the age signature in a county’s net migration estimates?

  5. 5.

    Would collapsing estimates into broader age groups (~ 20 years vs. 5 years) make DP estimates fit for use?

Data

There are three data sources required for comparison. First, we use published county-level net migration estimates by age, sex, race, and Hispanic origin for the 2000–2010 decade (Winkler et al., 2013a). We compare these original estimates to similar ones constructed using two sets of differentially private Census 2010 demonstration data released by the US Census Bureau in 2019 and 2020. The first demonstration data were released in Fall 2019, known as Sprint 1. The Bureau released the second data metrics in May 2020 (Sprint 2) and then re-released them as privacy-protected microdata in July 2020 (Van Riper et al., 2020). Although the Bureau released additional DP demonstration data in September and November 2020 and in April 2021, these most recent releases did not include age breakdowns, other than over/under 18. Consequently, we are not able to evaluate the most recent releases in terms of how DP would impact age-specific net migration estimates given the finer age granularity of the migration estimates.

Processing

We organized DP demonstration data into the same age groups as the original net migration estimates, applying the same process that was used to structure the 2010 final populations when the team created the 2000–2010 net migration estimates. Although the estimates have been published with more detailed race/ethnic and sex breakdowns, we have chosen to focus first on total net migration by five-year age group and then to examine data on the Hispanic population by five-year age group. This approach enables us to gain a sense of how race/ethnic breakdowns could be impacted by DP implementation. In order to maintain consistency with the 2010 migration estimates’ final population, the DP data were adjusted for net undercount and overcount. Drawing on findings from the US Census Bureau’s 2010 Demographic Analysis and Census Coverage Measurement studies, Winkler et al. (2013b) derived adjustment percentages specific to each sex and five-year age group for the migration estimates’ race and ethnicity categories. We apply those same adjustments to the DP data in our analysis. Finally, we exclude from this analysis counties that experienced boundary changes between 2000 and 2010 (n = 109, including Virginia independent cities), yielding a total of 3133 valid observations.

This harmonization generates a DP final 2010 population analogous to the final 2010 net migration estimates’ population. We calculate a DP version of the estimate of net migrants by substituting the DP value for the Summary File 1-based 2010 population count. We similarly construct a DP version of the net migration rate (per 100 expected population). The expected population for 2010 is the same for the original version and the DP version, as this value relies on the starting population from Census 2000 and natural increase observed over the 2000–2010 decade.

The original formulae for calculating net migrants and net migration rates, where M = net migrants and R = net migration rate:

$${\text{M }} = {\text{ final 2}}0{\text{1}}0{\text{ population }} - {\text{ expected 2}}0{\text{1}}0{\text{ population}},$$
$${\text{R }} = {\text{ }}\left( {{\text{M}}/{\text{expected 2}}0{\text{1}}0{\text{ population}}} \right) * {\text{1}}00.$$

The differentially private formulae for calculating net migrants and net migration rates, where DPM = differentially private net migrants and DPR = differentially private net migration rate:

$${\text{DPM }} = {\text{ DP final 2}}0{\text{1}}0{\text{ population }} - {\text{ expected 2}}0{\text{1}}0{\text{ population}},$$
$${\text{DPR }} = {\text{ }}\left( {{\text{DPM}}/{\text{expected 2}}0{\text{1}}0{\text{ population}}} \right) * {\text{1}}00.$$

Metrics for Comparison

To compare the hypothetical DP 2000–2010 net migration estimates to the actual ones, we use a series of metrics that are commonly used to evaluate error in population estimates and projections. These are summarized below and are similar to those the Census Bureau is using to compare differentially private Census 2010 data to released Summary File 1 Census 2010 data (US Census Bureau, 2020).

Median Absolute Error (MAE). Measures the “average” numerical difference between net migration estimates and the DP version of the same estimates. Calculated as Abs(DPM – M) for each of the 3133 counties, then take the median. Or, for rates, Abs(DPR - R) for each county, then take the median.

Median Absolute Percent Error (MAPE). Measures the “average” relative difference between net migration estimates and the DP version of the same estimates. Calculated as [Abs(DPM – M)/M] for each of the 3133 counties, then take the median.

Median Algebraic Percent Error (MALPE). Identifies systematic bias by showing the average direction of error (+ or −) and provides an alternative relative measure of error. The MALPE is generally closer to zero than the MAPE, because no absolute value is taken, and so positive and negative errors cancel out one another. Calculated as ((Σ((DPM – M)/M))/N)*100, where N= number of observations (3133 counties).

We choose to present the median versions, rather than the mean, because means are heavily skewed by a small number of outliers with extreme differences, particularly when it comes to relative error. These three measures – MAE, MAPE, and MALPE – allow us to answer our first two research questions about the difference (on average) between DP versions of net migration estimates and the originally published estimates, and to summarize differences by age and Hispanic origin.

The outlying cases of extreme differences (glossed over by the “average” metrics) are also important, because some counties could end up with wildly inaccurate net migration estimates and, consequently, problematic analytic conclusions and policy decisions. To answer our third research question concerning fitness of use, we identify where DPRs are particularly inaccurate in several ways. First, we report the number of counties where the difference between published and DP net migration rates is <5, 5–9, 10–14, 15–19, and 20 or more by five-year age group. Because these are rates per 100 expected population, a difference of more than 5 percentage points in a rate could lead to meaningful misinterpretations and presents a challenge to the data’s fitness of use. To ease interpretation, we use a difference in the net migration rate of >5 as an indicator that the DP version is not fit for use. For example, in Mercer County, Ohio (population 40,814) the expected population at age group 65–69 is 1,576. The published estimate of net migrants age 65–69 is 117 (NMR = 7). The DP estimate is 193 net migrants (DPR = 18), such that the difference in rates is 11 percentage points and, thus, not fit for use in many applications, such as making population estimates or projections.

Next, we focus on error distributions by county population size for two select age groups: age 25–29 and age 65–69. Young adult migration is important for planning and community development, and migration propensities are highest among young adults. Correspondingly, we see DPRs performing relatively well at age 25–29. Retirees (age 65–69) are another important age group for migration that exhibit very different spatial patterns in comparison to young adults (Johnson & Winkler, 2015). We investigate interquartile and 90th percentile ranges of the absolute error between originally published NMRs and DPRs for these two age groups to offer a sense of the range of error by county population size. Then, we map counties by absolute error in NMRs for these age groups and run local indicators of spatial autocorrelation (LISA statistics) to explore the possibility of spatial clustering in error.

Rather than focusing on any one age group, many net migration estimates’ data users are interested in each county’s specific age pattern of migration, referred to as a county’s “net migration signature” (Johnson & Winkler, 2015; Johnson et al., 2005). Our fourth research question concerns how DP could impact counties’ signature pattern of age-specific migration. We evaluate this possibility by calculating a correlation coefficient (Pearson’s R) between each county’s series of age-specific migration estimates as originally published in comparison to the DP version. We then review summary statistics of these correlations.

Combined, these analyses reveal concerns with using DP Census 2020 data to generate net migration estimates by five-year age group. Therefore, we explore our fifth research question concerning one possible remedy: collapsing age groups into broader categories and recalculating metrics to assess the extent to which aggregation by age could improve data fitness of use.

Results

Differences Between Published and DP Net Migration Estimates

The differences between the DP version of net migrants and the original estimates are substantial. Table 1 shows median absolute errors (MAE), median absolute percent errors (MAPE), and median algebraic percent errors (MALPE) by five-year age group and Hispanic origin using the Sprint 2 demonstration microdata release from July 2020. Median differences in the total number of net migrants range from 50 at age 80–84 to 88 net migrants at age 0–4. In relative terms, MAPEs range from 20% at ages 25–29 where migration is more common (larger denominator) to over 100% at the youngest and oldest ages. MALPE values indicate that the median direction of error varies by age group, but that DP estimates tend to underestimate net migrants at the youngest and oldest ages. Median percentage point errors in net migration rates generally range from 3 to 5 per 100 for most age groups and are higher at older ages.

Table 1 Difference between DP estimates and published net migration estimates

Effect on Hispanic Estimates

Our results show that errors are considerably higher for Hispanic migration estimates. Median relative errors on the number of net migrants by five-year age groups range from 57% to over 300%. NMR errors average between 30 and 50 percentage points for most age groups and are larger at older ages where the number of Hispanics is much smaller. MALPE values are consistently negative for Hispanics across age groups, indicating a negative bias. This means that the differentially private net migrant estimates are consistently lower than estimates from the original data version. Overall, the findings demonstrate that the most recent DP demonstration data broken down by five-year age group and Hispanic are not fit for use.

Number and Types of Counties Most Affected

Beyond these “average” differences, it is important to recognize the number and types of counties where errors are extreme. These are the cases in which applying differential privacy could have the greatest impact on data interpretation and related decision making. Figure 1 shows the distribution of absolute errors in net migration rates across counties by five-year age group. About half of all counties have DP rates that may not be fit for use, with errors greater than 5 percentage points for five-year age groups. About 25% of counties show errors greater than 10 percentage points, and about 10% of counties have errors of more than 20 percentage points, indicating that the original rate is 20 per 100 population different from the DP version and could lead to seriously erroneous interpretations. For example, Iron County, Wisconsin (population 5916 in 2010) had a rate error of 47 percentage points at age group 25–29. This county saw a net out-migration of 197 young adults according to the originally published estimates (NMR = −47 per 100), while the DP version estimates a net in-migration of 1 young adult (NMR = 0.25 per 100). Even more counties have questionable or unfit DP rates at ages 65 and over, and errors get worse as age increases so that almost 40% of counties have errors over 20 percentage points at age group 85 plus.

Fig. 1
figure 1

Distribution of absolute error in net migration rates (DPR—NMR) by age group

Figure 2 illustrates the distribution of net migration rate errors by county population size at ages 25–29 and ages 65–69. For all county size groups at ages 25–29, median errors are near zero, suggesting that errors cancel one another out and there is no systematic difference by county size between DP and original versions of net migration estimates. In contrast, for ages 65–69, DP rates generally underestimate net migration in smaller population counties and overestimate net migration in larger population counties. For both age groups, interquartile ranges for errors are small for counties with a population of 50,000 or more, and in these higher population counties, there are few outliers with extreme errors. However, several counties with smaller populations, even those in the 10,000 to 49,000 range, have errors that call into question the accuracy and usability of the DP rates.

Fig. 2
figure 2

Distribution of error in net migration rate (DPR-NMR) by county population size. a Age 25–29. Five counties with NMR errors greater than 100 are excluded from this chart. b Age 65–69. Eight counties with NMR errors greater than 100 are excluded from this chart

Figure 3 shows the spatial distribution of these errors in net migration rates for ages 25–29 and 65–69, and Fig. 4 shows the propensity for the error to cluster in space using LISA statistics. Among both 25–29 and 65–69 year olds, extreme rates are most common in more rural areas, particularly across the Great Plains. Errors for both age groups are especially clustered in the Great Plains, with groups of counties that both significantly underestimate and overestimate net migration rates across the region. High-high clusters are places where DP rates overestimate net migration for those counties and their neighbors, whether underestimating the rate that people are leaving or overestimating the rate that people are moving in. Low-low clusters are counties where DP rates underestimate net migration for those counties and their neighbors (i.e., counties where DP overestimates the rate of people leaving and/or underestimates those moving in). Low-high and High-low outliers are counties with low error relative to their neighbors.

Fig. 3
figure 3

Spatial distribution of error in net migration rate (DPR-NMR) by county. A Age 25–29. B Age 65–69

Fig. 4
figure 4

Spatial clustering of error in net migration rates. A Age 25–29.. B Age 65–69. LISA statistics calculated by authors using inverse distance-squared weights matrix. Statistical significance is based on a corrected 95 percent confidence level (Sensitivity analysis with alternative weights matrices (first-order Queen contiguity and inverse distance) revealed similar results.)

Our results demonstrate systematic spatial clustering in the impact of differential privacy. Global tests of spatial autocorrelation (Moran’s i) indicate that errors are not statistically significantly autocorrelated (age 25–29, i = 0.004; age 65–69, i = 0.012). Although the lack of global spatial autocorrelation is encouraging, local indicators of association show significant spatial clustering in some regions and demonstrate an uneven bias in estimates across the nation (see Fig. 4). Among age group 25–29, DP rates overestimate net migration in areas of both the northern and southern Great Plains but underestimate net migration in the central Great Plains (Kansas, Oklahoma, eastern Colorado). DPRs also underestimate net migration rates in parts of the Intermountain West, and in parts of southern Georgia and the Florida panhandle, but overestimate net migration in parts of rural Oregon and California. Among age group 65–69, it is nearly the opposite, with DPR underestimating net migration in the northern and southern Great Plains and overestimating net migration in the central Great Plains, the Mississippi Delta, southeast Georgia, some parts of Appalachia, and Michigan’s western Upper Peninsula.

Comparing Net Migration Signatures

County migration signatures show unique patterns of in-/out-migration rates that are correlated across age groups. For instance, central cities tend to attract young adults but lose middle-aged adults, children, and retirees, while suburban counties often attract middle-aged adults and children but lose young adults; and rural retirement destination counties attract retirees but lose young adults (Johnson & Winkler, 2015). We analyze the extent to which the DP migration estimates accurately reflect the correlation in migration patterns across age groups.

Across all counties, the average correlation coefficient (Pearson’s R) comparing original net migration age signatures by five-year age group to the DP versions we constructed was relatively high at 0.80, but with considerable deviations. The signatures we compared included age groups 0–4 through 80–84 and excluded the terminal age group (85 +) due to inconsistencies and uncertainties in net migration estimates at the most advanced ages. Our findings show that 42% of counties (n = 1329) had an r > 0.9, indicating strong agreement in the signatures. However, 15 counties (0.5 percent) had a negative correlation, which was unexpected and suggests profound disagreement between the DP and NMR signatures. An additional 304 counties (10 percent) had r < 0.5, indicating little agreement between the DP and NMR signatures. As in prior analyses, larger population counties performed better than smaller population counties. Figure 5 shows an example for Iron County, Wisconsin (discussed previously), where r = 0.61. While the DP signature is generally similar to the original net migration signature, the DP version exhibits higher peaks and valleys and a more erratic migration pattern.

Fig. 5
figure 5

Net migration signature comparison- Iron County, WI

Collapsing Across Age Groups

The error observed in the information presented thus far calls into question the fitness for use of net migration estimates by five-year age group. We investigate how fit might be improved by collapsing estimates into broader age groups. While migration is highly selective by age, it may be worth losing some granularity in order to provide more accurate estimates. The www.netmigration.wisc.edu website, which serves data from prior decades, includes interactive maps using age groups 0–14, 15–24, 25–29, 30–54, 55–74, and 75 plus. These broader groupings combine ages that tend to have similar place-based migration patterns.

Table 2 shows differences between DPRs and published rates by county population size, noting the median absolute error in percentage points and the percent of counties with absolute rate errors over 5 per 100 (unfit for use, as we have defined here). Although error is generally reduced, it remains problematic for smaller population counties and for some age groups, especially older age groups. For counties overall, median absolute errors range from 1.5 percentage points for age group 30–54 to 5.8 for age 75 plus. At the oldest ages, rates would be unfit for use in 54% of counties. Even for age group 30–54, 18% of counties still show rates unfit for use. In counties with populations less than 10,000 (n = 676), the majority of counties show DPRs that would be unfit for use. For example, 51% of counties with populations between 5000 and 9999 would be unfit for use for age group 55–74, which is a critical age for understanding retirement migration patterns in rural America. Even counties with populations greater than 50,000 or 100,000 show considerable proportions with unfit rates at age group 75 plus (37% and 13%, respectively).

Table 2 Net migration rate errors for broader age groups by county population size

Conclusions

Overall, our findings indicate a substantial and concerning impact of differential privacy on county-level net migration estimates. Median differences in the number of net migrants by five-year age group attributable to DP range from 50 to 88. Relative differences are also high, with median absolute percent errors ranging from 15% (for age group 20–24) to over 100% at ages 75 and over. Percentage point differences in net migration rates are particularly concerning. According to our estimates, about half of all counties would have net migration rates with errors over 5 percentage points and, thus, would not be fit for use at any given five-year age group. Moreover, 88% of counties have at least one five-year age group where data would not be fit for use. Errors are especially high for populations aged 65 and over and for the Hispanic population. The findings indicate that age breakdowns by Hispanic origin at the county level may not be feasible under current DP implementation plans.

Most concerning are the cases where we see extreme differences, numerically and relatively, when comparing the DP version of estimates to the originally published data (about 10% of counties). Such errors would seriously challenge the accuracy of the 70-year net migration data series, undermining their utility. Moreover, large errors (as in the Iron County, WI example) could lead to critically erroneous decision making and planning with detrimental implications for counties and their residents. The relative lack of consistency in the net migration signatures raises additional concern, with about 37% of counties showing a less than 0.8 correlation between the DP version and published rates.

We find that inaccuracies are greatest in smaller population counties. Yet, errors are sizeable in many counties with populations ranging from 10,000 to 50,000. Collapsing into broader age groups reduces error, but this strategy does not eliminate or adequately address concerns. Altogether our findings are similar to Hauer and Santos-Lozado's evaluation using age-specific DP data for calculating county-level covid mortality (Hauer and Santos-Lozado, 2021)

We also find evidence of systematic error by age, race, and geography. DP estimates tend to underestimate net migrants at the youngest and oldest ages, and among Hispanics at almost all ages. Moreover, there is evidence of systemic spatial clustering in some regions. The negative implications of drawing extraneous interpretations based on inaccurate data are exacerbated by DP results systematically under- or overestimating net migration in entire regions of the country. For example, if DP versions of net migration rates overestimate net migration at age group 25–29 across the northern Great Plains, the extent of young adult out-migration, and associated planning responses, from that region may be unclear. Similarly, if net migration among those ages 65–69 in the Mississippi Delta is overestimated, it may appear that these counties are attracting more retirement migrants than they actually are, potentially leading to extraneous investments in services or community and economic development strategies. In addition to applied consequences, erroneous estimates will challenge scholars’ abilities to empirically assess the causes and consequences of age-, sex-, and race-specific migration patterns in the contemporary United States. To our knowledge, there has been little attention paid to the potential for problematic spatial clustering in the application of DP more broadly. Finding evidence of the problem here serves as a warning signal that it could apply to other census data applications as well. Future analyses of various DP applications should closely study the potential for spatial clustering of error.

In this analysis, we have only had the opportunity to compare Sprint 1 and Sprint 2 versions of the DP application to Census 2010. Various users have discovered substantial differences between the DP estimates in Sprint 1 and the published 2010 data, and raised serious concerns at a National Academies workshop in December 2019 (National Academies, 2019) as well as through feedback provided to the Census Bureau. Our analysis of the Sprint 2 version indicates limited improvement in comparison to Sprint 1. The Bureau reports further improvements to algorithms in more recent releases but, regrettably, these releases do not include the five-year age groups necessary for the net migration estimates.

Ultimately, the results shown here could change dramatically, depending on final decisions the Bureau’s Data Stewardship Executive Policy (DSEP) committee makes about how to implement DP, including where to set the privacy loss budget (PLB) and how to allocate it across variables and geographies. The Bureau’s recent indication that they will release redistricting data with an epsilon= 19.6, which indicates a PLB almost five times greater than was used in the demonstration data evaluated here (US Census Bureau, 2021b) suggests that the errors observed in this study may overestimate the ultimate impact of DP on the net migration estimates program. Therefore, we might think of the estimates we present as a worst case scenario for data accuracy. Still, the Bureau has not yet indicated how they will handle the PLB for the Demographic and Housing Characteristics (DHC) file, which includes the age, sex, and race/ethnicity breakdowns that will inform net migration estimates as well as various other uses. The fact that the Bureau found they need to raise the epsilon for the PL file to 19.6, in order to meet accuracy standards for redistricting in the PL 94-171 data release, suggests that an even larger epsilon will be required to produce reasonably accurate data in the DHC file with much more detailed data. Yet, as the epsilon increases, privacy diminishes. This will likely be a source of tension as DSEP considers how to implement DP in the DHC file.