1 Introduction

The global COVID-19 pandemic in early 2020 led to severe disruptions in nearly all aspects of life. Stay-at-home orders closed schools and non-essential businesses, generating widespread unemployment and large declines in economic activity that cannot yet be fully assessed. The social and economic costs of this crisis have been widely felt. A United Nations report suggests that global GDP could decline by 1% (United Nations 2020), while the managing director of the International Monetary Fund stated that she anticipates that the pandemic will generate “...the worst economic fallout since the Great Depression,” with global GDP contracting by 3% (Gopinath 2020). According to UNESCO, over 1.5 billion students, almost 90% of all learners, have had their schooling disrupted by this crisis. At least 189 countries have closed schools in response to the COVID-19 threat, which may interrupt human capital accumulation in children and contributes to the social costs of the pandemic (UNESCO 2020).Footnote 1

While most people alive today have never witnessed this scale of the pandemic, research on other epidemics may provide insights into the possible magnitude of the disruption.Footnote 2 In this paper, we examine the 1916 polio epidemic to measure how it impacted the educational attainment of children during the outbreak. The 1916 epidemic is unique in that it was the first and largest major polio epidemic in the USA and was one of the first major public health crises many nascent public health departments faced.Footnote 3 The 1916 epidemic proved to be the first of many outbreaks that would continue until the advent of the Salk vaccine in 1954, although Trevelyan et al. (2005) suggest that the 1916 epidemic was more intense than subsequent outbreaks.Footnote 4 While smaller in magnitude than the more famous 1918–1919 Spanish flu, the polio epidemic started in New York City and New Jersey, but then spread across the USA, resulting in mass quarantines and prolonged school closures at the start of the 1916–1917 academic school year.

The polio epidemic may have affected educational attainment in two ways. First, infected children may have missed school during the period of their illness and recovery. Second, even children who did not have symptoms missed school during periods of closure or because their parents refused to send them to school. Thus, we leverage geographic variation in the severity of the epidemic, as well as the fact that polio differentially affected children of different ages, to measure how the disease affected the educational attainment reported by these children as adults in the 1940 census. Children living in areas with high polio morbidity were more likely to be infected and more likely to endure school closures than children in less-affected areas. Children under 10 were more susceptible to polio, yet children aged 14 and older may have been more likely to quit school during a prolonged closure and join the labor force.

Our results show that children born in states with more reported polio cases had lower educational attainment compared to slightly older birth cohorts who would have already completed schooling before the 1916–1917 school year and that the decline in educational attainment varied depending on their age during the outbreak. We find that a one-standard-deviation increase in polio intensity reduced years of completed schooling by 0.07 years for children between the ages of 14 and 17 in 1916. For children who were 10 and under during the epidemic (and who were most susceptible to polio), a one-standard-deviation increase led to a 0.1 year reduction in years of completed education. These results are robust to controlling for 1918–1919 influenza intensity.

2 The pathology of polio and the 1916 epidemic

Polio is caused by the poliovirus, an enterovirus transmitted fecal-orally. Evidence on Egyptian stele (depicting an adult with a withered leg and crutch) dating from 1580–1350 BC suggests that polio has existed for thousands of years (Nathanson and Kew 2010). Although the existence of polio dates to antiquity, it was not until the late 19th century that epidemics began occurring in the USA and Europe. The most plausible explanation for this pattern is that before the late 19th and early twentieth centuries, most people were probably exposed to polio during infancy when infants had circulating maternal antibodies that made the infection much less severe. Over the next 100 years, improved public sanitation delayed exposure to the virus, thus increasing both the age of primary infection and the severity of the illness, since maternal antibodies were no longer circulating. As a result, epidemics that became increasingly severe were reported each summer and fall, and the average age of persons infected increased (Nathanson and Kew 2010; Centers for Disease Control and Prevention 2015).Footnote 5

In 1916, there were over 5000 deaths and 23,000 more documented infections, numbers that eclipsed those of previous (and subsequent) epidemics (Lavinder et al. 1918; Trevelyan et al. 2005). Table 1 details the number of reported polio cases and deaths by state during the epidemic. Nearly every state reported polio cases, with Massachusetts, New York, New Jersey, and Connecticut reporting the highest numbers of reported polio infections. While much of the epidemic was in the northeastern USA, there were also sizable outbreaks in Minnesota, Montana and Mississippi.

Table 1 State-level polio epidemic intensity, 1916

It is important to note that while cities may be more heavily impacted by modern epidemics, polio was unusual in that it was precisely better hygiene that predisposed victims to a later, and more severe, onset. In places with poor sanitation, children would have been exposed at much younger ages (often infancy, without obvious symptoms) and be less likely to suffer a severe case. While many cities had improved sewer and sanitation (thus leading to the large flare of the epidemic in 1916), rural areas had traditionally been “healthier” places to live from a sanitation standpoint. Lavinder et al. (1918) note that the epidemic was more prevalent and intense in small towns and the country than in large cities, and “...not due to mere chance but to some fundamental law” (p. 19).

The outbreak generated substantial fear. Commonly known as “infantile paralysis,” polio often caused mild (or no) symptoms but could attack the nervous system and cause permanent paralysis. Little was known about how the disease was spread, and no vaccine for polio would be available for another forty years. The disease transcended class lines and affected the poor and the rich alike. In New York City, for example, a large share of cases occurred among native-born children living in single-family dwellings with clean surroundings and indoor toilets (Lavinder et al. 1918).Footnote 6

3 Actions taken to curb the spread of polio

Public health officials struggled to contain the outbreak, which began in June and lasted until November. As the disease spread, officials became increasingly desperate. To prevent the spread of the epidemic and allay public fear, officials implemented various non-pharmaceutical interventions, such as strict quarantines. Placards were also placed in front of homes of infected individuals to notify the community of the public health risk. In at least one case, a child suffering from polio was forcibly removed from his home to a hospital, against the wishes of his parents (Emerson 1917). In New York City, children ages 16 and under were prohibited from leaving New York City for travel unless they produced “... a certificate that the premises occupied by them were free from poliomyelitis, and had been free from this disease since January 1, 1916.” This was supplemented by a medical examination of such travelers at the point of departure (Emerson 1917).

In New York City and Philadelphia, health officials ordered the city streets to be washed with millions of gallons of water each day (Offit 2005; Rogers 1992).Footnote 7 Government policy also called for a ban on public gatherings in areas where numerous cases were reported (Emerson 1917). In the summer of 1916, New York public health officials barred children from movie theaters and libraries, closed Sunday schools and banned picnics (The New York Times 1916c, e; Rogers 1992). Fears that domesticated animals could spread the disease led to the extermination of 72,000 cats and 8000 dogs (The New York Times 1916b). Even though drastic measures were undertaken to control the spread of the disease, hundreds of new infections were reported every week.

As autumn approached, fears that afflicted youths would infect their peers and a belief that quarantines could limit the virus’ spread led many school districts to delay the start of the 1916 school year until the epidemic waned. As the New York Department of Health noted, “The unprecedented virulence and extent of the existing epidemic, and unfamiliarity with the disease, has engendered in the public such a state of mind that concession to public alarm seemed advisable” (Emerson 1917). The extent and nature of these closures varied across political and geographic divisions of the USA due to the decentralized structure of the nation’s public health system.Footnote 8 For example, Pennsylvania and Vermont postponed the start of the school year across the entire state, while states such as Connecticut and New Jersey left the decision up to local public health authorities or local school boards.

New York City schools opened on September 25, two weeks later than the expected September 11th open date (The New York Times 1916d). Newspaper accounts show numerous other cities nationwide joined New York in postponing the start of school. Philadelphia delayed opening schools until September 18th (and then to October 2nd), and both Washington, D.C., and Fort Wayne schools were not reopened until October 2nd (The Washington Times Company 1916; Evening Public Ledger 1916a, b).

Schools closed across the nation even in states with small numbers of cases. For example, some schools closed in Seattle, with only five cases reported (The Tacoma Times 1916). But, even when schools may not have been officially closed, there is evidence that parents worried about sending their children to school.Footnote 9 For example, even after the delayed reopening in New York City, worried parents withheld their children from school; up to 200,000 students were absent the first few weeks after schools opened, and the district announced “leniency” for parents who failed to send their children to school (The New York Times 1916a).

To gain insight into the extent of school closures nationally and when they occurred, we searched newspaper records contained in the Chronicling America Newspaper Archive (Library of Congress 2020). Cities in the most heavily affected areas are most frequently mentioned, but newspaper accounts demonstrate that school closures tended to occur wherever an area experienced a breakout of polio. The epidemic started in June, accelerated in July and persisted through the start of the academic school year. These data most likely represent a subset of school closures, since not all newspapers are covered in the Library of Congress archive. Moreover, our search procedure may have failed to reveal all closures.Footnote 10 Many newspapers referred to school closures in different cities, or even in major cities in other states. Although we found some articles in which a city expressly stated schools would remain open (including Chicago, Detroit and Milwaukee), for most cities we could not ascertain with certainty that they did not close.

In Fig. 1, we plot the frequency of school delay announcements in the Chronicling America Newspaper Archive for all newspapers from July 1, 1916, to November 1, 1916. School postponement notifications in US newspapers began to increase in the weeks preceding the start of the academic school year (normally around September 5 or September 11), peaked during the two first weeks of September and gradually decreased until the first week of October. The latest public school start date we observe in the Chronicling America Newspaper Archive (Library of Congress 2020) was October 2, 1916, although some New England preparatory schools advertised that they postponed their start dates into the middle of October. The newspaper articles also reveal that the persistence of the epidemic caused public health officials and school boards to repeatedly push back school start dates. Washington, DC initially planned to start on time, then pushed back the start two weeks to September 25, and then finally postponed until October 2. Boston, MA did the same. The entire state of Pennsylvania also postponed school starts multiple times until the first week of October.

To dig deeper into the relationship between city-level school closures and polio outbreaks, we further searched for announcements of school closures in 161 cities for which we have polio morbidity data digitized by Van Panhuis et al. (2018). These were cities that voluntarily participated in the disease reporting system conducted by the US Public Health Service, which published this information in its weekly bulletins. We were able to find newspaper accounts of school postponement for 38 of these cities.Footnote 11 Table 10 in the appendix provides more detailed information on the city names, the dates of postponement, and the name of the newspaper in which we found the information. Of these 38 cities, 84% opened over 2 weeks late, with 60.5% opening on October 2, 1916. Only three (Chicago, Detroit, and Milwaukee) opened on time.

Figure 2 presents the relationship between weekly polio morbidity in these 38 cities and the timing of school postponements using data from Project Tycho.Footnote 12 We highlight the relationship between the epidemic timing and school postponement by separating this information by the timing of the school starts: schools that started on time, schools that delayed their start dates to a later date in September, and schools that delayed school starts into October. A mass of polio outbreaks in August and September coincided with delayed school starts in many affected cities. Polio outbreaks continued in these cities throughout October, and the epidemic ended in November.

In Fig. 3, we focus in on a subset of the reported data for cases under 100 per week. For many cities with schools that opened in October there large numbers of reported weekly polio cases throughout September and well into October. Schools that postponed only a few weeks into September generally had fewer reported cases of acute polio than either the schools that started in October. In the few schools that we confirmed opened on time, most city week observations report low numbers of polio infections. The exception to this pattern is the city of Chicago, which opened schools on time even while polio still spread throughout the community.

4 Polio and educational attainment

There is a broad and well-documented literature in economics and in economic history about the impact of early childhood health and economic shocks on later life outcomes (Almond et al. 2018). Within this literature, there are a number of papers that demonstrate the long-run consequences of infectious disease shocks in childhood. Beach et al. (2016) show that reduced early-life exposure to typhoid improved later life educational attainment, perhaps by improving cognitive functioning.Footnote 13 Bhalotra and Venkataramani (2013) also find that reduced infant exposure to waterborne disease is associated with improved test scores for girls and improved heights for boys. There is also evidence that individuals who do not contract infectious diseases during epidemics can also be affected. For example, Parman (2015) examines the impact of the 1918 influenza pandemic on the siblings of those children born during the outbreak and finds that older siblings received an additional three months of education, while younger siblings received slightly less education relative to children who did not have a sibling born during the pandemic. To our knowledge, no one has examined the case of polio in the USA, which is unique in that educational attainment may have been affected even despite the fact that relatively small numbers of people contracted polio.

The polio epidemic may have reduced educational attainment through two channels. Children who contracted the disease may have missed school while ill, thus reducing educational attainment, although this might depend on whether they contracted a mild form of the disease or paralytic polio. Gensowski et al. (2019) find that children who contracted paralytic polio in Denmark in 1952 were more likely to earn a university degree than children who did not, suggesting that these children shifted investment into education as their comparative advantage moved from jobs requiring certain physical abilities to those requiring more cognitive skills.

Other children may have experienced reduced educational attainment if they missed school for short periods of time due to school closures or avoidance, or if they delayed entry into formal schooling (for example if the parents of a would-be kindergartner decided to wait until the fall of 1917 to send the child to school). Younger children who have their schooling disrupted even for short periods of time may suffer negative effects. For example, Marcotte and Hemelt (2008) find that unscheduled school closings due to weather negatively affect student performance on third-grade state assessment examinations. Middle-grade students may have lower educational attainment if their learning is disrupted during key periods of development (Lloyd 1978; Hernandez 2011). Whether these effects persist in the long run is less clear. Card and Krueger (1992) find that men educated in states with higher-quality schools (measured by term length, student-teacher ratio and teacher wage) have higher returns from their schooling. Pischke (2007) uses data from a German school district that had a one-time shortening of the school year and finds that the short school year had no negative effect on earnings and employment later in life, although it did lead to more students repeating a grade and fewer students entering higher secondary school tracks.

Other research does suggest that interruptions in schooling at crucial points of development may have long-run impacts. Hernandez (2011) suggests that children in third-grade experience a pedagogical shift from “learning to read” toward “reading to learn.” A failure to develop this proficiency in reading on time appears to inhibit human capital accumulation of students and limits the potential to keep pace with their peers. Lloyd (1978) suggests that students who perform at a lower level than their peers may be more likely to drop out of school, with third-grade academic performance accurately predicting whether an individual would drop out of school nearly 70% of the time.Footnote 14

In addition to the effect on younger children of missing school, prolonged school closures may have caused older children of legal working age to exit school and enter the labor force. A sizable literature on the effects of compulsory schooling laws suggests that educational attainment is affected by increases in the legal age of school exit, increases in the age at which children can legally work and decreases in the maximum age of school entrance.Footnote 15 In 1910, 22 states had legislated 14 as the minimum age for employment in manufacturing (Moehling 1999).Footnote 16 Around 16.8% of males and 5.8% of females between the ages of 10 and 15 were active participants of the workforce in 1920, suggesting that many school-age individuals in 1916 would have possessed workforce alternatives to education (Carter and Sutch 1996). Together, this evidence implies that a notably large portion of youth ages 14 and older had viable and legal employment alternatives available to them in 1916. Moreover, if worried parents delayed their youngest children’s entry into school for a year, research by Deming and Dynarski (2008) suggests that these “redshirted” children may have less educational attainment than their non-redshirted peers since they reached a legal working age having acquired fewer years of formal schooling.

5 Data and empirical methodology

Using the 1916 polio epidemic as a natural experiment, this paper examines the effect of epidemics on the educational attainment of children. Our primary identification strategy relies on interacting different age cohorts with polio morbidity rates in an individual’s birth state. Data on polio morbidity at the state level come from Lavinder et al. (1918), who provide comprehensive information pertaining to state-level polio morbidity covering most of the continental USA. We use the data from Lavinder et al because the US Public Health Reports do not include areas with only a few cases (US Surgeon General and US Public Health Service 1917).

In all specifications, we interact data on polio morbidity with dummy variables for an individual’s age in 1916, since children of different ages may have been differently affected by polio. Given that nearly 90% of children infected by polio before 1919 were under the age of 10, they may have been more likely to experience the effects of polio directly than children of older ages (Nathanson and Kew 2010). On the other hand, children who were 14 and over may have been more likely to leave school for the workforce than children of younger ages. A priori, we expect children under 10 and children ages 14 or older to have their educational attainment impacted at a greater rate than those between the ages of 11 and 13.

We use polio morbidity as our primary data for two reasons. First, complete data on school closures are not available, and state-level data on attendance is not reported for the 1916–17 school year.Footnote 17 Second, polio morbidity data enable us to capture both the direct and indirect impacts of polio on children. Polio may have directly affected school attendance for the children who contracted the disease. Since do we not have information on whether specific individuals contracted polio, we instead rely on the polio case (notification) rate in an individual’s birth state. In addition, the epidemic may have indirectly affected children who missed school either because schools closed, or because fearful parents kept them at home. Schools in areas with higher rates of polio were more likely to close, and parents were more reluctant to send their children to school even when schools were open. Relying on school closure data may fail to capture some of these “avoidance” behaviors. For example, data on average daily attendance for five boroughs in New York City (available only between 1911 and 1917) indicate that average daily school attendance in New York City fell in the 1916–17 school year, as shown in Fig. 4.

We measure 1916 polio epidemic exposure by reported infections at the state-level so that our main identifying assumption is that people who were school age in 1916 would be exposed in their state of birth. This assumption would be violated if children and adolescents migrated between states prior to the 1916 epidemic. While we can only speculate as to the number of children who moved across states during childhood, we do know that in 1940, 77% of people were residing in the same state in which they were born. Further, migration would bias the treatment effect toward zero if migration prior to the epidemic was not systematically correlated with polio morbidity. Using data from the 1920 US Census, we find that the difference between state of birth and state of residence in 1920 is less of a concern and that assigning polio virulence to state of birth is a reasonable proxy for exposure to the 1916 epidemic. In Fig. 5, we present the share of persons who were not observed in their state of birth in the 1920 Decennial Census. On average, 84% of people ages 0 to 21 in 1916 in states that reported polio still resided in their state of birth.

To test whether the epidemic influenced the educational attainment of exposed cohorts, we match a sample of white males born between 1895 and 1916 with the 1916 polio morbidity rate in their state of birth, and the years of education they report having in the 1940 US Census (Ruggles et al. 2019). We restrict the sample to males since they entered the workforce as permanent participants.Footnote 18 We also exclude blacks, since Margo (1986) finds that the highest grade of school reported in the 1940 Census does not accurately represent years of schooling received for African Americans. Table 2 reports summary statistics for the primary sample.

Table 2 Summary statistics

Equation (1) represents the reduced form empirical regression used to study the effects of the 1916 epidemic on educational attainment:

$$\begin{aligned} Y_{i,b,s,a} = \varvec{\alpha _{b}} + \varvec{\kappa _{s}} + \varSigma _{j} \beta _{j}*Polio_{b}*AgeBin_{a,j} + \varvec{X_{b}}*\varvec{\gamma _{a}} + \epsilon _{i,b,s,a} \end{aligned}$$
(1)

\(Y_{i,b,s,a}\) denotes years of completed education for individual i, born in state b, residing in state s in 1940, and born in birth year cohort a. To compare the relationship between polio morbidity in 1916 and educational attainment, we compare the educational attainment of persons born between 1899 and 1916 (who were between the ages of 0 and 17 at the time of the epidemic) to those who were born between 1895 and 1898 (who were between the ages of 18 and 21). This allows us to compare the educational attainment of persons who plausibly could have been in school with those who would have already completed school. In order to test our hypothesis that children of different ages may have been deferentially impacted by the polio epidemic based on their labor market alternatives, we bin them into four age groups: ages 14–17 (born 1899–1902), ages 11–13 (born 1903–1905), and children under age 10 (born 1906–1916). We interact 1916 polio morbidity per 1000 population (\(Polio_{b}\)) in an individual’s birth state with these age-specific cohort groups (\(AgeBin_{a,j}\)). The omitted cohort is individuals aged 18 to 21 (born 1895-1898).

The variables \(\varvec{\alpha _{b}}\), \(\varvec{\kappa _{s}}\), and \(\varvec{\gamma _{a}}\) denote state of birth, state of residence in 1940, and age cohort fixed effects. The identification of polio morbidity’s effect on educational attainment comes from comparing different age cohorts from the same birth state while controlling for current state of residence, and national shocks common across birth cohorts. State of birth fixed effects control for factors common across persons born in the same state, and state of residence fixed effects control for factors that are shared among persons residing in the same 1940 enumeration state. Common shocks shared across birth year cohorts, such as WWI, are controlled for using birth year fixed effects. In the full specification, we also include state-level economic and demographic controls for 1916 by state of birth, and control for schooling laws that applied to each birth year cohort from each state.Footnote 19 These controls are denoted by \(\varvec{X_{b}}\) and include doctors per capita in 1916, education expenditures per capita in 1916, the natural log of manufacturing wages per earner in 1916, and the natural log of population in 1916. Interacting these controls with age cohort fixed effects allows the effect of these state-level characteristics on educational attainment to vary across different age cohorts. These interactions allow the state-level treatment effect of the epidemic to vary across birth year cohorts. The schooling laws, constructed by Lleras-Muney (2002), denote the ages of mandatory school entry, age of school exit and age at which children could obtain work permits for each birth year cohort from each state of birth. These laws proxy for idiosyncratic changes in schooling regulations for each state of birth by birth year cohort.Footnote 20 Finally, \(\epsilon _{i,b,s,a}\) denotes a heteroskedastic error term clustered by state of birth.

6 Empirical results

In our analysis, we run three different regressions, reported in Table 3. Columns (1)–(3) report results from estimating equation (1) with the years of schooling as the dependent variable. Column (1) includes fixed effects for state of residence in 1940, state of birth, and birth year. Column (2) also includes the full set of fixed effects and birth state by age-cohort trends to control for potential underlying geographic trends common across cohorts born in different areas of the country, and column (3) replaces the trend variables with flexible state of birth economic and demographic controls for 1916 interacted with age cohort dummies, and adds schooling law controls. All coefficient results discussed refer to our preferred and most restrictive specification, (3), unless otherwise noted.

Table 3 Effect of 1916 polio notification rate per 1000 on educational attainment of age cohorts

6.1 Evidence from state-level polio morbidity

Results presented in Table 3 confirm our hypothesis that the polio epidemic of 1916 had different effects on educational attainment for children of different ages. School-aged children who were old enough to have labor market alternatives (i.e., those who were between ages 14 and 17 in 1916) and those who were living in areas more affected by the epidemic had lower educational attainment than similarly-aged children living in areas with lower polio morbidity rates. This result is robust even in our models with the most restrictive controls. A one-standard-deviation increase in the polio morbidity rate per 1000 persons (reported as 0.415 in Table 2) results in persons ages 14–17 having around 0.066 fewer years of educational attainment on average. The reference cohort of persons ages 18–21 had, on average, nine years of education. A back-of-the-envelope calculation suggests that this effect is equivalent to every fifteenth individual completing one less year of education. There is also a statistically significant and negative relationship between polio morbidity and educational attainment for children 10 and under. The estimated coefficient suggests that a one-standard-deviation increase in polio morbidity decreases educational attainment by 0.111 years—the equivalent of one in ten people of the cohort completing one less year of education.Footnote 21

In contrast, state-level polio morbidity in 1916 does not have a statistically significant relationship with educational attainment in individuals who were over 10, but too young to legally work in most states. The estimated coefficients on the interaction between polio morbidity and persons ages 11 to 13 in 1916 are negative, although the standard errors are large and they are not precise zeros. Relative to individuals born before 1898, the educational attainment of people born in those years do not appear to be strongly affected by polio morbidity rates.Footnote 22 We test whether our results are robust to using two-year age groupings. Results (reported in the appendix) are robust and confirm our main findings: that students with the ability to work legally, and children younger than 10 experienced reduced educational attainment relative to children in the middle grades.

6.2 Evidence from county level polio morbidity

In addition to state-level data, we have county-level data on polio morbidity from Lavinder et al. (1918) for a select set of states in the eastern USA, as shown in Fig. 6. These county data cover the Northeastern part of the USA, which experienced much higher rates of polio cases and deaths than other regions. While we do not have data on county of birth, we can connect this county-level measure of polio intensity to county of residence in 1940 and restrict the sample to persons residing in the same state they were born in. Essentially, we assume that people living in a particular county in 1940 (as long as it is in their birth state) were also born in this county. If we assume that migration within the USA between 1916 and 1940 is not systematically correlated with polio morbidity in 1916, then migration bias would just introduce classical measurement error in county-level exposure to the 1916 epidemic.Footnote 23

The results using the county-level polio morbidity in Table 4 mirror the results using state-level data. While the estimated coefficients are smaller, the relationship between education and polio is negative and statistically significant for all cohorts ages 0 to 17 in 1916. These results suggest that children living in counties with greater polio cases in 1916, on average, had lower levels of educational attainment than slightly older persons. A one-standard-deviation increase in county-level polio cases reduced the educational attainment of persons born between ages 14 to 17 by 0.07 years relative to the reference cohort born a few years earlier. Similarly, persons aged 11–13 also experienced reduced educational attainment; a one-standard-deviation increase in polio morbidity in this age group reduced educational attainment by 0.08 years. Children under 10 during the polio epidemic had the largest reductions in educational attainment, with a one-standard-deviation increase in polio morbidity associated with a 0.13 year reduction in average educational attainment.

Table 4 Effect of 1916 county polio notification rate per 1000 on educational attainment of age cohorts, 1895–1916

6.3 Robustness

6.3.1 Alternative specification: using age-specific infection rates instead of age bins

We also examine whether our results are robust to two alternative specifications. First, the 1916 polio outbreak primarily affected young children under the age of five. As sanitation improved, epidemics in later decades tended to infect older children more frequently. Using age-specific infection rates for the 1916 polio epidemic from a study by Dauer (1938) in Boston, we substitute our interaction of the polio notification rate variable with age category bins with a new variable created by interacting the polio notification rate for 1916 and the age-specific polio infection rate from Dauer (1938). We assign infection risks of 68% for persons under age four in 1916, 20% for persons ages 5 to 9, 8% for persons ages 10 to 19, and 4% for anyone above the age of 19. In Table 5, we present the main regressions specifications using this alternative measure of polio epidemic exposure. The negative and statistically significant coefficients in specifications (1) and (3) are consistent with our main findings and are larger in magnitude. According to specification (3), for persons aged four and under, an increase in polio cases by one per 1000 people decreased educational attainment by 0.505 years. For persons between ages five and nine, the reduction in educational attainment was 0.06 years, while educational attainment fell by 0.024 years for people aged 10 to 19 during the epidemic.

Table 5 Effect of 1916 polio notification rate per 1000 x age-specific polio infection risk on educational attainment of age cohorts, 1895–1916

6.3.2 Controlling for influenza

It is also plausible that persons exposed to the 1916 polio epidemic were also affected by the 1918 influenza pandemic, which killed over 500,000 people in the USA. The flu came in three waves. The first, with low mortality, came in the spring of 1918. A second wave associated with higher mortality came in fall/winter 1918, with a third wave arriving in early 1919. We include the 1918–19 flu mortality rates from Garrett (2008), the United States Bureau of the Census (1921), and the United States Bureau of the Census (1922) into our analysis, since public health departments were not required to report influenza cases. Table 6 provides state-level data on influenza deaths by state in 1918 and 1919.

Table 6 State level influenza pandemic deaths per 1000, 1918–1919

To control for the intensity of the 1918 influenza epidemic, we include the state-level 1918 influenza mortality rate interacted with age bins in our regression, even though the disease was most virulent and lethal in older populations than the children we focus on (i.e., adults of prime-working age). This exercise addresses potential concerns that the 1916 polio epidemic is spuriously correlated with 1918 influenza intensity.Footnote 24 We study the effects of influenza intensity on observed education in 1940 for a subset of states used in the main analysis (27 of 43).Footnote 25

Results reported in Table 7 show that including the influenza death rate and its interactions with age groups does not affect our finding that children of legal working age in states with greater numbers of polio cases had less educational attainment. It attenuates the statistical significance of the 1916 epidemic for children ages 0 to 10 in 1916 in specifications (1) and (2) but strengthens it in specification (3). The effect of influenza mortality itself on educational attainment is statistically insignificant in specifications (1) and (2). In specification (3), we find influenza mortality increases educational attainment for persons born between 1903 and 1916. This result is consistent with the findings in Parman (2015), who shows that families adjusted educational investments in children in response to reduced human capital caused by prenatal exposure to the 1918 flu: slightly older children with siblings who were prenatally exposed to the flu pandemic experienced increased educational attainment.

Table 7 Effect of 1916 polio and 1918–1919 influenza notification rate per 1000 on educational attainment of age cohorts

6.3.3 Falsification

To further substantiate the main results, we also perform a placebo test comparing men ages 22 to 33 in 1916 to reference cohort of those ages 18 to 21. Results, reported in Table 8, indicate that there is not a consistent pattern between reported polio morbidity and educational attainment. There is a small, negative and statistically significant relationship between polio intensity and the level of education observed for the age 22 to 24 cohort at the 10% level in specification (3). However, this relationship is not consistent across all specifications, suggesting that polio morbidity at the state level is relatively uncorrelated with unobserved factors that could affect observed educational attainment in 1940. In the appendix, we include an additional specification test using two-year birth cohorts, which do not substantively change our results.

Table 8 Placebo test: effect of 1916 polio notification rate per 1000 on educational attainment of age cohorts

6.3.4 Results using female-only sample

Our primary results focus on men because they had more workforce opportunities and may have responded differently to polio than women. Table 9 reports results from our primary specification on a sample of women in 1940. We find that the polio epidemic affected girls under age 10 in a similar way to men. Girls aged 14–17 also reported reduced educational attainment in high-polio areas, although not in our most saturated model. In our male sample, we find that boys aged 11–13 living in areas with higher polio did not have a statistically significant lower educational attainment than boys living in areas less affected by polio, although the standard errors are large and the results are an imprecise zero. For girls aged 11–13, the estimated coefficients are negative and statistically significant. We are not sure why. In general, males and females had different returns to education, and it might have been that girls, for whatever reason, did not return to school in the same numbers as their male counterparts.

Table 9 Women: effect of 1916 polio notification rate Per 1000 on educational attainment of age cohorts

7 Conclusions

The first major polio epidemic in the United States struck in the summer of 1916 and persisted into the fall. With over 23,000 cases of polio diagnosed, the epidemic tested the nascent system of public health departments. Officials engaged in a variety of measures to stem the outbreak, including quarantines, washing streets, and closing public schools. In this paper, we focus on the effect of the epidemic on years of schooling reported by individuals in the 1940 US Census. Educational attainment could have been reduced for infected children, or for children who spent less time in school because their schools were closed or because their parents refused to send them. While we do not have data on school closures, we leverage geographical variation in polio morbidity as a proxy and focus on whether children of differential ages were differently affected. Younger children were more likely to have the virus, and older children may have opted to join the workforce instead of stay in school. Our results show that children of legal working age living in areas with higher rates of polio infection had diminished educational attainment than similarly aged children living in states with lower infection rates. This result, which is robust and consistent across specifications, does not hold for age groups who were not of legal working age, nor does it hold for slightly older children who had already completed their secondary schooling. We find stronger negative educational effects for children who were most susceptible to the virus and who were age 10 and under during the epidemic.

Our results suggest that a one-standard-deviation increase in polio intensity during the 1916 epidemic reduced educational attainment by 0.07 years for the age 14–17 cohort and 0.11 years for the cohort age 10 and younger. Based on the average reported days in the school year, these numbers suggest that annual attendance declined by roughly 11 days in the age 14–17 group and 17 days among children aged 10 and under. These numbers are not trivial; they are within the range of Lleras-Muney’s (2002) finding that compulsory attendance and child labor laws increased educational attainment by about 18 days. Another way of looking at the impact is to look at the aggregate effect in the cohort. For the cohort of children aged 14–17 during the epidemic, these numbers suggest that on average, about 1 out of 15 of them had one year less of educational attainment in their lifetime. The impact is even larger among younger children, with one out of every 10 children attaining one less year of education.

These cohort-wide reductions in educational attainment suggest that public quarantines, school closures, and parental fear magnified the social cost of the 1916 poliomyelitis epidemic, and may shed light on how epidemics can affect even people who do not contract the illness. Clearly, school closures, even for short times, may reduce educational attainment among affected children. Today, with virtual instruction, it is likely that these effects may be smaller. It is also likely that the impact on high-school completion will be mitigated because many jobs require high school diplomas. Nevertheless, greater numbers of students are now enrolled in college. These results suggest that if these students leave school (either because they do not like virtual learning, or have a negative financial shock), they may be less likely to return and suffer from lower overall educational attainment as a result.

Fig. 1
figure 1

Frequency of newspapers in Chronicling America mentioning “school postponement” and polio on the same page, entire U.S.A.

Fig. 2
figure 2

Weekly city-level polio morbidity in 1916 by school start dates from Chronicling America. Subset of cities reporting polio cases to Public Health Service disease surveillance system in 1916

Fig. 3
figure 3

Subset of weekly city-level polio morbidity in 1916 by school start dates from Chronicling America, observations with less that 100 reported cases. Subset of cities reporting polio cases to Public Health Service disease surveillance system in 1916

Fig. 4
figure 4

Average daily attendance, NYC schools 1911 to 1917. Source: Annual report of the superintendent of Schools Board of Education, the City of New York

Fig. 5
figure 5

Mean share of the population observed residing in states other than birth state in 1920

Fig. 6
figure 6

County polio notifications per 1000 in 1916 for select counties, nine quantile bins. Collected from Lavinder et al. (1918)