Introduction

Tuberculosis (TB) is strongly related to social exclusion and inequality1. Although most TB cases in high-income countries are in migrants from high incidence countries, cases in the native-born population are often concentrated in vulnerable deprived and marginalised groups2. Poverty, lower education, reduced access to good healthcare, and poor living standards increase the risk of TB as well as deprivation-related behaviours harmful to health3, such as alcohol and controlled-drug misuse4 and tobacco smoking5. However, TB control strategies have often emphasised medical technologies (including preventive treatment and vaccination, early and better diagnosis, effective treatment, and management of co-morbidities), with less focus on tailoring strategies to the socioeconomic circumstances conducive of higher disease burden6.

The World Health Organization (WHO) 2015 strategy to eliminate TB by 2035 recognises this target cannot be achieved without also prioritising the social determinants of TB7, including in low-TB incidence countries, which are closer to TB elimination8. The pathways through which poverty affects TB risk have however received limited attention in these settings. Most studies have been ecological, comparing rates in geographic areas with different socio-economic levels9. Most individual-level studies in the setting have focused on either contrasting patients’ characteristics by groups (e.g. native vs foreign born, age groups, and drug sensitive vs drug resistant), or comparing the association between these social risk factors and treatment outcome, rather than disease risk2.

In England, social deprivation appears to contribute significantly to TB risk. Over 20% of notified UK-born TB cases report at least one of homelessness, drug misuse, alcohol abuse, or history of prison stay, compared to 10% in foreign-born cases10. Subjects with such social risk factors are younger (about 60% aged 15–44 years), more likely to be from White ethnic background, and present more commonly with pulmonary TB, the most communicable disease form10. Recurrent TB, drug-resistance and poor treatment outcomes are also more frequent in this group10. However, these social risk factors tend to cluster in individuals, therefore providing an additional challenge to disentangling their respective effects. A better understanding of the relative contribution of these social determinants of health to the risk of TB in the native White population could help tailor TB control efforts9.

We report on a case-control study investigating the associations between individual socio-economic status (SES) and a range of social determinants of health (including drug misuse, tobacco smoking, alcohol and homelessness), and the risk of TB in UK-born young adults of White ethnic background.

Methods

Study design, setting and participants

We used data from a community-based case-control study conducted between 2012 and 2014 across England measuring the duration of school-aged BCG (Bacillus Calmette-Guerin vaccine) protection against tuberculosis in England11.

Cases were subjects with a first episode of TB diagnosed at age 23 to 38 years between 2003 and 2012 notified to the UK Enhanced Tuberculosis Surveillance system (ETS). Controls were those never diagnosed with TB, frequency-matched to cases on sex and birth-year cohort (±2 years), with a ratio of approximately two controls per case. Controls were randomly sampled from the same communities as cases based on cluster-sampling with probability proportional to the population size of small areas, then simple random sampling of residential addresses from the UK Postcode Address File (Fig. 1). Cases with known HIV infection and those living in state institutions (e.g. prisons) were excluded. Data on social factors were collected as part of a study to measure the durability of BCG vaccine effectiveness 10–30 years post school aged-vaccination in the UK (median age of 13 years), hence the restriction to the 23–38 years age-group; details are reported elsewhere11.

Figure 1
figure 1

Summary of sampling strategy for cases and controls across England. 1MSOA = Mid-level Super-Output areas; contiguous LSOAs, constrained by the 2003 English local authority boundaries, average population 7,200 inhabitants. 2LSOA = Lower level Super-Output Areas (LSOAs); designed by the Office for National Statistics to have a socially homogeneous population with an average 1500 residents in each area. The census Output Areas are the smallest statistical enumeration level in England, and the Super Output Areas are the smallest grouping of contiguous output areas used to generate official statistics.

Ethics

Ethical approval was obtained from the UK National Health Services Research Ethics Committee (NHS-REC 11/H1102/11), and the London School of Hygiene and Tropical Medicine (LSHTM) Research Ethics Committee (Approval 5996). All participants provided written informed consent. All study methods were performed in accordance with the relevant guidelines and regulations.

Variables

Information was collected by interview on socio-demographic characteristics, a number of lifestyle variables and other TB risk factors. These included age, sex, residential postcode, and highest educational level attained. Residential postcodes linked to the Office for National Statistics data allowed their categorisation into quintiles of small-area indices of multiple deprivation (IMD).

Other potential TB risk factors included history of ≥12 weeks homelessness, prison stay (in the UK or abroad), stays of ≥3 months duration in high TB incidence regions (Africa and Asia), and BCG vaccination. To minimise information bias, subjects self-entered and electronically locked tobacco, alcohol and drug use, history of homelessness, and prison stay information on computer-assisted software, thus inaccessible to the interviewer.

Statistical analysis

Socio-economic variables were grouped into distal (educational level, residence based on area-level deprivation and household overcrowding) and intermediate social factors (tobacco smoking, alcohol drinking, drug misuse, history of homelessness and prison stay). We posited a causal framework of the relationship between these variables and the TB risk, as well as to each other, based on background knowledge represented in a directed acyclic graph (DAG), to inform the statistical models. The matching variables (birth cohort, sex) and other TB determinants (BCG vaccination and long stays in high TB-incidence) were considered a-priori confounders and included in all adjusted models.

We assumed that educational level affects employment opportunities and income in later life, and through these, area-level deprivation at the place of residence and household overcrowding level. The educational level and socio-economic status may also affect risk of tuberculosis via other pathways not measured in this study. However, we assumed that education level has a dual effect on the intermediate social determinants, by itself and via its effect on socio-economic status (Supplementary Fig. 1). This assumption allows control for indirect paths between the intermediate variables and TB that do not include education level, which could otherwise have been missed.

The distribution of variables was compared in cases and controls. Characteristics of study participants with data missing for at least one variable were compared to those with complete data. Correlations between related variables were also examined, notably between lifestyle variables (use of tobacco, alcohol and controlled drug), and between history of prison and homelessness.

Two groups of logistic regression models were built using Stata 14, respectively measuring the overall association between education level, then that of each intermediate factor in the DAG, and TB. Baseline models (controlling for birth-cohort and sex) and adjusted models (controlling for other confounders and appropriate variables from the DAG) were fitted. For the overall association between educational level the fully-adjusted logistic regression model controlled for birth-cohort, age and sex, BCG status and stays in high TB regions. For the separate investigation of other SDHs and TB risk, the fully adjusted model (based on the DAG) included all a-priori confounders, education level, as well as all other variables in the DAG. The variables Class B/C drug (e.g. Cannabis, Amphetamines, Benzodiazepines, Qat, Glue, Solvents, Speed or other amphetamines etc.) misuse and Class A drug (e.g. Ecstasy, Cocaine, Crack Cocaine, Heroin, LSD, Psychedelics (e.g. “magic” mushrooms)), misuse were strongly correlated, preventing their simultaneous inclusion in the model. These were therefore measured in turn in separate models.

The final models were based on observations with complete data for all variables. Significance testing and tests for trend for ordered categorical variables in which level-specific estimates suggested a trend were done using likelihood ratio tests.

We conducted sensitivity analyses using multiple imputation by chained equations (MICE) to impute missing data. MICE models included all variables in the fully-adjusted model, and case/ control status. Twenty datasets were imputed, repeating the complete-case regression models and estimates combined using Rubin’s rules.

For key modifiable risk factors, we estimated the population attributable fraction (PAF) for our study’s target population, using the formula PAF = ∑pi(aORi − 1)/aORi, where pi is the proportion of cases with exposure level i for a specific risk factor, and aORi is the adjusted odds ratio for level i of that risk factor12.

Results

Overview of study sample and characteristics

There was 9% more female control than cases, but comparable distributions by birth cohorts (Table 1). Complete data were available for 88% (1618/1864) of participants, with missing data mostly seen for alcohol drinking (3%), tobacco smoking (3%) and education level (3%). Over 40% of cases lived in the most deprived quintile of residence, and 20% reported less than 13 years of formal education compared to 5% of controls. About 25% of cases smoked daily with lifetime consumption >10 pack-years, and 18% reported alcohol drinking above 14 units/week, compared with 13% and 11% respectively in controls. Ten percent of cases reported using injectable class A drugs compared to 1% of controls. Using class B/C drugs was strongly correlated with using class A drugs (Spearman correlation coefficient = 0.76). Social risk factors clustering was more common in cases than controls (e.g. 15% of cases reported at least two of class A drug use, history of homelessness or prison stay, compared to 3% of controls – Supplementary Fig. 2).

Table 1 Characteristics of Study Participants, England.

Social determinants and risk of tuberculosis

The odds of TB in subjects with education level <13 years duration was nearly four times higher than in those with degree-level education but without a clear gradient among those with 13 years of education of more. The associations were little changed after adjusting for age, sex, BCG vaccination status and long stays in high TB areas, and after multiple imputation of missing data (Table 2).

Table 2 Association Between Education Level and Tuberculosis in White UK-born Subjects Aged 23–38 Years, England.

A strong association between area-level deprivation and TB was seen in subjects living in the most deprived fourth and fifth quintiles areas, respectively, compared to those living in the least deprived quintile, after adjustment for confounding ([aOR=1.74; 95%CI = 1.16, 2.59] and [aOR=3.30; 95%CI = 2.23, 4.88] respectively) (Table 3).

Table 3 Association Between Intermediate Social Determinants of Health and Tuberculosis in White UK-born Subjects Aged 23–38 Years, Englanda.

The strongest risk factor among the intermediate social determinants was misuse of class A injectable drugs, with five times higher TB odds (aOR = 5.67; 95%CI = 2·68, 11.98) compared to those who never misused class A drugs. The TB risk was also 50% higher in those who reported using either non-injectable class A or class B/C drugs within the past 10 years. There was strong evidence (P = 0.008) of a dose-response association between level of tobacco smoking and TB, compared to never-smokers, but no clear association between alcohol drinking levels and TB. History of homelessness of up to 12 weeks and >12 weeks were respectively associated with 66% (aOR = 1.66; 95%CI = 0.99, 2.79) and double (aOR = 2.01; 95%CI = 1·11, 3.63) higher TB odds.

BCG vaccination on average halved the risk of TB in those vaccinated 10 to 30 years earlier (aOR = 0.51; 95%CI = 0.38, 0.68), and travel for 3 months or more in high-TB burden parts of the world was associated with more than a doubling in the risk of disease.

Population attributable fractions

While the association between TB and tobacco smoking was not strong, avoiding smoking could reduce TB cases in our target population by 18% (Table 4) because of the high prevalence of smoking (16% controls smoked >10 pack-year daily - Table 1). Similarly, preventing Class-A drugs misuse could prevent 15% TB cases, 8% by stopping injectable Class-A drug abuse. BCG vaccination may have helped prevent about 12% cases and eliminating homelessness could reduce 7.6% of TB notifications. The respective impact of these risk factors is not assumed additive; the joint PAF for any combination of risk factors is not the sum of their respective PAFs.

Table 4 Estimates of Tuberculosis Population Attributable Fraction (PAF) for Specific Risk Factors in White UK-born Subjects Aged 23–38 Years, England.

Missing data and sensitivity analysis

The 226 subjects with data missing for at least one variable (12% overall, 15% cases and 10% controls, P = 0.003) were slightly less educated, more deprived, heavier tobacco smokers and alcohol drinkers and more likely to have a history of homelessness, compared to those 1638 (88%) with complete data (Supplementary Table 1). The results from statistical analyses after multiple imputation of missing data were similar to those of the complete case analyses (Table 3).

Discussion

Declines in TB incidence among UK-born in England since 2012 varied by ethnicity. In the White ethnic group TB rates have remained relatively stable10. Social deprivation may explain in part these relatively stagnant rates, with TB rates in White UK-born in the most deprived areas more than 4 times higher than least deprived areas, and a higher proportion of reporting at least one social risk factor compared to other ethnic groups10. We have, to our knowledge, made the first attempt at a causal framework analysis of how the association between overall socio-economic status (SES) and TB in the native population from a high-income low-TB burden setting may be mediated by intermediate social factors. We found that amongst UK-born White young adults in England, the risk of TB was four times higher in those with less than 13 years education than those with A-level and above. This association may be explained by the link between low educational level and small-area level deprivation, tobacco smoking, misuse/abuse of controlled drugs, and homelessness, and in turn their effect on TB risk.

In low-incidence settings, there is data showing that poverty and lower SES are associated with greater risk of Mycobacterium tuberculosis infection9, as well as delay to diagnosis and treatment13, and poorer treatment outcome14; but there is limited data measuring the association of individual SES to the risk of active TB. The direct comparison of our findings to other studies is complicated by variations in individual SES measures used across settings. We used highest educational attainment, which is assessed to be a good indicator of SES, because it is highly correlated with parental SES, as well as a strong predictor of future employment and earnings, marking both early life influences and adult SES15. Our findings that the risk of TB is nearly four-fold higher in individuals with lower education levels is consistent with the few published studies that have measured the association between individual-level SES and TB. In their case-control study of risk factors of TB in adults in Washington in 1988–90, Buskin et al. used family income, years of education and housing conditions to create a composite binary SES variable16; they found that the risk of TB in the lowest SES category was about four times higher than that of the highest SES group. A case-control study in Greenland, in 2004–06, using occupation as the proxy-measure for SES, noted that TB risk four times greater in those unemployed than in those in work or in studies17.

The association between tobacco smoking and higher TB risk is consistent with results from two UK case-control studies in which current smokers were found to be on average 60% more likely to develop TB than non-smokers18. Similar results were also reported from a USA study19, and systematic reviews20. It is not surprising that lower SES increases the risk of TB through tobacco smoking. Studies have consistently found higher prevalence of tobacco smoking and younger age at smoking initiation in subjects from lower SES5,21. Tobacco-related loss of mucosal immunity in the respiratory tract may increase the risk of TB infection in smokers; and impair both innate and adaptive immune responses leading to higher risk of progression to disease22,23. The dose-effect relationship between smoking and TB risk in our study further supports a direct effect of tobacco smoking on TB risk.

Subjects who admitted to using either of class C, B or non-injectable class A drugs in the past 10 years were more likely to develop TB, and the risk in those reporting injectable class A drugs use was much higher, than non-drug users. Reports from 1971, before the HIV pandemic, indicate that the prevalence and incidence of TB among drug users in Harlem, New York, were respectively 9 and 10 times higher than that of its general population24, similar to our unadjusted estimates; in that study, tuberculin skin test positivity rates were similar in drug-users and the general population, suggesting that drug use may increase the risk of active TB after infection rather than only Mycobacterium tuberculosis infection24. Studies from the USA25 and the UK26 have also found that smear positive pulmonary TB is twice more common in drug-users than other TB forms, and diagnosis delays are more frequent, contributing to more intensive transmission in this population. Recent TB infection is in itself a strong TB risk factor, with nearly 5–10% risk of TB in the 2–5 years following infection27. HIV co-morbidity and under-nutrition also likely contribute to the higher risk in those using injectable drugs.

We found an independent association between history of homelessness and higher TB risk, after accounting for the other TB risk factors, including tobacco smoking, harmful drug use and alcohol drinking. This can be explained by other TB determinants not explicitly included in our analyses, for instance poorer nutritional status.

The absence of an association between alcohol drinking and TB in our study was consistent with the results of the sole previous UK study in which this was measured28. Subjects who reported regularly drinking on average 14 units or more per week, were only about 50% more likely to have TB than non-drinkers, but alcohol was no longer associated with TB after adjusting for other social determinants. In contrast, the pooled relative risk from a previous systematic review suggests nearly three times more TB among heavy drinkers29. The difference to our results likely reflects duration of exposure; our study population is younger (23 to 38 years), hence less likely to have been exposed to long-term harmful drinking, whereas studies reporting stronger association included older adults, and/or subjects with recorded diagnosis of alcohol abuse29.

Household overcrowding and history of prison stay were both strongly associated with TB, although the associations were weaker after adjusting for other variables, especially tobacco smoking, drug use and homelessness. Overcrowding increases the risk of TB exposure and infection30, and the higher prevalence of latent tuberculosis infection among prisoners has been noted in several surveys31.

Our results are relatively conservative. We included social factors and behaviours perceived as undesirable and typically under-reported; therefore, the associations are more likely to be underestimates. Reasonable efforts were made to include all eligible cases and comparable controls, including for example, attempts to contact eligible TB cases with known contact details at shelters or temporary accommodations11. There are limitations to the data presented, however. The retrospective ascertainment of exposure may have led to some misclassification, although most likely non-differential between cases and controls, again leading to underestimation rather than overestimation of the associations measured. Lower success in enrolling TB cases in the most deprived areas11 may have further contributed to underestimating the association between lower SES and TB. Finally, a limitation of using DAGs in our analyses is the requirement of relatively simplistic causal assumptions to describe complex relationships between the various social determinants investigated. For example, drug-users are more likely to go to prison, but prisoners can develop drug addiction during their prison-stay32.

Overall, our results provide insights into some pathways through which social deprivation affects the risk of TB in the native population of a low-incidence high-income country. TB rates have relatively stagnated in the UK-born population for nearly a quarter of a century in spite of the scaling up of control efforts in recent years; new threats have also emerged, including multi-drug resistant TB strains. The WHO global TB strategy recognises additional actions to address the underlying social determinants of TB are needed to complement current TB control and prevention tools for elimination8. The potential impact of interventions like tobacco cessation programmes and accessible substance misuse services on TB rates, as well as the primary prevention of tobacco, alcohol and drug misuse, should be additional reason to allocate resources, besides their other health benefits. Attributable fraction estimates suggest tobacco smoking cessation in white UK-born young adults may help reduce about a fifth TB cases in the target population, while drug addiction prevention could help avert nearly 15% TB. A UK-based qualitative study had highlighted the need to integrate care across a number of social and health services to address the complex needs of TB patients to achieve better treatment outcome33. It is encouraging that the Collaborative TB control strategy for England 2015–202034 which is currently being updated, has given more pre-eminence to addressing social determinants of TB, including for example recommendations on closer collaboration between local authorities and health services to address socio-economic risk factors of TB, dedicated TB services for homeless and those attending substance misuse services, fast-track access to social care and accommodation for homeless for the duration of their treatment, continuity of TB care between prison and the community, and the systematic monitoring of social determinants of TB34. This study provides further arguments in support of such integrated approaches to reduce the disease risk.