Introduction

Preterm birth is defined as birth before 37 completed weeks of gestation and is accompanied by a spectrum of morbidities and brain injuries. The risk of preterm birth is increased with multiple gestations, young or advanced maternal age, previous preterm delivery, and maternal and fetal complications.1 Genetic factors may also contribute to preterm birth. A recent genome-wide association study identified variants at six gene loci that were associated with gestational duration and preterm labor.2 Previous studies have shown that preterm infants are at an increased risk of adverse neurodevelopmental outcomes, including poor cognitive, language and motor development, hearing and vision impairments, and brain injury.3,4,5,6,7 Risk factors associated with poor neurodevelopmental outcomes in preterm neonates include low gestational age, low birth weight, small head circumference, bronchopulmonary dysplasia (BPD), retinopathy of prematurity (ROP), infection, frequent invasive procedures, and brain injury.5,7,8,9,10,11,12,13 Brain injury that can occur in the neonatal period includes white matter injury, intraventricular hemorrhage, and cerebellar hemorrhage. These preterm-birth related morbidities may be present at birth or acquired during the neonatal period. While some of these risk factors may have a genetic contribution (e.g., BPD, ROP),14,15 many of these risk factors are acquired insults.

In recent years, there has been an increase in multiple births in Canada, with twin and triplet births accounting for 2.2% of all live births in 1995 and 3.2% in 2012.16,17 This recent increase in multiple births has been attributed to older maternal age at conception, use of ovulation induction agents, and the use of assisted reproductive technology.18,19,20 Multiple gestation is a risk factor for preterm birth resulting in either spontaneous preterm labor or indicated preterm delivery for a maternal or fetal condition.1,21,22 In Canada, 57% of twins and 96% of higher-order multiple births were born preterm.22 Similarly, a large multicentre study in the United States by Gardner and colleagues found that 54% of twins were born preterm compared with 9.6% among singletons. While studies have examined neurodevelopmental outcomes among preterm singletons, longitudinal studies of preterm twins and higher order multiples are limited.23,24,25,26,27 Twin studies offer the advantage of studying the genetic and environmental contribution to a given trait. If monozygotic (MZ) twin pairs have greater agreement for the trait relative to dizygotic (DZ) pairs, this suggests a strong genetic contribution. However, if MZ and DZ twin pairs have similar levels of agreement, then environmental factors may have a stronger influence.

The objective of the present study was to examine the genetic and environmental influences on long-term neurodevelopmental outcomes in preterm MZ and DZ twins. Our specific aims were (1) to assess agreement among preterm twin pairs for perinatal morbidities that are recognized to have a negative impact on neurodevelopment (BPD, ROP, infection, and brain injury) and (2) to examine agreement in motor, cognitive, and language outcomes among MZ and DZ twin pairs and twin pairs concordant or discordant for neurodevelopment at 1.5, 3, and 4.5 years. We hypothesized that preterm MZ twin pairs will have greater agreement for perinatal morbidities and outcomes relative to DZ twin pairs due to their identical genetic makeup.

Methods

Participants

Study participants were selected from a prospective cohort of preterm infants born between 24 and 32 weeks’ gestational age from the tertiary-level neonatal intensive care unit (NICU) at the British Columbia’s Women’s Hospital as described previously.9 Neonates were excluded if they had congenital malformations, congenital infection, or ultrasound evidence of a large parenchymal hemorrhage (>2 cm). This preterm cohort included a total of 234 neonates born between April 2006 and August 2010. The final cohort for the present study comprised 24 MZ twins and 52 DZ twins. Inclusion as a twin in this study was based on placental data. All twins included in the study had placental data available to determine zygosity. There were two neonates in the overall cohort, who were single survivors of a twin gestation, and these two twin pregnancies were not included in our sample as there was only one surviving twin. This study was approved by the University of British Columbia Research Ethics Board. Informed consent was obtained from the parent or legal guardian.

Neonatal clinical data

Prospective medical chart review was conducted by an experienced neonatal research nurse. Clinical risk factors for adverse neurodevelopmental outcome were selected based on the strength of prior associations in this and other cohorts5,7,8,9,10,11,12 and included BPD (defined as requiring oxygen at 36 weeks), ROP, infection, and brain injury (white matter injury, intraventricular hemorrhage, and cerebellar hemorrhage).

Magnetic resonance imaging (MRI) scans

Neonates were scanned with MRI as soon as clinically stable after birth using an MRI-compatible incubator and a specialized neonatal head coil (Advanced Imaging Research, Cleveland, OH) on a Siemens 1.5 T Avanto scanner (Erlangen, Germany). A second MRI scan was acquired at term equivalent age. MRI scans were performed without pharmacological sedation. Of the 225 neonates in this cohort, 186 had two MRI scans, and 39 had only the first MRI scan (there were no neonates with only the second scan). Detailed imaging methods for this study cohort have been previously described.28 The MRI images were reviewed by an experienced neuroradiologist who was blind to the neonate’s medical history. Both the initial and the term-equivalent MRI scans were assessed for the presence of white matter injury, intraventricular hemorrhage, and cerebellar hemorrhage as reported previously; the maximum severity of injury was evident on the early scan and used in the analyses.29,30

Neurodevelopmental assessment

Children were assessed at 1.5 and 3 years with Bayley Scales of Infant and Toddler Development, Third edition (Motor, Cognitive, and Language Composite scores). Outcomes at 4.5 years was assessed with The Movement Assessment Battery for Children and The Wechsler Preschool and Primary Scale of Intelligence-III using Full Scale IQ [FSIQ] as the Cognitive outcome and Verbal IQ as the Language outcome score. Assessments were completed by experienced physiotherapy, occupational therapy, and psychology staff in the Neonatal Follow-up Program at BC Children’s & Women’s Health Centre, who had access to the participants’ clinical data but were blind to the MRI data. Inter-rater reliability measurements for neurodevelopmental outcomes were not obtained.

Data analysis

All statistical analyses were conducted using SPSS Statistics version 25. Clinical data for MZ and DZ twins were compared using unpaired t tests (gestational age, birth weight, head circumference) and chi-squared tests (BPD, ROP, infection, white matter injury, intraventricular hemorrhage, and cerebellar hemorrhage). Shared demographic factors (e.g., gestational age, maternal and paternal education) were compared for twin pairs. Unshared demographic factors (e.g., birth weight, head circumference, BPD, ROP, infection, and brain injury) were compared for all twins, rather than pairs. To estimate the degree of agreement for categorical clinical variables within twin pairs beyond that expected by chance, we calculated Cohen’s Kappa with values of 0 indicating no agreement, 0–0.20 as slight agreement, 0.21–0.40 as fair agreement, 0.41–0.60 as moderate agreement, 0.61–0.80 as substantial agreement, and 0.81–1 as high agreement.31

Percent score difference was calculated for MZ and DZ twin motor, cognitive, and language scores at 1.5, 3, and 4.5 years. For this analysis, twins were assessed as part of a pair and not individually. Twin pairs were classified as concordant or discordant based on their initial motor, cognitive, and language scores at 1.5 years. Discordance was defined as a score difference >2 standard deviation (SD) from the mean percent score difference for the motor, cognitive, or language scores. This cut-off was selected based on our previous data using Bland Altman plots, which identified outliers that were outside the 95% limits of agreement (>2 SD of the mean percent score difference for the group). For subsequent analyses, we defined discordance as a percent score difference >2 SD of the mean percent score difference of the group. Clinical characteristics (gestational age, birth weight, head circumference) and the presence of perinatal morbidities (BPD, ROP, infection, white matter injury, intraventricular hemorrhage, and cerebellar hemorrhage) were compared between concordant and discordant twins using unpaired t tests and chi-squared tests. Mean percent score differences for motor, cognitive, and language scores were generated for discordant and concordant twin groups at 1.5, 3, and 4.5 years.

Results

Clinical characteristics

Clinical characteristics for the study cohort are presented in Table 1 for 24 MZ twins and 52 DZ twins. MZ and DZ twins did not differ in sex, gestational age, birth weight, or head circumference. The presence of perinatal morbidities (BPD, ROP, infection, white matter injury, intraventricular hemorrhage, and cerebellar hemorrhage) was not significantly different between MZ and DZ twins. Assessments at 1.5 years were available for 20 MZ twins and 44 DZ twins (84% of initial sample). Assessments at 3 years were available for 24 MZ twins and 42 DZ twins (87% of initial sample). Assessments at 4.5 years were available for 18 MZ twins and 42 DZ twins (79% of initial sample).

Table 1 Characteristics of the study sample.

Agreement for risk factors among MZ and DZ twin pairs

MZ twin pairs had substantial agreement for the presence or absence of ROP (κ = 0.63) and moderate agreement for BPD (κ = 0.46) and infection (κ = 0.33). MZ twins had moderate agreement for cerebellar hemorrhage (κ = 0.44) and slight agreement for white matter injury (κ = −0.18) and intraventricular hemorrhage (κ = −0.11).

DZ twin pairs had substantial agreement for the presence of ROP (κ = 0.94), moderate agreement for BPD (κ = 0.43), and fair agreement for infection (κ = 0.40). DZ twins had moderate agreement for intraventricular hemorrhage (κ = 0.42), fair agreement for cerebellar hemorrhage (κ = 0.26), and slight agreement for white matter injury (κ = 0.02).

Agreement in neurodevelopmental outcomes over time

Motor, language, and cognitive scores for MZ and DZ twin pairs are listed in Table 2. Both MZ and DZ twin pairs were similar in cognitive and language outcomes. Bayley-III cognitive scores were lower in MZ twins at the 3-year follow-up. For all other outcomes, there were no significant differences in scores between MZ and DZ twins (Table 2). When plotted, agreement in motor outcomes between twin pairs decreased over time as indicated by increasing motor score differences for both MZ and DZ twins (Fig. 1). Agreement in cognitive and language scores remained stable over time for both MZ and DZ twin pairs (Figs. 2 and 3).

Table 2 Neurodevelopmental outcome scores for monozygotic and dizygotic twins.
Fig. 1: Similar neurodevelopmental outcomes for MZ and DZ twin pairs.
figure 1

Differences in motor outcomes increase over time.

Fig. 2: Similar neurodevelopmental outcomes for MZ and DZ twin pairs.
figure 2

Differences in cognitive outcomes stable over time.

Fig. 3: Similar neurodevelopmental outcomes for MZ and DZ twin pairs.
figure 3

Differences in language outcomes stable over time.

Twin pairs discordant in neurodevelopmental outcomes

Twin pairs with percent score differences >2 SD of the mean score difference were classified as discordant (N = 12, 6 pairs). Discordant twins had a significantly lower gestational age, birth weight, and head circumference relative to concordant twins (Table 3). A greater proportion of discordant twins had BPD (χ2 = 8.0, P = 0.01) and ROP (χ2 = 5.9, P = 0.02). Within discordant twin pairs, there were two pairs with a birth weight difference >20%. In terms of brain injury, 50% (N = 6) of all discordant twins had brain injury. Brain injury involved minimal white matter injury (N = 1), severe white matter injury (N = 1), and intraventricular hemorrhage (N = 5). Within discordant twin pairs, there was 1 pair in which only the lower scoring twin had brain injury (severe white matter injury and intraventricular hemorrhage), 2 pairs in which both twins had brain injury (intraventricular hemorrhage), 1 pair in which the higher scoring twin had brain injury (minimal white matter injury and intraventricular hemorrhage), and 2 pairs with no brain injury in either twin.

Table 3 Discordant and concordant twin pairs.

Motor, language, and cognitive scores between discordant and concordant twin pairs are listed in Table 4. Discordant twins had lower cognitive scores at 1.5-year follow-up and lower motor scores at 3-year follow-up relative to concordant twins. There were no significant differences in the other assessments between discordant and concordant twin pairs (Table 4). Agreement in motor outcomes between twin pairs decreased over time for both discordant and concordant twin pairs, as indicated by increasing motor score differences (Fig. 4). Discordant twin pairs had increasing agreement for cognitive and language outcomes as indicated by decreasing score differences over time (Figs. 5 and 6).

Table 4 Neurodevelopmental outcome scores for concordant and discordant twins.
Fig. 4: Neurodevelopmental outcomes in discordant and concordant twins.
figure 4

Differences in motor outcomes increase over time. Discordant = >2 standard deviations from the mean Bayley-III motor percent score difference at 1.5 years.

Fig. 5: Neurodevelopmental outcomes in discordant and concordant twins.
figure 5

Differences in cognitive outcomes decrease over time. Discordant = >2 standard deviations from the mean Bayley-III cognitive percent score difference at 1.5 years.

Fig. 6: Neurodevelopmental outcomes in discordant and concordant twins.
figure 6

Differences in language outcomes decrease over time. Discordant = >2 standard deviations from the mean Bayley-III cognitive percent score difference at 1.5 years.

Discussion

In our cohort of preterm twins, MZ and DZ twin pairs had similar long-term neurodevelopmental outcomes through early childhood. MZ twins did not show greater agreement in outcomes relative to DZ twins suggesting a stronger environmental, rather than genetic, influence on development. Preterm infants may have neurodevelopmental outcomes based on their exposure to early-life insults or environmental stressors rather than their genetic makeup. Differences in the presence of perinatal morbidities such as BPD, ROP, infection, and brain injury may lead to different outcomes in two genetically similar twins. In our study, twin pairs who were discordant in neurodevelopmental outcomes were born earlier with a lower birth weight and higher incidence of BPD and ROP relative to concordant twins. The mechanism underlying differential susceptibility to perinatal morbidities within twin pairs remains unclear. One twin may have had a compromised intrauterine development, difficult extraction, or lower birth weight relative to the other twin. These differences may cause one twin to be more vulnerable to perinatal morbidities such as BPD, ROP, infection, and brain injury, which in turn will lead to differences in neurodevelopmental outcome.

One third of preterm twins in our sample had brain injury detected on their neonatal MRI. It is well established that these perinatal morbidities can have a negative impact on neurodevelopment with worse outcomes being associated with an increasing number of these insults.3 The presence of neonatal brain injury has been shown to be a significant predictor of adverse neurodevelopmental outcome, including poor cognitive, language and motor development and an increased risk of cerebral palsy.6,9,30 When we examined motor outcomes over time, differences in motor scores increased over time for all groups (MZ, DZ, discordant, and concordant). This may be due to motor impairments in children who had sustained a brain injury becoming more apparent with increasing age and abilities.

When discordant twin pairs were followed over time, initial differences were minimized, and pairs become more similar in cognitive and language abilities. The lower scoring twin in the discordant pair appears to catch up, which may reflect the positive impact of early intervention programs or home environment. Developmental surveillance in preterm infants allows for early detection of delays and implementation of allied health services, such as physical therapy, occupational therapy, and speech language pathology. A recent systematic review found that early intervention programs for preterm infants have a positive impact on neurodevelopmental outcomes during infancy with cognitive benefits persisting into preschool age.32 Other environmental factors such as parental education, socioeconomic status, and parenting style have also been shown to be associated with neurodevelopmental outcomes in preterm populations.33,34,35,36 For instance, higher parental education is associated with increased cognitive and language scores at 18 months among preterm children.33 Similarly, a recent study found that cognitive outcomes at preschool age was associated with maternal education, with greater cognitive scores in children whose mothers had a post-secondary degree relative to children whose mothers had lower levels of education.36 Interestingly, this study also reported that the adverse cognitive outcomes associated with perinatal brain injury were mitigated in preterm children born to mothers with higher education.36 Other parenting behaviors such as positive affect, sensitivity, and parent–child synchrony have also been shown to be associated with better cognitive and social development in children born preterm.35 Our findings along with previous research indicates that the initial variability in outcomes among preterm infants is not fixed, and efforts made post-discharge from the NICU can have a substantial impact on long-term outcomes.

The strengths of this study include the high subject retention at follow-up assessments. We also had an adequate sample size of 76 twins for an MRI study, which is comparable to previous MRI studies of this age group.37,38 There are limitations of this study that should be acknowledged. Zygosity was determined based on placental pathology and was not confirmed with genetic testing. We had a small number of discordant twins relative to concordant twins. Neurodevelopment outcomes at all three time points cannot be assessed with the same scales. The Bayley-III subscales may not be directly comparable to the Movement Assessment Battery for Children and The Wechsler Preschool and Primary Scale of Intelligence-III. We recognize that increasing motor score differences may reflect the use of different measures. Our preterm twins were followed only until 4.5 years of age, and any developmental problems that become evident after this age would not have been captured. Although preterm twin pairs usually share similar postnatal home and family environments (e.g., family socioeconomic status, parental education), differences in care within pairs by family members or by teachers at daycare could not be controlled and may have contributed to differences in outcomes. We do not have data on the number of illnesses that twins acquired and differences in illness severity, and number of illnesses may have impacted neurodevelopmental outcomes between twin pairs. Interventions after discharge from the NICU were not directly examined; however, infant development programs and physiotherapy are available province-wide through government services to all developmentally high-risk infants and toddlers.

In our cohort of preterm twins, MZ and DZ twin pairs had similar outcomes through early childhood suggesting that neurodevelopmental outcomes may relate more strongly to environmental rather than genetic factors. Lower gestational age and a higher incidence of perinatal morbidities were associated with discordant development among twin pairs. Over time, discordant pairs become more similar in cognitive and language outcomes, which may reflect the positive impact of early intervention programs or home environment. These findings are encouraging; however further studies are needed to better understand the susceptibility for perinatal illness and factors contributing to discordant development in preterm twins.