Introduction

An estimated 15–30% of infants born with a birth defect will have more than one malformation.1,2,3,4,5 Some of these infants have a syndromic condition with a known genetic, chromosomal, or teratogenic etiology (e.g., trisomy 21, valproic acid) or an identifiable sequence of associated defects attributable to a single primary defect (e.g., Potter sequence). However, an estimated 50% of patients seen in medical genetics clinics do not receive a diagnosis encompassing the co-occurring birth defects.6 While co-occurrence of multiple defects in the same individual is common, the population-level landscape of different patterns of birth defect co-occurrence among infants without a syndromic diagnosis is not well understood. Describing patterns involving two or more defects may help researchers and clinicians better understand which combinations of defects co-occur more frequently than would be expected based on their prevalence in the population. This foundational work may also help to identify groups of cases with similar phenotypes, which could ultimately have implications related to shared etiological mechanisms that could be further investigated.7,8

To date, the characterization of new birth defect patterns (e.g., syndromes) has primarily relied on clinicians’ recognition of phenotypic or genetic patterns within relatively small, clinical populations; however, some recurrent phenotypic patterns may be so uncommon that a single clinical practice will not see sufficient patients to identify those patterns. Therefore, our group has recently developed a software platform for analyzing co-occurrence patterns that occur more frequently than expected in large data resources (i.e., birth defects registries), which could help identify new patterns.9

Our objective was to identify and characterize combinations of defects that co-occurred more frequently than expected in a large, population-based registry. To do so, we used statewide data from the Texas Birth Defects Registry (TBDR). For each combination of two to five co-occurring birth defects, we calculated an adjusted observed/expected ratio to identify the combinations that co-occurred more frequently than expected based on their population prevalence.

Methods

Data from the TBDR, which is managed by the Birth Defects Epidemiology and Surveillance Branch at the Texas Department of State Health Services, was obtained for deliveries to Texas residents occurring between 1999 and 2014. The TBDR conducts active surveillance for birth defects among live births, stillbirths, terminations, and fetal deaths at relevant hospitals, clinics, and birthing centers throughout the state.10 To be included in the registry, infants must have at least one of the monitored structural birth defects or a chromosomal abnormality diagnosed by one year of age. This project was approved by the Institutional Review Board of the Texas Department of State Health Services and the UTHealth Committee for the Protection of Human Subjects.

Birth defects in the TBDR are recorded using modified six-digit British Pediatric Association (BPA) codes.11,12 As our objective was to identify potentially novel syndromes and associations, we excluded infants with a documented syndromic diagnosis. Excluded syndromic diagnoses included chromosomal anomalies (e.g., trisomy 21, 22q11.2 deletion syndrome), syndromes (e.g., Marfan syndrome, Goldenhar syndrome), and sequences/associations (e.g., VACTERL association, Potter sequence) recorded in the abstracted record. We restricted analyses to infants with major structural birth defects based on input from medical geneticists at the TBDR and criteria used to define minor defects in the National Birth Defects Prevention Study.13 To analyze distinct groups of defects, BPA codes were grouped by the first 4 digits, as detailed in the Supplemental Appendix.9 Singleton pregnancies resulting in live births, stillbirths, fetal deaths, or terminations with two or more major birth defects were included in the analyses.

Maternal and infant demographic characteristics were tabulated in order to describe the study cohort. We then used the R-based software platform, Co-Occurring Defect Analysis (CODA),9 to analyze two, three, four, and five-way combinations of defects observed in the TBDR. CODA calculates an adjusted observed-to-expected (O/E) ratio, based on the method developed by Khoury et al.14 Briefly, the adjusted O/E ratio compares the observed prevalence of a birth defect combination to the prevalence that would be expected based on the population prevalence of each defect in the combination. The adjustment method accounts for the tendency of the birth defects in a given combination to co-occur with any other defects. An O/E ratio > 1 indicates that the combination occurs more frequently than would be expected if the defects occurred independently of one another.

After calculating adjusted O/E ratios for all observed two to five-way combinations in the study population, we reviewed the top 5% of patterns of defects with the highest adjusted O/E ratios that had five or more cases observed. Among these, combinations that consisted of only defects within the same system (e.g., two or more eye defects only) or were suggestive of heterotaxy (e.g., heart defect with only lung, spleen, and/or intestinal fixation anomalies)15,16 were tabulated and removed from further consideration. Two medical geneticists (A.E.S. and D.A.S.) reviewed the remaining combinations, including individual-level detailed birth defects descriptions, to assess whether the combinations were suggestive of known syndromes or patterns. Thus, the remaining combinations represented the top patterns of interest, defined as the top 5% of combinations that (1) included defects in multiple systems and (2) were not suggestive of known syndromes or patterns. A Venn diagram was constructed post hoc to visualize organ system overlap among these top patterns of interest (e.g., the number of the top combinations that involved both heart and spine defects). Due to a large number of observed combinations with heart, spine, and rib defects, we also conducted a post hoc tabulation of the full six-digit BPA codes present in these cases to determine if the vertebral defect level was spatially closer to the heart/ribs (i.e., thoracic level vertebral defects, BPA codes: 756.150–756.156) or at a different level among cases with these patterns.

Results

There were 40,771 infants with ≥2 major birth defects recorded in the TBDR delivered between 1999 and 2014 and there were 6,181,631 live births in Texas during the same time period. Maternal and infant demographic characteristics for the analytic sample are shown in Table 1. A majority of mothers of children with ≥2 defects were between 20 and 29 years old (53%). Fifty-one percent of mothers were Hispanic, 40% of mothers had at least some college education, and 50% of mothers were overweight or obese prior to pregnancy. After running CODA to analyze co-occurrence patterns for 138 major birth defects groups, there were 10,595 observed combinations of two to five defects with at least five individuals. We reviewed the 530 patterns with the highest adjusted O/E ratios (i.e., the top 5% of all eligible combinations).

Table 1 Maternal and infant demographic characteristics of infants with two or more major birth defects in the Texas Birth Defects Registry (n = 40,771), 1999–2014.

Of the 530 patterns we evaluated, 352 patterns (66%) were comprised of only two to five defects in the same organ system (Fig. 1). Of these same-system defect combinations, 312 of the combinations involved multiple heart defects. Eye and limb defect combinations were the second and third most frequent same-system patterns observed, with 17 and 9 combinations, respectively.

Fig. 1: Flow chart of our review of results.
figure 1

Summary of the review process for the birth defect combinations with the highest observed/expected ratios (O/E) among non-syndromic cases in the Texas Birth Defects Registry (1999–2014 deliveries).

Of the remaining combinations, which consisted of defects in multiple systems, 94 combinations (18%) were suggestive of heterotaxy (Fig. 1). The remaining 84 combinations were reviewed by two medical geneticists (A.E.S. and D.A.S.), and 57 (68%) of these combinations also included defects that likely reflected known syndromes or patterns. Patterns associated with midline developmental defects affecting the brain, eye, and face (e.g., nose defects, cleft lip) were the most frequently observed (26 patterns) and 50% of the cases with those patterns of defects had holoprosencephaly recorded in the TBDR. Patterns reflective of spina bifida co-occurring with associated anomalies, including limb (e.g., clubfoot), spine, and renal defects, were also frequently observed (9 patterns). Patterns suggestive of OEIS (Omphalocele, Exstrophy, Imperforate anus, and Spinal defects) and Bladder-Exstrophy-Epispadias Complex (4 patterns), VACTERL association (Vertebral defects, Anal atresia, Cardiac defects, TracheoEsophageal fistula, Renal, Limb; 3 patterns) and gastroschisis co-occurring with associated anomalies (gastroschisis with intestinal anomalies; 2 patterns) were also noted.

The remaining 27 top combinations that did not clearly represent an identifiable association or developmental sequence are shown in Table 2. The adjusted O/E ratios for these patterns ranged from 63.0 to 193.8 and the combination with the highest adjusted O/E ratio included atrial septal defects, spinal defects (e.g., vertebral defects), diaphragm defects (e.g., congenital diaphragmatic hernia), and spleen defects. This was one of only two top combinations involving diaphragm or spleen anomalies, with the other being combination 19 in Table 2, which included three of the same defect categories (spine, diaphragm, and spleen) and included five of the same infants, with one additional case.

Table 2 Top defect patterns, ranked by an adjusted observed-to-expected ratio (O/E), in the Texas Birth Defects Registry, 1999–2014.

Over half of the 27 top combinations included in Table 2 (n = 14) comprised one or more heart defects with co-occurring spine defects. For example, pattern 13 included heart and spine defects only, while pattern 16 consisted of heart, lung, and spine defects. In fact, four of the five cases with pattern 16 had scimitar syndrome, a type of partial anomalous pulmonary venous return. The other 12 heart-spine patterns comprised only heart, spine, and rib defects with no other systems involved (patterns 2–4, 6–7, 9, 14-15, 20–22, 26). In post hoc analyses of cases with only heart, spine, and rib defects in the pattern, we looked at the six-digit BPA codes for the 23 unique cases in these patterns to examine vertebral defect level in order to determine if the defect was spatially close to the heart/ribs (thoracic level). All 23 cases had defects of the thoracic vertebrae recorded (more than one vertebral defect could be recorded per case); 10 had thoracic hemivertebrae (43%) and 17 had other anomalies of the thoracic vertebrae (74%).

Ten additional patterns involved combinations with two defects in the following systems: heart, spine, limbs, anal atresia/stenosis, or renal. Considering these and the 14 combinations with heart and spine defects described above, 24 combinations involved two hallmark features of VACTERL (25 with the inclusion of pattern 1), but none included tracheoesophageal fistula, a common hallmark VACTERL feature. We constructed a Venn diagram post hoc to visualize organ system overlap among the five VACTERL component organ systems represented among these results: anomalies of the spine, anal atresia (atresia and stenosis of the large intestine, rectum, and anal canal), heart, renal, and limb defects (Fig. 2). The count displayed in the Venn diagram represents the number of top combinations that included the indicated defects. For example, among the top 27 combinations, heart and spine defects were represented the most frequently (present in 15 patterns), followed by heart and limb defects (present in three patterns).

Fig. 2: Venn diagram of defect system overlap.
figure 2

Diagram of the five VACTERL component organ systems represented in the top 27 birth defects patterns, where the count represents the number of combinations that include the indicated defects (e.g., 15 of the top 27 combinations include heart and spine defects) in the Texas Birth Defects Registry (1999–2014 deliveries).

Finally, one additional combination involved reduction anomalies of the brain, congenital hydrocephalus, anomalies of the spine, and anomalies of ribs and sternum. Our review of individual data did not reveal a consistent pattern among these cases; two of the seven individuals with this pattern of defects had holoprosencephaly recorded in the TBDR, while none of the individuals had spina bifida.

Discussion

We aimed to describe patterns of multiple birth defects observed much more frequently than expected and to identify unique patterns of co-occurring phenotypes, which may indicate a shared etiological basis for the observed pattern of defects. In this population-based analysis of birth defect co-occurrence patterns among infants without a syndromic diagnosis in the TBDR, we identified several interesting patterns, including birth defect combinations that may represent novel patterns.

The pattern with the highest adjusted O/E ratio consisted of heart, vertebral, diaphragmatic, and spleen anomalies. All of the infants with this pattern of defects had a congenital diaphragmatic hernia (CDH). Non-isolated CDH can occur with chromosomal abnormalities (e.g., trisomy 18) and genetic syndromes (e.g., Fryns syndrome, Cornelia de Lange syndrome),17,18,19 however, co-occurring heart, spine, and spleen anomalies are not defining features of these syndromes. While the majority of instances of CDH have no identified etiological basis,19,20 recent genetic studies investigating infants with multiple malformations and CDH have reported genetic variants associated with specific patterns of co-occurring defects. For example, infants with pathogenic variants in MYRF commonly have CDH, heart defects (including scimitar syndrome), and genitourinary abnormalities.21,22 Similarly, HLX has been reported as a candidate gene for a pattern of anomalies that included CDH, short bowel, and asplenia.23 While infants with patterns 1 and 19 had CDH with co-occurring heart and spleen defects, they were phenotypically different from the infants with shared genetic variants reported in these prior publications. More work is needed to better understand whether the co-occurrence of the defects observed in our data could reflect a shared etiological basis.

Another pattern of note involved heart (anomalies of great veins), lung, and vertebral defects (Table 2; pattern 16). Four of the five cases with this pattern of defects had scimitar syndrome, an anomaly involving partial anomalous pulmonary venous return directly to the systemic venous system.24 While lung malformation, specifically hypoplasia, is a component of scimitar syndrome, co-occurring vertebral defects are less common. Masrani et al. examined co-occurring defects in a cohort of 16 scimitar syndrome patients and noted vertebral anomalies in only one patient.25 There may be a shared etiological mechanism in this cluster of cases given the infrequency of co-occurring scimitar syndrome and vertebral defects reported previously. More work is needed to confirm and better understand this possible connection.

Many of the top observed patterns included combinations of heart, spine, and rib anomalies. Heart defects and vertebral anomalies can co-occur in syndromic conditions (e.g., Goldenhar syndrome)26 and have been reported to co-occur among infants without an identified syndrome. An estimated 10–37% of infants with vertebral malformations have cardiac anomalies.27,28,29 Rib defects also commonly co-occur with both. A study looking at co-occurring defects among infants with rib anomalies found that 72% of the infants without a syndromic diagnosis also had a vertebral anomaly and 40% had a heart defect.30 Disruptions of development during vertebral fusion or segmentation, perhaps via mutations in notch signaling pathway genes (DLL3), alterations in genes involved in vertebral development (e.g., PAX1, PAX9), or environmental factors (e.g., maternal hyperglycemia), can result in vertebral and rib malformations.31,32,33 Martínez-Frías described segmentation anomalies of the vertebrae and ribs as developmental field defects arising from failure of segmentation of the preaxial mesoderm at 4–6 weeks of gestation.34 Cardiac structures also arise from the mesoderm and these patterns of defects suggest that a disturbance during early development could contribute to a developmental field defect affecting these systems.35 Given the high rates of co-occurring defects in these systems and heterogeneous presentation among our results, there were likely multiple causes involved, and future work may shed further light on the potential developmental overlap.

Nearly all of the top observed patterns contained two features of the VACTERL association, which represents a non-random group of defects known to co-occur.36,37 VACTERL, a heterogeneous spectrum, is typically only diagnosed when three or more characteristic defects are present and other syndromic causes have been ruled out.36,37,38,39,40 There is some evidence for “caudal” VACTERL patterns, resulting in anal atresia, renal, and lower vertebral defects and “cranial” patterns involving tracheoesophageal fistula, upper vertebral, and preaxial limb defects.38,41 Moreover, some of our results may additionally support the possibility of a spectrum of VACTERL-like combinations. Despite the exclusion of 517 cases with VACTERL recorded in the medical record, 25 patterns in our top results contained two of the component defects. Further, none of the cases in any of our top 27 combinations had tracheoesophageal fistula, which is estimated in some studies to be the most common VACTERL defect, present in up to 80% of VACTERL cases.37,40 It is unclear if some of these two-way combinations of VACTERL components that occurred much more frequently than expected might be etiologically related to VACTERL. Since this was a registry study, no further clinical examination or screening could be completed, and it is possible that additional VACTERL component defects were missed in some of these cases. Regardless, our findings support and reinforce the importance of screening patients with patterns suggestive of VACTERL for additional defects.

As expected, many of the top 530 patterns reflected known patterns. For example, multiple defects within the same organ (e.g., heart or eye) occurred much more frequently than expected. This may be a reflection of the known co-occurrence of defects in the same organ system42 and the likelihood of finding additional defects when one defect is present and additional evaluations of the affected system are undertaken. It may also be a consequence of TBDR procedures that lead to the detailed recording of all reported defects. Midline developmental defects and known associations (e.g., spina bifida with associated defects) were also reflected in patterns with high adjusted O/E ratios. Several of the midline developmental patterns included orofacial clefts; though they are not presented in Table 2, defects co-occurring with orofacial clefts in the TBDR have been previously described.43 Analyses that group infants with birth defects into isolated and non-isolated cases without accounting for these established co-occurrence patterns may classify cases as multimalformed when they may more accurately be classified as isolated for some analyses. In our analyses, certain infants with one primary defect and additional secondary, related defects could have been reclassified as “isolated” cases. For example, an infant with spina bifida, clubfoot, and the tethered cord could be considered as isolated spina bifida. However, classification of isolated-sequence cases should be done after careful review of the recorded defects and their descriptions and after developing objective criteria establishing which defects would be considered secondary for all defects of interest.44 Many birth defects registries do not have resources for a manual clinical review for all cases (e.g., thousands of records) and criteria may vary from study to study. While this individual review and reclassification were beyond the scope of this study looking at all observed patterns for all defects, the large proportion of patterns with same-system defects and defects suggestive of sequences further reinforces the importance of considering the impact of etiologically related defects on the classification of multiple versus isolated cases.

Several factors may have affected our analyses. First, at the time of study onset, data were only available through 2014. While it is not unusual for the availability of registry data to lag for several years following birth due to ascertainment and quality control procedures, inclusion of additional recent cases may have changed our results. Secondly, the TBDR relies on abstracted medical records, including progress notes, genetic testing results, and operative reports, to classify defects and syndromic conditions. Our analyses could not account for potential differences in the type or number of clinical evaluations of the same infant, which may have generated additional recorded defects and more specific diagnoses, and some diagnoses may have been made after the first year of life, including genetic testing undertaken after the first year, or in settings that were not included in the Registry’s active surveillance. As a result, some infants with a syndrome may have been included in analyses of non-syndromic cases. Finally, no standard system exists for grouping defects and analyzing co-occurrence patterns. We grouped defects based on the first four digits of the BPA code, however, other more broad or specific groups could have been analyzed. Because these broad groupings have been given general names (e.g., anomalies of spine), additional details have been provided in the Supplemental Appendix. Differences in ascertainment procedures, coding of defects, and analytic grouping could affect the reproducibility of these results.

Strengths of this study included our utilization of a large, population-based birth defects registry with active surveillance to recognize rare birth defect patterns that could go undetected by clinical observation among individual clinicians, who might be unlikely to see enough cases to make the connection. Using data from a single registry ensured consistent ascertainment and classification protocols. Our approach also assessed patterns occurring much more frequently than would be expected given their observed prevalence in the population and adjusted for the frequency with which defects in a pattern tend to co-occur with other defects.

Conclusions

In this analysis of over 40,000 infants with multiple malformations, we found that the majority of patterns occurring more frequently than expected involved multiple defects within the same organ system and known associations. Nearly all of the top patterns (beyond the same system and those suggestive of known associations) involved organ systems that are components of the VACTERL association, with heart, spine, and rib defect patterns being the most common. Several patterns of potential interest for further investigation were noted and future work could be undertaken to replicate our findings and further assess groups of infants with CDH, heart, spine, and/or spleen defects and scimitar syndrome with vertebral defects.