Introduction

Child physical abuse has detrimental consequences across the life span and is recognized by the World Health Organization as an important target for preventive efforts [1]. A prevailing etiologic explanation of physical child abuse is family stressors overcoming supportive factors [2]. In family stress theory [3], family stress has been connected with chronic illness, both among adults and children [4]. Adult disease has been linked with perceived stress [5], and parental chronic illness has been associated with deficiencies in family cohesion and functioning and increased levels of conflict [6]. Additionally, a dose–response relationship between symptom severity and negative impact on family functioning has been indicated [6]. Parental mental health problems have been found to predict lethal child abuse [7, 8], but parental physical health has shown differing results [9,10,11]. In the only study investigating parental physical health severity, using the parental Charlson Comorbidity Index and published concurrently in this journal, our group found no association with documented physical child abuse [12]. Possibly, this result was due to few high index scores among parents, with only 1.7% scoring ≥ 2, and because heterogeneous categories of diseases were covered. Extending beyond the severe categories in the Charlson index, adult disease has been categorized, based on previous studies [5]. Despite high prevalence [13, 14], diagnoses not attributable to well-defined physical diseases were not included [5]. These diagnoses, comprising functional somatic syndromes [15], known to be associated with stress [16] and adverse life effects [17, 18], and diagnoses defined by their symptoms and not pathology, together represent unspecific somatic symptoms.

In this study, we use categories for physical disease, with the addition of unspecific somatic symptoms [19, 20], to explore the association between parental physical health problems and documented physical child abuse. As parental physical health severity did not show a causal association with the outcome, the current study is exploratory.

Methods

A prospective, observational cohort study was carried out. The data source used is also used in our article [12], published concurrently in this journal. All children living in Denmark between the 1st of January 1997 and the 31st of December 2018 and their legally registered parents were included, choosing this study period because among others the Fertility Database and the Medical Birth Register were available from this date. A parent was defined here as an adult registered as the current legal parent of a child, including unrelated adults in a legal parenting role. Any children without legally registered parents were excluded. Any children who emigrated and re-entered Denmark were censored at time of emigration to avoid immortality bias. Background data were drawn from other registries (see Table 1).

Table 1 List of variables, their study definitions, sources, and levels

Exposure

See Table 2 for an overview. Parental disease was categorized mainly according to Prior et al. [5]. To ensure optimal coverage in the dataset, diagnoses not categorized by Prior et al. were added to existing categories (see below). The sorting algorithm was inspired by other studies [19, 20]. The full algorithm, and the code for data analysis, is described in the pre-registration for this study [42].

Table 2 Disease clusters and categories, modified from Prior et al. [5]

In short, all ICD 10 diagnoses were extracted from the National Patient Register; accidents, infections, and procedures were left out, and all remaining diagnoses, except psychiatric diagnoses (F-diagnoses), were sorted into conventionally defined diagnoses and unspecific symptoms. The conventionally defined diagnoses were then filtered according to Prior et al. [5] but excluding the category HIV/AIDS as our focus did not include infections. Finally, a category of “Other diagnoses” was added, comprising all diagnoses that did not fit within the well-defined disease categories or the category of unspecific symptoms, were not F-diagnoses, and were not excluded as described above. This resulted in 384 diagnosis sorting rules in addition to Prior et al., made by the first author, who is a trained medical doctor. To ease the reproducibility of this study, the final classification of 11,866 diagnostic codes is available in the Online Resource 1. For some categories, additional information on redeemed prescriptions was used, inspired by Prior et al. [5]. This captured a number of diseases treated in the primary sector. Information on redeemed prescriptions was obtained from the Register of Medicinal Products Statistics. For all disease categories, a 3-year burn-in period before the study was established. Health information for parents were read in from the first of January 1994 and onward, and, for example, parents with a code for hypertension during the period 1994–1997 were already classified in this category at study entry. All categories were updated throughout the study period on a monthly basis. To ensure precise categorization, some prescriptions used in the original categorization by Prior et al. were left out. For example, Anatomical Therapeutic Chemical (ATC) code C02, which was used by Prior et al. to categorize hypertension, is also registered for use in attention-deficit hyperactivity disorder and prostate hyperplasia. Prior et al. included this code because of its most prevalent use (A. Prior, personal communications, 2020), but we have excluded it for greater specificity.

Outcome

The outcome was documented physical child abuse. All hospital and police codes that indicated violence against a child, including lethal violence, were combined into a dichotomous variable indicating the first incidence of abuse. Details are available in our description of the coding algorithm [42].

Covariates

Covariates were chosen based on the literature (see Table 1 for references and data sources). Parental psychiatric diseases were extended from Prior’s categories to also include personality disorder [7, 50].

Statistical methods

We used pseudoobservations [51] to estimate relative risks (RR) adjusted for covariates, with 95% confidence intervals (CI) and Bonferroni-corrected 95% confidence intervals (CIc) [52, 53]. A child with at least one parent in a specific disease category was identified as exposed and the index date set at parental time of diagnosis or birth date of the child, whichever came last. Then, for each child exposed, five children who were not exposed at the index date were drawn from the underlying source population, matching on date of inclusion (a child is identified as exposed and within 3 months of calendar time, five other children not exposed are drawn), reconstituted family (living with biological parents, living with one or more unrelated adults, adopted or in foster care), child ages/birth year (matched within 1 year), number of children in the family (either one child, two children, three to five children, or six or more children), and mean parental age (within 5 years). Thus, individual cohorts were created for all disease categories studied. Each family was followed until end of study period, first incident of documented physical child abuse, emigration, the child being without registered parents, dying or reaching their 18th birthday, whichever came first, allowing for at least 1 month and at most < 18 years between exposure and outcome. Children experiencing the outcome before the exposure were excluded. Each child could participate in more than one cohort, but not twice in the same cohort, and not after first incident of documented physical child abuse. A full model, containing pre-specified variables, and a parsimonious model, reduced to avoid collinearity and small cells, were produced for all analyses. To avoid computational overload, a maximum of 100,000 exposed children were allowed in each analysis, drawn randomly among those available. Only rows with information on all variables were included. The parental disease categories were not adjusted among each other. Pseudoobservations assume that censoring in the dataset is marginally independent. As this was not the case for the calendar time groups, the dataset was stratified on this variable and pseudovalues generated within these strata. The same child could be included both as exposed and un-exposed; Eicker-Huber-White standard errors were used to compensate for this. Siblings could be included, and therefore clustering was adjusted for on a family level. In all models, death not related to abuse was treated as a competing risk. Analyses were presented with and without Bonferroni corrections, correcting for tests of 33 categories.

Results

Samples, demographics, and events

Exposed and unexposed children were drawn from 2,705,770 children available in the dataset. First-time documented physical child abuse was experienced by 70,892 children during the study period, and lethal child abuse was experienced by 111 children. Table 3 describes the full population’s distribution across covariates; this table is derived from the table included in our article [12], published concurrently in this journal. Table 4 describes the risk time, events, number of children studied, and number and percentage of children exposed in the full cohort. Most disease categories had less than 10% exposed during the study period, except unspecific symptoms (72%), allergy (23%), painful condition (11%), and other diagnoses (69%) (Table 4).

Table 3 Source population characteristics at entry in the source population
Table 4 Follow-up time, events, and number of children in each model

Estimates of relative risk

Table 5 describes the results for each disease category, including a specification of covariates used for each model. After Bonferroni-adjustment, we found the following disease categories to be statistically significantly associated with documented physical child abuse: ischemic heart disease, RR 1.44 (CIc 1.13–1.84); peripheral artery occlusive disease, RR 1.39 (CIc 1.01–1.90); stroke, RR 1.19 (1.01–1.41); chronic pulmonary disease, RR 1.33 (CIc 1.18–1.51); ulcer/chronic gastritis, RR 1.23 (CIc 1.00–1.51); painful condition, 1.17 (CIc 1.00–1.37); epilepsy, RR 1.27 (CIc 1.03–1.56); unspecific symptoms within the last 2 years and ever, RR 1.31 (CIc 1.14–1.50) and RR 1.33 (CIc 1.17–1.52), respectively; and other diagnoses, RR 1.23 (CIc 1.09–1.39). Diabetes mellitus and chronic liver disease showed increased risks, but these were not statistically significant after Bonferroni correction.

Table 5 Model results for all disease categories (mark significant results with bold)

Discussion

In models adjusted for known risk factors for physical child abuse, the majority of disease categories explored were found to have no statistically significant association to documented physical child abuse. Ischemic heart disease, peripheral artery occlusive disease, stroke, chronic pulmonary disease, ulcer/chronic gastritis, painful condition, epilepsy, unspecific symptoms, and the broad category of “other diagnoses” showed varying degrees of associations. Diabetes mellitus and chronic liver disease showed increased risks, but results were not statistically significant after Bonferroni correction. Some findings of no association were expected based on our study on parental physical disease severity using the same dataset [12], published concurrently in this journal.

Motivating this study, we assumed that parental physical health influence family stress levels. As referenced earlier, this is indicated among parental unspecific symptoms [6] and functional somatic syndromes [16,17,18]. For many categories of disease, the specific impact on family stress levels is unknown, although there are studies on family functioning and its components. A previous study found no association between hypertension and family cohesion [54]. In diabetes, the association is influenced by treatment outcome. Among adults with non-insulin–dependent diabetes mellitus, good glycemic control was associated with lower family cohesion compared to those with lower quality glycemic regulation, while this relationship was reversed among adults with insulin-dependent diabetes mellitus [55]. Among chronic obstructive pulmonary disease patients, family functioning was found to be comparatively better among patients treated with oxygen in their home than hospitalized patients [56]. Lower family cohesion was found among individuals diagnosed with epilepsy compared to controls [57]. In a study of cancer, no differences were found in family functioning between patients and controls [58]. Headache symptom severity has been shown to adversely influence family cohesion [59]. Only cautious interpretations should be derived from single studies in each category. Nonetheless, it seems likely that disease categories’ influence on family functioning is not uniform, sometimes counterintuitive, and mediated by other factors, including treatment. Through this lens, the differing associations across categories in our results could be an expression of differences in how disease categories influence family stress.

The disease categories found to be associated with documented physical child abuse are all either chronic conditions or with considerable chronic subgroups. Consequently, part of the associations found could be an expression of the underlying chronicity of these categories. Nonetheless, a number of other disease categories representing chronic conditions show no association, possibly underlining the varying influences by different categories discussed above.

Residual confounding may also have influenced our findings. For example, adjusting for only two levels of education could result in strata with quite heterogeneous populations, and there may therefore be residual bias in the models. In addition, the model does not adjust for all possible risk factors. For example, the association with ischemic heart disease, peripheral artery occlusive disease, stroke, and chronic pulmonary disease may be affected by smoking. Smoking is a risk factor for all these conditions and has been associated with both childhood maltreatment among smokers [60] and as a risk factor for physical child abuse [29]. This is supported by the association before Bonferroni correction of diabetes mellitus, which is also associated with smoking. Thus, diseases resulting from smoking could be proxies for residual confounding from parental childhood maltreatment. Similarly, stomach ulcer/chronic gastritis is associated with substance abuse, which is also both a risk factor for [8] and a possible result of child maltreatment [61]. This is supported by the association of chronic liver disease without Bonferroni correction, also associated with substance abuse.

Epilepsy has been associated with depression and difficulties in emotional regulation [62], which may contribute to its influence on family cohesion. Affective disorders in parents have been linked to lethal physical child abuse [7, 8]. Another possible link could be that the risk of epilepsy is increased after trauma [63]. Thus, for a subgroup of parents with posttraumatic epilepsy, parental experiences of childhood maltreatment, which is a risk factor for maltreatment of offspring, could be proxied by adult epilepsy.

Chronic pain and childhood maltreatment have been previously studied, but Marin et al. in 2021 in a systematic review considered current evidence inconclusive [64]. The painful condition category in our study is based on the use of prescription analgesics at least four times within the last year. This category stands out by being defined by analgesics alone and thus possibly overlaps with other disease categories. Importantly, some diagnoses in the category unspecific symptoms describe manifestations of pain, and there might be a substantial overlap between the painful condition and unspecific symptoms categories.

Among unspecific symptoms, functional somatic syndromes have been shown to increase with childhood maltreatment [65]. This has also been shown for unspecific somatic complaints and childhood physical abuse, again invoking the possibility of residual confounding [66].

Unspecific symptoms stand out if their prevalence is taken into account. A better understanding of the individuals in the unspecific symptoms category, which encompassed 72% of the entire dataset, could potentially provide targets for preventive measures both to target populations and to inspire components for blended interventions. The current results do not warrant targeted interventions in their own right, but may inspire to future interventions such as integrating parental physical symptom checklists into existing home visiting programs or parenting programs or introducing tools to support parents in coping with their symptoms. However, further research is needed to study the associations found here and further specify their causal and/or predictive nature.

The association with the category “Other diagnoses,” diagnoses that do not fit with the remaining categories, is intriguing, but difficult to interpret due to their heterogeneous nature. Splitting this category into subcategories such as rare endocrine disorders, hereditary disorders, or similar may provide further insights.

Each disease category was not adjusted for other categories. This was inspired by our findings [12] as discussed above. Adjusting for other disease categories might consequently introduce more complexity to our models without a clear theoretical justification. Nonetheless, it is possible that some combinations, perhaps combinations of the categories found to be associated with documented physical child abuse, could have an additive or even multiplicative effect on the risk of abuse. This remains to be studied. Also, some diseases have common causes and may also be in each others’ causal pathways, for example, diabetes and heart disease [67]. This could explain the relatedness in our study of, for example, ischemic heart disease and stroke, and assumptions on the relations between categories should be remembered when designing future studies to elaborate on our results.

Strengths and weaknesses

To the best of our knowledge, this is the first study to describe a diverse set of disease categories in parents and their link to child risk of documented physical child abuse. The longitudinal nature of the data and size of the sample enable discovery of relevant candidates for further scrutiny in predictive and causal models. The categories applied have been utilized previously, and additional categorization was done based on clinical insight. The models were adjusted for a number of known risk factors, showing the relevance of the associations even after taking current knowledge into account. There are also some potential weaknesses. The categories used were not validated, although a number of diagnoses in the registries used have been validated [48]. Although the assignment of ICD 10 codes in the Danish healthcare system is done by trained health professionals, the rate of irregularities for many diagnoses is not known. Such misclassification would be expected to be independent of the outcome and hence would bias toward no association. As mentioned above, the model may contain residual bias, from unobserved or insufficiently observed variables. This is because it is based on registry data; for example, substance abuse is based on hospital diagnoses only. Thus, as argued above, the associations seen could be hypothesized to be based on residual confounding. On the contrary, this is useful in its own right for predictions. If, for example, stomach ulcer/chronic gastritis captures a part of substance abuse that is not otherwise visible through either registries or clinical observation, this category is useful for predictive modeling. Additional categorizations of diagnoses were only done by the first author. Although the categorizations were checked rigorously and backed up by clinical experience, a lack of validation might lead to unexpected associations. As noted for Table 5, the category hearing problems has an unexplained behavior in the time trend of number of diagnoses. Consequently, results from this category should be interpreted with special caution. Number of missing entries at entry in source population on neighborhood resources and reconstituted family were 20% and 23%, respectively. All cohorts were drawn using only complete cases, and there is a risk of bias if the risk of missing data is associated with differences in the underlying population, possibly affecting generalizability of the results. We used data sets drawn from all children living in Denmark up until 2018 in this study, and in some categories, we used all available children exposed to a parental disease category (for details, see Table 4). Nonetheless, a population correction factor was not used to correct the confidence intervals presented. This was because we regarded our population as a sub-population of all children living in settings with universal healthcare and because no analyses used data on all available non-exposed children. Also, some disease categories may represent a subset of families with parents in these categories, both because a maximum of 100,000 exposed families were allowed in each analysis and because some disease categories, for example, hypertension, may be underdiagnosed in patient registries not including the primary sector. Consequently, our analyses never represented the full population of interest. If the population of interest was limited to Danish children only and the variation in remaining non-exposed children assumed to be negligible, categories expected to be fully diagnosed within the registries available to us could be presented with more precise confidence intervals. However, as we find the estimated strength of associations presented as insignificant in our analyses to be moderate or less (see relative risks in Table 5), the usefulness of such categories for preventive and predictive purposes would be expected to be limited. Finally, as mentioned above, this study took place in a high-income country [68] with universal healthcare coverage. Results are likely to generalize to similar populations but would require replication elsewhere, in particular in low-income countries and healthcare systems that are organized differently. Additionally, most self-reported cases of maltreatment are not known to the sources used for the outcome [69]. Consequently, results may not generalize to the cases of physical child abuse unknown to healthcare or law enforcement.

Conclusion

A number of diverse disease categories were exploratively tested for an association with documented physical child abuse. A notable candidate for future studies was unspecific symptoms, presenting both a significant prevalence and an association with the outcome. Further candidates were ischemic heart disease, peripheral artery occlusive disease, stroke, chronic pulmonary disease, stomach ulcer/chronic gastritis, painful condition, and epilepsy. Further research into these is warranted and may inspire additions to preventive interventions.