Introduction

The medial (or sternal) clavicular epiphysis matures relatively slowly. In the human skeleton, it is generally the last long bone epiphysis to fuse. Its developmental stage is therefore useful when estimating age at death in the post-pubertal period [1] or the age of living persons involved in criminal proceedings [13, 15]. In the Netherlands, a radiological clavicle examination may be applied to verify the age of young asylum seekers without valid identification documents.

Various authors have studied clavicle development to determine the age interval in which fusion of the clavicular epiphyses occurs. Most of the published studies were based on the examination of dry bone specimens [3, 6, 9, 10, 17, 18]. When dealing with living individuals, examiners analyzed either X-rays [4, 13] or computed tomography (CT) scans [5, 15, 16]. In a number of studies, the results for different ethnic groups were evaluated or the results for males were compared with those for females. In most studies, such comparisons were, however, hampered by a relatively small sample size.

In this study, we aim to assess from a large sample whether or not variables such as sex, socioeconomic status, and ethnicity affect the probability of having completely fused clavicles at a certain age. We further aim to compare results from X-ray and CT scan examinations with those from studies based on dry bone specimens. To do so, we have combined data from different studies and performed a binomial logistic regression analysis. We then illustrate how the resulting model can be applied to estimate the probability of completely fused clavicles at a given age.

Data

Published and unpublished data on the individual subjects, taken from various studies on clavicle development [36, 8, 10, 13, 15, 18] were combined. For each individual, information on the calendar age in years was combined with data on the fusion status of the clavicle, i.e., exhibiting a complete bony fusion of epiphysis and diaphysis (i.e., mature) or not. When data for both left and right clavicles were published, we used the side for which most records were provided. Data on the sex and ethnicity of the individuals were copied when available. In the context of this study, we defined ethnicity as being either of European (Caucasian), Asian, or Afro-American descent. It was further recorded which method was used for determining the fusion status, i.e., using dry bone specimens, X-rays, or CT scans.

The socioeconomic status of individuals was estimated from the human development index (HDI) associated with the country in which the study took place. The HDI is a summary composite index that measures a country’s average achievements in three basic aspects of human development: longevity, knowledge, and the standard of living [19]. The HDI is used by the United Nations Development Programme, and data are available from 1975 onward. To accommodate data from two American studies that took place before this year [6, 18], we extrapolated the HDI by first calculating the linear regression of the HDI on time.

In total, data of 3,552 individuals aged 15–33 years were combined. Figure 1 shows the age distribution within the sample. Table 1 provides further information on the number of individuals per category.

Fig. 1
figure 1

Age distribution of individuals in the sample

Table 1 Number of individuals per category

Statistical analysis

Analysis of the data was performed using the SPSS (version 10.0) software package for statistical analysis. We carried out a binomial logistic regression analysis to assess the effect of various variables on the probability of mature clavicles. In logistic regression, the dependent variable is transformed into a logit variable, i.e., the natural log of the odds of the dependent occurring or not. This transformation ensures that the estimated probabilities are between 0 and 1. A logit model is a form of the generalized linear model.

Age, sex, ethnicity, and socioeconomic status were considered as potential predictor variables for the probability of mature clavicles. The method of clavicle examination was further considered as a factor influencing the probability of being diagnosed with mature clavicles. Age and socioeconomic status (i.e., HDI) were analyzed as covariates. Sex, ethnicity, and the method of clavicle examination were analyzed as fixed factors. Possible interaction between these fixed factors and the covariate age was also investigated. The initial (exploratory) logit model may be described by the following equation:

$${\text{ln}}{\left[ {p \mathord{\left/ {\vphantom {p {{\left( {{\text{1 - }}p} \right)}}}} \right. \kern-\nulldelimiterspace} {{\left( {{\text{1 - }}p} \right)}}} \right]} = \alpha {\text{ }} + {\text{ }}\beta _{{{\text{age}}}} {\text{ $ \times $ age }} + {\text{ }}\beta _{{{\text{HDI}}}} {\text{ $ \times $ HDI }} + {\text{ }}\gamma _{{{\text{sex}}}} {\text{ }} + {\text{ }}\gamma _{{{\text{ethnicity}}}} {\text{ }} + {\text{ }}\gamma _{{{\text{method}}}} {\text{ }} + {\text{ }}\beta _{{{\text{sex $ \times $ age}}}} {\text{ $ \times $ age}} + \beta _{{{\text{ethnicity}} * {\text{age}}}} {\text{ $ \times $ age}} + \beta _{{{\text{method}} * {\text{age}}}} {\text{ $ \times $ }}age$$

where p denotes the probability of the dependent, α the intercept, β the estimated effect of a covariate or factor-by-covariate interaction term on the logit of the probability, and γ the estimated effect of a fixed factor.

Variables to be kept in the model were selected by means of backward elimination using the Wald test. Insignificant interaction terms were successively dropped. After each alteration to the exploratory model, possible significance of the remaining variables was reassessed. A p value lower than 0.05 was considered to be statistically significant.

Not all studies that were used as a source for our data provided individual data on sex or ethnicity. Consequently, the described model (M1) could be tested on the data of a subgroup of 2,002 individuals only. No data from CT scans were available for this subgroup. To be able to include data from all available studies, we therefore also tested simplified models. For the first of these simplified models (M2), we omitted the variable ethnicity and the associated interaction term from the initial model. This model could be fitted to the data of a subgroup of 3,256 individuals. For this subgroup, data resulting from the evaluation of CT scans were drawn from a single study [15], as the second available study [5] did not provide details on the sex of individuals. As a next step, we therefore further omitted the variable sex and the associated interaction term from the initial model. The resulting model (M3) could be fitted to the data of all 3,552 individuals. As a final step, we assessed the significance of the change in −2 log likelihood between the resulting model and a nested model including the main terms only. We further compare the Nagelkerke pseudo R 2. As such, we compare the difference in model fit.

Results

Table 2 provides the results for the three different models. Model M1 resulted in a p value higher than 0.05 for ethnicity after all non-significant interaction terms had been removed. Tested for a subgroup of 2,002 individuals, ethnicity therefore did not appear to affect the probability of having mature clavicles. Including this variable in the model does not improve the model fit. Exploring model M1 therefore effectively resulted in model M2, which could be fitted to the data of 3,256 individuals.

Table 2 Estimated parameters of logistic regression models for the logit of the probability of mature clavicles

From model M2, it appeared that age, the HDI (i.e., socioeconomic status), sex, and the method of clavicle examination all significantly affected the probability of having completely fused clavicles. Exploring possible interaction between sex and age did not indicate a significant effect. The interaction term method × age, however, yielded a p value of 0.000. The subsequent assessment of the significance of the difference in −2 log likelihood between the resulting model and a nested model omitting the interaction term yielded a p value of 0.000. The Nagelkerke pseudo R 2 reduced from 0.815 to 0.795 after omission of the interaction term. All indicates a better model fit when the interaction term is included.

Model M3, which could be fitted to all available data on clavicle fusion, resulted in comparable model parameters for the dummy variable for CT scans. Confounding by an observer effect is therefore reduced.

Application of the model

The most appropriate model to predict the probability of being diagnosed with mature clavicles, i.e., providing the best model fit to the currently available data, appeared to be model M2. The resulting model—i.e., after the removal of non-significant main and interaction terms and using the parameters for covariates and the intercept—may be described by:

$${\text{ln}}{\left[ {p \mathord{\left/ {\vphantom {p {{\left( {1 - p} \right)}}}} \right. \kern-\nulldelimiterspace} {{\left( {1 - p} \right)}}} \right]} = {\text{ }} - {\text{18}}{\text{.638 }} + {\text{ 0}}{\text{.629 $ \times $ age }} + {\text{ 2}}{\text{.746 $ \times $ HDI }} + {\text{ }}\gamma _{{{\text{sex}}}} {\text{ }} + {\text{ }}\gamma _{{{\text{method}}}} {\text{ }} + {\text{ }}\beta _{{{\text{method $ \times $ age}}}} {\text{ $ \times $ age}}{\text{.}}$$

Parameters for the appropriate dummies for the categorical variables may be found in Table 2. From this equation, it follows that the predicted probability may be calculated using the following equation:

$${p{\left( {\operatorname{age} } \right)} = e^{ \wedge } {\left( { - 18.638 + 0.629{\text{ $ \times $ }}\operatorname{age} + {\text{ }}2.746{\text{ $ \times $ }}HDI{\text{ }} + {\text{ }}\gamma _{{sex}} + {\text{ }}\gamma _{{method}} {\text{ }} + {\text{ }}\beta _{{\operatorname{method} {\text{ $ \times $ }}\operatorname{age} }} {\text{ $ \times $ }}age} \right)}} \mathord{\left/ {\vphantom {{p{\left( {\operatorname{age} } \right)} = e^{ \wedge } {\left( { - 18.638 + 0.629{\text{ $ \times $ }}\operatorname{age} + {\text{ }}2.746{\text{ $ \times $ }}HDI{\text{ }} + {\text{ }}\gamma _{{sex}} + {\text{ }}\gamma _{{method}} {\text{ }} + {\text{ }}\beta _{{\operatorname{method} {\text{ $ \times $ }}\operatorname{age} }} {\text{ $ \times $ }}age} \right)}} {\left( {1 + e\,^{ \wedge } } \right.\,{\left( { - 18.638{\text{ }} + {\text{ }}0.629{\text{ $ \times $ }}age{\text{ }} + {\text{ }}2.746{\text{ $ \times $ }}HDI{\text{ }} + {\text{ }}\gamma _{{sex}} {\text{ }} + {\text{ }}\gamma _{{method}} {\text{ }} + {\text{ }}\beta _{{method{\text{ $ \times $ }}age}} {\text{ $ \times $ }}\operatorname{age} } \right)}}}} \right. \kern-\nulldelimiterspace} {\left( {1 + e\,^{ \wedge } } \right.\,{\left( { - 18.638{\text{ }} + {\text{ }}0.629{\text{ $ \times $ }}age{\text{ }} + {\text{ }}2.746{\text{ $ \times $ }}HDI{\text{ }} + {\text{ }}\gamma _{{sex}} {\text{ }} + {\text{ }}\gamma _{{method}} {\text{ }} + {\text{ }}\beta _{{method{\text{ $ \times $ }}age}} {\text{ $ \times $ }}\operatorname{age} } \right)}}.$$

Based on the currently available data on clavicle fusion, we can now predict the probability of having mature clavicles for different individuals. Here, we provide a few examples.

The predicted probability of mature clavicles is 0.016 (1.6%) for a contemporary 19-year-old German male (HDI is 0.932). For a 20- or 21-year-old German male, this probability increases to 0.029 (2.9%) and 0.054 (5.4%), respectively. For females, the predicted probability values are increased to 0.023 (2.3%), 0.042 (4.2%), and 0.076 (7.6%), respectively. However, if the individuals have a lower socioeconomic status, these values are decreased. For contemporary 19-, 20-, and 21-year-old males from India (HDI is 0.611), for instance, they are 0.007 (0.7%), 0.012 (1.2%), and 0.023 (2.3%), respectively. When diagnosed from X-ray or CT scan, the values increase. For the 21-year-old Indian male, it would be 0.040 (4%) if diagnosed from an X-ray and 0.076 (7.6%) if diagnosed from a CT scan (composed using a slice thickness that is far from optimal).

In Figs. 2 and 3, we show the differences in predicted probabilities at different ages between the various categories. In Fig. 2, we compare the predicted probability values for a German male with that for a German female and an Indian male. Figure 3 illustrates the difference in predicted probability for a German male depending on whether dry bone specimens, X-rays, or CT scans are examined.

Fig. 2
figure 2

Predicted probability of having completely fused clavicles: a comparison between German males, German females and Indian males

Fig. 3
figure 3

Predicted probability of being diagnosed with mature clavicles: a comparison between examination by means of dry bone specimens, X-rays and CT scans

Discussion

Ethnicity did not appear to significantly influence the probability of having completely fused clavicles, which is in line with the results of Schmeling et al. [12]. We should, however, keep in mind that the current database is not well balanced. In addition, a small number of individuals taken from McKern and Stewart [6] were misclassified in our analysis (see footnote Table 1). Additional data from future studies may contribute to the knowledge on the possible effect of ethnicity, if not strengthen our conclusion that there is no significant effect.

The effect of sex on the probability of having completely fused clavicles appeared significant. The estimated model parameter for the dummy variable for females being greater than zero indicates a positive effect on the logit of the probability. The predicted probability of mature clavicles is therefore greater for females than for males of similar ages. This would point to a generally earlier onset of fusion in females. A similar disparity has been reported for epiphysial closure of all other long bones [11]. There was no indication of significant interaction between sex and age. We may therefore assume similarly shaped probability curves for males and females, indicating an equal number of years during which complete fusion may generally be achieved.

A higher HDI, and therefore a higher socioeconomic status, appears to increase the probability of having mature clavicles. This would mean that under less preferable socioeconomic conditions, the process of fusion is generally delayed. This is in concordance with conclusions drawn by Schmeling et al. [12, 14], and was summarized before by Eveleth and Tanner [2].

The method of clavicle examination appeared to also be significant. The significant interaction term method × age indicates that we should assume a difference in the estimated effect of age depending on the method applied. For the age interval in which mature clavicles have been observed, the predicted probability of being diagnosed with mature clavicles is greater when X-rays or CT scans are used instead of dry bone specimens. It is noteworthy that the difference in the predicted probability is initially very small, but becomes increasingly greater with increasing age during a limited age interval. When keeping in mind that the curve for the predicted probability will be s-shaped, one may image that the difference is again only minimal for the older individuals.

A number of theories can be postulated to explain a difference in the predicted probabilities between methods of examination.

Inappropriate slice thickness

Regarding the results from CT scans, one could postulate that the slice thickness affected the diagnosis. Mühler et al. [7] concluded that reconstructions composed with a slice thickness of 1 mm should be used. The vast majority of the analyzed data were, however, obtained from reconstructions with a greater slice thickness (up to 8 mm) [5, 15]. One can foresee that an almost completely fused clavicle may be diagnosed as being mature when details are lost due to the greater slice thickness. Schulze et al. [16]—from whom no data were included in our analysis, as no individual data were provided—observed two 19-year-olds with completely fused clavicles. Both observations were done from reconstructions with a slice thickness of 7 mm (Schulze, personal communication). We therefore think there is a realistic possibility of these cases having been misdiagnosed.

Persistent small grooves or notches between diaphysis and epiphysis

When comparing the results from their study based on CT scans with two studies based on dry bone specimens [6, 9], Kreitner et al. [5] hypothesized that persistent small grooves or notches seen in dry bone specimens may be classified as partial fusions of the epiphysis with the clavicular shaft. This would reduce the probability of dry bone specimens being classified as completely fused.

Reduced visibility of the epiphyseal disc on X-rays

X-rays used in the published studies have not been made with the purpose of examining the clavicles. Possibly, this has hindered a correct classification of the clavicle at times.

Interobserver variability

As the sample included results from two studies based on X-rays and two studies based on CT scans, we hold it unlikely that what we perceive to be an effect of the applied method is in fact an observer effect.

Expectation bias

In some X-rays, the (nearly fused) epiphyseal disc will have been more difficult to judge than in others. In such cases, knowing the actual age of the individuals could have affected the interpretation of the observation. Clavicles of relatively young individuals would then more often be classified as “not completely fused”, while those of relatively old individuals would be classified as “completely fused” more often. We know, however, that in at least two out of four studies, one based on X-rays [13] and one on CT scans [15], the examiner was not aware of the age of the subjects when classifying the images. We therefore do not believe that expectation bias could have played a role of any great importance.

Statistical artifact

In theory, it may be possible that due to an unfortunate coincidence, there were no relatively late maturing individuals examined by X-ray or CT scan.

In theory, all of these aspects could have contributed to the significant effect of the variable method. However, if significance of the interaction between method and age is not explained as a statistical artifact, only expectation bias would have resulted in a significant change in the shape of the probability curve. In case of misclassifications of nearly fused clavicles or differences in the classification system among studies, we would have expected a higher number of individuals with completely fused clavicles in the relatively young subjects, even below the age of 20.

If we, however, were to conclude that significance of the interaction term is in fact a statistical artifact, the term should be removed from the model, and model parameters should be recalculated. It is our ambition to investigate the matter further by comparing high quality CT scans, expected to allow a correct diagnosis, with X-rays taken from the same person.

Conclusions

The probability of having mature clavicles at a certain age is affected by the sex and socioeconomic status of the individual. The probability of being diagnosed with mature clavicles further appears affected by the method of clavicle examination. Predicted probabilities from the model may be used to illustrate acceptance or rejection of individual age claims.

Please note that the odds on having mature clavicles given a certain age should not be confused with the (posterior) odds of having reached a certain age given that the clavicles have matured. These probabilities are only equal if we assume a prior odds of 1.