Introduction

Forensic age assessment encompasses a wide range of approaches and methods [1,2,3,4,5,6]. It is currently an extensively researched domain within forensic sciences [7,8,9,10,11,12,13,14,15,16].

Age assessment essentially involves deducing the chronological age of an individual based on observable traits. The data collected from the individual are subsequently compared with reference data from the literature, enabling an inference about the chronological age of the individual. The relevance of reliable reference data in forensic age assessment procedures is apparent [2, 17,18,19].

A diverse array of trait systems, encompassing aspects of sexual maturity, skeletal development, and genetic traits, is documented in the literature for the purpose of forensic age assessment [6, 20,21,22,23]. These traits can be either developmental, such as specific stages of skeletal development and dental development, or degenerative, such as age-related genome degeneration or degenerative dental traits [24,25,26,27,28,29]. The notable strength of developmental traits resides in their primary genetic determination [30,31,32]. In contrast, degenerative traits are often more dependent on external influences and the transition from age-associated trait to pathological alteration is gradual. However, degenerative traits must be accessed when development is finished.

Forensic age assessment can be fundamentally categorized into two distinct applications. Firstly, in cases involving unidentified deceased individuals, where it is necessary to determine the age at the time of death, forensic age assessment can assist in elucidating the circumstances of death and in facilitating identification [33,34,35,36,37,38]. The methods applicable in these cases are broadly defined, allowing, for instance, invasive procedures as the extraction and preservation of teeth or osteological examinations of bones. Considerations related to radiation protection are not required in the case of deceased individuals.

The second major application field of forensic age assessment procedures is age assessment in living individuals [39, 40]. In instances where the age of a person is unknown or legitimate doubts exist regarding the provided age information, forensic age assessment can contribute to determining the individual’s age and providing legal certainty [39]. It is essential to note that the limitations for such procedures are considerably greater than for those on deceased individuals. For instance, considerations related to radiation protection must be taken into account [41]. Furthermore, the extraction of teeth for the purpose of forensic age assessment is precluded.

In addition to the broad categorizations of age assessment procedures in age assessment based on developmental or degenerative characteristics and its application in deceased or living individuals, another crucial aspect of forensic age estimation can be derived from the underlying aim of the process. On the one hand, forensic age assessment can be used to estimate the most probable chronological age of a person [41]. The assessment of this most probable age is possible with an acceptable margin of uncertainty. This uncertainty usually spans a few years across all methods. However, the most probable age of a person can only be estimated as long as the underlying trait is not completely developed [42].

On the other hand, the exceeding of forensically relevant age limits can be verified. The term “minimal age concept” has become established for this purpose [41]. For this approach, the minimum age that was found for the corresponding trait expression, e.g. a certain stage of development, in the relevant reference population is crucial. If several trait systems are examined, the highest minimum age found is the deciding age according to current recommendations. For expert reports in forensic age assessment, it is useful to state the minimal estimated legal age and the most probable estimated age of the individual under assessment [41].

The present study focuses on the dental developmental trait of eruption of mandibular third molars in panoramic dental radiographs (PANs). This feature represents a well-examined trait of dental development. The fundamental suitability of this trait for forensic age assessment is substantiated by strong evidence [43,44,45,46,47,48].

The trait can be employed for forensic age assessment in both living and deceased individuals, with the limitation residing in the practical execution of a PAN in the case of a deceased person.

Moreover, the trait is suitable for determining both the most probable chronological age of an individual and for surpassing specific age thresholds with the highest degree of certainty.

Various staging scales have been introduced for the assessment of mandibular third molar eruption in PAN [47, 49,50,51].

The aim of the study was to collect reference data for the trait of mandibular third molar eruption in PAN in a Chinese population. The research question, in particular, was whether examination of mandibular third molar eruption in PAN is suitable, following the minimal age principle, to demonstrate the attainment of 16 and 18 years of age in Chinese people.

Material and method

The PANs used in this study were collected from the Department of Oral Radiology of the Stomatological Hospital of Xi’an Jiaotong University, Xi’an, Shaanxi, China. Importantly, all the PANs utilized in this study were originally acquired for medical purposes. The sample of PANs for this study was randomly selected for retrospective, blinded evaluation, stratified by each year of age between 15 and 25 years, which is in a line with comparable publications on the subject [47].

Age was defined in a way that the age group “15 years” includes, for example, people aged between 15.00 and 15.99 years. Leap years were taken into account for the calculation of age. The formula for calculating age was: (Date of X-ray - Date of Birth) / 365.25. For inclusion in the study, age of the participants at the time of the X-ray examination had to be known beyond doubt.

The first step was to collect the X-ray images. The selection criteria were the appropriate age and sex and the radiographic detectability of at least one mandibular third molar. The specific exclusion criteria for the individual teeth were then applied as part of the evaluations. Exclusion criteria involved inadequate image quality (e.g., artifact-related issues or misalignments in the X-ray machine resulting in image distortions), excessive retention of teeth, and pathological alterations in the teeth or jaw. The assessment of retentions adhered to established clinical criteria, such as an angle exceeding 30 degrees in the mesio-distal direction, which served as an exclusion criterion [52, 53]. Specifically, all PANs indicating pathologies such as bone fractures, cysts, carious lesions on the third molars, dental restorations on the third molars, or orthodontic appliances on the third molars were also excluded. Multiple PANs from the same individual were not permitted for inclusion.

The evaluations were performed according to the classification by Olze et al. (2012) [49]:

  1. A

    coverage of the occlusal surface with alveolar bone.

  1. B

    alveolar emerge, complete resorption of the alveolar bone over the occlusal surface.

  1. C

    at least half the crown length of the second molar has been reached, the occlusal plane has not been reached.

  1. D

    complete emerge in the occlusal plane.

Radiographs were examined using synedra Personal View software version 22.0.0 (synedra information technologies GmbH, Innsbruck, Austria) at appropriate workstations. The setup and ambient conditions were consistent for both examiners. During the evaluations, the software’s magnification tool and Gray-Level adjustment tool were utilized. The examiners, two board-certified dentists, possessed comprehensive familiarity with the methodology from prior studies [54]. The radiographs were independently evaluated by both examiners. Following the evaluations by both examiners, consensus was achieved through subsequent arbitration between the two examiners in cases where different stages had been determined.

Data management and statistical analyses were conducted using Stata, version 13.0 (Stata Corp LP, College Station, Texas, USA). Mean value, median, upper and lower quartile as well as the minimum and maximum age were determined for each stage.

Results

A total of 917 PANs were included in the study, comprising 430 females and 487 males, with ages ranging from 15.00 (male) to 25.99 (male) years. The images were acquired between January 2016 and April 2021. Table 1 illustrates the composition of the study population stratified by age and biological sex.

Table 1 Age and sex distribution of the study population

Based on the number of radiographs collected, a total of 1834 teeth could potentially have been analyzed. After strict application of the exclusion criteria, 1230 teeth (males: 630, females: 600), could finally be considered (Table 2). These 1230 teeth ultimately form the basis for the calculation of the reference data. The number of teeth examined for each third molar position in each year of age by sex ranged from n = 21 (tooth 48, age group 19 years, males and females) to n = 37 (tooth 38, age group 17 years, females) (Table 2).

Table 2 Number of teeth evaluated by age and sex

Mean age at stage increased with stages for both sexes. For the median, this does not apply for males at tooth 38 [FDI] in the transition from stage C to D.

Among the females, the minimum age at stage D for both teeth was 15.64y, provided by the same individual (Tables 3 and 4). The minimum age at stage D for both teeth came from the same individual at 16.05y (Tables 5 and 6). Thus, the minimum ages for stage D, which correspond to a completed eruption, were well below 18 years of age for both teeth and both sexes.

Table 3 Descriptive measures for each stage for tooth 38 [FDI] in females. Age in years, rounded to second decimal place. SD: standard deviation. LQ: lower quartile. UQ: upper quartile
Table 4 Descriptive measures for each stage for tooth 48 [FDI] in females. Age in years, rounded to second decimal place. SD: standard deviation. LQ: lower quartile. UQ: upper quartile
Table 5 Descriptive measures for each stage for tooth 38 [FDI] in males. Age in years, rounded to second decimal place. SD: standard deviation. LQ: lower quartile. UQ: upper quartile
Table 6 Descriptive measures for each stage for tooth 48 [FDI] in males. Age in years, rounded to second decimal place. SD: standard deviation. LQ: lower quartile. UQ: upper quartile

The mean and median age for females in stage B and higher was above 18 years. For males, this only applied to tooth 48. For tooth 38, the mean in stage B was above 18 years, but the median was below this at 17.67 years.

The minimum age of 15.00 years for males in stages A and B for both teeth represented the lower age limit of the study sample. For females, the minimum age for all stages in both teeth were close to 15 years.

It was also noteworthy that the maximum age for all stages, including stage A, was over 24.00 years for both teeth and both sexes.

Individuals per stage ranged from 31 cases (stage B, teeth 38 and 48, males) to 160 cases (stage D, tooth 48, males). Stage D was the most frequently detected in all teeth and both sexes, largely reflecting the age distribution of the sample (Tables 3, 4, 5 and 6). Females had the greater proportion of earlier staging with stage A being detected over 90 times for both teeth (Tables 3, 4, 5 and 6).

Discussion

The aim of generating current reference data from a northern Chinese population for eruption of mandibular third molars in PANs was achieved, and we expect that our study will contribute to a better understanding of the distribution of age within the trait and its legal relevance in the relevant population.

For northern Chinese people, few reference data are available for determining mandibular third molar eruption in PAN for forensic age assessment. In 2014, Guo et al. conducted a study on this topic [44]. They examined 1135 PANs from 506 males and 629 females aged 11–26 years. It should be emphasized that the images came from the same institution as those used for our study. Guo et al. examined X-ray images taken in 2012 and 2013 [44].

Table 7 includes the descriptive measures for stages from Guo et al. [44]. Looking at Table 7, it is evident that the values of Guo et al. for stages A and B are considerably lower than the current values of our study. Only the values for stage D, which represents a complete eruption, are around the age of 22 in both studies and are therefore quite close. It is also noteworthy that Guo et al. achieved significantly lower values for the standard deviation, except for stage D (Table 7).

Table 7 Comparison of the studies investigating eruption of mandibular third molars using the Olze et al. method in Asian populations and the present study. SD: standard deviation. LQ: lower quartile. UQ: upper quartile. - : No information given. Values for stage C in italics because the staging scale for stage C in the present study is not the same. Values rounded to one decimal place

As one conclusion of their study, Guo et al. 2014 stated that assessing the eruption of mandibular third molars in the PAN is suitable for detecting the completion of 16 years of age [44]. This was based on a binary logistic regression analysis. For females, they found a sensitivity of 100%. For males, this value was 98.9% (tooth 38) or 99.6% (tooth 48). Based on this data, it is evident that the minimum age in stage D was also below 16 years in the study by Guo et al. However, there is no specific information on this in the corresponding publication from 2014. With regard to our results, this needs to be assessed against the background of the minimum age concept [41]. Since the minimum age of females in stage D in our study was below 16 years, the result of Guo et al. for females cannot be confirmed. Assessing the eruption of mandibular third molars in the PAN cannot be used to verify the completion of 16 years of age in females. For males, the minimum age in stage D for both teeth was just over 16 years according to our results. Thus, following the minimum age concept [41], the characteristic would theoretically be suitable for proving that Chinese males had reached the age of 16. Summarizing the results of both studies, as the age in our results was only very slightly above 16 years and Guo et al. did not find 100% sensitivity in males in 2014, the completion of eruption of the mandibular third molars should not be relied upon as the exclusive marker for verifying the completion of 16 years of age in northern Chinese males. Rather, it should only be evaluated in combination with other traits.

In addition to the study by Guo et al. from 2014 in a northern Chinese population [44], there are a few other studies on assessing eruption of third molars in PANs in Asian populations. Yusof et al. conducted a study on a Malaysian population in 2015 [55]. In 2008, Olze et al. presented a study on a Japanese population [45]. Table 7 compares the studies with our present study.

In their 2015 study, Yusof et al. included a total of 714 PANs from individuals aged between 14.1 and 23.9 years [55]. Like Guo et al. [44], Yusof et al. carried out a binary logistic regression analysis and stated that stage D is particularly suitable for detecting the completion of the 18th year of life in males. Yusof et al. provided a value for a correct age prediction as a percentage of 93.6 for tooth 38 and 93.9 for tooth 48 for this age threshold. For females, these values were well below 90%. Yusof et al. did not present comparable values for the age threshold of 16, as presented by Guo et al. [44, 55]. Overall, Yusof et al. also found an accelerated eruption in females [55].

Furthermore, it is striking that the mean value in stage D of Yusof et al. is considerably lower than the values of the other studies included in Table 7.

Olze et al. conducted a study in 2008 with a total of 1300 PANs from people aged 14–26 years (Table 7) [45]. They provide comparable descriptive measures to those in our study. Olze et al. found a minimum age of over 17 years for the complete eruption of mandibular third molars (stage D) in both teeth and both sexes. The method would therefore have been suitable for proving the age of 16 years but not to prove completion of the 18th year of age. According to our results, the values for completed eruption are more than one year lower for Chinese males and two years lower for Chinese females.

Important aspects must be considered when comparing the descriptive measures of the studies in Asian populations in Table 7 to deduce any potential population-specific differences in the temporal patterns of the eruption.

These aspects include the fact that the different studies did not use the same staging scale. In the studies by Guo et al., Yusof et al. and Olze et al., a staging scale presented by Olze et al. in 2007 was used (Table 7) [47]. The scale only differs in the definition of stage C from the staging used in the present study [47, 49]. Olze et al. once noticed a shortcoming in their staging scale from 2007, so the staging scale used in the present study was then presented as a further development in 2012 [49]. The shortcoming of the 2007 staging scale concerns the definition of gingival eruption. In the 2007 scale, the criteria for assessing gingival eruption involved the penetration of at least one cusp tip of the erupting third molar through the gingival tissue [46]. Although the clinical identification of this stage presented no challenges in practical clinical settings, the radiographic evaluation of gingival eruption proved to be intricate, often rendering it unfeasible [49]. However, stages A, B and D are identical for both staging scales. This allows the reference values for stage D, which corresponds to a completed eruption in both scales, to be compared. Recently, our group demonstrated that the staging scale presented by Olze et al. in 2012 is particularly suited [54]. We were able to conclude that this staging scale should be employed for further studies in the field [54].

Other differences between the studies could explain the differing descriptive measures. Previous research has demonstrated that the age intervals of the study populations influence the mean values of the ages in the stages [17, 56]. The age range in the study by Guo et al. starts at 11 years and is therefore considerably lower than the 15 years in our study. It is obvious that this different age range alone results in lower mean values, especially for the lower stages [17, 56]. However, the age ranges between the studies by Yusof et al., Olze et al. and our study are more comparable.

In addition to the influence of different age ranges in the study populations, the influence of an uneven age distribution in the study population must also be taken into account [17]. The effect of uneven age distribution is qualitatively similar to the choice of age ranges in the study. A disproportionate number of younger individuals naturally leads to lower estimated ages for groups that include those individuals, while the greater number of older individuals increases the estimated ages [17].

Analyzing the age distribution in the study by Guo et al., it is apparent that there was a clear surplus of individuals in the 11–15 years age groups [44]. For the study by Yosuf et al., no such clear trend can be found for the age distribution in the study population, although the age groups are not represented equally here either [55]. The study by Olze et al. shows a clear overrepresentation of the 19–24 years age groups in both sexes, but particularly among females [45]. The margins of the study population, age groups 14, 15 and 26 years, are significantly underrepresented [45] in the Olze et al. study. For our study, a slight preponderance of males between the ages of 21 and 24 was found (Table 1). However, looking at the teeth examined in Table 2, the effect is not fully reflected. There is a greater underrepresentation of teeth in the 19 years age group.

The conclusion that different results between the studies can be attributed to a fundamental ethnic difference in the expression of the trait may therefore only be drawn after careful evaluation of the relevant study parameters. For studies designed to evaluate such relationships, the parameters of the studies would have to be comparable as far as possible.

When comparing the studies overall, according to our results, the assessment of mandibular third molars in PAN in females is not suitable for detecting the completion of the 16th year of life in northern Chinese individuals. With this result we could not contribute to a confirmation of the results of Guo et al. and Olze et al. for other Asian populations [44, 45].

In males, our minimum age in stage D was above 16 years, but only marginally so. However, considering the study by Guo et al. from 2014, the assessment of third molar eruption in the PAN in northern Chinese males should not be used as the decisive marker to prove the completion of 16 years of age [44]. This is in clear contrast to the results of Olze et al. from 2008 for a Japanese population [45]. When looking at the minimum ages of different studies, however, it must be noted that the lowest minimum age found across the various studies should be regarded as the “true” minimum age for the respective stage. The increase in the number of individuals examined added in the new studies increases the probability of finding the “true” minimum age.

Overall, compared to the existing literature in Asian populations, we demonstrated completion of the eruption (stage D) in both sexes at considerably younger ages [45, 47]. Future studies should therefore examine whether the postulate made by Olze et al. in 2007 that the eruption of the third molars in Asians is completed after that of Europeans and black Africans can still be assumed to be correct [47].

For our study, we chose the approach of consensual final evaluation. In existing studies on forensic age assessment, in contrast, it is common practice to determine examiner agreement and to specify corresponding coefficients [57,58,59] (Table 7). However, as there is never an exact agreement between the examiners in those cases, it must be assumed that “incorrect” findings are also included in the statistics and thus form the basis for the published reference values. In order to overcome this shortcoming, a consensual joint assessment was carried out in our study in cases where the two examiners had not determined the same stage. These determinations then form the basis for the reference data presented. This approach attempts to reduce the proportion of “incorrect” determinations to make the reference data more robust. However, even with this approach, it is not possible to ensure that the “true” stage was found in every case. A residual subjectivity also remains inherent in this approach.

In the study mentioned above, in which our group compared different staging scales to assess the eruption of mandibular third molars, we also examined observer agreement. The examiners were the same as in the current study.

Krippendorff coefficients of 0.904 (males) and 0.898 (females) were found for the intra-observer agreement for the Olze et al. method used in the present study. The comparable values for inter-observer agreement were 0.797 and 0.792 respectively [54]. ]. These data are within the range of comparable studies (Table 7).

For reference studies in age assessment, an equal distribution across the age groups is required [17, 39]. The required equal distribution in studies with dental traits therefore refers more to the equal distribution of teeth across the age groups. Since it is not directly possible to draw conclusions about the number of teeth examined when studying PANs and considering specific exclusion criteria, it makes sense to state the number of teeth actually examined. We have complied with this requirement and provide the relevant data in Table 2. Minor deviations observed across age cohorts within our population may be ascribed to methodological factors inherent in the study, such as the lack of a targeted pre-screening process for X-ray images prior to formal assessment. This must be mentioned as a limitation of the study. However, this approach was chosen because a targeted filling of the age groups contains the risk of bias. By specifically filling the age groups with suitable teeth, i.e. two healthy teeth each, there is a risk of eliminating the randomization and turning the sample into a biased one. Therefore, such an approach was not used in order to avoid distorting the results. The disadvantage of this approach is the resulting inevitable imbalance in the age groups.

The Chinese population is composed of many different ethnic groups [60]. The influence of these individual ethnic groups on the trait under study has not been clarified. This means that the significance of the present results is probably not representative of the entire Chinese population.

It is currently common practice in forensic age assessment studies of Chinese populations to further classify them as Northern, Southern or Western Chinese [44, 58, 61,62,63]. Differences between the various ethnic groups in China regarding third molar eruption should be studied in the future.

The main limitation of the present study relates to the reference sample. Prospective randomized population-based study populations would be desirable. However, prospective X-ray examinations are not possible for such studies for ethical reasons. Thus, in this field of science, retrospective use must regularly be made of existing data from medical facilities. Therefore, the study populations are not necessarily identical to the general population. This fact has already been discussed in detail in the relevant literature [64]. It is stated that the data can nevertheless be assumed to be reliable [64]. The need for strict inclusion and exclusion criteria is emphasized in the literature in order to keep the bias as low as possible [64]. For this reason, correspondingly strict criteria were applied in the present study.

Conclusion

Contrary to older literature, in northern Chinese people, the eruption of mandibular third molars can be completed before the age of 16 in females. As the results are inconsistent with previous literature data, they should be validated by further studies. Based on our findings and existing literature, the trait should not be used as the decisive marker to prove exceeding this age threshold in males either.

In the future, reference studies on the subject should be standardized regarding the staging scale used, the age range and age distribution of the study population.

Key points

  1. 1.

    The minimum age at completed eruption of the third molars in females was under 16 years.

  2. 2.

    The minimum age at completed eruption of the third molars in males was slightly over 16 years.

  3. 3.

    The trait should not be used as the decisive criterion to prove that the 16-years age limit has been reached for either sex.

  4. 4.

    Proof of majority is not possible with this trait in the population studied.