Introduction

Age estimation has been frequently applied in the field of orthodontics, pediatric dentistry, anthropology, and archaeology [1, 2]. In circumstances wherein the actual age is ambiguous, it is usually used to assess the profile and age of a person [3, 4]. Age estimation is mainly used to determine certain social responsibilities, such as criminal responsibility, legal ability, and employment, marriage, or immigration responsibility [3,4,5,6]. Moreover, assessment of the age is also used in the treatment plan and diagnostic tools in orthodontics and pediatric dentistry [7,8,9].

To date, there are various methods to estimate the chronological age (CA), including clinical findings like fusion of sterno-clavicular bones, molecular biomarkers such as DNA methylation, left hand imaging, and tooth age assessment. Compared with skeletal development indicators [10], teeth can be used as a more reliable indicator of maturity, because it is controlled by genes and is independent of the individual’s somatic growth [11]. The degree of tooth mineralization is less affected by mechanical or chemical corrosions and is virtually immune to physical factors such as lack of deciduous teeth, insufficient space, dental caries, and some orthodontic deformities [2, 10, 11].

The radiological method proposed by Demirjian et al. [12] in 1973 in the study of French-Canadian individuals is the most widely applied method in dental age (DA) assessment at present, due to its rationality, ease of application, and objectivity [3].

The age prediction performance of the Demirjian method has been validated in Spain [13], Tunisia [14], France [15, 16], Malaysia [17], and other countries [4, 10, 16, 18], leading to an overestimation of CA. The Willems method simplifies the transformation steps of the Demirjian method, creating a new table. It showed that, as tested in different populations, including Somali [16], South Australian [18], Navi Mumbai [19], and Malaysian [20], the accuracy of estimating DA was consistently higher than that of the Demirjian method [21, 22]. Ethnic differences are thought to be a significant factor in the development of dental maturity [23,24,25]. For example, the DA by the Demirjian method for Chinese northern children comparing with the Willems method obviously overestimates the dental of boys and girls, which is not accurate [26]. However, according to the estimation of the DA of children in northern China, the Demirjian method underestimated the age of children in northern China, similar to Willems method [27]. Due to the fact that ethnicity is dependent on economic, political, and nutrition, it is clear that one set of dental age estimation method does not fit all populations including Chinese eastern population. Therefore, this study aimed to establish a modified formula to assess dental age in eastern Chinese children based on an adaptation of the Demirjian method and then compared the accuracy of the modified method with the Demirjian method and Willems method.

Materials and methods

Sample

In this retrospective study, 2367 samples were derived from patients who underwent treatment in Shanghai Ninth People’s Hospital affiliated with Shanghai Jiao Tong University, School of Medicine. The patients were all from eastern China, including Jiangsu, Zhejiang, and Shanghai province, which all have the same eating habits, geographical environment, and economic development level. The dental panoramic radiographs were obtained from 1217 boys and 1150 girls, with age ranging from 5 to 16 years. Radiographs were collected from the hospital information sheet from October 1, 2016 to March 31, 2018, and the birth dates of these children were from January 1, 2000 to February 28, 2013. These data were divided into 12 age groups every other year. The ethics approval for this study was granted by the Independent Ethics Committee of the Shanghai Ninth People’s Hospital affiliated with Shanghai Jiao Tong University, School of Medicine.

The inclusion criteria were as follows:

  1. 1.

    Han ethnic origin.

  2. 2.

    Age between 5 and 16 years.

  3. 3.

    Panoramic radiographs with clear and high-quality images and no obvious development pathology.

  4. 4.

    Complete dental records of radiographs, including the date of birth, date of radiography, and sex.

The exclusion criteria were as follows:

  1. 1.

    Bilateral absence of the teeth in the mandible (except the third molar).

  2. 2.

    Malocclusion or maxillofacial abnormalities.

  3. 3.

    Previous orthodontic or endodontic treatment.

  4. 4.

    Systemic disease or history of dental trauma.

  5. 5.

    Shape and position anomalies.

  6. 6.

    Any existing pathological condition relevant to the jaw bone, such as cysts or tumors.

  7. 7.

    Congenital and genetic anomalies.

  8. 8.

    The radiographs were of poor quality or blurred, and patient records were incomplete.

Materials and methods

The CA of each subject was calculated by subtracting the date of birth from the date of panoramic radiography, and the results were expressed as years with two decimal places.

Dental age estimation by the Demirjian method and Willems method

The dental age estimation was performed based on the maturity of the seven permanent teeth on the left mandible in the radiographs. According to the Demirjian staging criterion [12], tooth developments were classified into eight distinct stages, which were identified by letters “A” to “H”. The designated stages started with initial crown formation and continued until closure of the root apex. Each tooth was rated on a scale, and each rating was then converted into a score according to the table provided by Demirjian. The DA was obtained from the total maturity score of the seven teeth as described by Demirjian [12, 28]. The Willems method is a simplified process of the Demirjian method [29]. After converting each rating to a score according to the Willems method, the DA was obtained by adding the scores of the seven teeth.

Modified dental estimation

After testing the Demirjian and Willems methods in the eastern Chinese children, when paired t test showed the significant difference between DA and CA, a mathematical model was constructed to modify the Demirjian method using the polynomial curve fitting method (PCFM) to estimate the relationship between the Demirjian maturity score and CA, so as to obtain more accurate results. Since the Willems method was modified from the Demirjian method (2001), the score obtained from the Demirjian method was adopted to establish a more suitable scale for the Chinese population.

PCFM was used to find the coefficients of CA of degree n that fits the maturity score where the degree n can be selected according to the requirement of errors. It can fit a sufficient number of data points, which had good predictive power for both known and unknown data. It matched by minimizing the square of the error and finding the best function for the data.

The form of it is as follows:

$$ \mathrm{p}\left(\mathrm{x}\right)={\mathrm{p}}_1{\mathrm{x}}^{\mathrm{n}}+{\mathrm{p}}_2{\mathrm{x}}^{\mathrm{n}-1}+\dots +{\mathrm{p}}_{\mathrm{n}}\mathrm{x}+{\mathrm{p}}_{\mathrm{n}+1} $$

Inter-examiner and intra-examiner reliability

The digital orthopantomograms were scored separately by three observers who were skilled in both Demirjian and Willems methods. To avoid bias, they all had no information on patients’ details, such as name, age, or sex. One hundred randomly selected radiographs were scored by the observers after 2 weeks. The inter- and intra-rater agreement was assessed using Cohen’s kappa analysis.

Statistical methods

The statistical data were analyzed using Statistical Analysis System (SAS 9.4) software, and data management was performed using Microsoft Excel 2010. The relationship between the CA and DA was analyzed for each sex and age group. Besides calculation of the mean and standard deviation of the CA and DA, the difference between the CA and DA was tested using paired Student’s t test at a significance level of p < 0.05. Moreover, we usually use integers for age, so we assumed that, when |CA-DA| ≤ 1, the result was applicative and meaningful; then, we calculated the number and percentage of accuracy within this specific time interval.

Results

Statistical analysis revealed no statistically significant intra- and inter-observer differences, with kappa coefficients of 0.756 and 0.728, respectively. The mean CAs of boys and girls were basically equal, which were 11.20 ± 3.29 years and 10.99 ± 3.12 years, respectively. The distribution of the 2367 samples by age and sex is shown in Table 1.

Table 1 Distribution of samples by age and sex

Comparison between chronological age and dental age

The difference between chronological age (CA) and dental age (DA) were calculated independently for boys and girls. The descriptive statistics of the age subtractions (CA-DA) using the Demirjian and Willems methods are presented in Tables 2 and 3, respectively.

Table 2 Differences between mean chronological age (CA) and calculated dental age (DA) using the Demirjian method and p- values of the differences in various age groups for both sexes
Table 3 Differences between mean chronological age (CA) and calculated dental age (DA) using the Williems method and p-values of the differences in various age groups for both sexes

Table 2 shows that, in the Demirjian method, the most frequently observed age difference was between 0 and 1 year in boys and between 0 and 0.5 year in girls. In boys between 6.00 and 15.99 years of age, the DA yielded an insignificant under- and overestimation compared with the CA, and a significant underestimation is noted in those aged 16 years. In girls, the DA showed a slight overestimation between 6.00 and 13.99 years and underestimation between 14.00 and 16.99 years. The largest validation was 1.36 years in age 5 years in boys and 1.30 years in age 16 years in girls.

Table 3 shows all samples using the Willems method. The most frequently observed difference was 0–0.5 years. The DA presented a slight difference between ages 5.00 and 14.99 years in both sexes and a significant underestimation between ages 15.00 and 16.99 years. The largest under- and overestimation of age was 1.06 years (boys) and 1.75 years (girls).

Using both methods, the most significant validation of CA-DA was 16 years for both sexes. However, the Demirjian method showed more than 1-year bias in individuals aged 5 and 16 years, which resulted in a larger difference compared with the Willems method. The mean absolute error (MAE) using the Willems method for each sex (0.7 years for boys and 0.79 years for girls) was similar with that using the Demirjian method (0.73 year for boys and 0.79 year for girls).

Table 4 shows the percentages of CA-DA values within ± 1 year for both sexes in all age groups. The accuracy using the Demirjian method was 73.9% for boys and 70.2% for girls and 75.0% for boys and 75.9% for girls by using the Willems methods. Between ages 5.00 and 6.99 years, using the Demirjian method, the percentage accuracy varied from 15.0 to 54.0% and was only 38.2% for girls aged 16 years. The Willems method was extremely accurate in ages between 5.00 and 6.99 years (varying from 80.0 to 97.8%), but accuracies varied from 32.7 to 65.1% in those aged between 15.00 and 16.99 years.

Table 4 Percentages of accuracy for dental age (DA) estimated using the Demirjian and Willems methods for difference values within 1 year from chronological age (CA) for both sexes

Therefore, the Willems method resulted in a more accurate estimation in lower, relative to higher, age groups in both sexes. The Demirjian method showed better accuracy in ages 8.00–15.99 years. However, in both sexes, the Demirjian method and the Willems method provided quite accuracy in eastern Chinese children.

Modified dental estimation methods

PCFM was employed to estimate the age based on the maturity score values in our study as follows: (x represents maturity score; p(x) represents age; p1-pn+1 are all constants)

$$ \mathrm{p}\left(\mathrm{x}\right)={\mathrm{p}}_1{\mathrm{x}}^{\mathrm{n}}+{\mathrm{p}}_2{\mathrm{x}}^{\mathrm{n}-1}+\dots +{\mathrm{p}}_{\mathrm{n}}\mathrm{x}+{\mathrm{p}}_{\mathrm{n}+1} $$

According to the eastern Chinese samples, two formulas are used:

$$ \mathrm{Boys}:p(x)= 0.000000304471040683471\ast {x}^5- 0.000104409502065903\ast {x}^4+ 0.0141677515859983\ast {x}^3- 0.948508480952154\ast {x}^2+ 31.3447464323149\ast x- 403.48344718101 $$
$$ \mathrm{Girls}:p(x)= 0.000000461590755570541\ast {x}^5- 0.000154896783268403\ast {x}^4+ 0.0203841924163241\ast {x}^3- 1.31156808447141\ast {x}^2+ 41.2383716718296\ast x- 501.400716931816 $$

The modified formula was used to establish a new relationship between the maturity score and DA between 5 and 17 years. This resulted in new tables for boys and girls with scores directly converting to age (Tables 5 and 6).

Table 5 New table to convert the maturity score to dental age for boys
Table 6 New table to convert the maturity score to dental age for girls

Comparison between chronological age and modified dental age

The accuracy of the modified method was tested to prove that the modified model is suitable for use in eastern Chinese children. The percentages of CA-DA values within ± 1 year of the modified method were 83.7% for boys and 79.6% for girls (Table 7). In the Demirjian method, the percentage accuracies were 73.9% for boys and 70.2% for girls for the absolute difference values within 1 year. The Willems method presented a better percentage of accuracy for the absolute difference values within 1 year, which were 75.0% for boys and 75.9% for girls. The accuracy was significantly improved compared with those in the Demirjian and Willems methods. Among them, the accuracy rate of the modified method in ages 5 and 6 years most significantly increased compared with that of the Demirjian method, and compared with that of the Willems method, the most obvious increase was in ages 13–16 years.

Table 7 Percentages of accuracy for dental age (DA) estimated using the Demirjian and Willems methods for difference values within 1 year from chronological age (CA) for both sexes

The descriptive statistics for the CA-DA is shown in Table 8 and Fig. 1. It resulted in a smaller underestimation compared with CA for both boys (0.04 year) and girls (0.09 year), which showed a high suitability to Chinese children to some extent. The age differences in both sexes were between 0 and 0.5 years, except in boys aged 12 years. The MAE of each age group ranged from 0.31 to 0.88 years, which was quite closer to the CA compared with those in the Demirjian (ranging from 0.36 to 1.36 years) and Willems methods (ranging from 0.47 to 1.68 years).

Fig. 1
figure 1

Comparison of dental age between the modified method, the Demirjian method, and the Willems method by gender

Discussion

Accurate age estimation techniques are increasingly needed due to the increasing number of unidentified bodies, as well as age-dependent criminal cases and absence of a valid birth certificate for individuals. In many developed and developing countries, including China, DA estimation of children is widely applied. Ages 14, 16, and 18 years are important limitations in legal and social spheres, which deciding criminal liability is directly responsible by themselves or by the guardian. Ages 5–12 years are mixed dentition period, in which orthodontics and pediatric dentistry treatment are needed.

As is well known, the advantages of clinical methods of DA estimation are based on the fact that tooth development is largely controlled by genes [30, 31]. Among various DA estimation methods, the Demirjian and Williams methods, as the methods using panoramic radiographs to depict the eight stages of tooth mineralization and apical closure, have gained wide attention in different regions because these methods are easy to perform and inexpensive and the subjects are accessible [12, 21]. Considering that dental development is partly regulated by some factors including ethnicity and social environment, the authors speculated that their estimation may be inaccurate in other populations. A series of publications have revealed its inaccuracies [19, 21,22,23, 25]. Hence, rather than setting a universal standard, a population-specific standard will achieve the most accurate age assessment.

The present study was conducted to establish a formula that is more applicable to Chinese children and supplement the accuracy and applicability of the three methods for dental estimation of children in eastern China. The age range of studied children was 5 to 16 years, and their nationalities are all Han. Applying the Demirjian method, we concluded an overestimation in DA in general (0.21 year for boys and 0.10 year for girls). In each age group, the dental estimation difference values vary from 0.07 to 1.36 years in boys and 0.01 to 1.30 years in girls. The largest difference was in ages 5 and 16 years. The Willems method simplifies the Demirjian method by allowing the estimated age to be calculated directly from the new score adapted from the original maturity score. In our study, for boys, the DA was more overestimated than CA with 0.07 year for boys and 0.24 year for girls, which was similar with those in the Demirjian method. The difference values for each age group were generally low, ranging from 0.04 to 1.06 years in boys and 0.02 to 1.75 years in girls. However, significant underestimation was expressed in older ages, especially in ages 15 and 16 years. Compared with it, the Demirjian method presented more accurate results in both sexes from ages 12 to 16 years. These results are consistent with those of previous studies. Zhai et al. [27] and Wang et al. [25] evaluated 11–18-year-old children and found that the Demirjian method was more accurate.

Studies have shown variations in both dental emergence and formation in different ethnic populations, since people of various nationalities differ in biological maturity [21]. A study compared the biological maturity from the hand-wrist radiographs of Asian, Hispanic, and African-American children and found that Asian and Hispanic children mature earlier than other children [27]. However, to date, there is no suitable formula for Chinese DA estimation. Hence, we analyzed the reasons behind the inaccuracy of the two methods used in Chinese children. The largest difference between the DA and CA of the Demirjian method was noted in lower ages, especially in ages 5 and 6 years, resulting in a significant overestimation. As for the Willems method, the largest difference was noted in ages 13–16 years, generally expressing underestimation. Thus, we, therefore, use polynomial fitting formula to build a new table. We reduced the error by increasing the order, but at the same time, overfitting became more and more serious. To solve this problem, we expanded the sample data to more than 2000. After using the polynomial curve fitting to reevaluate the relationship between the scores and CA, the scores corresponding to lower ages increased in the new table.

In analyzing each age cohort separately in the 2367 samples, the modified method proved to be more accurate than the Demirjian and Willems methods in most age groups (Table 8 and Fig. 1). The lowest mean differences in boys were 0.05, 0.03, 0.02, 0.04, and 0.00 years for ages 5, 6, 7, 9, and 14 years, respectively. Moreover, girls, especially those aged 6, 8, and 14 years, with mean differences of 0.00, had an accurate DA estimation. The MAEs were 0.63 year for boys and 0.64 year for girls, which were both less than those in the two methods. The percentages of accuracy within 1 year sharply increased in lower ages compared with those in the Demirjian method, especially in age of 5 years, from 15.0 to 82.5% for boys and from 34.8 to 93.5% for girls. It also significantly increased in ages 13–16 years compared with those in the Willems method.

Table 8 Differences between mean chronological age (CA) and calculated dental age (DA) using the modified method and p-values of the differences in various age groups for both sexes

The estimated age difference (CA-DA) using the modified method was − 0.04 year for boys and − 0.09 year for girls, and it estimated the age of 83.7% of boys and 79.6% of girls within ± 1 year range, showing it to be more accurate than Demirjian method and Willems method. For example, Wang et al. showed that the Demirjian method was found to be more accurate compared with the Willems method for estimation of the ages of children from eastern China, with the mean underestimations of 0.66 year for boys and 0.62 year for girls [25]. Yang et al. found that, in 8–16-year-old southern Chinese children, the Willems method underestimated DA by 0.03 year for girls and overestimated it by 0.03 year for boys, and the Demirjian method underestimated DA by 0.54 year for girls and 0.44 year for boys [25]. Jayaraman J et al. reported that for the southern Chinese specific dental reference datasets, the age difference was − 0.02 for both gender and 80% of subjects were within ± 12 months range, which accuracy is basically the same as the modified method. However, their number of samples is 254, which is relatively small, so the error is more likely to exist [32].

However, in either the Demirjian method or the modified method, the score intervals corresponding to the age at an interval of 0.1 year in older ages were quite close. Therefore, the inaccurate estimation of the root closure degree of any mandibular left tooth will lead to a large error of DA. It is suggested to subdivide the stages of tooth development and design a software to precisely evaluate the stages to reduce the error of the score.

According to the score system in the Demirjian method, if the root apices of the seven left mandibular teeth are all closed, the sum of the maturity score is 100 for boys and 98.4 for girls, which both correspond to the age of 16 years. It must be determined whether the Demirjian and Willems methods both have estimation errors in ages 16–18 years. Based on the Demirjian method, the third molar can be taken into consideration in deciding if an unidentified person has reached 18 years of age. Concurrently, cone beam computed tomography can be used to establish a 3D model and increase the measurement accuracy [20, 33].

Conclusion

The study was conducted to establish a modified dental age estimation suitable for eastern Chinese children. The Demirjian method was found to be unreliable in lower ages (5–11 years), while the Willems method was unreliable in older ages (12–16 years). Neither the Demirjian method nor the Willems method was accurate. Thus, PCFM was set to convert the maturity score in the Demirjian method to the DA of eastern Chinese children. A further study should analyze the applicability of the new table in other parts of China.