Background

The science of estimating stature from bones was well known since the 19th century (Pearson 1899) till present moment. Researchers all over the world developed regression for different populations. Long bones, metacarpals, metatarsals, vertebrae, talus, calcaneus, scapula, pelvis and skull were proven to show correlation with stature (Bidmos 2006; Karaman et al. 2008; Karakas et al. 2011; Mahakkanukrauh et al. 2011; Giurazza et al. 2012; Torimitsu et al. 2014a, 2014b; Zhang et al. 2015). Though a variety of bones were used to estimate stature, estimation of stature from long bones appeared to be most popularly used and provided accurate results (Dupertuis and Hadden 1951; Trotter and Gleser 1952; Mall et al. 2001; Radoinova et al. 2002; Hauser et al. 2005; Celbis and Agritmis 2006; Dayal et al. 2008; Didia et al. 2009; Mahakkanukrauh et al. 2011; Akhlaghi et al. 2012; Ahmed 2013a, 2013b; Farsinejad et al. 2014; Torimitsu et al. 2014c; Jeong and Jantz 2016; Zech et al. 2016).

Early publications on stature estimation developed by Pearson (1899), Trotter and Gleser (1952) and Trotter and Gleser (1958) were widely used by several researchers. However, in the present day, these regressions were no longer valid to represent different populations and generations. Their applications could lead to inaccuracy in stature estimation in the current forensic setting (Radoinova et al. 2002). The most precise results for estimating stature should be provided by the population, from which the equations were derived. Thus, each population should have their own regressions with a sampling of modern populations to account for any secular trends in stature (Mahakkanukrauh et al. 2011).

There were various methods used by researchers for stature estimation depending on the available resources. Most researchers used direct measurements on bones such as the Terry’s collection (Hunt and Albanese 2005), Dart’s collection (Dayal et al. 2009), Giraudi’s collection (Giraudi et al. 1984) and Hamann-Todd’s collection (Mensforth and Latimer 1989). Recent studies used x-ray and scan due to the new upcoming technology and availability of skeletal collections (Muñoz et al. 2001; Sarajlić et al. 2006; Petrovečki et al. 2007; Hasegawa et al. 2009; Amal Hayati et al. 2011; Farsinejad et al. 2014). The most advanced method was to perform post mortem computed tomography (PMCT), as it is in line with the growing use of virtual autopsy worldwide (Giurazza et al. 2012; Giurazza et al. 2013; Torimitsu et al. 2014a, 2014b; Hishmat et al. 2015; Zhang et al. 2015). Although direct measurements on bones gave more accurate results compared to that with radiography (Hauser et al. 2005), direct measurement has not been adopted in Malaysia due to the limited number of skeletal collections.

Estimation of stature is an important indicator to help in establishing the biological profile of skeletal remains in medicolegal investigations. In this study, predictive formulae were constructed based on radiographic measurements of maximum length of humerus, radius and ulna. The aim of this study was to develop regression equations to estimate stature of Malaysian adults from radiographic measurements of upper limbs.

Materials and methods

Subjects

A total of 90 healthy Malaysian adults comprised 50 males and 40 females, from 23 to 47 years of ages, were randomly selected from the Radiology Department. The ethical approval was obtained from the Institutional Ethics Committee (NMRR-09-1040-4976). Consent from all subjects was obtained prior to the study. Subjects with history of trauma, surgery of upper limb and/or past history of prolonged illness were excluded from the study. The living stature of the subject was measured by using the standard height measurement scale (Portable Leicester Stadiometer) taken in centimetre (cm) (in two decimal places). The subjects were required to stand erect in an anatomical position (Saladin 2007). As the diversity in demic groups is challenging, ambiguous or mixed demic groups were excluded as studies proved that genetic and dietary may contribute to the difference in body build and growth (Shields 2007).

Radiographic technique for bone measurement

The X-rays for left and right hands were taken for each subject. A custom-made aluminium ruler scale (49 cm long, 47 cm length) was placed on the scanning film alongside the upper limb for measurements. The position of the side marker (left or right) against the feature was marked, and the exposure limit was set at 40 to 50 kvp (kilovoltage peak) or 4 to 5 mAs (miliamperage per second).

The maximum length of humerus was measured from the head of the humerus to trochlea. The maximum length of radius was taken from radial head to the tip of the styloid process. The maximum length of ulna was taken from olecranon to the tip of styloid process. All measurements on both sides were recorded in cm (two decimal places) (Saladin 2007).

Statistical analysis

The data was analysed by SPSS version 23.0 for Windows. The descriptive statistics were analysed by One-way ANOVA (post-hoc) to evaluate, whether the means of all groups (Malays, Chinese and Indians) was different for different parameters (lengths of humerus, radius and ulna). The Shapiro Wilk test was used to determine if the three data sets showed a normal distribution. The Pearson’s correlation analysis was used to see the correlations between stature and the parameters. Single and multiple regression analysis were used to predict stature based on single and multiple parameters, respectively. Only measurements from the left side were accounted for in the regression equation analysis. The right side measurement was used, only when the left side was not available due to fractures or deformity. The study was validated by testing the regression against 10 randomly selected subjects for comparison with true stature.

Results

The Shapiro-Wilk test showed that the data (n = 90) were normally distributed. The males (n = 50) were subcategorised into three different demic groups (Malay = 20, Chinese = 7, Indian = 23). Similarly, the females (n = 40) were subcategorised into three demic groups (Malay = 20, Chinese = 7, Indian = 23). Results showed that stature was significantly different between male and female (p < 0.05). The mean stature of male was 168.24 cm, which was higher than the female (155.69 cm). However, there was no significant difference among the demic groups of Malay, Chinese and Indian in stature (p > 0.05).

There was no significant difference in stature with age or with different sides of the body. The minimum and maximum ages for males were 23 years and 47 years, respectively. In females, the minimum and maximum ages were 23 years and 44 years, respectively. The age distribution was normal. The results showed that with increased age, the stature did not increase in persons between 23 and 47 years of age.

The Pearson’s correlation showed that stature was correlated with all the parameters namely, maximum lengths of humerus, radius and ulna (p < 0.05). The minimum, maximum and mean values for all measurements and living stature were tabulated (Table 1). The regression equations for stature estimation were presented (Table 2). The single linear regression equations were developed if there was only one bone available for stature estimation. The multiple regression equations were developed for multiple bones to increase its accuracy.

Table 1 Descriptive Statistics of Measurements
Table 2 Regression equations for male and female

In different sex, the correlation differed in several aspects for instance, humerus showed the highest correlation (r = 0.64) with stature in female, but showed the least correlation (r = 0.44) in males for single linear regression equations. However, when humerus, radius and ulna were used in combination, the correlation coefficient was high in both sexes (r = 0.71 in male, r = 0.72 in female). The regression equation with the highest accuracy was indicated by the lowest standard error of estimate (SEE). In simple linear regression, the lowest SEE was based on radius in female (SEE = 10.33). In multiple regression, the highest correlation with stature was achieved in female (r = 0.72).

For comparison, the mean of estimated stature was compared with the mean of true stature by paired T-test (Table 3). The results showed no significant difference between true and estimated stature in both single and multiple regressions (Table 4). The statistical results indicated that estimated statures were in good approximation with true statures. The maximum difference between true and estimated stature in both single linear and multiple regression equations showed large differences i.e. 8.96 and 9.33 cm, respectively.

Table 3 Comparison of true stature with estimated stature using linear regression and multiple regression equations
Table 4 Paired T-test between true and estimated stature

Discussion

In the present study, Malaysian males were found to be taller than females. Results showed significant differences in stature between males and females. There was no difference among the demic groups of Malay, Chinese and Indian. The results were in accordance with a study in similar population using radiograph on lower limbs (Amal Hayati et al. 2011). The agreement of results between this study and the literature was probably attributed to similar population used and their small sample size. Interestingly, the demic groups in Malaysia are complex in nature as some Malaysians were descendants from mixed marriage. For instance, Chindian is an offspring of Chinese and Indian. This information was missed during the study as the family tree was excluded from the profile of each subject.

Small sample size (n = 90) was a limitation in this study. However, it is a common issue in stature estimation studies. Several studies were done on limited number of sample populations. For instance, studies were done in the Croatian population (n = 40), Bosnian (n = 50), Iranian (n = 101) and Spain population (n = 104) (Petrovečki et al. 2007; Sarajlić et al. 2006; Farsinejad et al. 2014; Muñoz et al. 2001). It is noteworthy that these were preliminary studies, which require further research for validation. Additionally, a study on the Japanese population used a relatively big sample size (n = 434), which is commendable (Hasegawa et al. 2009).

The present study proved that anthropometric measurements did not show any difference between the left or right arm. The literature showed no difference between different sides of the arm (Trotter and Gleser 1952; Petrovečki et al. 2007; Hasegawa et al. 2009; Amal Hayati et al. 2011). Thus, this research used only the left arm to develop regressions. Several studies used only one side measurement (Choi et al. 1997; Muñoz et al. 2001; Amal Hayati et al. 2011; Farsinejad et al. 2014). In the literature, the average values of both sides were used to develop regressions (Radoinova et al. 2002; Sarajlić et al. 2006; Petrovečki et al. 2007; Hasegawa et al. 2009). Also, a study proved that bilateral asymmetry existed whereby, the farmers tend to have stronger hands on the dominant side because of routine extraneous exercise (Krishan et al. 2010). Thus, it is feasible to analyse both sides of extremities before taking into account, whether to use a single side or an average of both sides in developing regressions for a specific population.

In the literature, lower limbs were highly correlated with stature than the upper limbs (Trotter and Gleser 1952; Choi et al. 1997; Muñoz et al. 2001; Duyar and Pelin 2003; Hasegawa et al. 2009; Amal Hayati et al. 2011; Giurazza et al. 2012). However, in most cases, the lower limbs were sometimes went missing, fractured and/or damaged at the time of discovery (Giurazza et al. 2012). In such conditions, upper limbs could provide alternatives for stature estimation. In this research, the best bone to be used was ulna in male (r = 0.68) and humerus in female (r = 0.64). Interestingly, Petrovečki et al. (2007) found similar results in female (r = 0.50).

The utility of regression equation depends heavily on SEE value. The SEE in this study ranged from 10.33 to 15.99, which was higher than that in the literature (Table 5). However, the SEE in this study was comparable with a study done in the Malaysian population (Amal Hayati et al. 2011). Both studies had limited sample size. Amal Hayati et al. (2011) utilised only 32 subjects, and this study used 90 subjects. This present study was a preliminary study in the Malaysian population. The regression equations are not suitable yet to be used practically in Malaysian population due to high standard error of estimates. However, this study showed a great effort to develop regression equations for the Malaysian population. The regression can be developed further with extended sample size to provide reliable regression for stature estimation in the Malaysian population.

Table 5 Comparison of Standard Error of Estimate (SEE) with other similar studies using x-ray on long bones

Conclusion

In conclusion, regression equations were developed for stature estimation in male and female based on humerus, radius and ulna. The best parameters were ulna (in male) and humerus (in female). However, the regressions could be improved in future with larger sample size to better represent the diversity of the population. We will continue this research to validate the results in an extended sample for identification of human remains in Malaysia.