Abstract
Purpose
This study aimed to validate the Willems Belgian Caucasian (Willems BC) age estimation model in a Kenyan sample, to develop and validate a Kenyan-specific (Willems KB) age estimation model and to compare the age prediction performances of both models.
Methods
Panoramic radiographs of 1038 (523 female, 515 male) Kenyan children without missing permanent teeth and without all permanent teeth fully developed (except third molars) were retrospectively selected. Tooth development of the seven lower-left permanent teeth was staged according to Demirjian et al. The Willems BC model, performed on a Belgian Caucasian sample and a constructed Kenyan-specific model (Willems KB) were validated on the Kenyan sample. Their age prediction performances were quantified and compared using the mean error (ME), mean absolute error (MAE) and root-mean-square error (RMSE).
Results
The ME with Willems BC method equalled zero. Hence, there was no systematic under- or overestimation of the age. For males and females separately, the ME with Willems BC was significantly different from zero, but negligible in magnitude (–0.04 and 0.04, respectively). Willems KB was found not to outperform Willems BC, since the MAE and RMSE were comparable (0.98 vs 0.97 and 1.31 vs 1.29, respectively). Although Willems BC resulted in a higher percentage of subjects with predicted age within a one-year difference of the true age (63.3% vs 60.4%, p=0.018), this cannot be considered as clinically relevant.
Conclusion
There is no reason to use a country-specific (Willems KB) model in children from Kenya instead of the original Willems (BC) model.
Keywords
Introduction
Estimating the age of an individual has proven useful for several purposes, such as pre- or post-mortem identification of unknown subjects, controlling migration issues such as child trafficking and abuse and other civil, criminal, socio-economic or administrative purposes. 1 The most used forensic methods for chronological age estimation in children are those based on radiographic registrations of dental and bone development.2,3 While bone development seems to be more influenced by environmental factors such as the diet, teeth have been reported to have a fairly stable maturation track. 4
Several methods for dental age estimation in children have been reported and validated, such as the ones by Demirjian, Nolla, Nicodemo or Willems,5–7 which use radiographically observed permanent tooth development and classify it into predefined stages. From these, the Willems model 7 has been found to provide the most accurate dental age estimations in children. 8 This model was developed in a Belgian Caucasian (Willems BC) sample of 2116 individuals and has been validated in multiple populations with different ethnical backgrounds.8–18
One of the limitations of the Willems BC method was its lack of validation in diverse black populations. Recently, this was achieved in a South African and a Somali black sample.12,15 However, the population of black Somali children was exceptional, since they had been living their whole life in Finland, which makes fair comparisons with other populations of black children difficult. On the other hand, the country-specific prediction method established on the South African black sample (Willems SAB) performed better compared to Willems BC, but the differences were too small to be clinically relevant, especially in males. These results should therefore be confirmed in another black population, since certain studies report differences in age estimation results between black and non-black populations.19,20 Therefore, in the present study, a sample of panoramic radiographs of Kenyan children was collected.
In Kenya, age estimations in children are particularly requested for a specific age threshold. Children under the age of eight cannot be held criminally responsible, while a child between 8 and 12 years of age can be held criminally responsible if his or her capacity to know right and wrong can be proven. A male person under the age of 12 is presumed incapable of having carnal knowledge, which prevents the prosecution of younger boys for certain sexual offences, while the official age of consent is 18 years old. 21 Migration of individuals to and from Kenya is also an issue where age estimation is desirable, especially in the case of unaccompanied minors. UNICEF shows in its 2018 Kenya Humanitarian Situation Report that in 2017, approximately 469,000 immigrants came to Kenya from the neighbouring countries of Somalia, South Sudan, Congo or Ethiopia, while 350,000 people migrated from Kenya to the USA, the UK or Canada. 22
The aims of the present study were (a) to use a sample of black children from Kenya to validate the Willems age estimation model (Willems BC), (b) to develop and validate a Kenyan-specific age estimation model (Willems KB) and (c) to quantify and compare the age prediction performances of both models.
Methods
Digital panoramic radiographs of 1399 children between 3 and 24 years old of Kenyan origin were collected (Mage=11.71±4.63 years, 720 females (F) and 679 males (M)) between 2015 and 2018. Data were collected from the following sites in Nairobi: the School of Dental Sciences of the University of Nairobi, the Dental and Maxillofacial Imaging Centre (DAMIC), the Kenyatta National Hospital and the private practices Millennium Dental Services and JB Dental Clinic. Approval was obtained from the Ethics and Research Committee (ERC) of the Kenyatta National Hospital/University of Nairobi (KNH/UON; approval no. P499/10/2013).
The following selection criteria were applied: age was used to ensure that a sufficient sample could be collected in each age category, which was confirmed by the birthdate on official papers, and radiographs with poor image quality or unclear images were excluded from the study, as well as those corresponding to patients with cleft lip and palate or other craniofacial affectations. The seven permanent teeth of the left mandible were staged on the panoramic radiographs according to the Demirjian technique. 5 Intra-observer reliability was tested by weighted kappa analysis. Statistical analyses were only performed on individuals without missing permanent teeth and without all permanent teeth on the left mandible fully developed, excluding third molars.
To validate the Willems BC model on the Kenyan sample, the obtained Demirjian stages were used as categorical predictors for age in a weighted analysis of variance (ANOVA) model. Weights were used to handle the non-constant variance (higher variability of age at higher stages). The same approach was used on all included Kenyan individuals to construct a new Kenyan-specific age prediction model, referred to as the Willems KB model. The age prediction performances of both approaches (Willems BC vs. Willems KB method) were quantified and compared in general and by sex and age categories. Therefore, the mean error (ME, calculated by subtracting the estimated age from the chronological age), the mean absolute error (MAE) and the root-mean-square error (RMSE) were used. Leave-one-out cross-validation was used in the validation of the Willems KB model. Note that the validation of the Willems BC model was external and hence did not require a cross-validation strategy. ME and MAE were compared using Wilcoxon signed-rank tests. A McNemar test was used to compare the proportion of children with a MAE ≤1 year. The association between age or mean score and the (absolute) error was evaluated using Spearman correlations. They were compared using the Steiger modification of Dunn & Clark Z test for dependent correlations. All statistical analyses were performed using SAS v9.4 (SAS Institute, Cary, NC).
Results
From the 1399 individuals with collected Demirjian information (720 F, 679 M), 1038 (523 F and 515 M) had no missing permanent teeth and did not have all permanent teeth of the left mandible fully developed. Only these were included in the analysis. If a specific score level did not occur in the training set, no age prediction could be obtained for that subject. In the Willems KB cross-validation, this was the case for two subjects, reducing this validation sample to 1036 (522 F, 514 M) subjects.
The selected subjects were between 3 and 18 years old, with a mean age of 9.85 years old. A total of 1030 (99.23%) subjects were <16 years old, while eight (0.77%) were 16–18 years old. The sex and age distribution of the selected sample is reported in Table 1.
Age and sex distribution of the selected sample.
SD: standard deviation; M: mean.
Results of the kappa analysis showed a good intra-observer agreement. The simple and weighted kappa values were 0.84 and 0.94, respectively. The regression coefficients from the weighted ANOVA for the Willems KB model are reported in Table 2, while Table 3 reports the results of the ME, absolute error, RMSE and differences between both Willems BC and KB models. Table 4 reports the error and absolute error per age category and sex. Note that a negative error refers to overestimation of the age, while a positive error is interpreted as underestimation of the age.
Regression coefficients from the weighted ANOVA (Willems KB).
ANOVA: analysis of variance; t31: central left-lower incisor; t32: lateral left-lower incisor; t33: left-lower canine; t34: first lower-left premolar; t35: second lower-left premolar; t36: first lower-left molar; t37: second lower-left molar.
Absolute error, RMSE and comparison between Willems BC and KB.
Note that results were reported for 1036 instead of 1038 subjects, since for two subjects, no age prediction was obtained in the leave-one-out cross-validation procedure (i.e. if the case in the test data set has a score which does not appear in the training data set).
Med: median; p=p-value from Wilcoxon signed rank test comparing absolute error Willems BC and absolute error Willems KB; RMSE: root-mean-square error.
Error and absolute error as a function of age and sex.
Note that a negative error refers to overestimation of the age, while a positive error is interpreted as underestimation of the age.
The ME with Willems BC method was found to equal zero. Hence, we can conclude that there is no systematic under- or overestimation of the age. For males and females separately, the MEs are significantly different from zero but negligible in magnitude (–0.04 and 0.04, respectively). The MAE and RMSE between the Willems BC and KB methods were comparable (0.97 vs 0.98 (p=0.4209) and 1.29 vs 1.31, respectively). Although the Willems BC model resulted in a higher percentage of subjects with the predicted age within a one-year difference of the true age (63.3% vs 60.4%, p=0.018), this cannot be considered as clinically relevant. There were no important differences in calibration slope between both approaches (Table 5), although the calibration slope when using Willems KB was closer to 1, but also when using the Willems BC method the slope did not differ significantly from 1 within males and females separately.
Comparison calibration slope.
Calibration slope: regression line of age regressed on predicted age; CI: confidence interval.
In both Willems BC and KB, there was a significant relation between age and the absolute error (r=0.274 vs r=0.247, p=0.13; Table 6), meaning that the older the subject the larger the discrepancy between true and predicted age but without significant difference between both approaches. The same is observed for the relation between the mean score and the absolute error (r=0.285 vs r=0.263, p=0.21).
Dependence of error and absolute error on age (Spearman correlations).
In both approaches there was a significant relation (comparable in magnitude) between age and the error (significantly larger when using Willems BC (0.315) vs Willems KB (0.357; p<0.0001)) which is due to underestimation of the age of older subjects. This does not hold for the relation between the mean score and the error (–0.052 vs 0.006, p<0.001).
Discussion
The Willems BC method for dental age estimation has been validated in many geographical and ethnical populations, such as Bosnia-Herzegovina, 16 Bangladesh, 8 India, 10 Brazil, 11 Japan, 13 Arab Emirates 14 and Malaysia, 17 yielding consistent results. The validations provided overestimation of age between 0.10 18 and 0.46 17 years in females and between 0.05 8 and 0.55 17 years in males.
Due to these validation results, which greatly simplify the practical application, the Willems BC model is one of the most frequently used in forensic age estimation practice in children. The fact that this model is restricted to no missing scores and to subjects without all permanent teeth fully developed, instead of using age as a selection criterion, has been reported as a limitation. However, this actually ensures fair comparison, since in a practical setting age is not known.
Recently, the method was also validated in a SAB sample, where small but significant age overestimation was only found in females. 12 The results from the present study confirm that in another black (Kenyan) population, there is also no need to develop a country-specific dental age estimation model.
In the Kenyan sample, a very slight age overestimation is found between the ages of 8 and 12 (0.08–0.23 for F, 0.17–0.39 for M). The younger the individual, the higher the overestimation and vice versa, which is typical for age prediction using a regression model. From the age of 12 years, there is a slight underestimation that increases from 16 years and is more accentuated for females than males (Table 4). Despite results of particular age categories, no systematic under- or overestimation of the age was detected in the present study when using Willems BC on the Kenyan sample. Also, contrary to the findings in the SAB sample, Willems KB did not outperform Willems BC.
In our study, no linearity assumption has been made with respect to the Demirjian scores, and a weighted ANOVA has been used to handle the non-constant variance (higher variability of age at higher stages), as suggested in the original article of Willems et al. 7 As such, this linear model is the same as an ANOVA model (using the scores as categorical predictors for age). Allowing non-linearity (by using the score as a categorical predictor in the regression model) increases the risk of overfitting. This phenomenon will be stronger the smaller the training data set. The difference between the prediction from Willems BC and the prediction from the model on the current dataset (Willems KB) not only reflects the usefulness of the Belgian population as reference, but also the difference in size of the current data set and the set of subjects used with Willems BC 7 to develop the prediction model.
It has been reported that adding the information of third molars may result in a decrease of the RMSE, especially during the teenage years (i.e. in those 14–16 years old). 11 This can be explained by the fact that in this age range, most permanent teeth are already developed, yielding no extra information regarding dental age. Therefore, adding the third molar information is encouraged for a more accurate age prediction, taking into consideration that this age period shows the largest error and the teenage years may be especially relevant regarding legal issues in several countries, such as those mentioned in the introduction.
Conclusion
No reason was found to use a country-specific model for juveniles from Kenya (Willems KB) instead of using the Willems BC method. The ME with Willems BC method was zero. Hence, no important systematic under- or overestimation of the age was found for the Kenyan black sample. Even more importantly, the absolute error was not increased using the Willems BC method.
Footnotes
Acknowledgements
We would like to thank the participating centres: the School of Dental Sciences of the University of Nairobi, the Dental and Maxillofacial Imaging Centre (DAMIC), the Kenyatta National Hospital, Millennium Dental Services and JB dental clinic (Dr James Ngesa).
Declaration of conflicting interests
The authors declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.
Funding
The authors received no financial support for the research, authorship and/or publication of this article.
