A1 Vertaisarvioitu alkuperäisartikkeli tieteellisessä lehdessä

Early childhood caries risk prediction using machine learning approaches in Bangladesh




TekijätHasan, Fardous; El Tantawi, Maha; Haque, Farzana; Oluwátóyìn, Foláyan Moréniké; Virtanen, Jorma I.

Julkaisuvuosi2025

JournalBMC Oral Health

Artikkelin numero49

Vuosikerta25

eISSN1472-6831

DOIhttps://doi.org/10.1186/s12903-025-05419-2

Verkko-osoitehttps://bmcoralhealth.biomedcentral.com/articles/10.1186/s12903-025-05419-2

Rinnakkaistallenteen osoitehttps://research.utu.fi/converis/portal/detail/Publication/477717407


Tiivistelmä

Background: In the last years, artificial intelligence (AI) has contributed to improving healthcare including dentistry. The objective of this study was to develop a machine learning (ML) model for early childhood caries (ECC) prediction by identifying crucial health behaviours within mother-child pairs.

Methods: For the analysis, we utilized a representative sample of 724 mothers with children under six years in Bangladesh. The study utilized both clinical and survey data. ECC was assessed using ICDAS II criteria in the clinical examinations. Recursive Feature Elimination (RFE) and Random Forest (RF) was applied to identify the optimal subsets of features. Random forest classifier (RFC), extreme gradient boosting (XGBoost), support vector machine (SVM), adaptive boosting (AdaBoost), and multi-layer perceptron (MLP) models were used to identify the best fitted model as the predictor of ECC. SHAP and MDG-MDA plots were visualized for model interpretability and identify significant predictors.

Results: The RFC model identified 10 features as the most relevant for ECC prediction obtained by RFE feature selection method. The features were: plaque score, age of child, mother's education, number of siblings, age of mother, consumption of sweet, tooth cleaning tools, child's tooth brushing frequency, helping child brushing, and use of F-toothpaste. The final ML model achieved an AUC-ROC score (0.77), accuracy (0.72), sensitivity (0.80) and F1 score (0.73) in the test set. Of the prediction model, dental plaque was the strongest predictor of ECC (MDG: 0.08, MDA: 0.10).

Conclusions: Our final ML model, integrating 10 key features, has the potential to predict ECC effectively in children under five years. Additional research is needed for validation and optimization across various groups.


Ladattava julkaisu

This is an electronic reprint of the original article.
This reprint may differ from the original in pagination and typographic detail. Please cite the original version.




Julkaisussa olevat rahoitustiedot
Open access funding provided by University of Bergen. We confirm that our research received no financial support from external sources. We conducted the study independently, without any funding assistance.


Last updated on 2025-13-02 at 12:53