Keywords
dental caries; transmissible disease; health and well-being; machine learning; artificial intelligence
Dental caries is a common chronic oral disease, posing a serious public health issue. By analyzing large datasets, machine learning shows potential in addressing this problem. This study employs bibliometric analysis to explore emerging topics, collaborations, key authors, and research trends in Southeast Asia related to the application of machine learning in dental caries management.
A comprehensive selection using the Scopus database to obtain relevant research, covering publications from inception to July 2024 was done. We employed the Bibliometric approaches, including co-authorship networks, yearly publishing trends, institutional and national partnerships, keyword co-occurrence analysis, and citation analysis, for the collected data. To explore the visualization and network analysis, we employed the tools such as VOSviewer and Bibliometrix in R package.
The final bibliometric analysis included 246 papers. We found that Malaysia became the top contributor with 59 publications, followed by Indonesia (37) and Thailand (29). Malaysia had the highest Multiple Country Publications (MCP) ratio at 0.407. Top institutions including the Universiti Sains Malaysia led with 39 articles, followed by Chiang Mai University (36) and the National University of Singapore (30) became the leader. Co-authorship analysis using VOSviewer revealed six distinct clusters. A total of 1220 scholars contributed to these publications. The top 10 keywords, including ‘human’ and ‘dental caries,’ indicated research hotspots.
We found growing evidence of machine learning applications to address dental caries in Southeast Asia. The bibliometric analysis highlights key authors, collaborative networks, and emerging topics, revealing research trends since 2014. This study underscores the importance of bibliometric analysis in tackling this public health issue.
dental caries; transmissible disease; health and well-being; machine learning; artificial intelligence
We have added a paragraph in both the Introduction and Discussion sections to address the identified gaps and provide a comparative analysis relevant to the current systematic review.
See the authors' detailed response to the review by Juni Handajani
See the authors' detailed response to the review by Heni Susilowati
Dental caries (DC), a common chronic oral disease, is a substantial threat to global health, with high incidence rates despite disease prevention efforts (Cheng et al., 2022). individuals encountering food insecurity are significantly more prone to dental caries, as evidenced by meta-analyses that demonstrate a higher risk of caries in food-insecure individuals compared to those with food security (Drumond, De Arruda, Bernabé, Mesquita, & Abreu, 2023). Both clinical researchers and dental practitioners are rigorously investigating optimal therapeutic interventions for this condition.
The recent systematic study indicated that employing machine learning for diagnosing and predicting the prognosis of dental caries had encouraging outcomes (Reyes et al., 2022). Nevertheless, a significant degree of variation persists across the research, which also do not exclusively concentrate on the Southeast Asia Region (SEAR).
DC continue to be a major oral health concern in the SEAR, impacting children aged 5 to 15 (Kale, Kakodkar, Shetiya, & Rizwan, 2019). Studies have demonstrated an elevated prevalence of DC among children in SEAR countries, with variability among age groups and nations (Kale et al., 2019). In these investigations, the dentition status based on the 1997 WHO criteria was the most widely employed index for evaluating DC (Kale et al., 2019). Furthermore, the burden of DC extends to Cambodian preschool children, with a significant proportion developing carious lesions and pulpally affected teeth, underlining the critical need for intervention to address Early Childhood Caries (ECC) in the region (Turton, Chher, Sabbah, Durward, Hak, & Lailou, 2019). Efforts to integrate oral health into national health systems and establish uniform treatment standards are critical steps toward combating the prevalence of DC in the SEAR area (Acharya, Mathur, Tadakamadla, & Brand, 2024).
To increase diagnostic accuracy and prevent caries from being neglected, computer-based intelligent vision systems employing deep learning techniques have been developed, with a remarkable accuracy rate of 99.13% in automatic caries diagnosis based on periapical images (Imak, Celebi, Siddique, Turkoglu, Sengur, & Salam, 2022). In addition, innovations in caries detection include self-training-based approaches exploiting small sets of labeled images and enormous quantities of unlabeled images, boosting caries detection and segmentation performance without the requirement for extensive annotated data (Qayyum et al., 2023).
The current state of research in Southeast Asia regarding the application of machine learning techniques to DC research is lacking in comprehensive studies that incorporate multimodal data, including behavioral factors, radiographic imaging, and microbial analysis, to improve treatment outcomes and caries risk assessment (Jusman, Anam, Puspita, & Saleh, 2021; Velmurugadass, Rohith, Harish, & Reddy, 2023; Wu et al., 2022). While machine learning models such as Support Vector Machine (SVM) and K-Nearest Neighbors (KNN) have been effectively utilized in dental caries classification research, there remains a dearth of understanding regarding the distinct challenges and determinants that impact DC within Southeast Asian communities, notably in nations such as Thailand and Indonesia (Ula, Anjani, Ulva, Sahputra, & Pratama, 2022; Wu et al., 2022). Researchers can better understand the complex etiology of DC in this region through the application machine learning techniques to a wide range of data sources, including genetic, behavioral, and environmental factors.
Bibliometric analysis connects research and practice, ensuring researchers are informed by pertinent and influential literature, while clinicians are equipped to execute evidence-based interventions efficiently. By conducting a thorough assessment of literature utilizing machine learning methodologies, we have provided a comprehensive overview of the themes, patterns, and bibliometric attributes that define this body of literature from 2002 to 2024. The outcomes derived from our research unveil significant findings and propose innovative pathways for further exploration within this rapidly expanding realm of knowledge acquisition and academic inquiry.
We conducted a search for pertinent articles within the Scopus database, recognized as one of the most extensive international repositories of scholarly publications. Scopus encompasses an estimated 24,000 present-day periodicals sourced from over 5,000 global publishers, encompassing peer-reviewed journals, conference papers, periodicals for commercial purposes, and scholarly book series. The database comprises roughly 75 million entries, spanning various forms of scholarly outputs such as journals, conference proceedings, patents, and publications covering a wide array of disciplines. Our search for relevant literature involved the utilization of specific keywords: (ALL ((“artificial intelligence” OR “machine intelligence” OR “robot*” OR “robot technology” OR “assistant robot” OR “robot-assisted" OR “computational intelligence” OR “computer reasoning” OR “deep learning” OR “computer vision system” OR “sentiment analysis” OR “machine learning” OR “neural network*” OR “data learning” OR “expert* system*” OR “natural language processing” OR “support vector machine*” OR “decision tree*” OR “data mining” OR “deep learning” OR “neural network*” OR “bayesian network*” OR “intelligent learning” OR “feature* learning” OR “feature* extraction” OR “feature* mining” OR “feature* selection” OR “unsupervised clustering” OR “image* segmentation” OR “supervised learning” OR “semantic segmentation” OR “deep network*” OR “neural learning” OR “neural nets model” OR “graph mining” OR “data clustering” OR “big data” OR “knowledge graph”)) AND ALL ("Dental caries” OR “Caries, Dental” OR “Dental Cavity” OR “Dental Decay” OR “Tooth Decay” OR “Decay, Tooth” OR “Tooth Demineralization” OR “Caries, Tooth” OR “Caries, Teeth” OR “Teeth Caries”)) AND PUBYEAR > 2001 AND PUBYEAR < 2025 AND (LIMIT-TO (DOCTYPE, “ar”) OR LIMIT-TO (DOCTYPE, “re")) AND (LIMIT-TO (AFFILCOUNTRY, “Malaysia”) OR LIMIT-TO (AFFILCOUNTRY, “Indonesia”) OR LIMIT-TO (AFFILCOUNTRY, “Thailand”) OR LIMIT-TO (AFFILCOUNTRY, “Singapore”) OR LIMIT-TO (AFFILCOUNTRY, “Brunei Darussalam”) OR LIMIT-TO (AFFILCOUNTRY, “Philippines”) OR LIMIT-TO (AFFILCOUNTRY, “Viet Nam”) OR LIMIT-TO (AFFILCOUNTRY, “Myanmar”) OR LIMIT-TO (AFFILCOUNTRY, “Cambodia”)). Two professionals with a minimum of five years of expertise in conducting systematic reviews endorsed our search strategy subsequent to a meticulous examination of all article titles obtained. They identified all pertinent publications, with any discrepancies being settled through deliberation with an additional author. Ultimately, all articles retrieved were preserved. In addition, we selected data from 2002 to 2024. This timeframe was selected because of technological advancements commencing in 2002, signifying a crucial transformation in research methodologies and data accessibility, enabling the identification of pertinent trends in the domain. Incorporating data through 2024 guarantees the inclusion of the most recent advancements and offers a thorough examination of emerging research. This timeframe offers dependable and accessible data, crucial for a comprehensive bibliometric investigation.
Only articles that utilized Machine Learning in their findings and focused on DC were considered for the bibliometric analysis. There were no restrictions based on language. Excluded from the analysis were letters, editorials, conference abstracts, and book chapters, while research papers and reviews published in peer-reviewed journals were included. The evaluation of the articles was conducted by two experts in bibliometric analysis, who possessed extensive experience and assessed the articles according to predetermined criteria and guidelines (Martinez, Al-Hussein, & Ahmad, 2019).
In order to produce replicable and measurable data that is pertinent to policy administration, bibliometric analysis (Pritchard, 1969) serves as a quantitative assessment of scientific research, examining current research patterns within a specific field (Mooghali, Alijani, Karami, & Khasseh, 2011). Bibliometric analysis has the capability to offer a comprehensive examination of a specific subject of investigation, along with pinpointing research domains that scholars ought to delve into and methodologies that authors have formulated to achieve their objectives (Su & Lee, 2010). A holistic representation of the field allows readers to better comprehend patterns and trends in DC research.
Additionally, the distribution, proportions, and frequency for each journal were presented. The categories, proportions, and occurrence for each periodical were presented. The pertinent author ratios, occurrence, and proportion were computed for each country for both individual and multiple nations. The occurrence and citation rate for each author and organization were also provided. The research impact of each nation, periodical, organization, and author was arranged based on the quantity of publications. All information was formulated and depicted utilizing VOSviewer (Leiden University) and Bibliometrix (an R package).
The final bibliometric investigation consisted of 246 papers after the screening process. The growth in the quantity of DC articles is visualized in Figure 1.
Table 1 demonstrates the geographical distribution of the leading ten countries that have conducted research on DC. Malaysia emerged as the top contributor with 59 publications, succeeded by Indonesia with 37 publications and Thailand with 29 publications. The country with the highest ratio of Multiple Country Publications (MCP) was Malaysia at 0.407, followed by Singapore at 0.375 and Indonesia at 0.27. Supplementary Figure 1 (Extended data) showcases the network of countries that have collaborated on at least one research project.
Supplementary Table 1 (Extended data) showcases the leading ten institutions that have shown high productivity in conducting research on DC. Among these institutions, Universiti Sains Malaysia stands out as the most prolific university with thirty-nine published articles, followed by Chiang Mai University with thirty-six, and the National University of Singapore with thirty publications.
The utilization of the VOSviewer tool (Leiden University) was employed for conducting co-authorship analysis, aiming to present a thorough depiction of the entire network of countries involved in DC research. The level of collaboration between two countries is determined by the quantity of publications they have co-authored. Out of the total of 8 authors, only 3 authors are interconnected, with each of them having published a minimum of 5 articles. The examination of authorship by country resulted in the identification of six distinct clusters, as illustrated in Supplementary Figure 2 (Extended data).
In the discourse concerning research on DC, a total of 1220 scholars collectively contributed to the publication of 246 scholarly articles. Within the realm of DC research, Table 2 provides a comprehensive overview of the top 10 most prolific authors in this field. Noteworthy among these authors is Li Y, who authored a total of eight scholarly papers on the subject matter.
The 246 articles were divided into three keyword clusters. The top ten keywords utilized in the articles that were retrieved can be found in Table 3. It is worth mentioning that human, dental caries, article, child, and artificial intelligence were some of the predominant categories. The examination of the co-occurrence of the top 10 phrases revealed the research hotspots in the field of dental caries ( Figure 2).
Rank | Keywords | Occurrences |
---|---|---|
1 | Human | 114 |
2 | Humans | 79 |
3 | Dental caries | 61 |
4 | Article | 54 |
5 | Child | 52 |
6 | Female | 48 |
7 | Male | 44 |
8 | Artificial intelligence | 41 |
9 | Review | 30 |
10 | Adult | 28 |
We delved deeper into the knowledge repository of the DC research domain. Displayed in Table 4 are the top ten articles with high citation counts. Vos T et al., in 2017, garnered the highest number of citations by June 1, 2024, totaling 5476. This particular article was published in LANCET in 2016.
To the best of our knowledge, this research is the first bibliometric study to use machine learning to investigate DC. SEAR researchers are becoming more interested in using machine learning technologies to combat DC, according to this bibliometric analysis. The number of papers has increased over the previous ten years (>2014), which highlights the significance of this issue for public health as well as the potential of algorithms that use machine learning to provide assistance.
Machine learning has emerged as a powerful tool in combating dental caries, with Malaysia, Indonesia, and Thailand being at the forefront of utilizing this technology (Basri, Yazid, Zain, Yusof, Rani, & Zoolfakar, 2024; Wu et al., 2021, 2022). Research conducted in these countries has shown promising results in early caries detection and risk prediction through the application of machine learning algorithms such as artificial neural networks (ANN), convolutional neural networks (CNN), and statistical models (Basri et al., 2024; Sadegh-Zadeh, Rahmani Qeranqayeh, Benkhalifa, Dyke, Taylor, & Bagheri, 2022; Toledo Reyes et al., 2023; Wu et al., 2022). By integrating multimodal data, including genetic, microbial, demographic, and behavioral factors, these countries have been able to identify key predictors of caries development in both primary and permanent teeth, paving the way for more effective preventive measures and personalized treatment strategies (Sadegh-Zadeh et al., 2022; Toledo Reyes et al., 2023; Wu et al., 2021, 2022). The collaborative efforts between researchers and healthcare professionals in Malaysia, Indonesia, and Thailand highlight the potential of machine learning in revolutionizing oral health care and addressing the global burden of dental caries.
The research landscape on dental caries utilizing machine learning reveals a notable productivity ranking among institutions. Universiti Sains Malaysia emerges as the most prolific university in this field, boasting thirty-nine published articles, showcasing a strong commitment to advancing knowledge in dental caries detection. Following closely behind is Chiang Mai University with thirty-six publications, indicating a significant contribution to the research domain of dental caries and machine learning. Additionally, the National University of Singapore has also demonstrated substantial engagement in this area with thirty publications, further enriching the scholarly discourse on the early detection and management of dental caries through innovative technological approaches. These institutions’ dedication to research in dental caries using machine learning underscores the global effort to enhance diagnostic accuracy and treatment outcomes in oral healthcare.
In line with previous studies, the co-occurrence of the top 10 phrases in dental caries research, as highlighted in the bibliometric studies (Melo, Sanz, Forner, Rodríguez-Lozano, & Guerrero-Gironés, 2022; Patil et al., 2020), reveals a strong relationship with the predominant categories of human, dental caries, article, and child. Terms related to the validity of diagnostic methods, tools, and principles used in caries diagnosis, as well as general aspects of caries detection and diagnosis, are commonly found among the top phrases. This suggests that research in dental caries is heavily centered around human subjects, particularly children, with a significant emphasis on diagnostic methods and tools, as well as the overall understanding of the disease. However, interestingly, we found that the use of the keyword ‘artificial intelligence’ is on the rise.
The bibliometric analysis highlights significant research trends, pivotal studies, and deficiencies in the existing literature, which can guide evidence-based public health interventions. By identifying the most commonly cited therapies or successful public health initiatives in dental caries prevention, researchers and policymakers can prioritize these techniques for widespread implementation. The machine learning methodology facilitates the discovery of emerging research domains, enabling the prompt recognition of novel therapies or preventive strategies. This can inform funding distribution and policy formulation, ensuring resources are allocated to the most effective strategies for enhancing oral health outcomes.
Despite potential improvements in the global application of machine learning for dental caries detection, research in Southeast Asia is very few. Several studies in the region have utilized deep learning models, specifically convolutional neural networks (CNNs), to enhance caries detection from dental radiographs, attaining high accuracy rates (e.g., 99.13%) in diagnosing caries from periapical images (Imak et al., 2022). Notwithstanding these achievements, the quantity of studies concentrating on machine learning applications for dental caries in Southeast Asia remains limited, particularly in comparison to research conducted outside the region, which frequently examines broader demographic factors such as socioeconomic status and parental perceptions (Ramos-Gomez et al., 2021). Enhancing research initiatives in Southeast Asia may close this gap and aid in the formulation of more thorough, region-specific models for dental health forecasting.
Despite the fact that this bibliometric analysis provides interesting information, there are considerable limitations to take into account. The dependency on the Scopus database may result in the omission of significant articles from various sources. The correctness of the metadata associated with the articles also limits the study. Additionally, bibliometric analysis focuses on quantitative rather than qualitative evaluations of research findings and methodologies. Despite these limitations, the results demonstrate the increasing importance of employing machine learning to manage dental caries in Southeast Asia, which may guide future treatments and research areas.
We found the increasing evidence on the application of machine learning to mitigate the issue of dental caries in Southeast Asia. The bibliometric analysis identifies key authors, collaborative networks, and emerging topics, including dental caries and artificial intelligence. It also reveals growing research trends since 2014. This work underscores the significance of bibliometric analysis in tackling this critical public health challenge.
No data are associated with this article. Supplementary materials are available in the supplementary data.
Supplementary data: https://doi.org/10.5281/zenodo.15223304 (Faizul, 2024).
This project contains the following extended data:
Data are available under the terms of the Creative Commons Zero “No rights reserved” data waiver (CC0 1.0 Public domain dedication).
Reporting guidelines: https://doi.org/10.5281/zenodo.15223304 (Faizul, 2024).
STROBE_checklist_v4_combined.docx
Data are available under the terms of the Creative Commons Zero “No rights reserved” data waiver (CC0 1.0 Public domain dedication).
Software: VOSviewer is a free software tool for constructing and visualizing bibliometric networks. To learn more, visit the VosViewer Getting Started page.
Views | Downloads | |
---|---|---|
F1000Research | - | - |
PubMed Central
Data from PMC are received and updated monthly.
|
- | - |
Competing Interests: No competing interests were disclosed.
Reviewer Expertise: Oral microbiology and immunology
Competing Interests: No competing interests were disclosed.
Reviewer Expertise: Oral Biology, Oral Immunology
Is the work clearly and accurately presented and does it cite the current literature?
Yes
Is the study design appropriate and is the work technically sound?
Yes
Are sufficient details of methods and analysis provided to allow replication by others?
Yes
If applicable, is the statistical analysis and its interpretation appropriate?
Not applicable
Are all the source data underlying the results available to ensure full reproducibility?
Yes
Are the conclusions drawn adequately supported by the results?
Yes
References
1. Reyes L, Knorst J, Ortiz F, Ardenghi T: Machine Learning in the Diagnosis and Prognostic Prediction of Dental Caries: A Systematic Review. Caries Research. 2022; 56 (3): 161-170 Publisher Full TextCompeting Interests: No competing interests were disclosed.
Reviewer Expertise: Oral Microbiology and Immunology
Is the work clearly and accurately presented and does it cite the current literature?
Yes
Is the study design appropriate and is the work technically sound?
Yes
Are sufficient details of methods and analysis provided to allow replication by others?
Yes
If applicable, is the statistical analysis and its interpretation appropriate?
No
Are all the source data underlying the results available to ensure full reproducibility?
Yes
Are the conclusions drawn adequately supported by the results?
Yes
Competing Interests: No competing interests were disclosed.
Reviewer Expertise: Oral Biology, Oral Immunology
Alongside their report, reviewers assign a status to the article:
Invited Reviewers | ||
---|---|---|
1 | 2 | |
Version 3 (revision) 11 Oct 24 |
read | read |
Version 2 (revision) 03 Oct 24 |
||
Version 1 08 Aug 24 |
read | read |
Provide sufficient details of any financial or non-financial competing interests to enable users to assess whether your comments might lead a reasonable person to question your impartiality. Consider the following examples, but note that this is not an exhaustive list:
Sign up for content alerts and receive a weekly or monthly email with all newly published articles
Already registered? Sign in
The email address should be the one you originally registered with F1000.
You registered with F1000 via Google, so we cannot reset your password.
To sign in, please click here.
If you still need help with your Google account password, please click here.
You registered with F1000 via Facebook, so we cannot reset your password.
To sign in, please click here.
If you still need help with your Facebook account password, please click here.
If your email address is registered with us, we will email you instructions to reset your password.
If you think you should have received this email but it has not arrived, please check your spam filters and/or contact for further assistance.
Comments on this article Comments (0)