Introduction

F1000Research

2046-1402

F1000 Research Limited

London, UK

10.12688/f1000research.130042.1

Research Article

Articles

Predicting diabetic ketoacidosis in pediatric patients using machine learning

[version 1; peer review: 1 approved with reservations]

Eid

Waad Mohammed

Conceptualization Data Curation Formal Analysis Methodology Writing – Original Draft Preparation Writing – Review & Editing https://orcid.org/0000-0002-3874-0667 a 1 Alharthi

Hana

Data Curation Supervision Writing – Review & Editing b 1 Aslam

Nida

Formal Analysis 2 Abdur rab

Irfan Ullah

Formal Analysis 2 Madani

Alaa

Data Curation Resources 3 1Department of Health Information Management Technology, College of Public Health, Imam Abdulrahman Bin Faisal University, Dammam, Saudi Arabia 2Department of Computer Science, College of Computer Science and Information Technology, Imam Abdulrahman Bin Faisal University, Dammam, Saudi Arabia 3Department of Health Education, King Fahad Medical City, Riyadh, Saudi Arabia

a waadeid@gmail.com b halharthi@iau.edu.sa

No competing interests were disclosed.

6 6 2023

2023

611

14 2 2023

2023

This is an open access article distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Background

Machine learning is a powerful tool to define relationships between large data variables through computing algorithms. In medicine, machine learning can find the association between a given disease and disease-related complications such as the relationship between Diabetes and development of diabetic ketoacidosis (DKA). The aim of this study is to develop and evaluate a predicting model for diabetic ketoacidosis among pediatric cases to define the leading factors that can predict diabetic ketoacidosis.

Methods

We evaluated the medical records of 3737 pediatric patients between the ages of 0 and 18 years who attended diabetic clinics and were diagnosed with diabetes. After the initial data preprocessing, we used Orange, an open source software, for data visualization, and machine learning for data analysis. The study used six prediction models: Decision Tree, Random Forest, kNN, Gradient Boosting, CN2 rule inducer and AdaBoost. Data imbalance was managed using oversampling technique. Variables analyzed included age, sex, hemoglobin A1C level, visits to the diabetic education clinic, and number of appointments to diabetic clinic. Models were evaluated based on the Area under the Curve (AUC), accuracy, precision, recall and F1-score using the stratified 5-fold cross validation technique.

Results

The results show that the Random Forest is the highest performance classifier (AUC=0.98; F1 score=0.92; and recall=0.93). Furthermore, HbA1c was the most contributing factor to the prediction model.

Conclusion

This study shows the importance and effectiveness of machine learning modeling to predict the association between diabetes and the development of DKA. Flagging those patients who are at a higher risk of developing DKA provides a better point of care for these patients.

Predictive analytics classifiers Diabetes DKA

The author(s) declared that no grants were involved in supporting this work.

Introduction

Type 1 diabetes, an autoimmune disease of insulin resistance, is predicted to affect one person per 10 individuals in the world by the year 2040. ¹ It was formerly known as juvenile diabetes because it is typically diagnosed during childhood. Complications with diabetes can adversely affect multiple organs such as the heart, the brain, the kidneys, eyes, and even the limbs, such as diabetic foot ulcers that can lead to foot amputations. Uncontrolled diabetes increases the risk of Alzheimer’s disease. ² One of the most serious complications of type 1 diabetes is ketoacidosis (DKA). DKA occurs when the body has high levels of sugar for a long period of time and the body then produces blood acids called ketones. Ketoacidosis can disrupt the normal body workflow which causes serious complications such as pulmonary and cerebral edema, hypokalemia and organ damages. ³ DKA can cause neurocognitive impairment in children, such as memory loss, poor concentration, and/or deficits in learning and emotional connection. ⁴ High occurrence of DKA can also increase patients’ admission to the hospital which results in higher management cost which creates an economic burden on the healthcare system. ⁵ Machine learning is the scientific branch of artificial intelligence that focuses on how computers learn from data to define relationships between data variables through computing algorithms. ⁶ In medicine, machine learning can be used to study diagnosis and disease patterns in large patient datasets. For example, machine learning can predict how fast a disease can develop. Also, it can predict which patients are at a higher risk of developing a condition or disease progression. These predictions can support physicians in their point of care decisions, whether it is preventive care or disease management, to provide a high level of care to these patients to improve healthcare outcomes. ⁷ As such, machine learning can be used to flag patients with health risks and enable the healthcare team to provide the best course of treatment for their patients. In a study which used machine learning to predict the likelihood of diabetes occurrence in patients. Specifically, three classification algorithms, decision tree (DT), the support vector machine (SVM), and the naive Bayes (NB), were employed. The data used is a diabetes dataset named PIDD which is taken from the UCI machine learning repository. The data included 768 female patients with two values: 1 as positive for diabetes and 0 as negative. In addition, multiple attributes or risk factors, were included such as number of times pregnant, plasma glucose concentration, diastolic blood pressure, skinfold thickness, 2 hours serum insulin level, BMI ratio, diabetes pedigree function and age. Moreover, researchers tested the three algorithms performance evaluating precision, accuracy, F-measures and recall. The result shows NB has the highest accuracy level with 76.30% in comparison with other algorithms. ⁸ However, the attributes included in the test did not include the known diagnostic tests such as hemoglobin (A1C) form of hemoglobin that is chemically linked to a sugar, random blood glucose, and fasting blood glucose; this requires further research using the same algorithms and models. Another study used the National Health and Nutrition Examination Survey data (NHANES) to predict patients at risk of diabetes and cardiovascular diseases. ⁹ NHANES is a comprehensive national program in the United States to assess the status of health and nutrition among its population. Data from NHANES were used to predict diabetes and cardiovascular diseases. In the study, scientists used different models such as SVM,RF,GBT,WEM to classify patients at risk of diabetes and cardiovascular diseases, they provided the program with the training data which contained the observations and labels for the category of the observations. This can give the algorithm the ability to predict the output label associated with a new observation if presented to the program. Results showed that machine learning models based on the survey used can provide an automated identification method for patients at risk for diabetes and cardiovascular diseases and they were also able to identify major contributors to the prediction results. ⁹ Given that this study was based on extraction of variables from a national survey rather than electronic health records data, the findings underscore the challenges of data set for machine learning as data from surveys can point to findings that are different from data extracted from electronic health records data. Additional studies aimed to create a prediction program which can detect high risk group who are more likely to develop type 2 diabetes. One in particular used the Synthetic Minority Over-sampling Technique to balance the dataset and included six features (body mass index (BMI), diet, smoking, blood pressure, sex and geographic region. The study evaluated the algorithms using the balanced data, they used nine classifiers which are, Logistic Regression (LR), Average Perceptron (AP), Naïve Bayes (NB), Neural Network (NN), Support Vector Machine (SVM), LD, Decision Jungle (DJ), Decision Forest (DF), and Boosted Decision tree (BDT). The Decision Forest (DF) model had better performance than other classifiers with an accuracy rate of 83%. The results of this study can help to establish a web-based service to assess a disease risk in preventative medical care. ¹⁰ Another study, aimed to detect diabetic retinopathy where various classifiers were used such as, RF, kNN, SVM, LDA and RRF. The RF model showed the best performance among other classifiers, with an accuracy of 86%. ¹¹ Collectively, these studies underscore the potential of machine learning to be used in preventative medicine as well as in assistive decision making to improve healthcare. Table 1 summarizes the literature gap of machine learning in diabetic field.

Table 1. Summary of the literature gap of machine learning in diabetic field.

Study (arranged in reference number)	Models used	Highest accuracy	Number of features	Sample size	Target
9	DT, SVM, NB	76.30%	9	768	Diabetes
10	SVM, RF, GBT, WEM	73.7%	9	5000	Diabetes
11	LR, AP, NB, NN, SVM, LD, DJ, DF, BDT	83%	9	4896	Type 2 Diabetes Mellitus
12	LDA, SVM, KNN, RF, and Ranger Random Forest (RRF)	86%	14	327	Diabetic retinopathy

In a project conducted in Texas children’s hospital in the United States to provide the best care for high risk patients with type 1 diabetes. The hospital developed a model using machine learning classifiers, which can predict the occurrence of DKA. The project aimed to reduce the number of hospitalizations related to diabetes or DKA from 9.5% to 5% by the year 2018 and to reduce the admissions of DKA by at least 1% every year to reach a goal of maximum 5% DKA admissions per year. A predicting risk model for DKA was developed. The model used data such as risk index for poor glycemic control (RIPGC), socioeconomic status, clinical data such as fasting blood glucose level, hemoglobin A1C, and number of clinical visits per year. The team then proceeded with developing a risk stratification tool and divided patients into four tiers; high risk, moderate risk, mild risk, and lowest risk. They then provided care according to their risk prediction model. This targeted approach resulted in decreasing the recurrent DKA cases admission by 30.9% per year and it showed higher documentation rate of RIPGC in the electronic system. In addition to a risk index for DKA for all the patients. ¹²

In this study we used machine learning as a tool to predict DKA occurrence among a pediatric population and identify the most important factors in predicting DKA.

Methods

This research was ethically reviewed and approved by the institutional review board at Imam Abdulrahman bin Faisal University (IRB-PGS-2020-03-431). It was also approved by the institutional review board at King Fahad Medical city. (IRB Log Number: 21-186E) This study is an experimental study aimed to create a predicting model for diabetic ketoacidosis among pediatrics cases and find the most important factors predicting diabetic ketoacidosis. The target variable is the DKA and the attributes are sex, age, HbA1c levels, number of patient appointments in the diabetic clinic, number of patient appointments in the health education clinic, and the number of patients those who do not attend appointments at the health education clinic. The dataset included the medical records of pediatric patients aged 18 and younger who attended the Diabetic clinic in King Fahad medical city from starting January 2018 to until December 2020. We excluded any patients who were above 18 at the time of data collection and patients who did not have any laboratory results registered in the system. The total sample size was 1537 patients.

Data pre-processing

The dataset was received in excel format from King Fahad Medical City health information system in Riyadh, Saudi Arabia. It was divided into four sections. The first section was the list of appointments in the pediatric Diabetes clinic, which also included whether the patient attended the appointment or was registered as a no-show, and the demographics ( e.g., nationality, sex). The first section data size was 3737. The second section was the laboratory results of the patients, which contained their Hemoglobin A1c levels. The third section was the list of patients who were diagnosed with DKA. The fourth section is the list of patients’ ages. All patient identifications have been removed to ensure patients privacy and confidentiality. We created a new spreadsheet to consolidate this information. It included the lab results where duplicated Medical record number (MRN) numbers were removed using the remove duplicates function in excel. We used the VLOOKUP function which looks up a value in the columns of a table and returns the value in the same row from a column which the user specifies. Using the patient’s MRN, we matched the patients’ age, sex and DKA diagnosis to their laboratory results and the data size was reduced to 1543 data records. we used the COUNTIFS function in excel, which counts the number of cells specified by a given set of conditions or criteria, to count the number of appointment visits versus no shows. The variables assessed included sex, age, HbA1c level, number of appointments, number of health education clinic appointment and number of no shows to health education clinic appointment The target variable was DKA status with two values of yes and no.

Model development and evaluation

To analyze the data we used Orange Data Mining (RRID:SCR_019811) V3.30. Orange is an open source software which is used for data visualization, machine learning and data mining purposes. There are different classifiers available in Orange, which include: Random Forest, which creates a set of decision trees. Every tree is created from a small sample from the training data. When the classifier develops an individual tree, a random subset of attributes is drawn then the best attribute is selected. The final model is based on the majority selected individual developed trees in the forest. KNN, which uses algorithms to search for the closest training examples in a feature and uses the average to form the prediction. AdaBoost, is an algorithm that merges weak learners and adapts to each training sample. CN2 rule inducer uses an algorithm as a classification technique through making of simple, comprehensible rules. ¹³ Tree simply uses an algorithm to separate the data into nodes. It is similar to Random Forest. Gradient Boosting is a technique that produces a prediction model in the form of an joined of weak prediction models, typically decision trees.

Model evaluation

To analyze descriptive statistics for the variables, we used a feature statistics tool in Orange, also ranked the attributes to demonstrate the most contributing factor to DKA among pediatric patients.

Data sampler

dataset showed an imbalance among DKA cases (17.5%) and none DKA (82.5%). To balance the data set, a data sampler tool in Orange V3.30 was used. This tool is used to develop different types of complementary samples from the input data. The fixed sample size method develops a certain number of data instances with replacements, which means always sampling from the entire dataset and does not delete instances from the subset data. We also maintained the sampling pattern by checking replicable sampling settings in the data sampler. This technique to oversample the DKA instances. The positive DKA instances were replicated by 1308 to equalize it with the negative 1308 DKA instances and overcome the data imbalance. Furthermore, Python was used to oversample the data as a comparison method with the data sampler in Orange. The oversampling technique has been used in research which aimed to evaluate the performance of supervised learning algorithms on imbalanced class datasets. ¹⁴

Cross validation

We used stratified 5-folds to cross validate our data which is the default parameters in Orange. This technique splits the dataset into folds such as N. One-fold will be used for testing while the remaining N-1 will be used for training in each N iterations. In the current study the dataset is divided into 5 stratified folds and in each fold there are approximately equal number of samples for each class.

Confusion matrix

This matrix shows the number and proportion of instances in the predicted and actual class. This allows the reporting of cases that were misclassified or were accurately classified.

Rank

It scores variables that can be calculated using the information from the confusion matrix.

Area under the curve (AUC)

Is model performance evaluation technique that indicate the ability of the classifier to distinguish between classes. The higher the AUC score, the better performance for the classifier to distinguish between true positives and true negatives.

Classification accuracy (CA)

Is a measure to evaluate the performance of a classifier by calculating the number of correct predictions divided by the total number of predictions.

Precision

Is the number of true positives among instances classified as positive.

Recall

Is the number of correctly predicted positive class sample, among all the positive class in the dataset.

F1-Score

It represents the harmonic mean among the precision and the recall.

Results

Several Orange classifiers were used to predict the incidences of DKA among a pediatric cohort. Figure 1 illustrates the workflow performed in Orange. The workflow begins with imported data followed by outlier’s extraction. After the extraction, we balanced the data using the data sampler widget. The data is inserted into the six classifiers and evaluated by Area under the curve (AUC) level (test and score widget). A total of 1536 patient data points were imported into the program. Data showed an 82% imbalance. To overcome this imbalance, a data sampler tool in the Orange program was used. Moreover, Python was additionally used as another oversampling technique which showed similar performance results to the Orange data sampler tool with the Random Forest being the best predicting model with an AUC higher than 0.9. Female and male distribution are approximately equal as shown in Table 2 and Figure 2. Incidences of DKA distribution were normalized after applying the over sampling technique as shown in Figure 3. For age, the youngest patient was 2 years old and the oldest was 18 years old with a mean of 12 years old. For the HbA1c the maximum level was 16.3 and the minimum was 4 with a mean of 9.99 as shown in Table 2. To test the prediction performance on our data, we used six classifiers which are Random Forest, AdaBoost, CN2 rule inducer, kNN, Gradient Boosting and Decision Tree. Through the test and score feature we evaluated the classifiers prediction performance through cross validation technique and the AUC score as it is a highly reliable method to evaluate the performance. The result showed that Random Forest had the highest performance result with an AUC score of 0.98 followed by AdaBoost and CN2 rule inducer with a score of 0.97 and 0.93, respectively As shown in Table 3. The confusion matrix for the performance of the Random Forest classifiers is shown in Table 4. The Random Forest classifier made 86.5 % (n=885) correct predictions, 27.16 % (n=33) false prediction for the no incidence of DKA and 88.6% (n=1077) correct prediction and 11.35 (n=138) false prediction for the incidence of DKA. Additionally, the HbA1c level was the most contributing attribute to the occurrence of DKA followed by the health education appointments as shown in Table 5. Furthermore, sex was the least contributing attribute ( Table 5). Figure 1. Workflow of the prediction model in Orange.

Table 2. Descriptive statistics of the results.

Name	Mean	Median	Dispersion	Min.	Max.	Missing
HbA1C	9.992	10.5	0.285	4.0	16.3	113 (4%)
Appointment number	4.80	5	0.49	0	10	0 (0%)
Health education appointments	1.35	1	1.19	0	6	0 (0%)
Health education appointments no-show	0.07	0	3.68	0	2	0 (0%)
Age	12.01	13	0.33	2	20	0 (0%)
Sex		Female	0.691			0 (0%)
DKA		No	0.693			0 (0%)

Figure 2. Sex distribution which shows a balance distribution between male and females. Figure 3. Incidence of DKA distribution after balancing data.

Table 3. Stratified 5-fold Cross validation for the prediction models.

Model	AUC	CA	F1	Precision	Recall
Tree	0.916	0.891	0.891	0.894	0.891
Random Forest	0.983	0.930	0.930	0.934	0.930
kNN	0.898	0.823	0.820	0.842	0.823
Gradient Boosting	0.884	0.807	0.806	0.815	0.807
CN2 rule inducer	0.936	0.922	0.922	0.931	0.922
AdaBoost	0.978	0.949	0.948	0.952	0.949

Table 4. Confusion matrix for Random Forest (showing number of instances).

Actual	Predicted
Actual	No	Yes	∑
No	885	330	1215
Yes	138	1077	1215
∑	1023	1407	2430

Table 5. Attributes ranking.

	#	Gain ratio	Gini
HbA1C	1	0.0987	0.1246
Health Education appointments	2	0.0175	0.0217
Health Education appointments no-show	3	0.0109	0.0026
Appointment number	4	0.0052	0.0071
AGE	5	0.0039	0.0053
Sex	6	0.0029	0.0020

Discussion

Recurrent admission of patients due to DKA can be prevented with proper education and targeted strategic programs. ¹⁵ ^, ¹⁶ In this research, we focused on developing a prediction model which can detect high risk groups for DKA to decrease admission cost to the healthcare organization and prevent the readmission of patients. These predictions can inform better patient quality of life and help to develop better education programs for these patients. Our data showed an 82% imbalance. To overcome this imbalance, a data sampler tool in the Orange program was used. This program is used as a tool to oversample the DKA incidence and replicate the positive DKA instances to equalize it with the non DKA incidence. Moreover, Python was additionally used as another oversampling technique which showed similar performance results to the Orange data sampler tool with the Random Forest being the best predicting model with an AUC higher than 0.9. The ability of different machine learning prediction models to predict DKA incidence using data imported from the electronic medical records. We found that the HbA1c level was the most contributing attribute to the occurrence of DKA followed by the health education appointments. Although six attributes were used in this research, other similar studies have used other clinical attributes such as the fasting blood glucose, HbA1c, vital signs, injection therapy vs pump therapy. It also included sex, race and ethnicity, BMI, healthy diet, smoking status. ¹⁰ ^, ¹⁷ However, not all of these attributes can be found in the electronic medical records such as race and ethnicity, smoking status, healthy diet in our study. Our data is consistent with others. ⁸ ^– ¹¹ Similar research has been conducted previously which explored similar models and their performances on predicting DKA. All demonstrated an AUC level of 0.7 and higher. ¹⁰ ^, ¹⁷ ^, ¹⁸ On the other hand, the highest AUC level scored in the current was by the Random Forest model with a score of 0.97 followed by AdaBoost and CN2 rule inducer with a score of 0.95 and 0.92 respectively. Lastly, although this research showed promising results in the prediction performance, it had some limitations which could be improved in future studies. To refine the prediction model additional clinical attributes should be included such as the fasting blood glucose, BMI, and medication adherence and type of treatment.

Conclusions

DKA is considered to be a serious complication of diabetes which can be prevented with proper education and targeted strategic health care delivery. We aimed to create a predicting model for diabetic ketoacidosis among pediatric cases. Therefore, a real dataset was collected from Fahad Medical City health information system in Riyadh, Saudi Arabia. Several machine learning models have been used such as Random Forest (RF), Decision Tree (DT), kNN, Gradient Boosting (GB), CN2 rule inducer and AdaBoost. Furthermore, several preprocessing and data sampling techniques were applied. We found Random Forest model achieved the highest performance with the AUC of 0.98. Furthermore, HbA1c was the most contributing factor to the prediction model. Further research is required to refine the prediction model with additional clinical attributes such as the fasting blood glucose, BMI, medication adherence and type of treatment. Moreover, it is required to test the model’s performance with the multi-center balanced patient’s sample.

Author contributions

Waad Eid: conception and design of the paper and data analysis

Hana Alharthi: conception and design of the paper, data analysis, critical and final revision of the article

Nida Aslam: Data analysis and model framework.

Irfan Ullah Abdur rab: Data analysis and model framework.

Alaa Madani: acquisition of data.

Consent

The authors report that all patient data used in this research is anonymous, thus no consent for publication was required, and no alterations was done that would distort scientific meaning

Data availability statement

The data that support the findings of this study are available from King Fahad Medical City but restrictions apply to the availability of these data, which were used under license for the current study, and so are not publicly available. Data are however available from the authors (Waad Eid, Email: wmeid@iau.edu.sa & waadeid@gmail.com) upon reasonable request and with permission of King Fahad Medical City.

References 1

Zou

Kaiyang

Luo

: Predicting diabetes mellitus with machine learning techniques. Front. Genet. 2018;9:11. 30459809

10.3389/fgene.2018.00515

PMC6232260

staff Mayo clinic: Diabetic ketoacidosis. 2019.

staff Mayo clinic: Type 1 diabetes in children. 2020.

Ghetti

SPD

Lee

JKBA

Sims

CEBA

: Diabetic ketoacidosis and memory dysfunction in children with type 1 diabetes. J. Pediatr. 2010;156(1):109–114. 19833353

10.1016/j.jpeds.2009.07.054

Reference Source

Maldonado

Chong

Oehl

: Economic impact of diabetic ketoacidosis in a multiethnic indigent population. Diabetes Care. 2003;26(4):1265–1269. 12663608

10.2337/diacare.26.4.1265

Reference Source

Deo Rahul

: Machine learning in medicine. Circulation. 2015;132(20):1920–1930. 26572668

10.1161/CIRCULATIONAHA.115.001593

PMC5831252

Rajkomar

Dean

Kohane

10.1056/NEJMra1814259

Reference Source

Sisodia

: Prediction of diabetes using classification algorithms. Procedia Comput. Sci. 2018;132:1578–1585. 10.1016/j.procs.2018.05.122

Reference Source

Dinh

Miertschin

Young

: A data-driven approach to predicting diabetes and cardiovascular disease with machine learning. BMC Med. Inform. Decis. Mak. 2019;19(1):211. 1472-6947. 31694707

10.1186/s12911-019-0918-5

PMC6836338

Syed

Khan

: Machine learning-based application for predicting risk of type 2 diabetes mellitus (t2dm) in saudi arabia: A retrospective cross-sectional study. IEEE Access. 2020;8:199539–199561. 2169-3536. 10.1109/ACCESS.2020.3035026

Alabdulwahhab

Sami

Mehmood

: Automated detection of diabetic retinopathy using machine learning classifiers. Eur. Rev. Med. Pharmacol. Sci. 2021;25(2):583–590. 10.26355/eurrev_202101_24615

Reference Source

Health Catalyst: Texas children’s take the reins in preventing dka in high risk pediatrics patients. 2016.

orange: widget-catalog. 2021. Reference Source

Kaur

Kumari

: Predictive modelling and analytics for diabetes using a machine learning approach. Appl. Comput. Inform. 2018;18:90–100. 10.1016/j.aci.2018.12.004

Vellanki

Umpierrez

: Increasing hospitalizations for dka: A need for prevention programs. Diabetes Care. 2018;41(9):1839–1841. 30135197

10.2337/dci18-0004

PMC6105328

Reference Source

Dhatariya

Nunney

Higgins

: National survey of the management of diabetic ketoacidosis (dka) in the uk in 2014. Diabet. Med. 2016;33(2):252–260. 26286235

10.1111/dme.12875

Williams

Dass

Bass

: 1303-p: Comparative performance of a recurrent neural network (rnn) and logistic regression (lr) model to predict diabetic ketoacidosis (dka) among youth postdiagnosis with type 1 diabetes (t1d). Diabetes. 2020;69(Supplement 1):1303-P. 10.2337/db20-1303-P

Reference Source

Lee

C-C

Zhou

: Performance assessment of different machine learning approaches in predicting diabetic ketoacidosis in adults with type 1 diabetes using electronic health records data. Pharmacoepidemiol. Drug Saf. 2021;30(5):610–618. 33480091

10.1002/pds.5199

PMC8049019

10.5256/f1000research.142769.r249422

Reviewer response for version 1

Cichosz

Simon

1 Referee https://orcid.org/0000-0002-3484-7571 1Aalborg University, Aalborg, Denmark

Competing interests: No competing interests were disclosed.

30 8 2024

2024

This is an open access peer review report distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

recommendation

approve-with-reservations

The manuscript explores the potential for risk stratification of pediatric patients based on their likelihood of developing Diabetic Ketoacidosis (DKA), a model with significant clinical implications. However, several important concerns warrant attention:

The authors employ oversampling techniques on both the training and test datasets, resulting in an unrealistic performance assessment for real-world applications. Oversampling to address imbalanced datasets should be limited to the training data exclusively.

Authors are strongly encouraged to adhere to & submit the TRIPOD checklist or a similar guideline to ensure essential information is included in the manuscript.

The discussion section requires expansion, including a dedicated segment on the limitations of the study.

The manuscript could benefit from a discussion of its findings in the context of related studies concerning the prediction of adverse events in diabetes, such as DKA. For instance:(Cichosz SL, et al, 2024) (Ref-1)

Is the work clearly and accurately presented and does it cite the current literature?

Partly

If applicable, is the statistical analysis and its interpretation appropriate?

Are all the source data underlying the results available to ensure full reproducibility?

Is the study design appropriate and is the work technically sound?

Are the conclusions drawn adequately supported by the results?

Are sufficient details of methods and analysis provided to allow replication by others?

Reviewer Expertise:

Diabetes, technology, machine learning

I confirm that I have read this submission and believe that I have an appropriate level of expertise to confirm that it is of an acceptable scientific standard, however I have significant reservations, as outlined above.

References 1

: Development of Machine Learning Models for the Identification of Elevated Ketone Bodies During Hyperglycemia in Patients with Type 1 Diabetes. Diabetes Technol Ther .2024; 10.1089/dia.2023.0531

38456910

10.1089/dia.2023.0531