Warning: fopen(/home/virtual/enm-kes/journal/upload/ip_log/ip_log_2025-01.txt): failed to open stream: Permission denied in /home/virtual/lib/view_data.php on line 100 Warning: fwrite() expects parameter 1 to be resource, boolean given in /home/virtual/lib/view_data.php on line 101
1Division of Endocrinology and Metabolism, Department of Internal Medicine, Seoul St. Mary’s Hospital, College of Medicine, The Catholic University of Korea, Seoul, Korea
2NAVER CLOVA AI Lab, Seongnam, Korea
3Department of Medical Informatics, College of Medicine, The Catholic University of Korea, Seoul, Korea
4Department of Biomedicine and Health Sciences, College of Medicine, The Catholic University of Korea, Seoul, Korea
5Health Promotion Center, Seoul St. Mary’s Hospital, Seoul, Korea
Copyright © 2024 Korean Endocrine Society
This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
CONFLICTS OF INTEREST
This study was supported by the Daewoong Pharmaceutical company. The opinions expressed in this paper are those of the authors and do not necessarily represent those of Daewoong Pharmaceutical company. While this study received technical guidance from NAVER CLOVA AI Lab for the development of an AI prediction model, it had no effect on the research outcomes.
AUTHOR CONTRIBUTIONS
Conception or design: H.S.K. Acquisition, analysis, or interpretation of data: J.L., Y.C., T.K., K.L., J.S., H.S.K. Drafting the work or revising: J.L., H.S.K. Final approval of the manuscript: J.L., Y.C., T.K., K.L., J.S., H.S.K.
Values are expressed as number (%) or mean±standard deviation unless otherwise indicated. Student’s t test was used for statistical analysis, and a P<0.05 was regarded as statistically significant. The baseline characteristics of patients who developed macrovascular complications and those who did not develop macrovascular complications were compared (2nd–4th column). The data were expressed after the missing values (Supplemental Table S1) were discarded. The patient data for training and testing were distributed in a 7:3 ratio with comparable baseline characteristics (5th–6th columns).
BMI, body mass index; SBP, systolic blood pressure; DBP, diastolic blood pressure; HbA1c, hemoglobin A1C; BUN, blood urea nitrogen; GFR, glomerular filtration rate; AST, aspartate aminotransferase; ALT, alanine aminotransferase; HDL-C, high-density lipoprotein cholesterol; LDL-C, low-density lipoprotein cholesterol; CV, cardiovascular.
Values are expressed as mean±standard deviation.
XGBoost, eXtreme Gradient Boosting; GRU, gated recurrent unit; ODE, ordinary differential equation; AUROC, area under the receiver operating characteristic curve; AUPRC, area under the precision-recall curve; PPV, positive predictive value; NPV, negative predictive value.
GRU-ODE-Bayes | XGBoost | Logistic regression | |
---|---|---|---|
GRU-ODE-Bayes | - | 0.013 | <0.001 |
XGBoost | 0.013 | - | <0.001 |
Logistic regression | <0.001 | <0.001 | - |
The AUROC of each macrovascular risk engine was compared by paired t test. All P values were calculated by paired t test for the results of 100 replicates.
AUROC, area under the receiver operating characteristic curve; GRU, gated recurrent unit; ODE, ordinary differential equation; XGBoost, eXtreme Gradient Boosting.
Characteristic | Macrovascular complications |
Developed prediction model |
||||
---|---|---|---|---|---|---|
Developed | Not developed | P value | Training model | Test model | ||
Number | 1,034 (20.5) | 4,006 (79.5) | 3,528 (70.0) | 1,512 (30.0) | ||
Age | 59 (52–66) | 59 (51–66) | ||||
18–59 yr, % | 38.5 | 55.82 | 51.9 | 51.8 | ||
≥60 yr, % | 61.5 | 44.18 | 48.4 | 48.2 | ||
Male sex | 43.7 | 50.87 | 46.9 | 47.0 | ||
Height, cm | 162.7±8.9 | 162.20±9.08 | 0.388 | 162 (155–170) | 163.0 (156.0–169.0) | |
Weight, kg | 64.6±11.6 | 64.89±12.80 | <0.050 | 63 (56–72) | 63.0 (56.0–71.0) | |
BMI, kg/m2 | 24.3±3.3 | 24.55±3.74 | 0.215 | 24.1 (22.1–26.2) | 24.0 (22.0–26.2) | |
SBP, mm Hg | 136±21 | 129±18 | <0.050 | 130 (120–140) | 130 (120–140) | |
DBP, mm Hg | 80±11 | 77±11 | <0.050 | 80 (70–82) | 79 (70–82) | |
HbA1c, % | 7.8±1.8 | 7.5±1.8 | <0.050 | 6.9 (6.3–7.8) | 6.8 (6.3–7.8) | |
BUN, mg/dL | 19.6±13.4 | 17.0±7.1 | <0.050 | 15.5 (12.7–19.3) | 15.8 (12.9–19.2) | |
Creatinine, mg/dL | 1.1±1.2 | 0.9±0.6 | <0.050 | 0.8 (0.7–1.0) | 0.8 (0.7–1.0) | |
GFR, mL/min/1.73 m2 | 78.1±27.6 | 84.2±22.9 | <0.050 | 84.3 (71.0–97.5) | 83.77 (69.8–96.4) | |
AST, U/L | 25±14 | 27±25 | 0.116 | 22 (18–28) | 22 (18–27) | |
ALT, U/L | 29±20 | 32±34 | 0.058 | 24 (17–35) | 23.0 (17–34) | |
Total cholesterol, mg/dL | 169±44 | 176±42 | <0.050 | 168.0 (144.0–195.0) | 168 (144–196) | |
Triglyceride, mg/dL | 150±132 | 145±111 | 0.457 | 119.0 (80.0–175.0) | 118 (80–167) | |
HDL-C, mg/dL | 44±12 | 46±13 | <0.050 | 45.0 (38.0–53.0) | 45 (39–53) | |
LDL-C, mg/dL | 95±33 | 100±35 | <0.050 | 93.0 (73.0–116.0) | 92 (73–116) | |
CV complications, % | - | - | 20.4 | 19.4 |
AUROC | AUPRC | PPV | NPV | Sensitivity | Specificity | F1 | Accuracy | |
---|---|---|---|---|---|---|---|---|
GRU-ODE-Bayes | 0.812±0.016 | 0.607±0.033 | 0.889±0.041 | 0.868±0.004 | 0.251±0.027 | 0.994±0.003 | 0.39±0.033 | 0.869±0.005 |
XGBoost | 0.78±0.014 | 0.569±0.029 | 0.842±0.065 | 0.851±0.006 | 0.233±0.036 | 0.993±0.004 | 0.364±0.047 | 0.85±0.008 |
GRU-ODE-Bayes | XGBoost | Logistic regression | |
---|---|---|---|
GRU-ODE-Bayes | - | 0.013 | <0.001 |
XGBoost | 0.013 | - | <0.001 |
Logistic regression | <0.001 | <0.001 | - |
Values are expressed as number (%) or mean±standard deviation unless otherwise indicated. Student’s BMI, body mass index; SBP, systolic blood pressure; DBP, diastolic blood pressure; HbA1c, hemoglobin A1C; BUN, blood urea nitrogen; GFR, glomerular filtration rate; AST, aspartate aminotransferase; ALT, alanine aminotransferase; HDL-C, high-density lipoprotein cholesterol; LDL-C, low-density lipoprotein cholesterol; CV, cardiovascular.
Values are expressed as mean±standard deviation. XGBoost, eXtreme Gradient Boosting; GRU, gated recurrent unit; ODE, ordinary differential equation; AUROC, area under the receiver operating characteristic curve; AUPRC, area under the precision-recall curve; PPV, positive predictive value; NPV, negative predictive value.
The AUROC of each macrovascular risk engine was compared by paired AUROC, area under the receiver operating characteristic curve; GRU, gated recurrent unit; ODE, ordinary differential equation; XGBoost, eXtreme Gradient Boosting.