METHOD: In this study, the MNSI data were collected from the Epidemiology of Diabetes Interventions and Complications (EDIC) clinical trials. Two different datasets with different MNSI variable combinations based on the results from the eXtreme Gradient Boosting feature ranking technique were used to analyze the performance of eight different conventional ML algorithms.
RESULTS: The random forest (RF) classifier outperformed other ML models for both datasets. However, all ML models showed almost perfect reliability based on Kappa statistics and a high correlation between the predicted output and actual class of the EDIC patients when all six MNSI variables were considered as inputs.
CONCLUSIONS: This study suggests that the RF algorithm-based classifier using all MNSI variables can help to predict the DSPN severity which will help to enhance the medical facilities for diabetic patients.
METHOD: For designing and modeling the DSPN severity grading systems for MNSI, 19 years of data from Epidemiology of Diabetes Interventions and Complications (EDIC) clinical trials were used. Different Machine learning-based feature ranking techniques were investigated to identify the important MNSI features associated with DSPN diagnosis. A multivariable logistic regression-based nomogram was generated and validated for DSPN severity grading using the best performing top-ranked MNSI features.
RESULTS: Top-10 ranked features from MNSI features: Appearance of Feet (R), Ankle Reflexes (R), Vibration perception (L), Vibration perception (R), Appearance of Feet (L), 10-gm filament (L), Ankle Reflexes (L), 10-gm filament (R), Bed Cover Touch, and Ulceration (R) were identified as important features for identifying DSPN by Multi-Tree Extreme Gradient Boost model. The nomogram-based prediction model exhibited an accuracy of 97.95% and 98.84% for the EDIC test set and an independent test set, respectively. A DSPN severity score technique was generated for MNSI from the DSPN severity prediction model. DSPN patients were stratified into four severity levels: absent, mild, moderate, and severe using the cut-off values of 17.6, 19.1, 20.5 for the DSPN probability less than 50%, 75%-90%, and above 90%, respectively.
CONCLUSIONS: The findings of this work provide a machine learning-based MNSI severity grading system which has the potential to be used as a secondary decision support system by health professionals in clinical applications and large clinical trials to identify high-risk DSPN patients.