A Machine-Learning Model Based on Clinical Features for the Prediction of Severe Dysphagia After Ischemic Stroke

Feng Ye,^1,^* Liang-Ling Cheng,^1,^* Wei-Min Li,² Ying Guo,³ Xiao-Fang Fan²

¹Wuxi School of Medicine, Jiangnan University, Wuxi, People’s Republic of China; ²Department of Ultrasonography, Affiliated Hospital of Jiangnan University, Wuxi, People’s Republic of China; ³Department of Neurology, Affiliated Hospital of Jiangnan University, Wuxi, People’s Republic of China

*These authors contributed equally to this work

Correspondence: Xiao-Fang Fan, Department of Ultrasonography, Affiliated Hospital of Jiangnan University, Wuxi, People’s Republic of China, Email [email protected]

Background: This study aimed to construct machine-learning models for prediction of severe dysphagia after ischemic stroke based on clinical features and identify significant clinical predictors.
Methods: Patients hospitalized with dysphagia after ischemic stroke in Affiliated Hospital of Jiangnan University were retrospectively analyzed and randomly divided into training and validation sets at a ratio of 7:3. Additional patients from Huai’an Hospital were selected as test set. 19 relevant clinical characteristics were collected. According to the water swallowing test (WST), patients were divided into severe dysphagia group and non-severe dysphagia group. K-nearest neighbor (KNN), decision tree (DT), random forest (RF), support vector machine (SVM), light gradient boosting machine (LGBM), and extreme gradient boosting (XGBoost) were applied to predict severe dysphagia. Receiver operating characteristic (ROC) curves were plotted, the area under the ROC (AUC) was calculated to assess predictive power, and DeLong’s test was used to compare the AUCs among six models. Finally, an optimal model was obtained, and significant clinical predictors of severe dysphagia after stroke were screened.
Results: A total of 724 patients were enrolled, 422 in training set, 182 in validation set and 120 in test set, respectively, with no statistically differences in baseline information (P> 0.05). In the training set, the AUCs of KNN, DT, RF, SVM and XGBoost were higher than that of LGBM (P< 0.05). In the validation and test sets, the AUCs of XGBoost were also higher. The performance metrics of XGBoost were better in terms of accuracy, precision, recall, and F1-score. Therefore, XGBoost was the best model, with good clinical practicality. Furthermore, the top five features based on XGBoost were NIHSS score, BI, BMI, age and time since stroke onset.
Conclusion: Among all clinical feature-based machine-learning models for the prediction of severe dysphagia after ischemic stroke, XGBoost had the best predictive value.

Keywords: ischemic stroke, dysphagia, machine learning, predictive model

Introduction

Stroke is the leading cause of death globally, with ischemic stroke being the most common type, accounting for approximately 80% of all strokes.^1,2 Dysphagia remains one of the most frequent complications of stroke, with an incidence of 37–78%,^3,4 of which severe dysphagia will not only cause malnutrition due to poor nutritional uptake but may also result in dehydration, aspiration pneumonia, asphyxia, and other serious consequences, thereby leading to poor prognosis and even death.⁵ Therefore, the early prediction, prevention, and treatment of severe dysphagia after stroke are of great clinical importance.

Current research on dysphagia after stroke mainly focuses on evaluation tools, risk factors, prevention, and treatment; however, no effective and unified prediction model has been developed. As a branch of artificial intelligence (AI), machine learning can process a large amount of multidimensional data, which goes beyond traditional statistical methods to some extent.⁶ There is no doubt that machine learning is increasingly used in the medical field in terms of diagnosis, treatment and prognosis of diseases, and has notable predictive advantages.^7–9

Based on clinical features, this study aimed to construct machine-learning models for the prediction of dysphagia grade after ischemic stroke, identify significant clinical predictors of severe dysphagia, and shed light on the clinical optimization of dysphagia management.

Materials and Methods

Study Population

All procedures involving human participants were performed in accordance with the Declaration of Helsinki (2013). The ethics committee of Affiliated Hospital of Jiangnan University approved this retrospective study (LS2024245). The requirement for individual consent for this retrospective study was waived. We retrospectively recruited 724 hospitalized patients with dysphagia after ischemic stroke from December 2021 to November 2024 in the Department of Neurology of the Affiliated Hospital of Jiangnan University and Huai’an Hospital. The inclusion criteria were as follows: (1) confirmed ischemic stroke on MRI, with typical clinical symptoms; (2) medically stable, with no digestive function; (3) suspicious or abnormal results of the first swallowing function assessment within 72 h after stroke; (4) normal swallowing function before onset. Patients were excluded if they (1) had disorders of consciousness or cognitive function; (2) had difficulties in hearing, understanding, or communication; (3) had severe organ dysfunction in the heart, brain, or kidney; (4) had dysphagia probably caused by other diseases.

Assessment of Swallowing Function

The WST was used to assess swallowing function within 72 h of admission. All patients were seated comfortably and asked to drink 30 mL of water. Drinking time and signs of choking were recorded: Grade I: no choking on drinking trial without interruption; Grade II: no choking on drinking trial with interruption; Grade III: choking on drinking trial without interruption; Grade IV: choking on drinking trial with interruption; Grade V: unable to finish drinking trial and several choking episodes. Patients in Grades II–III and IV–V were further divided into the non-severe dysphagia and severe dysphagia groups, respectively.

Baseline Information

19 clinical characteristics were collected through literature reading,^3,10–14 including age, time since stroke onset, sex (male=0, female=1), hypertension (no=0, yes=1), hyperlipidemia (no=0, yes=1), diabetes (no=0, yes=1), muscle strength (higher than level 3=0, lower than level 3=1), smoking history (no=0, yes=1), drinking history (no=0,yes=1), National Institute of Health Stroke Scale (NIHSS) score, Barthel index (BI), cough (no=0, yes=1), dysarthria (no=0, yes=1), body mass index (BMI), location of the stroke (cortical stroke=1, subcortical stroke=2, cerebellar stroke=3, brainstem stroke=4, mixed/multifocal stroke=5), motor dysfunction (no=0, yes=1), cognitive dysfunction (no=0, yes=1), history of atrial fibrillation (no=0, yes=1) and history of cerebral infarction (no=0, yes=1).

Model Construction and Evaluation

Eligible patients in the Affiliated Hospital of Jiangnan University were randomly divided into training and validation sets at a ratio of 7:3. Patients from Huai’an Hospital were used to test models. According to the results of some literatures about machine learning,^15–19 K-nearest neighbor (KNN), decision tree (DT), random forest (RF), support vector machine (SVM), light gradient boosting machine (LGBM), and extreme gradient boosting (XGBoost) were applied to predict severe dysphagia. Receiver operating characteristic (ROC) curves were plotted against the original propensity score, the area under the ROC (AUC) was calculated to assess predictive power, and DeLong’s test was used to compare the AUCs among the six models. Finally, an optimal model was obtained, and significant clinical predictors of severe dysphagia after stroke were screened.

Statistical Analysis

Normally distributed continuous variables were expressed as mean ± standard deviation, non-normally distributed variables were expressed as median and interquartile range (IQR), the differences between two groups were assessed using independent t-test or Mann–Whitney U-test. Categorical variables were presented as counts (%), the differences between two groups were assessed using the chi-square test or Fisher’s exact test. Missing data was deleted including the data row. ROC curves were plotted against the original propensity score, AUCs were calculated to assess predictive power and DeLong’s test was used to compare the performance of the 6 models. Confounding factors were controlled by specifying the inclusion conditions of study subjects, assigning study subjects to different groups according to the principle of randomization to eliminate the influence of confounding factors. All statistical analyses were performed using SPSS, Version 26.0, and R software, Version 4.2.1. The threshold for significance was set at P<0.05.

Results

Baseline Information Characteristics in Training Set, Validation Set and Test Set

724 patients were enrolled in this study (308 patients with severe dysphagia and 416 without severe dysphagia). 604 patients from Department of Neurology in Affiliated Hospital of Jiangnan University were divided into training set and validation set according to 7:3 for simple cross-validation, with 422 (189 with severe dysphagia and 233 without severe dysphagia) in the training set and 182 (74 with severe dysphagia and 108 without severe dysphagia) in the validation set. 120 patients (45 with severe dysphagia and 75 without severe dysphagia) from Department of Neurology in Huai’an Hospital were collected in the test set. No significant differences were found in the baseline characteristics (P>0.05). Details are presented in Table 1.

Table 1 Baseline Characteristics in Different Sets (N=724)

Construction and Predictive Power of the 6 Models

Predictive Power of the 6 Models can been seen in Figure1. In the training set, ROC analysis revealed that the AUCs of KNN (1.000; 95% CI: 0.999–1.000), DT (0.890; 95% CI: 0.858–0.922), RF (1.000, 95% CI: 1.000–1.000), SVM (0.895, 95% CI: 0.865–0.925) and XGBoost (0.991; 95% CI: 0.985–0.997) were all significantly higher than those of LGBM (0.841, 95% CI: 0.804–0.878), with P<0.05 (Figure 1A).

Figure 1 AUCs of six machine-learning models in the different sets. (A) AUCs in the training set. (B) AUCs in the validation set. (C) AUCs in the test set.

In the validation set, the AUC of XGBoost (0.916, 95% CI: 0.849–0.983) was significantly higher than that of SVM (0.905, 95% CI: 0.834–0.975), KNN (0.787, 95% CI: 0.690–0.885), DT (0.796, 95% CI: 0.706–0.887), RF (0.891, 95% CI: 0.821–0.961) and LGBM (0.500, 95% CI: 0.500–0.500), with P<0.05 (Figure 1B).

In the test set, the AUC of XGBoost (0.959, 95% CI: 0.926–0.992) was also highest among six machine models, with SVM (0.935, 95% CI: 0.893–0.977), KNN (0.671, 95% CI: 0.589–0.752), DT (0.706, 95% CI: 0.632–0.780), RF (0.850, 95% CI: 0.780–0.919) and LGBM (0.500, 95% CI: 0.500–0.500), with P<0.05 (Figure 1C).

Performance Metrics of the 6 Models

Among the six machine-learning models, RF and KNN demonstrated the best performance in terms of accuracy, precision, recall, and F1-score in training set but achieved lower scores in the validation and test sets. LGBM demonstrated poor performance metrics in the training, validation and test sets. As shown in Table 2, XGBoost was better in all sets in terms of accuracy, precision, recall, and F1-score; therefore, XGBoost was the optimal model, with good clinical practicality.

Table 2 Performance Metrics of Six Machine Learning Models in Predicting swallowing Disorders after Ischemic Stroke

Importance of Predictors Using XGBoost and RF Model

The RF and KNN models had the highest AUCs in the training set, but the KNN do not rank the importance of features because of its working principle, therefore, we used XGBoost and RF models to analyse the importance of the predictors (Figure 2). In the RF model, the top five indicators of importance were NIHSS score, BI, age, BMI and location of ischemic stroke (Figure 2A). In the XGBoost model, the top five indicators of importance were NIHSS score, BI, BMI, age and time since stroke onset (Figure 2B).

Figure 2 The importance of clinical features in machine-learning models. (A) Variable importance in XGBoost model. (B) Variable importance in RF model.

Discussion

Dysphagia remains a common complication of ischemic stroke and can lead to serious clinical consequences. Therefore, being able to make an early prediction of dysphagia grade after stroke is crucial. As a branch of AI, machine learning mainly includes logistic regression, SVM, neural network, Bayesian network, DT, and RF, and has been applied in the neural field.¹⁹ By processing a large amount of resources to fuse computer science and statistics with medical research, machine learning can effectively identify specific features highly correlated with the results, thereby eliminating subjective differences of observers. By retrospectively analyzing 724 patients with dysphagia after ischemic stroke and applying six machine-learning models (KNN, DT, RF, SVM, LGBM, and XGBoost), XGBoost was identified as the optimal model, with good clinical practicality.

Six machine-learning models were selected to predict severe dysphagia after ischemic stroke: KNN, DT, RF, SVM, LGBM, and XGBoost. ROC analysis revealed that the AUCs of KNN, DT, RF, SVM, and XGBoost were significantly higher than that of LGBM in the training set, indicating that the five models had relatively good predictive power. However, in the validation and test sets, the AUCs of XGBoost were significantly higher than those of the other five models. Subsequent analysis of the performance metrics revealed that XGBoost was better in terms of accuracy, precision, recall, and F1-score; therefore, XGBoost was the optimal model, with good clinical practicality. As an optimized distributed gradient enhancement library, XGBoost is designed to implement efficient, flexible, and portable machine-learning algorithms whose core worth lies in the use of gradient-boosting algorithms to minimize loss functions; in each iteration, XGBoost learns a new model to fit the residuals of previous models, thereby gradually reducing prediction errors.^20,21

In addition, the top five indicators based on XGBoost were NIHSS score, BI, BMI, age and time since stroke onset. Studies have shown that the NIHSS offers the advantages of reliability, validity, and sensitivity and is positively correlated with degree of neurological impairment. In addition, several items in the NIHSS are related to swallowing function,^22–24 so the observation and evaluation of swallowing function should be strengthened in patients with higher NIHSS scores. In summary, the NIHSS score was considered an independent risk factor for the occurrence of dysphagia after stroke. Our study obtained similar results in that age was one of the risk factors for severe dysphagia in patients with ischemic stroke. The Barthel index is used to measure performance in basic/physical activities of daily living, with a lower score indicating a higher possibility of dysphagia after stroke in patients with changes in motor function. Consistent with previous studies, our results identified that BI is a risk factor for severe dysphagia.²⁵ Total or partial impairment of swallowing can lead to reduced food intake, progressive weight loss, and nutrient deficiencies, leading to malnutrition, which is an independent correlation factor for poor prognosis in stroke patients.²⁶ Dysphagia is closely related to malnutrition, and dysphagia significantly affects the nutritional status of patients, and dysphagia can aggravate dysphagia through neuromuscular dysfunction, which forms a vicious cycle through mutual causation. It is recommended that all stroke patients undergo routine nutritional examination to improve their rehabilitation potential and prevent malnutrition during rehabilitation.²⁷ It has been demonstrated that the physiological function of swallowing changes with age. Specifically, the reduction in muscle mass and connective tissue elasticity in older people leads to loss of strength and range of motion, and these age-related changes can negatively affect the effective flow of substances being swallowed through the upper aerodigestive tract. Hence, the higher the age, the higher the risk of dysphagia.^28–30 Our study found that, time since stroke onset played an significant role in dysphagia occurence after stroke. This has been shown in other studies as well,^31,32 the patient’s brain damage will be more severe over time, which increases the risk of swallowing disorders to some extent. Therefore, patients not only should be teached about typical and less common stroke symptoms and signs but also be provided the fastest means of transportation to a stroke unit and the best chances to get treatment early.

This study had some limitations. First, the retrospective design may have led to a selection bias. Second, the information we included is finite, and using a cohort including more clinical and pathological characteristics to train a model would help further improve the performance of ML model. Third, the sample size was relatively small, and thus a prospective study with a larger sample size is needed to verify our results.

Conclusion

In conclusion, machine learning models can predict severe dysphagia after ischemic stroke preciously, and XGBoost has a better predictive value. To make it easier for others to use this model, we could develop a Web APP based on the XGBoost model in the future. For wider adoption in practice, more sample sizes are necessary. We need to collect more relevant indicators of patients and enter their information into the model. Then, the model outputted a probability of disphagia after ischemic stroke, thus providing a diagnostic basis for the clinical assessment of severe dysphagia after ischemic stroke.

Ethical Statement

The authors are accountable for all aspects of this work, and ensure that questions related to the accuracy or integrity of any part of the work are appropriately investigated and resolved. All procedures involving human participants were performed in accordance with the Declaration of Helsinki (2013). The Ethics Committee of Affiliated Hospital of Jiangnan University approved this retrospective study and waived the requirement for informed consent. Patient information, such as name, age, sex, occupation, address, ID card, related diseases, and treatment plan, was provided by the Afffliated Hospital of Jiangnan University during the treatment period for the illness. Owing to the privacy of patients, the Affiliated Hospital of Jiangnan University maintained the confidentiality of the above information.

Funding

Funding for this study was provided by the Wuxi Maternal and Child Health Promotion Project (FYTG202203).

Disclosure

The authors declare that they have no conflicts of interest.

References

1. Feske SK. Ischemic Stroke. Am J Med. 2021;134:1457–1464. doi:10.1016/j.amjmed.2021.07.027

2. Saini V, Guada L, Yavagal DR. Global epidemiology of stroke and access to acute ischemic stroke interventions. Neurology. 2021;97:S6–S16. doi:10.1212/WNL.0000000000012781

3. Banda KJ, Chu H, Kang XL, et al. Prevalence of dysphagia and risk of pneumonia and mortality in acute stroke patients: a meta-analysis. BMC Geriatr. 2022;22:420. doi:10.1186/s12877-022-02960-5.

4. McIntyre M, Chimunda T, Koppa M, et al. Risk factors for postextubation dysphagia a systematic review and meta-analysis. Laryngoscope. 2022;132:364–374. doi:10.1002/lary.29311

5. Todisco M, Giudice C, Todisco M, Tassorelli C, Alfonsi E et al. Assessment and treatment of neurogenic dysphagia in stroke and Parkinson’s disease. Curr Opin Neurol. 2022;35:741–752. doi:10.1097/WCO.0000000000001117

6. Jiang T, Gradus JL, Rosellini AJ. Supervised machine learning: a brief primer. Behav Ther. 2020;51:675–687. doi:10.1016/j.beth.2020.05.002

7. Costa ÂK, Farias SR, Farias SR, et al. Machine learning for predicting chronic diseases: a systematic review. Public Health. 2022;205:14–25. doi:10.1016/j.puhe.2022.01.007

8. Silva GFS, Fagundes TP, Teixeira BC, et al. Machine learning for hypertension prediction: a systematic review. Curr Hypertens Rep. 2022;24:523–533. doi:10.1007/s11906-022-01212-6

9. Ahsan MM, Siddique Z. Machine learning-based heart disease diagnosis: a systematic literature review. Artif Intell Med. 2022;128:102289. doi:10.1016/j.artmed.2022.102289

10. Baroni AFFB, Fábio SRC, Dantas RO. Risk factors for swallowing dysfunction in stroke patients. Arq Gastroenterol. 2012;49:118–124. doi:10.1590/s0004-28032012000200005

11. Wang L, Qiao J, Sun F, et al. Demographic and clinical factors associated with recovery of poststroke dysphagia: a meta-analysis. Brain Behav. 2023;13:e3033. doi:10.1002/brb3.3033

12. Dispenza F, Kulamarva G, Kulamarva G, et al. Predictive factors of severity and persistence of oropharyngeal dysphagia in sub-acute stroke. Eur Arch Otorhinolaryngol. 2021;278:741–748. doi:10.1007/s00405-020-06429-2

13. D’Netto P, Rumbach A, Dunn K, et al. Clinical predictors of dysphagia recovery after stroke: a systematic review. Dysphagia. 2023;38:1–22. doi:10.1007/s00455-022-10443-3

14. Xu Z, Gu Y, Li J, et al. Dysphagia and aspiration pneumonia in elderly hospitalization stroke patients: risk factors, cerebral infarction area comparison. J Back Musculoskelet Rehabil. 2019;32:85–91. doi:10.3233/BMR-170801

15. Angraal S, Mortazavi BJ, Gupta A, et al. machine learning prediction of mortality and hospitalization in heart failure with preserved ejection fraction. JACC Heart Fail. 2020;8:12–21. doi:10.1016/j.jchf.2019.06.013

16. Greener JG, Kandathil SM, Moffat L, et al. A guide to machine learning for biologists. Nat Rev Mol Cell Biol. 2022;23:40–55. doi:10.1038/s41580-021-00407-0

17. Heo J, Yoon JG, Park H, et al. Machine learning-based model for prediction of outcomes in acute stroke. Stroke. 2019;50:1263–1265. doi:10.1161/STROKEAHA.118.024293

18. Lin CH, Hsu KC, Johnson KR, et al. Evaluation of machine learning methods to stroke outcome prediction using a nationwide disease registry. Comput Methods Programs Biomed. 2020;190:105381. doi:10.1016/j.cmpb.2020.105381

19. Wagner MW, Namdar K, Biswas A, et al. Radiomics, machine learning, and artificial intelligence-what the neuroradiologist needs to know. Neuroradiology. 2021;63:1957–1967. doi:10.1007/s00234-021-02813-9

20. Hou N, Li M, He L, et al. Predicting 30-days mortality for MIMIC-III patients with sepsis-3: a machine learning approach using XGboost. J Transl Med. 2020;18:462. doi:10.1186/s12967-020-02620-5

21. Bernard D, Doumard E, Ader I, et al. Explainable machine learning framework to predict personalized physiological aging. Aging Cell. 2023;22:e13872. doi:10.1111/acel.13872

22. Tsarapatsani K, Sakellarios AI, Pezoulas VC, et al. Machine learning models for cardiovascular disease events prediction. 2022 Annu Int Conf IEEE Eng Med Biol Soc. 2022:1066–1069. doi:10.1109/EMBC48229.2022.9871121.

23. Wang L, Li G, Hao Y, et al. Intrave not presentus thrombolysis for mild stroke: NIHSS 3–5 Versus NIHSS 0–2. J Stroke Cerebrovasc Dis. 2023;32:107070. doi:10.1016/j.jstrokecerebrovasdis.2023.107070

24. Einstad MS, Saltvedt I, Lydersen S, et al. Associations between post-stroke motor and cognitive function: a cross-sectional study. BMC Geriatr. 2021;21(1):103. doi:10.1186/s12877-021-02055-7

25. Abd-Elmonem AM, Saad-Eldien SS, Abd El-Nabie WA. Effect of oral sensorimotor stimulation on oropharyngeal dysphagia in children with spastic cerebral palsy: a randomized controlled trial. Eur J Phys Rehabil Med. 2021;57:912–922. doi:10.23736/S1973-9087.21.06802-7

26. Zhao WT, Yang M, Wu HM, et al. Systematic review and meta-analysis of the association between sarcopenia and dysphagia. J Nutr Health Aging. 2018;22:1003–1009. doi:10.1007/s12603-018-1055-z

27. Crary MA, Carnaby-Mann GD, Miller L, et al. Dysphagia and nutritional status at the time of hospital admission for ischemic stroke. J Stroke Cerebrovasc Dis. 2006;15:164–171. doi:10.1016/j.jstrokecerebrovasdis.2006.05.006

28. Rajashekar D, Wilms M, MacDonald ME, et al. Lesion-symptom mapping with NIHSS sub-scores in ischemic stroke patients. Stroke Vasc Neurol. 2022;7:124–131. doi:10.1136/svn-2021-001091

29. Sura L, Madhavan A, Carnaby G, et al. Dysphagia in the elderly: management and nutritional considerations. Clin Interv Aging. 2012;7:287–298. doi:10.2147/CIA.S23404

30. Boaden E, Burnell J, Hives L, et al. Screening for aspiration risk associated with dysphagia in acute stroke. Cochrane Database Syst Rev. 2021;10:CD012679. doi:10.1002/14651858.CD012679.pub2

31. Agyeman O, Nedeltchev K, Arnold M, et al. Time to admission in acute ischemic stroke and transient ischemic attack. Stroke. Stroke. 2006;37:963–966. doi:10.1161/01.STR.0000206546.76860.6b

32. Turin TC, Kita Y, Rumana N, et al. The time interval window between stroke onset and hospitalization and its related factors. Neuroepidemiology. 2009;33:240–246. doi:10.1159/000229778

Creative Commons License © 2024 The Author(s). This work is published and licensed by Dove Medical Press Limited. The full terms of this license are available at https://www.dovepress.com/terms.php and incorporate the Creative Commons Attribution - Non Commercial (unported, 3.0) License. By accessing the work you hereby accept the Terms. Non-commercial uses of the work are permitted without any further permission from Dove Medical Press Limited, provided the work is properly attributed. For permission for commercial use of this work, please see paragraphs 4.2 and 5 of our Terms.

Download Article [PDF]