Predictive Model for ICU Readmission Based on Discharge Summaries Using Machine Learning and Natural Language Processing

Thumbnail Image

Authors

Orangi-Fard, Negar
Akhbardeh, Alireza
Sagreiya, Hersh

Issue Date

2022-01-26

Type

Article

Language

en_US

Keywords

natural language processing , machine learning , intensive care unit , readmission , health informatice

Research Projects

Organizational Units

Journal Issue

Alternative Title

Abstract

Predicting ICU readmission risk will help physicians make decisions regarding discharge. We used discharge summaries to predict ICU 30-day readmission risk using text mining and machine learning (ML) with data from the Medical Information Mart for Intensive Care III (MIMIC-III).We used Natural Language Processing (NLP) and the Bag-of-Words approach on discharge summaries to build a Document-Term-Matrix with 3000 features. We compared the performance of support vector machines with the radial basis function kernel (SVM-RBF), adaptive boosting (AdaBoost), quadratic discriminant analysis (QDA), least absolute shrinkage and selection operator (LASSO), and Ridge Regression. A total of 4000 patients were used for model training and 6000 were used for validation. Using the bag-of-words determined by NLP, the area under the receiver operating characteristic (AUROC) curve was 0.71, 0.68, 0.65, 0.69, and 0.65 correspondingly for SVM-RBF, AdaBoost, QDA, LASSO, and Ridge Regression. We then used the SVM-RBF model for feature selection by incrementally adding features to the model from 1 to 3000 bag-of-words. Through this exhaustive search approach, only 825 features (words) were dominant. Using those selected features, we trained and validated all ML models. The AUROC curve was 0.74, 0.69, 0.67, 0.70, and 0.71 respectively for SVM-RBF, AdaBoost, QDA, LASSO, and Ridge Regression. Overall, this technique could predict ICU readmission relatively well.

Description

AMA Style Orangi-Fard N, Akhbardeh A, Sagreiya H. Predictive Model for ICU Readmission Based on Discharge Summaries Using Machine Learning and Natural Language Processing. Informatics. 2022; 9(1):10. https://doi.org/10.3390/informatics9010010 Chicago/Turabian Style Orangi-Fard, Negar, Alireza Akhbardeh, and Hersh Sagreiya. 2022. "Predictive Model for ICU Readmission Based on Discharge Summaries Using Machine Learning and Natural Language Processing" Informatics 9, no. 1: 10. https://doi.org/10.3390/informatics9010010
AMA Style Orangi-Fard N, Akhbardeh A, Sagreiya H. Predictive Model for ICU Readmission Based on Discharge Summaries Using Machine Learning and Natural Language Processing. Informatics. 2022; 9(1):10. https://doi.org/10.3390/informatics9010010 Chicago/Turabian Style Orangi-Fard, Negar, Alireza Akhbardeh, and Hersh Sagreiya. 2022. "Predictive Model for ICU Readmission Based on Discharge Summaries Using Machine Learning and Natural Language Processing" Informatics 9, no. 1: 10. https://doi.org/10.3390/informatics9010010

Citation

Orangi-Fard, N.; Akhbardeh, A.; Sagreiya, H. Predictive Model for ICU Readmission Based on Discharge Summaries Using Machine Learning and Natural Language Processing. Informatics 2022, 9, 10. https://doi.org/10.3390/informatics9010010

Publisher

MDPI

License

Journal

Informatics

Volume

Issue

PubMed ID

DOI

ISSN

2227-9709

EISSN