Personalized Risk Prediction for 30-Day Readmissions With Venous Thromboembolism Using Machine Learning

Jung In Park1, Doyub Kim2, Jung-Ah Lee3
Kai Zheng4, Alpesh Amin5

  1. Assistant Professor, Sue & Bill Gross School of Nursing, University of California, Irvine, CA.
  2. Principal Software Engineer, NVIDIA, Santa Clara, CA.
  3. Associate Professor, Sue & Bill Gross School of Nursing, University of California, Irvine, CA.
  4. Professor, Donald Bren School of Information & Computer Sciences, University of California, Irvine, CA.
  5. Professor, School of Medicine, University of California, Irvine, CA.

Journal of Nursing Scholarship 2021 Feb 22


Purpose: The aim of the study was to develop and validate machine learning models to predict the personalized risk for 30-day readmission with venous thromboembolism (VTE).

Design: This study was a retrospective, observational study.

Methods: We extracted and preprocessed the structured electronic health records (EHRs) from a single academic hospital. Then we developed and evaluated three prediction models using logistic regression, the balanced random forest model, and the multilayer perceptron.

Results: The study sample included 158,804 total admissions; VTE-positive cases accounted for 2,080 admissions from among 1,695 patients (1.31%). Based on the evaluation results, the balanced random forest model outperformed the other two risk prediction models.

Conclusions: This study delivered a high-performing, validated risk prediction tool using machine learning and EHRs to identify patients at high risk for VTE after discharge.

Clinical relevance: The risk prediction model developed in this study can potentially guide treatment decisions for discharged patients for better patient outcomes.

Keywords: 30-day readmission; Risk Prediction Model; electronic health records; machine learning; venous thromboembolism.