Clinical applications of machine learning in the survival prediction and classification of sepsis: coagulation and heparin usage matter

J Transl Med. 2022 Jun 11;20(1):265. doi: 10.1186/s12967-022-03469-6.

Abstract

Background: Sepsis is a life-threatening syndrome eliciting highly heterogeneous host responses. Current prognostic evaluation methods used in clinical practice are characterized by an inadequate effectiveness in predicting sepsis mortality. Rapid identification of patients with high mortality risk is urgently needed. The phenotyping of patients will assistant invaluably in tailoring treatments.

Methods: Machine learning and deep learning technology are used to characterize the patients' phenotype and determine the sepsis severity. The database used in this study is MIMIC-III and MIMIC-IV ('Medical information Mart for intensive care') which is a large, public, and freely available database. The K-means clustering is used to classify the sepsis phenotype. Convolutional neural network (CNN) was used to predict the 28-day survival rate based on 35 blood test variables of the sepsis patients, whereas a double coefficient quadratic multivariate fitting function (DCQMFF) is utilized to predict the 28-day survival rate with only 11 features of sepsis patients.

Results: The patients were grouped into four clusters with a clear survival nomogram. The first cluster (C_1) was characterized by low white blood cell count, low neutrophil, and the highest lymphocyte proportion. C_2 obtained the lowest Sequential Organ Failure Assessment (SOFA) score and the highest survival rate. C_3 was characterized by significantly prolonged PTT, high SIC, and a higher proportion of patients using heparin than the patients in other clusters. The early mortality rate of patients in C_3 was high but with a better long-term survival rate than that in C_4. C_4 contained septic coagulation patients with the worst prognosis, characterized by slightly prolonged partial thromboplastin time (PTT), significantly prolonged prothrombin time (PT), and high septic coagulation disease score (SIC). The survival rate prediction accuracy of CNN and DCQMFF models reached 92% and 82%, respectively. The models were tested on an external dataset (MIMIC-IV) and achieved good performance. A DCQMFF-based application platform was established for fast prediction of the 28-day survival rate.

Conclusion: CNN and DCQMFF accurately predicted the sepsis patients' survival, while K-means successfully identified the phenotype groups. The distinct phenotypes associated with survival, and significant features correlated with mortality were identified. The findings suggest that sepsis patients with abnormal coagulation had poor outcomes, abnormal coagulation increase mortality during sepsis. The anticoagulation effects of appropriate heparin sodium treatment may improve extensive micro thrombosis-caused organ failure.

Keywords: Coagulation; Convolution neural network; Deep learning; Sepsis; Survival prediction.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Blood Coagulation Disorders*
  • Hematologic Tests
  • Heparin / pharmacology
  • Heparin / therapeutic use
  • Humans
  • Machine Learning
  • Prognosis
  • Retrospective Studies
  • Sepsis*

Substances

  • Heparin