Machine learning-based models to support decision-making in emergency department triage for patients with suspected cardiovascular disease

https://doi.org/10.1016/j.ijmedinf.2020.104326Get rights and content

Highlights

  • Cardiovascular disease, which is an acute time-sensitive condition, is the leading cause of mortality and morbidity.

  • Drawbacks of the Chinese Emergency Triage Scale, such as human dependency and ambiguity of judgement have been highlighted.

  • Machine learning models perform moderately well in predicting the triage levels.

  • eXtreme gradient boosting yielded a more accurate prediction of triage.

  • Blood pressure, pulse rate, oxygen saturation, and age are the most significant variables for making decision of the triage.

Abstract

Background

Accurate differentiation and prioritization in emergency department (ED) triage is important to identify high-risk patients and to efficiently allocate of finite resources. Using data available from patients with suspected cardiovascular disease presenting at ED triage, this study aimed to train and compare the performance of four common machine learning models to assist in decision making of triage levels.

Methods

This cross-sectional study in the second Affiliated Hospital of Guangzhou Medical University was conducted from August 2015 to December 2018 inclusive. Demographic information, vital signs, blood glucose, and other available triage scores were collected. Four machine learning models – multinomial logistic regression (multinomial LR), eXtreme gradient boosting (XGBoost), random forest (RF) and gradient-boosted decision tree (GBDT) – were compared. For each model, 80 % of the data set was used for training and 20 % was used to test the models. The area under the receiver operating characteristic curve (AUC), accuracy and macro- F1 were calculated for each model.

Results

In 17,661 patients presenting with suspected cardiovascular disease, the distribution of triage of level 1, level 2, level 3 and level 4 were 1.3 %, 18.6 %, 76.5 %, and 3.6 % respectively. The AUCs were: XGBoost (0.937), GBDT (0.921), RF (0.919) and multinomial LR (0.908). Based on feature importance generated by XGBoost, blood pressure, pulse rate, oxygen saturation, and age were the most significant variables for making decisions at triage.

Conclusion

Four machine learning models had good discriminative ability of triage. XGBoost demonstrated a slight advantage over other models. These models could be used for differential triage of low-risk patients and high-risk patients as a strategy to improve efficiency and allocation of finite resources.

Introduction

Cardiovascular disease (CVD) is the leading cause of mortality and morbidity worldwide accounting for approximately one third of all deaths [[1], [2], [3]]. It is an acute time-sensitive condition, so it is important to identify high-risk patients at an early stage. The number of patients with CVD visiting emergency departments (ED) in China is enormous [4]. For most of these patients, the first emergency care they encounter is at triage. Triage, a specialized role typically performed by registered nurses, is sorting patients by acuity to prioritize them for full evaluation [5].

The Chinese Emergency Triage Scale (CETS) was originally developed in 2011 and further improved in 2018 [6,7]. The scale is now used in many hospitals in Mainland China based on the Chinese policy. It has been proved that the CETS is a reliable system for ED triage and can promote rapid and effective triage in Mainland China [8].

The CETS classifies patients into four levels based on the assessment of vital complaints and vital parameters [7,8]. Although most criteria are objective indicators, the final triage level assigned to the patient is influenced by the subjective judgment of the triage nurse, even if there are strict requirements for the triage personnel’s ability and maintaining regular education and training for them. Consequently, drawbacks such as human dependency and ambiguity of judgment have been highlighted. These potential problems could worsen when the volume of patients increases and information accumulates.

An algorithmic electronic triage (e-triage) model has been shown to support decision-making and to improve patient risk management in the ED [[10], [11], [12], [13]]. Little e-triage research has been reported in Mainland China. Machine learning models, which were developed based on the electronic algorithm, have been reported to have superior ability for prediction in different disease conditions [14,15]. E-triage essentially is a rule-based expert system, which is difficult to apply to the clinical practice because of the inherent complexity and uncertainty of clinical information [16]. However, machine learning models can learn patterns from large, complex and heterogeneous data and perform practical accurate predictions [17]. Additionally, machine learning used for decision-support systems can potentially alleviate the increased cognitive load of medical professionals, allowing them to focus more on clinical care [18].

In this context, we used ED visit data to develop four machine learning models by using routinely available triage data to accurately predict triage levels for patients with suspected CVD, and to compare the predictive performance of each model.

Section snippets

Study design

A cross-sectional study of ED patients with suspected CVD was conducted from August 2015 to December 2018. Ethical approval was obtained from the Clinical Research Ethics Committee of Guangzhou Medical University.

Study setting

This study was conducted in the ED of the second Affiliated Hospital of Guangzhou Medical University (AHGZMU), which is a teaching hospital with 1500 beds. The ED receives more than 150,000 new patients per year and serves a population of approximately 1.56 million people in the Hai

Characteristics of study samples

There were 28,242 visits with suspected CVD recorded in the ED information system during August 2015 and December 2018. Of these, 429 visits younger than 14 years old and 10,152 with more than a half missing variables were excluded, leaving 17,661 visits in the analytic cohort (Fig. 1). The characteristics of patients in the analytic and non-analytic cohorts were generally similar (Appendix A). In the analytic cohort, the median age of the patients was 65 years old (IQR 51-78 years old) and

Discussion

Using data available at triage, we developed four machine-learning models (i.e., multinomial LR, RF, XGBoost, and GBDT) to make decision of the triage levels of patients with suspected CVD. Machine learning models performed well achieving AUCs greater than 0.90 and had moderately good capability to accurately classify patients with suspected CVD into different triage levels. Among the four models, XGBoost performed slightly well. The major goals of ED triage are to accurately differentiate

Limitations

This study has several limitations. First, visits with more than a half missing variables were excluded; however, the analytic and non-analytic were still generally comparable, which can argue against substantial selection bias. Second, this study was performed at a single institution and without external validation dataset; it might not be generalized to other hospitals. Individualized site-specific machine learning models need to be developed to improve accuracy in the future. Third, as

Conclusion

Machine learning models were successfully developed to support decision-making of triage levels for patients with suspected CVD using only data available at the time of triage. These models had good discriminative ability of triage, and XGBoost demonstrated a slight advantage over other models. Moreover, these models could be used for differential triage of low-risk patients and high-risk patients as a strategy to improve efficiency and allocation of finite resources.

Summary points

What was

Contributions

HJ and HM conceived and designed the study. HJ, PL and XC monitored the entire planning, execution and analysis of the study. HJ and XC obtained the ethical and grant for the study. HJ, PL, XC, HjL and GY participated in the data collection for the study. HM, HmL and WG analyzed and explained the data. HJ, HM, XC and THR provided advice on the study methods and manuscript writing. HJ, HM, HmL, WG and THR wrote the first draft of the manuscript and prepared the manuscript. All authors

Funding

This work was supported by the Major Project of Guangzhou Health Science and Technology [grant number 2020A031005]; and The Key Medical Disciplines and Specialities Program of Guangzhou.

Declaration of Competing Interest

The authors report no declarations of interest.

Acknowledgments

This research study was supported by the Major Project of Guangzhou Health Science and Technology (Grant No. 2020A031005) and The Key Medical Disciplines and Specialities Program of Guangzhou.

References (41)

  • Ministry of Health of the People’s Republic of China

    Guidelines for the classification of emergency patients (draft for solicitation)

    Chin. J. Crit. Care Med. (Electron. Ed.).

    (2011)
  • D. Shi et al.

    Consensus of emergency triage experts

    Chin. J. Emerg. Med.

    (2018)
  • X. Xie et al.

    Prognostic value of modified early warning score generated in a Chinese emergency department: a prospective cohort study

    BMJ Open

    (2018)
  • S. Levin et al.

    Machine-learning-based electronic triage more accurately differentiates patients with respect to clinical outcomes compared with the Emergency Severity Index

    Ann. Emerg. Med.

    (2018)
  • S.W. Choi et al.

    Machine learning-based prediction of Korean Triage and Acuity Scale level in emergency department patients

    Healthc. Inform. Res.

    (2019)
  • E.A. Cicolo et al.

    Electronic and manual registration of Manchester System: reliability, accuracy, and time evaluation

    Rev. Lat.-Am. Enferm.

    (2019)
  • R.A. Taylor et al.

    Prediction of in-hospital mortality in emergency department patients with sepsis: a local big data-driven, machine learning approach

    Acad. Emerg. Med.

    (2016)
  • C. Eken et al.

    Artificial neural network, genetic algorithm, and logistic regression applications for predicting renal colic in emergency settings

    Int. J. Emerg. Med.

    (2009)
  • B. Greg

    IEEE/IBM Watson student showcase

    Computer

    (2016)
  • K. Shameer et al.

    Machine learning in cardiovascular medicine: are we there yet?

    Heart

    (2018)
  • Cited by (38)

    View all citing articles on Scopus
    1

    These authors contributed equally to this work.

    View full text