Machine learning-based prediction of acute severity in infants hospitalized for bronchiolitis: a multicenter prospective study

Raita, Yoshihiko; Camargo, Carlos A.; Macias, Charles G.; Mansbach, Jonathan M.; Piedra, Pedro A.; Porter, Stephen C.; Teach, Stephen J.; Hasegawa, Kohei

doi:10.1038/s41598-020-67629-8

Download PDF

Article
Open access
Published: 03 July 2020

Machine learning-based prediction of acute severity in infants hospitalized for bronchiolitis: a multicenter prospective study

Yoshihiko Raita¹,
Carlos A. Camargo Jr¹,
Charles G. Macias²,
Jonathan M. Mansbach³,
Pedro A. Piedra⁴,
Stephen C. Porter^5,6,
Stephen J. Teach⁷ &
…
Kohei Hasegawa¹

Scientific Reports volume 10, Article number: 10979 (2020) Cite this article

3754 Accesses
18 Citations
8 Altmetric
Metrics details

Subjects

Abstract

We aimed to develop machine learning models to accurately predict bronchiolitis severity, and to compare their predictive performance with a conventional scoring (reference) model. In a 17-center prospective study of infants (aged < 1 year) hospitalized for bronchiolitis, by using routinely-available pre-hospitalization data as predictors, we developed four machine learning models: Lasso regression, elastic net regression, random forest, and gradient boosted decision tree. We compared their predictive performance—e.g., area-under-the-curve (AUC), sensitivity, specificity, and net benefit (decision curves)—using a cross-validation method, with that of the reference model. The outcomes were positive pressure ventilation use and intensive treatment (admission to intensive care unit and/or positive pressure ventilation use). Of 1,016 infants, 5.4% underwent positive pressure ventilation and 16.0% had intensive treatment. For the positive pressure ventilation outcome, machine learning models outperformed reference model (e.g., AUC 0.88 [95% CI 0.84–0.93] in gradient boosted decision tree vs 0.62 [95% CI 0.53–0.70] in reference model), with higher sensitivity (0.89 [95% CI 0.80–0.96] vs. 0.62 [95% CI 0.49–0.75]) and specificity (0.77 [95% CI 0.75–0.80] vs. 0.57 [95% CI 0.54–0.60]). The machine learning models also achieved a greater net benefit over ranges of clinical thresholds. Machine learning models consistently demonstrated a superior ability to predict acute severity and achieved greater net benefit.

Prediction of extubation failure among low birthweight neonates using machine learning

Article 07 January 2023

Developing well-calibrated illness severity scores for decision support in the critically ill

Article Open access 15 August 2019

Predicting the need for intubation in the first 24 h after critical care admission using machine learning approaches

Article Open access 01 December 2020

Introduction

Bronchiolitis is the leading cause of infant hospitalization in the US, accounting for 107,000 infant hospitalizations each year with direct cost of 734 million US dollars¹. Even among hospitalized infants, the severity of bronchiolitis can range from moderate severity (which requires observation and supportive therapies, such as supplemental oxygen, fluid, and nutrition) to near-fatal and fatal infections. Previous studies have identified individual risk factors for higher severity of bronchiolitis (e.g., young age, prematurity, viral etiology)^2,3,4,5 and developed prediction scoring models (e.g., logistic regression models)^6,7,8,9. However, identifying the subgroup of infants with bronchiolitis who require higher acuity care (e.g., positive pressure ventilation, intensive care unit [ICU] admission) remains an important challenge. The difficulty and uncertainty of predicting acute severity—and, consequently, the appropriate level of care for infants with bronchiolitis—are reflected by the well-documented variability in inpatient management across the nation^1,10,11,12.

Machine learning models have gained increasing attention because of their advantages, such as the ability to incorporate high-order, nonlinear interactions between predictors and to yield more accurate and stable predictions. Indeed, recent studies have reported that the use of machine learning models provide a high predictive ability in various conditions and settings—e.g., sepsis^13,14, asthma exacerbation¹⁵, emergency department (ED) triage^16,17, and unplanned transfers to ICU¹⁸. Despite the clinical and research promise, no study has yet examined the utility of modern machine learning models in predicting outcomes in infants hospitalized for bronchiolitis—a large population with high morbidity and health resource use.

In this context, we aimed to develop machine learning models that accurately predict acute severity in infants hospitalized with bronchiolitis, and compare their predictive performance with that of conventional scoring approaches⁶.

Results

During 2011–2014, 1,016 infants with bronchiolitis were enrolled into a 17-center prospective cohort study. The median age at the enrolment was 3.2 months (IQR 1.6–6.0), 40% were female, and 42% were non-Hispanic white. The length-of-hospital stay varied widely from 0 to 60 days (median, 2 days) (Table 1). Clinical data had a small proportion of missingness; most had < 1% missingness (e.g., missingness on oxygen saturation with the use of supplemental oxygen, 0.1%) while the maximum proportion of missing was 4.8% (eTable 3 in Additional file 1). Overall, 55 infants (5.4%) underwent positive pressure ventilation and 163 infants (16.0%) had intensive treatment outcome.

Table 1 Patient characteristics and clinical outcomes in 1,016 infants hospitalized for bronchiolitis.

Full size table

Predicting positive pressure ventilation outcome

In the prediction of positive pressure ventilation outcome, the discriminatory abilities of all models are summarized in Fig. 1A and Table 2. All four machine learning models demonstrated significantly superior AUCs (all P < 0.001). For example, compared with the reference model (AUC 0.62 [95% CI 0.53–0.70]), the AUC was higher in the elastic net regression (AUC 0.89 [95% CI 0.85–0.92]) and gradient boosted decision tree (AUC 0.88 [95% CI 0.84–0.93]) models. Similarly, compared with the reference model, all machine learning models also achieved a significant net reclassification improvement (all P < 0.001).

Table 2 Prediction performance of the reference, and machine learning models in infants hospitalized for bronchiolitis.

Full size table

Additionally, compared with the reference model, all machine learning models also demonstrated a higher sensitivity (e.g., 0.62 [95% CI 0.49–0.75] in the reference model vs. 0.89 [95% CI 0.80–0.96] in the elastic net regression; Table 2) and specificity (e.g., 0.57 [95% CI 0.54–0.60] in the reference model vs. 0.79 [95% CI 0.77–0.82] in the Lasso regression model). More specifically, all machine learning models correctly predicted a larger number of infants who underwent positive pressure ventilation (true-positives) with a fewer number of predicted outcomes (Table 3). For example, the reference scoring system categorized most infants (n = 629, 62%) into the prediction score groups of 2–3. The reference model correctly identified 16 out of 25 infants who underwent positive pressure ventilation, while predicting that 265 infants would have undergone positive pressure ventilation. In contrast, the gradient boosted decision tree model correctly identified 23 (of 25) patients, while predicting that 135 infants would have undergone positive pressure ventilation in the same patient groups. Considering the low prevalence of the positive pressure ventilation outcome, all models had a high negative predictive value (e.g., 0.96 [95% CI 0.95–0.97] in the reference model vs. 0.99 [95% CI 0.99–0.99] in the Lasso regression model; Table 2).

Table 3 The number of actual and predicted outcomes of prediction models, according to the score of the reference model.

Full size table

Likewise, in the decision curve analysis (Fig. 1B), all four machine learning models outperformed the reference model, demonstrating a greater net benefit throughout the range of clinical thresholds, indicating that the machine learning prediction would more accurately identify high-risk infants (true-positives) while taking the trade-off with false-positives into consideration.

Predicting intensive treatment outcome

In the prediction of intensive treatment outcome, the discriminatory abilities of all models are shown in Fig. 2A and Table 2. All four machine learning models demonstrated a significantly higher AUC (all P < 0.001). For example, compared with the reference model (AUC 0.62 [95% CI 0.57–0.67]), the AUC was higher in the elastic net regression (AUC 0.80 [95% CI 0.76–0.83]) and random forest (AUC 0.79 [95% CI 0.75–0.84]) models. Similarly, compared with the reference model, all machine learning models also achieved significant net reclassification improvement (all P < 0.001).

Additionally, all machine learning models demonstrated a higher sensitivity (e.g., 0.58 [95% CI 0.49–0.75] in the reference model vs. 0.75 [95% CI 0.69–0.82] in the Lasso regression; Table 2) and specificity (e.g., 0.58 [95% CI 0.50–0.66] in the reference model vs. 0.78 [95% CI 0.76–0.81] in the random forest model). For example, among the infants categorized into the reference score groups of 2–3 (62% of cohort infants), the reference model correctly identified 39 out of 80 infants who had intensive treatment, while predicting that 275 infants would have had intensive treatment (Table 3). In contrast, the gradient boosted decision tree correctly identified 52 (out of 80) infants with the outcome, while predicting that 162 infants would have had intensive treatment. Likewise, in the decision curve analysis (Fig. 2B), all four machine learning models outperformed the reference model, demonstrating a greater net benefit throughout the range of clinical thresholds.

Variable importance

To yield insights into the relevance of each predictor, eFigures 1 and 2 (Additional file 1) summarized the 15 most important predictors of random forest and gradient boosted decision tree models for each outcome. In the prediction of positive pressure ventilation outcome, age, oxygen saturation level with the use of supplemental oxygen, and other vital signs [at the presentation] were the most important predictors in both models (eFigures 1A and 2A). Likewise, in the prediction of intensive treatment outcome, similar predictors were considered important in the both models (eFigures 1B and 2B).

Discussion

In this analysis of multicenter prospective cohort data from 1,016 infants, we applied four modern machine learning approaches (i.e., Lasso regression, elastic net regression, random forest, and gradient boosted decision tree) to the prediction of acute severity outcomes of bronchiolitis. Compared to the reference model that was derived in an ED sample⁶, these machine learning models consistently demonstrated a superior performance in predicting positive pressure ventilation and intensive treatment outcomes, including AUC and net reclassification. Additionally, the machine learning models achieved a higher sensitivity and specificity for the two outcomes, in both the overall cohort and the majority of cohort infants that were categorized into the reference score groups of 2–3. Furthermore, the decision curve analysis also demonstrated the net benefit of machine learning models was also greater—i.e., a larger number of true-positives considering a trade-off with false-positives—across a range of clinical thresholds. To the best of our knowledge, this is the first study that has investigated the performance of modern machine learning models in predicting severity in infants with bronchiolitis.

One of the main objectives in the risk stratification of infants with bronchiolitis is to promptly identify infants at risk for higher severity and efficiently utilize finite healthcare resources. The American Academy of Pediatrics bronchiolitis guideline² highlights the importance of assessing the risk in infants with bronchiolitis. However, optimal risk stratification and prediction remains a challenge as the clinical course in this population (even in infants hospitalized for bronchiolitis) is highly variable^10,11,12. Previous studies, by using conventional modeling (e.g., logistic regression models), have reported a moderate ability to predict severity outcomes (e.g., ED-to-hospital admission, hospital length-of-stay, ICU admission, positive pressure ventilation use) of infants with bronchiolitis^6,7,8,9,19. Although the use of an expanded set of predictors—e.g., repeated examinations and invasive monitoring during hospital course—may yield better predictive performance, it is often impractical in the real-world acute care settings with an aim to promptly risk-stratify these infants. Alternatively, the use of advanced machine learning models may improve the clinician’s decision-making ability. Indeed, machine learning models have recently been applied to the prediction of various disease conditions and clinical settings, such as early identification of mortality risk in patients with sepsis¹³, rehospitalization in patients with heart failure²⁰, intensive treatment outcomes in patients with asthma exacerbation¹⁵, unplanned transfer to ICU¹⁸, and escalated care at pediatric ED triage¹⁶. Our multicenter study builds on these earlier reports, and extends them by demonstrating that the modern machine learning models outperform conventional approaches in predicting higher severity of infants with bronchiolitis. While external validation is warranted, these machine learning models using routinely-available predictors can be implemented to clinical practice (e.g., online risk calculators or build-in risk assessment systems)—similar to existent clinical scoring rules.

Clinical prediction systems strive for an appropriate balance between sensitivity and specificity because of the trade-off relationship between these two factors in the context of prevalence of clinical outcomes. In the present study, we observed that the reference score model did not effectively categorize most infants (i.e., 62% of cohort were categorized into the two score groups) or appropriately predicted infants who developed the outcomes. By contrast, the machine learning models correctly identified a larger number of true-positives (i.e., higher sensitivity). This finding supports the utility of these models in the target population, for which the one of the major priorities is to reduce “missed” high-risk cases. Additionally, the machine learning models also had a fewer number of false-positives (i.e., higher specificity) in predicting both outcomes while they were imperfect in the setting of relatively-smaller prevalence of outcome (5.4% for positive pressure ventilation use). This may mitigate excessive resource use in this large population. These findings are further supported by the decision curve analysis that demonstrated a greater net benefit of the machine learning models incorporating the trade-offs between true-positives and false-positives across the wide ranges of clinical thresholds.

There are several potential explanations for the observed gains in the predictive abilities of machine learning models. For example, machine learning models incorporate high-order interactions between predictors and nonlinear relationships with outcomes. Additionally, machine learning models are able to mitigate potential overfitting by adopting several methods, such as regularization, out-of-bagging estimation, and cross-validation. Furthermore, the use of large multicenter data with rigorous quality assurance might have contributed to low bias and variance in the machine models. Although the machine learning models achieved superior predictive ability, their performance remained imperfect. This may be explained, at least partially, by the limited set of predictors, subjectivity of some data elements (e.g., parent-reported symptoms at home), variable clinical factors after prehospitalization assessment (e.g., ED management and patient responses), difference in clinician’s practice patterns, and availability of intensive care resources. Notwithstanding the complexity and challenges of clinical prediction in infants with bronchiolitis, machine learning models have scalable advantages in the era of health information technology, such as automated sophistication of models through the sequential extraction of electronic health records, continuous non-invasive physiological monitoring, natural language processing, and reinforcement learning^21,22,23,24. In the past, this scalability had not been attainable with the use of conventional approaches. Taken together, our findings and recent developments support cautious optimism that modern machine learning may enhance the clinician’s ability as an assistive technology.

Our study has several potential limitations. Firstly, the data may be subject to measurement bias and missingness. However, the study was conducted by trained investigators using a standardized protocol, which led to the low proportion of missingness in the predictors (eTable 3 in Additional file 1). Secondly, the clinical thresholds for these outcomes may depend on local resources and vary between clinicians and hospitals (e.g., different criteria for admission to the ICU). Yet, the decision curve analysis demonstrated the greater benefit of the machine learning models across the wide range of clinical thresholds. Lastly, the study cohort consisted of a racially/ethnically- and geographically-diverse US sample of infants hospitalized with bronchiolitis. While the severity of this population was highly variable and the model used pre-hospitalization data, our models might not be generalizable to infants in ambulatory settings. External validation of the models in different populations and settings is necessary. Nonetheless, our data remain highly relevant for the 107,000 infants hospitalized yearly in the US¹.

Conclusion

Based on data from a multicenter prospective cohort of 1,016 infants with bronchiolitis, we developed four machine learning models to predict severity of illness. By using prehospitalization data as predictors, these models consistently yielded superior performance—a higher AUC, net reclassification, sensitivity, and specificity—in predicting positive pressure ventilation and intensive treatment outcomes over the reference model⁶. Specifically, these advanced machine learning models correctly predicted a larger number of infants with higher severity—with a fewer number of false-positives—who would not be appropriately predicted by the conventional models. Moreover, the machine learning models also achieved a greater net benefit across wide ranges of clinical thresholds. Although an external validation is warranted, the current study lends support to the application of machine learning models to the prediction of acute severity in infants with bronchiolitis. Machine learning models have a potential to enhance clinicians’ decision-making ability and hence to improve clinical care and optimize resource utilization in this high morbidity population.

Methods

Study design, setting and participants

The current study aimed to develop machine learning models that accurately predict acute severity in infants with bronchiolitis, by using the data from a multicenter prospective cohort study of 1,016 infants hospitalized for bronchiolitis—the 35th Multicenter Airway Research Collaboration (MARC-35) study^25,26. MARC-35 is coordinated by the Emergency Medicine Network (EMNet, https://www.emnet-usa.org²⁷) an international research collaboration with 246 participating hospitals. Briefly, at 17 sites across 14 U.S. states (eTable 1 in Additional file 1), MARC-35 enrolled infants (aged < 1 year) who were hospitalized with an attending physician diagnosis of bronchiolitis during three consecutive bronchiolitis seasons (November 1 to April 30) during 2011–2014. The diagnosis of bronchiolitis was made according to the American Academy of Pediatrics bronchiolitis guidelines², defined as acute respiratory illness with a combination of rhinitis, cough, tachypnea, wheezing, crackles, and retractions. We excluded infants who were transferred to a participating hospital > 24 h after initial hospitalization or with a preexisting heart and lung disease, immunodeficiency, immunosuppression or gestational age of < 32 weeks.

We followed the Standards for Reporting Diagnostic Accuracy statement guideline for the reporting of prediction models²⁸. The institutional review board of the 17 participating hospitals (Alfred I. duPont Hospital for Children, Arnold Palmer Hospital for Children, Boston Children's Hospital, Children's Hospital of Los Angeles, Children's Hospital of Philadelphia, Children's Hospital of Pittsburgh, The Children's Hospital at St. Francis, The Children's Mercy Hospital & Clinics, Children's National Medical Center, Cincinnati Children's Hospital and Medical Center, Connecticut Children's Medical Center, Dell Children's Medical Center of Central Texas, Norton Children's Hospital, Massachusetts General Hospital, Phoenix Children's Hospital, Seattle Children's Hospital, Texas Children's Hospital) approved the study. Written informed consent was obtained from the parent or guardian.

Predictors

For predictors in the machine learning models, we selected variables based on clinical plausibility and a priori knowledge^{3,6,7,8,9,29,30,31}. These predictors—which are available in most prehospitalization settings—included demographics (age, sex, and race/ethnicity), medical history (prenatal maternal smoking, gestational age, birth weight, postnatal ICU admission, history of hospital and ICU admission, history of breathing problems, and history of eczema), parent-reporting symptoms (poor feeding, cyanosis, apnea, and duration of symptoms), ED presentation (vital signs [temperature, pulse rate, respiratory rate, oxygen saturation], interaction between oxygen saturation and supplemental oxygen use, wheezing, retractions, apnea, and dehydration), and detection of respiratory syncytial virus (RSV) by PCR²⁵. These clinical data were obtained through a structured interview and medical record review by trained physicians and investigators using a standardized protocol²⁶. All data were reviewed at the EMNet Coordinating Center at Massachusetts General Hospital (Boston, MA), and site investigators were queried about missing data and discrepancies identified by manual data checks.

Outcomes

The primary outcome was the use of positive pressure ventilation—continuous positive airway pressure ventilation and/or intubation during inpatient stay³². The secondary outcome was intensive treatment defined as a composite of ICU admission and/or the use of positive pressure ventilation during the inpatient stay^{3, 31}. In this observational study, patients were managed at the discretion of treating physicians. These two outcomes have been employed for outcomes in the MARC-35 study.

Statistical analysis

In the training sets (80% randomly-selected samples) in fivefold cross-validation, we developed five models: the reference model⁶ and four machine learning models for each outcome. As the reference model, we fit logistic regression models using the predictors of a previously-established clinical prediction score that was derived using an ED sample⁶. We selected this prediction score as the reference model since it was recently developed in a large sample and focused on similar clinical outcomes reflecting acute severity of bronchiolitis^6,33. The predictors included age, poor feeding, oxygen saturation, retractions, apnea, and dehydration, excluding nasal flaring/grunting, based on the availability of data in the current study (eTable 2).

Next, using the prehospitalization predictors, we developed four machine learning models: (1) logistic regression with Lasso regularization (Lasso regression), (2) logistic regression with elastic net regularization (elastic net regression), (3) random forest, and (4) gradient boosted decision tree models. First, Lasso regression is an extension of regression-based models that has an ability to shrink (or regularize) the predictor coefficients toward zero, thereby effectively selecting important predictors and improving interpretability of the model³⁴. Lasso regression computes the optimal regularization parameter (lambda) that minimizes the sum of least square plus L1-shrinkage penalty using a cross-validation method³⁵. Second, elastic net regression is another regression-based model incorporating both Lasso-regularization and Ridge-regularization^34,36. Elastic net regression calculates the optimal regularization parameter that minimizes the sum of least square plus weighted L1-shrinkage penalty and weighted L2-shrinkage penalty. We used R glmnet and caret packages for Lasso regression and elastic net regression models^37,38. Third, random forest is an ensemble of decision trees generated by bootstrapped training samples with random predictor selection in tree induction^34,39. We created a hyperparameter tuning grid to identify the best set of parameters using cross-validation methods. We used randomForest and caret packages to construct random forest models^38,40. Lastly, gradient boosted decision tree is another ensemble method which constructs new simple tree models predicting the errors and residuals of the previous model. When adding a new tree, this model uses a gradient descent algorithm minimizes a loss function⁴¹. We performed hyperparameter tuning sequentially using a fivefold cross-validation method. We used R xgboost and caret packages to construct gradient boosted decision tree^38,42. To minimize potential overfitting, we utilized several methods—e.g., regularizations (or penalizations) in Lasso and elastic net regression models, out-of-bag estimation in random forest models, and cross-validation in all models.

As for the predictor engineering methods of the machine learning models, we preprocessed predictors sequentially. First, we investigated non-linear relationships between the continuous predictors and outcomes and created quadric terms of age, respiratory rate, and temperature. These quadratic terms were used only for regression-based machine learning models (i.e., logistic regression models with Lasso regularization and those with elastic net regularization). Second, we also chose either of highly-correlated predictors (e.g., age and weight at hospitalization). Third, we imputed predictors with missing values (eTable 3) using bagged tree imputation. Fourth, we converted continuous predictors into normalized scales using Yeo-Johnson transformation. Categorical predictors were coded as dummy variables while birth weight, gestational age, previous breathing problem, and degree of retraction were coded as ordinal variables. Fifth, to incorporate clinically evident interaction between oxygen saturation level and use of supplemental oxygen, we created an interaction term between oxygen saturation and use of supplemental oxygen. Lastly, we removed predictors that are highly sparse in the dataset. We applied these preprocessing methods independently to the training sets and the test sets to avoid carrying the information from the training sets to the test sets. We used R recipe package for these predictor preprocessing⁴³.

To examine the variable importance in the random forest, we used permutation-based variable importance—normalized average values of difference between the prediction accuracy of out-of-bag estimation and that of the same measure after permutating each predictor. In the gradient boosted model, we also computed the variable importance that is summed over iterations³⁹. We graphically presented the rank of variable importance using unscaled values.

To measure the test performance of each model, we computed the overall cross-validation performance from the test sets (the remaining randomly-selected 20% samples). As the predictive performance, we used (1) the area under the receiver-operating-characteristic curve (AUC), (2) net reclassification improvement, (3) confusion matrix results (i.e., sensitivity, specificity, positive predictive value, and negative predictive value), and (4) net benefit from decision curve analysis. To compare the AUC between the models, we used Delong’s test⁴⁴. To compute AUC and its confidential interval, we used pROC package⁴⁵. We also used the net reclassification improvement to quantify whether a new model provides clinically relevant improvements in prediction when compared to the reference model⁴⁶. To compute the net reclassification improvement, we used PredictABEL package⁴⁷. To address the class imbalance in the both outcomes, we employed the value with the shortest distance to the top-left part of the AUC plot as the threshold for the confusion matrix³⁹.The decision curve analysis incorporates the information on both the benefit of correctly predicting the outcome (true-positives) and the relative harm of incorrectly labelling patients as if they would have the outcome (false-positives)—i.e., the net benefit^{48,49,50,51,52}. We made a graphical presentation of the net benefit for each model over a range of threshold probabilities (or clinical preferences) of the outcome as decision curves. We used decision curve analysis R source code from Memorial Sloan Kettering Cancer Center⁵³ and plotted the graphs using ggplot2 package⁵⁴. We performed all analysis with R version 3.5.1 (R Foundation for Statistical Computing, Vienna, Austria)⁵⁵.

Data availability

The datasets generated and analysed during the current study are not publicly available because of the informed consent documents. Per the informed consent documents of the MARC research participants, the data sharing and use are limited to the severe bronchiolitis, recurrent wheezing, asthma and related concepts. Accordingly, the data are not publicly available but available from the corresponding author on reasonable request.

Abbreviations

MARC-35:: The 35th Multicenter Airway Research Collaboration study
AUC:: Area-under-the-curve
CI:: Confidential interval
ICU:: Intensive care unit
ED:: Emergency department
EMNet:: Emergency Medicine Network
RSV:: Respiratory syncytial virus
IQR:: Interquartile range
NRI:: Net reclassification improvement
PPV:: Positive predictive value
NPV:: Negative predictive value

References

Fujiogi, M. et al. Trends in bronchiolitis hospitalizations in the United States: 2000–2016. Pediatrics 144(6), e20192614 (2019). https://doi.org/10.1542/peds.2019-2614.
Ralston, S. L. et al. Clinical practice guideline: the diagnosis, management, and prevention of bronchiolitis. Pediatrics 134, e1474-1502 (2014).
Article Google Scholar
Mansbach, J. M. et al. Prospective multicenter study of children with bronchiolitis requiring mechanical ventilation. Pediatrics 130, e492-500 (2012).
Article Google Scholar
Hasegawa, K., Mansbach, J. M. & Camargo, C. A. Infectious pathogens and bronchiolitis outcomes. Expert Rev. Anti Infect. Ther. 12, 817–828 (2014).
Article CAS Google Scholar
Mansbach, J. M. et al. Prospective multicenter study of viral etiology and hospital length of stay in children with severe bronchiolitis. Arch. Pediatr. Adolesc. Med. 166, 700–706 (2012).
Article Google Scholar
Freire, G. et al. Predicting escalated care in infants with bronchiolitis. Pediatrics 142(3), e20174253 (2018). https://doi.org/10.1542/peds.2017-4253.
Walsh, P., Rothenberg, S. J., O’Doherty, S., Hoey, H. & Healy, R. A validated clinical model to predict the need for admission and length of stay in children with acute bronchiolitis. Eur. J. Emerg. Med. Off. J. Eur. Soc. Emerg. Med. 11, 265–272 (2004).
Article Google Scholar
Voets, S., van Berlaer, G. & Hachimi-Idrissi, S. Clinical predictors of the severity of bronchiolitis. Eur. J. Emerg. Med. Off. J. Eur. Soc. Emerg. Med. 13, 134–138 (2006).
Article Google Scholar
Damore, D., Mansbach, J. M., Clark, S., Ramundo, M. & Camargo, C. A. Prospective multicenter bronchiolitis study: Predicting intensive care unit admissions. Acad. Emerg. Med. Off. J. Soc. Acad. Emerg. Med. 15, 887–894 (2008).
Article Google Scholar
Christakis, D. A. et al. Variation in inpatient diagnostic testing and management of bronchiolitis. Pediatrics 115, 878–884 (2005).
Article Google Scholar
Macias, C. G. et al. Variability in inpatient management of children hospitalized with bronchiolitis. Acad. Pediatr. 15, 69–76 (2015).
Article Google Scholar
Mansbach, J. M. et al. Hospital course and discharge criteria for children hospitalized with bronchiolitis. J. Hosp. Med. 10, 205–211 (2015).
Article Google Scholar
Taylor, R. A. et al. Prediction of in-hospital mortality in emergency department patients with sepsis: A local big data-driven, machine learning approach. Acad. Emerg. Med. Off. J. Soc. Acad. Emerg. Med. 23, 269–278 (2016).
Article Google Scholar
Nemati, S. et al. An interpretable machine learning model for accurate prediction of sepsis in the ICU. Crit. Care Med. 46, 547–553 (2018).
Article Google Scholar
Goto, T., Camargo, C. A. J., Faridi, M. K., Yun, B. J. & Hasegawa, K. Machine learning approaches for predicting disposition of asthma and COPD exacerbations in the ED. Am. J. Emerg. Med. 36, 1650–1654 (2018).
Article Google Scholar
Goto, T., Camargo, C. A., Faridi, M. K., Freishtat, R. J. & Hasegawa, K. Machine learning-based prediction of clinical outcomes for children during emergency department triage. JAMA Netw. Open 2, e186937 (2019).
Article Google Scholar
Raita, Y. et al. Emergency department triage prediction of clinical outcomes using machine learning models. Crit. Care Lond. Engl. 23, 64 (2019).
Article ADS Google Scholar
Wellner, B. et al. Predicting unplanned transfers to the intensive care unit: A machine learning approach leveraging diverse clinical elements. JMIR Med. Inform. 5, e45 (2017).
Article Google Scholar
Mansbach, J. M. et al. Prospective multicenter study of bronchiolitis: predicting safe discharges from the emergency department. Pediatrics 121, 680–688 (2008).
Article Google Scholar
Mortazavi, B. J. et al. Analysis of machine learning techniques for heart failure readmissions. Circ. Cardiovasc. Qual. Outcomes 9, 629–640 (2016).
Article Google Scholar
Priesol, A. J., Cao, M., Brodley, C. E. & Lewis, R. F. Clinical vestibular testing assessed with machine-learning algorithms. JAMA Otolaryngol. Head Neck Surg. 141, 364–372 (2015).
Article Google Scholar
Bacchi, S. et al. Deep Learning Natural language processing successfully predicts the cerebrovascular cause of transient ischemic attack-like presentations. Stroke https://doi.org/10.1161/STROKEAHA.118.024124 (2019).
Article PubMed Google Scholar
Zhang, X. et al. Prediction of emergency department hospital admission based on natural language processing and neural networks. Methods Inf. Med. 56, 377–389 (2017).
Article Google Scholar
Kolachalama, V. B. et al. Association of pathological fibrosis with renal survival using deep neural networks. Kidney Int. Rep. 3, 464–475 (2018).
Article Google Scholar
Mansbach, J. M. et al. Respiratory syncytial virus and rhinovirus severe bronchiolitis are associated with distinct nasopharyngeal microbiota. J. Allergy Clin. Immunol. 137, 1909-1913.e4 (2016).
Article Google Scholar
Hasegawa, K. et al. Association of nasopharyngeal microbiota profiles with bronchiolitis severity in infants hospitalised for bronchiolitis. Eur. Respir. J. 48, 1329–1339 (2016).
Article CAS Google Scholar
Emergency Medicine Network. https://www.emnet-usa.org/. Accessed 17 June 2020.
Bossuyt, P. M. et al. STARD 2015: An updated list of essential items for reporting diagnostic accuracy studies. BMJ 351, h5527 (2015).
Article Google Scholar
Corneli, H. M. et al. Bronchiolitis: Clinical characteristics associated with hospitalization and length of stay. Pediatr. Emerg. Care 28, 99–103 (2012).
Article Google Scholar
Yusuf, S., Caviness, A. C. & Adekunle-Ojo, A. O. Risk factors for admission in children with bronchiolitis from pediatric emergency department observation unit. Pediatr. Emerg. Care 28, 1132–1135 (2012).
Article Google Scholar
Hasegawa, K. et al. Risk factors for requiring intensive care among children admitted to ward with bronchiolitis. Acad. Pediatr. 15, 77–81 (2015).
Article Google Scholar
Stewart, C. J. et al. Associations of nasopharyngeal metabolome and microbiome with severity among infants with bronchiolitis. A multiomic analysis. Am. J. Respir. Crit. Care Med. 196, 882–891 (2017).
Article CAS Google Scholar
Luo, G., Nkoy, F. L., Gesteland, P. H., Glasgow, T. S. & Stone, B. L. A systematic review of predictive modeling for bronchiolitis. Int. J. Med. Inf. 83, 691–714 (2014).
Article Google Scholar
James, G., Witten, D., Hastie, T. & Tibshirani, R. An Introduction to Statistical Learning: with Applications in R (Springer-Verlag, New York, 2013).
Book Google Scholar
Tibshirani, R. Regression shrinkage and selection via the Lasso. J. R. Stat. Soc. Ser. B Methodol. 58, 267–288 (1996).
MathSciNet MATH Google Scholar
Zou, H. & Hastie, T. Regularization and variable selection via the elastic net. J. R. Stat. Soc. Ser. B Stat. Methodol. 67, 301–320 (2005).
Article MathSciNet Google Scholar
Friedman, J. et al. glmnet: Lasso and Elastic-Net Regularized Generalized Linear Models. (2020). https://cran.rproject.org/web/packages/glmnet/glmnet.pdf. Accessed 17 June 2020.
Kuhn, M. The caret Package. http://topepo.github.io/caret/index.html. Accessed 17 June 2020.
Kuhn, M. & Johnson, K. Applied Predictive Modeling (Springer-Verlag, New York, 2013).
Book Google Scholar
Cutler, F. original by L. B. and A. & Wiener, R. port by A. L. and M. randomForest: Breiman and Cutler’s Random Forests for Classification and Regression. (2018). https://cran.r-project.org/web/packages/randomForest/randomForest.pdf. Accessed 17 June 2020.
Natekin, A. & Knoll, A. Gradient boosting machines, a tutorial. Front. Neurorobotics 7, 21 (2013).
Chen, T. et al. xgboost: Extreme Gradient Boosting. (2018). https://cran.r-project.org/web/packages/xgboost/vignettes/xgboost.pdf. Accessed 17 June 2020.
Kuhn, M., Wickham, H. & RStudio. recipes: Preprocessing Tools to Create Design Matrices. (2018). https://cran.rproject.org/web/packages/recipes/recipes.pdf. Accessed 17 June 2020.
DeLong, E. R., DeLong, D. M. & Clarke-Pearson, D. L. Comparing the areas under two or more correlated receiver operating characteristic curves: A nonparametric approach. Biometrics 44, 837–845 (1988).
Article CAS Google Scholar
Robin, X. et al. pROC: Display and Analyze ROC Curves. (2020). https://cran.r-project.org/web/packages/pROC/pROC.pdf. Accessed 17 June 2020.
Kerr, K. F. et al. Net reclassification indices for evaluating risk prediction instruments: A critical review. Epidemiol. Camb. Mass 25, 114–121 (2014).
Article Google Scholar
Kundu, S., Aulchenko, Y. S. & Janssens, A. C. J. W. PredictABEL: Assessment of Risk Prediction Models. (2020). https://cran.rproject.org/web/packages/PredictABEL/PredictABEL.pdf. Accessed 17 June 2020.
Vickers, A. J. & Elkin, E. B. Decision curve analysis: a novel method for evaluating prediction models. Med. Decis. Making 26, 565–574 (2006).
Article Google Scholar
Fitzgerald, M., Saville, B. R. & Lewis, R. J. Decision curve analysis. JAMA 313, 409–410 (2015).
Article CAS Google Scholar
Rousson, V. & Zumbrunn, T. Decision curve analysis revisited: Overall net benefit, relationships to ROC curve analysis, and application to case–control studies. BMC Med. Inform. Decis. Mak. 11, 45 (2011).
Article Google Scholar
Van Calster, B. et al. Reporting and interpreting decision curve analysis: A guide for investigators. Eur. Urol. 74, 796–804 (2018).
Article Google Scholar
Vickers, A. J., Van Calster, B. & Steyerberg, E. W. Net benefit approaches to the evaluation of prediction models, molecular markers, and diagnostic tests. BMJ 352, i6 (2016).
Article Google Scholar
Biostatistics: Decision Curve Analysis|Memorial Sloan Kettering Cancer Center. https://www.mskcc.org/departments/epidemiology-biostatistics/biostatistics/decision-curve-analysis. Accessed 17 June 2020.
Wickham, H. et al. ggplot2: Create Elegant Data Visualisations Using the Grammar of Graphics. (2019). https://cran.rproject.org/web/packages/ggplot2/index.html. Accessed 17 June 2020.
R: The R Project for Statistical Computing. https://www.r-project.org/. Accessed 17 June 2020.

Download references

Acknowledgements

We thank the MARC-35 study hospitals and research personnel for their ongoing dedication to bronchiolitis and asthma research (see eTable 1 in Additional file 1). We also thank Ashley Sullivan, MPH, MS and Janice Espinola, MPH (Massachusetts General Hospital, Boston, MA) as well as Alkis Togias, MD (National Institute of Allergy and Infectious Diseases) for their helpful contributions. This study was supported by Grants from the National Institutes of Health (Bethesda, MD): U01 AI-087881, R01 AI-114552, R01 AI-108588, R01 AI-134940, and UG3/UH3 OD-023253. The content of this manuscript is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health. The funding organizations were not involved in the collection, management, or analysis of the data; preparation or approval of the manuscript; or decision to submit the manuscript for publication.

Author information

Authors and Affiliations

Department of Emergency Medicine, Massachusetts General Hospital, Harvard Medical School, 125 Nashua Street, Suite 920, Boston, MA, 02114-1101, USA
Yoshihiko Raita, Carlos A. Camargo Jr & Kohei Hasegawa
Department of Pediatric Emergency Medicine, Rainbow Babies and Children’s Hospital, Cleveland, OH, USA
Charles G. Macias
Department of Medicine, Boston Children’s Hospital, Harvard Medical School, Boston, MA, USA
Jonathan M. Mansbach
Departments of Molecular Virology and Microbiology and Pediatrics, Baylor College of Medicine, Houston, TX, USA
Pedro A. Piedra
Department of Pediatrics, College of Medicine, University of Cincinnati, Cincinnati, OH, USA
Stephen C. Porter
Division of Emergency Medicine, Cincinnati Children’s Hospital Medical Center, Cincinnati, OH, USA
Stephen C. Porter
Division of Emergency Medicine and Department of Pediatrics, Children’s National Health System, Washington, DC, USA
Stephen J. Teach

Authors

Yoshihiko Raita
View author publications
You can also search for this author in PubMed Google Scholar
Carlos A. Camargo Jr
View author publications
You can also search for this author in PubMed Google Scholar
Charles G. Macias
View author publications
You can also search for this author in PubMed Google Scholar
Jonathan M. Mansbach
View author publications
You can also search for this author in PubMed Google Scholar
Pedro A. Piedra
View author publications
You can also search for this author in PubMed Google Scholar
Stephen C. Porter
View author publications
You can also search for this author in PubMed Google Scholar
Stephen J. Teach
View author publications
You can also search for this author in PubMed Google Scholar
Kohei Hasegawa
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Y.R. contributed to the design of the study, data analysis, and interpretation of results, and writing the manuscript. C.A.C. organized the study cohort and interpretation of results, and critically reviewed the manuscript. C.G.M., J.M.M., P.A.P., S.C.P., and S.J.T. contributed to the acquisition and maintaining of the data, and critically reviewed the manuscript. K.H. contributed to the analysis of data, design of the study, and interpretation of result, and critically reviewed the manuscript. All authors finally approved the final version of manuscript.

Corresponding author

Correspondence to Yoshihiko Raita.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary file1 (DOCX 21137 kb)

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Raita, Y., Camargo, C.A., Macias, C.G. et al. Machine learning-based prediction of acute severity in infants hospitalized for bronchiolitis: a multicenter prospective study. Sci Rep 10, 10979 (2020). https://doi.org/10.1038/s41598-020-67629-8

Download citation

Received: 04 October 2019
Accepted: 11 May 2020
Published: 03 July 2020
DOI: https://doi.org/10.1038/s41598-020-67629-8

This article is cited by

Utilizing big data from electronic health records in pediatric clinical care
- Charles G. Macias
- Kenneth E. Remy
- Amie J. Barda
Pediatric Research (2023)
Integrating structured and unstructured data for timely prediction of bloodstream infection among children
- Azade Tabaie
- Evan W. Orenstein
- Rishikesan Kamaleswaran
Pediatric Research (2023)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Prediction of extubation failure among low birthweight neonates using machine learning

Developing well-calibrated illness severity scores for decision support in the critically ill

Predicting the need for intubation in the first 24 h after critical care admission using machine learning approaches

Introduction

Results

Predicting positive pressure ventilation outcome

Predicting intensive treatment outcome

Variable importance

Discussion

Conclusion

Methods

Study design, setting and participants

Predictors

Outcomes

Statistical analysis

Data availability

Abbreviations

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary information

Supplementary file1 (DOCX 21137 kb)

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Utilizing big data from electronic health records in pediatric clinical care

Integrating structured and unstructured data for timely prediction of bloodstream infection among children

Comments

Search

Quick links