Introduction

Crohn’s disease (CD) is a chronic inflammatory disease of the gastrointestinal tract characterized by a relapsing and remitting course1. Without adequate medical treatment, CD often results in irreversible bowel damage. Therefore, monitoring of disease activity and timely treatment is of utmost importance to prevent the complications of CD.

Traditionally, monitoring of patients with CD is based on clinical symptoms. Crohn’s disease activity index (CDAI) or Harvey–Bradshaw index (HBI) are commonly used tools for assessing the symptoms of CD2,3. Symptom resolution assessed using patient-reported outcomes (PRO) is one of the targets of inflammatory bowel disease (IBD) management in the treat-to-target strategy4. Such monitoring is typically performed intermittently (3–4 times a year) during outpatient follow-up. However, intermittent PRO monitoring may not be adequate for assessing the long-term progressive nature of CD as several clinical symptoms that occur at home, school, or workplace are liable to be missed5. Hence, seamless monitoring using remote technology may be an ideal approach for monitoring CD activity.

Self-monitoring via mobile technology has gained unprecedented traction amidst the boom in the development of mobile health applications and restricted health services during the coronavirus disease 2019 (COVID-19) pandemic6,7. For IBD, there are several commercially or noncommercially available apps, such as GI Monitor, GI Buddy, myIBD, HealthPROMISE, and UCLA eIBD8,9,10. These apps provide PRO symptom tracking, food diaries, medication alarms, and disease information. However, not all these apps have been validated, and the benefits of these apps in the management of IBD are not clear. Furthermore, no study has evaluated the utility of disease activity patterns detected by apps in predicting disease outcomes in patients with IBD.

We previously developed and validated a web-based CD symptom diary that can be used by patients via smartphones11. After defining the activity pattern as good (downward or stable activity) or poor (upward or fluctuating activity), we found that poor patterns were independently associated with poor clinical outcomes such as disease-related hospitalization and surgery12. However, the role of this pattern analysis in predicting disease outcomes could not be determined because the incidence of outcomes was included within the period of activity in the pattern analysis12. Furthermore, disease duration varied widely between patients, which may have affected disease outcomes. Therefore, in the present study, we aimed to determine whether activity patterns recorded via a web-based monitoring system can help predict disease outcomes in patients with newly diagnosed CD.

Methods

Patients and study design

This was a prospective observational study conducted at four tertiary referral centers. Consecutive adult patients who were diagnosed with CD within the preceding 3 months from December 2016 to September 2018 were enrolled. CD was diagnosed according to the guidelines based on clinical symptoms, laboratory results, endoscopic findings, and imaging studies13. The location of the disease and its behavior were classified using the Montreal Classification14. Disease activity at enrollment was categorized using the CDAI score. Patients with ileostomy, those unable to use smartphones, and those who used a diary for no more than one month were excluded. All patients recorded their symptoms in a web-based Crohn’s disease symptom diary (CDSD) using their smartphones at least once a week. Demographic and clinical data, including age, sex, disease location and behavior, current medications, baseline disease activity, and serum C-reactive protein (CRP) levels at enrollment were collected. This study was approved by the Institutional Review Boards of all four participating hospitals (Kyungpook National University Hospital; Daegu Catholic University Medical Center; Yeungnam University Hospital; and Keimyung University Dongsan Medical Center). This study was registered at www.clinicaltrials.gov (NCT02760836) and the study protocol complied with the principles enshrined in the Helsinki Declaration. Written informed consent was obtained from all patients prior to enrollment.

CD symptom diary and pattern analysis

Variables in the CDSD were based on HBI, including the degree of abdominal pain, number of stools per day, general well-being, abdominal mass, and CD-related complications11. Patients recorded their symptoms in CDSD by clicking the relevant checkbox in the list of symptoms, and the score was automatically calculated12. In a previous study, the PRO score of CDSD showed a good correlation with the CDAI score11. Disease activity serially recorded via CDSD is graphically depicted in Fig. 112. The X-axis indicated the number of whole activity records during study. This axis was divided into two frames by half of whole number of recordings; 1st frame and 2nd frame.

Figure 1
figure 1

Graphical depiction of Crohn’s disease activity in the mobile monitoring system. The number on the x-axis indicates the number of activity records for the total follow-up period.

Pattern analysis has been described previously12. In brief, the activity pattern was classified as good or poor based on two factors: the degree of activity variation (maximum score–minimum score) and the tendency of activity (increasing, stationary, or decreasing). Based on these two factors in the two timeframes, there are 12 distinct patterns (Supplementary Table 1). When the variation is ≥ 6, it indicates a high variation, while a variation < 6 indicates a low variation. The follow-up period was divided into two timeframes, and variations were assessed within each timeframe. The tendency of activity was determined based on the difference between the mean scores of the two timeframes. Intuitively, increasing (higher mean score in the 2nd frame than that in the 1st) or persistently fluctuating activity (high variation of the scores in both time frames with a stationary trend [limited difference between the mean scores of each time frame]) were classified as poor patterns, whereas decreasing (higher mean score in the 1st frame than that in the 2nd) or stable activity (low variation of scores in both time frames with a stationary trend) were classified as good patterns. The details of the statistical calculations for the pattern analysis are provided in Supplementary Table 1.

The index clinical outcomes were disease-related hospitalization and surgery during the follow-up period. Anti-tumor necrosis factor (anti-TNF) use was assessed as an additional clinical outcome. For patients who experienced index outcomes during follow-up, disease activity from enrollment to the development of these outcomes was used. For patients who did not experience these outcomes, disease activity to the last follow-up was used for pattern analysis. Pattern analysis was separately performed for hospitalization and bowel resection because we surmised that activity patterns varied between these outcomes. The difference in the pattern for each outcome is explained in Supplementary Table 1. The final poor pattern group included patients showing poor patterns for hospitalization or bowel resection. In addition, as potential risk factors of hospitalization or surgery, CDAI at 3–6 months before the development of those outcomes were collected for patients who had index outcomes during follow-up. For patients without hospitalization or surgery, CDAI data at the last follow-up were collected.

Statistical analysis

In our previous retrospective study of CDSD, 19.6% of poor pattern CD patients and 1.8% of good pattern CD patients underwent bowel resection surgery with a significant difference between these two groups12. Based on these figures and factoring in 80% power and a 2-sided α of 0.05, forty-eight patients were required in each group. Student’s t-test or Mann–Whitney U test was used for testing continuous variables, and Fisher’s exact test or χ2 test was used for categorical variables. Continuous variables are presented as mean ± SD or median (interquartile range). Kaplan–Meier analysis with the log-rank test was used to identify the predictors of the outcomes. Variables with a p-value ≤ 0.1 in the log-rank test were included in a multivariate analysis using the Cox regression hazard model. In addition, known risk factors of poor clinical outcomes1, such as disease behavior and age at diagnosis, were included as variables for multivariate analysis. CDAI scores during the follow-up period were also included for multivariate analysis; p values < 0.05 were considered indicative of statistical significance. All analyses were performed using R (v4.1.2; R Core Team 2021).

Results

Baseline characteristics of patients

Of the 135 patients who were newly diagnosed with CD, 102 (69 [67.6%] males) were included in the final analysis (mean age at diagnosis: 25.2 ± 9.6 years). Thirty-three patients were excluded due to the following reasons: seven patients did not record their symptoms; one patient refused to participate; 25 patients recorded data for less than 1 month. The baseline characteristics of the patients are summarized in Table 1. The most common location was L3 (ileocolonic, 44 [43.1%], followed by L1 (ileal, 32 [31.4%]). In terms of disease behavior, B1 (nonstricturing nonpenetrating, 77 [75.5%]) was more common than B2 (stricturing, 13 [12.7%] and B3 (penetrating, 12 [11.8%]). At enrollment, 60 patients were categorized as having clinically active disease (mild, 20 [19.6%]; moderate to severe, 40 [39.2%]), while 42 (41.2%) were considered in clinical remission.

Table 1 Baseline characteristics of the study population.

During a median follow-up of 42 months (interquartile range, 18–51), 25 (24.5%) and 6 (5.9%) patients required disease-related hospitalization and surgery, respectively (Fig. 2). The median durations (month [interquartile range]) from the inclusion to hospitalization and to surgery were 6 (2–21) and 22 (17–27), respectively. We found no significant difference in the rate of hospitalization (28.9% vs. 24.5%, p = 0.665) and surgery (0 vs. 5.9%, p = 0.19) between 33 excluded-patients and 102 enrolled-patients. For hospitalization outcomes, pattern analysis revealed 36 patients with a poor pattern and 66 patients with a good pattern. Regarding bowel surgery outcomes, 37 and 65 patients were found to have poor and good patterns, respectively. Thirty patients showed a poor pattern with regard to both admission and surgery outcomes. Patients who had a poor pattern to admission or surgery were categorized as the final poor pattern group (43 [42.2%]), whereas the remaining patients showing a good pattern both to admission and surgery outcomes were categorized as the final good pattern group (59 [57.8%]). The baseline characteristics were not significantly different between the poor and good pattern groups, except for disease activity at enrollment. The mean CDAI score of the poor pattern group was significantly higher than that of the good pattern group (223.9 ± 101 vs. 174.4 ± 98.1, p = 0.015, Table 1).

Figure 2
figure 2

Flowchart depicting the inclusion of patients in the study.

Predictors of clinical outcomes

Kaplan–Meier analysis revealed a higher risk of hospitalization in patients with a poor pattern than in those with a good pattern (log-rank p = 0.0004, Fig. 3A). Other baseline factors, including age at diagnosis, disease behavior and location, medication use, serum CRP level, perianal disease, previous surgery, and smoking, demonstrated no significant difference (see Supplementary Fig. 1A–H); however, sex (female vs. male, log-rank p = 0.1) and disease activity at enrollment (log-rank p = 0.1) demonstrated marginal significance (Fig. 3B–C). Kaplan–Meier analysis revealed that a poor activity pattern (log-rank p = 0.02), female sex (log-rank p = 0.04), and disease activity at enrollment (log-rank p = 0.04) were associated with a higher risk of surgery (Fig. 4A–C). The other baseline characteristics showed no significant difference (see Supplementary Fig. 2A–H).

Figure 3
figure 3

Kaplan–Meier analysis of the risk of disease-related admissions stratified by activity pattern (A), sex (B), and disease activity at enrollment (C).

Figure 4
figure 4

Kaplan–Meier analysis for the risk of bowel resection stratified by activity pattern (A), sex (B), and disease activity at enrollment (C).

We used a Cox proportional hazard regression model to control for confounding variables in the risk analysis for each outcome. A poor pattern was found to be an independent predictor of both disease-related hospitalization (adjusted hazard ratio, 3.96; 95% confidence interval [CI] 1.50–10.49, p = 0.005, Table 2) and bowel resection surgery (adjusted hazard ratio, 19.48; 95% CI 1.86–203.95, p = 0.013, Table 2). Female sex was an additional independent predictor of surgery (adjusted hazard ratio, 11.28; 95% CI 1.49–85.01, p = 0.018; Table 2). Regarding anti-TNF use outcome (23 [22.5%] patients), poor pattern showed a statistical significance in Kaplan–Meier analysis (Supplementary Fig. 3) while it failed to remain in Cox-regression analysis (Supplementary Table 2).

Table 2 Cox regression hazard analysis for the predictors of disease outcomes.

Discussion

In this prospective multicenter study, the activity pattern described using a mobile monitoring system based on PRO was found to predict adverse outcomes in patients with newly diagnosed CD. A poor pattern was an independent predictor of disease-related admission and bowel resection. To the best of our knowledge, this is the first study to investigate the role of activity patterns in predicting disease outcomes in patients with CD.

Telemedicine or digital health technologies in the form of telemonitoring, tele-education, or telecounseling have not been found to be effective in improving disease outcomes in IBD, such as the risk of hospitalization or surgery and decreasing disease activity. In two randomized controlled trials, telemedicine was found to significantly reduce the hospitalization rate compared with standard care (traditional face-to-face encounters in the clinic) in IBD15,16; however, most other studies have found no difference in the hospitalization risk between telemedicine and standard care17,18,19,20,21. Additionally, no studies have reported the effects of telemedicine in decreasing the risk of surgery15,16,18,22. In a systematic review of randomized controlled studies investigating the effects of telemedicine on various outcomes compared with those of standard care, digital health intervention was found to be effective in reducing health utilization/cost; however, it had no effect on decreasing the relapse rate and improving treatment adherence23. This lack of effectiveness of digital health interventions in improving disease outcomes in IBD may be attributable to the relatively short follow-up for disease outcomes, mild disease activity in the study population, and nonspecific symptom-driven monitoring without objective measures (such as endoscopy or biochemical results) which are associated with better clinical outcomes23. Our findings suggest that the activity pattern based on a continuous assessment of PRO may be more useful than an occasional check-up of IBD symptoms in an outpatient clinic. In the present study, CDAI scores at enrollment (p = 0.697 for admission, p = 0.548 for surgery) or during follow-up (p = 0.323 for admission) were not found to predict disease outcomes (Table 2), supporting the potential advantage of continuous PRO monitoring over random assessment of CDAI. It is reasonable to surmise that patients with a poor activity pattern might benefit from early medical interventions, such as step-up therapy, based on an alarming notification in the remote monitoring telemedicine system. Further studies are required to determine whether digital health technologies using pattern analysis can help alter the disease course in patients with CD.

We analyzed data for anti-TNF use as another disease outcome. Although the poor pattern group showed a significantly higher risk of anti-TNF use than the good pattern group in the univariate analysis (Cox regression analysis p = 0.031, Supplementary Table 2; log-rank p = 0.03, Supplementary Fig. 3), the between-group difference was not statistically significant in the multivariate analysis after controlling for confounding factors (p = 0.124, Supplementary Table 2). Instead, CDAI at enrollment (p = 0.004) and disease behavior (p = 0.013) were found to be independent predictors of anti-TNF use (Supplementary Table 2). Although the reason for this disparate ability of pattern analysis for different clinical outcomes is not clear, the timing of the occurrence of the event is one of the plausible explanations. Anti-TNF initiation was observed earlier than the occurrence of admission or surgery (Supplementary Fig. 4); anti-TNF therapy was initiated in less than 1 year, whereas admission and surgery were generally observed in 20 months. The predictive ability of activity patterns for clinical outcomes in patients with IBD may manifest after a certain period. This suggests that pattern analysis might be useful for predicting long-term clinical outcomes. This finding needs to be confirmed in another independent cohort.

One of the key challenges in implementing telemedicine for IBD is the establishment of robust patient engagement24. In previous studies, approximately 50–70% of eligible patients refused or did not reply to the invitation, and the attrition rate was high (up to 32%)15,17,20,21,25. Most patients in previous studies had a longstanding disease with inactive or mild disease15,17,18,21. Patients in good condition are less likely to record their symptoms using an app because it may constantly remind them of their underlying disease24. The relatively high participation rate in our study (102/135, 75.5%) might be explained by a short disease duration (enrollment within 3 months after diagnosis) and the high proportion of patients with active disease; newly diagnosed patients or those with active symptoms are more likely to be interested in monitoring their disease activity.

The risk factors for surgery in CD include small bowel involvement, penetrating disease behavior, and smoking26. In an Asian cohort study, penetrating disease behavior was found to be an independent predictor of surgery in CD27. However, these variables were not identified as risk factors for surgery in the present study. This may be attributable to differences with respect to the characteristics of the study population and study design. For example, we excluded patients who could not use smartphones or those who recorded symptoms for less than a month. In addition, the small number of events (6 cases of surgery) might have reduced the statistical power. There are conflicting results regarding the influence of sex on outcomes. Some studies28,29 have found female sex as an independent risk factor for surgery in CD, which is consistent with our results.

One might argue that the CDSD use might be different between patients according to their clinical symptoms or well-being. For instance, sick patients might have reported their symptoms more frequently using CDSD than those who were doing well. To address this issue, we compared the median number of reporting symptoms between patients with surgery or hospitalization (25 [24.5%]) and those without these outcomes (77 [75.5%]). We found no significant difference in the median use number of CDSD between these two groups (47 [IQR 25–152] vs. 52 [25–141], p = 0.646) suggesting that patients’ clinical outcomes might not have significantly affected pattern analysis using CDSD.

The strengths of the present study include its prospective design, a homogeneous group of patients with newly diagnosed CD, and a relatively long follow-up period. Nonetheless, some limitations of this study should be acknowledged. The number of patients was relatively small. Although this study was conducted prospectively, the pattern analysis was performed retrospectively, which may have introduced an element of bias. In addition, the predictive ability of this pattern analysis needs to be validated in an independent cohort. As the patients were recruited from tertiary referral centers, the results might not be generalizable to all patients with CD. Lastly, although this study showed the potential usefulness of symptom-based assessment in predicting CD outcomes, there is a significant disconnect between patients’ subjective symptoms and objective measures such as biomarkers (CRP or fecal calprotectin) and endoscopic activity. Indeed, a study investigating the role of CRP in predicting patient prognosis showed a significant association between elevated CRP levels, even in patients with clinical remission, and subsequent CD-related hospitalization and intestinal resection during follow-up, suggesting the potential risk of a symptom-based approach in CD30. In addition, there is an overlap of symptoms between CD and other types of irritable bowel syndrome. Therefore, the results of this study need to be interpreted with caution.

In conclusion, the activity pattern based on a mobile monitoring system may help predict clinical outcomes, such as disease-related hospitalization and bowel resection surgery, in patients with newly diagnosed CD, thus underlining the potential role of telemedicine activity patterns in the management of IBD. Such remote monitoring with unique pattern analysis could be clinically relevant, particularly in the era of the COVID-19 pandemic. Further studies are required to ascertain whether management based on such pattern analysis can alter the disease course in patients with CD.